15 213 The Class That Gives CMU Its Zip Introduction to Computer Systems Frank Pfenning January 17 2006 Topics Theme and objective Five great realities of computer systems How this fits within CS curriculum Course mechanics and overview 01 overview ppt 15 213 S 06 Course Theme Abstraction is good but programs run on real hardware Courses to date emphasize abstraction Abstract data types Asymptotic analysis These abstractions have limits Need to understand underlying implementations Performance time and space Useful outcomes Become more effective programmers Able to find and eliminate bugs efficiently Able to tune program performance Prepare for later systems classes in CS ECE Compilers Operating Systems Networks Computer Architecture Embedded Systems 2 15 213 S 06 Great Reality 1 Int s are not Integers Float s are not Reals Examples Is x2 0 Float s Yes Int s 40000 40000 1600000000 50000 50000 Is x y z x y z Unsigned Signed Int s Yes Float s 1e20 1e20 3 14 3 14 1e20 1e20 3 14 3 15 213 S 06 Computer Arithmetic Does not generate random values Arithmetic operations have important mathematical properties Cannot assume usual properties Due to finiteness of representations Integer operations satisfy ring properties Commutativity associativity distributivity Floating point operations satisfy ordering properties Monotonicity values of signs Observation Need to understand which abstractions apply in which contexts 4 Important issues for compiler writers and serious application programmers 15 213 S 06 Great Reality 2 You ve got to know assembly Chances are you ll never write a program in assembly Compilers are much better and more patient than you are Understanding assembly key to machine level execution model Behavior of programs in presence of bugs High level language model is inadequate Tuning program performance Understanding sources of program inefficiency Implementing system software Compiler has machine code as target Operating systems must manage process state 5 15 213 S 06 Measuring Time Trickier than it Might Look Many sources of variation Example Sum integers from 1 to n n 100 1 000 1 000 10 000 10 000 1 000 000 1 000 000 1 000 000 000 6 Cycles 961 8 407 8 426 82 861 82 876 8 419 907 8 425 181 8 371 2305 591 Cycles n 9 61 8 41 8 43 8 29 8 29 8 42 8 43 8 37 15 213 S 06 Great Reality 3 Memory Matters Random Access Memory is an un physical abstraction Memory is not unbounded It must be allocated and managed Many applications are memory dominated Memory referencing bugs especially pernicious Effects are distant in both time and space Memory performance is not uniform Cache and virtual memory effects can greatly affect program performance Adapting program to characteristics of memory system can lead to major speed improvements 7 15 213 S 06 Memory Referencing Bug Example main main long long int int a 2 a 2 double double dd 3 14 3 14 a 2 a 2 1073741824 1073741824 Out Out of of bounds bounds reference reference printf d printf d 15g n 15g n d d exit 0 exit 0 Alpha MIPS Linux g 5 30498947741318e 315 3 1399998664856 3 14 O 3 14 3 14 3 14 Linux version gives correct result but implementing as separate function gives segmentation fault 8 15 213 S 06 Memory Referencing Errors C and C do not provide any memory protection Out of bounds array references Invalid pointer values Abuses of malloc free Can lead to nasty bugs Whether or not bug has any effect depends on system and compiler Action at a distance Corrupted object logically unrelated to one being accessed Effect of bug may be first observed long after it is generated How can I deal with this Program in Java Lisp ML or Cyclone Understand what possible interactions may occur 9 Use or develop tools to detect referencing errors 15 213 S 06 Memory System Performance Example void copyij int int int i j for i 0 i for j 0 dst i j src 2048 2048 dst 2048 2048 2048 i j 2048 j src i j 59 393 288 clock cycles void copyji int int int i j for j 0 j for i 0 dst i j src 2048 2048 dst 2048 2048 2048 j i 2048 i src i j 1 277 877 876 clock cycles 21 5 times slower Measured on 2GHz Intel Pentium 4 Hierarchical memory organization Performance depends on access patterns Including how step through multi dimensional array 10 15 213 S 06 The Memory Mountain Pentium III Xeon 550 MHz 16 KB on chip L1 d cache 16 KB on chip L1 i cache 512 KB off chip unified L2 cache 1200 Read throughput MB s copyij 1000 L1 800 copyji 600 400 xe L2 200 11 2k 8k 32k 128k 512k 2m 8m s15 s13 s9 s11 Stride words Mem s7 s5 s3 s1 0 Working set size bytes 15 213 S 06 Memory Performance Example Implementations of Matrix Multiplication Multiple ways to nest loops ijk ijk for for i 0 i 0 i n i n i i for for j 0 j 0 j n j n j j sum sum 0 0 0 0 for for k 0 k 0 k n k n k k sum sum a i k a i k b k j b k j c i j c i j sum sum 12 jik jik for for j 0 j 0 j n j n j j for for i 0 i 0 i n i n i i sum sum 0 0 0 0 for for k 0 k 0 k n k n k k sum sum a i k a i k b k j b k j c i j c i j sum sum 15 213 S 06 Great Reality 4 There s more to performance than asymptotic complexity Constant factors matter too Easily see 10 1 performance range depending on how code written Must optimize at multiple levels algorithm data representations procedures and loops Must understand system to optimize performance How programs compiled and executed How to measure program performance and identify bottlenecks How to improve performance without destroying code modularity and generality 13 15 213 S 06 Great Reality 5 Computers do more than execute programs They need to get data in and out I O system critical to program reliability and performance They communicate with each other over networks Many system level issues arise in presence of network Concurrent operations by autonomous processes Coping with unreliable media Cross platform compatibility Complex performance issues 14 15 213 S 06 Role within Curriculum CS 412 Operating Systems CS 441 Networks Network Protocols CS 212 Execution Models Processes Mem Mgmt CS 411 Compilers Machine Code Optimization Data Structures Applications Programming 15 ECE 349 Embedded Systems Exec Model Memory System CS 213 Systems CS 211 Fundamental Structures ECE 447 Architecture CS 113 C Programming Transition from Abstract to Concrete From high level language model To underlying implementation 15 213 S 06 Course Perspective Most Systems Courses are Builder Centric Computer Architecture Design pipelined processor in Verilog Operating Systems …
View Full Document