15 213 The course that gives CMU its Zip Machine Level Programming I Introduction Sept 7 2007 Topics Assembly Programmer s Execution Model Accessing Information Registers Memory class04 ppt Arithmetic operations 15 213 F 07 x86 Processors Dominate the Desktop Laptop and Server Markets Evolutionary Design Starting in 1978 with 8086 Added more features as time goes on Still support old features although obsolete Complex Instruction Set Computer CISC Many different instructions with many different formats But only small subset encountered with Linux programs Reduced Instruction Set Computers RISC enjoyed a performance advantage during the late 80s early 90s Until a CMU alumnus Bob Colwell changed that with Pentium Pro Since Pentium Pro x86 has been a performance leader 2 15 213 F 07 x86 Evolution Programmer s View Abbreviated Name 8086 Transistors 1978 29K 16 bit processor Basis for IBM PC DOS Limited to 1MB address space DOS only gives you 640K 386 3 Date 1985 275K Extended to 32 bits Added flat addressing Capable of running Unix Referred to as IA32 32 bit Linux gcc uses no instructions introduced in later models 15 213 F 07 x86 Evolution Programmer s View Machine Evolution 486 1989 1 9M Pentium 1993 3 1M Pentium MMX 1997 4 5M PentiumPro 1995 6 5M Pentium III 1999 8 2M Pentium 4 2001 42M Watershed design Added Features Instructions to support multimedia operations Parallel operations on 1 2 and 4 byte data both integer FP Instructions to enable more efficient conditional operations Linux GCC Evolution 4 None 15 213 F 07 Itanium a 64 bit architecture Name Date Transistors Itanium 2001 10M Extends to IA64 a 64 bit architecture Radically new instruction set designed for high performance Can run existing IA32 programs On board x86 engine Joint project with Hewlett Packard Itanium 2 2002 221M Big performance boost Itanium 2 Dual Core 2006 1 7B Itanium has not taken off in marketplace as Intel had originally hoped 5 15 213 F 07 x86 Clones Advanced Micro Devices AMD Historically AMD has followed just behind Intel A little bit slower a lot cheaper Starting in roughly 2001 Recruited top circuit designers from Digital Equipment Corp and other downward trending companies Exploited fact that Intel was distracted by Itanium Started making very competitive products especially at the high end Developed x86 64 its own extension to 64 bits Started eating into Intel s high end server market 6 15 213 F 07 Intel s Response to AMD s x86 64 2004 Intel Announces EM64T extension to IA32 7 Extended Memory 64 bit Technology Very similar to x86 64 Our Saltwater fish machines 15 213 F 07 The Rate of Single Thread Performance Improvement has Decreased Figure courtesy of Hennessy Patterson Computer Architecture A Quantitative Approach V4 8 15 213 F 07 Impact of Power Density on the Microprocessor Industry Pat Gelsinger ISSCC 2001 The future is not higher clock rates but multiple cores per die 9 15 213 F 07 x86 Evolution Recent History Year Transistors Clock GHz Power W Pentium 4 2000 42M 1 7 3 4 65 89 Pentium M 2003 140M 1 4 2 1 21 Core Duo 2006 151M 2 3 2 5 Core 2 Duo 2006 291M 2 6 2 9 Core 2 Quad 2006 2x291M 2 6 2 9 Intel Core 2 Duo Conroe Copyright Intel Copyright Intel To learn more about parallel processing take 15 418 in Spring 08 10 15 213 F 07 Our Coverage IA32 The traditional x86 x86 64 The emerging standard Presentation Book has IA32 Handout has x86 64 Lecture will cover both Labs Lab 2 x86 64 Lab 3 IA32 11 15 213 F 07 Assembly Programmer s View CPU Registers P C Condition Codes Memory Addresses Data Instructions Object Code Program Data OS Data Programmer Visible State PC Stack Program Counter Address of next instruction Called EIP IA32 or RIP x86 64 Register File Heavily used program data Condition Codes Store status information about most recent arithmetic operation Used for conditional branching 12 Memory Byte addressable array Code user data some OS data Includes stack used to support procedures 15 213 F 07 Turning C into Object Code Code in files p1 c p2 c Compile with command gcc O p1 c p2 c o p Use optimizations O Put resulting binary in file p text C program p1 c p2 c Compiler gcc S text Asm program p1 s p2 s Assembler gcc or as binary Object program p1 o p2 o Static libraries a Linker gcc or ld binary 13 Executable program p 15 213 F 07 Compiling Into Assembly C Code int sum int x int y int t x y return t Generated IA32 Assembly sum pushl ebp movl esp ebp movl 12 ebp eax addl 8 ebp eax movl ebp esp popl ebp ret Obtain with command gcc O S code c Produces file code s 14 15 213 F 07 Assembly Characteristics Minimal Data Types Integer data of 1 2 or 4 bytes Data values Addresses untyped pointers Floating point data of 4 8 or 10 bytes No aggregate types such as arrays or structures Just contiguously allocated bytes in memory Primitive Operations Perform arithmetic function on register or memory data Transfer data between memory and register Load data from memory into register Store register data into memory Transfer control Unconditional jumps to from procedures Conditional branches 15 15 213 F 07 Object Code Code for sum Assembler Translates s into o 0x401040 sum Binary encoding of each instruction 0x55 Total of 13 0x89 Nearly complete image of executable bytes 0xe5 code Each 0x8b instruction 1 Missing linkages between code in 0x45 2 or 3 bytes different files 0x0c Starts at 0x03 address Linker 0x45 0x401040 0x08 Resolves references between files 0x89 Combines with static run time libraries 0xec E g code for malloc printf 0x5d 0xc3 Some libraries are dynamically linked Linking occurs when program begins execution 16 15 213 F 07 Machine Instruction Example C Code int t x y Add two signed integers Assembly addl 8 ebp eax Long words in GCC parlance Same instruction whether signed Similar to expression x y Or Add 2 4 byte integers or unsigned Operands x y t int eax int ebp eax ebp 2 Register eax Memory M ebp 8 Register eax Return function value in eax Object Code 0x401046 17 03 45 08 3 byte instruction Stored at address 0x401046 15 213 F 07 Disassembling Object Code Disassembled 00401040 sum 0 55 1 89 e5 3 8b 45 0c 6 03 45 08 9 89 ec b 5d c c3 d 8d 76 00 push mov mov add mov pop ret lea ebp esp ebp 0xc ebp eax 0x8 ebp eax ebp esp ebp 0x0 esi esi Disassembler objdump d p 18 Useful tool for examining object code Analyzes bit pattern of series of instructions Produces approximate rendition of assembly code Can be run on either a out complete executable or o file 15 213 F 07 Alternate Disassembly
View Full Document