II:1 X86 Assembly - Data. II:2 Admin Quiz?  What happened?  Make-up options? Several missing from lab Attendance at lab is required Passing grade without.

Slides:



Advertisements
Similar presentations
Fabián E. Bustamante, Spring 2007 Machine-Level Programming – Introduction Today Assembly programmer’s exec model Accessing information Arithmetic operations.
Advertisements

Machine Programming I: Basics
Machine-Level Programming I: Basics
Machine Programming – Procedures and IA32 Stack CENG334: Introduction to Operating Systems Instructor: Erol Sahin Acknowledgement: Most of the slides are.
University of Washington Instruction Set Architectures ISAs Brief history of processors and architectures C, assembly, machine code Assembly basics: registers,
Carnegie Mellon 1 Machine-Level Programming I: Basics /18-243: Introduction to Computer Systems 4 th Lecture, Sep. 2, 2010 Instructors: Randy Bryant.
Instruction Set Architectures
Carnegie Mellon 1 Machine-Level Programming I: Basics /18-213: Introduction to Computer Systems 5 th Lecture, Tue. May 27, 2015 Instructors: Nathaniel.
Machine-Level Programming I: Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations CS.
Machine-Level Programming I: Introduction January 29, 2002 Topics Assembly Programmer’s Execution Model Accessing Information –Registers –Memory Arithmetic.
– 1 – ISAs and Microarchitectures Instruction Set Architecture The interface between hardware and software “Language” + programmer visible state + I/O.
Machine-Level Programming I: Introduction Sept. 10, 2002 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic.
Machine-Level Programming I: Introduction
– 1 – ISAs and Microarchitectures Instruction Set Architecture The interface between hardware and software “Language” + programmer visible state + I/O.
Machine-Level Programming I: Introduction Jan 27, 2004 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
Machine-Level Programming I: Introduction Apr. 10, 2006 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic.
Machine-Level Programming I: Introduction Jan. 22, 2008 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic.
Machine-Level Programming I: Introduction Apr. 14, 2008 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic.
Machine-Level Programming I: Introduction Sept. 14, 2004 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic.
C Prog. To Object Code text text binary binary Code in files p1.c p2.c
Machine-Level Programming I: Introduction Sept. 10, 2007 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic.
Machine-Level Programming I: Introduction Sept. 10, 2002 Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic.
Carnegie Mellon 1 Machine-Level Programming I: Basics /18-213: Introduction to Computer Systems 5 th Lecture, Sep. 11, 2012 Instructors: Dave O’Hallaron,
1 Machine-Level Programming I: Basics Computer Systems Organization Andrew Case Slides adapted from Jinyang Li, Randy Bryant and Dave O’Hallaron.
Carnegie Mellon 1 Machine-Level Programming I: Basics Lecture, Feb. 14, 2013 These slides are from website which accompanies the.
Carnegie Mellon 1 Machine-Level Programming I: Basics /18-213: Introduction to Computer Systems 5 th Lecture, Jan 28, 2014 Instructors: Seth Copen.
Machine-Level Programming I: Topics History of Intel Processors – short...incomplete Assembly Programmer’s Execution Model Accessing Information Registers.
Lee CSCE 312 TAMU 1 Machine-Level Programming I: Basics Instructor: Dr. Hyunyoung Lee Based on slides provided by: Randy Bryant and Dave O’Hallaron.
Machine-Level Programming I: Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
Machine-Level Programming I: Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations X86.1.ppt CS 105.
Machine-Level Programming 1 Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
Machine-Level Programming I: Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations CS.
Carnegie Mellon 1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Machine-Level Programming I: Basics Sep. 15, 2015.
University of Washington Roadmap 1 car *c = malloc(sizeof(car)); c->miles = 100; c->gals = 17; float mpg = get_mpg(c); free(c); Car c = new Car(); c.setMiles(100);
University of Washington Roadmap 1 car *c = malloc(sizeof(car)); c->miles = 100; c->gals = 17; float mpg = get_mpg(c); free(c); Car c = new Car(); c.setMiles(100);
Machine-Level Programming I: Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
Carnegie Mellon 1 Machine-Level Programming I: Basics Slides based on from those of Bryant and O’Hallaron.
Lec 4Systems Architecture1 Systems Architecture Lecture 4: Compilers, Assemblers, Linkers & Loaders Jeremy R. Johnson Anatole D. Ruslanov William M. Mongan.
Machine-Level Programming I: Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations CS 105 “Tour of.
University of Washington Basics of Machine Programming The Hardware/Software Interface CSE351 Winter 2013.
Machine-Level Programming I: Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations CS 105 “Tour of.
Machine-Level Programming 1 Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
Machine-Level Programming I: Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
University of Washington x86 Programming I The Hardware/Software Interface CSE351 Winter 2013.
Carnegie Mellon 1 Machine-Level Programming I: Basics Lecture, Feb. 21, 2013 These slides are from website which accompanies the.
IA32 Stack –Region of memory managed with stack discipline –Grows toward lower addresses –Register %esp indicates lowest stack address address of top element.
Machine-Level Programming I: Introduction Jan. 30, 2001 Topics Assembly Programmer’s Execution Model Accessing Information –Registers –Memory Arithmetic.
Machine-Level Programming I: IA32 Assembly Language Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
Spring 2016Machine Code & C Roadmap 1 car *c = malloc(sizeof(car)); c->miles = 100; c->gals = 17; float mpg = get_mpg(c); free(c); Car c = new Car(); c.setMiles(100);
X86 Machine-Level Programming I: Introduction Topics Assembly Programmer’s Execution Model Accessing Information Registers Memory Arithmetic operations.
1 Binghamton University Machine-Level Programming I: Basics CS220: Computer Systems II.
X86 Assembly - Data.
CS 3214 Computer Systems Lecture 2 Godmar Back.
ISAs and Microarchitectures
Instruction Set Architecture
Machine-Level Programming I:
Machine-Level Programming I:
Credits and Disclaimers
IA32 Processors Evolutionary Design
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
Machine-Level Programming 1 Introduction
C Prog. To Object Code text text binary binary Code in files p1.c p2.c
Machine-Level Programming I: Introduction Feb. 1, 2000
Machine-Level Programming I: Introduction
X86 Assembly - Data.
Machine-Level Programming: Introduction
Machine-Level Programming I: Basics
Machine-Level Programming I:
Machine-Level Programming I:
Machine-Level Programming I: Introduction Sept. 10, 2002
Presentation transcript:

II:1 X86 Assembly - Data

II:2 Admin Quiz?  What happened?  Make-up options? Several missing from lab Attendance at lab is required Passing grade without labs not possible Need pre-approval to switch lab groups (limited equipment) Labs done in room 335 CTB (except one or two later) Complete in assigned time block.

II:3 Intel x86 Processors Totally dominate desktop/laptop computer market  Dominate total market in $$$  Many more embedded systems in number of CPUs Evolutionary design  Backwards compatible up until 8086, introduced in 1978  Added more features as time goes on Complex instruction set computer (CISC)  Many different instructions with many different formats  But, only small subset encountered with Linux programs  Hard to match performance of Reduced Instruction Set Computers (RISC)  But, Intel has done just that!

II:4 RISC vs. CISC Used to be lists of CPUs, Now just two concepts CISC Complex Instruction Set  ASM level Instructions for everything  EG: 8086 divide  up to 120+ cycles to complete  Very powerful for modern languages  Historically older RISC Reduced Instruction set  One instruction, one clock  Fewer, simpler instructions  Very fast (predictable)  May require many instructions for same task Modern CPUs do some of both & pipelining

II:5 Intel x86 Evolution: Milestones NameDateTransistorsMHz K5-10  First 16-bit processor. Basis for IBM PC & DOS  1MB address space K16-33  First 32 bit processor, referred to as IA32  Added “flat addressing” (Prev block pointer + 16 bit pointer)  Capable of running Unix  32-bit Linux/gcc uses no instructions introduced in later models Pentium 4F M  First 64-bit processor  Meanwhile, Pentium 4s (Netburst arch.) phased out in favor of “Core” line

II:6 Intel x86 Processors: Overview X86-64 / EM64t X86-32/IA32 X Pentium Pentium MMX Pentium III Pentium 4 Pentium 4E Pentium 4F Core 2 Duo Core i7 IA: often redefined as latest Intel architecture time ArchitecturesProcessors MMX SSE SSE2 SSE3 SSE4

II:7 Intel x86 Processors, contd. Machine Evolution  M  Pentium M  Pentium/MMX M  PentiumPro M  Pentium III M  Pentium M  Core 2 Duo M Added Features  Instructions to support multimedia operations  Parallel operations on 1, 2, and 4-byte data, both integer & FP  Instructions to enable more efficient conditional operations Linux/GCC Evolution  Very limited

II:8 New Species: IA64 / Itanium / IPF NameDateTransistors Itanium200110M  First shot at 64-bit architecture: first called IA64  Radically new instruction set designed for high performance  Can run existing IA32 programs  On-board “x86 engine”  Joint project with Hewlett-Packard Itanium M  Big performance boost Itanium 2 Dual-Core B Itanium has not taken off in marketplace  Lack of backward compatibility, no good compiler support, Pentium 4 got too good

II:9 x86 Clones: Advanced Micro Devices (AMD) Historically  AMD has followed just behind Intel  A little bit slower, a lot cheaper Then  Recruited top circuit designers from Digital Equipment Corp. and other downward trending companies  Built Opteron: tough competitor to Pentium 4  Developed x86-64, their own 64-bit extension Recently  Comparable or better performance

II:10 x86 Clones: Advanced Micro Devices (AMD) Historically  AMD has followed just behind Intel  A little bit slower, a lot cheaper Then  Recruited top circuit designers from Digital Equipment Corp. and other downward trending companies  Built Opteron: tough competitor to Pentium 4  Developed x86-64, their own 64-bit extension Recently  Intel much quicker with dual core design  Intel currently ahead in performance  Intel’s EM64T backwards compatible to x86-64

II:11 Intel’s 64-Bit Intel Attempted Radical Shift from IA32 to IA64  Totally different architecture (Itanium)  Executes IA32 code only as legacy  Performance disappointing AMD Stepped in with Evolutionary Solution  x86-64 (now called “AMD64”) Intel Felt Obligated to Focus on IA64  Hard to admit mistake or that AMD is better 2004: Intel Announces EM64T extension to IA32  Extended Memory 64-bit Technology  Almost identical to x86-64! Meanwhile: EM64t well introduced, however, still often not used by OS, programs

II:12 Our Coverage IA32 (through Section 3.12)  The traditional x86 x86-64/EM64T (Section 3.13)  The emerging standard, if time allows

II:13 Definitions Architecture: (also instruction set architecture: ISA) The parts of a processor design that one needs to understand to write assembly code. Microarchitecture: Implementation of the architecture. Architecture examples: instruction set specification, registers. Microarchitecture examples: cache sizes, core frequency. Example ISAs (Intel): x86, IA, Itanium

II:14 CPU Assembly Programmer’s View Programmer-Visible State  PC: Program counter  Address of next instruction  Called “EIP” (IA32)  Register file  Heavily used program data  Condition codes  Store status information about most recent arithmetic operation  Used for conditional branching  Memory  Byte addressable array  Code, user data, (some) OS data  Includes stack used to support procedures PC Registers Memory Object Code Program Data OS Data Addresses Data Instructions Stack Condition Codes

II:15 Integer Registers (IA32) %eax %ecx %edx %ebx %esi %edi %esp %ebp %ax %cx %dx %bx %si %di %sp %bp %ah %ch %dh %bh %al %cl %dl %bl general purpose accumulate counter data base source index destination index stack pointer base pointer Origin (mostly obsolete)

II:16 Instruction Syntax Intel/Microsoft  Op, Dest/Src, Src simple: push ebp mov ebp, esp mov edx, DWORD PTR [ebp+8] mov eax, DWORD PTR [ebp+12] add eax, DWORD PTR [edx] mov DWORD PTR [edx], eax pop ebp ret AT&T/Linux/Gnu  Op, Src, Dest/Src simple: pushl %ebp movl %esp, %ebp movl 8(%ebp), %edx movl 12(%ebp), %eax addl (%edx), %eax movl %eax, (%edx) popl %ebp ret int simple(int *xp, int y) { int t = *xp + y; *xp = t; return t; }

II:17 text binary Compiler ( gcc -S ) Assembler ( gcc or as ) Linker ( gcc or ld ) C program ( p1.c p2.c ) ASM program ( p1.s p2.s ) Object program ( p1.o p2.o ) Executable program ( p ) Static libraries (.a ) Turning C into Object Code  Code in files p1.c p2.c  Compile with command: gcc -O p1.c p2.c -o p  Use optimizations ( -O )  Put resulting binary in file p

II:18 Compiling Into Assembly C Code int sum(int x, int y) { int t = x+y; return t; } Generated IA32 Assembly sum: pushl %ebp movl %esp,%ebp movl 12(%ebp),%eax addl 8(%ebp),%eax movl %ebp,%esp popl %ebp ret Obtain with command gcc -O -S code.c Produces file code.s Some compilers use single instruction “ leave ”

II:19 Difference between ASM and HLL. eg C ASM has one-to-one relationship w. machine code  Program in ASM  think like computer  Every CPU design has own ASM High Level Language (HLL) expresses ideas  Program in HLL  think in problem space  Many possible compilations of HLL to machine  C is HLL designed for Sys Architecture and HW interface  What are Java, Python, JavaScript, PHP each designed for?

II:20 Assembly Characteristics: Data Types “Integer” data of 1, 2, or 4 bytes  Data values  Addresses (untyped pointers) Floating point data of 4, 8, or 10 bytes No aggregate types such as arrays or structures  Just contiguously allocated bytes in memory

II:21 Assembly Characteristics: Operations Perform arithmetic function on register or memory data Transfer data between memory and register  Load data from memory into register  Store register data into memory Transfer control  Unconditional jumps to/from procedures  Conditional branches

II:22 Code for sum 0x : 0x55 0x89 0xe5 0x8b 0x45 0x0c 0x03 0x45 0x08 0x89 0xec 0x5d 0xc3 Object Code Assembler  Translates.s into.o  Binary encoding of each instruction  Nearly-complete image of executable code  Missing linkages between code in different files Linker  Resolves references between files  Combines with static run-time libraries  E.g., code for malloc, printf  Some libraries are dynamically linked  Linking occurs when program begins execution Total of 13 bytes Each instruction 1, 2, or 3 bytes Starts at address 0x sum: pushl %ebp movl %esp,%ebp movl 12(%ebp),%eax addl 8(%ebp),%eax movl %ebp,%esp popl %ebp ret

II:23 Machine Instruction Example C Code  Add two signed integers Assembly  Add 2 4-byte integers  “Long” words in GCC parlance  Same instruction whether signed or unsigned  Operands: x :Register %eax y :MemoryM[ %ebp+8] t :Register %eax –Return function value in %eax Object Code  3-byte instruction  Stored at address 0x int t = x+y; addl 8(%ebp),%eax 0x401046: Similar to expression: x += y More precisely: int eax; int *ebp; eax += ebp[2]

II:24 Disassembled : 0:55 push %ebp 1:89 e5 mov %esp,%ebp 3:8b 45 0c mov 0xc(%ebp),%eax 6: add 0x8(%ebp),%eax 9:89 ec mov %ebp,%esp b:5d pop %ebp c:c3 ret d:8d lea 0x0(%esi),%esi Disassembling Object Code Disassembler objdump -d p  Useful tool for examining object code  Analyzes bit pattern of series of instructions  Produces approximate rendition of assembly code  Can be run on either a.out (complete executable) or.o file

II:25 Disassembled 0x :push %ebp 0x :mov %esp,%ebp 0x :mov 0xc(%ebp),%eax 0x :add 0x8(%ebp),%eax 0x :mov %ebp,%esp 0x40104b :pop %ebp 0x40104c :ret 0x40104d :lea 0x0(%esi),%esi Alternate Disassembly Within gdb Debugger gdb p disassemble sum  Disassemble procedure x/13b sum  Examine the 13 bytes starting at sum Object 0x401040: 0x55 0x89 0xe5 0x8b 0x45 0x0c 0x03 0x45 0x08 0x89 0xec 0x5d 0xc3

II:26 What Can be Disassembled? Anything that can be interpreted as executable code Disassembler examines bytes and reconstructs assembly source % objdump -d WINWORD.EXE WINWORD.EXE: file format pei-i386 No symbols in "WINWORD.EXE". Disassembly of section.text: : :55 push %ebp :8b ec mov %esp,%ebp :6a ff push $0xffffffff : push $0x a:68 91 dc 4c 30 push $0x304cdc91

II:27 Administrivia HW1 due midnight tonight  Paper to locker, file to blackboard Lab2  Can anybody switch to Wednesday session? Quiz1  On blackboard, due Wednesday in class  Print it out, work on paper, bring to class, grade in class  Open book Remind me if you don’t see in the lecture slide online

II:28 Integer Registers (IA32) %eax %ecx %edx %ebx %esi %edi %esp %ebp %ax %cx %dx %bx %si %di %sp %bp %ah %ch %dh %bh %al %cl %dl %bl general purpose accumulate counter data base source index destination index stack pointer base pointer Origin (mostly obsolete)

II:29 Moving Data: IA32 Moving Data  movx Source, Dest  x in { b, w, l }  movl Source, Dest: Move 4-byte “long word” or “double word”  movw Source, Dest: Move 2-byte “word”  movb Source, Dest: Move 1-byte “byte” Lots of these in typical code %eax %ecx %edx %ebx %esi %edi %esp %ebp

II:30 Moving Data: IA32 Moving Data movl Source, Dest: Operand Types  Immediate: Constant integer data  Example: $0x400, $-533  Like C constant, but prefixed with ‘$’  Encoded with 1, 2, or 4 bytes  Register: One of 8 integer registers  Example: %eax, %edx  But %esp and %ebp reserved for special use  Others have special uses for particular instructions  Memory: 4 consecutive bytes of memory at address given by register  Simplest example: (%eax)  Various other “address modes” %eax %ecx %edx %ebx %esi %edi %esp %ebp

II:31 movl Operand Combinations Cannot do memory-memory transfer with a single instruction movl Imm Reg Mem Reg Mem Reg Mem Reg SourceDestC Analog movl $0x4,%eaxtemp = 0x4; movl $-147,(%eax)*p = -147; movl %eax,%edxtemp2 = temp1; movl %eax,(%edx)*p = temp; movl (%eax),%edxtemp = *p; Src,Dest

II:32 Simple Memory Addressing Modes Normal(R)Mem[Reg[R]]  Register R specifies memory address movl (%ecx),%eax DisplacementD(R)Mem[Reg[R]+D]  Register R specifies start of memory region  Constant displacement D specifies offset movl 8(%ebp),%edx

II:33 Using Simple Addressing Modes void swap(int *xp, int *yp) { int t0 = *xp; int t1 = *yp; *xp = t1; *yp = t0; } swap: pushl %ebp movl %esp,%ebp pushl %ebx movl 12(%ebp),%ecx movl 8(%ebp),%edx movl (%ecx),%eax movl (%edx),%ebx movl %eax,(%edx) movl %ebx,(%ecx) movl -4(%ebp),%ebx movl %ebp,%esp popl %ebp ret Body Set Up Finish int main(...) { int x = 3; int y = 5; swap(&x, &y); return 1; }

II:34 Using Simple Addressing Modes void swap(int *xp, int *yp) { int t0 = *xp; int t1 = *yp; *xp = t1; *yp = t0; } swap: pushl %ebp movl %esp,%ebp pushl %ebx movl 12(%ebp),%ecx movl 8(%ebp),%edx movl (%ecx),%eax movl (%edx),%ebx movl %eax,(%edx) movl %ebx,(%ecx) movl -4(%ebp),%ebx movl %ebp,%esp popl %ebp ret Body Set Up Finish

II:35 Understanding Swap void swap(int *xp, int *yp) { int t0 = *xp; int t1 = *yp; *xp = t1; *yp = t0; } # COMMENTS movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx Stack (in memory) RegisterValue %ecxyp %edxxp %eaxt1 %ebxt0 yp xp Rtn adr Old % ebp %ebp Offset Old % ebx -4

II:36 Understanding Swap movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx 0x120 0x124 Rtn adr %ebp Offset Address 0x124 0x120 0x11c 0x118 0x114 0x110 0x10c 0x108 0x104 0x100 yp xp %eax %edx %ecx %ebx %esi %edi %esp %ebp0x104

II:37 movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx Understanding Swap 0x120 0x124 Rtn adr %ebp Offset Address 0x124 0x120 0x11c 0x118 0x114 0x110 0x10c 0x108 0x104 0x100 yp xp %eax %edx %ecx %ebx %esi %edi %esp %ebp0x104 0x120

II:38 Understanding Swap 0x120 0x124 Rtn adr %ebp Offset Address 0x124 0x120 0x11c 0x118 0x114 0x110 0x10c 0x108 0x104 0x100 yp xp %eax %edx %ecx %ebx %esi %edi %esp %ebp 0x120 0x104 movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx 0x124

II:39 Understanding Swap 0x120 0x124 Rtn adr %ebp Offset Address 0x124 0x120 0x11c 0x118 0x114 0x110 0x10c 0x108 0x104 0x100 yp xp %eax %edx %ecx %ebx %esi %edi %esp %ebp 0x124 0x120 0x104 movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx 456

II:40 Understanding Swap 0x120 0x124 Rtn adr %ebp Offset Address 0x124 0x120 0x11c 0x118 0x114 0x110 0x10c 0x108 0x104 0x100 yp xp %eax %edx %ecx %ebx %esi %edi %esp %ebp 456 0x124 0x120 0x104 movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx 123

II: Understanding Swap 0x120 0x124 Rtn adr %ebp Offset -4 Address 0x124 0x120 0x11c 0x118 0x114 0x110 0x10c 0x108 0x104 0x100 yp xp %eax %edx %ecx %ebx %esi %edi %esp %ebp 456 0x124 0x x104 movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx

II:42 Understanding Swap 0x120 0x124 Rtn adr %ebp Offset Address 0x124 0x120 0x11c 0x118 0x114 0x110 0x10c 0x108 0x104 0x100 yp xp %eax %edx %ecx %ebx %esi %edi %esp %ebp 456 0x124 0x120 0x104 movl 12(%ebp),%ecx# ecx = yp movl 8(%ebp),%edx# edx = xp movl (%ecx),%eax# eax = *yp (t1) movl (%edx),%ebx# ebx = *xp (t0) movl %eax,(%edx)# *xp = eax movl %ebx,(%ecx)# *yp = ebx 123

II:43 Complete Memory Addressing Modes Most General Form D(Rb,Ri,S)Mem[Reg[Rb]+S*Reg[Ri]+ D]  D: Constant “displacement” 1, 2, or 4 bytes  Rb: Base register: Any of 8 integer registers  Ri:Index register: Any, except for %esp  Unlikely you’d use %ebp, either  S: Scale: 1, 2, 4, or 8 (why these numbers?) Special Cases (Rb,Ri)Mem[Reg[Rb]+Reg[Ri]] D(Rb,Ri)Mem[Reg[Rb]+Reg[Ri]+D] (Rb,Ri,S)Mem[Reg[Rb]+S*Reg[Ri]]

II:44 Address Computation Examples %edx %ecx 0xf000 0x100 ExpressionAddress ComputationAddress 0x8(%edx)0xf x80xf008 (%edx,%ecx)0xf x1000xf100 (%edx,%ecx,4)0xf *0x1000xf400 0x80(,%edx,2)2*0xf x800x1e080

II:45 Peer Instruction – Addressing mode Textbook practice problem

II:46 Address Computation Instruction leal Src,Dest  Src is address mode expression  Set Dest to address denoted by expression Uses  Computing addresses without a memory reference  E.g., translation of p = &x[i];  Computing arithmetic expressions of the form x + k*y  k = 1, 2, 4, or 8

II:47 Some Arithmetic Operations Two Operand Instructions: FormatComputation addl Src,DestDest = Dest + Src subl Src,DestDest = Dest - Src imull Src,DestDest = Dest * Src sall Src,DestDest = Dest << SrcAlso called shll sarl Src,DestDest = Dest >> SrcArithmetic shrl Src,DestDest = Dest >> SrcLogical xorl Src,DestDest = Dest ^ Src andl Src,DestDest = Dest & Src orl Src,DestDest = Dest | Src No distinction between signed and unsigned int (why?)

II:48 Some Arithmetic Operations One Operand Instructions incl DestDest = Dest + 1 decl DestDest = Dest - 1 negl DestDest = -Dest notl DestDest = ~Dest See book for more instructions

II:49 Using leal for Arithmetic Expressions int arith (int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; } arith: pushl %ebp movl %esp,%ebp movl 8(%ebp),%eax movl 12(%ebp),%edx leal (%edx,%eax),%ecx leal (%edx,%edx,2),%edx sall $4,%edx addl 16(%ebp),%ecx leal 4(%edx,%eax),%eax imull %ecx,%eax movl %ebp,%esp popl %ebp ret Body Set Up Finish

II:50 Understanding arith int arith (int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; } movl 8(%ebp),%eax# eax = x movl 12(%ebp),%edx# edx = y leal (%edx,%eax),%ecx# ecx = x+y (t1) leal (%edx,%edx,2),%edx# edx = 3*y sall $4,%edx# edx = 48*y (t4) addl 16(%ebp),%ecx# ecx = z+t1 (t2) leal 4(%edx,%eax),%eax# eax = 4+t4+x (t5) imull %ecx,%eax# eax = t5*t2 (rval) y x Rtn adr Old % ebp %ebp Offset Stack z 16

II:51 Understanding arith int arith (int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; } movl 8(%ebp),%eax# eax = x movl 12(%ebp),%edx# edx = y leal (%edx,%eax),%ecx# ecx = x+y (t1) leal (%edx,%edx,2),%edx# edx = 3*y sall $4,%edx# edx = 48*y (t4) addl 16(%ebp),%ecx# ecx = z+t1 (t2) leal 4(%edx,%eax),%eax# eax = 4+t4+x (t5) imull %ecx,%eax# eax = t5*t2 (rval) y x Rtn adr Old % ebp %ebp Offset Stack z 16

II:52 Understanding arith int arith (int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; } movl 8(%ebp),%eax# eax = x movl 12(%ebp),%edx# edx = y leal (%edx,%eax),%ecx# ecx = x+y (t1) leal (%edx,%edx,2),%edx# edx = 3*y sall $4,%edx# edx = 48*y (t4)(48=3x16) addl 16(%ebp),%ecx# ecx = z+t1 (t2) leal 4(%edx,%eax),%eax# eax = 4+t4+x (t5) imull %ecx,%eax# eax = t5*t2 (rval) y x Rtn adr Old % ebp %ebp Offset Stack z 16

II:53 Understanding arith int arith (int x, int y, int z) { int t1 = x+y; int t2 = z+t1; int t3 = x+4; int t4 = y * 48; int t5 = t3 + t4; int rval = t2 * t5; return rval; } movl 8(%ebp),%eax# eax = x movl 12(%ebp),%edx# edx = y leal (%edx,%eax),%ecx# ecx = x+y (t1) leal (%edx,%edx,2),%edx# edx = 3*y sall $4,%edx# edx = 48*y (t4) addl 16(%ebp),%ecx# ecx = z+t1 (t2) leal 4(%edx,%eax),%eax# eax = 4+t4+x (t5) imull %ecx,%eax# eax = t5*t2 (rval) y x Rtn adr Old % ebp %ebp Offset Stack z 16

II:54 Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; } logical: pushl %ebp movl %esp,%ebp movl 8(%ebp),%eax xorl 12(%ebp),%eax sarl $17,%eax andl $8185,%eax movl %ebp,%esp popl %ebp ret Body Set Up Finish movl 8(%ebp),%eax# eax = x xorl 12(%ebp),%eax# eax = x^y sarl $17,%eax# eax = t1>>17 andl $8185,%eax# eax = t2 & 8185

II:55 Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; } logical: pushl %ebp movl %esp,%ebp movl 8(%ebp),%eax xorl 12(%ebp),%eax sarl $17,%eax andl $8185,%eax movl %ebp,%esp popl %ebp ret movl 8(%ebp),%eax# eax = x xorl 12(%ebp),%eax# eax = x^y(t1) sarl $17,%eax# eax = t1>>17(t2) andl $8185,%eax# eax = t2 & 8185 Body Set Up Finish

II:56 Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; } logical: pushl %ebp movl %esp,%ebp movl 8(%ebp),%eax xorl 12(%ebp),%eax sarl $17,%eax andl $8185,%eax movl %ebp,%esp popl %ebp ret movl 8(%ebp),%eax# eax = x xorl 12(%ebp),%eax# eax = x^y(t1) sarl $17,%eax# eax = t1>>17(t2) andl $8185,%eax# eax = t2 & 8185 Body Set Up Finish

II:57 Another Example int logical(int x, int y) { int t1 = x^y; int t2 = t1 >> 17; int mask = (1<<13) - 7; int rval = t2 & mask; return rval; } logical: pushl %ebp movl %esp,%ebp movl 8(%ebp),%eax xorl 12(%ebp),%eax sarl $17,%eax andl $8185,%eax movl %ebp,%esp popl %ebp ret movl 8(%ebp),%eax# eax = x xorl 12(%ebp),%eax# eax = x^y(t1) sarl $17,%eax# eax = t1>>17(t2) andl $8185,%eax# eax = t2 & = 8192, 2 13 – 7 = 8185 Body Set Up Finish