Machine- Level Programming III: Switch Statements and IA32 Procedures

Similar documents
Sungkyunkwan University

IA32 Stack. Stack BoDom. Region of memory managed with stack discipline Grows toward lower addresses. Register %esp contains lowest stack address

Machine- Level Programming III: Switch Statements and IA32 Procedures

Machine- Level Programming II: Control Structures and Procedures

Machine- Level Programming II: Control Structures and Procedures

Region of memory managed with stack discipline Grows toward lower addresses. Register %esp contains lowest stack address = address of top element

CS213. Machine-Level Programming III: Procedures

Assembly III: Procedures. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University

ASSEMBLY III: PROCEDURES. Jo, Heeseung

Assembly III: Procedures. Jo, Heeseung

The course that gives CMU its Zip! Machine-Level Programming III: Procedures Sept. 17, 2002

Machine Representa/on of Programs: Control Flow cont d. Previous lecture. Do- While loop. While- Do loop CS Instructors: Sanjeev Se(a

Machine-level Programming (3)

Machine Level Programming II: Arithmetic &Control

X86 Assembly -Procedure II:1

Machine- Level Representa2on: Procedure

Machine-Level Programming III: Procedures

This is a medical robot, guided by a skilled surgeon and designed to get to places doctors are unable to reach without opening a pacent up.

Systems I. Machine-Level Programming V: Procedures

%r8 %r8d. %r9 %r9d. %r10 %r10d. %r11 %r11d. %r12 %r12d. %r13 %r13d. %r14 %r14d %rbp. %r15 %r15d. Sean Barker

Control flow. Condition codes Conditional and unconditional jumps Loops Switch statements

Machine-Level Programming II: Control Flow

hnp://

Giving credit where credit is due

Machine-Level Programming III: Procedures

Stack Discipline Jan. 19, 2018

An Experience Like No Other. Stack Discipline Aug. 30, 2006

Systems Programming and Computer Architecture ( )

Bryant and O Hallaron, Computer Systems: A Programmer s Perspective, Third Edition. Carnegie Mellon

Machine Programming 3: Procedures

IA32 Stack. Lecture 5 Machine-Level Programming III: Procedures. IA32 Stack Popping. IA32 Stack Pushing. Topics. Pushing. Popping

IA32 Stack The course that gives CMU its Zip! Machine-Level Programming III: Procedures Sept. 17, IA32 Stack Popping. IA32 Stack Pushing

University of Washington

Machine-level Programs Procedure

CS 33. Machine Programming (2) CS33 Intro to Computer Systems XII 1 Copyright 2017 Thomas W. Doeppner. All rights reserved.

Three Kinds of Instruc;ons

Machine-Level Programming II: Control

Machine-Level Programming II: Control

Data Representa/ons: IA32 + x86-64

CS429: Computer Organization and Architecture

Turning C into Object Code Code in files p1.c p2.c Compile with command: gcc -O p1.c p2.c -o p Use optimizations (-O) Put resulting binary in file p

CS 33. Machine Programming (2) CS33 Intro to Computer Systems XI 1 Copyright 2018 Thomas W. Doeppner. All rights reserved.

Procedure Calls. Young W. Lim Sat. Young W. Lim Procedure Calls Sat 1 / 27

Assembly Programming IV

x86-64 Programming III & The Stack

Assembly I: Basic Operations. Jo, Heeseung

ASSEMBLY I: BASIC OPERATIONS. Jo, Heeseung

Machine-Level Programming (2)

Assembly I: Basic Operations. Computer Systems Laboratory Sungkyunkwan University

Assembly Programmer s View Lecture 4A Machine-Level Programming I: Introduction

Assembly Language: Function Calls" Goals of this Lecture"

Condition Codes. Lecture 4B Machine-Level Programming II: Control Flow. Setting Condition Codes (cont.) Setting Condition Codes (cont.

Procedure Calls. Young W. Lim Mon. Young W. Lim Procedure Calls Mon 1 / 29

Assembly Language: Function Calls

Compiling C Programs Into X86-64 Assembly Programs

Machine-Level Programming I: Introduction Jan. 30, 2001

Machine Programming 1: Introduction

Assembly Language: Function Calls" Goals of this Lecture"

Procedures and the Call Stack

CS241 Computer Organization Spring Addresses & Pointers

Mechanisms in Procedures. CS429: Computer Organization and Architecture. x86-64 Stack. x86-64 Stack Pushing

CS241 Computer Organization Spring 2015 IA

CF Carry Flag SF Sign Flag ZF Zero Flag OF Overflow Flag. ! CF set if carry out from most significant bit. "Used to detect unsigned overflow

Assembly II: Control Flow. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University

ASSEMBLY II: CONTROL FLOW. Jo, Heeseung

Giving credit where credit is due

1 /* file cpuid2.s */ 4.asciz "The processor Vendor ID is %s \n" 5.section.bss. 6.lcomm buffer, section.text. 8.globl _start.

Assembly III: Procedures. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University

Page 1. IA32 Stack CISC 360. Machine-Level Programming III: Procedures Sept. 22, IA32 Stack Popping Stack Bottom. IA32 Stack Pushing

Instructor: Alvin R. Lebeck

Condition Codes The course that gives CMU its Zip! Machine-Level Programming II Control Flow Sept. 13, 2001 Topics

Outline. Review: Assembly/Machine Code View. Processor State (x86-64, Par2al) Condi2on Codes (Explicit Se^ng: Compare) Condi2on Codes (Implicit Se^ng)

CS367. Program Control

x86-64 Programming III

Function Calls COS 217. Reading: Chapter 4 of Programming From the Ground Up (available online from the course Web site)

Chapter 3 Machine-Level Programming II Control Flow

Assembly Programming IV

Assembly Language: Function Calls. Goals of this Lecture. Function Call Problems

Process Layout and Function Calls

Machine- level Programming II: Control Flow

Page # CISC 360. Machine-Level Programming I: Introduction Sept. 18, IA32 Processors. X86 Evolution: Programmerʼs View.

CISC 360. Machine-Level Programming I: Introduction Sept. 18, 2008

CISC 360. Machine-Level Programming II: Control Flow Sep 23, 2008

Page # CISC 360. Machine-Level Programming II: Control Flow Sep 23, Condition Codes. Setting Condition Codes (cont.)

CSC 2400: Computing Systems. X86 Assembly: Function Calls"

The Hardware/Software Interface CSE351 Spring 2013

Machine-Level Programming II: Control and Arithmetic

Credits to Randy Bryant & Dave O Hallaron

Machine- level Programming

The Hardware/Software Interface CSE351 Spring 2015

Machine-Level Programming II: Control

Machine-Level Programming II: Arithmetic & Control /18-243: Introduction to Computer Systems 6th Lecture, 5 June 2012

Machine- Level Programming II: Arithme6c & Control

Machine Program: Procedure. Zhaoguo Wang

Sungkyunkwan University

Meet & Greet! Come hang out with your TAs and Fellow Students (& eat free insomnia cookies) When : TODAY!! 5-6 pm Where : 3rd Floor Atrium, CIT

The course that gives CMU its Zip! Machine-Level Programming I: Introduction Sept. 10, 2002

System Programming and Computer Architecture (Fall 2009)

Instruction Set Architecture

Instruction Set Architecture

Transcription:

Machine- Level Programming III: Switch Statements and IA32 Procedures 15-213: Introduc;on to Computer Systems 6 th Lecture, Sep. 9, 2010 Instructors: Randy Bryant and Dave O Hallaron

Today Switch statements IA 32 Procedures Stack Structure Calling Conven;ons Illustra;ons of Recursion & Pointers 2

long switch_eg (long x, long y, long z) { long w = 1; switch(x) { case 1: w = y*z; break; case 2: w = y/z; /* Fall Through */ case 3: w += z; break; case 5: case 6: w -= z; break; default: w = 2; return w; Switch Statement Example MulBple case labels Here: 5 & 6 Fall through cases Here: 2 Missing cases Here: 4 Carnegie Mellon 3

Jump Table Structure Switch Form switch(x) { case val_0: Block 0 case val_1: Block 1 case val_n-1: Block n 1 jtab: Jump Table Targ0 Targ1 Targ2 Targn-1 Targ0: Targ1: Targ2: Jump Targets Code Block 0 Code Block 1 Code Block 2 Approximate Transla;on target = JTab[x]; goto *target; Targn-1: Code Block n 1 4

Switch Statement Example (IA32) long switch_eg(long x, long y, long z) { long w = 1; switch(x) {... return w; Setup: What range of values takes default? switch_eg: pushl %ebp # Setup movl, %ebp # Setup movl 8(%ebp), %eax # %eax = x cmpl $6, %eax # Compare x:6 ja.l2 # If unsigned > goto default jmp *.L7(,%eax,4) # Goto *JTab[x] Note that w not ini;alized here 5

Switch Statement Example (IA32) long switch_eg(long x, long y, long z) { long w = 1; switch(x) {... return w; Setup: Indirect jump Jump table.section.rodata.align 4.L7:.long.L2 # x = 0.long.L3 # x = 1.long.L4 # x = 2.long.L5 # x = 3.long.L2 # x = 4.long.L6 # x = 5.long.L6 # x = 6 switch_eg: pushl %ebp # Setup movl, %ebp # Setup movl 8(%ebp), %eax # eax = x cmpl $6, %eax # Compare x:6 ja.l2 # If unsigned > goto default jmp *.L7(,%eax,4) # Goto *JTab[x] 6

Assembly Setup ExplanaBon Table Structure Each target requires 4 bytes Base address at.l7 Jumping Direct: jmp.l2 Jump target is denoted by label.l2 Jump table.section.rodata.align 4.L7:.long.L2 # x = 0.long.L3 # x = 1.long.L4 # x = 2.long.L5 # x = 3.long.L2 # x = 4.long.L6 # x = 5.long.L6 # x = 6 Indirect: jmp *.L7(,%eax,4) Start of jump table:.l7 Must scale by factor of 4 (labels have 32- bits = 4 Bytes on IA32) Fetch target from effec;ve Address.L7 + eax*4 Only for 0 x 6 7

Jump Table Jump table.section.rodata.align 4.L7:.long.L2 # x = 0.long.L3 # x = 1.long.L4 # x = 2.long.L5 # x = 3.long.L2 # x = 4.long.L6 # x = 5.long.L6 # x = 6 switch(x) { case 1: //.L3 w = y*z; break; case 2: //.L4 w = y/z; /* Fall Through */ case 3: //.L5 w += z; break; case 5: case 6: //.L6 w -= z; break; default: //.L2 w = 2; 8

Handling Fall- Through long w = 1;... switch(x) {... case 2: w = y/z; /* Fall Through */ case 3: w += z; break;... case 3: w = 1; goto merge; merge: case 2: w = y/z; w += z; 9

Code Blocks (ParBal) switch(x) { case 1: //.L3 w = y*z; break;... case 3: //.L5 w += z; break;... default: //.L2 w = 2;.L2: # Default movl $2, %eax # w = 2 jmp.l8 # Goto done.l5: # x == 3 movl $1, %eax # w = 1 jmp.l9 # Goto merge.l3: # x == 1 movl 16(%ebp), %eax # z imull 12(%ebp), %eax # w = y*z jmp.l8 # Goto done 10

Code Blocks (Rest) switch(x) {... case 2: //.L4 w = y/z; /* Fall Through */ merge: //.L9 w += z; break; case 5: case 6: //.L6 w -= z; break;.l4: # x == 2 movl 12(%ebp), %edx movl %edx, %eax sarl $31, %edx idivl 16(%ebp) # w = y/z.l9: # merge: addl 16(%ebp), %eax # w += z jmp.l8 # goto done.l6: # x == 5, 6 movl $1, %eax # w = 1 subl 16(%ebp), %eax # w = 1-z 11

Switch Code (Finish) return w;.l8: # done: popl %ebp ret Noteworthy Features Jump table avoids sequencing through cases Constant ;me, rather than linear Use jump table to handle holes and duplicate tags Use program sequencing to handle fall- through Don t ini;alize w = 1 unless really need it 12

x86-64 Switch ImplementaBon Same general idea, adapted to 64- bit code Table entries 64 bits (pointers) Cases use revised code switch(x) { case 1: //.L3 w = y*z; break;....l3: movq %rdx, %rax imulq %rsi, %rax ret Jump Table.section.rodata.align 8.L7:.quad.L2 # x = 0.quad.L3 # x = 1.quad.L4 # x = 2.quad.L5 # x = 3.quad.L2 # x = 4.quad.L6 # X = 5.quad.L6 # x = 6 13

IA32 Object Code Setup Label.L2 becomes address 0x8048422 Label.L7 becomes address 0x8048660 Assembly Code switch_eg:... ja.l2 # If unsigned > goto default jmp *.L7(,%eax,4) # Goto *JTab[x] Disassembled Object Code 08048410 <switch_eg>:... 8048419:77 07 ja 8048422 <switch_eg+0x12> 804841b:ff 24 85 60 86 04 08 jmp *0x8048660(,%eax,4) 14

IA32 Object Code (cont.) Jump Table Doesn t show up in disassembled code Can inspect using GDB gdb switch (gdb) x/7xw 0x8048660 Examine 7 hexadecimal format words (4- bytes each) Use command help x to get format documenta;on 0x8048660: 0x08048422 0x08048432 0x0804843b 0x08048429 0x8048670: 0x08048422 0x0804844b 0x0804844b 15

IA32 Object Code (cont.) Deciphering Jump Table 0x8048660: 0x08048422 0x08048432 0x0804843b 0x08048429 0x8048670: 0x08048422 0x0804844b 0x0804844b Address Value x 0x8048660 0x8048422 0 0x8048664 0x8048432 1 0x8048668 0x804843b 2 0x804866c 0x8048429 3 0x8048670 0x8048422 4 0x8048674 0x804844b 5 0x8048678 0x804844b 6 16

Disassembled Targets 8048422: b8 02 00 00 00 mov $0x2,%eax 8048427: eb 2a jmp 8048453 <switch_eg+0x43> 8048429: b8 01 00 00 00 mov $0x1,%eax 804842e: 66 90 xchg %ax,%ax # noop 8048430: eb 14 jmp 8048446 <switch_eg+0x36> 8048432: 8b 45 10 mov 0x10(%ebp),%eax 8048435: 0f af 45 0c imul 0xc(%ebp),%eax 8048439: eb 18 jmp 8048453 <switch_eg+0x43> 804843b: 8b 55 0c mov 0xc(%ebp),%edx 804843e: 89 d0 mov %edx,%eax 8048440: c1 fa 1f sar $0x1f,%edx 8048443: f7 7d 10 idivl 0x10(%ebp) 8048446: 03 45 10 add 0x10(%ebp),%eax 8048449: eb 08 jmp 8048453 <switch_eg+0x43> 804844b: b8 01 00 00 00 mov $0x1,%eax 8048450: 2b 45 10 sub 0x10(%ebp),%eax 8048453: 5d pop %ebp 8048454: c3 ret 17

Matching Disassembled Targets Value 0x8048422 0x8048432 0x804843b 0x8048429 0x8048422 0x804844b 0x804844b 8048422: mov $0x2,%eax 8048427: jmp 8048453 <switch_eg+0x43> 8048429: mov $0x1,%eax 804842e: xchg %ax,%ax 8048430: jmp 8048446 <switch_eg+0x36> 8048432: mov 0x10(%ebp),%eax 8048435: imul 0xc(%ebp),%eax 8048439: jmp 8048453 <switch_eg+0x43> 804843b: mov 0xc(%ebp),%edx 804843e: mov %edx,%eax 8048440: sar $0x1f,%edx 8048443: idivl 0x10(%ebp) 8048446: add 0x10(%ebp),%eax 8048449: jmp 8048453 <switch_eg+0x43> 804844b: mov $0x1,%eax 8048450: sub 0x10(%ebp),%eax 8048453: pop %ebp 8048454: ret 18

Summarizing C Control if- then- else do- while while, for switch Assembler Control Condi;onal jump Condi;onal move Indirect jump Compiler generates code sequence to implement more complex control Standard Techniques Loops converted to do- while form Large switch statements use jump tables Sparse switch statements may use decision trees 19

Today Switch statements IA 32 Procedures Stack Structure Calling Conven;ons Illustra;ons of Recursion & Pointers 20

IA32 Stack Region of memory managed with stack discipline Grows toward lower addresses Stack Boiom Increasing Addresses Register contains lowest stack address address of top element Stack Pointer: Stack Grows Down Stack Top 21

IA32 Stack: Push pushl Src Fetch operand at Src Decrement by 4 Write operand at address given by Stack Boiom Increasing Addresses Stack Pointer: - 4 Stack Grows Down Stack Top 22

IA32 Stack: Pop Stack Boiom Increasing Addresses Stack Pointer: +4 Stack Grows Down Stack Top 23

Procedure Control Flow Use stack to support procedure call and return Procedure call: call label Push return address on stack Jump to label Return address: Address of the next instruc;on right aker call Example from disassembly 804854e: e8 3d 06 00 00 call 8048b90 <main> 8048553: 50 pushl %eax Return address = 0x8048553 Procedure return: ret Pop address from stack Jump to address 24

Procedure Call Example 804854e: e8 3d 06 00 00 call 8048b90 <main> 8048553: 50 pushl %eax call 8048b90 0x110 0x110 0x10c 0x10c 0x108 123 0x108 123 0x104 0x8048553 0x108 0x104 %eip 0x804854e %eip 0x8048b90 %eip: program counter 25

Procedure Return Example 8048591: c3 ret ret 0x110 0x110 0x10c 0x10c 0x108 123 0x108 123 0x104 0x8048553 0x8048553 0x104 0x108 %eip 0x8048591 %eip 0x8048553 %eip: program counter 26

Stack- Based Languages Languages that support recursion e.g., C, Pascal, Java Code must be Reentrant Mul;ple simultaneous instan;a;ons of single procedure Need some place to store state of each instan;a;on Arguments Local variables Return pointer Stack discipline State for given procedure needed for limited ;me From when called to when return Callee returns before caller does Stack allocated in Frames state for single procedure instan;a;on 27

Call Chain Example yoo( ) { who(); who( ) { (); (); ( ) { (); Example Call Chain yoo who Procedure () is recursive 28

Stack Frames Contents Local variables Return informa;on Temporary space Frame Pointer: %ebp Stack Pointer: Previous Frame Frame for proc Management Space allocated when enter procedure Set- up code Deallocated when return Finish code Stack Top 29

Example Stack yoo( ) { who(); yoo who %ebp yoo 30

Example Stack yoo( ) { who( ) { who(); (); (); yoo who %ebp yoo who 31

Example Stack yoo( ) { who( ) { ( ) who(); (); { (); (); yoo who %ebp yoo who 32

Example Stack yoo( ) { who( ) { ( ) who(); (); { ( ) (); { (); (); yoo who %ebp yoo who 33

Example Stack yoo( ) { who( ) { ( ) who(); (); { ( ) (); { (); ( ) { (); (); yoo who %ebp yoo who 34

Example Stack yoo( ) { who( ) { ( ) who(); (); { ( ) (); { (); (); yoo who %ebp yoo who 35

Example Stack yoo( ) { who( ) { ( ) who(); (); { (); (); yoo who %ebp yoo who 36

Example Stack yoo( ) { who( ) { who(); (); (); yoo who %ebp yoo who 37

Example Stack yoo( ) { who( ) { ( ) who(); (); { (); (); yoo who %ebp yoo who 38

Example Stack yoo( ) { who( ) { who(); (); (); yoo who %ebp yoo who 39

Example Stack yoo( ) { who(); yoo who %ebp yoo 40

IA32/Linux Stack Frame Current Stack Frame ( Top to Bo_om) Argument build: Parameters for func;on about to call Local variables If can t keep in registers Saved register context Old frame pointer Frame pointer %ebp Caller Frame Arguments Return Addr Old %ebp Caller Stack Frame Return address Pushed by call instruc;on Arguments for this call Stack pointer Saved Registers + Local Variables Argument Build 41

RevisiBng swap int course1 = 15213; int course2 = 18243; void call_swap() { swap(&course1, &course2); Calling swap from call_swap call_swap: subl $8, movl $course2, 4() movl $course1, () call swap void swap(int *xp, int *yp) { int t0 = *xp; int t1 = *yp; *xp = t1; *yp = t0; &course 2 &course 1 Rtn adr Resul;ng Stack subl call 42

RevisiBng swap void swap(int *xp, int *yp) { int t0 = *xp; int t1 = *yp; *xp = t1; *yp = t0; swap: pushl %ebp movl, %ebp pushl %ebx movl 8(%ebp), %edx movl 12(%ebp), %ecx movl (%edx), %ebx movl (%ecx), %eax movl %eax, (%edx) movl %ebx, (%ecx) popl %ebx popl %ebp ret Set Up Body Finish 43

swap Setup #1 Entering Stack Resul;ng Stack %ebp %ebp &course2 yp &course1 xp Rtn adr Rtn adr Old %ebp swap: pushl %ebp movl,%ebp pushl %ebx 44

swap Setup #2 Entering Stack Resul;ng Stack %ebp &course2 yp &course1 xp Rtn adr Rtn adr Old %ebp %ebp swap: pushl %ebp movl,%ebp pushl %ebx 45

swap Setup #3 Entering Stack %ebp Resul;ng Stack &course2 &course1 Rtn adr yp xp Rtn adr swap: pushl %ebp movl,%ebp pushl %ebx Old %ebp Old %ebx %ebp 46

swap Body Entering Stack Resul;ng Stack %ebp Offset rela;ve to %ebp &course2 &course1 Rtn adr 12 8 4 yp xp Rtn adr Old %ebp %ebp Old %ebx movl 8(%ebp),%edx # get xp movl 12(%ebp),%ecx # get yp... 47

swap Finish Stack Before Finish popl %ebx popl %ebp Resul;ng Stack %ebp yp xp Rtn adr Old %ebp Old %ebx %ebp yp xp Rtn adr Observa;on Saved and restored register %ebx Not so for %eax, %ecx, %edx 48

Disassembled swap 08048384 <swap>: 8048384: 55 push %ebp 8048385: 89 e5 mov,%ebp 8048387: 53 push %ebx 8048388: 8b 55 08 mov 0x8(%ebp),%edx 804838b: 8b 4d 0c mov 0xc(%ebp),%ecx 804838e: 8b 1a mov (%edx),%ebx 8048390: 8b 01 mov (%ecx),%eax 8048392: 89 02 mov %eax,(%edx) 8048394: 89 19 mov %ebx,(%ecx) 8048396: 5b pop %ebx 8048397: 5d pop %ebp 8048398: c3 ret Calling Code 80483b4: movl $0x8049658,0x4() # Copy &course2 80483bc: movl $0x8049654,() # Copy &course1 80483c3: call 8048384 <swap> # Call swap 80483c8: leave # Prepare to return 80483c9: ret # Return 49

Today Switch statements IA 32 Procedures Stack Structure Calling Conven;ons Illustra;ons of Recursion & Pointers 50

Register Saving ConvenBons When procedure yoo calls who: yoo is the caller who is the callee Can register be used for temporary storage? yoo: movl $15213, %edx call who addl %edx, %eax ret who: movl 8(%ebp), %edx addl $18243, %edx ret Contents of register %edx overwriien by who This could be trouble something should be done! Need some coordina;on 51

Register Saving ConvenBons When procedure yoo calls who: yoo is the caller who is the callee Can register be used for temporary storage? ConvenBons Caller Save Caller saves temporary values in its frame before the call Callee Save Callee saves temporary values in its frame before using 52

IA32/Linux+Windows Register Usage %eax, %edx, %ecx Caller saves prior to call if values are used later %eax also used to return integer value %ebx, %esi, %edi Callee saves if wants to use them, %ebp special form of callee save Restored to original values upon exit from procedure Caller- Save Temporaries Callee- Save Temporaries Special %eax %edx %ecx %ebx %esi %edi %ebp 53

Today Switch statements IA 32 Procedures Stack Structure Calling Conven;ons Illustra;ons of Recursion & Pointers 54

Recursive FuncBon /* Recursive popcount */ int pcount_r(unsigned x) { if (x == 0) return 0; else return (x & 1) + pcount_r(x >> 1); Registers %eax, %edx used without first saving %ebx used, but saved at beginning & restored at end pcount_r: pushl %ebp movl, %ebp pushl %ebx subl $4, movl 8(%ebp), %ebx movl $0, %eax testl %ebx, %ebx je.l3 movl %ebx, %eax shrl %eax movl %eax, () call pcount_r movl %ebx, %edx andl $1, %edx leal (%edx,%eax), %eax.l3: addl $4, popl %ebx popl %ebp ret 55

Recursive Call #1 /* Recursive popcount */ int pcount_r(unsigned x) { if (x == 0) return 0; else return (x & 1) + pcount_r(x >> 1); AcBons Save old value of %ebx on stack Allocate space for argument to recursive call Store x in %ebx %ebx x pcount_r: pushl %ebp movl, %ebp pushl %ebx subl $4, movl 8(%ebp), %ebx x Rtn adr Old %ebp Old %ebx %ebp 56

Recursive Call #2 /* Recursive popcount */ int pcount_r(unsigned x) { if (x == 0) return 0; else return (x & 1) + pcount_r(x >> 1); movl $0, %eax testl %ebx, %ebx je.l3.l3: ret AcBons If x == 0, return with %eax set to 0 %ebx x 57

Recursive Call #3 /* Recursive popcount */ int pcount_r(unsigned x) { if (x == 0) return 0; else return (x & 1) + pcount_r(x >> 1); movl %ebx, %eax shrl %eax movl %eax, () call pcount_r AcBons Store x >> 1 on stack Make recursive call Effect %eax set to func;on result %ebx s;ll has value of x %ebx x Rtn adr Old %ebp Old %ebx x >> 1 %ebp 58

Recursive Call #4 /* Recursive popcount */ int pcount_r(unsigned x) { if (x == 0) return 0; else return (x & 1) + pcount_r(x >> 1); movl %ebx, %edx andl $1, %edx leal (%edx,%eax), %eax Assume %eax holds value from recursive call %ebx holds x AcBons Compute (x & 1) + computed value Effect %eax set to func;on result %ebx x 59

Recursive Call #5 /* Recursive popcount */ int pcount_r(unsigned x) { if (x == 0) return 0; else return (x & 1) + pcount_r(x >> 1); L3: addl$4, popl%ebx popl%ebp ret AcBons Restore values of %ebx and %ebp Restore Rtn adr %ebp Old %ebp Old %ebx %ebp %ebx Old %ebx 60

ObservaBons About Recursion Handled Without Special ConsideraBon Stack frames mean that each func;on call has private storage Saved registers & local variables Saved return pointer Register saving conven;ons prevent one func;on call from corrup;ng another s data Stack discipline follows call / return paiern If P calls Q, then Q returns before P Last- In, First- Out Also works for mutual recursion P calls Q; Q calls P 61

Pointer Code Genera;ng Pointer /* Compute x + 3 */ int add3(int x) { int localx = x; incrk(&localx, 3); return localx; Referencing Pointer /* Increment value by k */ void incrk(int *ip, int k) { *ip += k; add3 creates pointer and passes it to incrk 62

CreaBng and IniBalizing Local Variable int add3(int x) { int localx = x; incrk(&localx, 3); return localx; Variable localx must be stored on stack Because: Need to create pointer to it Compute pointer as - 4(%ebp) 8 4 x Rtn adr 0 Old %ebp %ebp First part of add3-4 localx add3: = x pushl%ebp -8 movl, %ebp -12 Unused subl $24, # Alloc. 24 bytes movl 8(%ebp), %eax -16 movl %eax, -4(%ebp)# Set localx to x -20-24 63

CreaBng Pointer as Argument int add3(int x) { int localx = x; incrk(&localx, 3); return localx; Use leal instrucbon to compute address of localx Middle part of add3 movl $3, 4() # 2 nd arg = 3 leal -4(%ebp), %eax# &localx movl %eax, () # 1 st arg = &localx call incrk 8 x 4 Rtn adr 0 Old %ebp %ebp -4 localx -8-12 Unused -16-20 3 +4-24 64

Retrieving local variable int add3(int x) { int localx = x; incrk(&localx, 3); return localx; Retrieve localx from stack as return value Final part of add3 movl -4(%ebp), %eax # Return val= localx leave ret 8 x 4 Rtn adr 0 Old %ebp %ebp -4 localx -8-12 Unused -16-20 -24 65

IA 32 Procedure Summary Important Points Stack is the right data structure for procedure call / return If P calls Q, then Q returns before P Recursion (& mutual recursion) handled by normal calling convenbons Can safely store values in local stack frame and in callee- saved registers Put func;on arguments at top of stack Result return in %eax Pointers are addresses of values On stack or global Caller Frame %ebp Arguments Return Addr Old %ebp Saved Registers + Local Variables Argument Build 66