Introduction to Computer Engineering. CS/ECE 252, Spring 2017 Rahul Nayar Computer Sciences Department University of Wisconsin Madison

Similar documents
Introduction to Computer Engineering. CS/ECE 252, Fall 2016 Prof. Guri Sohi Computer Sciences Department University of Wisconsin Madison

Chapter 7 Assembly Language. ACKNOWLEDGEMENT: This lecture uses slides prepared by Gregory T. Byrd, North Carolina State University

Assembly Language. University of Texas at Austin CS310H - Computer Organization Spring 2010 Don Fussell

Computing Layers. Chapter 7 Assembly Language

Chapter 7 Assembly Language

CS 135: Computer Architecture I

Chapter 9 TRAP Routines and Subroutines

Chapter 9 TRAP Routines and Subroutines

Chapter 5 - ISA. 1.The Von Neumann Model. The Stored Program Computer. Von Neumann Model. Memory

Syntax of LC-3 assembly: Language elements. Instructions

TRAPs and Subroutines. University of Texas at Austin CS310H - Computer Organization Spring 2010 Don Fussell

Midterm 2 Review Chapters 4-16 LC-3

System Calls. Chapter 9 TRAP Routines and Subroutines

LC-3 Assembly Language. (Textbook Chapter 7)"

Instruction Set Architecture

Chapter 9 TRAP Routines and Subroutines

LC-3 Instruction Set Architecture. Textbook Chapter 5

LC-3 Subroutines and Traps. (Textbook Chapter 9)"

LC-3 Instruction Set Architecture

Introduction to Computer Engineering. CS/ECE 252, Spring 2017 Rahul Nayar Computer Sciences Department University of Wisconsin Madison

Subroutines & Traps. Announcements. New due date for assignment 2 5pm Wednesday, 5May

Introduction to Computer Engineering. Chapter 5 The LC-3. Instruction Set Architecture

Chapter 6 Programming the LC-3

Introduction to Computer Engineering. CS/ECE 252, Fall 2016 Prof. Guri Sohi Computer Sciences Department University of Wisconsin Madison

UNIVERSITY OF WISCONSIN MADISON

Introduction to Computer Engineering. CS/ECE 252, Spring 2017 Rahul Nayar Computer Sciences Department University of Wisconsin Madison

Assembly Language. 7.1 Assembly Language Programming Moving Up o Level. chapter

Chapter 5 The LC-3. ACKNOWLEDGEMENT: This lecture uses slides prepared by Gregory T. Byrd, North Carolina State University 5-2

Computing Layers. Chapter 5 The LC-3

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

Assembly Language. 7.1 Assembly Language Programming Moving Up o Level. chapter

LC-3 Assembly Language

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Computing Layers

COSC121: Computer Systems: Review

COSC121: Computer Systems: Review

ECE 206, Fall 2001: Lab 3

CS 2461: Computer Architecture I

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING COMPUTER SCIENCES DEPARTMENT UNIVERSITY OF WISCONSIN-MADISON

appendix a The LC-3 ISA A.1 Overview

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON. Instructor: Rahul Nayar TAs: Mohit Verma, Annie Lin

EEL 5722C Field-Programmable Gate Array Design

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING COMPUTER SCIENCES DEPARTMENT UNIVERSITY OF WISCONSIN-MADISON

Trap Vector Table. Interrupt Vector Table. Operating System and Supervisor Stack. Available for User Programs. Device Register Addresses

Register Files. Single Bus Architecture. Register Files. Single Bus Processor. Term Project: using VHDL. Accumulator based architecture

LC-3 ISA - II. Lecture Topics. Lecture materials. Homework. Machine problem. Announcements. ECE 190 Lecture 10 February 17, 2011

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

11/10/2016. Review the Problem to Be Solved. ECE 120: Introduction to Computing. What Shall We Keep in the Registers? Where Are the Pieces in Memory?

20/08/14. Computer Systems 1. Instruction Processing: FETCH. Instruction Processing: DECODE

Implementing Functions at the Machine Level

CS 2461: Computer Architecture I

LC-3 TRAP Routines. Textbook Chapter 9

The LC-3 Instruction Set Architecture. ISA Overview Operate instructions Data Movement instructions Control Instructions LC-3 data path

Chapter 10 And, Finally... The Stack. ACKNOWLEDGEMENT: This lecture uses slides prepared by Gregory T. Byrd, North Carolina State University

That is, we branch to JUMP_HERE, do whatever we need to do, and then branch to JUMP_BACK, which is where we started.

11/16/2016. Review Our Process for Programming. ECE 120: Introduction to Computing. A Typical Programming Process. Can a Computer Help Us Program?

Lab 1. Warm-up : discovering the target machine, LC-3

CIT 595 Spring System Software: Programming Tools. Assembly Process Example: First Pass. Assembly Process Example: Second Pass.

Lec-memMapIOtrapJSR SYS_BUS

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

Load -- read data from memory to register. Store -- write data from register to memory. Load effective address -- compute address, save in register

Introduction to Computer. Chapter 5 The LC-3. Instruction Set Architecture

Introduction to Computer Engineering. CS/ECE 252, Fall 2012 Prof. Guri Sohi Computer Sciences Department University of Wisconsin Madison

10/31/2016. The LC-3 ISA Has Three Kinds of Opcodes. ECE 120: Introduction to Computing. ADD and AND Have Two Addressing Modes

Memory Usage in Programs

LC-2 Programmer s Reference and User Guide

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

Ch. 5: The LC-3. PC-Relative Addressing Mode. Data Movement Instructions. James Goodman!

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING COMPUTER SCIENCES DEPARTMENT UNIVERSITY OF WISCONSIN-MADISON

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON. Instructor: Rahul Nayar TAs: Annie Lin, Mohit Verma

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

ECE/CS 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

컴퓨터개념및실습. 기말고사 review

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

ECE 411 Exam 1. This exam has 5 problems. Make sure you have a complete exam before you begin.

Homework 06. Push Z Push Y Pop Y Push X Pop X Push W Push V Pop V Push U Pop U Pop W Pop Z Push T Push S

Intro. to Computer Architecture Homework 4 CSE 240 Autumn 2005 DUE: Mon. 10 October 2005

UNIVERSITY OF WISCONSIN MADISON

Binghamton University. CS-120 Summer LC3 Memory. Text: Introduction to Computer Systems : Sections 5.1.1, 5.3

CIT 593: Intro to Computer Systems Homework #4: Assembly Language Due October 18, 2012, 4:30pm. Name

CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING COMPUTER SCIENCES DEPARTMENT UNIVERSITY OF WISCONSIN-MADISON

Problem Solving in C. Stepwise Refinement. ...but can stop refining at a higher level of abstraction. Same basic constructs. as covered in Chapter 6

University of Illinois at Urbana-Champaign First Midterm Exam, ECE 220 Honors Section

Introduction to Computer Engineering. CS/ECE 252 Prof. Mark D. Hill Computer Sciences Department University of Wisconsin Madison

ECE/CS 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON

ECE 109 Sections 602 to 605 Final exam Fall December, 2007

ADD R3, R4, #5 LDR R3, R4, #5

The Assembly Language of the Boz 5

Assembly: Review. Computer Science and Engineering College of Engineering The Ohio State University. Lecture 14

LC-3 Instruction Processing

ECE/CS 252 Fall 2011 Homework 7 (25 points) // Due in Lecture on Nov. 30, 2011 (Wed)

11/6/2016. Let s Solve a Problem. ECE 120: Introduction to Computing. Overview of Our Task. Programs are Finite State Machines

LC-3 Instruction Processing. (Textbook s Chapter 4)

ECE220: Computer Systems and Programming Spring 2018 Honors Section due: Saturday 14 April at 11:59:59 p.m. Code Generation for an LC-3 Compiler

Lecture V Toy Hardware and Operating System

Guide to Using the Unix version of the LC-2 Simulator

Computer Science and Engineering 331. Midterm Examination #1. Fall Name: Solutions S.S.#:

MIPS Instruction Set Architecture (2)

1/30/2018. Conventions Provide Implicit Information. ECE 220: Computer Systems & Programming. Arithmetic with Trees is Unambiguous

2 Steps Building the Coding and Testing Infrastructure

Transcription:

Introduction to Computer Engineering CS/ECE 252, Spring 2017 Rahul Nayar Computer Sciences Department University of Wisconsin Madison

Chapter 7 & 9.2 Assembly Language and Subroutines

Human-Readable Machine Language Computers like ones and zeros 0001110010000110 Humans like symbols ADD R6,R2,R6 ; increment index reg. Assembler is a program that turns symbols into machine instructions. ISA-specific: close correspondence between symbols and instruction set mnemonics for opcodes labels for memory locations additional operations for allocating storage and initializing data 7-3

An Assembly Language Program ; ; Program to multiply a number by the constant 6 ;.ORIG x3050 LD R1, SIX LD R2, NUMBER AND R3, R3, #0 ; Clear R3. It will ; contain the product. ; The inner loop ; AGAIN ADD R3, R3, R2 ADD R1, R1, #-1 ; R1 keeps track of BRp AGAIN ; the iteration. ; HALT ; NUMBER.BLKW 2 SIX.FILL x0006 ;.END 7-4

Decoding the Lable NUMBER for LD instruction LD R2, NUMBER Addr = x3050 PC= x3051 Addr for Label NUMBER x3059 0010 DR PCOffset Offset Required = x0008 7-5

LC-3 Assembly Language Syntax Each line of a program is one of the following: an instruction an assember directive (or pseudo-op) a comment Whitespace (between symbols) and case are ignored. Comments (beginning with ; ) are also ignored. Can we write two assembly instructions on the same line? An instruction has the following format: LABEL OPCODE OPERANDS ; COMMENTS optional mandatory 7-6

In Class Exercise Do instruction translation by filling in the table below Machine Code Assembly Code Comments 1001 101 010 1 11111 0101 000 011 0 00 001 0001 110 111 1 00011 NOT R5, R2 AND R0, R3, R1 ADD R6, R7, #3 R5 NOT(R2) R0 R3 AND R1 R6 R7 + SEXT(#3) 7-7

Opcodes and Operands Opcodes reserved symbols that correspond to LC-3 instructions listed in Appendix A ex: ADD, AND, LD, LDR, Operands registers -- specified by Rn, where n is the register number numbers -- indicated by # (decimal) or x (hex) label -- symbolic name of memory location separated by comma number, order, and type correspond to instruction format ex: ADD R1,R1,R3 ADD R1,R1,#3 LD R6,NUMBER BRz LOOP R1 R1 + R3 R1 R1 + #3 R6 mem[ ] If Z, PC 0001 001 001 0 00 011 0001 001 001 1 00011 0010 110 xxxxxxxxx 0000 010 xxxxxxxxx 7-8

Labels and Comments Label placed at the beginning of the line assigns a symbolic name to the address corresponding to line ex: LOOP ADD R1,R1,#-1 BRp LOOP Comment anything after a semicolon is a comment ignored by assembler used by humans to document/understand programs tips for useful comments: avoid restating the obvious, as decrement R1 provide additional insight, as in accumulate product in R6 use comments to separate pieces of program 7-9

Assembler Directives Pseudo-operations do not refer to operations executed by program used by assembler look like instruction, but opcode starts with dot Opcode Operand Meaning.ORIG address starting address of program.end end of program.blkw n allocate n words of storage.fill n allocate one word, initialize with value n.stringz n-character string allocate n+1 locations, initialize w/characters and null terminator 7-10

Assembler Directives.ORIG x3010 HELLO.STRINGZ Hello, World! x3010 : x0048 x3011 : x0065 x3012 : x006c x3013 : x006c x3014 : x006f x3015 : x002c x3016 : x0020.. x301c : x0021 x301d : x0000 7-11

Trap Codes LC-3 assembler provides pseudo-instructions for each trap code, so you don t have to remember them. Code Equivalent Description HALT TRAP x25 Halt execution and print message to console. IN TRAP x23 Print prompt on console, read (and echo) one character from keybd. Character stored in R0[7:0]. OUT TRAP x21 Write one character (in R0[7:0]) to console. GETC TRAP x20 Read one character from keyboard. Character stored in R0[7:0]. PUTS TRAP x22 Write null-terminated string to console. Address of string is in R0. 7-12

Style Guidelines Use the following style guidelines to improve the readability and understandability of your programs: 1. Provide a program header, with author s name, date, etc., and purpose of program. 2. Start labels, opcode, operands, and comments in same column for each line. (Unless entire line is a comment.) 3. Use comments to explain what each register does. 4. Give explanatory comment for most instructions. 5. Use meaningful symbolic names. Mixed upper and lower case for readability. ASCIItoBinary, InputRoutine, SaveR1 6. Provide comments between program sections. 7. Each line must fit on the page -- no wraparound or truncations. Long statements split in aesthetically pleasing manner. 7-13

Sample Program Count the occurrences of a character in a file. Remember this? Count = 0 (R2 = 0) Done? (R1?= EOT) YES Convert count to ASCII character (R0 = x30, R0 = R2 + R0) Ptr = 1st file character (R3 = M[x3012]) NO Input char from keybd (TRAP x23) YES Match? (R1?= R0) NO Print count (TRAP x21) Load char from file (R1 = M[R3]) Incr Count (R2 = R2 + 1) HALT (TRAP x25) Load next char from file (R3 = R3 + 1, R1 = M[R3]) 7-14

Char Count in Assembly Language (1 of 3) ; ; Program to count occurrences of a character in a file. ; Character to be input from the keyboard. ; Result to be displayed on the monitor. ; Program only works if no more than 9 occurrences are found. ; ; ; Initialization ;.ORIG x3000 AND R2, R2, #0 ; R2 is counter, initially 0 LD R3, PTR ; R3 is pointer to characters GETC ; R0 gets character input LDR R1, R3, #0 ; R1 gets first character ; ; Test character for end of file ; TEST ADD R4, R1, #-4 ; Test for EOT (ASCII x04) BRz OUTPUT ; If done, prepare the output 7-15

Char Count in Assembly Language (2 of 3) ; ; Test character for match. If a match, increment count. ; NOT R1, R1 ADD R1, R1, R0 ; If match, R1 = xffff NOT R1, R1 ; If match, R1 = x0000 BRnp GETCHAR ; If no match, do not increment ADD R2, R2, #1 ; ; Get next character from file. ; GETCHAR ADD R3, R3, #1 ; Point to next character. LDR R1, R3, #0 ; R1 gets next char to test BRnzp TEST ; ; Output the count. ; OUTPUT LD R0, ASCII ; Load the ASCII template ADD R0, R0, R2 ; Covert binary count to ASCII OUT HALT ; ASCII code in R0 is displayed. ; Halt machine 7-16

Char Count in Assembly Language (3 of 3) ; ; Storage for pointer and ASCII template ; ASCII.FILL x0030 PTR.FILL x4000.end 7-17

Note for HW problems Don t focus on only compiling your program and expect the program to work Think about a good way to test your program Suggestion: write a small script To get register values, at the end of the program set breakpoints at HALT instructions use check r7 x343 Read the questions carefully, always look for start of program address Address is calculated after.orig.orig X4545 LDI R0,#3 Offset is calculated on the PC register value of the current instructions LC3 edit does not generate symbol table Submit HW in *.bin format 7-18

Recap Assembly Language: Written in the same abstraction as binary instructions More readable ADD R6,R2,R6 ; increment index reg. Need an assembler to convert assembly instructions into binary instructions PennSim has in-built assembler Assembly Language can have additional operations for allocating storage and initializing data values In few cases it is possible to write two assembly instructions in the same line LC3 it is possible since LC3 instructions are fixed length 7-19

Recap Each line of a program is one of the following: an instruction an assember directive (or pseudo-op) a comment Whitespace (between symbols) and case are ignored. Comments (beginning with ; ) are also ignored. An instruction has the following format: LABEL OPCODE OPERANDS ; COMMENTS optional mandatory 7-20

Recap:Trap Codes LC-3 assembler provides pseudo-instructions for each trap code, so you don t have to remember them. Code Equivalent Description HALT TRAP x25 Halt execution and print message to console. IN TRAP x23 Print prompt on console, read (and echo) one character from keybd. Character stored in R0[7:0]. OUT TRAP x21 Write one character (in R0[7:0]) to console. GETC TRAP x20 Read one character from keyboard. Character stored in R0[7:0]. PUTS TRAP x22 Write null-terminated string to console. Address of string is in R0. 7-21

Assembly Process Convert assembly language file (.asm) into an executable file (.obj) for the LC-3 simulator. First Pass: scan program file find all labels and calculate the corresponding addresses; this is called the symbol table Second Pass: convert instructions to machine language, using information from symbol table 7-22

First Pass: Constructing the Symbol Table 1. Find the.orig statement, which tells us the address of the first instruction. Initialize location counter (LC), which keeps track of the current instruction. 2. For each non-empty line in the program: a) If line contains a label, add label and LC to symbol table. b) Increment LC. NOTE: If statement is.blkw or.stringz, increment LC by the number of words allocated. 3. Stop when.end statement is reached. NOTE: A line that contains only a comment is considered an empty line. 7-23

Practice Construct the symbol table for the program in Figure 7.1 (Slides 7-15 through 7-17). Symbol TEST GETCHAR OUTPUT ASCII PTR Address x3004 x300b x300e x3012 x3013 7-24

In-Class Exercise (2014 Exam4 Question2b) Construct the symbol table for the program..orig x3000 LEA R2, STRING LD R3, NUMBER HERE ADD R1, R2, R3 ADD R2, R1, #0 LDR R0, R1, #0 BRz DONE OUT BR HERE THIS.BLKW 6 STRING.STRINGZ "2down_3to_go" NUMBER.FILL x4 DONE HALT.END Symbol HERE THIS STRING NUMBER DONE Address x3002 x3008 x300e x301b x301c

Second Pass: Generating Machine Language For each executable assembly language statement, generate the corresponding machine language instruction. If operand is a label, look up the address from the symbol table. Potential problems: Improper number or type of arguments ex: NOT R1,#7 ADD R1,R2 ADD R3,R3,NUMBER Immediate argument too large ex: ADD R1,R2,#1023 Address (associated with label) more than 256 from instruction can t use PC-relative addressing mode 7-26

Practice Identify and correct 5 assembly errors in the following LC3 program:.orig x3001 AND R4, R4, #0 BRz NEXT LD R5, STRING STOP ADD R5, R5, *1 BRp STOP LDR R6, R5, #4 OR R6, R6, #3 ST R4, STRING STOP HALT ZERO.FILL #0 VAL.FILL VALUE STRING.STRINGS "Hello World!".END 7-27

Practice Identify and correct 5 assembly errors in the following LC3 program:.orig x3001 AND R4, R4, #0 BRz NEXT LD R5, STRING STOP ADD R5, R5, *1 BRp STOP LDR R6, R5, #4 OR R6, R6, #3 ST R4, STRING STOP HALT ZERO.FILL #0 VAL.FILL VALUE STRING.STRINGS "Hello World!".END OR Instruction not part of LC3 7-28

Practice Identify and correct 5 assembly errors in the following LC3 program:.orig x3001 AND R4, R4, #0 BRz NEXT LD R5, STRING STOP ADD R5, R5, *1 BRp STOP LDR R6, R5, #4 OR R6, R6, #3 ST R4, STRING STOP HALT ZERO.FILL #0 VAL.FILL VALUE STRING.STRINGS "Hello World!".END No STRING directive, it is.stringz 7-29

Practice Identify and correct 5 assembly errors in the following LC3 program:.orig x3001 AND R4, R4, #0 BRz NEXT LD R5, STRING STOP ADD R5, R5, *1 BRp STOP LDR R6, R5, #4 OR R6, R6, #3 ST R4, STRING STOP HALT ZERO.FILL #0 VAL.FILL VALUE STRING.STRINGS "Hello World!".END Immediate Value using * 7-30

Practice Identify and correct 5 assembly errors in the following LC3 program:.orig x3001 AND R4, R4, #0 BRz NEXT LD R5, STRING STOP ADD R5, R5, *1 BRp STOP LDR R6, R5, #4 OR R6, R6, #3 ST R4, STRING STOP HALT ZERO.FILL #0 VAL.FILL VALUE STRING.STRINGS "Hello World!".END NEXT label undeclared 7-31

Practice Identify and correct 5 assembly errors in the following LC3 program:.orig x3001 AND R4, R4, #0 BRz NEXT LD R5, STRING STOP ADD R5, R5, *1 BRp STOP LDR R6, R5, #4 OR R6, R6, #3 ST R4, STRING STOP HALT ZERO.FILL #0 VAL.FILL VALUE STRING.STRINGS "Hello World!".END STOP label used twice 7-32

Practice Identify and correct 5 assembly errors in the following LC3 program:.orig x3001 AND R4, R4, #0 BRz NEXT LD R5, STRING STOP ADD R5, R5, *1 BRp STOP LDR R6, R5, #4 OR R6, R6, #3 ST R4, STRING STOP HALT ZERO.FILL #0 VAL.FILL VALUE STRING.STRINGS "Hello World!".END VALUE label undeclared 7-33

Practice Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Using the symbol table constructed earlier, translate these statements into LC-3 machine language. Address Statement Machine Language x3001 LD R3,PTR 0010 011 000010001 = x2611 x3004 x300c x3009 ADD R4,R1,#-4 LDR R1,R3,#0 BRnp GETCHAR 0001 100 001 1 11100 = x187c 0110 001 011 000000 = x62c0 0000 101 000000001 = x0a01 Symbol TEST GETCHAR OUTPUT ASCII PTR x3004 x300b x300e x3012 x3013 Address 7-34

In-Class Exercise Construct the symbol table for the program..orig x3000 LD R2, NUMBER LD R1, MASK LD R3, PTR2 LOOP LDR R4, R3, #0 AND R4, R4, R1 BRz NEXT ADD R0, R0, #1 NEXT ADD R3, R3, #1 ADD R2, R2, #-1 BRp LOOP STI R0, PTR1 HALT NUMBER.BLKW 3 MASK.FILL x8000 PTR1.FILL x4000 PTR2.FILL x5000.end Symbol LOOP NEXT NUMBER MASK PTR1 PTR2 Address x3003 x3007 x300b x300e x300f x3010

Practice Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Using the symbol table constructed earlier, translate these statements into LC-3 machine language. Address Statement Machine Language x3000 ADD R1,R2,#4 0001 001 010 1 00100 = x12a4 x300a STI R0,PTR3 1011 000 000 0 01010 = xb00a Symbol ADDRESS AGAIN PTR3 DESTINATION x3012 x3014 x3015 x301a Address 7-36

Note for All HW6 due today Need to submit your binary code at Learn@uw, by 9.55am Also need to hand in physical copy Remember to staple your HW HW7 will be released today Due on April 14th 7-37

Points covered so far Assembly Language Structure Labels Assembler directives Two step assembly process generating the symbol table converting assembly to machine instructions 7-38

LC-3 Assembler Using lc3as (Unix) or LC3Edit (Windows), generates several different output files. This one gets loaded into the simulator. 7-39

Object File Format LC-3 object file contains Starting address (location where program must be loaded), followed by Machine instructions Example Beginning of count character object file looks like this: 0011000000000000 0101010010100000 0010011000010001 1111000000100011....ORIG x3000 AND R2, R2, #0 LD R3, PTR TRAP x23 7-40

Multiple Object Files An object file is not necessarily a complete program. system-provided library routines code blocks written by multiple developers For LC-3 simulator, can load multiple object files into memory, then start executing at a desired address. system routines, such as keyboard input, are loaded automatically loaded into system memory, below x3000 user code should be loaded between x3000 and xfdff each object file includes a starting address be careful not to load overlapping object files 7-41

Linking and Loading Loading is the process of copying an executable image into memory. more sophisticated loaders are able to relocate images to fit into available memory must readjust branch targets, load/store addresses Linking is the process of resolving symbols between independent object files. suppose we define a symbol in one module, and want to use it in another some notation, such as.external, is used to tell assembler that a symbol is defined in another module linker will search symbol tables of other modules to resolve symbols and complete code generation before loading 7-42

Linking P1:.ORIG x3000 ADD R0, R1, #0 ADD R0, R0, #-15 ADD R0, R0, #-10 STI R0, PTR HALT.END P2: PTR.FILL x5000.end 7-43

Linking P1:.ORIG x3000 ADD R0, R1, #0 ADD R0, R0, #-15 ADD R0, R0, #-10 STI R0, PTR HALT.EXTERNAL PTR.END P2: PTR.FILL x5000.end 7-44

Skipping Ahead to Chapter 9 You will need to use subroutines for programming assignments Read Section 9.2 A subroutine is a program fragment that: performs a well-defined task is invoked (called) by another user program returns control to the calling program when finished Reasons for subroutines: reuse useful (and debugged!) code without having to keep typing it in divide task among multiple programmers use vendor-supplied library of useful routines 7-45

Subroutine Motivation Reasons for subroutines: Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. reuse useful code without having to keep typing it in divide task among multiple programmers 7-46

JSR Instruction Jumps to a location (like a branch but unconditional), and saves current PC (addr of next instruction) in R7. saving the return address is called linking target address is PC-relative (PC + Sext(IR[10:0])) bit 11 specifies addressing mode if =1, PC-relative: target address = PC + Sext(IR[10:0]) if =0, register: target address = contents of register IR[8:6] 7-47

JSR NOTE: PC has already been incremented during instruction fetch stage. 7-48

JSR Example PC is currently x4200 Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. What is the contents of R7 and PC after the following instruction is executed? 1 0 0 0 0 0 0 0 1 0 0 0x4201 0x4201 100 0000 0100 = 0x404 1111 1100 0000 0100 = 0xFC04 NOTE: PC has already been incremented during instruction fetch stage. 0x3E05 7-49

JSRR Instruction Just like JSR, except Register addressing mode. target address is Base Register bit 11 specifies addressing mode What important feature does JSRR provide that JSR does not? Subroutine (target) address can be anywhere in memory 7-50

JSRR NOTE: PC has already been incremented during instruction fetch stage. 7-51

JSRR Example PC is currently x420a R5 is currently x3002 Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. What is the contents of R7 and PC after the following instruction is executed? 1 0 1 0x3002 0x420B R5 NOTE: PC has already been incremented during instruction fetch stage. 7-52

Returning from a Subroutine RET (JMP R7) gets us back to the calling routine. just like TRAP 7-53

Example: Negate the value in R0 2sComp NOT R0, R0 ; flip bits ADD R0, R0, #1 ; add one RET ; return to caller To call from a program (within 1024 instructions): ; need to compute R4 = R1 - R3 ADD R0, R3, #0 ; copy R3 to R0 JSR 2sComp ; negate ADD R4, R1, R0 ; add to R1... Note: Caller should save R0 if we ll need it later! 7-54

Passing Information to/from Subroutines Arguments A value passed in to a subroutine is called an argument. This is a value needed by the subroutine to do its job. Examples: In 2sComp routine, R0 is the number to be negated In OUT service routine, R0 is the character to be printed. In PUTS routine, R0 is address of string to be printed. Return Values A value passed out of a subroutine is called a return value. This is the value that you called the subroutine to compute. Examples: In 2sComp routine, negated value is returned in R0. In GETC service routine, character read from the keyboard is returned in R0. 7-55

Using Subroutines In order to use a subroutine, a programmer must know: its address (or at least a label that will be bound to its address) its function (what does it do?) NOTE: The programmer does not need to know how the subroutine works, but what changes are visible in the machine s state after the routine has run. its arguments (where to pass data in, if any) its return values (where to get computed data, if any) function address arguments return values 7-56

Saving and Restore Registers Since subroutines are just like service routines, we also need to save and restore registers, if needed. Generally use callee-save strategy, except for return values. Save anything that the subroutine will alter internally that shouldn t be visible when the subroutine returns. It s good practice to restore incoming arguments to their original values (unless overwritten by return value). Remember: You MUST save R7 if you call any other subroutine or service routine (TRAP). Otherwise, you won t be able to return to caller. 7-57

Example Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. (1) Write a subroutine FirstChar to: find the first occurrence of a particular character (in R0) in a string (pointed to by R1); return pointer to character or to end of string (NULL) in R2. (2) Use FirstChar to write CountChar, which: counts the number of occurrences of a particular character (in R0) in a string (pointed to by R1); return count in R2. Can write CountChar subroutine using FirstChar Alternatively, can write the second subroutine first, without knowing the implementation of FirstChar! 7-58

FirstChar Algorithm save regs Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. R3=R0 yes R0 holds character to search R1 is pointer to string R2 is return value pointer to character or end of string R2 <- R1 no R2 <- R2 + 1 R3 <- M(R2) restore regs R3=0 yes no return 7-59

FirstChar Implementation ; FirstChar: subroutine to find first occurrence of a char FirstChar ST R3, FCR3 ; save registers ST R4, FCR4 NOT R4, R0 ; use R4 for comparisons ADD R4, R4, #1 ADD R2, R1, #0 ; initialize ptr to beginning of string FC1 LDR R3, R2, #0 ; read character BRz FC2 ; if null, we re done ADD R3, R3, R4 ; see if matches input char BRz FC2 ; if yes, we re done ADD R2, R2, #1 ; increment pointer BRnzp FC1 FC2 LD R3, FCR3 ; restore registers LD R4, FCR4 ; RET ; and return 7-60

CountChar Algorithm (using FirstChar) save regs call FirstChar R3 <- M(R2) R1 <- R2 + 1 save R7, since we re using JSR R0 holds character to search R1 is pointer to string R2 is return value count of occurrences restore regs R3=0 yes no return 7-61

CountChar Implementation ; CountChar: subroutine to count occurrences of a char CountChar ST R3, CCR3 ; save registers ST R4, CCR4 ST R7, CCR7 ; JSR alters R7 ST R1, CCR1 ; save original string ptr AND R4, R4, #0 ; initialize count to zero CC1 JSR FirstChar ; find next occurrence (ptr in R2) LDR R3, R2, #0 ; see if char or null BRz CC2 ; if null, no more chars ADD R4, R4, #1 ; increment count ADD R1, R2, #1 ; point to next char in string BRnzp CC1 CC2 ADD R2, R4, #0 ; move return val (count) to R2 LD R3, CCR3 ; restore regs LD R4, CCR4 LD R1, CCR1 LD R7, CCR7 RET ; and return 7-62

Subroutine Summary Subroutines are useful reuse of code Need address, function, arguments, and return value Use JSR, JSRR to call subroutine Use RET to return from subroutine PennSim Demo If the return address is written to R7, then can we call a subroutine within a subroutine? Save R7 Can we make recursive calls? Stack, discussed in Chapter 10, but not this course. 7-63