Lecture 2: RISC V Instruction Set Architecture. James C. Hoe Department of ECE Carnegie Mellon University

Similar documents
Lecture 2: RISC V Instruction Set Architecture. Housekeeping

Lecture 3: Single Cycle Microarchitecture. James C. Hoe Department of ECE Carnegie Mellon University

Programmable Machines

Programmable Machines

Reduced Instruction Set Computer (RISC)

ISA and RISCV. CASS 2018 Lavanya Ramapantulu

Reduced Instruction Set Computer (RISC)

ece4750-tinyrv-isa.txt

MIPS Instruction Set

Chapter 2A Instructions: Language of the Computer

Computer Architecture

ENGN1640: Design of Computing Systems Topic 03: Instruction Set Architecture Design

CENG3420 Lecture 03 Review

RISC-V Assembly and Binary Notation

Lecture 5: Instruction Set Architecture : Road Map

Introduction to the MIPS. Lecture for CPSC 5155 Edward Bosworth, Ph.D. Computer Science Department Columbus State University

Course Administration

MIPS R-format Instructions. Representing Instructions. Hexadecimal. R-format Example. MIPS I-format Example. MIPS I-format Instructions

ELEC / Computer Architecture and Design Fall 2013 Instruction Set Architecture (Chapter 2)

Today s topics. MIPS operations and operands. MIPS arithmetic. CS/COE1541: Introduction to Computer Architecture. A Review of MIPS ISA.

Computer Architecture. The Language of the Machine

Lecture 2: Instruction Set Architecture. How to specify what a computer does?

CISC 662 Graduate Computer Architecture. Lecture 4 - ISA MIPS ISA. In a CPU. (vonneumann) Processor Organization

CS3350B Computer Architecture MIPS Instruction Representation

Instructions: MIPS ISA. Chapter 2 Instructions: Language of the Computer 1

Chapter 2. Instructions: Language of the Computer. Adapted by Paulo Lopes

CSCI 402: Computer Architectures. Instructions: Language of the Computer (3) Fengguang Song Department of Computer & Information Science IUPUI.

Instruction Set Architecture part 1 (Introduction) Mehran Rezaei

CISC 662 Graduate Computer Architecture. Lecture 4 - ISA

Lecture 2. Instructions: Language of the Computer (Chapter 2 of the textbook)

Instructions: Language of the Computer

Chapter 2. Instructions: Language of the Computer. HW#1: 1.3 all, 1.4 all, 1.6.1, , , , , and Due date: one week.

Stored Program Concept. Instructions: Characteristics of Instruction Set. Architecture Specification. Example of multiple operands

Processor. Han Wang CS3410, Spring 2012 Computer Science Cornell University. See P&H Chapter , 4.1 4

COMPSCI 313 S Computer Organization. 7 MIPS Instruction Set

EN164: Design of Computing Systems Topic 03: Instruction Set Architecture Design

Anne Bracy CS 3410 Computer Science Cornell University. See P&H Chapter: , , Appendix B

Thomas Polzer Institut für Technische Informatik

Branch Addressing. Jump Addressing. Target Addressing Example. The University of Adelaide, School of Computer Science 28 September 2015

Design for a simplified DLX (SDLX) processor Rajat Moona

Chapter 2. Instruction Set. RISC vs. CISC Instruction set. The University of Adelaide, School of Computer Science 18 September 2017

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

Chapter 2. Computer Abstractions and Technology. Lesson 4: MIPS (cont )

Review: ISA. Review: Compiling Applications. Computer Architecture ELEC3441. Instruction Set Architecture (1) Computer Architecture: HW/SW Interface

Computer Architecture

ECE 154A Introduction to. Fall 2012

Reminder: tutorials start next week!

Computer Organization MIPS ISA

The MIPS Instruction Set Architecture

Procedure Call Instructions

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

comp 180 Lecture 10 Outline of Lecture Procedure calls Saving and restoring registers Summary of MIPS instructions

Computer Architecture. MIPS Instruction Set Architecture

MIPS Memory Access Instructions

Control Instructions. Computer Organization Architectures for Embedded Computing. Thursday, 26 September Summary

Control Instructions

101 Assembly. ENGR 3410 Computer Architecture Mark L. Chang Fall 2009

Midterm. Sticker winners: if you got >= 50 / 67

Instructions: Language of the Computer

ISA: The Hardware Software Interface

CS 61C: Great Ideas in Computer Architecture RISC-V Instruction Formats

9/14/17. Levels of Representation/Interpretation. Big Idea: Stored-Program Computer

Computer Organization and Components

CS 61C: Great Ideas in Computer Architecture More RISC-V Instructions and How to Implement Functions

MIPS Datapath. MIPS Registers (and the conventions associated with them) MIPS Instruction Types

Chapter 2. Instructions: Language of the Computer

INSTRUCTION SET COMPARISONS

Stored Program Concept. Instructions: Characteristics of Instruction Set. Architecture Specification. Example of multiple operands

MIPS ISA and MIPS Assembly. CS301 Prof. Szajda

are Softw Instruction Set Architecture Microarchitecture are rdw

Computer Architecture Instruction Set Architecture part 2. Mehran Rezaei

ECE232: Hardware Organization and Design. Computer Organization - Previously covered

Math 230 Assembly Programming (AKA Computer Organization) Spring 2008


A Processor. Kevin Walsh CS 3410, Spring 2010 Computer Science Cornell University. See: P&H Chapter , 4.1-3

CS 4200/5200 Computer Architecture I

MIPS ISA. 1. Data and Address Size 8-, 16-, 32-, 64-bit 2. Which instructions does the processor support

Lecture 11: Interrupt and Exception. James C. Hoe Department of ECE Carnegie Mellon University

Instruction Set Architecture (ISA)

COMPUTER ORGANIZATION AND DESIGN

Character Is a byte quantity (00~FF or 0~255) ASCII (American Standard Code for Information Interchange) Page 91, Fig. 2.21

CS 61c: Great Ideas in Computer Architecture

CENG3420 L03: Instruction Set Architecture

Overview. Introduction to the MIPS ISA. MIPS ISA Overview. Overview (2)

Mark Redekopp, All rights reserved. EE 352 Unit 3 MIPS ISA

CMSC Computer Architecture Lecture 2: ISA. Prof. Yanjing Li Department of Computer Science University of Chicago

CS31001 COMPUTER ORGANIZATION AND ARCHITECTURE. Debdeep Mukhopadhyay, CSE, IIT Kharagpur. Instructions and Addressing

ECE 4750 Computer Architecture, Fall 2014 T01 Single-Cycle Processors

Lectures 3-4: MIPS instructions

Inf2C - Computer Systems Lectures 3-4 Assembly Language

Instruction Set Principles. (Appendix B)

EEM 486: Computer Architecture. Lecture 2. MIPS Instruction Set Architecture

Chapter 3 MIPS Assembly Language. Ó1998 Morgan Kaufmann Publishers 1

Architecture II. Computer Systems Laboratory Sungkyunkwan University

CS222: MIPS Instruction Set

5/17/2012. Recap from Last Time. CSE 2021: Computer Organization. The RISC Philosophy. Levels of Programming. Stored Program Computers

Recap from Last Time. CSE 2021: Computer Organization. Levels of Programming. The RISC Philosophy 5/19/2011

MIPS Instruction Set Architecture (2)

Architecture I. Computer Systems Laboratory Sungkyunkwan University

M2 Instruction Set Architecture

Transcription:

18 447 Lecture 2: RISC V Instruction Set Architecture James C. Hoe Department of ECE Carnegie Mellon University 18 447 S18 L02 S1, James C. Hoe, CMU/ECE/CALCM, 2018

Your goal today Housekeeping get bootstrapped on RISC V RV32I to start Lab 1 (will revisit general ISA issues in L04) Notices Student survey on Canvas, due Wed Lab 1, Part A, due week of 1/29 Lab 1, Part B, due week of 2/5 Readings P&H Ch2 P&H Ch4.1~4.4 (next time) 18 447 S18 L02 S2, James C. Hoe, CMU/ECE/CALCM, 2018

How to specify what a computer does? Architectural Level a clock has a hour hand and a minute hand,... a computer does.????. You can read a clock without knowing how it works Microarchitecture Level a particular clockwork has a certain set of gears arranged in a certain configuration a particular computer design has a certain datapath and a certain control logic Realization Level machined alloy gears vs stamped sheet metal CMOS vs ECL vs vacuum tubes 18 447 S18 L02 S3, James C. Hoe, CMU/ECE/CALCM, 2018 conceptual physical [Computer Architecture, Blaauw and Brooks, 1997]

Stored Program Architecture a.k.a. von Neumann Memory holds both program and data instructions and data in a linear memory array instructions can be modified as data Sequential instruction processing 1. program counter (PC) identifies current instruction 2. fetch instruction from memory 3. update some state (e.g. PC and memory) as a function of current state (according to instruction) 4. repeat program counter Dominant paradigm since its invention 18 447 S18 L02 S4, James C. Hoe, CMU/ECE/CALCM, 2018 0 1 2 3 4 5...

Very Different Architectures Exist Consider a von Neumann program What is the significance of the program order? What is the significance of the storage locations? Dataflow program instruction ordering implied by data dependence instruction specifies who receives the result instruction executes when operands received no program counter, no intermediate state 18 447 S18 L02 S5, James C. Hoe, CMU/ECE/CALCM, 2018 v := a+ b ; w:= b*2 ; x := v w ; y := v +w ; z := x *y; + *2 + [dataflow figure and example from Arvind] a * z b

Parallel Random Access Memory P 1 P 2 P 3 P n Memory 18 447 S18 L02 S6, James C. Hoe, CMU/ECE/CALCM, 2018 Do you naturally think parallel or sequential?

Instruction Set Architecture (ISA) 18 447 S18 L02 S7, James C. Hoe, CMU/ECE/CALCM, 2018

18 447 S18 L02 S8, James C. Hoe, CMU/ECE/CALCM, 2018 ISA in a nut shell A stable programming target (typically 15~20 yr) binary compatibility for SW investments permits adoption of foreseeable technology Better to compromise immediate optimality for future scalability and compatibility Dominant paradigm has been von Neumann program visible state: memory, registers, PC, etc. instructions to modified state; each prescribes which state elements are read which state elements including PC updated how to compute new values of update state Atomic, sequential, in order

3 Instruction Classes (as convention) Arithmetic and logical operations fetch operands from specified locations compute a result as a function of the operands store result to a specified location update PC to the next sequential instruction Data movement operations (no compute) fetch operands from specified locations store operand values to specified locations update PC to the next sequential instruction Control flow operations (affects only PC) fetch operands from specified locations compute a branch condition and a target address if branch condition is true then PC target address else PC next seq. instruction 18 447 S18 L02 S9, James C. Hoe, CMU/ECE/CALCM, 2018

Complete ISA Picture User level ISA state and instructions available to user programs single user abstraction on top a virtualization For this course and for now, RV32I of RISC V Virtual Environment Architecture state and instructions to control virtualization (e.g., caches, sharing) user level, but for need to know uses Operating Environment Architecture state and instructions to implement virtualization privileged/protected access reserved for OS system arch 18 447 S18 L02 S10, James C. Hoe, CMU/ECE/CALCM, 2018

RV32I Program Visible State program counter 32 bit byte address of current instruction M[0] M[1] M[2] M[3] M[4] **note** x0=0 x1 x2 general purpose register file 32x 32 bit words named x0...x31 M[N 1] 32 bit memory address: 2 32 by 8 bit locations (4 GBytes) (there is some magic going on) 18 447 S18 L02 S11, James C. Hoe, CMU/ECE/CALCM, 2018

Register Register ALU Instructions Assembly (e.g., register register addition) ADD rd, rs1, rs2 Machine encoding 0000000 7 bit Semantics GPR[rd] GPR[rs1] + GPR[rs2] PC PC + 4 Exceptions: none (ignore carry and overflow) Variations Arithmetic: {ADD, SUB} Compare: {signed, unsigned} x {Set if Less Than} Logical: {AND, OR, XOR} Shift: {Left, Right Logical, Right Arithmetic} 18 447 S18 L02 S12, James C. Hoe, CMU/ECE/CALCM, 2018 rs2 5 bit rs1 5 bit 000 3 bit rd 5 bit 0110011 7 bit

Assembly Programming 101 Break down high level program expressions into a sequence of elemental operations E.g. High level Code Assembly Code f = ( g + h ) ( i + j ) suppose f, g, h, i, j are in r f, r g, r h, r i, r j suppose r temp is a free register add r temp r g r h # r temp = g+h add r f r i r j # r f = i+j sub r f r temp r f # f = r temp r f 18 447 S18 L02 S13, James C. Hoe, CMU/ECE/CALCM, 2018

Reg Reg Instruction Encodings 32 bit R type ALU 18 447 S18 L02 S14, James C. Hoe, CMU/ECE/CALCM, 2018 [from page 54, The RISC V Instruction Set Manual]

Reg Immediate ALU Instructions Assembly (e.g., reg immediate additions) ADDI rd, rs1, imm 12 Machine encoding imm[11:0] 12 bit Semantics GPR[rd] GPR[rs1] + sign extend (imm) PC PC + 4 Exceptions: none (ignore carry and overflow) Variations Arithmetic: {ADDI, SUBI} Compare: {signed, unsigned} x {Set if Less Than Imm} Logical: {ANDI, ORI, XORI} **Shifts by unsigned imm[4:0]: {SLLI, SRLI, SRAI} 18 447 S18 L02 S15, James C. Hoe, CMU/ECE/CALCM, 2018 rs1 5 bit 000 3 bit rd 5 bit 0010011 7 bit

Reg Immediate ALU Inst. Encodings sign extended immediate unsigned Note: SLTIU does unsigned compare with sign extended immediate [from page 54, The RISC V Instruction Set Manual] 18 447 S18 L02 S16, James C. Hoe, CMU/ECE/CALCM, 2018 matches R type encoding 32 bit I type ALU

Load Store Architecture RV32I ALU instructions operates only on register operands next PC always PC+4 A distinct set of load and store instructions dedicated to copying data between register and memory next PC always PC+4 Another set of control flow instructions dedicated to manipulating PC (branch, jump, etc.) does not effect memory or other registers 18 447 S18 L02 S17, James C. Hoe, CMU/ECE/CALCM, 2018

Load Instructions Assembly (e.g., load 4 byte word) LW rd, offset 12 (base) Machine encoding offset[11:0] 12 bit Semantics byte_address 32 = sign extend(offset 12 ) + GPR[base] GPR[rd] MEM 32 [byte_address] PC PC + 4 Exceptions: none for now Variations: LW, LH, LHU, LB, LBU e.g., LB :: GPR[rd] sign extend(mem 8 [byte_address]) LBU :: GPR[rd] zero extend(mem 8 [byte_address]) Note: RV32I memory is byte addressable, little endian 18 447 S18 L02 S18, James C. Hoe, CMU/ECE/CALCM, 2018 base 5 bit 010 3 bit rd 5 bit 0000011 7 bit rs1

18 447 S18 L02 S19, James C. Hoe, CMU/ECE/CALCM, 2018 Big Endian vs. Little Endian (Part I, Chapter 4, Gulliver s Travels) 32 bit signed or unsigned integer word is 4 bytes By convention we write MSB on left MSB (most significant) 8 bit 8 bit 8 bit 8 bit On a byte addressable machine....... Big Endian Little Endian MSB LSB byte 0 byte 1 byte 2 byte 3 byte 4 byte 5 byte 6 byte 7 byte 8 byte 9 byte 10 byte 11 byte 12 byte 13 byte 14 byte 15 byte 16 byte 17 byte 18 byte 19 pointer points to the big end MSB byte 3 byte 7 byte 11 byte 15 byte 19 What difference does it make? byte 2 byte 6 byte 10 byte 14 byte 18 byte 1 byte 5 byte 9 byte 13 byte 17 LSB (least significant) LSB byte 0 byte 4 byte 8 byte 12 byte 16 pointer points to the little end check out htonl(), ntohl() in in.h

Load/Store Data Alignment MSB byte 3 byte 2 byte 1 byte 0 byte 7 byte 6 byte 5 byte 4 LSB Common case is aligned loads and stores physical implementations of memory and memory interface optimize for natural alignment boundaries (i.e., return an aligned 4 byte word per access) unaligned loads or stores would require 2 separate accesses to memory Was common for RISC ISAs to disallow misaligned loads/stores; if necessary, use a code sequence of aligned loads/stores and shifts RV32I allows misaligned loads/stores but warns it could be very slow; if necessary,...... 18 447 S18 L02 S20, James C. Hoe, CMU/ECE/CALCM, 2018

18 447 S18 L02 S21, James C. Hoe, CMU/ECE/CALCM, 2018 Store Instructions Assembly (e.g., store 4 byte word) SW rs2, offset 12 (base) Machine encoding offset[11:5] 7 bit rs2 5 bit base 5 bit 010 3 bit ofst[4:0] 5 bit 0100011 7 bit Semantics byte_address 32 = sign extend(offset 12 )+ GPR[base] MEM 32 [byte_address] GPR[rs2] PC PC + 4 Exceptions: none for now Variations: SW, SH, SB e.g., SB:: MEM 8 [byte_address] (GPR[rs2])[7:0]

Assembly Programming 201 E.g. High level Code A[ 8 ] = h + A[ 0 ] where A is an array of integers (4 bytes each) Assembly Code suppose &A, h are in r A, r h suppose r temp is a free register LW r temp 0(r A ) # r temp = A[0] add r temp r h r temp # r temp = h + A[0] SW r temp 32(r A ) # A[8] = r temp # note A[8] is 32 bytes # from A[0] 18 447 S18 L02 S22, James C. Hoe, CMU/ECE/CALCM, 2018

Load/Store Encodings Both needs 2 register operands and 1 12 bit immediate 18 447 S18 L02 S23, James C. Hoe, CMU/ECE/CALCM, 2018 [from page 54, The RISC V Instruction Set Manual]

RV32I Immediate Encoding RV32I adopts 2 different register immediate formats (I vs S) to keep rs2 operand at inst[24:20] always Most RISCs had 1 register immediate format opcode 6 bit rs 5 bit rt 5 bit immediate 16 bit rt field used as a source (e.g., store) or dest (e.g., load) also common to opt for longer 16 bit immediate RV32I encodes immediate in non consecutive bits 18 447 S18 L02 S24, James C. Hoe, CMU/ECE/CALCM, 2018

RV32I Instruction Formats All instructions 4 byte long and 4 byte aligned in mem R type: 3 register operands I type: 2 register operands (with dest) and 12 bit imm S(B) type: 2 register operands (no dest) and 12 bit imm U(J) type, 1 register operation (dest) and 20 bit imm 18 447 S18 L02 S25, James C. Hoe, CMU/ECE/CALCM, 2018 Aimed to simplify decoding and field extraction

C Code Control Flow Instructions Control Flow Graph code A Assembly Code (linearized) code A { code A } if X==Y then { code B } else { code C } { code D } True if X==Y False code B code C code D if X==Y goto code C goto code B code D basic blocks (1 way in, 1 way out, all or nothing) 18 447 S18 L02 S26, James C. Hoe, CMU/ECE/CALCM, 2018

(Conditional) Branch Instructions Assembly (e.g., branch if equal) BEQ rs1, rs2, imm 13 Note: implicit imm[0]=0 Machine encoding imm[12 10:5] 7 bit Semantics target = PC + sign extend(imm 13 ) if GPR[rs1]==GPR[rs2] then PC target else PC PC + 4 How far can you jump? Exceptions: misaligned target (4 byte) if taken Variations BEQ, BNE, BLT, BGE, BLTU, BGEU 18 447 S18 L02 S27, James C. Hoe, CMU/ECE/CALCM, 2018 rs2 5 bit rs1 5 bit 000 3 bit imm[4:1 11] 5 bit 1100011 7 bit

Assembly Programming 301 E.g. High level Code 18 447 S18 L02 S28, James C. Hoe, CMU/ECE/CALCM, 2018 if (i == j) then e = g else e = h f = e fork then else join Assembly Code suppose e, f, g, h, i, j are in r e, r f, r g, r h, r i, r j bne r i r j L1 # L1 and L2 are addr labels # assembler computes offset add r e r g x0 # e = g beq x0 L2 L1: add r e r h x0 # e = h L2: add r f r e x0 # f = e

Function Call and Return... A: BEQ x0, x0, F...... B: BEQ x0, x0, F... F:......... BEQ x0, x0, A+4? A function return need to 1. jump back to different callers 2. know where to jump back to 18 447 S18 L02 S29, James C. Hoe, CMU/ECE/CALCM, 2018

Jump Instruction Assembly JAL rd imm 21 Machine encoding Note: implicit imm[0]=0 imm[20 10:1 11 19:12] 20 bit rd 5 bit 1101111 7 bit Semantics target =PC + sign extend(imm 21 ) GPR[rd] PC + 4 PC target How far can you jump? Exceptions: misaligned target (4 byte) UJ type *Note*: use JAL x0 label instead of BEQ x0 x0 label 18 447 S18 L02 S30, James C. Hoe, CMU/ECE/CALCM, 2018

Jump Indirect Instruction Assembly JALR rd, rs1, imm 12 Machine encoding imm[11:0] 12 bit rs1 5 bit 000 3 bit rd 5 bit 1100111 7 bit Semantics target =GPR[rs1] + sign extend(imm 12 ) target &= 0xffff_fffe GPR[rd] PC + 4 PC target How far can you jump? Exceptions: misaligned target (4 byte) 18 447 S18 L02 S31, James C. Hoe, CMU/ECE/CALCM, 2018

Assembly Programming 401 Caller... code A... JAL x1, _myfxn... code C... JAL x1, _myfxn... code D... Callee _myfxn:... code B... JALR x0,x1,0... A call B return C call B return D... How do you pass argument between caller and callee? IfA set x10 to 1, what is the value of x10 when B returns to C? What registers can B use? What happens to x1 if B calls another function 18 447 S18 L02 S32, James C. Hoe, CMU/ECE/CALCM, 2018

Caller and Callee Saved Registers Callee Saved Registers Caller says to callee, The values of these registers should not change when you return to me. Callee says, If I need to use these registers, I promise to save the old values to memory first and restore them before I return to you. Caller Saved Registers Caller says to callee, If there is anything I care about in these registers, I already saved it myself. Callee says to caller, Don t count on them staying the same values after I am done. Unlike endianness, this is not arbitrary 18 447 S18 L02 S33, James C. Hoe, CMU/ECE/CALCM, 2018 When to use which?

RISC V Register Usage Convention 18 447 S18 L02 S34, James C. Hoe, CMU/ECE/CALCM, 2018 [from page 100, The RISC V Instruction Set Manual]

Memory Usage Convention high address stack space grow down grow up free space stack pointer GPR[x2] dynamic data static data text binary executable low address reserved 18 447 S18 L02 S35, James C. Hoe, CMU/ECE/CALCM, 2018

function Basic Calling Convention 1. caller saves caller saved registers 2. caller loads arguments into a0~a7 (x10~x17) 3. caller jumps to callee using JAL x1 4. callee allocates space on the stack (dec. stack pointer) 5. callee saves callee saved registers to stack... body of callee (can nest additional calls)... 6. callee loads results to a0, a1 (x10, x11) 7. callee restores saved register values 8. JALR x0, x1 9. caller continues with return values in a0, a1 prologue epilogue 18 447 S18 L02 S36, James C. Hoe, CMU/ECE/CALCM, 2018

Terminologies Instruction Set Architecture machine state and functionality as observable and controllable by the programmer Instruction Set set of commands supported Machine Code instructions encoded in binary format directly consumable by the hardware Assembly Code instructions in textual form, e.g. add r1, r2, r3 converted to machine code by an assembler one to one correspondence with machine code (mostly true: compound instructions, labels...) 18 447 S18 L02 S37, James C. Hoe, CMU/ECE/CALCM, 2018

We didn t talk about Privileged Modes user vs. supervisor Exception Handling trap to supervisor handling routine and back Virtual Memory each process has 4 GBytes of private, large, linear and fast memory? Floating Point Instructions 18 447 S18 L02 S38, James C. Hoe, CMU/ECE/CALCM, 2018