Lecture 9: Microcontrolled Multi-Cycle Implementations
|
|
- Robert Nathaniel Allison
- 6 years ago
- Views:
Transcription
1 8-447 Lectre 9: icroled lti-cycle Implementations James C. Hoe Dept of ECE, CU Febrary 8, 29 S 9 L9- Annoncements: P&H Appendi D Get started t on Lab Handots: Handot #8: Project (on Blackboard) Single-Cycle Implementations S 9 L9-2 atches natrally with the seqential and atomic semantics inherent to most ISAs instantiate programmer-visible state one-for-one map instrctions to combinational net-state logic Bt, contrived and inefficient all instrctions rn as slow as the slowest instrction mst provide worst-case combinational resorce in parallel as reqired by any instrction Not necessarily the simplest way to implement an ISA Imagine trying to bild a single-cycle implementation of a CISC ISA
2 Single-Cycle Datapath Analysis Assme memory nits (read or write): 2 ps ALU and adders: ps file (read or write): 5 ps other combinational logic: ps S 9 L9-3 steps IF ID EX E WB resorces mem RF ALU mem RF Delay R-type I-type LW SW Branch Jmp 2 2 Worksheet S 9 L9-4 [25 ] Shift Jmp address [3 ] left PCSrc =Jmp PC+4 [3 28] Add ALU reslt 4 Add RegDst Jmp Branch Shift left 2 PCSrc 2 =Br Taken [3 26] Control em emtoreg ALUOp em ALUSrc Reg PC address [3 ] memory [25 2] [2 6] [5 ] 2 Registers 2 bcond Zero ALU ALU reslt Address Data memory [5 ] 6 32 Sign etend ALU ALU operation [5 ] [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.]
3 S 9 L9-5 lti-cycle Implementation: Ver. Why not let each instrction type take only as mch time as it needs Idea rn a 5 ps clock let each instrction type take as many clock cycles as needed programmer-visible state only pdates at the end of an instrction s cycle-seqence an instrction s s effect is still prely combinational from PVS to PVS A more realistic alternative to the variable-length clock design in the tetbook lti-cycle Datapath: Ver. S 9 L9-6 [25 ] Shift Jmp address [3 ] left PCSrc =Jmp PC+4 [3 28] Add ALU reslt 4 Add RegDst Jmp Branch Shift left 2 PCSrc 2 =Br Taken [3 26] Control em emtoreg ALUOp em ALUSrc Reg PC address [3 ] memory [25 2] [2 6] [5 ] 2 Registers 2 bcond Zero ALU ALU reslt Address Data memory [5 ] 6 32 Sign etend ALU PVSEn [5 ] [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.]
4 Seqential Control: Ver. S 9 L9-7 IF IF 2 IF 3 IF 4 ID Jmp / PVSEn EX WB Branch / PVSEn I-type or R-type EX 2 LW or SW LW SW / PVSEn E 4 E 3 E 2 E What abot the rest of the signals? icroseqencer: Ver. S 9 L9-8 RO as a combinational logic lookp table ** RO size grows as O(2 n ) as the nmber of inpts literally holds the trth table Combinational logic 2 n rows address Inpts n / inpt Otpts PVSEn Datapath otpts ** RO size grows as O(m) as the nmber of otpts ealy or oore? Inpts from instrction opcode field State Net state [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.]
5 icrocoding: Ver. S 9 L9-9 state label cntrl flow conditional targets R/I- type LW SW Br Jmp IF net IF 2 net IF 3 net IF 4 branch ID ID ID ID IF ID net EX net EX 2 branch WB E E IF - E net E 2 net E 3 net E 4 net - WB IF - - WB branch IF IF IF IF IF A systematic approach to FS seqencing/ icroler A stripped-down processor for seqencing and states are like µpc µpc indeed into a µprogram RO to select an µinstrction well-formed -flow architectre fields in the µinstrction maps to signals Why not also spport µprogram-visible states and ALU ops? Very elaborate mlers have been bilt some µlers had fll-fledged ISAs µisas for CISC machines in many ways inspired RISC ISA Adder icrocode storage Inpt Otpts icroprogram conter Address select logic Inpts from instrction opcode field Seqencing S 9 L9- Datapath otpts [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.]
6 Performance Analysis S 9 L9- T wall-clock = No.s CPI T clk No. is fied for a given ISA and application mi For a fied ISA and application mi it is meaningfl to compare T avg-inst = CPI T clk or IPS = IPC f clk million instrctions per second freqency in Hz instrctions per cycle Performance Analysis S 9 L9-2 Single-Cycle Implementation,667Hz = 667 IPS lti-cycle Implementation IPC avg 2, Hz = 2235 IPS what is average behavior? Assme: 25% LW, % SW, 45% ALU, 5% Branch and 5% Jmps weighted arithmetic mean of CPI 8.95 weighted harmonic mean of IPC.7
7 S 9 L9-3 Redcing Datapath by Resorce Rese Add How to rese the same adder for both additions in the same instrction PC address memory 4 25:2 2:6 5: RegDest isitype 2 Registers Reg Sign etend 3 ALU operatio Zero ALU ALU reslt ALUSrc isitype Previos eample of rese by mtally eclsive conditions [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.] Redcing Datapath by Seqential Rese S 9 L9-4 Add PC address memory 4 IR 25:2 2:6 2 Registers 5: d 2 Reg 4 3 ALU Zero ALU ALU reslt to IR or not to IR? [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.]
8 Removing Redndancies S 9 L9-5 PC Address emory Data or emory Data Register # Registers Register # Register # A B ALU ALUOt - Cold also redce down to a single read-write port! - Do we want to remove all redndancies? [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.] Control Points S 9 L9-6 PCCond PCSorce PC Address emory emdata [3-26] [25 2] [2 6] [5 ] [5 ] emory PC IorD em em emtoreg IR [25 ] [5 ] Otpts Control Op [5 ] ALUOp ALUSrcB ALUSrcA Reg RegDst 6 2 Registers Sign etend 2 32 Shift left 2 A B Shift left 2 ALU PC [3-28] Zero ALU ALU reslt Jmp address [3-] ALUOt 2 [5 ] [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.]
9 New Seqential Control Signals S 9 L9-7 When De-asserted When asserted ALUSrcA IorD IR PC PCCond st ALU inpt from PC st ALU inpt from st RF read port (latched in A) PC spplies memory ALUOt spplies memory address address IR latching disabled no effect IR latching enabled PC latching enabled nconditionally no effect PC latching enabled only if branch condition is satisfied When both PC and PCCond are de-asserted, PC latching is disabled New Seqential Control Signals S 9 L9-8 signal ALUSrcB[:] PCSorce[:] effect 2 nd ALU inpt from 2 nd RF read port (latched in B) 2 nd ALU inpt is 4 (for PC increment) 2 nd ALU inpt is sign-etended IR[ 5: ] 2 nd ALU inpt is sign-etended IR[ 5: ], net PC from ALU net PC from ALUOt net PC from IR (jmp target) t)
10 Old Control Signals (similar to single-cycle) S 9 L9-9 RegDest Reg When De-asserted RF write select according to IR[2:6] RF write disabled When asserted RF write select according to IR[5:] RF write enabled em em emory read disabled emory read port retrn load vale emory write disabled emory write enabled emtoreg Steer ALU reslt (latched in ALUOt) to RF write port steer memory load reslt (latched in DR) to RF write port Synchronos Register Transfers S 9 L9-2 Synchronos state with latch enables PC, IR, RF, E Synchronos state that always latch A, B, ALUOt, DR One can enmerate all possible combinational transfers in the path For eample starting from PC IR E[ PC ] reqires IR=, IorD=, em= PC PC,JmpTarget reqires PC=, PCSorce=2 b PC PC+ALUSrcB DR E[ PC ] ALUOt PC + ALUSrcB
11 Usefl Register Transfers S 9 L9-2 PC PC+4 PC ALUOt (if PCCond is asserted) PC PC[ 3:28 ],IR[ 25: ],2 b IR E[ PC ] A RF[ IR[ 25:2 ] ] B RF[ IR[ 2:6 ] ] ALUOt A + B ALUOt A + sign-etend (IR[ 5: ]) ALUOt PC + ( sign-etend (IR[ 5: ]) <<2 ) DR E[ ALUOt ] E[ ALUOt ] B RF[ IR[ 5: ] ] ALUOt, RF[ IR[ 2:6 ] ] ALUOt RF[ IR[ 2:6 ] ] DR RT Seqencing: R-Type ALU S 9 L9-22 IF ID IR E[ PC ] PC PC+4 A RF[ IR[ 25:2 ] ] B RF[ IR[ 2:6 ] ] EX ALUOt A + B E WB if E[PC] == ADD rd rs rt GPR[rd] GPR[rs] + GPR[rt] PC PC + 4 RF[ IR[ 5: ] ] ALUOt
12 RT Datapath Conflicts S 9 L9-23 PC PC+4 PC ALUOt RF[ IR[ 5: ] ] ALUOt, RF[ IR[ 2:6 ] ] ALUOt PC PC[ 3:28 ],IR[ 25: ],2 b RF[ IR[ 2:6 ] ] DR IR E[ PC ] B RF[ IR[ 2:6 ] ] ALUOt A + sign-etend (IR[ 5: ]) ALUOt A + B A RF[ IR[ 25:2 ] ] ALUOt PC + ( sign-etend (IR[ 5: ]) <<2 ) E[ ALUOt ] B DR E[ ALUOt ] Can tilize each resorce only once per step (cycle) RT Seqencing: R-Type ALU S 9 L9-24 IF ID EX IR E[ PC ] PC PC+4 A RF[ IR[ 25:2 ] ] B RF[ IR[ 2:6 ] ] ALUOt A + B E WB RF[ IR[ 5: ] ] ALUOt step step 2 step 3 step 4
13 RT Seqencing: LW IF ID IR E[ PC ] PC PC+4 A RF[ IR[ 25:2 ] ] B RF[ IR[ 2:6 ] ] EX ALUOt A + sign-etend (IR[ 5: ]) E DR E[ ALUOt ] WB RF[ IR[ 2:6 ] ] DR S 9 L9-25 if E[PC]==LW rt offset 6 (base) EA = sign-etend(offset) + GPR[base] GPR[rt] E[ translate(ea) ] PC PC + 4 RT Seqencing: Branch IF ID EX IR E[ PC ] PC PC+4 A RF[ IR[ 25:2 ] ] B RF[ IR[ 2:6 ] ] ALUOt PC+( sign-etend (IR[ 5: ])<<2 ) PC ALUOt (only if condition is met) E S 9 L9-26 WB if E[PC]==BEQ rs rt immediate 6 target = PC + sign-etend(immediate) 4 +4 if GPR[rs]==GPR[rt] then PC target else PC PC + 4
14 Combined RT Seqencing S 9 L9-27 com mmon steps opcode d ependent steps R-Type LW SW Branch Jmp start: IR E[ PC ] PC PC+4 A RF[ IR[ 25:2 ] ] B RF[ IR[ 2:6 ] ] ALUOt PC + ( sign-etend case (IR[ 5: ]) <<2 ) opcode net PC ALUOt A + ALUOt A + signetend (IR[ 5: ]) etend (IR[ 5: ]) gotoir[25:],2 b net net net start start ALUOt A + sign- PC ALUOt PC[3:28], B RF[ IR[5:] ] DR E[ALUOt] B ALUOt E[ALUOt] goto goto net start start RF[IR[5:]] DR goto start RTs in each state corresponds to some setting of the signals Horizontal icrocode S 9 L9-28 icrocode storage Otpts n-bit µpc inpt Inpt ALUSrcA IorD Datapath IR otpts PC PCCond. k-bit otpt Adder icroprogram conter Seqencing Address select logic Inpts from instrction opcode field [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.] Control Store: 2 n k bit (not inclding seqencing)
15 Vertical icrocode icrocode storage Otpts n-bit µpc inpt Inpt PC PC+4 PC ALUOt Datapath PC PC[ 3:28 ],IR[ 25: ],2 b IR E[ PC ] otpts A RF[ IR[ 25:2 ] ] B RF[ IR[ 2:6 ] ].. S 9 L9-29 -bit signal means do this RT (or combination of RTs) m-bit inpt Adder icroprogram conter Seqencing RO Address select logic k-bit otpt [Based on figres from P&H CO&D, COPYRIGHT 24 Elsevier. ALL RIGHTS RESERVED.] Inpts from instrction opcode field ALUSrcA IorD IR PC PCCond. If done right (i.e., m<<n, and m<<k), two ROs together (2 n m+2 m k bit) shold be smaller than horizontal microcode RO (2 n k bit) icrocoding for CISC S 9 L9-3 Can we etend the µler and path to spport a new instrction I haven t thoght of yet to spport a comple instrction, e.g. polyf to spport a comple instrction, e.g. polyf Yes, and probably more if I can seqence an arbitrary RISC instrction then I can seqence an arbitrary RISC program as a µprogram seqence will need some µisa state (e.g. loop conters) for more elaborate µprograms l b ISA f l k lif i more elaborate µisa featres also make life easier µcoding allows very simple path do very powerfl comptation a path as simple as a Trning machine is niversal µcode enables a minimal path to emlate any ISA yo like (with a commensrate slow down)
16 Nanocode and illicode Nanocode a level below µcode µprogrammed for sb-systems (e.g., a complicated cat floating-point nt modle) that acts as a slave in a µled path e.g., the polyf seqence may be generated by a separate nanoler in the FPU illicode a level above µcode ISA-level sbrotines hardcoded into a RO that can be called by the µler t to handle really complicated operations e.g., to add polyf to IPS, one may code p polyf as a software rotine that is called by the µler when the polyf opcode is decoded In both cases, we avoid complicating the main µler with polyf spport The power of abstractions!! S 9 L9-3 Nanocode Concept Illstrated S 9 L9-32 a mcoded processor implementation RO PC processor path We refer to this as nanocode when a mcoded sbsystem is embedded in a mcoded system a mcoded FPU implementation RO PC arithmetic path
Lecture 6: Microprogrammed Multi Cycle Implementation. James C. Hoe Department of ECE Carnegie Mellon University
8 447 Lectre 6: icroprogrammed lti Cycle Implementation James C. Hoe Department of ECE Carnegie ellon University 8 447 S8 L06 S, James C. Hoe, CU/ECE/CALC, 208 Yor goal today Hosekeeping nderstand why
More informationLecture 9: Microcontrolled Multi-Cycle Implementations. Who Am I?
18-447 Lecture 9: Microcontrolled Multi-Cycle Implementations S 10 L9-1 James C. Hoe José F. Martínez Electrical & Computer Engineering Carnegie Mellon University February 1, 2010 Who Am I? S 10 L9-2 Associate
More informationComputer Architecture
Compter Architectre Lectre 4: Intro to icroarchitectre: Single- Cycle Dr. Ahmed Sallam Sez Canal University Based on original slides by Prof. Onr tl Review Compter Architectre Today and Basics (Lectres
More informationComputer Architecture
Compter Architectre Lectre 4: Intro to icroarchitectre: Single- Cycle Dr. Ahmed Sallam Sez Canal University Spring 25 Based on original slides by Prof. Onr tl Review Compter Architectre Today and Basics
More informationThe single-cycle design from last time
lticycle path Last time we saw a single-cycle path and control nit for or simple IPS-based instrction set. A mlticycle processor fies some shortcomings in the single-cycle CPU. Faster instrctions are not
More informationThe final datapath. M u x. Add. 4 Add. Shift left 2. PCSrc. RegWrite. MemToR. MemWrite. Read data 1 I [25-21] Instruction. Read. register 1 Read.
The final path PC 4 Add Reg Shift left 2 Add PCSrc Instrction [3-] Instrction I [25-2] I [2-6] I [5 - ] register register 2 register 2 Registers ALU Zero Reslt ALUOp em Data emtor RegDst ALUSrc em I [5
More informationComputer Architecture Chapter 5. Fall 2005 Department of Computer Science Kent State University
Compter Architectre Chapter 5 Fall 25 Department of Compter Science Kent State University The Processor: Datapath & Control Or implementation of the MIPS is simplified memory-reference instrctions: lw,
More informationThe extra single-cycle adders
lticycle Datapath As an added bons, we can eliminate some of the etra hardware from the single-cycle path. We will restrict orselves to sing each fnctional nit once per cycle, jst like before. Bt since
More informationReview Multicycle: What is Happening. Controlling The Multicycle Design
Review lticycle: What is Happening Reslt Zero Op SrcA SrcB Registers Reg Address emory em Data Sign etend Shift left Sorce A B Ot [-6] [5-] [-6] [5-] [5-] Instrction emory IR RegDst emtoreg IorD em em
More informationExceptions and interrupts
Eceptions and interrpts An eception or interrpt is an nepected event that reqires the CPU to pase or stop the crrent program. Eception handling is the hardware analog of error handling in software. Classes
More informationCSE Introduction to Computer Architecture Chapter 5 The Processor: Datapath & Control
CSE-45432 Introdction to Compter Architectre Chapter 5 The Processor: Datapath & Control Dr. Izadi Data Processor Register # PC Address Registers ALU memory Register # Register # Address Data memory Data
More information1048: Computer Organization
48: Compter Organization Lectre 5 Datapath and Control Lectre5B - mlticycle implementation (cwli@twins.ee.nct.ed.tw) 5B- Recap: A Single-Cycle Processor PCSrc 4 Add Shift left 2 Add ALU reslt PC address
More informationThe multicycle datapath. Lecture 10 (Wed 10/15/2008) Finite-state machine for the control unit. Implementing the FSM
Lectre (Wed /5/28) Lab # Hardware De Fri Oct 7 HW #2 IPS programming, de Wed Oct 22 idterm Fri Oct 2 IorD The mlticycle path SrcA Today s objectives: icroprogramming Etending the mlti-cycle path lti-cycle
More information1048: Computer Organization
48: Compter Organization Lectre 5 Datapath and Control Lectre5A - simple implementation (cwli@twins.ee.nct.ed.tw) 5A- Introdction In this lectre, we will try to implement simplified IPS which contain emory
More informationReview: Computer Organization
Review: Compter Organization Pipelining Chans Y Landry Eample Landry Eample Ann, Brian, Cathy, Dave each have one load of clothes to wash, dry, and fold Washer takes 3 mintes A B C D Dryer takes 3 mintes
More informationReview. A single-cycle MIPS processor
Review If three instrctions have opcodes, 7 and 5 are they all of the same type? If we were to add an instrction to IPS of the form OD $t, $t2, $t3, which performs $t = $t2 OD $t3, what wold be its opcode?
More informationEEC 483 Computer Organization
EEC 483 Compter Organization Chapter 4.4 A Simple Implementation Scheme Chans Y The Big Pictre The Five Classic Components of a Compter Processor Control emory Inpt path Otpt path & Control 2 path and
More informationPART I: Adding Instructions to the Datapath. (2 nd Edition):
EE57 Instrctor: G. Pvvada ===================================================================== Homework #5b De: check on the blackboard =====================================================================
More informationComputer Architecture. Lecture 6: Pipelining
Compter Architectre Lectre 6: Pipelining Dr. Ahmed Sallam Based on original slides by Prof. Onr tl Agenda for Today & Net Few Lectres Single-cycle icroarchitectres lti-cycle and icroprogrammed icroarchitectres
More informationCS 251, Winter 2018, Assignment % of course mark
CS 25, Winter 28, Assignment 4.. 3% of corse mark De Wednesday, arch 7th, 4:3P Lates accepted ntil Thrsday arch 8th, am with a 5% penalty. (6 points) In the diagram below, the mlticycle compter from the
More informationComputer Architecture Lecture 6: Multi-cycle Microarchitectures. Prof. Onur Mutlu Carnegie Mellon University Spring 2012, 2/6/2012
8-447 Compter Architectre Lectre 6: lti-cycle icroarchitectres Prof. Onr tl Carnegie ellon University Spring 22, 2/6/22 Reminder: Homeworks Homework soltions Check and stdy the soltions! Learning now is
More informationLecture 7. Building A Simple Processor
Lectre 7 Bilding A Simple Processor Christos Kozyrakis Stanford University http://eeclass.stanford.ed/ee8b C. Kozyrakis EE8b Lectre 7 Annoncements Upcoming deadlines Lab is de today Demo by 5pm, report
More informationEXAMINATIONS 2010 END OF YEAR NWEN 242 COMPUTER ORGANIZATION
EXAINATIONS 2010 END OF YEAR COPUTER ORGANIZATION Time Allowed: 3 Hors (180 mintes) Instrctions: Answer all qestions. ake sre yor answers are clear and to the point. Calclators and paper foreign langage
More informationPipelining. Chapter 4
Pipelining Chapter 4 ake processor rns faster Pipelining is an implementation techniqe in which mltiple instrctions are overlapped in eection Key of making processor fast Pipelining Single cycle path we
More informationProf. Kozyrakis. 1. (10 points) Consider the following fragment of Java code:
EE8 Winter 25 Homework #2 Soltions De Thrsday, Feb 2, 5 P. ( points) Consider the following fragment of Java code: for (i=; i
More informationQuiz #1 EEC 483, Spring 2019
Qiz # EEC 483, Spring 29 Date: Jan 22 Name: Eercise #: Translate the following instrction in C into IPS code. Eercise #2: Translate the following instrction in C into IPS code. Hint: operand C is stored
More informationEEC 483 Computer Organization
EEC 83 Compter Organization Chapter.6 A Pipelined path Chans Y Pipelined Approach 2 - Cycle time, No. stages - Resorce conflict E E A B C D 3 E E 5 E 2 3 5 2 6 7 8 9 c.y9@csohio.ed Resorces sed in 5 Stages
More informationCS 251, Winter 2018, Assignment % of course mark
CS 25, Winter 28, Assignment 3.. 3% of corse mark De onday, Febrary 26th, 4:3 P Lates accepted ntil : A, Febrary 27th with a 5% penalty. IEEE 754 Floating Point ( points): (a) (4 points) Complete the following
More informationTDT4255 Friday the 21st of October. Real world examples of pipelining? How does pipelining influence instruction
Review Friday the 2st of October Real world eamples of pipelining? How does pipelining pp inflence instrction latency? How does pipelining inflence instrction throghpt? What are the three types of hazard
More informationMIPS Architecture. An Example: MIPS. From the Harris/Weste book Based on the MIPS-like processor from the Hennessy/Patterson book
An Eample: IPS From the Harris/Weste book Based on the IPS-like processor from the Hennessy/Patterson book IPS Architectre w Eample: sbset of IPS processor architectre n Drawn from Patterson & Hennessy
More informationCS 251, Winter 2019, Assignment % of course mark
CS 25, Winter 29, Assignment.. 3% of corse mark De Wednesday, arch 3th, 5:3P Lates accepted ntil Thrsday arch th, pm with a 5% penalty. (7 points) In the diagram below, the mlticycle compter from the corse
More informationMIPS Architecture. Fibonacci (C) Fibonacci (Assembly) Another Example: MIPS. Example: subset of MIPS processor architecture
Another Eample: IPS From the Harris/Weste book Based on the IPS-like processor from the Hennessy/Patterson book IPS Architectre Eample: sbset of IPS processor architectre Drawn from Patterson & Hennessy
More informationLecture 5: The Processor
Lecture 5: The Processor CSCE 26 Computer Organization Instructor: Saraju P. ohanty, Ph. D. NOTE: The figures, text etc included in slides are borrowed from various books, websites, authors pages, and
More information1048: Computer Organization
8: Compter Organization Lectre 6 Pipelining Lectre6 - pipelining (cwli@twins.ee.nct.ed.tw) 6- Otline An overview of pipelining A pipelined path Pipelined control Data hazards and forwarding Data hazards
More informationChapter 6: Pipelining
CSE 322 COPUTER ARCHITECTURE II Chapter 6: Pipelining Chapter 6: Pipelining Febrary 10, 2000 1 Clothes Washing CSE 322 COPUTER ARCHITECTURE II The Assembly Line Accmlate dirty clothes in hamper Place in
More informationWhat do we have so far? Multi-Cycle Datapath
What do we have so far? lti-cycle Datapath CPI: R-Type = 4, Load = 5, Store 4, Branch = 3 Only one instrction being processed in datapath How to lower CPI frther? #1 Lec # 8 Spring2 4-11-2 Pipelining pipelining
More informationCS 251, Spring 2018, Assignment 3.0 3% of course mark
CS 25, Spring 28, Assignment 3. 3% of corse mark De onday, Jne 25th, 5:3 P. (5 points) Consider the single-cycle compter shown on page 6 of this assignment. Sppose the circit elements take the following
More informationLecture 13: Exceptions and Interrupts
18 447 Lectre 13: Eceptions and Interrpts S 10 L13 1 James C. Hoe Dept of ECE, CU arch 1, 2010 Annoncements: Handots: Spring break is almost here Check grades on Blackboard idterm 1 graded Handot #9: Lab
More informationReview. How to represent real numbers
PCWrite PC IorD Review ALUSrcA emread Address Write data emory emwrite em Data IRWrite [3-26] [25-2] [2-6] [5-] [5-] RegDst Read register Read register 2 Write register Write data RegWrite Read data Read
More informationLecture 3: Single Cycle Microarchitecture. James C. Hoe Department of ECE Carnegie Mellon University
8 447 Lecture 3: Single Cycle Microarchitecture James C. Hoe Department of ECE Carnegie Mellon University 8 447 S8 L03 S, James C. Hoe, CMU/ECE/CALCM, 208 Your goal today Housekeeping first try at implementing
More informationHardware Design Tips. Outline
Hardware Design Tips EE 36 University of Hawaii EE 36 Fall 23 University of Hawaii Otline Verilog: some sbleties Simlators Test Benching Implementing the IPS Actally a simplified 6 bit version EE 36 Fall
More informationLecture 10: Pipelined Implementations
U 8-7 S 9 L- 8-7 Lectre : Pipelined Implementations James. Hoe ept of EE, U Febrary 23, 29 nnoncements: Project is de this week idterm graded, d reslts posted Handots: H9 Homework 3 (on lackboard) Graded
More informationEnhanced Performance with Pipelining
Chapter 6 Enhanced Performance with Pipelining Note: The slides being presented represent a mi. Some are created by ark Franklin, Washington University in St. Lois, Dept. of CSE. any are taken from the
More informationCMSC Computer Architecture Lecture 4: Single-Cycle uarch and Pipelining. Prof. Yanjing Li University of Chicago
CMSC 22200 Computer Architecture Lecture 4: Single-Cycle uarch and Pipelining Prof. Yanjing Li University of Chicago Administrative Stuff! Lab1 due at 11:59pm today! Lab2 out " Pipeline ARM simulator "
More informationCPE 335. Basic MIPS Architecture Part II
CPE 335 Computer Organization Basic MIPS Architecture Part II Dr. Iyad Jafar Adapted from Dr. Gheith Abandah slides http://www.abandah.com/gheith/courses/cpe335_s08/index.html CPE232 Basic MIPS Architecture
More informationSolutions for Chapter 6 Exercises
Soltions for Chapter 6 Eercises Soltions for Chapter 6 Eercises 6. 6.2 a. Shortening the ALU operation will not affect the speedp obtained from pipelining. It wold not affect the clock cycle. b. If the
More informationEEC 483 Computer Organization. Branch (Control) Hazards
EEC 483 Compter Organization Section 4.8 Branch Hazards Section 4.9 Exceptions Chans Y Branch (Control) Hazards While execting a previos branch, next instrction address might not yet be known. s n i o
More informationCOMP303 - Computer Architecture Lecture 10. Multi-Cycle Design & Exceptions
COP33 - Computer Architecture Lecture ulti-cycle Design & Exceptions Single Cycle Datapath We designed a processor that requires one cycle per instruction RegDst busw 32 Clk RegWr Rd ux imm6 Rt 5 5 Rs
More informationCSSE232 Computer Architecture I. Mul5cycle Datapath
CSSE232 Compter Architectre I Ml5cycle Datapath Class Stats Next 3 days : Ml5cycle datapath ing Ml5cycle datapath is not in the book! How long do instrc5ons take? ALU 2ns Mem 2ns Reg File 1ns Everything
More informationInstruction fetch. MemRead. IRWrite ALUSrcB = 01. ALUOp = 00. PCWrite. PCSource = 00. ALUSrcB = 00. R-type completion
. (Chapter 5) Fill in the vales for SrcA, SrcB, IorD, Dst and emto to complete the Finite State achine for the mlti-cycle datapath shown below. emory address comptation 2 SrcA = SrcB = Op = fetch em SrcA
More informationEXAMINATIONS 2003 END-YEAR COMP 203. Computer Organisation
EXAINATIONS 2003 COP203 END-YEAR Compter Organisation Time Allowed: 3 Hors (180 mintes) Instrctions: Answer all qestions. There are 180 possible marks on the eam. Calclators and foreign langage dictionaries
More informationOverview of Pipelining
EEC 58 Compter Architectre Pipelining Department of Electrical Engineering and Compter Science Cleveland State University Fndamental Principles Overview of Pipelining Pipelined Design otivation: Increase
More informationComp 303 Computer Architecture A Pipelined Datapath Control. Lecture 13
Comp 33 Compter Architectre A Pipelined path Lectre 3 Pipelined path with Signals PCSrc IF/ ID ID/ EX EX / E E / Add PC 4 Address Instrction emory RegWr ra rb rw Registers bsw [5-] [2-6] [5-] bsa bsb Sign
More informationChapter 3 & Appendix C Pipelining Part A: Basic and Intermediate Concepts
CS359: Compter Architectre Chapter 3 & Appendi C Pipelining Part A: Basic and Intermediate Concepts Yanyan Shen Department of Compter Science and Engineering Shanghai Jiao Tong University 1 Otline Introdction
More informationPS Midterm 2. Pipelining
PS idterm 2 Pipelining Seqential Landry 6 P 7 8 9 idnight Time T a s k O r d e r A B C D 3 4 2 3 4 2 3 4 2 3 4 2 Seqential landry takes 6 hors for 4 loads If they learned pipelining, how long wold landry
More informationThe Processor. Z. Jerry Shi Department of Computer Science and Engineering University of Connecticut. CSE3666: Introduction to Computer Architecture
The Processor Z. Jerry Shi Department of Computer Science and Engineering University of Connecticut CSE3666: Introduction to Computer Architecture Introduction CPU performance factors Instruction count
More informationELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 4: Datapath and Control
ELEC 52/62 Computer Architecture and Design Spring 217 Lecture 4: Datapath and Control Ujjwal Guin, Assistant Professor Department of Electrical and Computer Engineering Auburn University, Auburn, AL 36849
More informationMIPS-Lite Single-Cycle Control
MIPS-Lite Single-Cycle Control COE68: Computer Organization and Architecture Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer Engineering Ryerson University Overview Single cycle
More informationCOMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor
COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition Chapter 4 The Processor Introduction CPU performance factors Instruction count Determined by ISA and compiler CPI and Cycle
More informationChapter 6 Enhancing Performance with. Pipelining. Pipelining. Pipelined vs. Single-Cycle Instruction Execution: the Plan. Pipelining: Keep in Mind
Pipelining hink of sing machines in landry services Chapter 6 nhancing Performance with Pipelining 6 P 7 8 9 A ime ask A B C ot pipelined Assme 3 min. each task wash, dry, fold, store and that separate
More informationCSE 2021 Computer Organization. Hugh Chesser, CSEB 1012U W10-M
CSE 22 Computer Organization Hugh Chesser, CSEB 2U Agenda Topics:. ultiple cycle implementation - complete Patterson: Appendix C, D 2 Breaking the Execution into Clock Cycles Execution of each instruction
More informationComputer and Information Sciences College / Computer Science Department The Processor: Datapath and Control
Computer and Information Sciences College / Computer Science Department The Processor: Datapath and Control Chapter 5 The Processor: Datapath and Control Big Picture: Where are We Now? Performance of a
More informationLecture 10 Multi-Cycle Implementation
Lecture 10 ulti-cycle Implementation 1 Today s enu ulti-cycle machines Why multi-cycle? Comparative performance Physical and Logical Design of Datapath and Control icroprogramming 2 ulti-cycle Solution
More informationProcessor (I) - datapath & control. Hwansoo Han
Processor (I) - datapath & control Hwansoo Han Introduction CPU performance factors Instruction count - Determined by ISA and compiler CPI and Cycle time - Determined by CPU hardware We will examine two
More informationCSE 2021 Computer Organization. Hugh Chesser, CSEB 1012U W9-W
CSE 22 Computer Organization Hugh Chesser, CSEB 2U Agenda Topics:. Single Cycle Review (Sample Exam/Quiz Q) 2. ultiple cycle implementation Patterson: Section 4.5 Reminder: Quiz #2 Next Wednesday (November
More informationECE232: Hardware Organization and Design
ECE232: Harware Organization an Design ectre 11: Introction to IPs path apte from Compter Organization an Design, Patterson & Hennessy, CB IPS-lite processor Compter Want to bil a processor for a sbset
More informationFull Datapath. CSCI 402: Computer Architectures. The Processor (2) 3/21/19. Fengguang Song Department of Computer & Information Science IUPUI
CSCI 42: Computer Architectures The Processor (2) Fengguang Song Department of Computer & Information Science IUPUI Full Datapath Branch Target Instruction Fetch Immediate 4 Today s Contents We have looked
More informationChapter 4. Instruction Execution. Introduction. CPU Overview. Multiplexers. Chapter 4 The Processor 1. The Processor.
COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition Chapter 4 The Processor The Processor - Introduction
More informationCOMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor
COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition Chapter 4 The Processor COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition The Processor - Introduction
More informationLecture 5 and 6. ICS 152 Computer Systems Architecture. Prof. Juan Luis Aragón
ICS 152 Computer Systems Architecture Prof. Juan Luis Aragón Lecture 5 and 6 Multicycle Implementation Introduction to Microprogramming Readings: Sections 5.4 and 5.5 1 Review of Last Lecture We have seen
More informationCPE 335 Computer Organization. Basic MIPS Architecture Part I
CPE 335 Computer Organization Basic MIPS Architecture Part I Dr. Iyad Jafar Adapted from Dr. Gheith Abandah slides http://www.abandah.com/gheith/courses/cpe335_s8/index.html CPE232 Basic MIPS Architecture
More informationLab 8 (All Sections) Prelab: ALU and ALU Control
Lab 8 (All Sections) Prelab: and Control Name: Sign the following statement: On my honor, as an Aggie, I have neither given nor received nathorized aid on this academic work Objective In this lab yo will
More informationChapter 5: The Processor: Datapath and Control
Chapter 5: The Processor: Datapath and Control Overview Logic Design Conventions Building a Datapath and Control Unit Different Implementations of MIPS instruction set A simple implementation of a processor
More informationEECS 322 Computer Architecture Improving Memory Access: the Cache
EECS 322 Computer Architecture Improving emory Access: the Cache Instructor: Francis G. Wolff wolff@eecs.cwru.edu Case Western Reserve University This presentation uses powerpoint animation: please viewshow
More informationCOMPUTER ORGANIZATION AND DESIGN. The Hardware/Software Interface. Chapter 4. The Processor: A Based on P&H
COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface Chapter 4 The Processor: A Based on P&H Introduction We will examine two MIPS implementations A simplified version A more realistic pipelined
More informationECE 313 Computer Organization FINAL EXAM December 14, This exam is open book and open notes. You have 2 hours.
This exam is open book and open notes. You have 2 hours. Problems 1-4 refer to a proposed MIPS instruction lwu (load word - update) which implements update addressing an addressing mode that is used in
More informationare Softw Instruction Set Architecture Microarchitecture are rdw
Program, Application Software Programming Language Compiler/Interpreter Operating System Instruction Set Architecture Hardware Microarchitecture Digital Logic Devices (transistors, etc.) Solid-State Physics
More informationProcessor: Multi- Cycle Datapath & Control
Processor: Multi- Cycle Datapath & Control (Based on text: David A. Patterson & John L. Hennessy, Computer Organization and Design: The Hardware/Software Interface, 3 rd Ed., Morgan Kaufmann, 27) COURSE
More informationENE 334 Microprocessors
ENE 334 Microprocessors Lecture 6: Datapath and Control : Dejwoot KHAWPARISUTH Adapted from Computer Organization and Design, 3 th & 4 th Edition, Patterson & Hennessy, 2005/2008, Elsevier (MK) http://webstaff.kmutt.ac.th/~dejwoot.kha/
More informationCO Computer Architecture and Programming Languages CAPL. Lecture 18 & 19
CO2-3224 Computer Architecture and Programming Languages CAPL Lecture 8 & 9 Dr. Kinga Lipskoch Fall 27 Single Cycle Disadvantages & Advantages Uses the clock cycle inefficiently the clock cycle must be
More informationMulticycle Approach. Designing MIPS Processor
CSE 675.2: Introduction to Computer Architecture Multicycle Approach 8/8/25 Designing MIPS Processor (Multi-Cycle) Presentation H Slides by Gojko Babić and Elsevier Publishing We will be reusing functional
More informationLecture 8: Data Hazard and Resolution. James C. Hoe Department of ECE Carnegie Mellon University
18 447 Lecture 8: Data Hazard and Resolution James C. Hoe Department of ECE Carnegie ellon University 18 447 S18 L08 S1, James C. Hoe, CU/ECE/CALC, 2018 Your goal today Housekeeping detect and resolve
More informationChapter 4. The Processor
Chapter 4 The Processor Introduction CPU performance factors Instruction count Determined by ISA and compiler CPI and Cycle time Determined by CPU hardware 4.1 Introduction We will examine two MIPS implementations
More informationAnimating the Datapath. Animating the Datapath: R-type Instruction. Animating the Datapath: Load Instruction. MIPS Datapath I: Single-Cycle
nimating the atapath PS atapath : Single-Cycle npt is either (-type) or sign-etended lower half of instrction (load/store) op offset/immediate W egister File 6 6 + from instrction path beq,, offset if
More informationCSE 2021 COMPUTER ORGANIZATION
CSE 22 COMPUTER ORGANIZATION HUGH CHESSER CHESSER HUGH CSEB 2U 2U CSEB Agenda Topics:. Sample Exam/Quiz Q - Review 2. Multiple cycle implementation Patterson: Section 4.5 Reminder: Quiz #2 Next Wednesday
More informationSystems Architecture I
Systems Architecture I Topics A Simple Implementation of MIPS * A Multicycle Implementation of MIPS ** *This lecture was derived from material in the text (sec. 5.1-5.3). **This lecture was derived from
More informationECE369. Chapter 5 ECE369
Chapter 5 1 State Elements Unclocked vs. Clocked Clocks used in synchronous logic Clocks are needed in sequential logic to decide when an element that contains state should be updated. State element 1
More informationﻪﺘﻓﺮﺸﻴﭘ ﺮﺗﻮﻴﭙﻣﺎﻛ يرﺎﻤﻌﻣ MIPS يرﺎﻤﻌﻣ data path and ontrol control
معماري كامپيوتر پيشرفته معماري MIPS data path and control abbasi@basu.ac.ir Topics Building a datapath support a subset of the MIPS-I instruction-set A single cycle processor datapath all instruction actions
More informationMajor CPU Design Steps
Datapath Major CPU Design Steps. Analyze instruction set operations using independent RTN ISA => RTN => datapath requirements. This provides the the required datapath components and how they are connected
More informationComputer Organization & Design The Hardware/Software Interface Chapter 5 The processor : Datapath and control
Computer Organization & Design The Hardware/Software Interface Chapter 5 The processor : Datapath and control Qing-song Shi http://.24.26.3 Email: zjsqs@zju.edu.cn Chapter 5 The processor : Datapath and
More informationCC 311- Computer Architecture. The Processor - Control
CC 311- Computer Architecture The Processor - Control Control Unit Functions: Instruction code Control Unit Control Signals Select operations to be performed (ALU, read/write, etc.) Control data flow (multiplexor
More informationMark Redekopp and Gandhi Puvvada, All rights reserved. EE 357 Unit 15. Single-Cycle CPU Datapath and Control
EE 37 Unit Single-Cycle CPU path and Control CPU Organization Scope We will build a CPU to implement our subset of the MIPS ISA Memory Reference Instructions: Load Word (LW) Store Word (SW) Arithmetic
More informationTopic #6. Processor Design
Topic #6 Processor Design Major Goals! To present the single-cycle implementation and to develop the student's understanding of combinational and clocked sequential circuits and the relationship between
More informationCSE 141 Computer Architecture Summer Session I, Lectures 10 Advanced Topics, Memory Hierarchy and Cache. Pramod V. Argade
CSE 141 Compter Architectre Smmer Session I, 2004 Lectres 10 Advanced Topics, emory Hierarchy and Cache Pramod V. Argade CSE141: Introdction to Compter Architectre Instrctor: TA: Pramod V. Argade (p2argade@cs.csd.ed)
More informationThe Processor: Datapath & Control
Chapter Five 1 The Processor: Datapath & Control We're ready to look at an implementation of the MIPS Simplified to contain only: memory-reference instructions: lw, sw arithmetic-logical instructions:
More informationPIPELINING. Pipelining: Natural Phenomenon. Pipelining. Pipelining Lessons
Pipelining: Natral Phenomenon Landry Eample: nn, rian, Cathy, Dave each have one load of clothes to wash, dry, and fold Washer takes 30 mintes C D Dryer takes 0 mintes PIPELINING Folder takes 20 mintes
More informationLECTURE 6. Multi-Cycle Datapath and Control
LECTURE 6 Multi-Cycle Datapath and Control SINGLE-CYCLE IMPLEMENTATION As we ve seen, single-cycle implementation, although easy to implement, could potentially be very inefficient. In single-cycle, we
More informationThe overall datapath for RT, lw,sw beq instrucution
Designing The Main Control Unit: Remember the three instruction classes {R-type, Memory, Branch}: a) R-type : Op rs rt rd shamt funct 1.src 2.src dest. 31-26 25-21 20-16 15-11 10-6 5-0 a) Memory : Op rs
More informationChapter 4. The Processor. Computer Architecture and IC Design Lab
Chapter 4 The Processor Introduction CPU performance factors CPI Clock Cycle Time Instruction count Determined by ISA and compiler CPI and Cycle time Determined by CPU hardware We will examine two MIPS
More informationDesign of Digital Circuits Lecture 13: Multi-Cycle Microarch. Prof. Onur Mutlu ETH Zurich Spring April 2017
Design of Digital Circuits Lecture 3: Multi-Cycle Microarch. Prof. Onur Mutlu ETH Zurich Spring 27 6 April 27 Agenda for Today & Next Few Lectures! Single-cycle Microarchitectures! Multi-cycle and Microprogrammed
More information