Chapter 6: Pipelining
|
|
- Shannon Lindsey
- 5 years ago
- Views:
Transcription
1 CSE 322 COPUTER ARCHITECTURE II Chapter 6: Pipelining Chapter 6: Pipelining Febrary 10, Clothes Washing CSE 322 COPUTER ARCHITECTURE II The Assembly Line Accmlate dirty clothes in hamper Place in washer Unload washer into hamper and move to dryer Unload dryer into hamper and move to folding table & fold Place folded clothes in hamper & pt away Compare yo doing yor wash to a landry - with 1 person per fnction YOU: TIE WASH DRY FOLD PUT_AWAY If 1 hor per step, this takes hors Landry: WASH1 DRY1 FOLD1 PACK1 WASH2 DRY2 FOLD2 PACK2 WASH3 DRY3 FOLD3 PACK3 WASH DRY FOLD PACK Still takes hors per patron, BUT!!!! Landry gets 1 patron/hor thrpt Chapter 6: Pipelining Febrary 10,
2 CSE 322 COPUTER ARCHITECTURE II Atomobile Assembly Plant Chemical factory Garden Hose Cooking Other Eamples General Characteristics Complete process broken down into S independent pieces Each piece done independently at a stage Stages arranged in linear order to match process As each stage finishes its piece, it passes piece to net stage Time for one complete processing seqence: sm of all stages BUT: rate at which we can initiate new work = ma of any stage time If times for all S stages are eqal to T: Time for one initiation to complete still ST; Time between 2 initiaties = T NOT ST! Initiations per second = 1/T Pipelining: Overlap mltiple eections of same seqence - Improves throghpt, not the time to perform a single operation Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Book s Approach to Draw Pipeline Timing Diagrams Time rns left to right, in nits of stage time Each row below corresponds to distinct initiation Bondary between 2 colmn entries: pipeline ister (eg hamper) st look at colmn contents to see what each stage is doing what Wash1 Dry1 Fold1 Pack1 Wash2 Dry2 Fold2 Pack2 Wash3 Dry3 Fold3 Pack3 Wash Dry Fold Pack Wash5 Dry5 Fold5 Pack5 Wash6 Dry6 Fold6 Pack6 Wash7 Dry7 Fold7 Pack7 Wash8 Dry8 Fold8 Pack8 Wash9 Dry9 Fold9 Pack9 TIE for N initiations = NT + (S-1)T Thrpt: Time per initiation = T + (S-1)T/N ==> T! Chapter 6: Pipelining Febrary 10, 2000
3 CSE 322 COPUTER ARCHITECTURE II An Alternative: Reservation Tables As before, time rns left to right, in nits of stage time Each row below corresponds to a distinct stage of processing Individal initiations are diagonal Stage Wash Dry Fold Pack Easy to see what stage is doing what Harder to see each initiation We will se both Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Complications Dividing processing so each stage takes same time Strctral Hazard: cannot spport the combination of operations to do concrrently - Eg. combined washer/dryer, not separate - Eg. only one memory, not two - Soltion: architect so problem does not eist or add hardware Hazard: need opt of one stage for an earlier stage of a scceeding initiation - Eg. deciding on wash time of football niforms - Eg. sing otpt of an add as inpt to a scceeding add - Soltion: forward on a special path back to where its needed Control Hazard: cannot even decide on what to initiation to start net withot completing a crrent one - Eg. branching on vale in a compted by previos instrction - Soltion: predict (i.e. gess) which way to go Stalls: know what to do, bt some stage takes longer than one cycle - Eg. memory takes longer than one cycle - Introdces bbble of nsed time slots into pipeline - Soltion: fill with ot-of-order initiations (aka sperscalar) Chapter 6: Pipelining Febrary 10,
4 Unpipelined: CSE 322 COPUTER ARCHITECTURE II Ideal Digital System Pipeline Speedp Latch Pipelined: Latch combinational logic delay = τ combinational logic delay = τ combinational logic delay = τ combinational logic delay = τ delay for 1 piece of = τ + latch_setp (assme small) appro delay for 1000 pieces of = 000τ combinational logic delay = τ combinational logic delay = τ combinational logic delay = τ combinational logic delay = τ Latch Latch Latch delay for 1 piece of = (τ + latch_setp) appro delay for 1000 pieces of = τ + 999τ = 1003τ 000 speedp for 1000 pieces of = Latch Latch Ideal speedp = nmber of pipeline stages Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Processing of IPS Instrctions Increment program conter; fetch net instrction one or two isters according to instrction fields emory- Reference Arithmetic- Logic Branching Calclate effective ess Eecte op-code Compare (check flag) / to memory to ister Change instrction ess Chapter 6: Pipelining Febrary 10,
5 CSE 322 COPUTER ARCHITECTURE II The Instrction Eection Seqence Instrction eection in a processor (and possible times) IF - instrction fetch (2ns) ID - Decode & fetch ister operands (1 ns) EX - Perform operation (2ns) E - Perform memory operation (2ns) WB - reslt (if any) back into ister file (1ns) Hm? - 5 stages => 5X performance increase over single cycle design? Electrical Design Challenge: can we make hardware to do each stage in same time? Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Timimg The Instrction Seqence One Initiation: IF ID EX E WB Total time = 8 ns Try to overlap IF ID EX E WB Doesn t line p! IF ID EX E WB Possible soltion: insert 1 ns after ID to allow alignment IF ID EX E WB Strctral Hazard IF ID EX E WB IF ID EX E WB IF ID EX E WB Chapter 6: Pipelining Febrary 10,
6 CSE 322 COPUTER ARCHITECTURE II A Better Soltion Delay ID by 1 ns also 9 ns IF IF ID EX IF ID E WB EX E WB ID EX EWB IF ID EX E WB 15 ns No Strctral Hazard One initiation = 9 ns oe 10 ns (depennding on how yo look at it) initiations = 15 ns => Ave of 1 initiation every 3.75 ns! How long for 1000 initiations? What is eqivalent time between initiations: What is effective speedp? Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Why is IPS ISA Good Pipeline Target? All instrctions are the same length - decoder doesn t need to figre ot how long ister specifiers in instrction always in same place - go get isters before knowing instrction type Only LOAD/STORE memory access - need only 1 emory operands all assmed aligned in memory Discssion: what if we did it differently? Chapter 6: Pipelining Febrary 10,
7 CSE 322 COPUTER ARCHITECTURE II Review: The Single-Cycle path Instrction fetch Instrction decode / ister fetch Eecte / ess calclation emory access back ess Instrction emory etend emory EX/E E/WB Flow is left to right, with 2 eceptions -back (into ister file) Selection of net vale (increment or branch ess) Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Program eection order (instrctions) ltiple Clock Cycle Pipeline Diagram Time (clock cycles) CC 1 CC 2 CC 3 CC CC5 CC6 CC 7 lw $1, 100($0) I D lw $2, 200($0) I D lw $3, 300($0) I D Sppose each instrction cold have its own path Shows hardware sed by each instrction dring each phase of eection I D Instrction emory ister emory ister Chapter 6: Pipelining Febrary 10,
8 CSE 322 COPUTER ARCHITECTURE II Inserting Pipeline into path Nmber of pipeline bits varies between stages EX/E E/WB ess Instrction emory etend emory Latch Latch Latch Latch Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Eecting Instrctions in Pipelined path Following charts describe 3 scenerios: 1. Processing of load word (lw) instrction - bg inclded in design (ake SURE yo nderstand bg!) 2. Processing of lw - bg corrected (ake SURE yo nderstand fi!) 3. Processing of lw followed in pipeline by sb (Set the stage for discssion on HAZARDS and inter-instrction dependencies!) Chapter 6: Pipelining Febrary 10,
9 CSE 322 COPUTER ARCHITECTURE II lw Instrction fetch Load Word: Cycle 1 Note: prple in a latch indicates from that instrction stored there EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Load Word: Cycle 2 lw Instrction decode EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10,
10 CSE 322 COPUTER ARCHITECTURE II Load Word: Cycle 3 lw Eection EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Load Word: Cycle lw emory EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10,
11 CSE 322 COPUTER ARCHITECTURE II Load Word: Cycle 5 Where s the bg? lw back EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Fiing Load Bg Bg: sorce for is invalid Soltion: Need to preserve ister nmber for write-back additional pipeline bits for write ister ess EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10,
12 CSE 322 COPUTER ARCHITECTURE II Processing a Two-Instrction Seqence Eamine mltiple-cycle and single-cycle diagrams for a seqence of 2 independent instrctions (no common isters between instrctions): lw $10, 9($1) sb $11, $2, $3 ltiple-cycle diagram Program eection order (instrctions) Time (clock cycles) CC 1 CC 2 CC 3 CC CC5 CC6 lw $10, 9($1) I D sb $11, $2, $3 I D newest instrction at bottom Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II lw $10,9($1) Instrction fetch Single-Cycle Diagrams: Cycle 1 EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10,
13 CSE 322 COPUTER ARCHITECTURE II sb $11,$2,$3 Instrction fetch Single-Cycle Diagrams: Cycle 2 lw $10,9($1) Instrction decode EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Single-Cycle Diagrams: Cycle 3 sb $11,$2,$3 Instrction decode lw $10,9($1) Eection EX/E E/WB ess Instrction emory etend emory don t need sign etend, bt don t know this yet Chapter 6: Pipelining Febrary 10,
14 CSE 322 COPUTER ARCHITECTURE II Single-Cycle Diagrams: Cycle sb $11,$2,$3 Eection lw $10,9($1) emory EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Single-Cycle Diagrams: Cycle 5 sb $11,... emory lw.. back EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10,
15 CSE 322 COPUTER ARCHITECTURE II Single-Cycle Diagrams: Cycle 6 sb. back EX/E E/WB ess Instrction emory etend emory Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Pipelined Control Potentially very complicated, need to approach this methodically Eample: independent instrctions (no ister operands in common) lw $10, 9($1) sb $11, $2, $3 and $12, $, $5 or $13, $6, $7 add $1, $8, $9 Eample: dependent instrctions ($2 sed in seqential instrctions) sb $2, $1, $3 # ister $2 written by sb and $12, $2, $5 # 1st operand ($2) depends on sb or $13, $6, $2 # 2nd operand ($2) depends on sb add $1, $2, $2 # 1st and 2nd ($2) depends on sb sw $15, 100($2) # inde ($2) depends on sb Problem: write-back for sb won t occr ntil 5th cycle First assme seqence of independent instrctions Later remove this assmption Chapter 6: Pipelining Febrary 10,
16 CSE 322 COPUTER ARCHITECTURE II Control al Smmary Src Src Branch em emto EX/E E/WB ess Instrction emory 1 2 Inst[15-0] 1 2 etend control emory Inst[20-16] Inst[15-11] Dst Op em Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Qestions on Control als Following discssion relevant to a single instrction Qestion: Are all control signals active at same time? - Answer: No, different signals needed at different times (cycles) Qestion: Can we generate all these signals at same time? - Answer: Yes, bt... we cannot apply then at same time Chapter 6: Pipelining Febrary 10,
17 CSE 322 COPUTER ARCHITECTURE II Classifying Control Lines by Pipeline Stage Each flow component is active in only one pipeline stage. So, divide control signals into grops according to active component (see page for smmary charts) 1. Instrction Fetch always read instrction memory & write (nothing special to control) 2. Instrction Decode / ister Fetch still nothing special to control, same action every time 3. Eection: following control signals mst be decoded from instrction Dst: does target come from bits or Op: how to control operation Src: does 2nd inpt come from file or sign-etension literal. emory: likewise Branch: sed to generate Src Src: does get incremented or replaced by otpt of branch adder em: signal reads from memory em: signal writes to memory 5. Back: likewise emto: does vale going back to file come from or mem : is there in fact a ister write back to perform Chapter 6: Pipelining Febrary 10, CSE 322 COPUTER ARCHITECTURE II Passing Control with Pipeline Analogy: sending instrctions with a car on an assembly line Install Corinthian leather interior on car 6 when its at stage 3 strip off signals for eection phase WB strip off signals for memory phase Instrction Control Generation EX WB strip off signals for write-back phase WB Dst Op Src Branch em em emto EX/E E/WB Chapter 6: Pipelining Febrary 10,
18 CSE 322 COPUTER ARCHITECTURE II Pipelined path with Control als ed Src WB EX/E Control EX Op Src WB Branch em em E/WB WB emto ess Instrction emory 1 2 Inst[15-0] 1 2 etend Dst control emory Inst[20-16] Inst[15-11] Chapter 6: Pipelining Febrary 10,
Pipelining. Chapter 4
Pipelining Chapter 4 ake processor rns faster Pipelining is an implementation techniqe in which mltiple instrctions are overlapped in eection Key of making processor fast Pipelining Single cycle path we
More informationReview: Computer Organization
Review: Compter Organization Pipelining Chans Y Landry Eample Landry Eample Ann, Brian, Cathy, Dave each have one load of clothes to wash, dry, and fold Washer takes 3 mintes A B C D Dryer takes 3 mintes
More informationEnhanced Performance with Pipelining
Chapter 6 Enhanced Performance with Pipelining Note: The slides being presented represent a mi. Some are created by ark Franklin, Washington University in St. Lois, Dept. of CSE. any are taken from the
More informationChapter 6 Enhancing Performance with. Pipelining. Pipelining. Pipelined vs. Single-Cycle Instruction Execution: the Plan. Pipelining: Keep in Mind
Pipelining hink of sing machines in landry services Chapter 6 nhancing Performance with Pipelining 6 P 7 8 9 A ime ask A B C ot pipelined Assme 3 min. each task wash, dry, fold, store and that separate
More informationWhat do we have so far? Multi-Cycle Datapath
What do we have so far? lti-cycle Datapath CPI: R-Type = 4, Load = 5, Store 4, Branch = 3 Only one instrction being processed in datapath How to lower CPI frther? #1 Lec # 8 Spring2 4-11-2 Pipelining pipelining
More informationLecture 10: Pipelined Implementations
U 8-7 S 9 L- 8-7 Lectre : Pipelined Implementations James. Hoe ept of EE, U Febrary 23, 29 nnoncements: Project is de this week idterm graded, d reslts posted Handots: H9 Homework 3 (on lackboard) Graded
More informationOverview of Pipelining
EEC 58 Compter Architectre Pipelining Department of Electrical Engineering and Compter Science Cleveland State University Fndamental Principles Overview of Pipelining Pipelined Design otivation: Increase
More informationPIPELINING. Pipelining: Natural Phenomenon. Pipelining. Pipelining Lessons
Pipelining: Natral Phenomenon Landry Eample: nn, rian, Cathy, Dave each have one load of clothes to wash, dry, and fold Washer takes 30 mintes C D Dryer takes 0 mintes PIPELINING Folder takes 20 mintes
More informationPS Midterm 2. Pipelining
PS idterm 2 Pipelining Seqential Landry 6 P 7 8 9 idnight Time T a s k O r d e r A B C D 3 4 2 3 4 2 3 4 2 3 4 2 Seqential landry takes 6 hors for 4 loads If they learned pipelining, how long wold landry
More informationTDT4255 Friday the 21st of October. Real world examples of pipelining? How does pipelining influence instruction
Review Friday the 2st of October Real world eamples of pipelining? How does pipelining pp inflence instrction latency? How does pipelining inflence instrction throghpt? What are the three types of hazard
More informationChapter 6: Pipelining
Chapter 6: Pipelining Otline An overview of pipelining A pipelined path Pipelined control Data hazards and forwarding Data hazards and stalls Branch hazards Eceptions Sperscalar and dynamic pipelining
More informationEEC 483 Computer Organization
EEC 83 Compter Organization Chapter.6 A Pipelined path Chans Y Pipelined Approach 2 - Cycle time, No. stages - Resorce conflict E E A B C D 3 E E 5 E 2 3 5 2 6 7 8 9 c.y9@csohio.ed Resorces sed in 5 Stages
More informationThe extra single-cycle adders
lticycle Datapath As an added bons, we can eliminate some of the etra hardware from the single-cycle path. We will restrict orselves to sing each fnctional nit once per cycle, jst like before. Bt since
More informationCS 251, Winter 2019, Assignment % of course mark
CS 25, Winter 29, Assignment.. 3% of corse mark De Wednesday, arch 3th, 5:3P Lates accepted ntil Thrsday arch th, pm with a 5% penalty. (7 points) In the diagram below, the mlticycle compter from the corse
More information1048: Computer Organization
8: Compter Organization Lectre 6 Pipelining Lectre6 - pipelining (cwli@twins.ee.nct.ed.tw) 6- Otline An overview of pipelining A pipelined path Pipelined control Data hazards and forwarding Data hazards
More informationEEC 483 Computer Organization
EEC 483 Compter Organization Chapter 4.4 A Simple Implementation Scheme Chans Y The Big Pictre The Five Classic Components of a Compter Processor Control emory Inpt path Otpt path & Control 2 path and
More informationReview Multicycle: What is Happening. Controlling The Multicycle Design
Review lticycle: What is Happening Reslt Zero Op SrcA SrcB Registers Reg Address emory em Data Sign etend Shift left Sorce A B Ot [-6] [5-] [-6] [5-] [5-] Instrction emory IR RegDst emtoreg IorD em em
More informationThe single-cycle design from last time
lticycle path Last time we saw a single-cycle path and control nit for or simple IPS-based instrction set. A mlticycle processor fies some shortcomings in the single-cycle CPU. Faster instrctions are not
More informationQuiz #1 EEC 483, Spring 2019
Qiz # EEC 483, Spring 29 Date: Jan 22 Name: Eercise #: Translate the following instrction in C into IPS code. Eercise #2: Translate the following instrction in C into IPS code. Hint: operand C is stored
More informationReview. A single-cycle MIPS processor
Review If three instrctions have opcodes, 7 and 5 are they all of the same type? If we were to add an instrction to IPS of the form OD $t, $t2, $t3, which performs $t = $t2 OD $t3, what wold be its opcode?
More informationComputer Architecture Chapter 5. Fall 2005 Department of Computer Science Kent State University
Compter Architectre Chapter 5 Fall 25 Department of Compter Science Kent State University The Processor: Datapath & Control Or implementation of the MIPS is simplified memory-reference instrctions: lw,
More informationEXAMINATIONS 2010 END OF YEAR NWEN 242 COMPUTER ORGANIZATION
EXAINATIONS 2010 END OF YEAR COPUTER ORGANIZATION Time Allowed: 3 Hors (180 mintes) Instrctions: Answer all qestions. ake sre yor answers are clear and to the point. Calclators and paper foreign langage
More informationInstruction fetch. MemRead. IRWrite ALUSrcB = 01. ALUOp = 00. PCWrite. PCSource = 00. ALUSrcB = 00. R-type completion
. (Chapter 5) Fill in the vales for SrcA, SrcB, IorD, Dst and emto to complete the Finite State achine for the mlti-cycle datapath shown below. emory address comptation 2 SrcA = SrcB = Op = fetch em SrcA
More informationPART I: Adding Instructions to the Datapath. (2 nd Edition):
EE57 Instrctor: G. Pvvada ===================================================================== Homework #5b De: check on the blackboard =====================================================================
More informationCS 251, Winter 2018, Assignment % of course mark
CS 25, Winter 28, Assignment 4.. 3% of corse mark De Wednesday, arch 7th, 4:3P Lates accepted ntil Thrsday arch 8th, am with a 5% penalty. (6 points) In the diagram below, the mlticycle compter from the
More information1048: Computer Organization
48: Compter Organization Lectre 5 Datapath and Control Lectre5A - simple implementation (cwli@twins.ee.nct.ed.tw) 5A- Introdction In this lectre, we will try to implement simplified IPS which contain emory
More informationThe multicycle datapath. Lecture 10 (Wed 10/15/2008) Finite-state machine for the control unit. Implementing the FSM
Lectre (Wed /5/28) Lab # Hardware De Fri Oct 7 HW #2 IPS programming, de Wed Oct 22 idterm Fri Oct 2 IorD The mlticycle path SrcA Today s objectives: icroprogramming Etending the mlti-cycle path lti-cycle
More informationComputer Architecture. Lecture 6: Pipelining
Compter Architectre Lectre 6: Pipelining Dr. Ahmed Sallam Based on original slides by Prof. Onr tl Agenda for Today & Net Few Lectres Single-cycle icroarchitectres lti-cycle and icroprogrammed icroarchitectres
More informationChapter 3 & Appendix C Pipelining Part A: Basic and Intermediate Concepts
CS359: Compter Architectre Chapter 3 & Appendi C Pipelining Part A: Basic and Intermediate Concepts Yanyan Shen Department of Compter Science and Engineering Shanghai Jiao Tong University 1 Otline Introdction
More informationEEC 483 Computer Organization. Branch (Control) Hazards
EEC 483 Compter Organization Section 4.8 Branch Hazards Section 4.9 Exceptions Chans Y Branch (Control) Hazards While execting a previos branch, next instrction address might not yet be known. s n i o
More informationThe final datapath. M u x. Add. 4 Add. Shift left 2. PCSrc. RegWrite. MemToR. MemWrite. Read data 1 I [25-21] Instruction. Read. register 1 Read.
The final path PC 4 Add Reg Shift left 2 Add PCSrc Instrction [3-] Instrction I [25-2] I [2-6] I [5 - ] register register 2 register 2 Registers ALU Zero Reslt ALUOp em Data emtor RegDst ALUSrc em I [5
More informationComp 303 Computer Architecture A Pipelined Datapath Control. Lecture 13
Comp 33 Compter Architectre A Pipelined path Lectre 3 Pipelined path with Signals PCSrc IF/ ID ID/ EX EX / E E / Add PC 4 Address Instrction emory RegWr ra rb rw Registers bsw [5-] [2-6] [5-] bsa bsb Sign
More informationLecture 19 Introduction to Pipelining
CSE 30321 Lecture 19 Pipelining (Part 1) 1 Lecture 19 Introduction to Pipelining CSE 30321 Lecture 19 Pipelining (Part 1) Basic pipelining basic := single, in-order issue single issue one instruction at
More informationCS 251, Winter 2018, Assignment % of course mark
CS 25, Winter 28, Assignment 3.. 3% of corse mark De onday, Febrary 26th, 4:3 P Lates accepted ntil : A, Febrary 27th with a 5% penalty. IEEE 754 Floating Point ( points): (a) (4 points) Complete the following
More informationEXAMINATIONS 2003 END-YEAR COMP 203. Computer Organisation
EXAINATIONS 2003 COP203 END-YEAR Compter Organisation Time Allowed: 3 Hors (180 mintes) Instrctions: Answer all qestions. There are 180 possible marks on the eam. Calclators and foreign langage dictionaries
More informationSolutions for Chapter 6 Exercises
Soltions for Chapter 6 Eercises Soltions for Chapter 6 Eercises 6. 6.2 a. Shortening the ALU operation will not affect the speedp obtained from pipelining. It wold not affect the clock cycle. b. If the
More informationCS 251, Spring 2018, Assignment 3.0 3% of course mark
CS 25, Spring 28, Assignment 3. 3% of corse mark De onday, Jne 25th, 5:3 P. (5 points) Consider the single-cycle compter shown on page 6 of this assignment. Sppose the circit elements take the following
More informationExceptions and interrupts
Eceptions and interrpts An eception or interrpt is an nepected event that reqires the CPU to pase or stop the crrent program. Eception handling is the hardware analog of error handling in software. Classes
More informationLecture 7. Building A Simple Processor
Lectre 7 Bilding A Simple Processor Christos Kozyrakis Stanford University http://eeclass.stanford.ed/ee8b C. Kozyrakis EE8b Lectre 7 Annoncements Upcoming deadlines Lab is de today Demo by 5pm, report
More informationProf. Kozyrakis. 1. (10 points) Consider the following fragment of Java code:
EE8 Winter 25 Homework #2 Soltions De Thrsday, Feb 2, 5 P. ( points) Consider the following fragment of Java code: for (i=; i
More information4.13 Advanced Topic: An Introduction to Digital Design Using a Hardware Design Language 345.e1
.3 Advanced Topic: An Introdction to Digital Design Using a Hardware Design Langage 35.e.3 Advanced Topic: An Introdction to Digital Design Using a Hardware Design Langage to Describe and odel a Pipeline
More informationCSE Introduction to Computer Architecture Chapter 5 The Processor: Datapath & Control
CSE-45432 Introdction to Compter Architectre Chapter 5 The Processor: Datapath & Control Dr. Izadi Data Processor Register # PC Address Registers ALU memory Register # Register # Address Data memory Data
More informationHardware Design Tips. Outline
Hardware Design Tips EE 36 University of Hawaii EE 36 Fall 23 University of Hawaii Otline Verilog: some sbleties Simlators Test Benching Implementing the IPS Actally a simplified 6 bit version EE 36 Fall
More informationDesigning a Pipelined CPU
Designing a Pipelined CPU CSE 4, S2'6 Review -- Single Cycle CPU CSE 4, S2'6 Review -- ultiple Cycle CPU CSE 4, S2'6 Review -- Instruction Latencies Single-Cycle CPU Load Ifetch /Dec Exec em Wr ultiple
More information1048: Computer Organization
48: Compter Organization Lectre 5 Datapath and Control Lectre5B - mlticycle implementation (cwli@twins.ee.nct.ed.tw) 5B- Recap: A Single-Cycle Processor PCSrc 4 Add Shift left 2 Add ALU reslt PC address
More informationChapter 3 & Appendix C Pipelining Part A: Basic and Intermediate Concepts
CS359: Computer Architecture Chapter 3 & Appendix C Pipelining Part A: Basic and Intermediate Concepts Yanyan Shen Department of Computer Science and Engineering Shanghai Jiao Tong University Parallel
More informationAnimating the Datapath. Animating the Datapath: R-type Instruction. Animating the Datapath: Load Instruction. MIPS Datapath I: Single-Cycle
nimating the atapath PS atapath : Single-Cycle npt is either (-type) or sign-etended lower half of instrction (load/store) op offset/immediate W egister File 6 6 + from instrction path beq,, offset if
More informationCSE 141 Computer Architecture Summer Session I, Lectures 10 Advanced Topics, Memory Hierarchy and Cache. Pramod V. Argade
CSE 141 Compter Architectre Smmer Session I, 2004 Lectres 10 Advanced Topics, emory Hierarchy and Cache Pramod V. Argade CSE141: Introdction to Compter Architectre Instrctor: TA: Pramod V. Argade (p2argade@cs.csd.ed)
More informationLecture 6: Microprogrammed Multi Cycle Implementation. James C. Hoe Department of ECE Carnegie Mellon University
8 447 Lectre 6: icroprogrammed lti Cycle Implementation James C. Hoe Department of ECE Carnegie ellon University 8 447 S8 L06 S, James C. Hoe, CU/ECE/CALC, 208 Yor goal today Hosekeeping nderstand why
More informationInstruction Pipelining is the use of pipelining to allow more than one instruction to be in some stage of execution at the same time.
Pipelining Pipelining is the se of pipelining to allow more than one instrction to be in some stage of eection at the same time. Ferranti ATLAS (963): Pipelining redced the average time per instrction
More informationComputer Architecture
Compter Architectre Lectre 4: Intro to icroarchitectre: Single- Cycle Dr. Ahmed Sallam Sez Canal University Based on original slides by Prof. Onr tl Review Compter Architectre Today and Basics (Lectres
More information3/12/2014. Single Cycle (Review) CSE 2021: Computer Organization. Single Cycle with Jump. Multi-Cycle Implementation. Why Multi-Cycle?
CSE 2021: Computer Organization Single Cycle (Review) Lecture-10b CPU Design : Pipelining-1 Overview, Datapath and control Shakil M. Khan 2 Single Cycle with Jump Multi-Cycle Implementation Instruction:
More informationComputer Architecture
Compter Architectre Lectre 4: Intro to icroarchitectre: Single- Cycle Dr. Ahmed Sallam Sez Canal University Spring 25 Based on original slides by Prof. Onr tl Review Compter Architectre Today and Basics
More informationPOWER-OF-2 BOUNDARIES
Warren.3.fm Page 5 Monday, Jne 17, 5:6 PM CHAPTER 3 POWER-OF- BOUNDARIES 3 1 Ronding Up/Down to a Mltiple of a Known Power of Ronding an nsigned integer down to, for eample, the net smaller mltiple of
More informationChapter Six. Dataı access. Reg. Instructionı. fetch. Dataı. Reg. access. Dataı. Reg. access. Dataı. Instructionı fetch. 2 ns 2 ns 2 ns 2 ns 2 ns
Chapter Si Pipelining Improve perfomance by increasing instruction throughput eecutionı Time lw $, ($) 2 6 8 2 6 8 access lw $2, 2($) 8 ns access lw $3, 3($) eecutionı Time lw $, ($) lw $2, 2($) 2 ns 8
More informationLecture 9: Microcontrolled Multi-Cycle Implementations
8-447 Lectre 9: icroled lti-cycle Implementations James C. Hoe Dept of ECE, CU Febrary 8, 29 S 9 L9- Annoncements: P&H Appendi D Get started t on Lab Handots: Handot #8: Project (on Blackboard) Single-Cycle
More informationWinter 2013 MIDTERM TEST #2 Wednesday, March 20 7:00pm to 8:15pm. Please do not write your U of C ID number on this cover page.
page of 7 University of Calgary Departent of Electrical and Copter Engineering ENCM 369: Copter Organization Lectre Instrctors: Steve Noran and Nor Bartley Winter 23 MIDTERM TEST #2 Wednesday, March 2
More informationLecture 13: Exceptions and Interrupts
18 447 Lectre 13: Eceptions and Interrpts S 10 L13 1 James C. Hoe Dept of ECE, CU arch 1, 2010 Annoncements: Handots: Spring break is almost here Check grades on Blackboard idterm 1 graded Handot #9: Lab
More informationECE232: Hardware Organization and Design
ECE232: Harware Organization an Design ectre 11: Introction to IPs path apte from Compter Organization an Design, Patterson & Hennessy, CB IPS-lite processor Compter Want to bil a processor for a sbset
More informationLecture 6: Pipelining
Lecture 6: Pipelining i CSCE 26 Computer Organization Instructor: Saraju P. ohanty, Ph. D. NOTE: The figures, text etc included in slides are borrowed from various books, websites, authors pages, and other
More informationReview. How to represent real numbers
PCWrite PC IorD Review ALUSrcA emread Address Write data emory emwrite em Data IRWrite [3-26] [25-2] [2-6] [5-] [5-] RegDst Read register Read register 2 Write register Write data RegWrite Read data Read
More informationThe Pipelined MIPS Processor
1 The niversity of Texas at Dallas Lecture #20: The Pipeline IPS Processor The Pipelined IPS Processor We complete our study of AL architecture by investigating an approach providing even higher performance
More informationProcessor Design CSCE Instructor: Saraju P. Mohanty, Ph. D. NOTE: The figures, text etc included in slides are borrowed
Lecture 3: General Purpose Processor Design CSCE 665 Advanced VLSI Systems Instructor: Saraju P. ohanty, Ph. D. NOTE: The figures, tet etc included in slides are borrowed from various books, websites,
More informationChapter 4 (Part II) Sequential Laundry
Chapter 4 (Part II) The Processor Baback Izadi Division of Engineering Programs bai@engr.newpaltz.edu Sequential Laundry 6 P 7 8 9 10 11 12 1 2 A T a s k O r d e r A B C D 30 30 30 30 30 30 30 30 30 30
More informationSI232 Set #20: Laundry, Co-dependency, and other Hazards of Modern (Architecture) Life. Chapter 6 ADMIN. Reading for Chapter 6: 6.1,
SI232 Set #20: Laundry, Co-dependency, and other Hazards of Modern (Architecture) Life Chapter 6 ADMIN ing for Chapter 6: 6., 6.9-6.2 2 Midnight Laundry Task order A 6 PM 7 8 9 0 2 2 AM B C D 3 Smarty
More informationUnpipelined Machine. Pipelining the Idea. Pipelining Overview. Pipelined Machine. MIPS Unpipelined. Similar to assembly line in a factory
Pipelining the Idea Similar to assembly line in a factory Divide instruction into smaller tasks Each task is performed on subset of resources Overlap the execution of multiple instructions by completing
More informationLab 8 (All Sections) Prelab: ALU and ALU Control
Lab 8 (All Sections) Prelab: and Control Name: Sign the following statement: On my honor, as an Aggie, I have neither given nor received nathorized aid on this academic work Objective In this lab yo will
More informationEE 457 Unit 6a. Basic Pipelining Techniques
EE 47 Unit 6a Basic Pipelining Techniques 2 Pipelining Introduction Consider a drink bottling plant Filling the bottle = 3 sec. Placing the cap = 3 sec. Labeling = 3 sec. Would you want Machine = Does
More informationFunctions of Combinational Logic
CHPTER 6 Fnctions of Combinational Logic CHPTER OUTLINE 6 6 6 6 6 5 6 6 6 7 6 8 6 9 6 6 Half and Fll dders Parallel inary dders Ripple Carry and Look-head Carry dders Comparators Decoders Encoders Code
More informationPipelining: Overview. CPSC 252 Computer Organization Ellen Walker, Hiram College
Pipelining: Overview CPSC 252 Computer Organization Ellen Walker, Hiram College Pipelining the Wash Divide into 4 steps: Wash, Dry, Fold, Put Away Perform the steps in parallel Wash 1 Wash 2, Dry 1 Wash
More information4.13. An Introduction to Digital Design Using a Hardware Design Language to Describe and Model a Pipeline and More Pipelining Illustrations
.3 An Introdction to Digital Design Using a Hardware Design Langage to Describe and odel a Pipeline and ore Pipelining Illstrations This online section covers hardware description langages and then gives
More informationPipelining. CSC Friday, November 6, 2015
Pipelining CSC 211.01 Friday, November 6, 2015 Performance Issues Longest delay determines clock period Critical path: load instruction Instruction memory register file ALU data memory register file Not
More informationCOSC 6385 Computer Architecture - Pipelining
COSC 6385 Computer Architecture - Pipelining Fall 2006 Some of the slides are based on a lecture by David Culler, Instruction Set Architecture Relevant features for distinguishing ISA s Internal storage
More informationCSE Lecture 13/14 In Class Handout For all of these problems: HAS NOT CANNOT Add Add Add must wait until $5 written by previous add;
CSE 30321 Lecture 13/14 In Class Handout For the sequence of instructions shown below, show how they would progress through the pipeline. For all of these problems: - Stalls are indicated by placing the
More informationMa Lesson 18 Section 1.7
Ma 15200 Lesson 18 Section 1.7 I Representing an Ineqality There are 3 ways to represent an ineqality. (1) Using the ineqality symbol (sometime within set-bilder notation), (2) sing interval notation,
More informationImprove performance by increasing instruction throughput
Improve performance by increasing instruction throughput Program execution order Time (in instructions) lw $1, 100($0) fetch 2 4 6 8 10 12 14 16 18 ALU Data access lw $2, 200($0) 8ns fetch ALU Data access
More informationOutline Marquette University
COEN-4710 Computer Hardware Lecture 4 Processor Part 2: Pipelining (Ch.4) Cristinel Ababei Department of Electrical and Computer Engineering Credits: Slides adapted primarily from presentations from Mike
More informationModule 4c: Pipelining
Module 4c: Pipelining R E F E R E N C E S : S T A L L I N G S, C O M P U T E R O R G A N I Z A T I O N A N D A R C H I T E C T U R E M O R R I S M A N O, C O M P U T E R O R G A N I Z A T I O N A N D A
More informationCSSE232 Computer Architecture I. Mul5cycle Datapath
CSSE232 Compter Architectre I Ml5cycle Datapath Class Stats Next 3 days : Ml5cycle datapath ing Ml5cycle datapath is not in the book! How long do instrc5ons take? ALU 2ns Mem 2ns Reg File 1ns Everything
More informationWhat do we have so far? Multi-Cycle Datapath (Textbook Version)
What do we have so far? ulti-cycle Datapath (Textbook Version) CPI: R-Type = 4, Load = 5, Store 4, Branch = 3 Only one instruction being processed in datapath How to lower CPI further? #1 Lec # 8 Summer2001
More informationLecture 3. Pipelining. Dr. Soner Onder CS 4431 Michigan Technological University 9/23/2009 1
Lecture 3 Pipelining Dr. Soner Onder CS 4431 Michigan Technological University 9/23/2009 1 A "Typical" RISC ISA 32-bit fixed format instruction (3 formats) 32 32-bit GPR (R0 contains zero, DP take pair)
More informationMultiple-Choice Test Chapter Golden Section Search Method Optimization COMPLETE SOLUTION SET
Mltiple-Choice Test Chapter 09.0 Golden Section Search Method Optimization COMPLETE SOLUTION SET. Which o the ollowing statements is incorrect regarding the Eqal Interval Search and Golden Section Search
More informationECE260: Fundamentals of Computer Engineering
Pipelining James Moscola Dept. of Engineering & Computer Science York College of Pennsylvania Based on Computer Organization and Design, 5th Edition by Patterson & Hennessy What is Pipelining? Pipelining
More informationCSCI 402: Computer Architectures. Fengguang Song Department of Computer & Information Science IUPUI. Today s Content
3/6/8 CSCI 42: Computer Architectures The Processor (2) Fengguang Song Department of Computer & Information Science IUPUI Today s Content We have looked at how to design a Data Path. 4.4, 4.5 We will design
More informationComputer Architecture Lecture 6: Multi-cycle Microarchitectures. Prof. Onur Mutlu Carnegie Mellon University Spring 2012, 2/6/2012
8-447 Compter Architectre Lectre 6: lti-cycle icroarchitectres Prof. Onr tl Carnegie ellon University Spring 22, 2/6/22 Reminder: Homeworks Homework soltions Check and stdy the soltions! Learning now is
More informationComputer Architecture. Lecture 6.1: Fundamentals of
CS3350B Computer Architecture Winter 2015 Lecture 6.1: Fundamentals of Instructional Level Parallelism Marc Moreno Maza www.csd.uwo.ca/courses/cs3350b [Adapted from lectures on Computer Organization and
More informationFull Datapath. CSCI 402: Computer Architectures. The Processor (2) 3/21/19. Fengguang Song Department of Computer & Information Science IUPUI
CSCI 42: Computer Architectures The Processor (2) Fengguang Song Department of Computer & Information Science IUPUI Full Datapath Branch Target Instruction Fetch Immediate 4 Today s Contents We have looked
More informationPipeline: Introduction
Pipeline: Introduction These slides are derived from: CSCE430/830 Computer Architecture course by Prof. Hong Jiang and Dave Patterson UCB Some figures and tables have been derived from : Computer System
More informationPipelining Analogy. Pipelined laundry: overlapping execution. Parallelism improves performance. Four loads: Non-stop: Speedup = 8/3.5 = 2.3.
Pipelining Analogy Pipelined laundry: overlapping execution Parallelism improves performance Four loads: Speedup = 8/3.5 = 2.3 Non-stop: Speedup =2n/05n+15 2n/0.5n 1.5 4 = number of stages 4.5 An Overview
More informationCOMP2611: Computer Organization. The Pipelined Processor
COMP2611: Computer Organization The 1 2 Background 2 High-Performance Processors 3 Two techniques for designing high-performance processors by exploiting parallelism: Multiprocessing: parallelism among
More informationModern Computer Architecture
Modern Computer Architecture Lecture2 Pipelining: Basic and Intermediate Concepts Hongbin Sun 国家集成电路人才培养基地 Xi an Jiaotong University Pipelining: Its Natural! Laundry Example Ann, Brian, Cathy, Dave each
More informationCSE 2021 Computer Organization. Hugh Chesser, CSEB 1012U W12-M
CSE 22 Computer Organization Hugh Chesser, CSEB 2U W2- Graphical Representation Time 2 6 8 add $s, $t, $t IF ID E E Decode / Execute emory Back fetch from / stage into the instruction register file. Shading
More informationMIPS Pipelining. Computer Organization Architectures for Embedded Computing. Wednesday 8 October 14
MIPS Pipelining Computer Organization Architectures for Embedded Computing Wednesday 8 October 14 Many slides adapted from: Computer Organization and Design, Patterson & Hennessy 4th Edition, 2011, MK
More informationT = I x CPI x C. Both effective CPI and clock cycle C are heavily influenced by CPU design. CPI increased (3-5) bad Shorter cycle good
CPU performance equation: T = I x CPI x C Both effective CPI and clock cycle C are heavily influenced by CPU design. For single-cycle CPU: CPI = 1 good Long cycle time bad On the other hand, for multi-cycle
More informationPipelined Datapath. One register file is enough
ipelined path The goal of pipelining is to allow multiple instructions execute at the same time We may need to perform several operations in a cycle Increment the and add s at the same time. Fetch one
More information10.2 Solving Quadratic Equations by Completing the Square
. Solving Qadratic Eqations b Completing the Sqare Consider the eqation We can see clearl that the soltions are However, What if the eqation was given to s in standard form, that is 6 How wold we go abot
More informationPipelining. Maurizio Palesi
* Pipelining * Adapted from David A. Patterson s CS252 lecture slides, http://www.cs.berkeley/~pattrsn/252s98/index.html Copyright 1998 UCB 1 References John L. Hennessy and David A. Patterson, Computer
More informationBasics of Digital Logic Design
ignals, Logic Operations and Gates E 675.2: Introdction to ompter rchitectre asics of igital Logic esign Rather than referring to voltage levels of signals, we shall consider signals that are logically
More informationMIPS Architecture. Fibonacci (C) Fibonacci (Assembly) Another Example: MIPS. Example: subset of MIPS processor architecture
Another Eample: IPS From the Harris/Weste book Based on the IPS-like processor from the Hennessy/Patterson book IPS Architectre Eample: sbset of IPS processor architectre Drawn from Patterson & Hennessy
More informationPavlin and Daniel D. Corkill. Department of Computer and Information Science University of Massachusetts Amherst, Massachusetts 01003
From: AAAI-84 Proceedings. Copyright 1984, AAAI (www.aaai.org). All rights reserved. SELECTIVE ABSTRACTION OF AI SYSTEM ACTIVITY Jasmina Pavlin and Daniel D. Corkill Department of Compter and Information
More information