190-MHz CMOS 4-Kbyte Pipelined Caches
|
|
- Clarissa Day
- 5 years ago
- Views:
Transcription
1 90-MHz CMOS -Kbyte Pipelined Caches Apoorv Srivastava, Yong-Seon Koh, Barton Sano, and Alvin M. Despain ACAL-TR-9- November 99 ABSTRACT In this paper we describe the design and implementation of a 90-MHz pipelined -Kbyte instruction and data cache. The caches are designed in.0-µm CMOS and measure 0.78 x 0.7 cm. This paper describes the microarchitecture, cache timing, circuit implementation, and layout of both the instruction and the data cache. The key features of these caches are pipelined execution and the use of dynamic single-phase clock logic. We discuss the interface of this cache with the processor core and the off-chip controller. This paper also describes the pipelined structure of the cache and the miss detection and handling logic. Published in ISCAS 9, Seattle 0
2 90-MHz CMOS -Kbyte Pipelined Caches Apoorv Srivastava, Yong-Seon Koh, Barton Sano, and Alvin M. Despain Department of Computer Engineering University of Southern California, Los Angeles, CA phone: () 70-9, fax: () ABSTRACT In this paper we describe the design and implementation of a 90-MHz pipelined -Kbyte instruction and data cache. The caches are designed in.0-µm CMOS and measure 0.78 x 0.7 cm. This paper describes the microarchitecture, cache timing, circuit implementation, and layout of both the instruction and the data cache. The key features of these caches are pipelined execution and the use of dynamic single-phase clock logic. We discuss the interface of this cache with the processor core and the off-chip controller. This paper also describes the pipelined structure of the cache and the miss detection and handling logic. I. INTRODUCTION With the increase in microprocessor speeds [],[] the design of high-speed on-chip caches becomes increasingly important. In this paper we describe the design and implementation of a 90-MHz -Kbyte direct-mapped instruction and data cache in.0-µm CMOS with metal layers. Although metal layers are available, the caches are designed using only two metal layers. The third metal layer is reserved for clock routing. The cache is organized with -word (8-bits) per cache line. The data cache supports the write-back policy. Through the use of pipelining, both the instruction and data caches support a throughput of one operation per clock cycle. Besides pipelining, we used single-phase dynamic logic [],[],[6] to build high-speed caches. These caches are part of the SLAM processor (Southern California Logic Abstract Machine) which aims to investigate deeply pipelined processors. Section II gives a high level description of the cache microarchitecture. In Section III we describe the pipelined execution timing of the instruction and data caches. Section IV discusses the circuit implementation of the major cells. In Section V we give a detailed description of the Ucache, the main module of both the instruction and data caches. In the next section, Section VI, we present the global layout of the cache. Section VII discusses the simulation results and in the last section, Section VIII, we present the conclusions. Published in ISCAS 9, Seattle
3 II. THE CACHE MICROARCHITECTURE Fig. shows a high-level block diagram of the cache microarchitecture of both the instruction and data caches. The cache Off-Chip Cache Controller Miss Detection and Handling Logic Cache Core Address Drv Decode Row Driver Tag Compare RAM CELLS w w w w0 Ucache Column Selector Demux (only in the D-Cache) Cache LCA (Datapath) HCA (Segment Table) Operation Miss Restart Data Write Data Processor Core Fig.. A high-level block diagram of the cache microarchitecture. interfaces with the processor core at the bottom and interfaces with the off-chip cache controller at the top. The processor core has a -bit address bus. Through the use of segment registers (the most significant 6 bits are used to access the segment registers), the processor core provides the cache with a 8-bit address (LCA, Low Address from Core, 6-bits; HCA High Address from Core, -bits). The control-signals to and from the processor to the cache are formed by 7-bit Operation lines, a Miss signal, and a Restart signal. The processor core uses the 7-bit Operation lines to specify various cache operations. The Miss-line signals a cache miss to the processor. To reset a cache miss, the processor asserts the Restart-line. The instruction cache provides the processor core with -bits of data every cycle. The data cache provides the processor core with 6-bits ( words) of data every cycle. The cache consists of the Ucache (unified cache), and the read multiplexer and write demultiplexer circuitry. The instruction cache contains only the read multiplexer circuitry. The Ucache is the portion of the cache common to both the instruction and the data cache. The Ucache contains the cache core, control logic, the cache tag comparison (Tag Compare), and the circuitry for cache miss detection and handling. The cache core consists of 6 cache lines each with -words (8-bits) and -bits of tag and status (8-bits tag and - bits status) (see Fig. ). The cache core includes the cache address drivers, the row decoders, row drivers, the static RAM cells, and the read/write circuitry. On top of the cache core are the read/write ports to and from the processor pins (off-chip memory and off-chip cache controller). At the bottom there are read and write (only for the data cache) ports for the processor core. The column selector circuitry consists of a -to- multiplexer and a -to- multiplexer for the instruction and the data Published in ISCAS 9, Seattle
4 cache, respectively. III. PIPELINED EXECUTION The key to the design of a high-speed cache is pipelined execution. The instruction and data caches use a stage pipeline. Fig. shows a pipeline timing diagram of the major functional units of the cache. The cache uses the first low phase of the phase 6 Φ Adr. from off-chip Decode Address Drive Selected Row Read/Write Data Tag(8-bits) Status (-bits: Excl, Invalid) Mux & Drive Compare Tag Tag Compare Unit n-latch Generate Cache Miss Drive cache miss signal Latch Miss Signal Miss cmux MARn Dec. decoded adr. Row Drive decoded adr. Read Mem Cell Data cmux_n n-latch Adr. from core On-chip Control PLA 60-bits Old_Tag Old_Buf recirculating latch Old_Tag Old_Buf Cache Control Pins 8-bits MARp MARn MARp MARn recirculating latch MARp Operation Bits (7-bits) Supervisor[6] LockUnlock[] LAS[] WrDbl[] WR[] ExpRD[] RD[0] Opern Operp Opern Operp Opern recirculating latch Operp Fig.. Pipelined block diagram of the major functional units of the Ucache. clock (phase -) to decode the address from the processor. For the datapath, the cache also latches the data in phase -. The decoded address is latched at the rising edge (edge ). The row drivers drive the selected row in the second phase (phase -). The third phase (phase -) is the read/write phase. For the instruction cache, the cache drives the data to the processor core during phase -. The processor latches the data from the cache on the falling edge. The cache detects two types of misses:. Address misses. These are generated when the tags do not match.. Consistency misses. These are generated when the processor attempts to access data marked invalid, or perform a Lock/Unlock or a write(wr) operation to shared data. Data is marked invalid or shared by setting, respectively, the invalid or exclusive status bit in the cache. The chain of MARs - Memory Address Register - (MARn, MARp, MARn, MARp, MARn, MARp) and Opers - Operation Registers - (Opernp, Opern, Oper0, Opern, Operp) supports the handling of cache misses in a pipelined cache. A cache miss, indicated by the Miss-signal, is detected during phase -6 and latched on the rising edge 6. The address of the instruction that caused the miss is latched on edge 6 into MARp. The Oper-chain holds the Operation signals (e.g Lock/Unlock or WR - write) from the processor to the cache. Similar to MARp, the cache operation specified by the instruction that caused the miss is latched into Operp on edge 6. On a cache miss recirculating latches (MARn/MARp and Opern/Operp) are frozen. MARn and Opern stop loading in new data and start recirculating the old data. This freezing of the latches holds the values until the off-chip controller has handled the cache-misses. The implementation of Old_Buf and Old_Tag allows the off-chip controller to handle cache misses caused by memory store operations. Since the cache only has one memory port, store operations to memory are implemented as two cycle operations. A store operation is expanded by the processor into two internal operations: a memory read operation (ExpRD - Expanded Read) followed by a write operation (WR - Write). Due to pipelined operations, in the case of cache write misses, a cache write to a particular address corrupts that cache address before the expanded-read operation can generate a cache Published in ISCAS 9, Seattle
5 miss. To allow the off-chip controller to write back the overwritten address, the expanded-read operation causes the data to latch in the Old_Buf and Old_Tag latches. The cache miss signal caused by the expanded-read operation freezes the latches on edge 6. The recirculating latches recirculate data until the off-chip controller handles the cache miss, and the processor asserts the Restart signal and resets the cache miss. IV. CELL DESIGN AND CIRCUIT IMPLEMENTATION Fig. presents the circuits used in the cache core. The figure shows the circuitry of the write driver, the memory cell, the row decoder, and the row driver. The write driver converts a single-ended data signal into a differential signal for the memory cell. The memory cell is built out of a six-transistor static RAM cell (0 x µm ). The row decoder is an 8-bit NAND circuit built using modified domino logic []. The layouts of both the row decoder and row driver are pitch matched (0 µm) to the height of the memory cell. The inverted clock signal,, is generated locally from the single-phase clock,. Write Driver Memory Cell Read Sense Amplifier & Precharge data write enable 6/ write enable 0 0 wr data 0 wr data 0 row select bitline / / bitline row select Diff. Sense Amplifier read data precharge Row Decoder Row Driver. a 7 a 6 a a memop /7 / / /80 to row select. 6 a a a a 0 Fig.. Circuit diagram of the write driver, memory cell, the row decoder, and the row driver. V. THE UCACHE The Ucache contains the peripheral circuitry common to both the instruction and the data cache. Fig. shows the microarchitecture of the instruction and the data cache. The Ucache includes the cache core, the MARs (Memory Address Register) chain, control latches, miss detection circuit, and the tri-state drivers for the external bus. The Ucache has a 77-bit bus interfacing to the off-chip cache controller. The instruction (I-Cache) and the data (D-Cache) share 70 bits. The remaining 7-bits are used to signal the status of the on-chip caches to the off-chip circuitry. The 7-bits signal the status of the data cache as follows: RD (read), ExpRD (expanded read), WR (write operation), WrDbl (write double word operation), LAS (load and set operation), LockUnlock (lock). The Supervisor-bit (-bit) signals the status of the instruction cache. The supervisor-bit is set when the processor is in the supervisor mode. A miss is detected at the third cycle after the cache access. To detect a write miss during a STORE instruction, the core internally expands the STORE operation into an exprd and WR control operation. The off-chip controller controls the cache (e.g. during cache miss handling) through 6 pins: MWait_h, I/D_h, cr/w_h, Buf_h[:0], BOE_h. These pins allow the off-chip controller to read/write to and from the cache. The cache interfaces with the processor through a 6-bit LCA (Lower Address from Core), a -bit HCA (Higher Published in ISCAS 9, Seattle
6 77 0-bit bus Dedicated 7-bit bus In Dcache, [0,,,,,] are connected to bus In Icache, [6] is connected to bus 60-bit bus MWait_h I/D_h cr/w_h Buf_h[:0] BOE_h 6 d_pla/i_pla Xread[0] mux[] mux[] StDrv[] StDrv[] StDrv[] BusDrv[6] Xwe[7] recirculating n- & p-latch. p-latch (latch on rising edge) n-latch (latch on falling edge) low_mar_out [0:9] 8 StDrv[] tag_status_out external_mar_in [0:] [:9] StDrv StDrv [6:9] [0:9] 0 8 mux[] cmux 0 8 [0:7] [8:] 6 [0:] [0:] MARnp [0:7] MARp [0:7] MARn MARp MARn 8 6 cmux 0 mux[] [0:] 8 8 [:9] Ucache StDrv[] 6 [0:] 6 LCA HCA (DP Core) (Seg Tbl) [6:7] [0:] StDrv[] Opernp Operp Opern Opernp StDrv 7 Supervisor[6] LockUnlock[] LAS[] WrDbl[] WR[] ExpRD[] RD[0] Supervisor[6] LockUnlock[] LAS[] WrDbl[] WR[] ExpRD[] RD[0] ([,,,,] grounded in the I-Cache [6] grounded in D-Cache) Miss 7 we<> PLA dirty write drv [:9] 8 Address [7:0] 8 Fig.. Microarchitecture of the instruction and data cache. In the D-Cache: 6-bits Address from Core), a 7-bit Operation signal (ExpRd, WR, WrDbl, LAS, LockUnlock are grounded in the instruction cache; the Supervisor-bit is grounded in the data cache), and the Restart and Miss signal. The SLAM architecture implements a double word databus to the data cache. Since the data cache is organized as 6- bits per cache line, the data cache implements a -to- multiplexer (d_cmux_n) and a -to- demultiplexer to perform the read and the write operations, respectively. The instruction fetch mechanism fetches only word (-bits) from the instruction cache. The instruction cache implements a -to- multiplexer (i_cmux_n) between the Ucache and the processor core. [:0] [:] [,] [0,] int_access 6 Xread[0] Xwe[7] memop Tag int_access_miss Q S R [:0] ([0] not connected in D-Cache) Tag Restart Address Drv cache_line_in [9:0] BusDrv[6] Cache Core Decode [] Row Driver Invalid 8 Excl TAG Compare core_miss 60 8 Column Decoders rr0 = ww Old Tag RAM CELLS word word word word0 8 0 rr0 = ww0 d_cmux_n/i_cmux_n d_cmdrv/i_cmdrv In the I-Cache: -bits BusDrv 60 cache_data_out [7:0] 8 Old Buf 6 D 6 8 to w w w w0 T T 6 Write demultiplexer implemented in the D-Cache Unused in the I-Cache VI. CACHE LAYOUT Fig. shows the layout of the data cache. The layout measures 0.78 x 0.7 cm. VII. SIMULATION RESULTS We ran a switch-level simulation and a circuit-level simulation on the entire cache circuitry using Irsim and CAzM, respectively. CAzM is a table-lookup (Spice-like) circuit simulator. We constructed the instruction and data cache using the Magic [] layout editor and extracted the netlist from the layout. Both the instruction and data cache have approximately 0K transistors. Circuit-level simulation on both caches gives a cycle-time of.ns which corresponds to an operating frequency of 9-MHz. VIII. CONCLUSIONS In this paper we have designed and simulated a 90-MHz -Kbyte direct-mapped instruction and data cache in.0 µm CMOS. The key to high-speed design is the use of pipelined execution and the use of dynamic single-phase clock logic. The critical path of the design is the write circuitry. We are currently working to eliminate this bottleneck. Published in ISCAS 9, Seattle
7 Fig.. Layout of the data cache. ACKNOWLEDGMENTS This work presented here is part of the Hypercomputing & Design (HPCD) project; and it is supported (partly) by ARPA under contracts DABT-6-9-C-006 and J-FBI-9-9. The content of the information herein does not necessarily reflect the position of the Government and official endorsement should not be inferred. REFERENCES [] Morteza Afgahi and Christer Svensson, A Unified Single-Phase Clocking Scheme for VLSI Systems, in IEEE Journal of Solid-State Circuits, Vol., No., February 990. [] Roy W. Badeau et. al., A 00-MHz Macropipelined VAX Microprocessor, in IEEE Journal of Solid-State Circuits, Vol. 7, No., November 99. [] Daniel W. Dobberpuhl et. al., A 00-MHz 6-b Dual-Issue CMOS Microprocessor, in IEEE Journal of Solid-State Circuits, Vol. 7, No., November 99. [] Neil Weste and Kamran Eshraghian, Principles of CMOS VLSI Design A Systems Perspective, Addison-Wesley, ISBN , June, 988. [] John Ousterhout, Magic Tutorial, Computer Science Division, EECS, University of California, Berkeley. [6] Jiren Yuan and Christer Svensson, High-Speed CMOS Circuit Technique, in IEEE Journal of Solid-State Circuits, Vol., No., February 989. Published in ISCAS 9, Seattle 6
A 65nm LEVEL-1 CACHE FOR MOBILE APPLICATIONS
A 65nm LEVEL-1 CACHE FOR MOBILE APPLICATIONS ABSTRACT We describe L1 cache designed for digital signal processor (DSP) core. The cache is 32KB with variable associativity (4 to 16 ways) and is pseudo-dual-ported.
More informationTHE latest generation of microprocessors uses a combination
1254 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 30, NO. 11, NOVEMBER 1995 A 14-Port 3.8-ns 116-Word 64-b Read-Renaming Register File Creigton Asato Abstract A 116-word by 64-b register file for a 154 MHz
More informationIssue Logic for a 600-MHz Out-of-Order Execution Microprocessor
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 33, NO. 5, MAY 1998 707 Issue Logic for a 600-MHz Out-of-Order Execution Microprocessor James A. Farrell and Timothy C. Fischer Abstract The logic and circuits
More informationLow Power PLAs. Reginaldo Tavares, Michel Berkelaar, Jochen Jess. Information and Communication Systems Section, Eindhoven University of Technology,
Low Power PLAs Reginaldo Tavares, Michel Berkelaar, Jochen Jess Information and Communication Systems Section, Eindhoven University of Technology, P.O. Box 513, 5600 MB Eindhoven, The Netherlands {regi,michel,jess}@ics.ele.tue.nl
More informationColumn decoder using PTL for memory
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735. Volume 5, Issue 4 (Mar. - Apr. 2013), PP 07-14 Column decoder using PTL for memory M.Manimaraboopathy
More informationData Cache Final Project Report ECE251: VLSI Systems Design UCI Spring, 2000
June 15, 2000 Data Cache Final Project Report ECE251: VLSI Systems Design UCI Spring, 2000 Jinfeng Liu Yi Deng ID: 65547013 (jinfengl@ece.uci.edu) ID: 57434732 (dengy@ece.uci.edu) Project Summary In this
More informationHigh Performance Memory Read Using Cross-Coupled Pull-up Circuitry
High Performance Memory Read Using Cross-Coupled Pull-up Circuitry Katie Blomster and José G. Delgado-Frias School of Electrical Engineering and Computer Science Washington State University Pullman, WA
More informationEECS150 - Digital Design Lecture 17 Memory 2
EECS150 - Digital Design Lecture 17 Memory 2 October 22, 2002 John Wawrzynek Fall 2002 EECS150 Lec17-mem2 Page 1 SDRAM Recap General Characteristics Optimized for high density and therefore low cost/bit
More informationChapter 5B. Large and Fast: Exploiting Memory Hierarchy
Chapter 5B Large and Fast: Exploiting Memory Hierarchy One Transistor Dynamic RAM 1-T DRAM Cell word access transistor V REF TiN top electrode (V REF ) Ta 2 O 5 dielectric bit Storage capacitor (FET gate,
More informationfswallace, niravd, limited superscalar principles. Most of them implemented 2 Design
Design and Implementation of a 100 MHz Centralized Instruction Window for a Superscalar Microprocessor Steven Wallace, Nirav Dagli, and Nader Bagherzadeh Department of Electrical and Computer Engineering
More informationISSN: [Bilani* et al.,7(2): February, 2018] Impact Factor: 5.164
IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A REVIEWARTICLE OF SDRAM DESIGN WITH NECESSARY CRITERIA OF DDR CONTROLLER Sushmita Bilani *1 & Mr. Sujeet Mishra 2 *1 M.Tech Student
More informationESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems
ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Lec 26: November 9, 2018 Memory Overview Dynamic OR4! Precharge time?! Driving input " With R 0 /2 inverter! Driving inverter
More informationCHAPTER 5 : Introduction to Intel 8085 Microprocessor Hardware BENG 2223 MICROPROCESSOR TECHNOLOGY
CHAPTER 5 : Introduction to Intel 8085 Hardware BENG 2223 MICROPROCESSOR TECHNOLOGY The 8085A(commonly known as the 8085) : Was first introduced in March 1976 is an 8-bit microprocessor with 16-bit address
More informationA Low Power DDR SDRAM Controller Design P.Anup, R.Ramana Reddy
A Low Power DDR SDRAM Controller Design P.Anup, R.Ramana Reddy Abstract This paper work leads to a working implementation of a Low Power DDR SDRAM Controller that is meant to be used as a reference for
More informationMarching Memory マーチングメモリ. UCAS-6 6 > Stanford > Imperial > Verify 中村維男 Based on Patent Application by Tadao Nakamura and Michael J.
UCAS-6 6 > Stanford > Imperial > Verify 2011 Marching Memory マーチングメモリ Tadao Nakamura 中村維男 Based on Patent Application by Tadao Nakamura and Michael J. Flynn 1 Copyright 2010 Tadao Nakamura C-M-C Computer
More informationInternational Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering
IP-SRAM ARCHITECTURE AT DEEP SUBMICRON CMOS TECHNOLOGY A LOW POWER DESIGN D. Harihara Santosh 1, Lagudu Ramesh Naidu 2 Assistant professor, Dept. of ECE, MVGR College of Engineering, Andhra Pradesh, India
More informationLecture 13: SRAM. Slides courtesy of Deming Chen. Slides based on the initial set from David Harris. 4th Ed.
Lecture 13: SRAM Slides courtesy of Deming Chen Slides based on the initial set from David Harris CMOS VLSI Design Outline Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitry Multiple Ports
More informationECE 485/585 Microprocessor System Design
Microprocessor System Design Lecture 4: Memory Hierarchy Memory Taxonomy SRAM Basics Memory Organization DRAM Basics Zeshan Chishti Electrical and Computer Engineering Dept Maseeh College of Engineering
More informationCS250 VLSI Systems Design Lecture 9: Memory
CS250 VLSI Systems esign Lecture 9: Memory John Wawrzynek, Jonathan Bachrach, with Krste Asanovic, John Lazzaro and Rimas Avizienis (TA) UC Berkeley Fall 2012 CMOS Bistable Flip State 1 0 0 1 Cross-coupled
More informationCPE300: Digital System Architecture and Design
CPE300: Digital System Architecture and Design Fall 2011 MW 17:30-18:45 CBC C316 Cache 11232011 http://www.egr.unlv.edu/~b1morris/cpe300/ 2 Outline Review Memory Components/Boards Two-Level Memory Hierarchy
More informationA Comparative Study of Power Efficient SRAM Designs
A Comparative tudy of Power Efficient RAM Designs Jeyran Hezavei, N. Vijaykrishnan, M. J. Irwin Pond Laboratory, Department of Computer cience & Engineering, Pennsylvania tate University {hezavei, vijay,
More informationInfineon HYB39S128160CT M SDRAM Circuit Analysis
September 8, 2004 Infineon HYB39S128160CT-7.5 128M SDRAM Circuit Analysis Table of Contents Introduction... Page 1 List of Figures... Page 2 Device Summary Sheet... Page 13 Chip Description... Page 16
More informationBasics of Microprocessor
Unit 1 Basics of Microprocessor 1. Microprocessor Microprocessor is a multipurpose programmable integrated device that has computing and decision making capability. This semiconductor IC is manufactured
More informationEE577b. Register File. By Joong-Seok Moon
EE577b Register File By Joong-Seok Moon Register File A set of registers that store data Consists of a small array of static memory cells Smallest size and fastest access time in memory hierarchy (Register
More informationMEMORIES. Memories. EEC 116, B. Baas 3
MEMORIES Memories VLSI memories can be classified as belonging to one of two major categories: Individual registers, single bit, or foreground memories Clocked: Transparent latches and Flip-flops Unclocked:
More information(1) Define following terms: Instruction, Machine Cycle, Opcode, Oprand & Instruction Cycle. Instruction:
(1) Define following terms: Instruction, Machine Cycle, Opcode, Oprand & Instruction Cycle. Instruction: Instruction is the command given by the programmer to the Microprocessor to Perform the Specific
More informationLecture 11 SRAM Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010
EE4800 CMOS Digital IC Design & Analysis Lecture 11 SRAM Zhuo Feng 11.1 Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitryit Multiple Ports Outline Serial Access Memories 11.2 Memory Arrays
More information8Kb Logic Compatible DRAM based Memory Design for Low Power Systems
8Kb Logic Compatible DRAM based Memory Design for Low Power Systems Harshita Shrivastava 1, Rajesh Khatri 2 1,2 Department of Electronics & Instrumentation Engineering, Shree Govindram Seksaria Institute
More informationDesign and Functional Verification of Four Way Set Associative Cache Controller
International Journal of Research in Computer and Communication Technology, Vol 4, Issue 3, March -2015 ISSN (Online) 2278-5841 ISSN (Print) 2320-5156 Design and Functional Verification of Four Way Set
More informationCHAPTER 12 ARRAY SUBSYSTEMS [ ] MANJARI S. KULKARNI
CHAPTER 2 ARRAY SUBSYSTEMS [2.4-2.9] MANJARI S. KULKARNI OVERVIEW Array classification Non volatile memory Design and Layout Read-Only Memory (ROM) Pseudo nmos and NAND ROMs Programmable ROMS PROMS, EPROMs,
More informationMemory in Digital Systems
MEMORIES Memory in Digital Systems Three primary components of digital systems Datapath (does the work) Control (manager) Memory (storage) Single bit ( foround ) Clockless latches e.g., SR latch Clocked
More information! Memory. " RAM Memory. " Serial Access Memories. ! Cell size accounts for most of memory array size. ! 6T SRAM Cell. " Used in most commercial chips
ESE 57: Digital Integrated Circuits and VLSI Fundamentals Lec : April 5, 8 Memory: Periphery circuits Today! Memory " RAM Memory " Architecture " Memory core " SRAM " DRAM " Periphery " Serial Access Memories
More informationIntroduction to CMOS VLSI Design Lecture 13: SRAM
Introduction to CMOS VLSI Design Lecture 13: SRAM David Harris Harvey Mudd College Spring 2004 1 Outline Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitry Multiple Ports Serial Access
More informationChapter 1: Basics of Microprocessor [08 M]
Microprocessor: Chapter 1: Basics of Microprocessor [08 M] It is a semiconductor device consisting of electronic logic circuits manufactured by using either a Large scale (LSI) or Very Large Scale (VLSI)
More informationECE 2300 Digital Logic & Computer Organization
ECE 2300 Digital Logic & Computer Organization Spring 201 Memories Lecture 14: 1 Announcements HW6 will be posted tonight Lab 4b next week: Debug your design before the in-lab exercise Lecture 14: 2 Review:
More informationIntroduction to SRAM. Jasur Hanbaba
Introduction to SRAM Jasur Hanbaba Outline Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitry Non-volatile Memory Manufacturing Flow Memory Arrays Memory Arrays Random Access Memory Serial
More informationELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 7: Memory Organization Part II
ELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 7: Organization Part II Ujjwal Guin, Assistant Professor Department of Electrical and Computer Engineering Auburn University, Auburn,
More informationLow Power using Match-Line Sensing in Content Addressable Memory S. Nachimuthu, S. Ramesh 1 Department of Electrical and Electronics Engineering,
Low Power using Match-Line Sensing in Content Addressable Memory S. Nachimuthu, S. Ramesh 1 Department of Electrical and Electronics Engineering, K.S.R College of Engineering, Tiruchengode, Tamilnadu,
More informationModule 2. Embedded Processors and Memory. Version 2 EE IIT, Kharagpur 1
Module 2 Embedded Processors and Memory Version 2 EE IIT, Kharagpur 1 Lesson 11 Embedded Processors - II Version 2 EE IIT, Kharagpur 2 Signals of a Typical Microcontroller In this lesson the student will
More informationMemory. Lecture 22 CS301
Memory Lecture 22 CS301 Administrative Daily Review of today s lecture w Due tomorrow (11/13) at 8am HW #8 due today at 5pm Program #2 due Friday, 11/16 at 11:59pm Test #2 Wednesday Pipelined Machine Fetch
More informationData Acquisition Using INS8048
Data Acquisition Using INS8048 Abstract This application note describes techniques for interfacing National Semiconductor s ADC0833 serial I O and ADC0804 parallel I O A D converters to the INS8048 family
More informationExploring High Bandwidth Pipelined Cache Architecture for Scaled Technology
Exploring High Bandwidth Pipelined Cache Architecture for Scaled Technology Amit Agarwal, Kaushik Roy, and T. N. Vijaykumar Electrical & Computer Engineering Purdue University, West Lafayette, IN 4797,
More informationDigital Integrated Circuits Lecture 13: SRAM
Digital Integrated Circuits Lecture 13: SRAM Chih-Wei Liu VLSI Signal Processing LAB National Chiao Tung University cwliu@twins.ee.nctu.edu.tw DIC-Lec13 cwliu@twins.ee.nctu.edu.tw 1 Outline Memory Arrays
More informationMemories: Memory Technology
Memories: Memory Technology Z. Jerry Shi Assistant Professor of Computer Science and Engineering University of Connecticut * Slides adapted from Blumrich&Gschwind/ELE475 03, Peh/ELE475 * Memory Hierarchy
More informationENGIN 112 Intro to Electrical and Computer Engineering
ENGIN 112 Intro to Electrical and Computer Engineering Lecture 30 Random Access Memory (RAM) Overview Memory is a collection of storage cells with associated input and output circuitry Possible to read
More informationEmbedded Systems Ch 15 ARM Organization and Implementation
Embedded Systems Ch 15 ARM Organization and Implementation Byung Kook Kim Dept of EECS Korea Advanced Institute of Science and Technology Summary ARM architecture Very little change From the first 3-micron
More informationLecture-55 System Interface:
Lecture-55 System Interface: To interface 8253 with 8085A processor, CS signal is to be generated. Whenever CS =0, chip is selected and depending upon A 1 and A 0 one of the internal registers is selected
More informationSRAM. Introduction. Digital IC
SRAM Introduction Outline Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitry Multiple Ports Serial Access Memories Memory Arrays Memory Arrays Random Access Memory Serial Access Memory
More informationSemiconductor Memory Classification. Today. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. CPU Memory Hierarchy.
ESE 57: Digital Integrated Circuits and VLSI Fundamentals Lec : April 4, 7 Memory Overview, Memory Core Cells Today! Memory " Classification " ROM Memories " RAM Memory " Architecture " Memory core " SRAM
More informationCMPEN 411 VLSI Digital Circuits Spring Lecture 22: Memery, ROM
CMPEN 411 VLSI Digital Circuits Spring 2011 Lecture 22: Memery, ROM [Adapted from Rabaey s Digital Integrated Circuits, Second Edition, 2003 J. Rabaey, A. Chandrakasan, B. Nikolic] Sp11 CMPEN 411 L22 S.1
More informationFIRST-LEVEL cache RAM blocks for the next generation
1682 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 33, NO. 11, NOVEMBER 1998 64-KByte Sum-Addressed-Memory Cache with 1.6-ns Cycle and 2.6-ns Latency Raymond Heald, Ken Shin, Vinita Reddy, I-Feng Kao, Masood
More informationCENG 3420 Computer Organization and Design. Lecture 08: Cache Review. Bei Yu
CENG 3420 Computer Organization and Design Lecture 08: Cache Review Bei Yu CEG3420 L08.1 Spring 2016 A Typical Memory Hierarchy q Take advantage of the principle of locality to present the user with as
More informationEE414 Embedded Systems Ch 5. Memory Part 2/2
EE414 Embedded Systems Ch 5. Memory Part 2/2 Byung Kook Kim School of Electrical Engineering Korea Advanced Institute of Science and Technology Overview 6.1 introduction 6.2 Memory Write Ability and Storage
More informationPin Description, Status & Control Signals of 8085 Microprocessor
Pin Description, Status & Control Signals of 8085 Microprocessor 1 Intel 8085 CPU Block Diagram 2 The 8085 Block Diagram Registers hold temporary data. Instruction register (IR) holds the currently executing
More information1. INTRODUCTION TO MICROPROCESSOR AND MICROCOMPUTER ARCHITECTURE:
1. INTRODUCTION TO MICROPROCESSOR AND MICROCOMPUTER ARCHITECTURE: A microprocessor is a programmable electronics chip that has computing and decision making capabilities similar to central processing unit
More informationPICo Embedded High Speed Cache Design Project
PICo Embedded High Speed Cache Design Project TEAM LosTohmalesCalientes Chuhong Duan ECE 4332 Fall 2012 University of Virginia cd8dz@virginia.edu Andrew Tyler ECE 4332 Fall 2012 University of Virginia
More informationReRISC: A Reconfigurable Reduced Instruction Set Computer
ReRISC: A Reconfigurable Reduced Instruction Set Computer Andrew Huang and Edward H. Kim Department of EECS, Massachusetts Institute of Technology, Cambridge Abstract The ReRISC processor gives users the
More informationCSE 431 Computer Architecture Fall Chapter 5A: Exploiting the Memory Hierarchy, Part 1
CSE 431 Computer Architecture Fall 2008 Chapter 5A: Exploiting the Memory Hierarchy, Part 1 Mary Jane Irwin ( www.cse.psu.edu/~mji ) [Adapted from Computer Organization and Design, 4 th Edition, Patterson
More informationCENG 4480 L09 Memory 2
CENG 4480 L09 Memory 2 Bei Yu Reference: Chapter 11 Memories CMOS VLSI Design A Circuits and Systems Perspective by H.E.Weste and D.M.Harris 1 v.s. CENG3420 CENG3420: architecture perspective memory coherent
More informationEECS150 - Digital Design Lecture 16 Memory 1
EECS150 - Digital Design Lecture 16 Memory 1 March 13, 2003 John Wawrzynek Spring 2003 EECS150 - Lec16-mem1 Page 1 Memory Basics Uses: Whenever a large collection of state elements is required. data &
More informationLow-Power SRAM and ROM Memories
Low-Power SRAM and ROM Memories Jean-Marc Masgonty 1, Stefan Cserveny 1, Christian Piguet 1,2 1 CSEM, Neuchâtel, Switzerland 2 LAP-EPFL Lausanne, Switzerland Abstract. Memories are a main concern in low-power
More informationSemiconductor Memory Classification
ESE37: Circuit-Level Modeling, Design, and Optimization for Digital Systems Lec 6: November, 7 Memory Overview Today! Memory " Classification " Architecture " Memory core " Periphery (time permitting)!
More informationControl Unit: The control unit provides the necessary timing and control Microprocessor resembles a CPU exactly.
Unit I 8085 and 8086 PROCESSOR Introduction to microprocessor A microprocessor is a clock-driven semiconductor device consisting of electronic logic circuits manufactured by using either a large-scale
More informationDesign and Implementation of Low Leakage Power SRAM System Using Full Stack Asymmetric SRAM
Design and Implementation of Low Leakage Power SRAM System Using Full Stack Asymmetric SRAM Rajlaxmi Belavadi 1, Pramod Kumar.T 1, Obaleppa. R. Dasar 2, Narmada. S 2, Rajani. H. P 3 PG Student, Department
More information+1 (479)
Memory Courtesy of Dr. Daehyun Lim@WSU, Dr. Harris@HMC, Dr. Shmuel Wimer@BIU and Dr. Choi@PSU http://csce.uark.edu +1 (479) 575-6043 yrpeng@uark.edu Memory Arrays Memory Arrays Random Access Memory Serial
More informationReview: Timing. EECS Components and Design Techniques for Digital Systems. Lec 13 Storage: Regs, SRAM, ROM. Outline.
Review: Timing EECS 150 - Components and Design Techniques for Digital Systems Lec 13 Storage: Regs,, ROM David Culler Electrical Engineering and Computer Sciences University of California, Berkeley http://www.eecs.berkeley.edu/~culler
More informationBased on slides/material by. Topic 7-4. Memory and Array Circuits. Outline. Semiconductor Memory Classification
Based on slides/material by Topic 7 Memory and Array Circuits K. Masselos http://cas.ee.ic.ac.uk/~kostas J. Rabaey http://bwrc.eecs.berkeley.edu/classes/icbook/instructors.html Digital Integrated Circuits:
More informationDYNAMIC ASYNCHRONOUS LOGIC FOR HIGH SPEED CMOS SYSTEMS. I. Introduction
DYNAMIC ASYNCHRONOUS LOGIC FOR HIGH SEED CMOS SYSTEMS Anthony J. McAuley Bellcore (Room MRE-2N263), 445 South Street, Morristown, NJ 07962-1910, USA hone: (201)-829-4698; E-mail: mcauley@bellcore.com Abstract
More informationAddress connections Data connections Selection connections
Interface (cont..) We have four common types of memory: Read only memory ( ROM ) Flash memory ( EEPROM ) Static Random access memory ( SARAM ) Dynamic Random access memory ( DRAM ). Pin connections common
More informationInternational Journal of Scientific & Engineering Research, Volume 5, Issue 2, February ISSN
International Journal of Scientific & Engineering Research, Volume 5, Issue 2, February-2014 938 LOW POWER SRAM ARCHITECTURE AT DEEP SUBMICRON CMOS TECHNOLOGY T.SANKARARAO STUDENT OF GITAS, S.SEKHAR DILEEP
More informationMultilevel Memories. Joel Emer Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology
1 Multilevel Memories Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology Based on the material prepared by Krste Asanovic and Arvind CPU-Memory Bottleneck 6.823
More informationMemory. Outline. ECEN454 Digital Integrated Circuit Design. Memory Arrays. SRAM Architecture DRAM. Serial Access Memories ROM
ECEN454 Digital Integrated Circuit Design Memory ECEN 454 Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitry Multiple Ports DRAM Outline Serial Access Memories ROM ECEN 454 12.2 1 Memory
More informationMicrocontroller Systems. ELET 3232 Topic 11: General Memory Interfacing
Microcontroller Systems ELET 3232 Topic 11: General Memory Interfacing 1 Objectives To become familiar with the concepts of memory expansion and the data and address bus To design embedded systems circuits
More informationLecture 11: MOS Memory
Lecture 11: MOS Memory MAH, AEN EE271 Lecture 11 1 Memory Reading W&E 8.3.1-8.3.2 - Memory Design Introduction Memories are one of the most useful VLSI building blocks. One reason for their utility is
More informationDesign of 6-T SRAM Cell for enhanced read/write margin
International Journal of Advances in Electrical and Electronics Engineering 317 Available online at www.ijaeee.com & www.sestindia.org ISSN: 2319-1112 Design of 6-T SRAM Cell for enhanced read/write margin
More informationEECS150 - Digital Design Lecture 16 - Memory
EECS150 - Digital Design Lecture 16 - Memory October 17, 2002 John Wawrzynek Fall 2002 EECS150 - Lec16-mem1 Page 1 Memory Basics Uses: data & program storage general purpose registers buffering table lookups
More informationPrototype of SRAM by Sergey Kononov, et al.
Prototype of SRAM by Sergey Kononov, et al. 1. Project Overview The goal of the project is to create a SRAM memory layout that provides maximum utilization of the space on the 1.5 by 1.5 mm chip. Significant
More informationDesign of Low Power Wide Gates used in Register File and Tag Comparator
www..org 1 Design of Low Power Wide Gates used in Register File and Tag Comparator Isac Daimary 1, Mohammed Aneesh 2 1,2 Department of Electronics Engineering, Pondicherry University Pondicherry, 605014,
More informationDP8420V 21V 22V-33 DP84T22-25 microcmos Programmable 256k 1M 4M Dynamic RAM Controller Drivers
DP8420V 21V 22V-33 DP84T22-25 microcmos Programmable 256k 1M 4M Dynamic RAM Controller Drivers General Description The DP8420V 21V 22V-33 DP84T22-25 dynamic RAM controllers provide a low cost single chip
More informationRoberto Muscedere Images and Text Portions 2003 Prentice Hall 1
Microcomputer Structure and Operation Chapter 5 A Microprocessor ( P) contains the controller, ALU and internal registers A Microcomputer ( C) contains a microprocessor, memory (RAM, ROM, etc), input and
More informationCS650 Computer Architecture. Lecture 9 Memory Hierarchy - Main Memory
CS65 Computer Architecture Lecture 9 Memory Hierarchy - Main Memory Andrew Sohn Computer Science Department New Jersey Institute of Technology Lecture 9: Main Memory 9-/ /6/ A. Sohn Memory Cycle Time 5
More informationCpE 442. Memory System
CpE 442 Memory System CPE 442 memory.1 Outline of Today s Lecture Recap and Introduction (5 minutes) Memory System: the BIG Picture? (15 minutes) Memory Technology: SRAM and Register File (25 minutes)
More informationIntroduction to Semiconductor Memory Dr. Lynn Fuller Webpage:
ROCHESTER INSTITUTE OF TECHNOLOGY MICROELECTRONIC ENGINEERING Introduction to Semiconductor Memory Webpage: http://people.rit.edu/lffeee 82 Lomb Memorial Drive Rochester, NY 14623-5604 Tel (585) 475-2035
More informationllla. UNIV CR CENTER FOR INTEGRATED SYSTEMS M HOROWITZ ET AL HDAR93-83-C-9335 STANFORD
MD-A177 9" R 26 HIPS PERK MICROPROCESSOR WITH ON-CHIP CANE (U) v/i STANFORD llla. UNIV CR CENTER FOR INTEGRATED SYSTEMS M HOROWITZ ET AL. 1904 HDAR93-83-C-9335 UNCLSSoFIED F/G 9/2 L HIll1. 33 W2 2.2 t1l.
More informationDesign and Implementation of High Performance Application Specific Memory
Design and Implementation of High Performance Application Specific Memory - 고성능 Application Specific Memory 의설계와구현 - M.S. Thesis Sungdae Choi Dec. 20th, 2002 Outline Introduction Memory for Mobile 3D Graphics
More informationMemory in Digital Systems
MEMORIES Memory in Digital Systems Three primary components of digital systems Datapath (does the work) Control (manager) Memory (storage) Single bit ( foround ) Clockless latches e.g., SR latch Clocked
More informationDP8420A,DP8421A,DP8422A
DP8420A,DP8421A,DP8422A DP8420A DP8421A DP8422A microcmos Programmable 256k/1M/4M Dynamic RAM Controller/Drivers Literature Number: SNOSBX7A DP8420A 21A 22A microcmos Programmable 256k 1M 4M Dynamic RAM
More informationBasic Processing Unit: Some Fundamental Concepts, Execution of a. Complete Instruction, Multiple Bus Organization, Hard-wired Control,
UNIT - 7 Basic Processing Unit: Some Fundamental Concepts, Execution of a Complete Instruction, Multiple Bus Organization, Hard-wired Control, Microprogrammed Control Page 178 UNIT - 7 BASIC PROCESSING
More informationThe Memory Hierarchy Part I
Chapter 6 The Memory Hierarchy Part I The slides of Part I are taken in large part from V. Heuring & H. Jordan, Computer Systems esign and Architecture 1997. 1 Outline: Memory components: RAM memory cells
More informationCS152 Computer Architecture and Engineering Lecture 16: Memory System
CS152 Computer Architecture and Engineering Lecture 16: System March 15, 1995 Dave Patterson (patterson@cs) and Shing Kong (shing.kong@eng.sun.com) Slides available on http://http.cs.berkeley.edu/~patterson
More informationREDUCING POWER IN SUPERSCALAR PROCESSOR CACHES USING SUBBANKING, MULTIPLE LINE BUFFERS AND BIT LINE SEGMENTATION *
REDUCING POWER IN SUPERSCALAR PROCESSOR CACHES USING SUBBANKING, MULTIPLE LINE BUFFERS AND BIT LINE SEGMENTATION * Kanad Ghose and Milind B. Kamble ** Department of Computer Science State University of
More informationA Configurable Radiation Tolerant Dual-Ported Static RAM macro, designed in a 0.25 µm CMOS technology for applications in the LHC environment.
A Configurable Radiation Tolerant Dual-Ported Static RAM macro, designed in a 0.25 µm CMOS technology for applications in the LHC environment. 8th Workshop on Electronics for LHC Experiments 9-13 Sept.
More information8085 Microprocessor Architecture and Memory Interfacing. Microprocessor and Microcontroller Interfacing
8085 Microprocessor Architecture and Memory 1 Points to be Discussed 8085 Microprocessor 8085 Microprocessor (CPU) Block Diagram Control & Status Signals Interrupt Signals 8085 Microprocessor Signal Flow
More informationSTUDY OF SRAM AND ITS LOW POWER TECHNIQUES
INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) International Journal of Electronics and Communication Engineering & Technology (IJECET), ISSN ISSN 0976 6464(Print)
More informationEND-TERM EXAMINATION
(Please Write your Exam Roll No. immediately) END-TERM EXAMINATION DECEMBER 2006 Exam. Roll No... Exam Series code: 100919DEC06200963 Paper Code: MCA-103 Subject: Digital Electronics Time: 3 Hours Maximum
More informationSummer 2003 Lecture 21 07/15/03
Summer 2003 Lecture 21 07/15/03 Simple I/O Devices Simple i/o hardware generally refers to simple input or output ports. These devices generally accept external logic signals as input and allow the CPU
More informationMULTIPROCESSOR system has been used to improve
arallel Vector rocessing Using Multi Level Orbital DATA Nagi Mekhiel Abstract Many applications use vector operations by applying single instruction to multiple data that map to different locations in
More informationArchitecture of An AHB Compliant SDRAM Memory Controller
Architecture of An AHB Compliant SDRAM Memory Controller S. Lakshma Reddy Metch student, Department of Electronics and Communication Engineering CVSR College of Engineering, Hyderabad, Andhra Pradesh,
More information1 MALP ( ) Unit-1. (1) Draw and explain the internal architecture of 8085.
(1) Draw and explain the internal architecture of 8085. The architecture of 8085 Microprocessor is shown in figure given below. The internal architecture of 8085 includes following section ALU-Arithmetic
More informationMemory Hierarchy and Caches
Memory Hierarchy and Caches COE 301 / ICS 233 Computer Organization Dr. Muhamed Mudawar College of Computer Sciences and Engineering King Fahd University of Petroleum and Minerals Presentation Outline
More information