ELE 455/555 Computer System Engineering. Section 1 Review and Foundations Class 3 Technology

Similar documents
COMP3221: Microprocessors and. and Embedded Systems. Overview. Lecture 23: Memory Systems (I)

ECSE-2610 Computer Components & Operations (COCO)

+1 (479)

Information Storage and Spintronics 10

FPGA Programming Technology

Design and Implementation of an AHB SRAM Memory Controller

Sense Amplifiers 6 T Cell. M PC is the precharge transistor whose purpose is to force the latch to operate at the unstable point.

Memory and Programmable Logic

CMPEN 411 VLSI Digital Circuits Spring Lecture 22: Memery, ROM

The Memory Hierarchy 1

ECE321 Electronics I

Very Large Scale Integration (VLSI)

! Memory Overview. ! ROM Memories. ! RAM Memory " SRAM " DRAM. ! This is done because we can build. " large, slow memories OR

Memory. Outline. ECEN454 Digital Integrated Circuit Design. Memory Arrays. SRAM Architecture DRAM. Serial Access Memories ROM

CENG4480 Lecture 09: Memory 1

Chapter 3 Semiconductor Memories. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

Integrated Circuits & Systems

Semiconductor Memory Classification. Today. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. CPU Memory Hierarchy.

Lecture 20: Package, Power, and I/O

Embedded Systems Design: A Unified Hardware/Software Introduction. Outline. Chapter 5 Memory. Introduction. Memory: basic concepts

Embedded Systems Design: A Unified Hardware/Software Introduction. Chapter 5 Memory. Outline. Introduction

Embedded System Application

MEMORIES. Memories. EEC 116, B. Baas 3

CS 152 Computer Architecture and Engineering

Organization. 5.1 Semiconductor Main Memory. William Stallings Computer Organization and Architecture 6th Edition

William Stallings Computer Organization and Architecture 6th Edition. Chapter 5 Internal Memory

Read and Write Cycles

COMPUTER ARCHITECTURE

NAND Flash Memory. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University

CMOS Logic Circuit Design Link( リンク ): センター教官講義ノートの下 CMOS 論理回路設計

Magnetic core memory (1951) cm 2 ( bit)

ECE 152 Introduction to Computer Architecture

ELCT 912: Advanced Embedded Systems

Introduction to ICs and Transistor Fundamentals

CENG 4480 L09 Memory 2

Concept of Memory. The memory of computer is broadly categories into two categories:

CENG3420 Lecture 08: Memory Organization

William Stallings Computer Organization and Architecture 8th Edition. Chapter 5 Internal Memory

Basic Organization Memory Cell Operation. CSCI 4717 Computer Architecture. ROM Uses. Random Access Memory. Semiconductor Memory Types

CS 261 Fall Mike Lam, Professor. Memory

ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems

Semiconductor Memory Classification

Memory Design I. Array-Structured Memory Architecture. Professor Chris H. Kim. Dept. of ECE.

Chapter 2 On-Chip Protection Solution for Radio Frequency Integrated Circuits in Standard CMOS Process

Design and Simulation of Low Power 6TSRAM and Control its Leakage Current Using Sleepy Keeper Approach in different Topology

Chapter 5 Internal Memory

Computer Organization. 8th Edition. Chapter 5 Internal Memory

Memory Overview. Overview - Memory Types 2/17/16. Curtis Nelson Walla Walla University

CHAPTER 12 ARRAY SUBSYSTEMS [ ] MANJARI S. KULKARNI

ECE 485/585 Microprocessor System Design

Advanced 1 Transistor DRAM Cells

TABLE OF CONTENTS 1.0 PURPOSE INTRODUCTION ESD CHECKS THROUGHOUT IC DESIGN FLOW... 2

Address connections Data connections Selection connections

NAND Flash Memory: Basics, Key Scaling Challenges and Future Outlook. Pranav Kalavade Intel Corporation

Memory Pearson Education, Inc., Hoboken, NJ. All rights reserved.

CS311 Lecture 21: SRAM/DRAM/FLASH

Design and Technology Trends

(Advanced) Computer Organization & Architechture. Prof. Dr. Hasan Hüseyin BALIK (5 th Week)

Z-RAM Ultra-Dense Memory for 90nm and Below. Hot Chips David E. Fisch, Anant Singh, Greg Popov Innovative Silicon Inc.

RTL Design (2) Memory Components (RAMs & ROMs)

! Memory. " RAM Memory. " Serial Access Memories. ! Cell size accounts for most of memory array size. ! 6T SRAM Cell. " Used in most commercial chips

Introduction to CMOS VLSI Design. Semiconductor Memory Harris and Weste, Chapter October 2018

Single Channel Protector in a SOT-23 Package and a MSOP Package ADG465

Internal Memory. Computer Architecture. Outline. Memory Hierarchy. Semiconductor Memory Types. Copyright 2000 N. AYDIN. All rights reserved.

CMOS Process Flow. Layout CAD Tools

Power IC 용 ESD 보호기술. 구용서 ( Yong-Seo Koo ) Electronic Engineering Dankook University, Korea

memories The world of Memories DEEP SUBMICRON CMOS DESIGN 10. Memories

Gigascale Integration Design Challenges & Opportunities. Shekhar Borkar Circuit Research, Intel Labs October 24, 2004

Introduction to SRAM. Jasur Hanbaba

Integrated circuits and fabrication

Large and Fast: Exploiting Memory Hierarchy

Computer Organization and Assembly Language (CS-506)

Reduce Your System Power Consumption with Altera FPGAs Altera Corporation Public

VLSI Test Technology and Reliability (ET4076)

Introduction to Semiconductor Memory Dr. Lynn Fuller Webpage:

Memory Challenges. Issues & challenges in memory design: Cost Performance Power Scalability

Lecture 13: SRAM. Slides courtesy of Deming Chen. Slides based on the initial set from David Harris. 4th Ed.

ΔΙΑΛΕΞΗ 5: FPGA Programming Technologies (aka: how to connect/disconnect wires/gates)

Unleashing the Power of Embedded DRAM

Spiral 2-9. Tri-State Gates Memories DMA

FPGA Power Management and Modeling Techniques

Memory Design I. Semiconductor Memory Classification. Read-Write Memories (RWM) Memory Scaling Trend. Memory Scaling Trend

ECEN 449 Microprocessor System Design. Memories

CS152 Computer Architecture and Engineering Lecture 16: Memory System

10. Interconnects in CMOS Technology

Spiral 2-8. Cell Layout

CS 320 February 2, 2018 Ch 5 Memory

Digital Systems. Semiconductor memories. Departamentul de Bazele Electronicii

a) Memory management unit b) CPU c) PCI d) None of the mentioned

Memory classification:- Topics covered:- types,organization and working

Intel s s Memory Strategy for the Wireless Phone

Computer Organization

Unit 7: Memory. Dynamic shift register: Circuit diagram: Refer to unit 4(ch 6.5.4)

Moore s s Law, 40 years and Counting

Digital Electronics. CHAPTER THIRTY TWO. Semiconductor Read-Only Memories

Introduction 1. GENERAL TRENDS. 1. The technology scale down DEEP SUBMICRON CMOS DESIGN

ENEE 759H, Spring 2005 Memory Systems: Architecture and

Summer 2003 Lecture 18 07/09/03

EECS150 - Digital Design Lecture 16 Memory 1

CMPE 415 Programmable Logic Devices FPGA Technology I

Transcription:

ELE 455/555 Computer System Engineering Section 1 Review and Foundations Class 3

MOSFETs MOSFET Terminology Metal Oxide Semiconductor Field Effect Transistor 4 terminal device Source, Gate, Drain, Body Threshold Voltage (Vth or Vt) The voltage from gate to source (Vgs) required to turn on the device Ron / Roff Device impedance in the on or off state Leakage Current Parasitic current from junctions to substrate or gate to junctions 2 tj

MOSFETs MOSFET Terminology N-type (N-channel) Forms a conducting n-channel from source to drain Requires a positive Vgs > Vth to form the channel and turn on the device D G B S 3 tj

MOSFETs MOSFET Terminology P-type (P-channel) Forms a conducting p-channel from source to drain Requires a negative Vgs > Vth to form the channel and turn on the device S G B D 4 tj

MOSFETs MOSFET Terminology CMOS Complementary MOS Contains both P-MOS and N-MOS devices Almost all digital circuits today are built using CMOS technology N-MOS P-MOS 5 tj

CMOS Trends CMOS Process Traditionally referenced to the gate length or metal 1/2 pitch 0.25 micron process, 130nm process Over the last 5 years the relationship between reference name and physical parameters has become tenuous Current generation processes are 22nm shifting to 14nm 22nm = 22x10-9 meters = 22x10-6 millimeters = 22 millionths of a mm 1 silicon atom ~ 5 angstrom spacing = 5x10-10 m 22nm = 44 atoms Gate oxide thickness is 2nm 4 rows of atoms 6 Src: TMS - Vol 57, No. 9 tj

CMOS Trends Processor Performance Faster Transistors Larger Die Pipelining Superscaler OoO Execution SISD -> MIMD Memory Hierarchy Performance improvement of 24,000 X Frequency Improvement of 660 X How? Moore s Law 200,000 X Aug 20, ELE 2013 455/555 Spring 2016 7 tj Source: Computer Architecture, Hennessy and Patterson, 2012 Elsivier Inc

CMOS Trends Transistors per chip src: http://www.saphana.com 8 tj

CMOS Trends SPECint Performance single CPU http://www.saphana.com Gap to Moores law src: http://www.saphana.com 9 tj

CMOS Trends SPECint Performance Including multi-core src: http://www.saphana.com 10 tj

CMOS Trends Half-Pitch Trends 11 tj

CMOS Logic Simplified CMOS Digital Design Use simplified models for the MOSFETs switch either on or off N-MOS devices on when a logic high is applied to the gate off when a logic low is applied to the gate P-MOS devices on when a logic low is applied to the gate off when a logic high is applied to the gate 12 tj

CMOS Logic Simplified CMOS Digital Design N-MOS devices on when a logic high is applied to the gate off when a logic low is applied to the gate D High or 1 G S Vss or GND Body Connection is assumed Vss or GND Low or 0 13 tj

CMOS Logic Simplified CMOS Digital Design P-MOS devices on when a logic low is applied to the gate off when a logic high is applied to the gate S Vdd Body Connection is assumed Vdd Low or 0 G D High or 1 14 tj

CMOS Logic Simplified CMOS Digital Design CMOS Inverter logic low 0 is Gnd logic high 1 is Vdd Vdd Vdd Vdd In Out P-MOS on In low 0 N-MOS off Out high 1 P-MOS off In high 1 N-MOS on Out low 0 Gnd Gnd Gnd 15 tj

CMOS Logic Simplified CMOS Digital Design CMOS Nand Gate Vdd Vdd A B 0 0 A B 0 1 Vdd Vdd Vdd Vdd Vdd A B 1 0 Vdd A B Out 0 0 1 0 1 1 1 0 1 1 1 0 A B 1 1 Vdd Vdd A B Out A B B Out 1 A B B Out 1 A B B Out 1 A B B Out 0 A A A A Gnd Gnd Gnd Gnd Gnd 16 tj

CMOS Logic Gate Level Performance Vdd DC Characteristics I DS = V GS V t V DS V 2 DS 2 - linear region I DS = 2 V GS V 2 t - saturated region = W t ox L n 2.8 p In Out Vout Gnd VDD Idsn Idsp 0 VDD Vout 0 Vin 0 Vtn VDD VDD VDD 2 - Vtp 17 tj

CMOS Logic Gate Level Performance Vdd Transient Characteristics Equalize t r and t f by making Wp 2.8 Wn In Out t r t f K n C L n V DD K p t d t r /2 t f /2 C L p V DD, Kn~Kp~3.5 VDD Vout Gnd Optimize delay reduce C L increase V DD increase reduce t ox why might this be the best approach? 0 0 Vtn VDD VDD Vin VDD 2 - Vtp 18 tj

CMOS Logic Gate Level Power 3 primary components of gate level power Vdd Static Power (leakage) Dynamic Power (CV 2 F) Short Circuit Power (shoot-through) In Out Gnd 19 tj

CMOS Logic Gate Level Power Static Power Leakage currents through the reverse biased diode junctions always present Sub-threshold current current from S-D when the input voltage is below Vt due to voltage drops Gate leakage current current from the gate to S/D/Body due to oxide defects or quantum tunneling In Vdd Out Design Considerations Multiple Vt devices Thick oxide devices Reduce supply voltages why? Gnd 20 tj

CMOS Logic Gate level Power Dynamic Power Vdd Power associated with slewing the load capacitance E = Pdt = IVdt = C dv dt Vdt = CVdv = CV2 energy per transition = C L V dd2 /2 2 In Out Power is energy / time P dynamic = CV 2 f Gnd Design considerations Run circuits at the lowest possible speed Reduce supply voltages Minimize capacitance 21 tj

CMOS Logic Gate Level Power Vdd Short Circuit Power Both devices are on Symmetrical gate In Out P sc 12 V DD 2V t 3 ft rf Design Considerations large relative loads lower Psc small relative loads higher Psc Match t r and t f through the chain Vout/Isc VDD Gnd Reduce supply voltages 0 0 Vtn VDD VDD Vin VDD 2 - Vtp 22 tj

CMOS Integrated Circuits Chip Level Considerations Millions to hundreds of millions of gates Physical Space Sheer numbers of gates Keeping shared resources close to multiple users (memory) I/O pin access and placement Interconnect getting all the wires connected Typical processes have 6 10 layers of interconnect Cell, local, global, power Performance Power / Heat Dissipation 23 tj

CMOS Integrated Circuits Chip Level Considerations Performance Drivers Process Transistor performance Short channel vs. long channel devices High Vt and Low Vt devices Clock Frequency Maximum is set by the longest unit delay Very complex timing tools used to ensure max frequency Interconnect RC delays Capacitive coupling 24 tj

CMOS Integrated Circuits Chip Level Considerations Power Drivers Process - Dynamic, Static, Short circuit ( D/S/SC ) Number of gates D/S/SC Clock Frequency D/SC Dynamic power becomes CV 2 f, where f is clock frequency Short circuit power is also multiplied by f Supply Voltage D/S/SC Routing Efficiency D/SC Minimizing capacitance is critical 25 tj

CMOS Integrated Circuits Chip Level Considerations Power / Performance Balance Device Level Solutions Multiple Vth devices Low Vth devices for high performance paths High Vth devices for low performance paths Chip Level Solutions Reduced interconnect R and C Power islands gating the power to circuits not in use Clock Gating turn off the clocks to circuits not in use 26 tj

CMOS Integrated Circuits Chip Level Considerations Power / Performance Balance System Level Solutions Dynamic Voltage Scaling changing VDD as needed Dynamic Frequency Scaling changing the clock frequency as needed Together these are referred to as DVFS Architectural Solutions Pipelining Multi-core processors Homogeneous dual/quad core Heterogeneous big/little Memory Hierarchy 27 tj

Active Power CMOS Integrated Circuits Power Performance Tradeoffs Standby Peak Performance Thermal Concerns Aug 20, ELE 2013 455/555 Spring 2016 DMIPs lp process/devices common processor hp process/devices super processor 28 tj

Active Power CMOS Integrated Circuits Power Performance Tradeoffs Standby Peak Performance Thermal Concerns Aug 20, ELE 2013 455/555 Spring 2016 DMIPs lp process/devices common processor Processor Architecture System Architecture Design Operation Policy hp process/devices super processor 29 tj

Memory Memory Taxonomy Memory Random Access Sequential Access Read/Write Read Only * Tape Optical Dynamic Static Mask ROM EPROM CCD DRAM Power Required No Power Required PROM EEPROM SRAM Flash MRAM Flash * Write once or seldom FERAM 30 tj

Memory SRAM Static Random Access Memory Memory cell (1 bit) is based on a feedback circuit Bit value is retained as long as power is maintained Fastest read/write (R/W) Highest power Lowest density Used in caches and small data memories weak inverters Bit line Bit line Word line 31 tj

Memory SRAM Static Random Access Memory Write All Word lines low Read Enable (RE) disabled (low) Place B0, B1, B2, B3 on inputs Pull write enable bar (WE) low Strobe the desired word line high Bit lines override the bit cell inverters and store the new value in the cell 32 tj

Memory SRAM Static Random Access Memory Read All Word lines low Write enable bar (WE) high inverters tristated Read Enable (RE) high Strobe the desired word line high Bit cell inverters drive the bit lines and sense amplifiers read the value 33 tj

Memory SDRAM Synchronous Dynamic Random Access Memory Memory cell (1 bit) is based on capacitor charge storage Bit value decays over time must be recharged called a refresh cycle Standard SDRAM transfers 1 word each clock cycle DDR double data rate transfers 2 words each clock cycle DDR2, DDR3, DDR4 transfer 4,8,16 words each array access Medium speed Highest density Used as main memory Bit line Vdd/2 Word line 34 tj

Memory SDRAM Synchronous Dynamic Random Access Memory Write All Word lines low Read Enable (RE) disabled (low) Place B0, B1, B2, B3 on inputs Pull write enable bar (WE) low Strobe the desired word line high Bit lines write to the bit cell capacitors 35 tj

Memory SDRAM Synchronous Dynamic Random Access Memory Read All Word lines low Write enable bar (WE) high inverters tristated Read Enable (RE) high Strobe the desired word line high Sense amplifiers read the value of the capacitors 36 tj

Memory ROM Read Only Memory Memory cell (1 bit) is based on whether a MOSFET is or is not connected between the bit line and word line The MOSFET structure is always part of the bit cell It can be connected to the bit line through a contact or via (via ROM) It can be disable by removing the S/D diffusion (diffusion ROM) Cannot be modified once the part is built Typically used as Boot memory or to hold chip configuration data Weak Pull up Contact or Via Word line Bit line 37 tj

Memory ROM Read Only Memory Read All Word lines low Read Enable (RE) high Strobe the desired word line high Sense amplifiers read the value of the bit lines If connected will read a 0 If not connected will read a 1 38 tj

Memory Flash Memory Memory cell (1 bit) is based on charge stored on a floating capacitor The capacitor modifies the threshold voltage of a MOSFET with negative charge stored need higher gate voltage to turn on the MOSFET Creates 2 possible threshold voltages Vth High is required to turn on the MOSFET if charge is stored Vth Low is required to turn on the MOSFET if no charge is stored Thick Oxide p Bulk Silicon S G polysilicon n+ n+ floating gate gate oxide D tunnel oxide G 39 tj D S B

Memory Flash Memory Cell write High voltage process that allows electrons to be injected into (erase) or tunnel out of (write) the floating gate Once the high voltage is removed the electrons are trapped Cell read Place a voltage on the gate midway between Vth High and Vth Low Use the circuit to determine if the MOSFET is on or off Pull Up Vout If charge stored on capacitor (erased) Vthmid < (Vth = VthHigh ) Vout = high 0 Vth mid If no charge stored on capacitor (programmed) Vthmid > (Vth = VthLow ) Vout = low 1 40 tj

Memory Flash Memory NOR Flash Block Erase Less dense than Nand Flash (extra wires and connections) Slower sequential access than Nand Flash Fast (true) random access Used as program memory 41 tj

Memory Flash Memory NAND Flash Block Erase More dense than Nor Flash Faster sequential access than Nor Flash Random access requires additional parts and time Used as file storage memory (Flash Drives) 42 tj

Memory Flash Memory Shadowing Store large amounts of program and data in Nand Flash At boot, copy a portion of the Nand memory into SRAM or SDRAM Use the SRAM as the processor program and data memory As additional program or data are needed swap out a portion of the SRAM/SDRAM 43 tj

Memory Flash memory XIP Execute in Place Execute directly out of NOR flash Nor Flash densities are growing rapidly Nor Flash speeds are fast enough to support the memory hierarchy Requires a caching system 44 tj

Memory Access Many variations, but all basically the same Two Buses Address Bus Address width (# of bits) is log 2 m where m is the number of memory locations Uni-directional only written by processor, read by memory Data Bus Data width (# of bits) is set by the memory/system interface Bi-directional both memory and processor can R/W this bus 3 Control Signals Chip Enable Used to choose one of several possible memory devices Read Asserted by the procesor to signal a read operation Write Asserted by the processor to signal a write operation 45 tj

Memory Access Simplified Process Read Processor selects the appropriate CE Processor places the desired read address on the address bus Processor asserts the read signal Memory fetches the data and places it on the data bus Processor reads the data bus Write Processor selects the appropriate CE Processor places the desired write address on the address bus Processor places the data on the data bus Processor asserts the write signal Memory reads the data on the data bus and stores it 46 tj

Packaging Ball Grid Array 47 tj

Packaging Ball Grid Array Bonded vs. Flip Chip 48 tj

Packaging Stacked Package 49 tj

Packaging PoP Package 50 tj

Packaging Silicon Through Via Package 51 tj