Where Have We Been? Ch. 6 Memory Technology

Similar documents
Memory Technology March 15, 2001

Memory hierarchy Outline

Performance & Technology. Performance expressed as a time. Todd C. Mowry CS 740. Sept 12, Performance expressed as a rate(cont)

Computer Hardware 2. Type of Computers. User Point of View. The Motherboard. Inside. Content

Performance & Technology. Performance expressed as a time. Todd C. Mowry CS 740. Sept 11, Performance expressed as a rate(cont)

CS429: Computer Organization and Architecture

CS429: Computer Organization and Architecture

CS 201 The Memory Hierarchy. Gerson Robboy Portland State University

CS 33. Memory Hierarchy I. CS33 Intro to Computer Systems XVI 1 Copyright 2016 Thomas W. Doeppner. All rights reserved.

Large and Fast: Exploiting Memory Hierarchy

Storage Technologies and the Memory Hierarchy

CS 261 Fall Mike Lam, Professor. Memory

ECE7995 (4) Basics of Memory Hierarchy. [Adapted from Mary Jane Irwin s slides (PSU)]

ECE468 Computer Organization and Architecture. Memory Hierarchy

Giving credit where credit is due

Random-Access Memory (RAM) CS429: Computer Organization and Architecture. SRAM and DRAM. Flash / RAM Summary. Storage Technologies

CISC 360. The Memory Hierarchy Nov 13, 2008

CS24: INTRODUCTION TO COMPUTING SYSTEMS. Spring 2017 Lecture 13

Memory Hierarchy Technology. The Big Picture: Where are We Now? The Five Classic Components of a Computer

CS152 Computer Architecture and Engineering Lecture 16: Memory System

CpE 442. Memory System

Computer Organization: A Programmer's Perspective

Contents. Memory System Overview Cache Memory. Internal Memory. Virtual Memory. Memory Hierarchy. Registers In CPU Internal or Main memory

CENG4480 Lecture 09: Memory 1

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

CENG3420 Lecture 08: Memory Organization

Random Access Memory (RAM)

Semiconductor Memory Types Microprocessor Design & Organisation HCA2102

The Memory Hierarchy Sept 29, 2006

Module 1: Basics and Background Lecture 4: Memory and Disk Accesses. The Lecture Contains: Memory organisation. Memory hierarchy. Disks.

ECE 152 Introduction to Computer Architecture

CPS101 Computer Organization and Programming Lecture 13: The Memory System. Outline of Today s Lecture. The Big Picture: Where are We Now?

Computer Systems. Memory Hierarchy. Han, Hwansoo

Foundations of Computer Systems

Random-Access Memory (RAM) CISC 360. The Memory Hierarchy Nov 24, Conventional DRAM Organization. SRAM vs DRAM Summary.

Topic 21: Memory Technology

Topic 21: Memory Technology

CMSC 611: Advanced Computer Architecture

CMSC 313 COMPUTER ORGANIZATION & ASSEMBLY LANGUAGE PROGRAMMING LECTURE 26, SPRING 2013

EEC 483 Computer Organization

CS 261 Fall Mike Lam, Professor. Memory

Memory and Disk Systems

CSE 153 Design of Operating Systems Fall 2018

CMSC 313 COMPUTER ORGANIZATION & ASSEMBLY LANGUAGE PROGRAMMING LECTURE 26, FALL 2012

UNIT 2 Data Center Environment

Key features. ! RAM is packaged as a chip.! Basic storage unit is a cell (one bit per cell).! Multiple RAM chips form a memory.

The Memory Hierarchy /18-213/15-513: Introduction to Computer Systems 11 th Lecture, October 3, Today s Instructor: Phil Gibbons

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Computer Architecture ECE 568

Lecture 18: Memory Systems. Spring 2018 Jason Tang

The Memory Hierarchy Part I

The Memory Hierarchy 1

COMPUTER ARCHITECTURE

CSE 153 Design of Operating Systems

CS 320 February 2, 2018 Ch 5 Memory

COSC 243. Memory and Storage Systems. Lecture 10 Memory and Storage Systems. COSC 243 (Computer Architecture)

ECE 250 / CS250 Introduction to Computer Architecture

Embedded Systems Design: A Unified Hardware/Software Introduction. Outline. Chapter 5 Memory. Introduction. Memory: basic concepts

Embedded Systems Design: A Unified Hardware/Software Introduction. Chapter 5 Memory. Outline. Introduction

Storage. CS 3410 Computer System Organization & Programming

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Computer Architecture ECE 568

Cycle Time for Non-pipelined & Pipelined processors

CS 33. Architecture and Optimization (3) CS33 Intro to Computer Systems XVI 1 Copyright 2018 Thomas W. Doeppner. All rights reserved.

ECE 341. Lecture # 16

Memory Hierarchy Y. K. Malaiya

Computer Architecture and System Software Lecture 08: Assembly Language Programming + Memory Hierarchy

CSE 451: Operating Systems Spring Module 12 Secondary Storage. Steve Gribble

The University of Adelaide, School of Computer Science 13 September 2018

Random-Access Memory (RAM) Lecture 13 The Memory Hierarchy. Conventional DRAM Organization. SRAM vs DRAM Summary. Topics. d x w DRAM: Key features

Semiconductor Memory Types. Computer & Microprocessor Architecture HCA103. Memory Cell Operation. Semiconductor Memory.

CSE 451: Operating Systems Spring Module 12 Secondary Storage

Where We Are in This Course Right Now. ECE 152 Introduction to Computer Architecture. This Unit: Main Memory. Readings

I/O CANNOT BE IGNORED

CSCI-UA.0201 Computer Systems Organization Memory Hierarchy

Overview. EE 4504 Computer Organization. Historically, the limiting factor in a computer s performance has been memory access time

The Memory Hierarchy 10/25/16

BCN1043. By Dr. Mritha Ramalingam. Faculty of Computer Systems & Software Engineering

EE414 Embedded Systems Ch 5. Memory Part 2/2

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

The Memory Hierarchy / : Introduction to Computer Systems 10 th Lecture, Feb 12, 2015

Memory. Lecture 22 CS301

Adapted from instructor s supplementary material from Computer. Patterson & Hennessy, 2008, MK]

Chapter 5B. Large and Fast: Exploiting Memory Hierarchy

Computer Architecture and System Software Lecture 09: Memory Hierarchy. Instructor: Rob Bergen Applied Computer Science University of Winnipeg

Concept of Memory. The memory of computer is broadly categories into two categories:

Advanced Parallel Architecture Lesson 4 bis. Annalisa Massini /2015

5 Computer Organization

CMSC 313 Lecture 26 DigSim Assignment 3 Cache Memory Virtual Memory + Cache Memory I/O Architecture

William Stallings Computer Organization and Architecture 8th Edition. Chapter 5 Internal Memory

CS 471 Operating Systems. Yue Cheng. George Mason University Fall 2017

Lecture-14 (Memory Hierarchy) CS422-Spring

I/O CANNOT BE IGNORED

Technology in Action. Chapter Topics. Participation Question. Participation Question. Participation Question 8/8/11

Mark Redekopp, All rights reserved. EE 352 Unit 10. Memory System Overview SRAM vs. DRAM DMA & Endian-ness

CSC Memory System. A. A Hierarchy and Driving Forces

ECEN 449 Microprocessor System Design. Memories. Texas A&M University

Memory Technology. Chapter 5. Principle of Locality. Chapter 5 Large and Fast: Exploiting Memory Hierarchy 1

CPS104 Computer Organization and Programming Lecture 18: Input-Output. Outline of Today s Lecture. The Big Picture: Where are We Now?

Memories. Design of Digital Circuits 2017 Srdjan Capkun Onur Mutlu.

Computer Organization

COSC 6385 Computer Architecture - Memory Hierarchies (III)

Transcription:

Where Have We Been? Combinational and Sequential Logic Finite State Machines Computer Architecture Instruction Set Architecture Tracing Instructions at the Register Level Building a CPU Pipelining Where Are We Going? Memory Technology 2 Ch. 6 Memory Technology Topics Memory Hierarchy Basics Static RAM Dynamic RAM Magnetic s Access Time Gap Impact of Moore s Law Moore s Law Transistors / Chip doubles every 18 months Performance factors of systems built with integrated circuit technology follow exponential curve e.g., computer speed / memory capacities double every 18 months Implications Computers 10 years from now will run 100 X faster Problems that appear computationally intractable today may be straightforward tomorrow Must not limit future planning with today s technology Example Application Domains Speech recognition Breaking secret codes Digital Video 4

Computer System Processor Reg Cache Cache Memory-I/O bus bus Memory Memory I/O I/O I/O I/O I/O I/O Display Display Network Network 5 Levels in Memory Hierarchy CPU CPU regs regs cache virtual memory C 8 B A 32 B 4/8 KB C Memory Memory H E size: speed: $/Mbyte: block size: Register L1 Cache Memory 256 B 0.3-1 ns 8 B 16KB-64KB 2-3 ns $20/MB 32 B 256-1024 MB 60 ns $0.12/MB 4-8 KB 20-200 GB 8-9 ms $0.002/MB larger, slower, cheaper 6 Static RAM (SRAM) Fast 1~2 nsec access time Persistent as long as power is supplied no refresh required Expensive ~$20/MByte 4/6 transistors/bit Stable High immunity to noise and environmental disturbances Technology for caches 7

Anatomy of an SRAM Cell bit line bit line b b word line Stable Configurations 0 1 1 0 (6 transistors) Terminology: bit line: carries data word line: used for addressing 8 Example SRAM Configuration (16 x 8) W0 b7 b7 b1 b1 b0 b0 A0 A1 A2 A3 W1 Address Address decoder decoder W15 memory cells R/W Input/output lines d7 d1 d0 9 Dynamic RAM (DRAM) Slower than SRAM access time ~50 nsec (true DRAM, not SDRAM) Not persistent every row must be accessed every ~1 ms (refreshed) Cheaper than SRAM ~$0.12 / MByte 1 transistor/bit Fragile electrical noise, light, radiation Workhorse memory technology 10

Anatomy of a DRAM Cell Bit Line Access Transistor Word Line Storage Node C node C BL A DRAM cell is constructed using a capacitor and a single transistor The capacitor is the state storing element. The charge stored in the capacitor slowly drains away (in about 10-100 ms, average is 32 ms). Also, whenever the cell is read the charge on the capacitor is depleted. Refreshing is needed periodically and after each read. 11 Addressing Arrays with Bits Array Size R rows, R = 2 r C columns, C = 2 c N = R * C bits of memory Addressing Addresses are n bits, where N = 2 n Row (address) = address / C leftmost r bits of address Col (address) = address % C rightmost bits of address Example: R = 2 C = 4 N = 8 Addr = 6 12 address = row 0 1 2 3 0 000 001 010 011 1 100 101 110 111 col row 1 col 2 r n c Example 2-Level Decode DRAM (64Kx1) RAS 256 Rows row A15-A0 col Provide 16-bit address in two 8-bit chunks Row Row address address latch latch Column Column address address latch latch 8 \ 8 \ Row Row decoder decoder 256x256 cell cell array array 256 Columns column column R/W column column latch latch and and decoder decoder CAS 13 Dout Din

Observations About DRAMs Timing Access time (= 50ns) < cycle time (= 90ns) Need to rewrite row Must Refresh Periodically Perform complete memory cycle for each row Approximately once every 1ms Sqrt(n) cycles Handled in background by memory Inefficient Way to Get a Single Bit Effectively read entire row of Sqrt(n) bits 14 Magnetic s surface spins at 3600 15,000 RPM The surface consists of a set of concentric magnetized rings called tracks read/write head arm Each track is divided into sectors 15 The read/write head floats over the disk surface and moves back and forth on an arm from track to track. Capacity Parameter 18GB Example Number Platters 12 Surfaces / Platter 2 Number of tracks 6962 Number sectors / track 213 Bytes / sector 512 Total Bytes 18,221,948,928 16

Operation Operation Read or write complete sector Seek Position head over proper track Typically 6-9ms Rotational Latency Wait until desired sector passes under head Worst case: complete rotation 10,025 RPM 6 ms Read or Write Bits Transfer rate depends on # bits per track and rotational speed E.g., 213 * 512 bytes @10,025RPM = 18 MB/sec. Modern disks have external transfer rates of up to 100 MB/sec 17 Performance Getting First Byte Seek + Rotational latency = 7,000 19,000 µsec Getting Successive Bytes ~ 0.06 µsec each roughly 100,000 times faster than getting the first byte! Optimizing Performance: Large block transfers are more efficient Try to do other things while waiting for first byte switch context to other computing task processor is interrupted when transfer completes 18 1. Processor Signals Controller Read sector X and store starting at memory address Y 2. Read Occurs Direct Memory Access (DMA) transfer Under control of I/O 3. I/O Controller Signals Completion Interrupts processor Can resume suspended process / System Interface (2) DMA Transfer Processor Reg Cache Cache Memory-I/O bus bus Memory Memory (1) Initiate Sector Read (3) Read Done I/O I/O 19

Magnetic Technology Seagate ST-12550N Barracuda 2 Linear density 52,187. bits per inch (BPI) Bit spacing 0.5 µm Track density 3,047. tracks per inch (TPI) Track spacing 8.3 µm Total tracks 2,707. tracks Rotational Speed 7200. RPM Avg Linear Speed 86.4 kilometers / hour Head Floating Height 0.13 microns Analogy: put the Sears Tower on its side fly it around the world, 2.5cm above the ground each complete orbit of the earth takes 8 seconds 20 CD Read Only Memory (CDROM) Basis Optical recording technology developed for audio CDs 74/80 minutes playing time 44,100 samples / second 2 X 16-bits / sample (Stereo) Raw bit rate = 172 KB / second Add extra 288 bytes of error correction for every 2048 bytes of data Cannot tolerate any errors in digital data, whereas OK for audio Bit Rate 172 * 2048 / (288 + 2048) = 150 KB / second For 1X CDROM N X CDROM gives bit rate of N * 150 E.g., 12X CDROM gives 1.76 MB / second Capacity 74 Minutes * 150 KB / second * 60 seconds / minute = 650 MB 21 Storage Trends SRAM metric 1980 1985 1990 1995 2000 2000:1980 $/MB 19,200 2,900 320 256 100 190 access (ns) 300 150 35 15 2 150 metric 1980 1985 1990 1995 2000 2000:1980 DRAM $/MB 8,000 880 100 30 1.5 5,300 access (ns) 375 200 100 70 60 6 typical size(mb) 0.064 0.256 4 16 64 1,000 metric 1980 1985 1990 1995 2000 2000:1980 $/MB 500 100 8 0.30 0.05 10,000 access (ms) 87 75 28 10 8 11 typical size(mb) 1 10 160 1,000 9,000 9,000 (Culled from back issues of Byte and PC Magazine) 22

Storage Price: $/MByte 1.E+05 1.E+04 1.E+03 1.E+02 1.E+01 SRAM DRAM 1.E+00 1.E-01 1.E-02 1980 1985 1990 1995 2000 23 Storage Access Times (nsec) 1.E+08 1.E+07 1.E+06 1.E+05 1.E+04 SRAM DRAM 1.E+03 1.E+02 1.E+01 1.E+00 1980 1985 1990 1995 2000 24 Processor clock rates Processors metric 1980 1985 1990 1995 2000 2000:1980 typical clock(mhz) 1 6 20 150 750 750 processor 8080 286 386 Pentium P-III culled from back issues of Byte and PC Magazine 25

The CPU vs. DRAM Latency Gap (ns) 1.E+03 1.E+02 SRAM DRAM CPU cycle 1.E+01 1.E+00 1980 1985 1990 1995 2000 26 Memory Technology Summary Cost and Density Improving at Enormous Rates Speed Lagging Processor Performance Memory Hierarchies Help Narrow the Gap: Small fast SRAMS (cache) at upper levels Large slow DRAMS (main memory) at lower levels Incredibly large & slow disks to back it all up Locality of Reference Makes It All Work Keep most frequently accessed data in fastest memory 27