Memory Hierarchy Technology. The Big Picture: Where are We Now? The Five Classic Components of a Computer

Similar documents
ECE468 Computer Organization and Architecture. Memory Hierarchy

CSC Memory System. A. A Hierarchy and Driving Forces

ECE7995 (4) Basics of Memory Hierarchy. [Adapted from Mary Jane Irwin s slides (PSU)]

CPS101 Computer Organization and Programming Lecture 13: The Memory System. Outline of Today s Lecture. The Big Picture: Where are We Now?

CS152 Computer Architecture and Engineering Lecture 16: Memory System

CpE 442. Memory System

14:332:331. Week 13 Basics of Cache

ECE232: Hardware Organization and Design

Recap: Machine Organization

EE 4683/5683: COMPUTER ARCHITECTURE

Course Administration

Memory Hierarchy: Motivation

Memory Hierarchy: The motivation

CENG3420 Lecture 08: Memory Organization

Review: Performance Latency vs. Throughput. Time (seconds/program) is performance measure Instructions Clock cycles Seconds.

CS 61C: Great Ideas in Computer Architecture. The Memory Hierarchy, Fully Associative Caches

CS 61C: Great Ideas in Computer Architecture. Direct Mapped Caches

Modern Computer Architecture

Memory Hierarchy. Maurizio Palesi. Maurizio Palesi 1

EEC 483 Computer Organization

The Memory Hierarchy & Cache

14:332:331. Week 13 Basics of Cache

CENG4480 Lecture 09: Memory 1

COSC 6385 Computer Architecture. - Memory Hierarchies (I)

Speicherarchitektur. Who Cares About the Memory Hierarchy? Technologie-Trends. Speicher-Hierarchie. Referenz-Lokalität. Caches

CENG 3420 Computer Organization and Design. Lecture 08: Memory - I. Bei Yu

Who Cares About the Memory Hierarchy? Time CS 152 Lec16.6. Performance. Recap

,e-pg PATHSHALA- Computer Science Computer Architecture Module 25 Memory Hierarchy Design - Basics

COEN-4730 Computer Architecture Lecture 3 Review of Caches and Virtual Memory

Pipelining, Instruction Level Parallelism and Memory in Processors. Advanced Topics ICOM 4215 Computer Architecture and Organization Fall 2010

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Computer Architecture ECE 568/668

Memory Hierarchy. Maurizio Palesi. Maurizio Palesi 1

The Memory Hierarchy & Cache The impact of real memory on CPU Performance. Main memory basic properties: Memory Types: DRAM vs.

EEC 170 Computer Architecture Fall Cache Introduction Review. Review: The Memory Hierarchy. The Memory Hierarchy: Why Does it Work?

CSE 431 Computer Architecture Fall Chapter 5A: Exploiting the Memory Hierarchy, Part 1

CSF Cache Introduction. [Adapted from Computer Organization and Design, Patterson & Hennessy, 2005]

LECTURE 10: Improving Memory Access: Direct and Spatial caches

Let!s go back to a course goal... Let!s go back to a course goal... Question? Lecture 22 Introduction to Memory Hierarchies

Lecture 17 Introduction to Memory Hierarchies" Why it s important " Fundamental lesson(s)" Suggested reading:" (HP Chapter

Question?! Processor comparison!

Page 1. Review: Address Segmentation " Review: Address Segmentation " Review: Address Segmentation "

Caches and Memory Hierarchy: Review. UCSB CS240A, Winter 2016

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

Memory Hierarchy, Fully Associative Caches. Instructor: Nick Riasanovsky

EECS151/251A Spring 2018 Digital Design and Integrated Circuits. Instructors: John Wawrzynek and Nick Weaver. Lecture 19: Caches EE141

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface

Memory hierarchy Outline

Caches and Memory Hierarchy: Review. UCSB CS240A, Fall 2017

Lecture-14 (Memory Hierarchy) CS422-Spring

Memory Technologies. Technology Trends

CPE 631 Lecture 04: CPU Caches

The Memory Hierarchy & Cache Review of Memory Hierarchy & Cache Basics (from 350):

Memory Hierarchy: Caches, Virtual Memory

Memory. Lecture 22 CS301

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

Memory Hierarchy. Caching Chapter 7. Locality. Program Characteristics. What does that mean?!? Exploiting Spatial & Temporal Locality

Chapter Seven. Memories: Review. Exploiting Memory Hierarchy CACHE MEMORY AND VIRTUAL MEMORY

ELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 6: Memory Organization Part I

Chapter 7 Large and Fast: Exploiting Memory Hierarchy. Memory Hierarchy. Locality. Memories: Review

Memory Hierarchy Y. K. Malaiya

Registers. Instruction Memory A L U. Data Memory C O N T R O L M U X A D D A D D. Sh L 2 M U X. Sign Ext M U X ALU CTL INSTRUCTION FETCH

Performance! (1/latency)! 1000! 100! 10! Capacity Access Time Cost. CPU Registers 100s Bytes <10s ns. Cache K Bytes ns 1-0.

Memory latency: Affects cache miss penalty. Measured by:

CS162 Operating Systems and Systems Programming Lecture 13. Caches and TLBs. Page 1

Memory Hierarchy. ENG3380 Computer Organization and Architecture Cache Memory Part II. Topics. References. Memory Hierarchy

Where Have We Been? Ch. 6 Memory Technology

Textbook: Burdea and Coiffet, Virtual Reality Technology, 2 nd Edition, Wiley, Textbook web site:

CS152 Computer Architecture and Engineering Lecture 17: Cache System

Welcome to Part 3: Memory Systems and I/O

CS161 Design and Architecture of Computer Systems. Cache $$$$$

CS162 Operating Systems and Systems Programming Lecture 10 Caches and TLBs"

Memory latency: Affects cache miss penalty. Measured by:

Main Memory. EECC551 - Shaaban. Memory latency: Affects cache miss penalty. Measured by:

ECE ECE4680

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

MEMORY HIERARCHY DESIGN

CISC 662 Graduate Computer Architecture Lecture 16 - Cache and virtual memory review

Transistor: Digital Building Blocks

The University of Adelaide, School of Computer Science 13 September 2018

CENG 3420 Computer Organization and Design. Lecture 08: Cache Review. Bei Yu

3Introduction. Memory Hierarchy. Chapter 2. Memory Hierarchy Design. Computer Architecture A Quantitative Approach, Fifth Edition

Recall: Paging. Recall: Paging. Recall: Paging. CS162 Operating Systems and Systems Programming Lecture 13. Address Translation, Caching

LECTURE 11. Memory Hierarchy

registers data 1 registers MEMORY ADDRESS on-chip cache off-chip cache main memory: real address space part of virtual addr. sp.

Handout 4 Memory Hierarchy

EEC 170 Computer Architecture Fall Improving Cache Performance. Administrative. Review: The Memory Hierarchy. Review: Principle of Locality

ECE 152 Introduction to Computer Architecture

CSF Improving Cache Performance. [Adapted from Computer Organization and Design, Patterson & Hennessy, 2005]

CS61C : Machine Structures

Paging! 2/22! Anthony D. Joseph and Ion Stoica CS162 UCB Fall 2012! " (0xE0)" " " " (0x70)" " (0x50)"

Cache Memory and Performance

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Caches Part 1

Memory Hierarchy. 2/18/2016 CS 152 Sec6on 5 Colin Schmidt

Chapter Seven. Large & Fast: Exploring Memory Hierarchy

Lecture 12. Memory Design & Caches, part 2. Christos Kozyrakis Stanford University

Donn Morrison Department of Computer Science. TDT4255 Memory hierarchies

Cycle Time for Non-pipelined & Pipelined processors

CMPT 300 Introduction to Operating Systems

Computer System Components

Transcription:

The Big Picture: Where are We Now? The Five Classic Components of a Computer Processor Control Datapath Today s Topics: technologies Technology trends Impact on performance Hierarchy The principle of locality hierarchy terminology Input Output Hierarchy Technology Random Access: Random is good: access time is the same for all locations DRAM: Dynamic Random Access - High density, low power, cheap, slow - Dynamic: need to be refreshed regularly : Static Random Access - Low density, high power, expensive, fast - Static: content will last forever (until lose power) Non-so-random Access Technology: Access time varies from location to location and from time to time Examples: Disk, CDROM Sequential Access Technology: access time linear in location (e.g.,tape) Main memory: DRAMs + Caches: s cs 152 L1 6.1 cs 152 L1 6.2

Technology Trends Capacity Speed (latency) Logic:2x in 3 years 2x in 3 years DRAM: 4x in 3 years 2x in 10 years Disk: 4x in 3 years 2x in 10 years DRAM Year Size Cycle Time 1000:1! 2:1! 1980 64 Kb 250 ns 1983 256 Kb 220 ns 1986 1 Mb 190 ns 1989 4 Mb 165 ns 1992 16 Mb 145 ns 1995 64 Mb 120 ns cs 152 L1 6.3 Performance cs 152 L1 6.4 Who Cares About the Hierarchy? 1000 Processor-DRAM Gap (latency) 100 10 1 1980 1981 1982 1983 1984 1985 1986 1987 1988 µproc CPU 60%/yr. Moore s Law (2X/1.5yr) Processor- Performance Gap: (grows 50% / year) 1989 1990 1991 1992 Time 1993 1994 1995 1996 DRAM 1997 1998 1999 2000 DRAM 9%/yr. (2X/10 yrs)

Impact on Performance Suppose a processor executes at Clock Rate = 200 MHz (5 ns per cycle) CPI = 1.1 50% arith/logic, 30% ld/st, 20% control Suppose that 10% of memory operations get 50 cycle miss penalty DataMiss (1.6) 49% Inst Miss (0.5) 16% CPI = ideal CPI + average stalls per instruction = 1.1 + ( 0.30 (memory access/ins) x 0.10 (miss/memory access) x 50 (cycle/miss) ) = 1.1 cycle + 1.5 cycle = 2. 6 58 % of the time the processor is stalled waiting for memory! a 1% instruction miss rate would add an additional 0.5 cycles to the CPI! Ideal CPI (1.1) 35% The Goal: illusion of large, fast, cheap memory Fact: Large memories are slow, fast memories are small How do we create a memory that is large, cheap and fast (most of the time)? Hierarchy Parallelism cs 152 L1 6.5 cs 152 L1 6.6

An Expanded View of the System Why hierarchy works Processor The Principle of Locality: Program access a relatively small portion of the address space at any instant of time. Datapath Control Probability of reference 0 Address Space 2^n - 1 Speed: Size: Cost: Fastest Smallest Highest Slowest Biggest Lowest cs 152 L1 6.7 cs 152 L1 6.8

Hierarchy: How Does it Work? Temporal Locality (Locality in Time): => Keep most recently accessed data items closer to the processor Spatial Locality (Locality in Space): => Move blocks consists of contiguous words to the upper levels To Processor From Processor Upper Level Blk X Lower Level Blk Y Hierarchy: Terminology Hit: data appears in some block in the upper level (example: Block X) Hit Rate: the fraction of memory access found in the upper level Hit Time: Time to access the upper level which consists of RAM access time + Time to determine hit/miss Miss: data needs to be retrieve from a block in the lower level (Block Y) Miss Rate = 1 - (Hit Rate) Miss Penalty: Time to replace a block in the upper level + Time to deliver the block the processor Hit Time << Miss Penalty Lower Level To Processor From Processor Upper Level Blk X Blk Y cs 152 L1 6.9 cs 152 L1 6.10

Hierarchy of a Modern Computer System By taking advantage of the principle of locality: Present the user with as much memory as is available in the cheapest technology. Provide access at the speed offered by the fastest technology. Datapath Processor Control Registers On-Chip Cache Second Level Cache () Main (DRAM) Secondary Storage (Disk) Tertiary Storage (Disk) How is the hierarchy managed? Registers <-> by compiler (programmer?) cache <-> memory by the hardware memory <-> disks by the hardware and operating system (virtual memory) by the programmer (files) Speed (ns): 1s 10s 100s Size (bytes): 100s Ks Ms 10,000,000s (10s ms) Gs 10,000,000,000s (10s sec) Ts cs 152 L1 6.11 cs 152 L1 6.12

Static RAM () 4 X 2 D latch 15 Address Din[1] Din[0] C D Q _ Q Chip select Output enable Write enable Din[7 0] 8 ı 32K 8 8 Dout[7 0] Write enable 0 2-to-4ı decoder 1 Select 0 Address Data 0 In Out 2 Three-state buffers Select 1 Data 1 In Out 3 Select 2 Output Data 2 In Out Dout[1] Dout[0] Select 3 Data 3 In Out cs 152 L1 6.13 cs 152 L1 6.14

32K X 8 Dynamic RAM (DRAM) DRAM cell Word line Addressı [14 6] 9-to-512ı decoder 512 Pass transistor Capacitor Bit line 4M X 1 DRAM Addressı [5 0] 64 Rowı ı decoderı 11-to-2048 2048 2048ı array Dout7 Dout6 Dout5 Dout4 Dout3 Dout2 Dout1 Dout0 Address[10 0] Column latches Dout cs 152 L1 6.15 cs 152 L1 6.16