Memory Hierarchy. 2/18/2016 CS 152 Sec6on 5 Colin Schmidt

Size: px

Start display at page:

Download "Memory Hierarchy. 2/18/2016 CS 152 Sec6on 5 Colin Schmidt"

Elmer Wheeler
5 years ago
Views:

1 Memory Hierarchy 2/18/2016 CS 152 Sec6on 5 Colin Schmidt

2 Agenda Review Memory Hierarchy Lab 2 Ques6ons Return Quiz 1

3 Latencies Comparison Numbers L1 Cache 0.5 ns L2 Cache 7 ns 14x L1 cache Main Memory 100 ns 20x L2, 200x L1 Read 4K randomly from SSD Read 1MB sequen6ally from memory 150,000 ns 250,000 ns Read 1MB sequen6ally from SSD 1,000,000 ns 4x Memory Disk Seek 10,000,000 ns Read 1 MB sequen6ally from disk 20,000,000 ns 80x memory, 20x SSD assuming ~1GB/sec SSD hhps://gist.github.com/jboner/

4 DRAM DRAM is complex why put up with it? CHEAP! and very fast compared to disk Many steps in modern dram but at a high level just 3 RAS, CAS, Precharge Why do we need precharge Capacitor is very small only has a lihle bit of charge Sense amps turn this small change into 0 or 1 Most architects just care about avoiding using the DRAM and using the BW effec6vely when needed

5 1000 µproc 60%/year Performance CPU Time Processor-Memory Performance Gap: (growing 50%/yr) DRAM DRAM 7%/year

6 Memory Hierarchy CPU A Small, Fast Memory (RF, SRAM) B Big, Slow Memory (DRAM) holds frequently used data capacity: Register << SRAM << DRAM latency: Register << SRAM << DRAM bandwidth: on-chip >> off-chip On a data access: if data fast memory low latency access (SRAM) if data fast memory high latency access (DRAM) 2/8/2016 CS152, Spring

7 SRAM Cell

8 Caches Who manages? Hardware How do we know what to cache? Locality? Temporal, Spa6al Basic parts of a cache? Tags, lines How does a Hit work vs a Miss? Why do we miss? Compulsory, Capacity, Conflict

9 Big Modern Chip

10 Direct-Mapped Cache Tag Index Block Offset t V Tag k Data Block b 2 k lines = t HIT Data Word or Byte 2/8/2016 CS152, Spring 2016

11 2-Way Set-Associa9ve Cache Tag Index Block Offset b t V Tag k Data Block V Tag Data Block t = = Data Word or Byte HIT 2/8/2016 CS152, Spring 2016

12 V Fully Associa9ve Cache Tag Data Block t = Tag t = HIT Block Offset b = Data Word or Byte 2/8/2016 CS152, Spring 2016

13 Replacement Policies Random LRU Pseudo-LRU FIFO Can explore in lab 2

14 AMAT Average memory access 6me (AMAT) = Hit 6me + Miss rate x Miss penalty Average memory access 6me (AMAT) = Hit 6me + Miss rate 1 x Miss penalty 1 + Miss rate 2 x Miss penalty 2 2/10/2016 CS152, Spring

15 CPU-Cache Interac9on Caches instead of memory blocks (5-stage pipeline) 0x4 Add bubble PC addr inst hit? PCen Primary Instruc6on Cache IR D To Memory Control Decode, Register Fetch E A B MD1 ALU M Y MD2 we addr Primary Data rdata Cache hit? wdata R Stall en6re CPU on data cache miss Cache Refill Data from Outer Levels of Memory Hierarchy 2/10/2016 CS152, Spring

16 Write Policy Choices Cache hit: write through: write both cache & memory Generally higher traffic but simpler pipeline & cache design write back: write cache only, memory is wrihen only when the entry is evicted A dirty bit per line further reduces write-back traffic Must handle 0, 1, or 2 accesses to memory for each load/ store Cache miss: no write allocate: only write to main memory write allocate (aka fetch on write): fetch into cache Common combina6ons: write through and no write allocate write back with write allocate 2/10/2016 CS152, Spring

17 Pipelining Cache Writes Address and Store Data From CPU Tag Index Store Data Tags Delayed Write Addr. =? Load/Store S L Delayed Write Data Data =? 1 0 Load Data to CPU Hit? Data from a store hit wrieen into data porfon of cache during tag access of subsequent store 2/10/2016 CS152, Spring

18 Inclusion Policy Inclusive mul6level cache: Inner cache can only hold lines also present in outer cache External coherence snoop access need only check outer cache Exclusive mul6level caches: Inner cache may hold lines not in outer cache Swap lines between inner/outer caches on miss Used in AMD Athlon with 64KB primary and 256KB secondary cache Why choose one type or the other? 2/10/2016 CS152, Spring

19 Prefetching Speculate on future instruc6on and data accesses and fetch them into cache(s) Instruc6on accesses easier to predict than data accesses Varie6es of prefetching Hardware prefetching Sosware prefetching Mixed schemes What types of misses does prefetching affect? Explore in Lab 2 2/10/2016 CS152, Spring

20 Ques6ons

21 Lab 2 Has anyone looked at it? Ques6ons? Lab 1 Report graded by next discussion

22 Graded Quiz

CS 152 Computer Architecture and Engineering. Lecture 7 - Memory Hierarchy-II

CS 152 Computer Architecture and Engineering Lecture 7 - Memory Hierarchy-II Krste Asanovic Electrical Engineering and Computer Sciences University of California at Berkeley http://www.eecs.berkeley.edu/~krste!