CSE 2021: Computer Organization

Similar documents
CSE 2021: Computer Organization

Chapter 5. Memory Technology

Memory Technology. Caches 1. Static RAM (SRAM) Dynamic RAM (DRAM) Magnetic disk. Ideal memory. 0.5ns 2.5ns, $2000 $5000 per GB

Computer Systems Laboratory Sungkyunkwan University

V. Primary & Secondary Memory!

5. Memory Hierarchy Computer Architecture COMP SCI 2GA3 / SFWR ENG 2GA3. Emil Sekerinski, McMaster University, Fall Term 2015/16

Caches. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University

ECE331: Hardware Organization and Design

The Memory Hierarchy Cache, Main Memory, and Virtual Memory

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

Memory. Principle of Locality. It is impossible to have memory that is both. We create an illusion for the programmer. Employ memory hierarchy

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

Chapter 5A. Large and Fast: Exploiting Memory Hierarchy

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

ECE232: Hardware Organization and Design

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. 5 th. Edition. Chapter 5. Large and Fast: Exploiting Memory Hierarchy

Computer Architecture Computer Science & Engineering. Chapter 5. Memory Hierachy BK TP.HCM

Memory Technology. Chapter 5. Principle of Locality. Chapter 5 Large and Fast: Exploiting Memory Hierarchy 1

Memory Hierarchy. ENG3380 Computer Organization and Architecture Cache Memory Part II. Topics. References. Memory Hierarchy

Memory Hierarchy. Reading. Sections 5.1, 5.2, 5.3, 5.4, 5.8 (some elements), 5.9 (2) Lecture notes from MKP, H. H. Lee and S.

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

COMPUTER ORGANIZATION AND DESIGN

LECTURE 4: LARGE AND FAST: EXPLOITING MEMORY HIERARCHY

CS/ECE 3330 Computer Architecture. Chapter 5 Memory

Advanced Memory Organizations

Chapter 5. Large and Fast: Exploiting Memory Hierarchy

EN1640: Design of Computing Systems Topic 06: Memory System

ECE260: Fundamentals of Computer Engineering

EN1640: Design of Computing Systems Topic 06: Memory System

Computer Architecture Review. Jo, Heeseung

Cache Optimization. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University

The Memory Hierarchy. Cache, Main Memory, and Virtual Memory (Part 2)

Chapter 5 Large and Fast: Exploiting Memory Hierarchy (Part 1)

CSF Cache Introduction. [Adapted from Computer Organization and Design, Patterson & Hennessy, 2005]

Chapter 5. Large and Fast: Exploiting Memory Hierarchy. Jiang Jiang

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition. Chapter 5. Large and Fast: Exploiting Memory Hierarchy

EEC 170 Computer Architecture Fall Cache Introduction Review. Review: The Memory Hierarchy. The Memory Hierarchy: Why Does it Work?

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface

14:332:331. Week 13 Basics of Cache

Donn Morrison Department of Computer Science. TDT4255 Memory hierarchies

Memory Hierarchies. Instructor: Dmitri A. Gusev. Fall Lecture 10, October 8, CS 502: Computers and Communications Technology

Memory Hierarchy Y. K. Malaiya

Module Outline. CPU Memory interaction Organization of memory modules Cache memory Mapping and replacement policies.

CS161 Design and Architecture of Computer Systems. Cache $$$$$

Lecture 11: Memory Systems -- Cache Organiza9on and Performance CSE 564 Computer Architecture Summer 2017

registers data 1 registers MEMORY ADDRESS on-chip cache off-chip cache main memory: real address space part of virtual addr. sp.

CENG 3420 Computer Organization and Design. Lecture 08: Cache Review. Bei Yu

COMPUTER ORGANIZATION AND DESIGN

Registers. Instruction Memory A L U. Data Memory C O N T R O L M U X A D D A D D. Sh L 2 M U X. Sign Ext M U X ALU CTL INSTRUCTION FETCH

ECE331: Hardware Organization and Design

CSCI 402: Computer Architectures. Fengguang Song Department of Computer & Information Science IUPUI. Recall

14:332:331. Week 13 Basics of Cache

Lecture 12: Memory hierarchy & caches

COMPUTER ORGANIZATION AND DESIGN ARM

LECTURE 11. Memory Hierarchy

Locality. Cache. Direct Mapped Cache. Direct Mapped Cache

Basic Memory Hierarchy Principles. Appendix C (Not all will be covered by the lecture; studying the textbook is recommended!)

CSE 431 Computer Architecture Fall Chapter 5A: Exploiting the Memory Hierarchy, Part 1

Caches. Hiding Memory Access Times

Chapter Seven. Memories: Review. Exploiting Memory Hierarchy CACHE MEMORY AND VIRTUAL MEMORY

Chapter 7 Large and Fast: Exploiting Memory Hierarchy. Memory Hierarchy. Locality. Memories: Review

registers data 1 registers MEMORY ADDRESS on-chip cache off-chip cache main memory: real address space part of virtual addr. sp.

Memory Hierarchy Design (Appendix B and Chapter 2)

Page 1. Memory Hierarchies (Part 2)

Memory Hierarchy. Memory Flavors Principle of Locality Program Traces Memory Hierarchies Associativity. (Study Chapter 5)

Memory. Lecture 22 CS301

Chapter 7: Large and Fast: Exploiting Memory Hierarchy

EE 4683/5683: COMPUTER ARCHITECTURE

CS 61C: Great Ideas in Computer Architecture. Direct Mapped Caches

EECS151/251A Spring 2018 Digital Design and Integrated Circuits. Instructors: John Wawrzynek and Nick Weaver. Lecture 19: Caches EE141

CSF Improving Cache Performance. [Adapted from Computer Organization and Design, Patterson & Hennessy, 2005]

Page 1. Multilevel Memories (Improving performance using a little cash )

Advanced Computer Architecture

Memory Hierarchy Motivation, Definitions, Four Questions about Memory Hierarchy

Textbook: Burdea and Coiffet, Virtual Reality Technology, 2 nd Edition, Wiley, Textbook web site:

The Memory Hierarchy & Cache Review of Memory Hierarchy & Cache Basics (from 350):

Lecture 9 Pipeline and Cache

data block 0, word 0 block 0, word 1 block 1, word 0 block 1, word 1 block 2, word 0 block 2, word 1 block 3, word 0 block 3, word 1 Word index cache

Memory Hierarchy: Caches, Virtual Memory

Course Administration

CS 61C: Great Ideas in Computer Architecture. The Memory Hierarchy, Fully Associative Caches

Why memory hierarchy? Memory hierarchy. Memory hierarchy goals. CS2410: Computer Architecture. L1 cache design. Sangyeun Cho

COEN-4730 Computer Architecture Lecture 3 Review of Caches and Virtual Memory

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Caches Part 1

Chapter 6 Objectives

Caching Basics. Memory Hierarchies

Cache Memory and Performance

Memory Hierarchy: Motivation

Memory Hierarchy: The motivation

3Introduction. Memory Hierarchy. Chapter 2. Memory Hierarchy Design. Computer Architecture A Quantitative Approach, Fifth Edition

Memory Hierarchy Computing Systems & Performance MSc Informatics Eng. Memory Hierarchy (most slides are borrowed)

ECE232: Hardware Organization and Design

LECTURE 10: Improving Memory Access: Direct and Spatial caches

Memory Hierarchy, Fully Associative Caches. Instructor: Nick Riasanovsky

Memory Hierarchy Computing Systems & Performance MSc Informatics Eng. Memory Hierarchy (most slides are borrowed)

Lecture 12. Memory Design & Caches, part 2. Christos Kozyrakis Stanford University

Lecture 17 Introduction to Memory Hierarchies" Why it s important " Fundamental lesson(s)" Suggested reading:" (HP Chapter

Transcription:

CSE 2021: Computer Organization Lecture-12 Caches-1 The basics of caches Shakil M. Khan

Memory Technology Static RAM (SRAM) 0.5ns 2.5ns, $2000 $5000 per GB Dynamic RAM (DRAM) 50ns 70ns, $20 $75 per GB Magnetic disk 5ms 20ms, $0.20 $2 per GB Ideal memory access time of SRAM capacity and cost/gb of disk CSE-2021 July-26-2012 2

Principle of Locality Programs access a small proportion of their address space at any time Temporal locality items accessed recently are likely to be accessed again soon e.g., instructions in a loop, induction variables Spatial locality items near those accessed recently are likely to be accessed soon e.g., sequential instruction access, array data CSE-2021 July-26-2012 3

Taking Advantage of Locality Memory hierarchy Store everything on disk Copy recently accessed (and nearby) items from disk to smaller DRAM memory main memory Copy more recently accessed (and nearby) items from DRAM to smaller SRAM memory cache memory attached to CPU CSE-2021 July-26-2012 4

Memory Hierarchy Levels Block (aka line): unit of copying may be multiple words If accessed data is present in upper level hit: access satisfied by upper level hit ratio: hits/accesses If accessed data is absent miss: block copied from lower level time taken: miss penalty miss ratio: misses/accesses = 1 hit ratio then accessed data supplied from upper level CSE-2021 July-26-2012 5

Cache Memory Cache memory the level of the memory hierarchy closest to the CPU Given accesses X 1,, X n 1, X n How do we know if the data is present? Where do we look? CSE-2021 July-26-2012 6

Direct Mapped Cache Location determined by address Direct mapped: only one choice (Block address) modulo (#Blocks in cache) #Blocks is a power of 2 Use low-order address bits CSE-2021 July-26-2012 7

Tags and Valid Bits How do we know which particular block is stored in a cache location? store block address as well as the data actually, only need the high-order bits called the tag What if there is no data in a location? valid bit: 1 = present, 0 = not present initially 0 CSE-2021 July-26-2012 8

Cache Example 8-blocks, 1 word/block, direct mapped Initial state: Index V Tag Data 000 N 001 N 010 N 011 N 100 N 101 N 110 N 111 N CSE-2021 July-26-2012 9

Cache Example Word addr Binary addr Hit/miss Cache block 22 10 110 Miss 110 Index V Tag Data 000 N 001 N 010 N 011 N 100 N 101 N 110 Y 10 Mem[10110] 111 N CSE-2021 July-26-2012 10

Cache Example Word addr Binary addr Hit/miss Cache block 26 11 010 Miss 010 Index V Tag Data 000 N 001 N 010 Y 11 Mem[11010] 011 N 100 N 101 N 110 Y 10 Mem[10110] 111 N CSE-2021 July-26-2012 11

Cache Example Word addr Binary addr Hit/miss Cache block 22 10 110 Hit 110 26 11 010 Hit 010 Index V Tag Data 000 N 001 N 010 Y 11 Mem[11010] 011 N 100 N 101 N 110 Y 10 Mem[10110] 111 N CSE-2021 July-26-2012 12

Cache Example Word addr Binary addr Hit/miss Cache block 16 10 000 Miss 000 3 00 011 Miss 011 16 10 000 Hit 000 Index V Tag Data 000 Y 10 Mem[10000] 001 N 010 Y 11 Mem[11010] 011 Y 00 Mem[00011] 100 N 101 N 110 Y 10 Mem[10110] 111 N CSE-2021 July-26-2012 13

Cache Example Word addr Binary addr Hit/miss Cache block 18 10 010 Miss 010 Index V Tag Data 000 Y 10 Mem[10000] 001 N 010 Y 10 Mem[10010] 011 Y 00 Mem[00011] 100 N 101 N 110 Y 10 Mem[10110] 111 N CSE-2021 July-26-2012 14

Address Subdivision CSE-2021 July-26-2012 15

Example: Larger Block Size 64 blocks, 16 bytes/block to what block number does address 1200 map? Block address = 1200/16 = 75 Block number = 75 modulo 64 = 11 31 10 9 4 3 0 Tag Index Offset 22 bits 6 bits 4 bits CSE-2021 July-26-2012 16

Block Size Considerations Larger blocks should reduce miss rate due to spatial locality But in a fixed-sized cache larger blocks fewer of them more competition increased miss rate larger blocks pollution Larger miss penalty can override benefit of reduced miss rate early restart and critical-word-first can help CSE-2021 July-26-2012 17

Cache Misses On cache hit, CPU proceeds normally On cache miss stall the CPU pipeline fetch block from next level of hierarchy instruction cache miss restart instruction fetch data cache miss complete data access CSE-2021 July-26-2012 18

Write-Through On data-write hit, could just update the block in cache but then cache and memory would be inconsistent Write through: also update memory But makes writes take longer e.g., if base CPI = 1, 10% of instructions are stores, write to memory takes 100 cycles effective CPI = 1 + 0.1 100 = 11 Solution: write buffer holds data waiting to be written to memory CPU continues immediately only stalls on write if write buffer is already full CSE-2021 July-26-2012 19

Write-Back Alternative: On data-write hit, just update the block in cache keep track of whether each block is dirty When a dirty block is replaced write it back to memory can use a write buffer to allow replacing block to be read first CSE-2021 July-26-2012 20

Write Allocation What should happen on a write miss? Alternatives for write-through allocate on miss: fetch the block write around: don t fetch the block since programs often write a whole block before reading it (e.g., initialization) For write-back usually fetch the block CSE-2021 July-26-2012 21