Design of Embedded DSP Processors Unit 5: Data access. 9/11/2017 Unit 5 of TSEA H1 1
|
|
- Nigel Preston
- 5 years ago
- Views:
Transcription
1 Design of Embedded DSP Processors Unit 5: Data access 9/11/2017 Unit 5 of TSEA H1 1
2 Data memory in a Processor Store Data FIFO supporting DSP executions Computing buffer Parameter storage Access Data access Memory addressing Cache Scratch pad memory Single/ multi-port Address generator Critical path Silicon cost 9/11/2017 2
3 Memory design: what to do Memory may not be a component in IP Addressing is part of the core design Memory peripheral is the core hardware Design for data access today includes 1. Memory peripheral design fundamentals 2. General memory addressing 3. Special memory addressing accelerations 9/11/2017 Unit 5 of TSEA H1 3
4 Memory issues and challenges An application may need mega B s memories Memory consumes the majority silicon area Move memory off chip will reeduce silicon cost Keep memory on chip will keep the performance On-off chip partition / trade-off is a main challenge for performance, low silicon, and low power Memory issue is the challenge of DSP design Memory hierarchy, partition, access latency, and usage or efficiency (SW-HW co-design) 9/11/2017 Unit 5 of TSEA H1 4
5 On chip memory problems Background: memory speed-up slower than that of logic when scaling silicon 1. Architecture: multi-memory blocks in parallel to support functional acceleration a) Small memory block size: fast, area inefficient, DFT? b) Large memory block size: slow and area efficient 2. Custom addressing V.S. cache (hide complexity) a) Acceleration will reduce latency and cost b) Cache easy programming and SW portability 9/11/2017 Unit 5 of TSEA H1 5
6 Physical memory circuits 9/11/2017 Unit 5 of TSEA H1 6
7 Data access specification Single port dual port Cache or SPM General address GEN Special address GEN such as modulo Memory peripheral functions Cache (I and D) and SPM (D) Size and number of memory modules Unsigned computing for AGU AGU Pipeline tricky Block shutting down Tricky to design for M peripherals Pipelie balancing for critical path and LPD 9/11/2017 7
8 The row decoder with m-lines of inputs and 2 m lines of output Copyright of Linköping University, all rights reserved Basic SRAM and its timing The column decoder and sense amplifiers with k-lines of inputs and 2 k+1 lines of output to the memory and one bit data in-out Data in-out bit 9/11/2017 For teachers using the book 8
9 Column line + Column bar line The row decoder Copyright of Linköping University, all rights reserved Basic SRAM and its timing A memory cell Row line The column decoder and R-W circuit (a) A memory cell 4 Data in-out bits (b) 128x4-bit signal port SRAM 9/11/2017 Unit 5 of TSEA H1 9
10 Scratch pad memory Simple SRAM, not a cache In general, scratch pad memory: Is a Synch, Single port, SRAM SW designer handels data access complexity & gets opportunities Can be DRAM / ROM multi ports 9/11/2017 Unit 5 of TSEA H1 10
11 Processor core, General address translation module Copyright of Linköping University, all rights reserved Cache = Valid Tag Data (256b=16bX16) Tag (21b) Index(9b) offset(4b) Cache come from French, Cache = Hide, meaning to access easier 9/11/2017 Unit 5 of TSEA H1 11
12 Scratch pad or cache? Scratch pad memory Simpler, cheaper, and use less power More opportunities for access acceleration: such as multi separated memories, custom addressing Deterministic, static only, for embedded systems! Cache memory Hide complexity, much more power, cost silicon Cache miss induced cycles costs uncertainty Programming easy, suitable for general computing 9/11/2017 Unit 5 of TSEA H1 12
13 Design of memory peripherals 9/11/2017 Unit 5 of TSEA H1 13
14 Basic SRAM and its timing W 9/11/2017 Unit 5 of TSEA H1 14
15 D flip-flop Memory logic (case 1) Copyright Write of in Linköping data University, all rights reserved Address Memory enable Read out data Logic or SRAM longwire Write enable Memory clock Machine clock The circuitry with problem Machine clock Address Memory enable Write enable Logic delay Read out data Read out register Data valid Before clock duty modification, memory using machine clock Memory clock Address Memory enable Write enable Logic delay Read out data Read out register Data valid 9/11/2017 For teachers using the book 15 After clock duty modification, memory using memory clock
16 Adress register Memory logic (case 2) Copyright of Linköping University, all rights reserved Machine clock Logic or long wire Write in data M Address Memory enable Write enable Memory clock The circuitry with problem SRAM Read out data Machine clock AR out Address Memory enable Write enable old address Decoding time Reading a wrong word Right address comes too late Read out data Incorrect data Read out register Data valid Before phase modification, memory using machine clock Machine clock Memory clock AR out Address Memory enable Write enable old address Decoding time Reading a right word Right address comes in time Data available here Read out data Correct data Read 9/11/2017 out register For teachers using the book 16 Data valid After phase modification, memory clock is delayed
17 Memory addressing 9/11/2017 Unit 5 of TSEA H1 17
18 General memory addressing Addressing Implied addressing Memory direct Segment plus offset Register indirect Register post increment Register pre decrement Index addressing Algorithm Specification Implicitly specified in the OP code A <= immediate data of the instruction A <= SEG + OFFSET A <= Selected GR A <= AP and then INC (AP) /*AP is an address pointer*/ DEC (AP) and then A <= AP A <= SEG + Index GR 9/11/2017 Unit 5 of TSEA H1 18
19 Addressing feedback Memory addressing circuit in general Copyright of Linköping University, all rights reserved Inputs Address calculation logic circuit Initial address Keeper Combinational output Registered output Address pointer 9/11/2017 Unit 5 of TSEA H1 19
20 An addressing circuit example Register value Offset value Addressing calculation logic II Direct address Addressing calculation M5 logic I (same same as II in this figure) M1 M2 FA M3 M M6... address RG1 address RG2 address RG6 address RG7 9/11/2017 For teachers using the book 20
21 Modulo addressing: an addressing acceleration example Register FIFO is needed by algorithms Easy to use, high power/silocn consumptions Modulo FIFO addressing on low power SRAM Using Top blocker, Bottom blocker, Address pointer Top Address pointer Bottom Address pointer <= Address pointer ± 1 If Address pointer (AP) = top then AP=bottom If Address pointer (AP) = bottom then AP=top 9/11/2017 Unit 5 of TSEA H1 21
22 A convolution Copyright of Linköping University, all rights reserved 02 ACR <= 0; 03 LCR <= m; // LCR is loop counter register; 04 CAR <= coefficient_starting_address; 05 DAR <= data_starting_address; // for DM; 06 TAR <= top_address; //of FIFO in DM; 07 BAR <= bottom_address;// of FIFO in DM; 08 DM (DAR) <= input_new_data; 09 OPA <= DM (DAR); 10 OPB <= TM (CAR); 11 BFR <= OPA * OPB; 12 ACR <= ACR + BFR; 13 if DAR == BAR then DAR <= TAR 14 else DEC (DAR); 15 INC (CAR); 16 DEC (LCR); 17 if LCR <> Loop_size then jump to else Y <= Saturate (round (ACR)); 9/11/2017 For teachers using the book end.
23 A convolution Copyright of Linköping University, all rights reserved 02 ACR <= 0; 03 LCR <= m; // LCR is loop counter register; 04 CAR <= coefficient_starting_address; 05 DAR <= data_starting_address; // for DM; 06 TAR <= top_address; //of FIFO in DM; 07 BAR <= bottom_address;// of FIFO in DM; 08 DM (DAR) <= input_new_data; 09 OPA <= DM (DAR); 10 OPB <= TM (CAR); 11 BFR <= OPA * OPB; 12 ACR <= ACR + BFR; 13 if DAR == BAR then DAR <= TAR 14 else DEC (DAR); 15 INC (CAR); 16 DEC (LCR); 17 if LCR <> Loop_size then jump to else Y <= Saturate (round (ACR)); 9/11/2017 For teachers using the book end.
24 The data memory space The FIFO buffer Copyright of Linköping University, all rights reserved Design a FIFO based on SRAM MIN address BAR TAR MAX address DM BAR + 0 BAR + 1 BAR + 2 BAR + 3 BAR + 4 BAR TAR BAR TAR Step 0 Step 1 X (n-3) BAR X (n-4) X (n-4) X (n) X (n) DAR X (n-1) X (n-1) X (n-2) X (n-2) TAR X (n-3) before getting after getting new data new data 1 Step 2 Step 3 X (n) DAR BAR X (n-1) X (n-1) X (n-2) X (n-2) X (n-3) X (n-3) X (n-4) X (n-4) TAR X (n) after getting after getting new data 2 new data 3 DAR DAR Example: The procedure a FIFO getting a new data sample 9/11/2017 Unit 5 of TSEA H1 24
25 Convolution hardware Copyright of Linköping University, all rights reserved Modulo addressing circuit Load data to registers M DM TM TAR + Flag if EQ 1 0 M1 * 0 1 M M2 BAR DAR = Modolu address generator + ACR 9/11/2017 Unit 5 of TSEA H1 25
26 Convolution hardware Copyright of Linköping University, all rights reserved Modulo++ addressing circuit Load data to registers M DM TM BAR + Flag if EQ 1 0 M1 * 0 1 M M2 TAR DAR = Modolu address generator + ACR 9/11/2017 For teachers using the book 26
27 Memory hierarchy 9/11/2017 Unit 5 of TSEA H1 27
28 Memory in an embedded system An application may need >200MB SRAM E.g. a video encoder with high end monitor Frames for camera pre-processing Frames for video encoder and for video decoder Frames for video post processing A 4K frame size is 24 MB (4096 x 2160 x 3B) More than 200MB SRAM needed, it consumes ~ 50mm 2 on chip, expensive! 9/11/2017 Unit 5 of TSEA H1 28
29 Memory in an embedded system An application may need X-MB ROM Except for data buffer, a video encoder with high end monitor needs yet other memories for: Up to xmb camera and other driver setting Up to xmb video CODEC codes Up to xkb codes for post processing Usually CMOS silicon processing masks are not for ROM processing. Extra cost needed! 9/11/2017 Unit 5 of TSEA H1 29
30 Memory in an embedded system 1. We therefore need off chip SDRAM (DDR) and off-chip ROM (or SSD) in a system 2. We speed up on chip SRAM by splitting DM into small memory blocks Memory subsystem design is essential for embedded system 9/11/2017 Unit 5 of TSEA H1 30
31 DSP core External memory I/F Main memory, ROM DSP core DMA controller & I/F Main memory, ROM Copyright of Linköping University, all rights reserved Memory hierarchy I-Scratch pad memory D-Scratch pad memories (a) I-Scratch pad memory I-cache memory D-Scratch pad memory (b) D-cache memory 9/11/2017 For teachers using the book 31
32 Datapath Level 1: Register file DMA controller Level 3: Main memory Copyright of Linköping University, all rights reserved Memory hierarchy on a chip Level 2: Data memory: Scratch pad memory or cache The chip 9/11/2017 Unit 5 of TSEA H1 32
33 DMA Copyright of Linköping University, all rights reserved Memory hierarchy of SoC = DSP+MCU DSP MCU Accelerators L1: RF DP+CP L1: RF DP+CP DP+CP DM1 DMn PM DM1 DMn PM DMn PM I/F DMA DMA I/F DMA I/F SoCBUS and its arbitration / routing / control Main on chip memory Nonvolatile memory I/F Off chip DRAM I/F I/F 9/11/2017 Unit 5 of TSEA H1 33
34 Requirements Memory partition The limit of the on chip Memory size The number of data needed simultaneously Supporting access of different data types Overhead costs from memory peripheral Critical path from memory peripheral Memory shutting down for low power 9/11/2017 Unit 5 of TSEA H1 34
35 Off chip memory extension DDR or Flash M or off-chip SRAM DDR (volatile): 1. Double data rate 2. SDRAM DDR Controller: POR, refresh, access, buffering DDR PHY: circuit and I/O implementation Flash M (no volatile SSD) Floating gate to keep data, control gate to change data NAND: write fast, large size, low cost NOR: read fast, low power, and small Off chip SRAM: fast and parallel, high cost 9/11/2017 Unit 5 of TSEA H1 35
36 DMA basics 9/11/2017 Unit 5 of TSEA H1 36
37 DMA definition and specification DMA: Direct memory access An external device independent to the core Running load and store in parallel with a core DSP processor can do other things in parallel Requirements Large bandwidth and low latency Flexible / support different access patterns Multiple access and Linking table is important 9/11/2017 Unit 5 of TSEA H1 37
38 Direct Memory Access Processor configures a DMA setup Processor asks for the right to use the main memory Gets right Processor releases a memory page (block) and connects it to a DMA port Main memory (or another memory) is connected to another DMA port Run DMA transaction Processor running other programs DMA run the transaction 9/11/2017 For teachers using the book 38 Processor takes back the memory page and release DMA as well the main memory
39 DMA data and control 9/11/2017 Unit 5 of TSEA H1 39
40 DMA Behavior model DMA request Arbiter DMA ACK Config. REG address Configuration vector DMA data in Controller FIFO buffer Address generator 1 Address generator 2 DMA data out Clock 1 Address enable W-enable DMA status Clock 2 Address enable W-enable 9/11/2017 Unit 5 of TSEA H1 40
41 Questions to discuss Why computing for memory addresses are based on unsigned computing, what are benefits Modulo addressing to emulate a FIFO, which is one kind of acceleration for address computing, what kinds of acceleration is need for address calculation of FFT? It is proposed to try (to run DIT butterfly in one cycle) it if you have time. 9/11/2017 Unit 5 of TSEA H1 41
42 Concepts Copyright of Linköping University, all rights reserved Skills Review on Unit 5 System understanding Plan HW schematic HW coding Micro architecture Memory & data access Memory circuit Memory hierarchy General an special addressing for memory access We did not talk about addressing for RF Modulo addressing: A way to use part of SRAM as FIFO for filters and for Java garbage collection in JVM You may need to think how to use D-cache and SPM in parallel Plan for pipeline to balance the critical path and avoid long wire delay Coding skill is specially important for IP reuse because SRAM offered by different IP suppliers might be different 9/11/2017 Unit 5 of TSEA H1 42
43 Self reading after the lecture Function of the memory subsystem Function of memory general/special addressing Read chapter 15 Think about: FIFO Behavior model mapping on SRAM Modulo addressing hardware implementation 9/11/2017 Unit 5 of TSEA H1 43
44 Exciting time now! Let us discuss Whatever you want to discuss and related to HW You will have the chance after each lecture (Fö), do take the chance! Prepare your Qs for the next time 9/11/2017 Unit 5 of TSEA H1 44
45 LOGO Welcome to ask any questions you want to I can answer Or discuss together I want to know what you want Dake Liu, Room 556 coridoor B, Hus-B, phone , dake.liu@liu.se
08 - Address Generator Unit (AGU)
October 2, 2014 Todays lecture Memory subsystem Address Generator Unit (AGU) Schedule change A new lecture has been entered into the schedule (to compensate for the lost lecture last week) Memory subsystem
More informationDesign of Embedded DSP Processors
Design of Embedded DSP Processors Unit 3: Microarchitecture, Register file, and ALU 9/11/2017 Unit 3 of TSEA26-2017 H1 1 Contents 1. Microarchitecture and its design 2. Hardware design fundamentals 3.
More informationDesign of Embedded DSP Processors Unit 2: Design basics. 9/11/2017 Unit 2 of TSEA H1 1
Design of Embedded DSP Processors Unit 2: Design basics 9/11/2017 Unit 2 of TSEA26-2017 H1 1 ASIP/ASIC design flow We need to have the flow in mind, so that we will know what we are talking about in later
More informationDesign of Embedded DSP Processors
Design of Embedded DSP Processors Unit 10: Integration and Verification 10/3/2017 Unit 10 of TSEA26 2017 H1 1 Three integrations 1. Hardware integration (Integration of RTL codes) 2. Integration of the
More information05 - Microarchitecture, RF and ALU
September 15, 2015 Microarchitecture Design Step 1: Partition each assembly instruction into microoperations, allocate each microoperation into corresponding hardware modules. Step 2: Collect all microoperations
More informationDesign of Embedded DSP Processors Unit 7: Programming toolchain. 9/26/2017 Unit 7 of TSEA H1 1
Design of Embedded DSP Processors Unit 7: Programming toolchain 9/26/2017 Unit 7 of TSEA26 2017 H1 1 Toolchain introduction There are two kinds of tools 1.The ASIP design tool for HW designers Frontend
More informationDesign of Embedded DSP Processors Unit 8: Firmware design and benchmarking. 9/27/2017 Unit 8 of TSEA H1 1
Design of Embedded DSP Processors Unit 8: Firmware design and benchmarking 9/27/2017 Unit 8 of TSEA26 2017 H1 1 Contents Introduction to FW and its coding flow 1. Application modeling under HW constraints
More informationEmbedded Systems Design: A Unified Hardware/Software Introduction. Outline. Chapter 5 Memory. Introduction. Memory: basic concepts
Hardware/Software Introduction Chapter 5 Memory Outline Memory Write Ability and Storage Permanence Common Memory Types Composing Memory Memory Hierarchy and Cache Advanced RAM 1 2 Introduction Memory:
More informationEmbedded Systems Design: A Unified Hardware/Software Introduction. Chapter 5 Memory. Outline. Introduction
Hardware/Software Introduction Chapter 5 Memory 1 Outline Memory Write Ability and Storage Permanence Common Memory Types Composing Memory Memory Hierarchy and Cache Advanced RAM 2 Introduction Embedded
More informationEmbedded Systems: Hardware Components (part I) Todor Stefanov
Embedded Systems: Hardware Components (part I) Todor Stefanov Leiden Embedded Research Center Leiden Institute of Advanced Computer Science Leiden University, The Netherlands Outline Generic Embedded System
More informationOrganization. 5.1 Semiconductor Main Memory. William Stallings Computer Organization and Architecture 6th Edition
William Stallings Computer Organization and Architecture 6th Edition Chapter 5 Internal Memory 5.1 Semiconductor Main Memory 5.2 Error Correction 5.3 Advanced DRAM Organization 5.1 Semiconductor Main Memory
More informationTSEA 26 exam page 1 of Examination. Design of Embedded DSP Processors, TSEA26 Date 8-12, G34, G32, FOI hus G
TSEA 26 exam page 1 of 10 20171019 Examination Design of Embedded DSP Processors, TSEA26 Date 8-12, 2017-10-19 Room G34, G32, FOI hus G Time 08-12AM Course code TSEA26 Exam code TEN1 Design of Embedded
More informationCENG4480 Lecture 09: Memory 1
CENG4480 Lecture 09: Memory 1 Bei Yu byu@cse.cuhk.edu.hk (Latest update: November 8, 2017) Fall 2017 1 / 37 Overview Introduction Memory Principle Random Access Memory (RAM) Non-Volatile Memory Conclusion
More informationEE414 Embedded Systems Ch 5. Memory Part 2/2
EE414 Embedded Systems Ch 5. Memory Part 2/2 Byung Kook Kim School of Electrical Engineering Korea Advanced Institute of Science and Technology Overview 6.1 introduction 6.2 Memory Write Ability and Storage
More informationBasic Organization Memory Cell Operation. CSCI 4717 Computer Architecture. ROM Uses. Random Access Memory. Semiconductor Memory Types
CSCI 4717/5717 Computer Architecture Topic: Internal Memory Details Reading: Stallings, Sections 5.1 & 5.3 Basic Organization Memory Cell Operation Represent two stable/semi-stable states representing
More informationWilliam Stallings Computer Organization and Architecture 6th Edition. Chapter 5 Internal Memory
William Stallings Computer Organization and Architecture 6th Edition Chapter 5 Internal Memory Semiconductor Memory Types Semiconductor Memory RAM Misnamed as all semiconductor memory is random access
More informationComputer Organization and Assembly Language (CS-506)
Computer Organization and Assembly Language (CS-506) Muhammad Zeeshan Haider Ali Lecturer ISP. Multan ali.zeeshan04@gmail.com https://zeeshanaliatisp.wordpress.com/ Lecture 2 Memory Organization and Structure
More informationTSEA22, DIGITALTEKNIK LECTURE 7
LINKÖPING UNIVERSITY Department of Electrical Engineering TSEA22, DIGITALTEKNIK LECTURE 7 Mario Garrido Gálvez mario.garrido.galvez@liu.se Linköping, 2018 1 FEEDBACK: POSITIVE Course: Good / Interesting
More informationCharacteristics of Memory Location wrt Motherboard. CSCI 4717 Computer Architecture. Characteristics of Memory Capacity Addressable Units
CSCI 4717/5717 Computer Architecture Topic: Cache Memory Reading: Stallings, Chapter 4 Characteristics of Memory Location wrt Motherboard Inside CPU temporary memory or registers Motherboard main memory
More informationECE 485/585 Microprocessor System Design
Microprocessor System Design Lecture 4: Memory Hierarchy Memory Taxonomy SRAM Basics Memory Organization DRAM Basics Zeshan Chishti Electrical and Computer Engineering Dept Maseeh College of Engineering
More informationa) Memory management unit b) CPU c) PCI d) None of the mentioned
1. CPU fetches the instruction from memory according to the value of a) program counter b) status register c) instruction register d) program status word 2. Which one of the following is the address generated
More informationOverview. Memory Classification Read-Only Memory (ROM) Random Access Memory (RAM) Functional Behavior of RAM. Implementing Static RAM
Memories Overview Memory Classification Read-Only Memory (ROM) Types of ROM PROM, EPROM, E 2 PROM Flash ROMs (Compact Flash, Secure Digital, Memory Stick) Random Access Memory (RAM) Types of RAM Static
More informationCENG3420 Lecture 08: Memory Organization
CENG3420 Lecture 08: Memory Organization Bei Yu byu@cse.cuhk.edu.hk (Latest update: February 22, 2018) Spring 2018 1 / 48 Overview Introduction Random Access Memory (RAM) Interleaving Secondary Memory
More informationChapter 4 Main Memory
Chapter 4 Main Memory Course Outcome (CO) - CO2 Describe the architecture and organization of computer systems Program Outcome (PO) PO1 Apply knowledge of mathematics, science and engineering fundamentals
More informationENGIN 112 Intro to Electrical and Computer Engineering
ENGIN 112 Intro to Electrical and Computer Engineering Lecture 30 Random Access Memory (RAM) Overview Memory is a collection of storage cells with associated input and output circuitry Possible to read
More information! Memory Overview. ! ROM Memories. ! RAM Memory " SRAM " DRAM. ! This is done because we can build. " large, slow memories OR
ESE 57: Digital Integrated Circuits and VLSI Fundamentals Lec 2: April 5, 26 Memory Overview, Memory Core Cells Lecture Outline! Memory Overview! ROM Memories! RAM Memory " SRAM " DRAM 2 Memory Overview
More informationChapter 5 Internal Memory
Chapter 5 Internal Memory Memory Type Category Erasure Write Mechanism Volatility Random-access memory (RAM) Read-write memory Electrically, byte-level Electrically Volatile Read-only memory (ROM) Read-only
More informationCOMP3221: Microprocessors and. and Embedded Systems. Overview. Lecture 23: Memory Systems (I)
COMP3221: Microprocessors and Embedded Systems Lecture 23: Memory Systems (I) Overview Memory System Hierarchy RAM, ROM, EPROM, EEPROM and FLASH http://www.cse.unsw.edu.au/~cs3221 Lecturer: Hui Wu Session
More informationInternal Memory Cache Stallings: Ch 4, Ch 5 Key Characteristics Locality Cache Main Memory
Lecture 3 Internal Memory Cache Stallings: Ch 4, Ch 5 Key Characteristics Locality Cache Main Memory Key Characterics of Memories / Storage (Sta06 Table 4.1) 26.1.2010 2 Goals I want my memory lightning
More informationLecture 18: Memory Systems. Spring 2018 Jason Tang
Lecture 18: Memory Systems Spring 2018 Jason Tang 1 Topics Memory hierarchy Memory operations Cache basics 2 Computer Organization Computer Processor Memory Devices Control Datapath Input Output So far,
More informationMemory. Outline. ECEN454 Digital Integrated Circuit Design. Memory Arrays. SRAM Architecture DRAM. Serial Access Memories ROM
ECEN454 Digital Integrated Circuit Design Memory ECEN 454 Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitry Multiple Ports DRAM Outline Serial Access Memories ROM ECEN 454 12.2 1 Memory
More informationComputer Organization. 8th Edition. Chapter 5 Internal Memory
William Stallings Computer Organization and Architecture 8th Edition Chapter 5 Internal Memory Semiconductor Memory Types Memory Type Category Erasure Write Mechanism Volatility Random-access memory (RAM)
More informationCHAPTER 12 ARRAY SUBSYSTEMS [ ] MANJARI S. KULKARNI
CHAPTER 2 ARRAY SUBSYSTEMS [2.4-2.9] MANJARI S. KULKARNI OVERVIEW Array classification Non volatile memory Design and Layout Read-Only Memory (ROM) Pseudo nmos and NAND ROMs Programmable ROMS PROMS, EPROMs,
More informationPicture of memory. Word FFFFFFFD FFFFFFFE FFFFFFFF
Memory Sequential circuits all depend upon the presence of memory A flip-flop can store one bit of information A register can store a single word, typically 32-64 bits Memory allows us to store even larger
More informationLatches. IT 3123 Hardware and Software Concepts. Registers. The Little Man has Registers. Data Registers. Program Counter
IT 3123 Hardware and Software Concepts Notice: This session is being recorded. CPU and Memory June 11 Copyright 2005 by Bob Brown Latches Can store one bit of data Can be ganged together to store more
More informationChapter 5. Internal Memory. Yonsei University
Chapter 5 Internal Memory Contents Main Memory Error Correction Advanced DRAM Organization 5-2 Memory Types Memory Type Category Erasure Write Mechanism Volatility Random-access memory(ram) Read-write
More informationLecture 11 Cache. Peng Liu.
Lecture 11 Cache Peng Liu liupeng@zju.edu.cn 1 Associative Cache Example 2 Associative Cache Example 3 Associativity Example Compare 4-block caches Direct mapped, 2-way set associative, fully associative
More informationMemory hierarchy Outline
Memory hierarchy Outline Performance impact Principles of memory hierarchy Memory technology and basics 2 Page 1 Performance impact Memory references of a program typically determine the ultimate performance
More informationMEMORY AND PROGRAMMABLE LOGIC
MEMORY AND PROGRAMMABLE LOGIC Memory is a device where we can store and retrieve information It can execute a read and a write Programmable Logic is a device where we can store and retrieve information
More informationFive Key Steps to High-Speed NAND Flash Performance and Reliability
Five Key Steps to High-Speed Flash Performance and Reliability Presenter Bob Pierce Flash Memory Summit 2010 Santa Clara, CA 1 NVM Performance Trend ONFi 2 PCM Toggle ONFi 2 DDR SLC Toggle Performance
More informationCycle Time for Non-pipelined & Pipelined processors
Cycle Time for Non-pipelined & Pipelined processors Fetch Decode Execute Memory Writeback 250ps 350ps 150ps 300ps 200ps For a non-pipelined processor, the clock cycle is the sum of the latencies of all
More informationComputer Organization
INF 101 Fundamental Information Technology Computer Organization Assistant Prof. Dr. Turgay ĐBRĐKÇĐ Course slides are adapted from slides provided by Addison-Wesley Computing Fundamentals of Information
More informationLecture Objectives. Introduction to Computing Chapter 0. Topics. Numbering Systems 04/09/2017
Lecture Objectives Introduction to Computing Chapter The AVR microcontroller and embedded systems using assembly and c Students should be able to: Convert between base and. Explain the difference between
More informationThe Memory Hierarchy. Daniel Sanchez Computer Science & Artificial Intelligence Lab M.I.T. April 3, 2018 L13-1
The Memory Hierarchy Daniel Sanchez Computer Science & Artificial Intelligence Lab M.I.T. April 3, 2018 L13-1 Memory Technologies Technologies have vastly different tradeoffs between capacity, latency,
More informationComputer Memory. Textbook: Chapter 1
Computer Memory Textbook: Chapter 1 ARM Cortex-M4 User Guide (Section 2.2 Memory Model) STM32F4xx Technical Reference Manual: Chapter 2 Memory and Bus Architecture Chapter 3 Flash Memory Chapter 36 Flexible
More informationPower Reduction Techniques in the Memory System. Typical Memory Hierarchy
Power Reduction Techniques in the Memory System Low Power Design for SoCs ASIC Tutorial Memories.1 Typical Memory Hierarchy On-Chip Components Control edram Datapath RegFile ITLB DTLB Instr Data Cache
More informationEmbedded Systems: Architecture
Embedded Systems: Architecture Jinkyu Jeong (Jinkyu@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu ICE3028: Embedded Systems Design, Fall 2018, Jinkyu Jeong (jinkyu@skku.edu)
More informationCS/EE 3710 Computer Architecture Lab Checkpoint #2 Datapath Infrastructure
CS/EE 3710 Computer Architecture Lab Checkpoint #2 Datapath Infrastructure Overview In order to complete the datapath for your insert-name-here machine, the register file and ALU that you designed in checkpoint
More informationLecture 13: SRAM. Slides courtesy of Deming Chen. Slides based on the initial set from David Harris. 4th Ed.
Lecture 13: SRAM Slides courtesy of Deming Chen Slides based on the initial set from David Harris CMOS VLSI Design Outline Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitry Multiple Ports
More information(Advanced) Computer Organization & Architechture. Prof. Dr. Hasan Hüseyin BALIK (5 th Week)
+ (Advanced) Computer Organization & Architechture Prof. Dr. Hasan Hüseyin BALIK (5 th Week) + Outline 2. The computer system 2.1 A Top-Level View of Computer Function and Interconnection 2.2 Cache Memory
More informationLinköping University Post Print. epuma: a novel embedded parallel DSP platform for predictable computing
Linköping University Post Print epuma: a novel embedded parallel DSP platform for predictable computing Jian Wang, Joar Sohl, Olof Kraigher and Dake Liu N.B.: When citing this work, cite the original article.
More informationThe Nios II Family of Configurable Soft-core Processors
The Nios II Family of Configurable Soft-core Processors James Ball August 16, 2005 2005 Altera Corporation Agenda Nios II Introduction Configuring your CPU FPGA vs. ASIC CPU Design Instruction Set Architecture
More informationLecture 17 Introduction to Memory Hierarchies" Why it s important " Fundamental lesson(s)" Suggested reading:" (HP Chapter
Processor components" Multicore processors and programming" Processor comparison" vs." Lecture 17 Introduction to Memory Hierarchies" CSE 30321" Suggested reading:" (HP Chapter 5.1-5.2)" Writing more "
More informationLecture 11 SRAM Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010
EE4800 CMOS Digital IC Design & Analysis Lecture 11 SRAM Zhuo Feng 11.1 Memory Arrays SRAM Architecture SRAM Cell Decoders Column Circuitryit Multiple Ports Outline Serial Access Memories 11.2 Memory Arrays
More informationCS650 Computer Architecture. Lecture 9 Memory Hierarchy - Main Memory
CS65 Computer Architecture Lecture 9 Memory Hierarchy - Main Memory Andrew Sohn Computer Science Department New Jersey Institute of Technology Lecture 9: Main Memory 9-/ /6/ A. Sohn Memory Cycle Time 5
More informationMemory technology and optimizations ( 2.3) Main Memory
Memory technology and optimizations ( 2.3) 47 Main Memory Performance of Main Memory: Latency: affects Cache Miss Penalty» Access Time: time between request and word arrival» Cycle Time: minimum time between
More informationInternal Memory. Computer Architecture. Outline. Memory Hierarchy. Semiconductor Memory Types. Copyright 2000 N. AYDIN. All rights reserved.
Computer Architecture Prof. Dr. Nizamettin AYDIN naydin@yildiz.edu.tr nizamettinaydin@gmail.com Internal Memory http://www.yildiz.edu.tr/~naydin 1 2 Outline Semiconductor main memory Random Access Memory
More informationChapter 5B. Large and Fast: Exploiting Memory Hierarchy
Chapter 5B Large and Fast: Exploiting Memory Hierarchy One Transistor Dynamic RAM 1-T DRAM Cell word access transistor V REF TiN top electrode (V REF ) Ta 2 O 5 dielectric bit Storage capacitor (FET gate,
More informationModule 5a: Introduction To Memory System (MAIN MEMORY)
Module 5a: Introduction To Memory System (MAIN MEMORY) R E F E R E N C E S : S T A L L I N G S, C O M P U T E R O R G A N I Z A T I O N A N D A R C H I T E C T U R E M O R R I S M A N O, C O M P U T E
More informationEECS150 - Digital Design Lecture 16 - Memory
EECS150 - Digital Design Lecture 16 - Memory October 17, 2002 John Wawrzynek Fall 2002 EECS150 - Lec16-mem1 Page 1 Memory Basics Uses: data & program storage general purpose registers buffering table lookups
More informationCOSC 6385 Computer Architecture - Memory Hierarchies (III)
COSC 6385 Computer Architecture - Memory Hierarchies (III) Edgar Gabriel Spring 2014 Memory Technology Performance metrics Latency problems handled through caches Bandwidth main concern for main memory
More informationCannon Mountain Dr Longmont, CO LS6410 Hardware Design Perspective
LS6410 Hardware Design Perspective 1. S3C6410 Introduction The S3C6410X is a 16/32-bit RISC microprocessor, which is designed to provide a cost-effective, lowpower capabilities, high performance Application
More informationISSN: [Bilani* et al.,7(2): February, 2018] Impact Factor: 5.164
IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A REVIEWARTICLE OF SDRAM DESIGN WITH NECESSARY CRITERIA OF DDR CONTROLLER Sushmita Bilani *1 & Mr. Sujeet Mishra 2 *1 M.Tech Student
More informationCS 152 Computer Architecture and Engineering. Lecture 7 - Memory Hierarchy-II
CS 152 Computer Architecture and Engineering Lecture 7 - Memory Hierarchy-II Krste Asanovic Electrical Engineering and Computer Sciences University of California at Berkeley http://www.eecs.berkeley.edu/~krste
More informationCSCI-UA.0201 Computer Systems Organization Memory Hierarchy
CSCI-UA.0201 Computer Systems Organization Memory Hierarchy Mohamed Zahran (aka Z) mzahran@cs.nyu.edu http://www.mzahran.com Programmer s Wish List Memory Private Infinitely large Infinitely fast Non-volatile
More informationELCT 912: Advanced Embedded Systems
Advanced Embedded Systems Lecture 2: Memory and Programmable Logic Dr. Mohamed Abd El Ghany, Memory Random Access Memory (RAM) Can be read and written Static Random Access Memory (SRAM) Data stored so
More informationWilliam Stallings Computer Organization and Architecture 8th Edition. Chapter 5 Internal Memory
William Stallings Computer Organization and Architecture 8th Edition Chapter 5 Internal Memory Semiconductor Memory The basic element of a semiconductor memory is the memory cell. Although a variety of
More informationMemory Pearson Education, Inc., Hoboken, NJ. All rights reserved.
1 Memory + 2 Location Internal (e.g. processor registers, cache, main memory) External (e.g. optical disks, magnetic disks, tapes) Capacity Number of words Number of bytes Unit of Transfer Word Block Access
More informationEECS150 - Digital Design Lecture 16 Memory 1
EECS150 - Digital Design Lecture 16 Memory 1 March 13, 2003 John Wawrzynek Spring 2003 EECS150 - Lec16-mem1 Page 1 Memory Basics Uses: Whenever a large collection of state elements is required. data &
More informationk -bit address bus n-bit data bus Control lines ( R W, MFC, etc.)
THE MEMORY SYSTEM SOME BASIC CONCEPTS Maximum size of the Main Memory byte-addressable CPU-Main Memory Connection, Processor MAR MDR k -bit address bus n-bit data bus Memory Up to 2 k addressable locations
More informationSemiconductor Memory Classification. Today. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. CPU Memory Hierarchy.
ESE 57: Digital Integrated Circuits and VLSI Fundamentals Lec : April 4, 7 Memory Overview, Memory Core Cells Today! Memory " Classification " ROM Memories " RAM Memory " Architecture " Memory core " SRAM
More informationVery Large Scale Integration (VLSI)
Very Large Scale Integration (VLSI) Lecture 8 Dr. Ahmed H. Madian ah_madian@hotmail.com Content Array Subsystems Introduction General memory array architecture SRAM (6-T cell) CAM Read only memory Introduction
More informationMemory Systems and Compiler Support for MPSoC Architectures. Mahmut Kandemir and Nikil Dutt. Cap. 9
Memory Systems and Compiler Support for MPSoC Architectures Mahmut Kandemir and Nikil Dutt Cap. 9 Fernando Moraes 28/maio/2013 1 MPSoC - Vantagens MPSoC architecture has several advantages over a conventional
More informationContents. Memory System Overview Cache Memory. Internal Memory. Virtual Memory. Memory Hierarchy. Registers In CPU Internal or Main memory
Memory Hierarchy Contents Memory System Overview Cache Memory Internal Memory External Memory Virtual Memory Memory Hierarchy Registers In CPU Internal or Main memory Cache RAM External memory Backing
More informationINSTITUTO SUPERIOR TÉCNICO. Architectures for Embedded Computing
UNIVERSIDADE TÉCNICA DE LISBOA INSTITUTO SUPERIOR TÉCNICO Departamento de Engenharia Informática Architectures for Embedded Computing MEIC-A, MEIC-T, MERC Lecture Slides Version 3.0 - English Lecture 16
More informationProcessing Unit CS206T
Processing Unit CS206T Microprocessors The density of elements on processor chips continued to rise More and more elements were placed on each chip so that fewer and fewer chips were needed to construct
More informationA 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications
A 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications Ju-Ho Sohn, Jeong-Ho Woo, Min-Wuk Lee, Hye-Jung Kim, Ramchan Woo, Hoi-Jun Yoo Semiconductor System
More informationCMPEN 411 VLSI Digital Circuits Spring Lecture 22: Memery, ROM
CMPEN 411 VLSI Digital Circuits Spring 2011 Lecture 22: Memery, ROM [Adapted from Rabaey s Digital Integrated Circuits, Second Edition, 2003 J. Rabaey, A. Chandrakasan, B. Nikolic] Sp11 CMPEN 411 L22 S.1
More informationAdvanced Caching Techniques (2) Department of Electrical Engineering Stanford University
Lecture 4: Advanced Caching Techniques (2) Department of Electrical Engineering Stanford University http://eeclass.stanford.edu/ee282 Lecture 4-1 Announcements HW1 is out (handout and online) Due on 10/15
More informationECE 1160/2160 Embedded Systems Design. Midterm Review. Wei Gao. ECE 1160/2160 Embedded Systems Design
ECE 1160/2160 Embedded Systems Design Midterm Review Wei Gao ECE 1160/2160 Embedded Systems Design 1 Midterm Exam When: next Monday (10/16) 4:30-5:45pm Where: Benedum G26 15% of your final grade What about:
More informationEE4380 Microprocessor Design Project
EE4380 Microprocessor Design Project Fall 2002 Class 1 Pari vallal Kannan Center for Integrated Circuits and Systems University of Texas at Dallas Introduction What is a Microcontroller? Microcontroller
More informationCS252 S05. Main memory management. Memory hardware. The scale of things. Memory hardware (cont.) Bottleneck
Main memory management CMSC 411 Computer Systems Architecture Lecture 16 Memory Hierarchy 3 (Main Memory & Memory) Questions: How big should main memory be? How to handle reads and writes? How to find
More informationMultilevel Memories. Joel Emer Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology
1 Multilevel Memories Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology Based on the material prepared by Krste Asanovic and Arvind CPU-Memory Bottleneck 6.823
More informationThe CPU and Memory. How does a computer work? How does a computer interact with data? How are instructions performed? Recall schematic diagram:
The CPU and Memory How does a computer work? How does a computer interact with data? How are instructions performed? Recall schematic diagram: 1 Registers A register is a permanent storage location within
More informationSummary of Computer Architecture
Summary of Computer Architecture Summary CHAP 1: INTRODUCTION Structure Top Level Peripherals Computer Central Processing Unit Main Memory Computer Systems Interconnection Communication lines Input Output
More informationELEC-H-473 Microprocessor architecture Caches
ELEC-H-473 Microprocessor architecture Caches Lecture 05 Dragomir MILOJEVIC dragomir.milojevic@ulb.ac.be 2013 Previously on ELEC-H-473 Reasons that cause pipeline inefficiency : Variance in execution time
More informationLECTURE 5: MEMORY HIERARCHY DESIGN
LECTURE 5: MEMORY HIERARCHY DESIGN Abridged version of Hennessy & Patterson (2012):Ch.2 Introduction Programmers want unlimited amounts of memory with low latency Fast memory technology is more expensive
More informationMemory. Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University
Memory Hakim Weatherspoon CS 3410, Spring 2013 Computer Science Cornell University Big Picture: Building a Processor memory inst register file alu PC +4 +4 new pc offset target imm control extend =? cmp
More informationIntroduction to Embedded System Processor Architectures
Introduction to Embedded System Processor Architectures Contents crafted by Professor Jari Nurmi Tampere University of Technology Department of Computer Systems Motivation Why Processor Design? Embedded
More informationMemory Hierarchy and Cache Ch 4-5
Memory Hierarchy and Cache Ch 4-5 Memory Hierarchy Main Memory Cache Implementation 1 Teemu s Cheesecake hand table Register, on-chip cache, memory, disk, and tape speeds relative to times locating cheese
More informationMain Memory Systems. Department of Electrical Engineering Stanford University Lecture 5-1
Lecture 5 Main Memory Systems Department of Electrical Engineering Stanford University http://eeclass.stanford.edu/ee282 Lecture 5-1 Announcements If you don t have a group of 3, contact us ASAP HW-1 is
More informationElectronic Control systems are also: Members of the Mechatronic Systems. Control System Implementation. Printed Circuit Boards (PCBs) - #1
Control System Implementation Hardware implementation Electronic Control systems are also: Members of the Mechatronic Systems Concurrent design (Top-down approach?) Mechanic compatibility Solve the actual
More informationECEN 449 Microprocessor System Design. Memories
ECEN 449 Microprocessor System Design Memories 1 Objectives of this Lecture Unit Learn about different types of memories SRAM/DRAM/CAM /C Flash 2 1 SRAM Static Random Access Memory 3 SRAM Static Random
More informationCS 33. Memory Hierarchy I. CS33 Intro to Computer Systems XVI 1 Copyright 2016 Thomas W. Doeppner. All rights reserved.
CS 33 Memory Hierarchy I CS33 Intro to Computer Systems XVI 1 Copyright 2016 Thomas W. Doeppner. All rights reserved. Random-Access Memory (RAM) Key features RAM is traditionally packaged as a chip basic
More informationMoneta: A High-performance Storage Array Architecture for Nextgeneration, Micro 2010
Moneta: A High-performance Storage Array Architecture for Nextgeneration, Non-volatile Memories Micro 2010 NVM-based SSD NVMs are replacing spinning-disks Performance of disks has lagged NAND flash showed
More informationThe Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006
The Use Of Virtual Platforms In MP-SoC Design Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006 1 MPSoC Is MP SoC design happening? Why? Consumer Electronics Complexity Cost of ASIC Increased SW Content
More informationESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems
ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Lec 26: November 9, 2018 Memory Overview Dynamic OR4! Precharge time?! Driving input " With R 0 /2 inverter! Driving inverter
More informationQuestion?! Processor comparison!
1! 2! Suggested Readings!! Readings!! H&P: Chapter 5.1-5.2!! (Over the next 2 lectures)! Lecture 18" Introduction to Memory Hierarchies! 3! Processor components! Multicore processors and programming! Question?!
More informationDigital Integrated Circuits Lecture 13: SRAM
Digital Integrated Circuits Lecture 13: SRAM Chih-Wei Liu VLSI Signal Processing LAB National Chiao Tung University cwliu@twins.ee.nctu.edu.tw DIC-Lec13 cwliu@twins.ee.nctu.edu.tw 1 Outline Memory Arrays
More information18-447: Computer Architecture Lecture 25: Main Memory. Prof. Onur Mutlu Carnegie Mellon University Spring 2013, 4/3/2013
18-447: Computer Architecture Lecture 25: Main Memory Prof. Onur Mutlu Carnegie Mellon University Spring 2013, 4/3/2013 Reminder: Homework 5 (Today) Due April 3 (Wednesday!) Topics: Vector processing,
More information