Routing Path Reuse Maximization for Efficient NV-FPGA Reconfiguration
|
|
- Christal Alexander
- 6 years ago
- Views:
Transcription
1 Routing Path Reuse Maximization for Efficient NV-FPGA Reconfiguration Yuan Xue, Patrick ronin, hengmo Yang and Jingtong Hu 01/27/2016
2 Outline Introduction NV-FPGA benefits and challenges Routing optimization strategy Proposed Routing Reuse Optimizations Experimental Evaluation onclusion 1
3 Self adaptive systems FPGA offers reconfigurability, flexibility, and low design cost to various embedded systems such as control, signal processing and many other applications areas. 2
4 Drawbacks of traditional FPGAs Unfortunately, traditional SRAM-based FPGAs cannot meet increasing design requirements: Low scalability High leakage power Prone to soft error Volatile 3
5 NVM-based FPGA Non volatile Memories (NVMs) use physical characteristics to represent logic states, such as: Phase hange Memory (PM) Ferroelectric RAM (FRAM) Spin Transfer Torque Magnetic RAM (STT-MRAM) STT-MRAM PM FRAM 4
6 Overcome SRAM limitations? NVM High density Near-zero leakage power Strong error resistance Near-zero power-on delay SRAM Low scalability High leakage power Prone to soft error Volatile However, no rose is without a thorn! 5
7 Any issues or challenges? OMPARISON OF SRAM AND VARIOUS NVM ELLS Type Area (F 2 ) Read time(ns) Write time(ns) Write cycles SRAM PM 4 12 STT-MRAM NOR Flash However, no rose is without a thorn! Two major issues of NVM FPGA: Slow Writes make the reconfiguration time non-trivial! Short Endurance limits device lifetime! 6
8 How to solve? asic scheme: reduce writes and increase reuse with a bitlevel read-before-write (RW) strategy New data compare read write NVM cells NVM cells Why this works for NVM? For PM Without RW 4 writes 400ns With RW 4 reads+ 148ns 1 write Great Improvement! Type For SRAM Read time(ns) Write time(ns) SRAM Without RW PM writes No benefit! 0.8ns With RW 4 reads+ 1ns 1 write 7
9 M NV-FPGA NV-FPGA: Use NVMs as on-chip memories and configurable block units on FPGA. Main blocks include: onfigurable logic blocks (s) onnection blocks (s) Switch box (s) LOK AREA POWER DELAY 90% 85% 80% M M (a) Structure General Input Full rossar (b) Structure arry in Fracturable LE[0] arry out General... Output Fracturable LE[9] LUT arry in FF FF arry out (c)asic Logic Element out1 out2 + 10% 15% 20% All can be NVMs! nv nv nv nv nv nv nv nv nv M NVM based Programmable locks Architecture (d) Structure (e)bidirectional pass switch We target s the dominating blocks on the FPGA 8
10 Three categories: Related work on routing optimization Hierarchical routing match & preserve Idea: onstruct cluster routing graph, match and preserve route hierarchically Related work: M.M Ozdal IAD 09, hing-yu hin IAD 14 Proposed work differs from them in: Single Path level routing optimization oarse-grained partial reconfiguration Idea: Partition bitstream into dynamic and static parts, reuse static parts under partial reconfiguration framework. Related work: E. Vansteenkiste FPL 12,. Al Farisi FPL 13 Fine-grained bit level reuse, require no partial reconfiguration support Incremental design routing reuse Idea: At engineering change order(eo) stage, compare netlists to find possible reusable metal wire sections, and preserve these metal layer wires. Related work: Yun-Ru Wu VLSI-DAT 10, Hsi-An hien ASP-DA 14 an be applied to both highly-similar and dissimilar designs. 8
11 Overview Our goal Minimize bit-flips when reconfiguring on NVM-based FPGA Strategy Maximize bit-level reuse of switch boxes during routing Questions to be addressed How to model reconfiguration cost? What types of flexibilities can be exploited to maximize reuse? How to perform reuse-aware routing while preserving circuit timing? 10
12 Outline Introduction Proposed Routing Reuse Optimizations Modeling routing reconfiguration cost Identifying & Maximizing reusable path Proposed reuse-aware routing algorithm Proposed AD flow Experimental Evaluation onclusion 11
13 Reconfiguration cost Wilton Switch type l k a b c d e switch On/ off l a 1 j b 1 c e 1 switch On/ off 3 changed switches l a 0 j b 0 c e 1 l k a b c d e j i h g Existing configuration f h f 1 k i 0 connected switch Nold=4 Reused switch: R=2 h f 1 k i 1 connected switch Nnew=3 j i h g New configuration f Single reconfig cost: RR =N old + N new 2R NUM RR i Total reconfig cost: ost reconfig = i=1 12
14 Outline Introduction Proposed Routing Reuse Optimizations Modeling routing reconfiguration cost Identifying & Maximizing reusable path Proposed reuse-aware routing algorithm Proposed AD flow Experimental Evaluation onclusion 13
15 Path Definition and haracterization Definition: Path is a single source-to-sink connection. s locally connect s to s, can be omitted in our structure model FPGA hip 14
16 Path Definition and haracterization Definition: Path is a single source-to-sink connection. haracterization: Path P={(i,j),( first last )}. P consists of two sets: set: (i,j) starting point i, ending point j set: ( first last ) all s that P passes through Path reuse could efficiently translate to reuse! Path P FPGA hip 15
17 Two types of Reusable Paths Full reuse: P shares both starting and ending s with P. Partial reuse: P shares starting and last with P. Full Reuse Old Path P New Path P Partial Reuse FPGA hip 16
18 Reusable Path Maximization However, the same starting cannot guarantee path reuse. LUT-to-LE mapping will result in different path reuse and reconfiguration costs. Suppose LUT1 1 and LUT2 3. If LUT1 LE1, path can be reused Otherwise, path can not be reused ross bar LE1 LEn 1 n LUT1 2 Pin1 6 New design LUT Old Path P New Path P FPGA hip 17
19 To maximize path reuse, we exploit LUT-to-LE mapping flexibilities. Assume n LEs in each, the problem can be translated to bipartite graph matching. Weight of edge i j represents the number of reusable switches. For example, LUT1 LE1 Edge 11 =2 LUT1 Reusable Path Maximization 1 2 Pin1 3 Optimal mapping=? mapping! 3 LEa Reuse= LE 1-n LUT 1-n Maximal reuse! LUT2 LEb LUT3 LEc One in new design One block on FPGA Optimal mapping can be identified with Kuhn-Munkras Algorithm. H. Kuhn, The Hungarian method for the assignment problem, in NRLQ, Mar. 1955, pp
20 Outline Introduction Proposed Routing Reuse Optimizations Modeling routing reconfiguration cost Identifying & Maximizing reusable path Proposed reuse-aware routing algorithm Proposed AD flow Experimental Evaluation onclusion 19
21 Reuse-aware Routing Algorithm asic routing: Select routing resources to finish connection in each net. source Net: single source to multiple sinks connection. Each net contains several paths with the same starting point. sink Routing resources pool Iteratively, exist if no congestion Proposed three-stage routing algorithm Fix reusable paths in each net Route other paths Relax reusable path with bad timing Divide resource into two types: general (used for all nets) and dedicated (used by specified net) Rank nets by their sink count, route nets with maximum sinks first When relax, release related dedicated resources 20
22 Stage 1: fix reusable path Stage 2: route other paths Stage 3: relax related paths Repeat 2,3 if congesting Input: Netlist, Placement, Reusable paths, Relax threshold ɛ Output: Routing results Mark all routing resources rr_node:type = general; for (i = 0;i<NUM_reuse_path;i++) { for (n= 0;i<NUM_net;i++) { if (rr_path[i] Nets[n]) { Mark its corresponding rr_node:type = n; Mark rr_path[i] Trace[n]; } } } Sort (nets, unrouted sinks); for (loop = 0;loop<Iteration_limit;loop++) { for (i=0;i<num_net;i++) { n = Order[i]; for (j = 0;j<sink_num[n];j++) { if (path[n][j] reusable path) { if (rr_node:type == (general n) rr_node andidate_set; Pathnder(Trace[n],path[n][j],andidate_set); } } Perform timing analysis and update path criticality values; for (i = 0;i<NUM_reuse_path;i++) } { if (T_rrpath[i] > (1-ɛ) T_critical_path) { rr node:type = general; Remove rr_path[i] from the reuse list; } } if (no congestion exist) Exit ; 21
23 Outline Introduction Proposed Routing Reuse Optimizations Modeling routing reconfiguration cost Identifying & Maximizing reusable path Proposed reuse-aware routing algorithm Proposed AD flow Experimental Evaluation onclusion 22
24 Original AD flow Design Description (HDL) Synthesis and technology map lustering and packing level placement Routing Estimate performance 23
25 Proposed AD flow New Design Design Description (HDL) Retrieve on-chip information Existing Design Synthesis and technology map lustering and packing level placement Phase 1 Generate inner (LUT to LE) placement mapping Phase 2 Path-reuse aware routing Evaluate performance Phase 3 24
26 Outline Introduction Proposed Routing Reuse Optimizations Experimental Evaluation onclusion 25
27 Methodology FPGA architecture: Altera Stratix IV AD toolkit: VTR 7.0 Schemes Read-beforewrite Strategy Experimental configuration Reusable Path Identification Reusable Path Maximization Reuse-aware Routing aseline DIR (ɛ=1%) Proposed (ɛ=1%) DIR: Proposed scheme without reuse maximization ɛ: Threshold for reusable path relaxation, path timing within ɛ=1% of critical path will be relaxed. 26
28 Methodology 10 MN benchmarks, 9 test pairs TP1 TP9 No enchmark # LUT# Net# Track Width 1 bigkey s frisc elliptic spla pdc ex s s clma
29 Results 1/3 Path reuse Full path reuse rate Partial path reuse rate 12.0% 30.0% 10.0% 25.0% 8.0% 20.0% 6.0% 15.0% 4.0% 10.0% 2.0% 5.0% 0.0% TP1 TP2 TP3 TP4 TP5 TP6 TP7 TP8 TP9 0.0% TP1 TP2 TP3 TP4 TP5 TP6 TP7 TP8 TP9 DIR full reuse Proposed full reuse DIR partial reuse Proposed partial reuse Average DIR = 1.7%, Proposed = 3.9%. Average DIR = 4.3%, Proposed = 15.8%. Full + Partial = 40.0% 30.0% 20.0% 10.0% Total path reuse rate 0.0% TP1 TP2 TP3 TP4 TP5 TP6 TP7 TP8 TP9 DIR total reuse Proposed total reuse Average DIR = 6%, Proposed = 19.7%. 28
30 Results 2/3 reconfiguration cost reduction 35.0% 30.0% 25.0% 20.0% 15.0% 10.0% 5.0% 0.0% reconfiguration cost = reconfiguration writes (in bits) reconfiguration cost reduction TP1 TP2 TP3 TP4 TP5 TP6 TP7 TP8 TP9 DIR Reduction Proposed Reduction Average DIR = 9.8%, Proposed = 24.5%. Reconfiguration cost reduction is strongly correlated with reuse rate but not exactly the same, since paths contain different numbers of s. 40.0% 30.0% 20.0% 10.0% 0.0% Total path reuse rate TP1 TP2 TP3 TP4 TP5 TP6 TP7 TP8 TP9 DIR total reuse Proposed total reuse 29
31 Results 3/3 Performances with different ɛ 1.1 ɛ=%1 has the best 1 overall performance. 0.9 Average performance of all test pairs ɛ=0% ɛ=1% ɛ=2% ɛ=3% ɛ=5% ɛ=10% ɛ: threshold to control reusable path relaxation reconfiguration cost(normalized) ritical path delay(normalized) Geomean the smaller, the better
32 Outline Introduction Proposed Routing Reuse Optimizations Experimental Evaluation onclusion 31
33 hallenges: onclusion NVM-based FPGAs are promising to self-adaptive applications, but slow writes and short endurance of NVMs need to be addressed Our goal: Minimize reconfiguration costs of switch boxes through reuse-aware routing Our approaches: Model reconfiguration cost and identify two types of reusable paths Maximize path reuse through exploiting LUT-to-LE mapping flexibilities Enhance VTR AD flow with a reuse-aware routing algorithm Results Summary: Proposed schemes deliver as much as 40% path reuse and 34% reduction in reconfiguration cost, within 3.5% overhead in critical path delay. 32
34 THE END Questions are welcome.
Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool
Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool Jin Hee Kim and Jason Anderson FPL 2015 London, UK September 3, 2015 2 Motivation for Synthesizable FPGA Trend towards ASIC design flow Design
More informationSUBMITTED FOR PUBLICATION TO: IEEE TRANSACTIONS ON VLSI, DECEMBER 5, A Low-Power Field-Programmable Gate Array Routing Fabric.
SUBMITTED FOR PUBLICATION TO: IEEE TRANSACTIONS ON VLSI, DECEMBER 5, 2007 1 A Low-Power Field-Programmable Gate Array Routing Fabric Mingjie Lin Abbas El Gamal Abstract This paper describes a new FPGA
More informationECE 636. Reconfigurable Computing. Lecture 2. Field Programmable Gate Arrays I
ECE 636 Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays I Overview Anti-fuse and EEPROM-based devices Contemporary SRAM devices - Wiring - Embedded New trends - Single-driver wiring -
More informationChristophe HURIAUX. Embedded Reconfigurable Hardware Accelerators with Efficient Dynamic Reconfiguration
Mid-term Evaluation March 19 th, 2015 Christophe HURIAUX Embedded Reconfigurable Hardware Accelerators with Efficient Dynamic Reconfiguration Accélérateurs matériels reconfigurables embarqués avec reconfiguration
More informationMTJ-Based Nonvolatile Logic-in-Memory Architecture
2011 Spintronics Workshop on LSI @ Kyoto, Japan, June 13, 2011 MTJ-Based Nonvolatile Logic-in-Memory Architecture Takahiro Hanyu Center for Spintronics Integrated Systems, Tohoku University, JAPAN Laboratory
More informationDesigning Heterogeneous FPGAs with Multiple SBs *
Designing Heterogeneous FPGAs with Multiple SBs * K. Siozios, S. Mamagkakis, D. Soudris, and A. Thanailakis VLSI Design and Testing Center, Department of Electrical and Computer Engineering, Democritus
More informationRTL Coding General Concepts
RTL Coding General Concepts Typical Digital System 2 Components of a Digital System Printed circuit board (PCB) Embedded d software microprocessor microcontroller digital signal processor (DSP) ASIC Programmable
More informationA Novel Design Framework for the Design of Reconfigurable Systems based on NoCs
Politecnico di Milano & EPFL A Novel Design Framework for the Design of Reconfigurable Systems based on NoCs Vincenzo Rana, Ivan Beretta, Donatella Sciuto Donatella Sciuto sciuto@elet.polimi.it Introduction
More informationMohsen Imani. University of California San Diego. System Energy Efficiency Lab seelab.ucsd.edu
Mohsen Imani University of California San Diego Winter 2016 Technology Trend for IoT http://www.flashmemorysummit.com/english/collaterals/proceedi ngs/2014/20140807_304c_hill.pdf 2 Motivation IoT significantly
More informationHow Much Logic Should Go in an FPGA Logic Block?
How Much Logic Should Go in an FPGA Logic Block? Vaughn Betz and Jonathan Rose Department of Electrical and Computer Engineering, University of Toronto Toronto, Ontario, Canada M5S 3G4 {vaughn, jayar}@eecgutorontoca
More information8. Best Practices for Incremental Compilation Partitions and Floorplan Assignments
8. Best Practices for Incremental Compilation Partitions and Floorplan Assignments QII51017-9.0.0 Introduction The Quartus II incremental compilation feature allows you to partition a design, compile partitions
More informationMaximizing Routing Resource Reuse in a Reconfiguration-aware Connection Router for FPGAs
FACULTY OF ENGINEERING AND ARCHITECTURE Maximizing Routing Resource Reuse in a Reconfiguration-aware Connection Router for FPGAs Elias Vansteenkiste Karel Bruneel and Dirk Stroobandt Elias.Vansteenkiste@UGent.be
More informationProgrammable Logic Devices FPGA Architectures II CMPE 415. Overview This set of notes introduces many of the features available in the FPGAs of today.
Overview This set of notes introduces many of the features available in the FPGAs of today. The majority use SRAM based configuration cells, which allows fast reconfiguation. Allows new design ideas to
More informationCS310 Embedded Computer Systems. Maeng
1 INTRODUCTION (PART II) Maeng Three key embedded system technologies 2 Technology A manner of accomplishing a task, especially using technical processes, methods, or knowledge Three key technologies for
More informationVdd Programmability to Reduce FPGA Interconnect Power
Vdd Programmability to Reduce FPGA Interconnect Power Fei Li, Yan Lin and Lei He Electrical Engineering Department University of California, Los Angeles, CA 90095 ABSTRACT Power is an increasingly important
More informationEN2911X: Reconfigurable Computing Lecture 13: Design Flow: Physical Synthesis (5)
EN2911X: Lecture 13: Design Flow: Physical Synthesis (5) Prof. Sherief Reda Division of Engineering, rown University http://scale.engin.brown.edu Fall 09 Summary of the last few lectures System Specification
More informationAn Introduction to FPGA Placement. Yonghong Xu Supervisor: Dr. Khalid
RESEARCH CENTRE FOR INTEGRATED MICROSYSTEMS UNIVERSITY OF WINDSOR An Introduction to FPGA Placement Yonghong Xu Supervisor: Dr. Khalid RESEARCH CENTRE FOR INTEGRATED MICROSYSTEMS UNIVERSITY OF WINDSOR
More informationMODULAR PARTITIONING FOR INCREMENTAL COMPILATION
MODULAR PARTITIONING FOR INCREMENTAL COMPILATION Mehrdad Eslami Dehkordi, Stephen D. Brown Dept. of Electrical and Computer Engineering University of Toronto, Toronto, Canada email: {eslami,brown}@eecg.utoronto.ca
More informationEfficient Hardware Debugging using Parameterized FPGA Reconfiguration
2016 IEEE International Parallel and Distributed Processing Symposium Workshops Efficient Hardware Debugging using Parameterized FPGA Reconfiguration Alexandra Kourfali Department of Electronics and Information
More information160 M. Nadjarbashi, S.M. Fakhraie and A. Kaviani Figure 2. LUTB structure. each block-level track can be arbitrarily connected to each of 16 4-LUT inp
Scientia Iranica, Vol. 11, No. 3, pp 159{164 c Sharif University of Technology, July 2004 On Routing Architecture for Hybrid FPGA M. Nadjarbashi, S.M. Fakhraie 1 and A. Kaviani 2 In this paper, the routing
More informationResearch Challenges for FPGAs
Research Challenges for FPGAs Vaughn Betz CAD Scalability Recent FPGA Capacity Growth Logic Eleme ents (Thousands) 400 350 300 250 200 150 100 50 0 MCNC Benchmarks 250 nm FLEX 10KE Logic: 34X Memory Bits:
More informationVery Large Scale Integration (VLSI)
Very Large Scale Integration (VLSI) Lecture 6 Dr. Ahmed H. Madian Ah_madian@hotmail.com Dr. Ahmed H. Madian-VLSI 1 Contents FPGA Technology Programmable logic Cell (PLC) Mux-based cells Look up table PLA
More informationDYNAMICALLY SHIFTED SCRUBBING FOR FAST FPGA REPAIR. Leonardo P. Santos, Gabriel L. Nazar and Luigi Carro
DYNAMICALLY SHIFTED SCRUBBING FOR FAST FPGA REPAIR Leonardo P. Santos, Gabriel L. Nazar and Luigi Carro Instituto de Informática Universidade Federal do Rio Grande do Sul (UFRGS) Porto Alegre, RS - Brazil
More informationVdd Programmable and Variation Tolerant FPGA Circuits and Architectures
Vdd Programmable and Variation Tolerant FPGA Circuits and Architectures Prof. Lei He EE Department, UCLA LHE@ee.ucla.edu Partially supported by NSF. Pathway to Power Efficiency and Variation Tolerance
More informationPower Solutions for Leading-Edge FPGAs. Vaughn Betz & Paul Ekas
Power Solutions for Leading-Edge FPGAs Vaughn Betz & Paul Ekas Agenda 90 nm Power Overview Stratix II : Power Optimization Without Sacrificing Performance Technical Features & Competitive Results Dynamic
More informationFPGA for Complex System Implementation. National Chiao Tung University Chun-Jen Tsai 04/14/2011
FPGA for Complex System Implementation National Chiao Tung University Chun-Jen Tsai 04/14/2011 About FPGA FPGA was invented by Ross Freeman in 1989 SRAM-based FPGA properties Standard parts Allowing multi-level
More informationINTRODUCTION TO FPGA ARCHITECTURE
3/3/25 INTRODUCTION TO FPGA ARCHITECTURE DIGITAL LOGIC DESIGN (BASIC TECHNIQUES) a b a y 2input Black Box y b Functional Schematic a b y a b y a b y 2 Truth Table (AND) Truth Table (OR) Truth Table (XOR)
More informationCouture: Tailoring STT-MRAM for Persistent Main Memory. Mustafa M Shihab Jie Zhang Shuwen Gao Joseph Callenes-Sloan Myoungsoo Jung
Couture: Tailoring STT-MRAM for Persistent Main Memory Mustafa M Shihab Jie Zhang Shuwen Gao Joseph Callenes-Sloan Myoungsoo Jung Executive Summary Motivation: DRAM plays an instrumental role in modern
More informationDigital Design Methodology (Revisited) Design Methodology: Big Picture
Digital Design Methodology (Revisited) Design Methodology Design Specification Verification Synthesis Technology Options Full Custom VLSI Standard Cell ASIC FPGA CS 150 Fall 2005 - Lec #25 Design Methodology
More informationAutomated Extraction of Physical Hierarchies for Performance Improvement on Programmable Logic Devices
Automated Extraction of Physical Hierarchies for Performance Improvement on Programmable Logic Devices Deshanand P. Singh Altera Corporation dsingh@altera.com Terry P. Borer Altera Corporation tborer@altera.com
More informationLeakage Efficient Chip-Level Dual-Vdd Assignment with Time Slack Allocation for FPGA Power Reduction
44.1 Leakage Efficient Chip-Level Dual-Vdd Assignment with Time Slack Allocation for FPGA Power Reduction Yan Lin and Lei He Electrical Engineering Department University of California, Los Angeles, CA
More informationArchitectural Aspects in Design and Analysis of SOTbased
Architectural Aspects in Design and Analysis of SOTbased Memories Rajendra Bishnoi, Mojtaba Ebrahimi, Fabian Oboril & Mehdi Tahoori INSTITUTE OF COMPUTER ENGINEERING (ITEC) CHAIR FOR DEPENDABLE NANO COMPUTING
More informationPerformance Improvement and Size Reduction Scheme over Circuits by Using LUT/MUX Architecture
Performance Improvement and Size Reduction Scheme over Circuits by Using LUT/MUX Architecture R. Pradeepa 1, S.P. Senthil Kumar 2 M.E. VLSI Design, Shanmuganathan Engineering College, Arasampatti, Pudukkottai-622507,
More informationAdvanced FPGA Design Methodologies with Xilinx Vivado
Advanced FPGA Design Methodologies with Xilinx Vivado Alexander Jäger Computer Architecture Group Heidelberg University, Germany Abstract With shrinking feature sizes in the ASIC manufacturing technology,
More informationFPGAs & Multi-FPGA Systems. FPGA Abstract Model. Logic cells imbedded in a general routing structure. Logic cells usually contain:
s & Multi- Systems Fit logic into a prefabricated system Fixed inter-chip routing Fixed on-chip logic & routing XBA Partitioning Global outing Technology Map. XBA XBA Placement outing 23 Abstract Model
More informationCMP annual meeting, January 23 rd, 2014
J.P.Nozières, G.Prenat, B.Dieny and G.Di Pendina Spintec, UMR-8191, CEA-INAC/CNRS/UJF-Grenoble1/Grenoble-INP, Grenoble, France CMP annual meeting, January 23 rd, 2014 ReRAM V wr0 ~-0.9V V wr1 V ~0.9V@5ns
More informationAn Introduction to Programmable Logic
Outline An Introduction to Programmable Logic 3 November 24 Transistors Logic Gates CPLD Architectures FPGA Architectures Device Considerations Soft Core Processors Design Example Quiz Semiconductors Semiconductor
More informationFPGA: What? Why? Marco D. Santambrogio
FPGA: What? Why? Marco D. Santambrogio marco.santambrogio@polimi.it 2 Reconfigurable Hardware Reconfigurable computing is intended to fill the gap between hardware and software, achieving potentially much
More informationTiming Optimization of FPGA Placements by Logic Replication
13.1 Timing Optimization of FPGA Placements by Logic Replication Giancarlo Beraudo ECE Department, University of Illinois at Chicago 851 S. Morgan St., Chicago IL, 60607 gberaudo@ece.uic.edu John Lillis
More informationHardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University
Hardware Design Environments Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Outline Welcome to COE 405 Digital System Design Design Domains and Levels of Abstractions Synthesis
More informationANALYSING THE BITSTREAM OF ALTERA'S MAX-V CPLDS
ANALYSING THE BITSTREAM OF ALTERA'S MAX-V CPLDS Jean-Francois Nguyen July 14, 2016 1 SUMMARY 1. CPLDs and their design flow 2. MAX V internals overview 3. Analysis of the bitstream 2 CPLD? Complex Programmable
More informationEmerging NV Storage and Memory Technologies --Development, Manufacturing and
Emerging NV Storage and Memory Technologies --Development, Manufacturing and Applications-- Tom Coughlin, Coughlin Associates Ed Grochowski, Computer Storage Consultant 2014 Coughlin Associates 1 Outline
More informationField Programmable Gate Array (FPGA)
Field Programmable Gate Array (FPGA) Lecturer: Krébesz, Tamas 1 FPGA in general Reprogrammable Si chip Invented in 1985 by Ross Freeman (Xilinx inc.) Combines the advantages of ASIC and uc-based systems
More informationEECS150 - Digital Design Lecture 5 - Verilog Logic Synthesis
EECS150 - Digital Design Lecture 5 - Verilog Logic Synthesis Jan 31, 2012 John Wawrzynek Spring 2012 EECS150 - Lec05-verilog_synth Page 1 Outline Quick review of essentials of state elements Finite State
More informationNovel Nonvolatile Memory Hierarchies to Realize "Normally-Off Mobile Processors" ASP-DAC 2014
Novel Nonvolatile Memory Hierarchies to Realize "Normally-Off Mobile Processors" ASP-DAC 2014 Shinobu Fujita, Kumiko Nomura, Hiroki Noguchi, Susumu Takeda, Keiko Abe Toshiba Corporation, R&D Center Advanced
More informationOptimization of Robust Asynchronous Circuits by Local Input Completeness Relaxation. Computer Science Department Columbia University
Optimization of Robust Asynchronous ircuits by Local Input ompleteness Relaxation heoljoo Jeong Steven M. Nowick omputer Science Department olumbia University Outline 1. Introduction 2. Background: Hazard
More informationA Methodology and Tool Framework for Supporting Rapid Exploration of Memory Hierarchies in FPGAs
A Methodology and Tool Framework for Supporting Rapid Exploration of Memory Hierarchies in FPGAs Harrys Sidiropoulos, Kostas Siozios and Dimitrios Soudris School of Electrical & Computer Engineering National
More informationAn FPGA Architecture Supporting Dynamically-Controlled Power Gating
An FPGA Architecture Supporting Dynamically-Controlled Power Gating Altera Corporation March 16 th, 2012 Assem Bsoul and Steve Wilton {absoul, stevew}@ece.ubc.ca System-on-Chip Research Group Department
More informationDE2 Board & Quartus II Software
January 23, 2015 Contact and Office Hours Teaching Assistant (TA) Sergio Contreras Office Office Hours Email SEB 3259 Tuesday & Thursday 12:30-2:00 PM Wednesday 1:30-3:30 PM contre47@nevada.unlv.edu Syllabus
More informationBasic Block. Inputs. K input. N outputs. I inputs MUX. Clock. Input Multiplexors
RPack: Rability-Driven packing for cluster-based FPGAs E. Bozorgzadeh S. Ogrenci-Memik M. Sarrafzadeh Computer Science Department Department ofece Computer Science Department UCLA Northwestern University
More informationProASIC PLUS FPGA Family
ProASIC PLUS FPGA Family Key Features Reprogrammable /Nonvolatile Flash Technology Low Power Secure Single Chip/Live at Power Up 1M Equivalent System Gates Cost Effective ASIC Alternative ASIC Design Flow
More informationLattice Semiconductor Design Floorplanning
September 2012 Introduction Technical Note TN1010 Lattice Semiconductor s isplever software, together with Lattice Semiconductor s catalog of programmable devices, provides options to help meet design
More informationA Path Based Algorithm for Timing Driven. Logic Replication in FPGA
A Path Based Algorithm for Timing Driven Logic Replication in FPGA By Giancarlo Beraudo B.S., Politecnico di Torino, Torino, 2001 THESIS Submitted as partial fulfillment of the requirements for the degree
More informationCache/Memory Optimization. - Krishna Parthaje
Cache/Memory Optimization - Krishna Parthaje Hybrid Cache Architecture Replacing SRAM Cache with Future Memory Technology Suji Lee, Jongpil Jung, and Chong-Min Kyung Department of Electrical Engineering,KAIST
More informationState of the art. 2.1 Introduction to FPGAs
2 State of the art 2.1 Introduction to FPGAs A Field programmable Gate Array (FPGA) is an integrated circuit that is designed to be configured after manufacturing. FPGAs can be used to implement any logic
More informationImplementing Logic in FPGA Memory Arrays: Heterogeneous Memory Architectures
Implementing Logic in FPGA Memory Arrays: Heterogeneous Memory Architectures Steven J.E. Wilton Department of Electrical and Computer Engineering University of British Columbia Vancouver, BC, Canada, V6T
More informationFloorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence
Floorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence Chen-Wei Liu 12 and Yao-Wen Chang 2 1 Synopsys Taiwan Limited 2 Department of Electrical Engineering National Taiwan University,
More informationEnhanced Prediction of Interconnect delays for FPGA Synthesis using MATLAB
Power Matters. TM Enhanced Prediction of Interconnect delays for FPGA Synthesis using MATLAB Geetesh More (Sr. Software Engineer) Kristofer Vorwerk (Principal Software Engineer) Arun Kundu (Director, Software
More informationLogic Block Clustering of Large Designs for Channel-Width Constrained FPGAs
{ Logic Block Clustering of Large Designs for Channel-Width Constrained FPGAs Marvin Tom marvint @ ece.ubc.ca Guy Lemieux lemieux @ ece.ubc.ca Dept of ECE, University of British Columbia, Vancouver, BC,
More informationIntroduction Warp Processors Dynamic HW/SW Partitioning. Introduction Standard binary - Separating Function and Architecture
Roman Lysecky Department of Electrical and Computer Engineering University of Arizona Dynamic HW/SW Partitioning Initially execute application in software only 5 Partitioned application executes faster
More informationEmerging NVM Memory Technologies
Emerging NVM Memory Technologies Yuan Xie Associate Professor The Pennsylvania State University Department of Computer Science & Engineering www.cse.psu.edu/~yuanxie yuanxie@cse.psu.edu Position Statement
More informationPerformance-Preserved Analog Routing Methodology via Wire Load Reduction
Electronic Design Automation Laboratory (EDA LAB) Performance-Preserved Analog Routing Methodology via Wire Load Reduction Hao-Yu Chi, Hwa-Yi Tseng, Chien-Nan Jimmy Liu, Hung-Ming Chen 2 Dept. of Electrical
More informationUnleashing MRAM as Persistent Memory
Unleashing MRAM as Persistent Memory Andrew J. Walker PhD Spin Transfer Technologies Contents The Creaking Pyramid Challenges with the Memory Hierarchy What and Where is MRAM? State of the Art pmtj Unleashing
More informationEEL 4783: HDL in Digital System Design
EEL 4783: HDL in Digital System Design Lecture 13: Floorplanning Prof. Mingjie Lin Topics Partitioning a design with a floorplan. Performance improvements by constraining the critical path. Floorplanning
More informationL14 - Placement and Routing
L14 - Placement and Routing Ajay Joshi Massachusetts Institute of Technology RTL design flow HDL RTL Synthesis manual design Library/ module generators netlist Logic optimization a b 0 1 s d clk q netlist
More informationSpiral 2-8. Cell Layout
2-8.1 Spiral 2-8 Cell Layout 2-8.2 Learning Outcomes I understand how a digital circuit is composed of layers of materials forming transistors and wires I understand how each layer is expressed as geometric
More informationIntroduction to reconfigurable systems
Introduction to reconfigurable systems Reconfigurable system (RS)= any system whose sub-system configurations can be changed or modified after fabrication Reconfigurable computing (RC) is commonly used
More informationCover TBD. intel Quartus prime Design software
Cover TBD intel Quartus prime Design software Fastest Path to Your Design The Intel Quartus Prime software is revolutionary in performance and productivity for FPGA, CPLD, and SoC designs, providing a
More informationEECS150 - Digital Design Lecture 6 - Field Programmable Gate Arrays (FPGAs)
EECS150 - Digital Design Lecture 6 - Field Programmable Gate Arrays (FPGAs) September 12, 2002 John Wawrzynek Fall 2002 EECS150 - Lec06-FPGA Page 1 Outline What are FPGAs? Why use FPGAs (a short history
More informationStatic and Dynamic Memory Footprint Reduction for FPGA Routing Algorithms
18 Static and Dynamic Memory Footprint Reduction for FPGA Routing Algorithms SCOTT Y. L. CHIN and STEVEN J. E. WILTON University of British Columbia This article presents techniques to reduce the static
More informationHybrid Cache Architecture (HCA) with Disparate Memory Technologies
Hybrid Cache Architecture (HCA) with Disparate Memory Technologies Xiaoxia Wu, Jian Li, Lixin Zhang, Evan Speight, Ram Rajamony, Yuan Xie Pennsylvania State University IBM Austin Research Laboratory Acknowledgement:
More informationDigital Design Methodology
Digital Design Methodology Prof. Soo-Ik Chae Digital System Designs and Practices Using Verilog HDL and FPGAs @ 2008, John Wiley 1-1 Digital Design Methodology (Added) Design Methodology Design Specification
More informationOutline. EECS150 - Digital Design Lecture 6 - Field Programmable Gate Arrays (FPGAs) FPGA Overview. Why FPGAs?
EECS150 - Digital Design Lecture 6 - Field Programmable Gate Arrays (FPGAs) September 12, 2002 John Wawrzynek Outline What are FPGAs? Why use FPGAs (a short history lesson). FPGA variations Internal logic
More informationAn automatic tool flow for the combined implementation of multi-mode circuits
An automatic tool flow for the combined implementation of multi-mode circuits Brahim Al Farisi, Karel Bruneel, João M. P. Cardoso and Dirk Stroobandt Ghent University, ELIS Department Sint-Pietersnieuwstraat
More informationCover TBD. intel Quartus prime Design software
Cover TBD intel Quartus prime Design software Fastest Path to Your Design The Intel Quartus Prime software is revolutionary in performance and productivity for FPGA, CPLD, and SoC designs, providing a
More informationRespin: Rethinking Near- Threshold Multiprocessor Design with Non-Volatile Memory
Respin: Rethinking Near- Threshold Multiprocessor Design with Non-Volatile Memory Computer Architecture Research Lab h"p://arch.cse.ohio-state.edu Universal Demand for Low Power Mobility Ba"ery life Performance
More informationCascaded Channel Model, Analysis, and Hybrid Decoding for Spin-Torque Transfer Magnetic Random Access Memory (STT-MRAM)
1/16 Cascaded Channel Model, Analysis, and Hybrid Decoding for Spin-Torque Transfer Magnetic Random Access Memory (STT-MRAM) Kui Cai 1, K.A.S Immink 2, and Zhen Mei 1 Advanced Coding and Signal Processing
More informationHybrid STT CMOS Designs for Reverse engineering Prevention
Hybrid STT CMOS Designs for Reverse engineering Prevention Theodore Winograd George Mason University Hassan Salmani* Howard University Hamid Mahmoodi San Francisco State University Kris Gaj George Mason
More informationAn FPGA Design And Implementation Framework Combined With Commercial VLSI CADs
An FPGA Design And Implementation Framework Combined With Commercial VLSI CADs ReCoSoC 2013 Qian Zhao Motoki Amagasaki Masahiro Iida Morihiro Kuga Toshinori Sueyoshi (, Japan) Background FPGA IP core development
More informationPhase Change Memory An Architecture and Systems Perspective
Phase Change Memory An Architecture and Systems Perspective Benjamin C. Lee Stanford University bcclee@stanford.edu Fall 2010, Assistant Professor @ Duke University Benjamin C. Lee 1 Memory Scaling density,
More informationA Spherical Placement and Migration Scheme for a STT-RAM Based Hybrid Cache in 3D chip Multi-processors
, July 4-6, 2018, London, U.K. A Spherical Placement and Migration Scheme for a STT-RAM Based Hybrid in 3D chip Multi-processors Lei Wang, Fen Ge, Hao Lu, Ning Wu, Ying Zhang, and Fang Zhou Abstract As
More informationA Routing Approach to Reduce Glitches in Low Power FPGAs
A Routing Approach to Reduce Glitches in Low Power FPGAs Quang Dinh, Deming Chen, Martin Wong Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign This research
More informationResearch Article FPGA Interconnect Topologies Exploration
International Journal of Reconfigurable Computing Volume 29, Article ID 259837, 13 pages doi:1.1155/29/259837 Research Article FPGA Interconnect Topologies Exploration Zied Marrakchi, Hayder Mrabet, Umer
More informationProgrammable Logic Devices
Programmable Logic Devices INTRODUCTION A programmable logic device or PLD is an electronic component used to build reconfigurable digital circuits. Unlike a logic gate, which has a fixed function, a PLD
More informationCompiler User Guide. Intel Quartus Prime Pro Edition. Updated for Intel Quartus Prime Design Suite: Subscribe Send Feedback
Compiler User Guide Intel Quartus Prime Pro Edition Updated for Intel Quartus Prime Design Suite: 18.0 Subscribe Send Feedback Latest document on the web: PDF HTML Contents Contents 1. Design Compilation...
More informationA Novel Net Weighting Algorithm for Timing-Driven Placement
A Novel Net Weighting Algorithm for Timing-Driven Placement Tim (Tianming) Kong Aplus Design Technologies, Inc. 10850 Wilshire Blvd., Suite #370 Los Angeles, CA 90024 Abstract Net weighting for timing-driven
More informationExploring Logic Block Granularity for Regular Fabrics
1530-1591/04 $20.00 (c) 2004 IEEE Exploring Logic Block Granularity for Regular Fabrics A. Koorapaty, V. Kheterpal, P. Gopalakrishnan, M. Fu, L. Pileggi {aneeshk, vkheterp, pgopalak, mfu, pileggi}@ece.cmu.edu
More informationDetailed Router for 3D FPGA using Sequential and Simultaneous Approach
Detailed Router for 3D FPGA using Sequential and Simultaneous Approach Ashokkumar A, Dr. Niranjan N Chiplunkar, Vinay S Abstract The Auction Based methodology for routing of 3D FPGA (Field Programmable
More informationBlock-Based Design User Guide
Block-Based Design User Guide Intel Quartus Prime Pro Edition Updated for Intel Quartus Prime Design Suite: 18.0 Subscribe Send Feedback Latest document on the web: PDF HTML Contents Contents 1. Block-Based
More informationNew Successes for Parameterized Run-time Reconfiguration
New Successes for Parameterized Run-time Reconfiguration (or: use the FPGA to its true capabilities) Prof. Dirk Stroobandt Ghent University, Belgium Hardware and Embedded Systems group Universiteit Gent
More informationChapter 5: ASICs Vs. PLDs
Chapter 5: ASICs Vs. PLDs 5.1 Introduction A general definition of the term Application Specific Integrated Circuit (ASIC) is virtually every type of chip that is designed to perform a dedicated task.
More informationBest Practices for Incremental Compilation Partitions and Floorplan Assignments
Best Practices for Incremental Compilation Partitions and Floorplan Assignments December 2007, ver. 1.0 Application Note 470 Introduction The Quartus II incremental compilation feature allows you to partition
More informationDesign and Implementation of FPGA Logic Architectures using Hybrid LUT/Multiplexer
Design and Implementation of FPGA Logic Architectures using Hybrid LUT/Multiplexer Krosuri Rajyalakshmi 1 J.Narashima Rao 2 rajyalakshmi.krosuri@gmail.com 1 jnarasimharao09@gmail.com 2 1 PG Scholar, VLSI,
More informationIntroduction to Partial Reconfiguration Methodology
Methodology This material exempt per Department of Commerce license exception TSU Objectives After completing this module, you will be able to: Define Partial Reconfiguration technology List common applications
More informationThe Engine. SRAM & DRAM Endurance and Speed with STT MRAM. Les Crudele / Andrew J. Walker PhD. Santa Clara, CA August
The Engine & DRAM Endurance and Speed with STT MRAM Les Crudele / Andrew J. Walker PhD August 2018 1 Contents The Leaking Creaking Pyramid STT-MRAM: A Compelling Replacement STT-MRAM: A Unique Endurance
More informationTestability Optimizations for A Time Multiplexed CPLD Implemented on Structured ASIC Technology
ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY Volume 14, Number 4, 2011, 392 398 Testability Optimizations for A Time Multiplexed CPLD Implemented on Structured ASIC Technology Traian TULBURE
More informationCprE 583 Reconfigurable Computing
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #9 Logic Emulation Technology Recap FPGA-Based Router (FPX)
More informationIntel Quartus Prime Pro Edition User Guide
Intel Quartus Prime Pro Edition User Guide Design Compilation Updated for Intel Quartus Prime Design Suite: 18.1 Subscribe Latest document on the web: PDF HTML Contents Contents 1. Design Compilation...
More informationdiscrete logic do not
Welcome to my second year course on Digital Electronics. You will find that the slides are supported by notes embedded with the Powerpoint presentations. All my teaching materials are also available on
More informationReduce Your System Power Consumption with Altera FPGAs Altera Corporation Public
Reduce Your System Power Consumption with Altera FPGAs Agenda Benefits of lower power in systems Stratix III power technology Cyclone III power Quartus II power optimization and estimation tools Summary
More information