An Interconnect-Centric Design Flow for Nanometer Technologies
|
|
- Jonathan Rodgers
- 6 years ago
- Views:
Transcription
1 An Interconnect-Centric Design Flow for Nanometer Technologies Professor Jason UCLA Computer Science Department Los Angeles, CA /~cong Gate Delays vs. Interconnect Delays Source: National Technology Roadmap of Semiconductors (1997) VLSI-TSA'99 Jason Cong 2
2 Interconnect-Centric Design Methodology Proposed transition interconnect device device interconnect device/function centric Analogy Data/Objects Programs interconnect/communication centric Programs Data/Objects VLSI-TSA'99 Jason Cong 3 Interconnect-Centric Design Flow Key steps in an interconnect-centric centric design flow: Interconnect Planning Interconnect Synthesis Interconnect Layout Other supporting tools to enable an interconnect- centric design flow Interconnect performance estimation Interconnect performance verification VLSI-TSA'99 Jason Cong 4
3 Outline of the Talk Interconnect Synthesis Interconnect Performance Estimation Interconnect Planning VLSI-TSA'99 Jason Cong 5 Constraints: Delay Skew Signal integrity... Interconnect Synthesis Optimized interconnect designs: Topology Sizing Spacing Buffer insertion Automatic solutions guided by accurate interconnect models VLSI-TSA'99 Jason Cong 6
4 Example: Single-Net Optimal Wire Sizing (OWS) [Cong-Leung, ICCAD 93] Given: A set of possible wire widths { W, W 1 2,, W r } Find: An optimal wire width assignment to minimize distributed RC delay Wiresizing Optimization Example: Global Interconnect Sizing and Spacing (GISS) [Cong et al, ICCAD 97] Sizing Given: Initial layout of multiple nets Critical sinks and their criticalities Capacitance model and design rules Output: Spacing Sizing and spacing of every net to minimize RC delays with consideration of coupling cap. VLSI-TSA'99 Jason Cong 8
5 Capacitance Model C x Ca c f 2.5D capacitance model [Cong Cong et al, DAC 97] Consider: C (area), C a f (fringing) and C x (coupling) Build capacitance table from 3D field solver (FastCap) Table lookup by interpolation and extrapolation Main Approaches to GISS Heuristic: Optimize one net at a time: bottom- up dynamic programming (optimal for one net) Better approach: Compute upper and lower bounds of opt. wire widths/spacings spacings of all nets Extended local refinement (ELR) using generalized CH-posynomial formulation Or iterative bound refinement (BR) In practice, lower and upper bounds meet most of time => optimal solution. VLSI-TSA'99 Jason Cong 10
6 GISS Optimization Results 16-bit 10mm bus structure equally spaced, with 5 different centerspacings from 2x to 5x min. pitch pitch = min. width + min.spacing Center spacing Average Delays(ns) MIN OWS GISS/S GISS/M 2 x pitch (-17%) 0.82 (-46%) 0.76 (-50%) 3 x pitch (-45%) 0.56 (-58%) 0.50 (-62%) 4 x pitch (-64%) 0.45 (-65%) 0.40 (-69%) 5 x pitch (-70%) 0.37 (-70%) 0.35 (-72%) For non-equal net weights, GISS/M shall have more advantage than GISS/S VLSI-TSA'99 Jason Cong 11 UCLA TRIO Package (Tree, Repeater, Interconnect Optimization) Synthesis/optimization capabilities Interconnect topology optimization Optimal buffer insertion Wiresizing optimization Global interconnect sizing and spacing Simultaneous driver, buffer, and interconnect sizing Simultaneous topology generation with buffer insertion and wiresizing... Efficient polynomial-time optimal/near-optimal algorithms Interconnect performance can be improved by up to 7x! Available on the web: Demo at DAC 99 VLSI-TSA'99 Jason Cong 12
7 Impact of Interconnect Optimization --For a 2cm Global Interconnect Using the TRIO Package Delay (ns) Technology (u m) 2cm DS 2cm BIS 2cm BISWS DS: Driver Sizing only BIS: Buffer Insertion and Sizing BISWS: Simultaneous Buffer Insertion/Sizing and Wiresizing 5x ~ 7x performance improvement! Interconnect Synthesis in Layout Design Flow Chip-planning, Floorplaning, Global Int. Planning and Optimization Timing Driven Placement Delay Budgeting Performance Driven Global Routing with Interconnect Optimization Detailed Routing with Variable Width and Spacing Topology Optimization Buffer Buffer Insertion Device Device Sizing Sizing Wiresizing Interconnect Optimizations Library (e.g. TRIO) VLSI-TSA'99 Jason Cong 14
8 Outline of the Talk Interconnect Synthesis Interconnect Performance Estimation Interconnect Planning VLSI-TSA'99 Jason Cong 15 Interconnect Performance Estimation G 0 G S 1 C s1 Sn Input S 2 C sn C s2 Problem: Estimate the optimized interconnect delay, area, etc., without actually running the optimization algorithms (such as TRIO)!
9 Needs for Interconnect Performance Estimation Models Efficiency need to explore many micro-architectures/ architectures/floorplans => require to process > 1 million nets/second cannot afford actual synthesis/optimization ( nets/second) Abstractionto to hide detailed design information granularity of wire segmentation number of wire widths, buffer sizes,... Explicit relationto to enable optimal design decision at high levels Result: very efficient (constant-time) time) estimation models for various interconnect optimization operations Example: Delay/Area Estimation under OWS Closed-form delay estimation formula T ows where α 1l 2α 1l ( Rd, l, CL) = + + Rdcf + 2 W ( α 2l) W( α 2l) 1 α 1 = 4 rca, 1 α 2 = 2 rc a RdCL W(x) is Lambert s W function defined as we Closed-form area estimation formula Rdrcacfl l w = x A ows ( Rd, l, CL) = r( cf l + 2C 2Rd ca L ) l
10 Delay Comparison of OWS model vs. TRIO ns Model TRIO length(um) OWS delay model consistently matches TRIO. 0.10um technology from NTRS 97. Driver is 100x min. To run TRIO, 40 discrete wire widths are used with the max width set to be 40x min width. VLSI-TSA'99 Jason Cong 19 Average Width (Area) Comparison width(um) length(um) Model TRIO Area estimation model for OWS almost exactly matches TRIO. VLSI-TSA'99 Jason Cong 20
11 Example: Delay Estimation Model for BIWS Problem: estimate interconnect delay with optimal buffer insertion and wire sizing (BIWS) Critical length for BIWS: threshold length over which buffer insertion provides additional delay reduction over optimal wire-sizing (OWS) Critical length for BIWS can be computed efficiently Critical Lengths of Un-Buffered Wires Technology (um) b=10x b=50x b=100x b=200x b=500x unit: mm With optimal wire sizing [Cong-Pan, IWLS 98/ASP-DAC 99] Min. WS Without wire sizing [Otten ISPD 98, Otten-Brayton DAC 98]
12 Example: Delay Estimation Model for BIWS (Cont d) Linear delay estimation model for BIWS: τbiws T biws = τbiws l + is the slope, and can be obtained from optimal wire sizing for critical length t g Comparison of BIWS Model vs. TRIO Delay Modeling Model TRIO ns length(um) n R d0 = r g /10, C L = c g x 10, buffer type is 100 x min. n For expt., max. wire width is 20x min. width, wire is segmented in every 100um.
13 Outline of the Talk Interconnect Synthesis Interconnect Performance Estimation Interconnect Planning VLSI-TSA'99 Jason Cong 25 Interconnect Planning Interconnect architecture planning (pre-design) Decide within freedom of fabrication technology: number of routing layers metal and isolation material at each layer thickness of each metal and isolation layer nominal width and spacing on each layer vertical interconnection schemes (via structure?)... Interconnect planning with RTL-floorplan Interconnect planning with physical-level level floorplan VLSI-TSA'99 Jason Cong 26
14 Interconnect Planning (cont d) Interconnect architecture planning (pre-design) Interconnect planning with RTL-floorplan Define global and local interconnects Estimate overall interconnect distribution Guide RTL-level and logic-level level synthesis/optimization Re-partition of design hierarchy Logic replication Retiming and pipelining... Interconnect planning with physical-level level floorplan VLSI-TSA'99 Jason Cong 27 Interconnect Planning (cont d) Interconnect architecture planning (pre-design) Interconnect planning with RTL-floorplan Interconnect planning with physical-level level floorplan Interconnect topologies Wire ordering Wire width and spacing Number of buffers and their locations VLSI-TSA'99 Jason Cong 28
15 Example: Optimal Wire-Width Planning Given: Certain technology Wire length distribution per layer Find: A small set of globally optimal widths per layer Performance/Area optimization Motivation Simplify interconnect optimization Simply detailed routing, layout extraction,... VLSI-TSA'99 Jason Cong 29 Overall Flow For each metal layer i Assign length range l min and l max ; Find a small set of optimal widths W to minimize l max r r Φ( W, l min, l max) = λ ( l) f ( W, l) dl l h f(w, l): the objective function to be minimized by the design for wire length l, using W hl (l): the weight function for wire lengthl Method: Analytical or numerical min
16 Objective in Our Study r r r j k f ( W, l) = A ( W, l) T ( W, l) A: area T: delay or r r f ( W, l) = T( W, l) f ( W r, l) A( W r, l) T ( W r 4 =, l) (performance only) (performance-driven and area-saving) Recommendation for Future Tech. 2-width design under objective function of AT 4 Wiring hierarchy for both performance and density! Technology (um) Tier1 Range (mm) W (um) Strawman Tier2 Range(mm) [Otten- W1(um) Brayton, W2(um) DAC 98] Tier3 Range(mm) W1(um) um W2(um) Tier4 Range(mm) W1(um) W2(um) um
17 Two Simple Wire Sizing Schemes 2.5 ns Tier1-1WS Tier1-2WS Tier1-OWS Tier4-1WS 0.5 Tier4-OWS length(um) 1-WS and 2-WS have less than 10% difference from OWS for length <4mm in Tier1 Both 1-WS and 2-WS work well in Tier4 up to chip size A Performance-Driven, Area-Saving Metric Opt. width for AT 4. Only increase delay by 10%, save area by 60%! metric AT^2 AT T AT^4 AT^3 width(um) Optimal width for delay T um tech; - Top layer pair; - Length range 8-23 mm; - Assume uniform distribution; - Metric: integral of T, AT, AT 2,, AT 4 - Driver/load 100x min gate
18 Experimental Setting For each metal pair (tier), assume certain wire length range Assume the max length in tier1 is 10,000x feature size, and top tier is L edge (chip dimension) [Fisher+ 98] Intermediate tier length range follows a geometric sequence mm Representative driver size for each metal layer (10x, 40x, 100x, and 250x for tiers 1-4) A Rather Surprising Result: 2 Widths /Per Layer are Sufficient! [DAC 99] pitch-sp=2um pitch-sp=2.9um pitch-sp=3.8um scheme avg-d max-erravg-w avg-d max-err avg-w avg-d max-err avg-w 1-width % % % width % % % 1.41 m-width % % % 1.38 Assumptions: 0.10 um process, layers 7&8 ( mm), under AT 4 metric, limited driver size variation size per layer 2-width design superior than 1-width delay reduction up to 12.4% area saving up to 48%! 2-width design comparable to many-width Avg. delay less than 5% and Max. delay less than 7% Area difference less than 4.7%
19 Paradigm shift Summary Device/function-centric centric => interconnect/communication-centriccentric Key components in an interconnect-centric centric design flow Interconnect planning Interconnect synthesis Interconnect layout Also need estimation, simulation, and verification tools at each stage for interconnect performance and signal integrity VLSI-TSA'99 Jason Cong 37 Acknowledgements Thanks for the supports from Semiconductor Research Corporation (SRC) National Science Foundation (NSF) Defense Advanced Research Project Agency (DARPA) Intel Corporation More information: /~cong VLSI-TSA'99 Jason Cong 38
20 Logic Volume within critical lengths - Defined as the number of min 2-input NAND gates that can be packed within the area of l c /2 * l c /2 Technology (um) NAND (um 2 ) b=10x b=50x b=100x b=200x b=500x unit: million Another Examp: Buffer Block Planning Buffer Blocks Logic Blocks Problem: automatically generates buffer blocks during physical-levellevel floorplan Motivation: Avoid buffer over hard IP-blocks Power/ground network sharing among buffers More regular layout, etc. VLSI-TSA'99 Jason Cong 40
21 Experimental Result: Number of BB Circuit RDM/RES RDM/FR BBP/RES BBP/FR Apte Xerox Hp Ami Ami playout RDM: a buffer is randomly assigned to a feasible location BBP: buffers are clustered appropriately RES: Restricted (delay-minimal) buffer insertion point FR: feasible buffer region for delay constraints Our buffer block planning (B -P) algorithm can reduce the number of buffer blocks to 1/10~1/20 of those from RDM VLSI-TSA'99 Jason Cong 41 Interconnect Layout Need a multi-layer layer general-area area router gridless flexible (variable widths within the same segment, variable spacings for each pair of nets) efficient Will leverage our current research on gridless routing Use of implicit graph representation Use of computational geometry techniques Highly scalable and flexible VLSI-TSA'99 Jason Cong 42
Interconnect Delay and Area Estimation for Multiple-Pin Nets
Interconnect Delay and Area Estimation for Multiple-Pin Nets Jason Cong and David Z. Pan UCLA Computer Science Department Los Angeles, CA 90095 Sponsored by SRC and Avant!! under CA-MICRO Presentation
More informationAn Interconnect-Centric Design Flow for Nanometer Technologies
An Interconnect-Centric Design Flow for Nanometer Technologies Jason Cong UCLA Computer Science Department Email: cong@cs.ucla.edu Tel: 310-206-2775 URL: http://cadlab.cs.ucla.edu/~cong Exponential Device
More informationAn Interconnect-Centric Design Flow for Nanometer Technologies. Outline
An Interconnect-Centric Design Flow for Nanometer Technologies Jason Cong UCLA Computer Science Department Email: cong@cs.ucla.edu Tel: 310-206-2775 http://cadlab.cs.ucla.edu/~cong Outline Global interconnects
More informationAn Interconnect-Centric Design Flow for Nanometer. Technologies
An Interconnect-Centric Design Flow for Nanometer Technologies Jason Cong Department of Computer Science University of California, Los Angeles, CA 90095 Abstract As the integrated circuits (ICs) are scaled
More informationS 1 S 2. C s1. C s2. S n. C sn. S 3 C s3. Input. l k S k C k. C 1 C 2 C k-1. R d
Interconnect Delay and Area Estimation for Multiple-Pin Nets Jason Cong and David Zhigang Pan Department of Computer Science University of California, Los Angeles, CA 90095 Email: fcong,pang@cs.ucla.edu
More informationRetiming & Pipelining over Global Interconnects
Retiming & Pipelining over Global Interconnects Jason Cong Computer Science Department University of California, Los Angeles cong@cs.ucla.edu http://cadlab.cs.ucla.edu/~cong Joint work with C. C. Chang,
More informationBuffer Block Planning for Interconnect Planning and Prediction
Buffer Block Planning for Interconnect Planning and Prediction Jason Cong, Tianming Kong and David Zhigang Pan y Department of Computer Science, University of California, Los Angeles, CA 90095 y IBM T.
More informationRegular Fabrics for Retiming & Pipelining over Global Interconnects
Regular Fabrics for Retiming & Pipelining over Global Interconnects Jason Cong Computer Science Department University of California, Los Angeles cong@cs cs.ucla.edu http://cadlab cadlab.cs.ucla.edu/~cong
More informationChallenges and Opportunities for Design Innovations in Nanometer Technologies
SRC Design Sciences Concept Paper Challenges and Opportunities for Design Innovations in Nanometer Technologies Jason Cong Computer Science Department University of California, Los Angeles, CA 90095 (E.mail:
More informationThermal-Aware 3D IC Physical Design and Architecture Exploration
Thermal-Aware 3D IC Physical Design and Architecture Exploration Jason Cong & Guojie Luo UCLA Computer Science Department cong@cs.ucla.edu http://cadlab.cs.ucla.edu/~cong Supported by DARPA Outline Thermal-Aware
More informationFloorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence
Floorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence Chen-Wei Liu 12 and Yao-Wen Chang 2 1 Synopsys Taiwan Limited 2 Department of Electrical Engineering National Taiwan University,
More informationInterconnect Design for Deep Submicron ICs
! " #! " # - Interconnect Design for Deep Submicron ICs Jason Cong Lei He Kei-Yong Khoo Cheng-Kok Koh and Zhigang Pan Computer Science Department University of California Los Angeles CA 90095 Abstract
More informationUCLA 3D research started in 2002 under DARPA with CFDRC
Coping with Vertical Interconnect Bottleneck Jason Cong UCLA Computer Science Department cong@cs.ucla.edu http://cadlab.cs.ucla.edu/ cs edu/~cong Outline Lessons learned Research challenges and opportunities
More informationLinking Layout to Logic Synthesis: A Unification-Based Approach
Linking Layout to Logic Synthesis: A Unification-Based Approach Massoud Pedram Department of EE-Systems University of Southern California Los Angeles, CA February 1998 Outline Introduction Technology and
More informationA buffer planning algorithm for chip-level floorplanning
Science in China Ser. F Information Sciences 2004 Vol.47 No.6 763 776 763 A buffer planning algorithm for chip-level floorplanning CHEN Song 1, HONG Xianlong 1, DONG Sheqin 1, MA Yuchun 1, CAI Yici 1,
More informationRetiming. Adapted from: Synthesis and Optimization of Digital Circuits, G. De Micheli Stanford. Outline. Structural optimization methods. Retiming.
Retiming Adapted from: Synthesis and Optimization of Digital Circuits, G. De Micheli Stanford Outline Structural optimization methods. Retiming. Modeling. Retiming for minimum delay. Retiming for minimum
More informationPushPull: Short Path Padding for Timing Error Resilient Circuits YU-MING YANG IRIS HUI-RU JIANG SUNG-TING HO. IRIS Lab National Chiao Tung University
PushPull: Short Path Padding for Timing Error Resilient Circuits YU-MING YANG IRIS HUI-RU JIANG SUNG-TING HO IRIS Lab National Chiao Tung University Outline Introduction Problem Formulation Algorithm -
More informationPhysical Design Closure
Physical Design Closure Olivier Coudert Monterey Design System DAC 2000 DAC2000 (C) Monterey Design Systems 1 DSM Dilemma SOC Time to market Million gates High density, larger die Higher clock speeds Long
More informationVdd Programmable and Variation Tolerant FPGA Circuits and Architectures
Vdd Programmable and Variation Tolerant FPGA Circuits and Architectures Prof. Lei He EE Department, UCLA LHE@ee.ucla.edu Partially supported by NSF. Pathway to Power Efficiency and Variation Tolerance
More informationSymmetrical Buffered Clock-Tree Synthesis with Supply-Voltage Alignment
Symmetrical Buffered Clock-Tree Synthesis with Supply-Voltage Alignment Xin-Wei Shih, Tzu-Hsuan Hsu, Hsu-Chieh Lee, Yao-Wen Chang, Kai-Yuan Chao 2013.01.24 1 Outline 2 Clock Network Synthesis Clock network
More informationOpenAccess In 3D IC Physical Design
OpenAccess In 3D IC Physical Design Jason Cong, Jie Wei,, Yan Zhang VLSI CAD Lab Computer Science Department University of California, Los Angeles Supported by DARPA and CFD Research Corp Outline 3D IC
More informationVariation Tolerant Buffered Clock Network Synthesis with Cross Links
Variation Tolerant Buffered Clock Network Synthesis with Cross Links Anand Rajaram David Z. Pan Dept. of ECE, UT-Austin Texas Instruments, Dallas Sponsored by SRC and IBM Faculty Award 1 Presentation Outline
More informationAn overview of standard cell based digital VLSI design
An overview of standard cell based digital VLSI design Implementation of the first generation AsAP processor Zhiyi Yu and Tinoosh Mohsenin VCL Laboratory UC Davis Outline Overview of standard cellbased
More informationCOE 561 Digital System Design & Synthesis Introduction
1 COE 561 Digital System Design & Synthesis Introduction Dr. Aiman H. El-Maleh Computer Engineering Department King Fahd University of Petroleum & Minerals Outline Course Topics Microelectronics Design
More informationSYNTHESIS FOR ADVANCED NODES
SYNTHESIS FOR ADVANCED NODES Abhijeet Chakraborty Janet Olson SYNOPSYS, INC ISPD 2012 Synopsys 2012 1 ISPD 2012 Outline Logic Synthesis Evolution Technology and Market Trends The Interconnect Challenge
More informationCHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION Rapid advances in integrated circuit technology have made it possible to fabricate digital circuits with large number of devices on a single chip. The advantages of integrated circuits
More informationARCHITECTURE AND CAD FOR DEEP-SUBMICRON FPGAs
ARCHITECTURE AND CAD FOR DEEP-SUBMICRON FPGAs THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE ARCHITECTURE AND CAD FOR DEEP-SUBMICRON FPGAs Vaughn Betz Jonathan Rose Alexander Marquardt
More informationCluster-based approach eases clock tree synthesis
Page 1 of 5 EE Times: Design News Cluster-based approach eases clock tree synthesis Udhaya Kumar (11/14/2005 9:00 AM EST) URL: http://www.eetimes.com/showarticle.jhtml?articleid=173601961 Clock network
More informationFast Dual-V dd Buffering Based on Interconnect Prediction and Sampling
Based on Interconnect Prediction and Sampling Yu Hu King Ho Tam Tom Tong Jing Lei He Electrical Engineering Department University of California at Los Angeles System Level Interconnect Prediction (SLIP),
More informationUniversity of California at Berkeley. Berkeley, CA the global routing in order to generate a feasible solution
Post Routing Performance Optimization via Multi-Link Insertion and Non-Uniform Wiresizing Tianxiong Xue and Ernest S. Kuh Department of Electrical Engineering and Computer Sciences University of California
More informationMetal-Density Driven Placement for CMP Variation and Routability
Metal-Density Driven Placement for CMP Variation and Routability ISPD-2008 Tung-Chieh Chen 1, Minsik Cho 2, David Z. Pan 2, and Yao-Wen Chang 1 1 Dept. of EE, National Taiwan University 2 Dept. of ECE,
More informationAdvanced Surface Based MoM Techniques for Packaging and Interconnect Analysis
Electrical Interconnect and Packaging Advanced Surface Based MoM Techniques for Packaging and Interconnect Analysis Jason Morsey Barry Rubin, Lijun Jiang, Lon Eisenberg, Alina Deutsch Introduction Fast
More informationLarge Scale Circuit Partitioning
Large Scale Circuit Partitioning With Loose/Stable Net Removal And Signal Flow Based Clustering Jason Cong Honching Li Sung-Kyu Lim Dongmin Xu UCLA VLSI CAD Lab Toshiyuki Shibuya Fujitsu Lab, LTD Support
More informationClock Tree Resynthesis for Multi-corner Multi-mode Timing Closure
Clock Tree Resynthesis for Multi-corner Multi-mode Timing Closure Subhendu Roy 1, Pavlos M. Mattheakis 2, Laurent Masse-Navette 2 and David Z. Pan 1 1 ECE Department, The University of Texas at Austin
More informationGlobal Clustering-Based Performance-Driven Circuit Partitioning
Global Clustering-Based Performance-Driven Circuit Partitioning Jason Cong University of California at Los Angeles Los Angeles, CA 90095 cong@cs.ucla.edu Chang Wu Aplus Design Technologies, Inc. Los Angeles,
More informationReference. Wayne Wolf, FPGA-Based System Design Pearson Education, N Krishna Prakash,, Amrita School of Engineering
FPGA Fabrics Reference Wayne Wolf, FPGA-Based System Design Pearson Education, 2004 Logic Design Process Combinational logic networks Functionality. Other requirements: Size. Power. Primary inputs Performance.
More informationThree-Dimensional Integrated Circuits: Performance, Design Methodology, and CAD Tools
Three-Dimensional Integrated Circuits: Performance, Design Methodology, and CAD Tools Shamik Das, Anantha Chandrakasan, and Rafael Reif Microsystems Technology Laboratories Massachusetts Institute of Technology
More informationSymmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations
XXVII IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, OCTOBER 5, 2009 Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations Renshen Wang 1 Takumi Okamoto 2
More informationArchitecture and Synthesis for Multi-Cycle Communication
Architecture and Synthesis for Multi-Cycle Communication Jason Cong, Yiping Fan, Xun Yang, Zhiru Zhang Computer Science Department University of California, Los Angeles Los Angeles CA 90095 USA {cong,
More informationUnit 2: High-Level Synthesis
Course contents Unit 2: High-Level Synthesis Hardware modeling Data flow Scheduling/allocation/assignment Reading Chapter 11 Unit 2 1 High-Level Synthesis (HLS) Hardware-description language (HDL) synthesis
More informationCircuit Model for Interconnect Crosstalk Noise Estimation in High Speed Integrated Circuits
Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 8 (2013), pp. 907-912 Research India Publications http://www.ripublication.com/aeee.htm Circuit Model for Interconnect Crosstalk
More informationPOWER PERFORMANCE OPTIMIZATION METHODS FOR DIGITAL CIRCUITS
POWER PERFORMANCE OPTIMIZATION METHODS FOR DIGITAL CIRCUITS Radu Zlatanovici zradu@eecs.berkeley.edu http://www.eecs.berkeley.edu/~zradu Department of Electrical Engineering and Computer Sciences University
More informationJames Lin Vice President, Technology Infrastructure Group National Semiconductor Corporation CODES + ISSS 2003 October 3rd, 2003
Challenges for SoC Design in Very Deep Submicron Technologies James Lin Vice President, Technology Infrastructure Group National Semiconductor Corporation CODES + ISSS 2003 October 3rd, 2003 1 Contents
More informationPilot: A Platform-based HW/SW Synthesis System
Pilot: A Platform-based HW/SW Synthesis System SOC Group, VLSI CAD Lab, UCLA Led by Jason Cong Zhong Chen, Yiping Fan, Xun Yang, Zhiru Zhang ICSOC Workshop, Beijing August 20, 2002 Outline Overview The
More informationEE582 Physical Design Automation of VLSI Circuits and Systems
EE582 Prof. Dae Hyun Kim School of Electrical Engineering and Computer Science Washington State University Preliminaries Table of Contents Semiconductor manufacturing Problems to solve Algorithm complexity
More informationNoCIC: A Spice-based Interconnect Planning Tool Emphasizing Aggressive On-Chip Interconnect Circuit Methods
1 NoCIC: A Spice-based Interconnect Planning Tool Emphasizing Aggressive On-Chip Interconnect Circuit Methods V. Venkatraman, A. Laffely, J. Jang, H. Kukkamalla, Z. Zhu & W. Burleson Interconnect Circuit
More informationPlace and Route for FPGAs
Place and Route for FPGAs 1 FPGA CAD Flow Circuit description (VHDL, schematic,...) Synthesize to logic blocks Place logic blocks in FPGA Physical design Route connections between logic blocks FPGA programming
More informationPower dissipation! The VLSI Interconnect Challenge. Interconnect is the crux of the problem. Interconnect is the crux of the problem.
The VLSI Interconnect Challenge Avinoam Kolodny Electrical Engineering Department Technion Israel Institute of Technology VLSI Challenges System complexity Performance Tolerance to digital noise and faults
More informationProblem Formulation. Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets.
Clock Routing Problem Formulation Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets. Better to develop specialized routers for these nets.
More informationHAI ZHOU. Evanston, IL Glenview, IL (847) (o) (847) (h)
HAI ZHOU Electrical and Computer Engineering Northwestern University 2535 Happy Hollow Rd. Evanston, IL 60208-3118 Glenview, IL 60025 haizhou@ece.nwu.edu www.ece.nwu.edu/~haizhou (847) 491-4155 (o) (847)
More informationA New Methodology for Interconnect Parasitic Extraction Considering Photo-Lithography Effects
A New Methodology for Interconnect Parasitic Extraction Considering Photo-Lithography Effects Ying Zhou, Yuxin Tian, Weiping Shi Texas A&M University Zhuo Li Pextra Corporation Frank Liu IBM Austin Research
More informationCMP Model Application in RC and Timing Extraction Flow
INVENTIVE CMP Model Application in RC and Timing Extraction Flow Hongmei Liao*, Li Song +, Nickhil Jakadtar +, Taber Smith + * Qualcomm Inc. San Diego, CA 92121 + Cadence Design Systems, Inc. San Jose,
More informationSilicon Virtual Prototyping: The New Cockpit for Nanometer Chip Design
Silicon Virtual Prototyping: The New Cockpit for Nanometer Chip Design Wei-Jin Dai, Dennis Huang, Chin-Chih Chang, Michel Courtoy Cadence Design Systems, Inc. Abstract A design methodology for the implementation
More informationArchitecture-Level Synthesis for Automatic Interconnect Pipelining
Architecture-Level Synthesis for Automatic Interconnect Pipelining Jason Cong, Yiping Fan, Zhiru Zhang Computer Science Department University of California, Los Angeles, CA 90095 {cong, fanyp, zhiruz}@cs.ucla.edu
More informationPseudopin Assignment with Crosstalk Noise Control
598 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 20, NO. 5, MAY 2001 Pseudopin Assignment with Crosstalk Noise Control Chin-Chih Chang and Jason Cong, Fellow, IEEE
More informationCan Recursive Bisection Alone Produce Routable Placements?
Supported by Cadence Can Recursive Bisection Alone Produce Routable Placements? Andrew E. Caldwell Andrew B. Kahng Igor L. Markov http://vlsicad.cs.ucla.edu Outline l Routability and the placement context
More informationArchitecture Evaluation for
Architecture Evaluation for Power-efficient FPGAs Fei Li*, Deming Chen +, Lei He*, Jason Cong + * EE Department, UCLA + CS Department, UCLA Partially supported by NSF and SRC Outline Introduction Evaluation
More informationAn Overview of Standard Cell Based Digital VLSI Design
An Overview of Standard Cell Based Digital VLSI Design With examples taken from the implementation of the 36-core AsAP1 chip and the 1000-core KiloCore chip Zhiyi Yu, Tinoosh Mohsenin, Aaron Stillmaker,
More informationReducing Power in an FPGA via Computer-Aided Design
Reducing Power in an FPGA via Computer-Aided Design Steve Wilton University of British Columbia Power Reduction via CAD How to reduce power dissipation in an FPGA: - Create power-aware CAD tools - Create
More informationMapping-Aware Constrained Scheduling for LUT-Based FPGAs
Mapping-Aware Constrained Scheduling for LUT-Based FPGAs Mingxing Tan, Steve Dai, Udit Gupta, Zhiru Zhang School of Electrical and Computer Engineering Cornell University High-Level Synthesis (HLS) for
More informationOn Constructing Lower Power and Robust Clock Tree via Slew Budgeting
1 On Constructing Lower Power and Robust Clock Tree via Slew Budgeting Yeh-Chi Chang, Chun-Kai Wang and Hung-Ming Chen Dept. of EE, National Chiao Tung University, Taiwan 2012 年 3 月 29 日 Outline 2 Motivation
More informationHigh-Level Synthesis (HLS)
Course contents Unit 11: High-Level Synthesis Hardware modeling Data flow Scheduling/allocation/assignment Reading Chapter 11 Unit 11 1 High-Level Synthesis (HLS) Hardware-description language (HDL) synthesis
More informationSatisfiability Modulo Theory based Methodology for Floorplanning in VLSI Circuits
Satisfiability Modulo Theory based Methodology for Floorplanning in VLSI Circuits Suchandra Banerjee Anand Ratna Suchismita Roy mailnmeetsuchandra@gmail.com pacific.anand17@hotmail.com suchismita27@yahoo.com
More informationDouble Patterning Layout Decomposition for Simultaneous Conflict and Stitch Minimization
Double Patterning Layout Decomposition for Simultaneous Conflict and Stitch Minimization Kun Yuan, Jae-Seo Yang, David Z. Pan Dept. of Electrical and Computer Engineering The University of Texas at Austin
More informationMAPLE: Multilevel Adaptive PLacEment for Mixed Size Designs
MAPLE: Multilevel Adaptive PLacEment for Mixed Size Designs Myung Chul Kim, Natarajan Viswanathan, Charles J. Alpert, Igor L. Markov, Shyam Ramji Dept. of EECS, University of Michigan IBM Corporation 1
More informationProcessing Rate Optimization by Sequential System Floorplanning
Processing Rate Optimization by Sequential System Floorplanning Jia Wang Ping-Chih Wu Hai Zhou EECS Department Northwestern University Evanston, IL 60208, U.S.A. {jwa112, haizhou}@ece.northwestern.edu
More informationTHERMAL EXPLORATION AND SIGN-OFF ANALYSIS FOR ADVANCED 3D INTEGRATION
THERMAL EXPLORATION AND SIGN-OFF ANALYSIS FOR ADVANCED 3D INTEGRATION Cristiano Santos 1, Pascal Vivet 1, Lee Wang 2, Michael White 2, Alexandre Arriordaz 3 DAC Designer Track 2017 Pascal Vivet Jun/2017
More informationICS 252 Introduction to Computer Design
ICS 252 Introduction to Computer Design Placement Fall 2007 Eli Bozorgzadeh Computer Science Department-UCI References and Copyright Textbooks referred (none required) [Mic94] G. De Micheli Synthesis and
More informationDigital Design Methodology (Revisited) Design Methodology: Big Picture
Digital Design Methodology (Revisited) Design Methodology Design Specification Verification Synthesis Technology Options Full Custom VLSI Standard Cell ASIC FPGA CS 150 Fall 2005 - Lec #25 Design Methodology
More informationRoutability-Driven Repeater Block Planning for Interconnect-Centric Floorplanning
Routability-Driven Repeater Block Planning for Interconnect-Centric Floorplanning Probir Sarkar and Cheng-Kok Koh, Member, IEEE Abstract In this paper, we present a repeater block planning algorithm for
More informationPlanning for Local Net Congestion in Global Routing
Planning for Local Net Congestion in Global Routing Hamid Shojaei, Azadeh Davoodi, and Jeffrey Linderoth* Department of Electrical and Computer Engineering *Department of Industrial and Systems Engineering
More informationESE535: Electronic Design Automation. Today. LUT Mapping. Simplifying Structure. Preclass: Cover in 4-LUT? Preclass: Cover in 4-LUT?
ESE55: Electronic Design Automation Day 7: February, 0 Clustering (LUT Mapping, Delay) Today How do we map to LUTs What happens when IO dominates Delay dominates Lessons for non-luts for delay-oriented
More informationDigital Integrated Circuits A Design Perspective. Jan M. Rabaey
Digital Integrated Circuits A Design Perspective Jan M. Rabaey Outline (approximate) Introduction and Motivation The VLSI Design Process Details of the MOS Transistor Device Fabrication Design Rules CMOS
More informationBasic Idea. The routing problem is typically solved using a twostep
Global Routing Basic Idea The routing problem is typically solved using a twostep approach: Global Routing Define the routing regions. Generate a tentative route for each net. Each net is assigned to a
More informationDigital Design Methodology
Digital Design Methodology Prof. Soo-Ik Chae Digital System Designs and Practices Using Verilog HDL and FPGAs @ 2008, John Wiley 1-1 Digital Design Methodology (Added) Design Methodology Design Specification
More informationECO-system: Embracing the Change in Placement
Motivation ECO-system: Embracing the Change in Placement Jarrod A. Roy and Igor L. Markov University of Michigan at Ann Arbor Cong and Sarrafzadeh: state-of-the-art incremental placement techniques unfocused
More informationMinimizing Clock Domain Crossing in Network on Chip Interconnect
Minimizing Clock Domain Crossing in Network on Chip Interconnect Parag Kulkarni 1, Puneet Gupta 2, Rudy Beraha 3 1 Synopsys 2 UCLA 3 Qualcomm Corp. R&D Outline Motivation The Router Coloring Problem Approaches
More informationSimultaneous OPC- and CMP-Aware Routing Based on Accurate Closed-Form Modeling
Simultaneous OPC- and CMP-Aware Routing Based on Accurate Closed-Form Modeling Shao-Yun Fang, Chung-Wei Lin, Guang-Wan Liao, and Yao-Wen Chang March 26, 2013 Graduate Institute of Electronics Engineering
More informationRuntime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays
Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays Éricles Sousa 1, Frank Hannig 1, Jürgen Teich 1, Qingqing Chen 2, and Ulf Schlichtmann
More informationDevice And Architecture Co-Optimization for FPGA Power Reduction
54.2 Device And Architecture Co-Optimization for FPGA Power Reduction Lerong Cheng, Phoebe Wong, Fei Li, Yan Lin, and Lei He Electrical Engineering Department University of California, Los Angeles, CA
More informationPhysical Design Implementation for 3D IC Methodology and Tools. Dave Noice Vassilios Gerousis
I NVENTIVE Physical Design Implementation for 3D IC Methodology and Tools Dave Noice Vassilios Gerousis Outline 3D IC Physical components Modeling 3D IC Stack Configuration Physical Design With TSV Summary
More informationFull-chip Routing Optimization with RLC Crosstalk Budgeting
1 Full-chip Routing Optimization with RLC Crosstalk Budgeting Jinjun Xiong, Lei He, Member, IEEE ABSTRACT Existing layout optimization methods for RLC crosstalk reduction assume a set of interconnects
More informationHardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University
Hardware Design Environments Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Outline Welcome to COE 405 Digital System Design Design Domains and Levels of Abstractions Synthesis
More informationHigh-Level Synthesis
High-Level Synthesis 1 High-Level Synthesis 1. Basic definition 2. A typical HLS process 3. Scheduling techniques 4. Allocation and binding techniques 5. Advanced issues High-Level Synthesis 2 Introduction
More informationAn integrated placement and routing approach
Retrospective Theses and Dissertations 2006 An integrated placement and routing approach Min Pan Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/rtd Part of the Electrical
More informationINTEGRATION, the VLSI journal
INTEGRATION, the VLSI journal ] (]]]]) ]]] ]]] Contents lists available at ScienceDirect INTEGRATION, the VLSI journal journal homepage: www.elsevier.com/locate/vlsi Q Improved predictability, timing yield
More informationL14 - Placement and Routing
L14 - Placement and Routing Ajay Joshi Massachusetts Institute of Technology RTL design flow HDL RTL Synthesis manual design Library/ module generators netlist Logic optimization a b 0 1 s d clk q netlist
More informationIterative-Constructive Standard Cell Placer for High Speed and Low Power
Iterative-Constructive Standard Cell Placer for High Speed and Low Power Sungjae Kim and Eugene Shragowitz Department of Computer Science and Engineering University of Minnesota, Minneapolis, MN 55455
More informationTCG-Based Multi-Bend Bus Driven Floorplanning
TCG-Based Multi-Bend Bus Driven Floorplanning Tilen Ma Department of CSE The Chinese University of Hong Kong Shatin, N.T. Hong Kong Evangeline F.Y. Young Department of CSE The Chinese University of Hong
More informationFastPlace 2.0: An Efficient Analytical Placer for Mixed- Mode Designs
FastPlace.0: An Efficient Analytical Placer for Mixed- Mode Designs Natarajan Viswanathan Min Pan Chris Chu Iowa State University ASP-DAC 006 Work supported by SRC under Task ID: 106.001 Mixed-Mode Placement
More informationWHITE PAPER PARASITIC EXTRACTION FOR DEEP SUBMICRON AND ULTRA-DEEP SUBMICRON DESIGNS
WHITE PAPER PARASITIC EXTRACTION FOR DEEP SUBMICRON AND ULTRA-DEEP SUBMICRON DESIGNS TABLE OF CONTENTS Introduction.................................................................................. 1 Design
More informationLab. Course Goals. Topics. What is VLSI design? What is an integrated circuit? VLSI Design Cycle. VLSI Design Automation
Course Goals Lab Understand key components in VLSI designs Become familiar with design tools (Cadence) Understand design flows Understand behavioral, structural, and physical specifications Be able to
More informationIntegrated Floorplanning with Buffer/Channel Insertion for Bus-Based Microprocessor Designs 1
Integrated Floorplanning with Buffer/ for Bus-Based Microprocessor Designs 1 Faran Rafiq Intel Microlectronics Services, 20325 NW Von Neumann Dr. AG3-318, Beaverton, OR 97006 faran.rafiq@intel.com Malgorzata
More informationOvercoming Wireload Model Uncertainty During Physical Design
Overcoming Wireload Model Uncertainty During Physical Design Padmini Gopalakrishnan, Altan Odabasioglu, Lawrence Pileggi, Salil Raje Monterey Design Systems 894 Ross Drive, Suite, Sunnyvale, CA {padmini,
More information10. Interconnects in CMOS Technology
10. Interconnects in CMOS Technology 1 10. Interconnects in CMOS Technology Jacob Abraham Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017 October
More informationEEM870 Embedded System and Experiment Lecture 2: Introduction to SoC Design
EEM870 Embedded System and Experiment Lecture 2: Introduction to SoC Design Wen-Yen Lin, Ph.D. Department of Electrical Engineering Chang Gung University Email: wylin@mail.cgu.edu.tw March 2013 Agenda
More informationBuffered Steiner Trees for Difficult Instances
Buffered Steiner Trees for Difficult Instances C. J. Alpert 1, M. Hrkic 2, J. Hu 1, A. B. Kahng 3, J. Lillis 2, B. Liu 3, S. T. Quay 1, S. S. Sapatnekar 4, A. J. Sullivan 1, P. Villarrubia 1 1 IBM Corp.,
More informationProgrammable Logic Devices II
São José February 2015 Prof. Hoeller, Prof. Moecke (http://www.sj.ifsc.edu.br) 1 / 28 Lecture 01: Complexity Management and the Design of Complex Digital Systems Prof. Arliones Hoeller arliones.hoeller@ifsc.edu.br
More informationCAD Algorithms. Placement and Floorplanning
CAD Algorithms Placement Mohammad Tehranipoor ECE Department 4 November 2008 1 Placement and Floorplanning Layout maps the structural representation of circuit into a physical representation Physical representation:
More informationPicoServer : Using 3D Stacking Technology To Enable A Compact Energy Efficient Chip Multiprocessor
PicoServer : Using 3D Stacking Technology To Enable A Compact Energy Efficient Chip Multiprocessor Taeho Kgil, Shaun D Souza, Ali Saidi, Nathan Binkert, Ronald Dreslinski, Steve Reinhardt, Krisztian Flautner,
More information