Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures

Similar documents
INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume VII /Issue 2 / OCT 2016

Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool

HVSoCs: A Framework for Rapid Prototyping of 3-D Hybrid Virtual System-on-Chips

Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles

A Novel Design Framework for the Design of Reconfigurable Systems based on NoCs

MARKET demands urge embedded systems to incorporate

More Course Information

Design methodology for multi processor systems design on regular platforms

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University

FPGA for Complex System Implementation. National Chiao Tung University Chun-Jen Tsai 04/14/2011

FPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC)

Development of tools supporting. MEANDER Design Framework

Hardware Modeling using Verilog Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

VLSI Design Automation

ProASIC PLUS FPGA Family

ASIC Physical Design Top-Level Chip Layout

FABRICATION TECHNOLOGIES

SPARK: A Parallelizing High-Level Synthesis Framework

ECE 459/559 Secure & Trustworthy Computer Hardware Design

COE 561 Digital System Design & Synthesis Introduction

An Efficient Flexible Architecture for Error Tolerant Applications

Designing Heterogeneous FPGAs with Multiple SBs *

From Temporal Partitioning and Temporal Placement to Algorithmic Skeletons

Hardware Software Codesign of Embedded Systems

Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks

ECEN 449 Microprocessor System Design. FPGAs and Reconfigurable Computing

VLSI Design Automation

Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP

Workspace for '4-FPGA' Page 1 (row 1, column 1)

Evolution of CAD Tools & Verilog HDL Definition

The Design of the KiloCore Chip

VLSI Design Automation. Calcolatori Elettronici Ing. Informatica

A Methodology and Tool Framework for Supporting Rapid Exploration of Memory Hierarchies in FPGAs

DESIGN AND IMPLEMENTATION OF 32-BIT CONTROLLER FOR INTERACTIVE INTERFACING WITH RECONFIGURABLE COMPUTING SYSTEMS

EEM870 Embedded System and Experiment Lecture 2: Introduction to SoC Design

FPGA: What? Why? Marco D. Santambrogio

Hardware Software Codesign of Embedded System

On Supporting Adaptive Fault Tolerant at Run-Time with Virtual FPGAs

MOJTABA MAHDAVI Mojtaba Mahdavi DSP Design Course, EIT Department, Lund University, Sweden

Systems Development Tools for Embedded Systems and SOC s

System-on-Chip Architecture for Mobile Applications. Sabyasachi Dey

Digital Integrated Circuits A Design Perspective. Jan M. Rabaey

Graphics: Alexandra Nolte, Gesine Marwedel, Universität Dortmund. RTL Synthesis

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes

Application-Specific Mesh-based Heterogeneous FPGA Architectures

DIGITAL DESIGN TECHNOLOGY & TECHNIQUES

Design Methodologies and Tools. Full-Custom Design

An Overview of Standard Cell Based Digital VLSI Design

Intro to High Level Design with SystemC

Digital Design Methodology

An overview of standard cell based digital VLSI design

ECE 448 Lecture 15. Overview of Embedded SoC Systems

VLSI Design of Multichannel AMBA AHB

Abbas El Gamal. Joint work with: Mingjie Lin, Yi-Chang Lu, Simon Wong Work partially supported by DARPA 3D-IC program. Stanford University

ECE 111 ECE 111. Advanced Digital Design. Advanced Digital Design Winter, Sujit Dey. Sujit Dey. ECE Department UC San Diego

Fast implementation and fair comparison of the final candidates for Advanced Encryption Standard using Field Programmable Gate Arrays

Research Article Architecture-Level Exploration of Alternative Interconnection Schemes Targeting 3D FPGAs: A Software-Supported Methodology

Designing with VHDL and FPGA

DFT-3D: What it means to Design For 3DIC Test? Sanjiv Taneja Vice President, R&D Silicon Realization Group

DRAF: A Low-Power DRAM-based Reconfigurable Acceleration Fabric

Introduction to System-on-Chip

What is Xilinx Design Language?

Energy-Efficient Dynamic Memory Allocators at the Middleware Level of Embedded Systems

Programmable Logic Devices HDL-Based Design Flows CMPE 415

High performance, power-efficient DSPs based on the TI C64x

Co-synthesis and Accelerator based Embedded System Design

Towards Performance Modeling of 3D Memory Integrated FPGA Architectures

Digital Design Methodology (Revisited) Design Methodology: Big Picture

VLSI Design Automation. Maurizio Palesi

Programmable Logic Devices

EECS 244 Computer-Aided Design of Integrated Circuits and Systems

Calibrating Achievable Design GSRC Annual Review June 9, 2002

Today. Comments about assignment Max 1/T (skew = 0) Max clock skew? Comments about assignment 3 ASICs and Programmable logic Others courses

A Process Model suitable for defining and programming MpSoCs

A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on

Multi processor systems with configurable hardware acceleration

DRAF: A Low-Power DRAM-based Reconfigurable Acceleration Fabric

Embedded Systems. 7. System Components

The Xilinx XC6200 chip, the software tools and the board development tools

ARM Processors for Embedded Applications

Session: Configurable Systems. Tailored SoC building using reconfigurable IP blocks

Implementation of Ripple Carry and Carry Skip Adders with Speed and Area Efficient

Energy Efficient Computing Systems (EECS) Magnus Jahre Coordinator, EECS

Systematic Dynamic Memory Management Design Methodology for Reduced Memory Footprint

Dynamic Memory Management Design Methodology for Reduced Memory Footprint in Multimedia and Wireless Network Applications

3D TECHNOLOGIES: SOME PERSPECTIVES FOR MEMORY INTERCONNECT AND CONTROLLER

NANOELECTRONICS AND DIGITAL TECHNOLOGY ENABLING DISRUPTIVE INNOVATION LUC VAN DEN HOVE - PRESIDENT AND CEO

A CAD Framework for MALIBU: An FPGA with Time-multiplexed Coarse-Grained Elements. David Grant

Lab. Course Goals. Topics. What is VLSI design? What is an integrated circuit? VLSI Design Cycle. VLSI Design Automation

IWES st Italian Workshop on Embedded Systems Pisa September 2016

Microarchitecture Overview. Performance

Electronic Design Automation Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

HW and SW Architectures for Over-The-Air Dynamic Reconfiguration by Software Download

Reconfigurable Computing. Introduction

Hardware/Software Codesign

Integrating MRPSOC with multigrain parallelism for improvement of performance

Configurable Processors for SOC Design. Contents crafted by Technology Evangelist Steve Leibson Tensilica, Inc.

Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design

Design & Implementation of 64 bit ALU for Instruction Set Architecture & Comparison between Speed/Power Consumption on FPGA.

Chapter 5: ASICs Vs. PLDs

Transcription:

Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures Ass. Professor Dimitrios Soudris School of Electrical and Computer Eng., National Technical Univ. of Athens, Greece dsoudris@microlab.ntua.gr 1

What about Europe? 2

MicroLab: : Profile 3 Professors o o o Prof. Kiamal Pekmestzi Ass. Prof. Dimitrios Soudris Lect. George Economakos 4 Post-doc researchers 7 Ph.D. Students Infrastructure: Hardware, Software and CAD tools Main research topics Embedded systems design, System level design, specification and modelling, Reconfigurable Architectures, Hardware (ASIC & FPGA) implementation of telecom systems and DSP apps 3D IC Integration Cooperation with leading universities, institutes and industry Funding from National, Industrial and EU projects. More 55 projects Running 7th-IST projects: MNEMEE www.mnemee.org and MOSART www.mosart-project.org 3

MicroLab: AWARDS VLSI 2005 Award of 2,000$ sponsored by Intel and IBM, Arizona, USA LPGD project SAMOS 2007 3 rd award 10,000 euros 1st award 5,000 euros 2nd award 3,000 euros AHS 2007 4-th Position for AMDREL project 4

Cooperation Academia & Industry Academia/Institutes IMEC, Leuven, Belgium (strategic partnership) [>10 years!!] TUD: Delft Technical University, Dept. Electrical and Computer Engineering Univ. of Roechester, USA University of Dortmund, Computer Science, Embedded Systems Group Technical Univ. of Eidhoven, The Netherlands University of Gent, Belgium Royal Institute of Technology (KTH) Industry ST Microelctronics, Milano, Italy THALES, France CSEM: Centre Suisse d'electronique et de Microtechnique, Neuchâtel, Switzerland CoWare, Belgium ARTERIS, Paris, France ΙΝΤΡΑCΟΜ Telecom Solutions 5

Embedded Systems 6

Motivation - Vision Current multimedia and wireless network applications Very complex => designed with high-level languages Dynamic Memory (DM) required => several sources of unpredictability: user movements? image features? Final platforms: Portables: limited Resources (e.g. memory, cpu, power) Low power and high performance 7

Methodology for Dynamic Data Type Refinement Management Multimedia and Network Applications Significant improvements in energy consumption >80% Trade-offs of memory footprint, performance, energy consumed in DDT Refinement are possible 8

Custom Dynamic Memory Manager Construction Wireless LAN Application Application: DRR Boundary Tags for every Memory Block Coalescing Memory Blocks Next Fit Algorithm One Pool per Size FIFO Block Order within Pools PROFILING TOOL Physical Memory 9

Customized Dynamic Memory Manager: Reduced Power Consumption Single-functioned Executes a single program, repeatedly Tightly-constrained Low cost, low power, small, fast, etc. Reactive and real-time 84% more Continually than reacts to changes in the system s AMDREL s environment 25% more Must compute certain than results in real-time without delay AMDREL 10

Results for Infineon EasyPort Lower fragmentation level than Lea 2.7.2 Higher performance than the WinXP allocator model 11

Coarse Grained Reconfigurable Architectures For High Performance - DSP Datapath Synthesis Automated Code Mapping and Synthesis C-to-Synthesized Netlist Tool Flow DSP Datapath Synthesis Exploiting 1. Instruction Level Parallelism 2. Inter-Tile Pipelined Execution 3. Intra-Tile Operation Chaining 4. Fast Operation Level Reconfigurability We achieved: Hardware Flexibility + High Performance + Low Area Up to 35% Execution Latency Gains Over CRISP slice Up to 40% Area Gains Over CRISP slice 12

Low Energy FPGA layout Fine-grain Specs -8X8 array -Area: 5.8 X 6.1mm^2 -Up to 330MHz(register-toregister delay) -0.18 um CMOS STM -6 metal layers -Core Power Supply: 1.8V -LSE configuration time: 42ns -Full Configuration time: 2.9us -20 tracks on routing channels I/O PINS -96 Data I/Os -1 Global Reset -1 Global Clock -12-bit Configuration Address Bus -16-bit Configuration Data Bus - Power/Ground pins 13

Design 3D FPGA architectures Architecture level exploration of FPGAs Design an interconnection network based on the connectivity demands Temperature/Power Aware Placement and Routing Alternative interconnection schemes for 3D vias 3D stack: Reduced system size Short interconnects Reduced packaging cost Lower power consumption Higher perfomance 14

CAD Tools for 2D and 3D reconfigurable architectures: MEANDER Design Framework Application description in HDL Synthesis 2D Flow Technology Mapping 3D Flow 2D or 3D Architecture? 2D architecture EX-VPR 3DPRO 3D architecture library 2D P&R PowerModel 3DPower library 3D P&R Bitstream generation Available for on-line execution at http://vlsi.ee.duth.gr/amdrel 15

Case study: MPEG4 Logic Block Memory Block Dual-Port Memory 16 60

MPEG4 implementation: (a) in a 2-D 2 die and into a 3D IC with three dies 2-D die Output from SoCEncounter Cadence tool 17

Current and Future research trends MicroLab Roadmap Static & Dynamic multimedia and network applications System Level modeling, exploration and optimization Run-time resource management Reconfigurable Computing APPLICATIONS METHODOLOGIES AND TOOLS Multi-core architecture Network-on on-chip interconnections FPGAS 3D integrated circuits PLATFORMS SILICON TECHNOLOGY 18

Thank you very much!!! 19