Intel Corporation. Software Development Environment for Reconfigurable Communications Architecture Intel Corporation.

Size: px
Start display at page:

Download "Intel Corporation. Software Development Environment for Reconfigurable Communications Architecture Intel Corporation."

Transcription

1 Software Development Environment for Reconfigurable Communications Architecture Vladimir Ivanov Radio Communications Lab/Corporate Technology Group Contributor: Vicki Tsai Radio Communications Lab/Corporate Technology Group Reconfigurable Computing Tutorial International Symposium on System-on-Chip Conference Tampere, Finland Intel Corporation 18 November 2003 Outline RCA Review What are the specific architectural features which impact software development tools? Programming flow How do specific architectural features impact software development process? Software Development Environment Goals and Challenges Process specifics System-level issues Development Environment Concept Algorithm and compiler point of view 2 1

2 RCA Review What are the specific architectural features which impact software development tools? RCA Review Scalable mesh interconnect of heterogeneous processing elements (s) Interconnect with Nearest Neighbour Mesh Clock frequency dependent on load and process 4 2

3 RCA Review Ubiquitous wireless communication across multiple protocols UMAC UMAC 2 1 UMAC 3 D D 1 2 A I.E. B IO (EC) C IO (EC) D IO (EC) E 1 2 Ultra-wideband WPAN WCDMA WWAN IO (AFE 1) IO (AFE 2) IO (AFE 3) 4 A B C D E CMOS AFE 1 CMOS AFE 2 CMOS AFE a WLAN Figure source Intel research and development A scalable mesh interconnect of heterogeneous processing elements (s): Configurable basebands for multiple (concurrent) PHY/MAC operation Power and Size conserving when compared to multiple dedicate d cores or traditional SDR (S/W defined radio) approaches Tools for simple programming and portability to different arrays of elements 5 Programming Flow How do specific architectural features impact the software development process? 3

4 1. Divide the protocol into modes Preamble Detect: Diversity Selection: Steady-State Data: Each mode refers to a different, non-overlapping period in time 7 2. Partitioning Specify functions for each mode Preamble Detect: AFE1 (ant. 1) AGC Dec. Filter Preamble Det. Diversity Selection: AFE1 (ant. 1) AGC Dec. Filter SNR Calc. Diversity Sel. AFE1 (ant. 2) AGC Dec. Filter SNR Calc. Steady-State Data: AFE1 (ant. 1) Dec. Filter AFC Fixed IQ Imb. Corr. Guard Int. Removal Host IO Descram. Viterbi Deinter. QAM Demap Adapt. IQ Imb Corr FEQ 64-Pt FFT Note: This description is function based and not hardware based. 8 4

5 3. Communication Establish communication structure among functions Preamble Detect: AFE1 (ant. 1) AGC Dec. Filter Preamble Det. Diversity Selection: AFE1 (ant. 1) AGC Dec. Filter SNR Calc. Diversity Sel. AFE1 (ant. 2) AGC Dec. Filter SNR Calc. Steady-State Data: AFE1 (ant. 1) Dec. Filter AFC Fixed IQ Imb. Corr. Guard Int. Removal Host IO Descram. Viterbi Deinter. QAM Demap Adapt. IQ Imb Corr FEQ 64-Pt FFT 9 4. Aggregation Determine onto which types the functions could be mapped Preamble Detect: AFE1 (ant. 1) AGC Dec. Filter Preamble Det. typea typeb typec typed typee Diversity Selection: AFE1 (ant. 1) AGC Dec. Filter SNR Calc. Diversity Sel. AFE1 (ant. 2) AGC Dec. Filter SNR Calc. Steady-State Data: AFE1 (ant. 1) Dec. Filter AFC Fixed IQ Imb. Corr. Guard Int. Removal Host IO Descram. Viterbi Deinter. QAM Demap Adapt. IQ Imb Corr FEQ 64-Pt FFT 10 5

6 5. Check if resources available for the current hardware layout Preamble Detect: AFE1 (ant. 1) AGC Dec. Filter Preamble Det. typea typeb typec typed typee Diversity Selection: AFE1 (ant. 1) AGC Dec. Filter SNR Calc. Diversity Sel. AFE1 (ant. 2) AGC Dec. Filter SNR Calc. Steady-State Data: AFE1 (ant. 1) Dec. Filter AFC Fixed IQ Imb. Corr. Guard Int. Removal Host IO Descram. Viterbi HW topology Deinter. QAM Demap Adapt. IQ Imb Corr FEQ 64-Pt FFT B A Resource Usage (%): typeb typea typee typed typec 11 C C E D C D 6. Mapping Place functions onto specific s Preamble Detect: AFE1 (ant. 1) AGC Dec. Filter Preamble Det. Diversity Selection: AFE1 (ant. 1) AGC Dec. Filter SNR Calc. Diversity Sel AFE1 (ant. 2) AGC Dec. Filter SNR Calc. Steady-State Data: AFE1 (ant. 1) Dec. Filter AFC Fixed IQ Imb. Corr. Guard Int. Removal Host IO Descram. Viterbi Deinter. QAM Demap Adapt. IQ Imb Corr FEQ 64-Pt FFT 12 6

7 7. Generate code for this mapping Preamble Detect: AFE1 (ant. 1) AGC Dec. Filter Preamble Det. Diversity Selection: AFE1 (ant. 1) AGC Dec. Filter SNR Calc. Diversity Sel AFE1 (ant. 2) AGC Dec. Filter SNR Calc. Steady-State Data: AFE1 (ant. 1) Dec. Filter AFC Fixed IQ Imb. Corr. Guard Int. Removal Host IO Descram. Viterbi Deinter. QAM Demap Adapt. IQ Imb Corr FEQ 64-Pt FFT Check if desired performance has been reached HW topology Stimulus Data System Profiler Performance results If desired performance has been met, output the binary images. Otherwise, use the results to adjust the mapping and go to Step 2 or 4 or

8 Programming Flow Summary 1. Divide the protocol into modes 2. Specify functions for each mode Programmer 3. Establish communication structure among functions 4. Determine onto what types the functions could be Tools mapped 5. Check if we have the resources in the hardware 6. Place functions onto specific s 7. Generate code for this mapping If code cannot be generated because the cannot fit the assigned functions, try a different mapping 8. Check if desired performance has been reached If not, try a different mapping Otherwise, output the generated code from Step 6 15 Software Development Environment Goals and Challenges Process specifics System-level issues 8

9 Tools Goals Primary goal is to assure development of effective code for RCA Developed code should effectively use all RCA capabilities Implemented protocols should meet users requirements Abstract code development from hardware If the number of total s change or the number of s of a certain type change, the algorithm does not need to be altered Give reasonable programming abstraction level for the programmer Provide effective environment for development, debugging and testing of software 17 Tools Challenges Reasonable balance for abstracting software development from hardware Classical challenges for parallel architecture Decomposition of program into parallel processes Effective mapping of processes to s Effective communication among processes Synchronization among processes Protocol concurrency implies dynamic RCA resource distribution among protocols Heterogeneity of s mesh Variety of Processing Elements (s) s may not be processor-based Methods to program s differ greatly Guaranteed protocol performance Effective data visualization from multiple s High performance simulation of RCA 18 9

10 Software Development Process Algorithm START redevelopment Redevelopment Algorithm development Source code development Tools for algorithm development Tools for source code development No Program code translation Translation tools, Linkage tools Debugging Debugger, Simulator Performance measurement Profiler Meets user s reqs? Yes Testing AWARD 19 Software Development Environment Hardware Software description description Algorithm and source code development tools Descriptions editor Process diagram editor Source code of processes IDE Source Code Editor Source Source etc etc. Descriptions Translator Specific tools Map Directives Parsed Descriptions (XML) Makefile Translation Translation tool tool Librarian Relocatable Relocatable Execution statistics Mapper Relocatable images of processes User Constraints Processes Layout (XML) Library Library Translation and linkage tools Packager creates Makefile in accordance with layout scheme and runs make for the loadable image building Packager Loadable Makefile Linkage tool Linkage tool Simulation/ Execution Loader 20 10

11 Input Example myfn in data In0 Out0 (Auto) realfir In0 Out0 (Auto) L In0 myfn3 Out0 In1 (Auto) out data myfn.c myfn(int16 in0[], int16 out0[]) { int16 i,x; for (i=0; i<in1len; i++) { x=in1[i] * in1[i]; send_output(0,x); } } myfn2 In0 Out0 ( typea) myfn2.ccs _myfn2:... X0:X7=*P0++8 Y0:Y7=*P1++8 M0=X0*Y0 M1=X1*Y1 M2=X2*Y2 M3=X3*Y3 M4=X4*Y4 M5=X5*Y5 M6=X6*Y6 M7=X7*Y7 A00=M0+M1 A20=M2+M3 A4=M4+M5 A6=M6+M7;....DONE 21 System Simulator Cycle accurate simulation High performance Allow to evaluate latency and computational overhead Possibility to connect two instances of the System Simulator to each other Provide debugging facilities 22 11

12 System Simulator SysSim contains Simulator Core (SC) and Individual Simulators (IS) Two abstraction layers for IS representation High level object Scheduled Object Object design principle: If being in state S1 and got an input signal In than after delay D change the state to S2 and produce an output signal Out Hardware Configuration File Debug queries Debugger User Application JTAG Host Data Driver Control control Debug events / RCA Device Driver responses JTAG Host Data Control RCA System Simulator Simulator Core AFE Data (to data files or to another AFE Data Host Data instance of the Simulator) Individual simulator Scheduled Objects Layer: efficient cycle-accurate scheduling 23 Simulation Performance Source D flip-flop D flip-flop Destination N instances Comparing SystemC core and SysSim core SC_METHOD process was used for SystemC Simulated object is N instances of D flip-flop flop objects Simulation on Intel 2.4 GHz Pentium 4 4*4 Mesh (~1000 objects), 400 MHz 1 sec simulation takes ~100 hours for SystemC Core and ~13 hours for SySim Core Simulation time (sec N of scheduled objects CTL core SystemC core 24 12

13 Development Environment Concept Algorithm and Compiler point of view Tools Development Concepts Naive Phase : Manual program partitioning Manual code optimization Independent compiler tools Static hardware and software Mature Phase: Automatic program partitioning Automatic code optimization Common compiler tools Static hardware and software Advanced Phase: Macro architecture description tools Automatic generation of micro architecture description Automatic software tools generation Protocol partitioning for joint hardware -software optimization 26 13

14 Tools Development Naive Phase Enhanced Traditional Model Networking (communication architecture) Mapping (distributable compilation) Traditional tool-suite for RCA Complete development tool-suite Integration of tools for sequential programming Solution constraints Aided mapping (user-defined mapping of process to ) 27 Enhanced Traditional Model source C source source code for for ii code for C C Compiler Object Object 1 module Linker Executable module Assembly code for for jj code for FMCA C Compiler Assembler Object Object Object module 22 module Linker Executable module Specialized code for VMCA Configurator Reconfiguration vector Linker Executable module Make directives Link directives Description Translator Tiny Mapper RCA Linker Loadable image RCA Simulator 28 Debugger 14

15 Tools Development Mature Phase True distributable compilation Automated mapping Global optimization Intermodule optimization Optimization on heterogeneous environment Enhanced development tools C C Compiler with high-level IR generation High-level IR Linker Retargetable Code Generator 29 Distributable Compilation Architecture source C source source code code Assembly code code Specialized code code C Front-End Assembler Configurator IR IR 1 IR 1 IR IR 2 IR 2 IR IR IR 3 IR Libs IR Linker General IR Mapper IR for 1 IR for 2 IR for 3 CG for 1 CG for 2 CG for 3 Object module 1 Object module 2 Object module 3 Obj Libs Communications Technology 30 Lab 15

16 Tools Development Advanced Phase Distributable compilation Retargetable development tools Retargetable C Compiler (tunable CG and optimization) Retargetable Assembler (target architecture templates) Retargetable Simulator (for RCA configurations) Comprehensive Target Descriptive Language Target Tools Generator HDL code generation Joint hardware and software optimization 31 Co-Design Architecture Software Design Source Code for RCA Tools Generator RCA Hardware Design Comprehensive Target Description HDL Output High Level IR Target Representation Target Tools C Compiler CGi Assemblers RCA Simulator HDL (VHDL) Debugger 32 16

17 Summary RCA programming process characteristics Parallel running processes with message exchange Procedure level parallelism Partitioning-communication-aggregation-mapping based optimization cycle RCA software development env contains standard set of tools for Algorithm and source code development Source code translation and linking RCA software development environment contains specific set of tools for the optimization cycle 3 phases of software tools development Main goal of the naive and mature phases is to assure (manually or automatically) program code effectiveness Main goal of advanced phase is to assure joint hardware- software effectiveness of PHY/MAC algorithms implementation 33 Acknowledgements Ernest Tsui, Vladimir Pudovkin, Vladimir Pavlov, Sergey Mironov, Veronica Mikheeva, Tony Chun, Michael K. Chen 34 17

A Software LDPC Decoder Implemented on a Many-Core Array of Programmable Processors

A Software LDPC Decoder Implemented on a Many-Core Array of Programmable Processors A Software LDPC Decoder Implemented on a Many-Core Array of Programmable Processors Brent Bohnenstiehl and Bevan Baas Department of Electrical and Computer Engineering University of California, Davis {bvbohnen,

More information

OpenRadio. A programmable wireless dataplane. Manu Bansal Stanford University. Joint work with Jeff Mehlman, Sachin Katti, Phil Levis

OpenRadio. A programmable wireless dataplane. Manu Bansal Stanford University. Joint work with Jeff Mehlman, Sachin Katti, Phil Levis OpenRadio A programmable wireless dataplane Manu Bansal Stanford University Joint work with Jeff Mehlman, Sachin Katti, Phil Levis HotSDN 12, August 13, 2012, Helsinki, Finland 2 Opening up the radio Why?

More information

Programmable Logic Devices

Programmable Logic Devices Programmable Logic Devices INTRODUCTION A programmable logic device or PLD is an electronic component used to build reconfigurable digital circuits. Unlike a logic gate, which has a fixed function, a PLD

More information

HW & SW co-verification of baseband HSPA Processor with Seamless PSP

HW & SW co-verification of baseband HSPA Processor with Seamless PSP HW & SW co-verification of baseband HSPA Processor with Seamless PSP Zheng Li, Xuedong Yang, Bing Wang, Zhitao Lu, Lawrence Yang, James Gualdoni, Jagan Raghavendran Steven Swanchara, William Hinkel, Scott

More information

Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP

Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP Presenter: Course: EEC 289Q: Reconfigurable Computing Course Instructor: Professor Soheil Ghiasi Outline Overview of M.I.T. Raw processor

More information

Embedded Systems. 2. Software Development. Lothar Thiele. Computer Engineering and Networks Laboratory

Embedded Systems. 2. Software Development. Lothar Thiele. Computer Engineering and Networks Laboratory Embedded Systems 2. Software Development Lothar Thiele Computer Engineering and Networks Laboratory Remember: Computer Engineering I Compilation of a C program to machine language program: textual representation

More information

A Case Study in Incremental Prototyping with Reconfigurable Hardware: DSRC Software Defined-Radio

A Case Study in Incremental Prototyping with Reconfigurable Hardware: DSRC Software Defined-Radio A Case Study in Incremental Prototyping with Reconfigurable Hardware: DSRC Software Defined-Radio Jared Dulmage jaredd@ee.ucla.edu Minko Tsai minko@ee.ucla.edu Michael Fitz fitz@ee.ucla.edu Babak Daneshrad

More information

Implementation of a Dual-Mode SDR Smart Antenna Base Station Supporting WiBro and TDD HSDPA

Implementation of a Dual-Mode SDR Smart Antenna Base Station Supporting WiBro and TDD HSDPA Implementation of a Dual-Mode SDR Smart Antenna Base Station Supporting WiBro and TDD HSDPA Jongeun Kim, Sukhwan Mun, Taeyeol Oh,Yusuk Yun, Seungwon Choi 1 HY-SDR Research Center, Hanyang University, Seoul,

More information

Evolution of Implementation Technologies. ECE 4211/5211 Rapid Prototyping with FPGAs. Gate Array Technology (IBM s) Programmable Logic

Evolution of Implementation Technologies. ECE 4211/5211 Rapid Prototyping with FPGAs. Gate Array Technology (IBM s) Programmable Logic ECE 42/52 Rapid Prototyping with FPGAs Dr. Charlie Wang Department of Electrical and Computer Engineering University of Colorado at Colorado Springs Evolution of Implementation Technologies Discrete devices:

More information

Interface-Based Design Introduction

Interface-Based Design Introduction Interface-Based Design Introduction A. Richard Newton Department of Electrical Engineering and Computer Sciences University of California at Berkeley Integrated CMOS Radio Dedicated Logic and Memory uc

More information

Software Defined Modem A commercial platform for wireless handsets

Software Defined Modem A commercial platform for wireless handsets Software Defined Modem A commercial platform for wireless handsets Charles F Sturman VP Marketing June 22 nd ~ 24 th Brussels charles.stuman@cognovo.com www.cognovo.com Agenda SDM Separating hardware from

More information

High Data Rate Fully Flexible SDR Modem

High Data Rate Fully Flexible SDR Modem High Data Rate Fully Flexible SDR Modem Advanced configurable architecture & development methodology KASPERSKI F., PIERRELEE O., DOTTO F., SARLOTTE M. THALES Communication 160 bd de Valmy, 92704 Colombes,

More information

Optimizing HW/SW Partition of a Complex Embedded Systems. Simon George November 2015.

Optimizing HW/SW Partition of a Complex Embedded Systems. Simon George November 2015. Optimizing HW/SW Partition of a Complex Embedded Systems Simon George November 2015 Zynq-7000 All Programmable SoC HP ACP GP Page 2 Zynq UltraScale+ MPSoC Page 3 HW/SW Optimization Challenges application()

More information

FPGA BASED SYSTEM DESIGN. Dr. Tayab Din Memon Lecture 1 & 2

FPGA BASED SYSTEM DESIGN. Dr. Tayab Din Memon Lecture 1 & 2 FPGA BASED SYSTEM DESIGN Dr. Tayab Din Memon tayabuddin.memon@faculty.muet.edu.pk Lecture 1 & 2 Books Recommended Books: Text Book: FPGA Based System Design by Wayne Wolf Verilog HDL by Samir Palnitkar.

More information

ECE 637 Integrated VLSI Circuits. Introduction. Introduction EE141

ECE 637 Integrated VLSI Circuits. Introduction. Introduction EE141 ECE 637 Integrated VLSI Circuits Introduction EE141 1 Introduction Course Details Instructor Mohab Anis; manis@vlsi.uwaterloo.ca Text Digital Integrated Circuits, Jan Rabaey, Prentice Hall, 2 nd edition

More information

Compilation of Parametric Dataflow Applications for Software-Defined-Radio-Dedicated MPSoCs DREAM seminar

Compilation of Parametric Dataflow Applications for Software-Defined-Radio-Dedicated MPSoCs DREAM seminar Compilation of Parametric Dataflow Applications for Software-Defined-Radio-Dedicated MPSoCs DREAM seminar Mickaël Dardaillon Research Intern with NOKIA Technologies January 27th, 2015 2 / 33 What we know

More information

Computer Architecture 2/26/01 Lecture #

Computer Architecture 2/26/01 Lecture # Computer Architecture 2/26/01 Lecture #9 16.070 On a previous lecture, we discussed the software development process and in particular, the development of a software architecture Recall the output of the

More information

Simplifying FPGA Design for SDR with a Network on Chip Architecture

Simplifying FPGA Design for SDR with a Network on Chip Architecture Simplifying FPGA Design for SDR with a Network on Chip Architecture Matt Ettus Ettus Research GRCon13 Outline 1 Introduction 2 RF NoC 3 Status and Conclusions USRP FPGA Capability Gen

More information

Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles

Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles Zhiyi Yu, Bevan Baas VLSI Computation Lab, ECE Department University of California, Davis, USA Outline Introduction Timing issues

More information

C-Based Hardware Design Platform for Dynamically Reconfigurable Processor

C-Based Hardware Design Platform for Dynamically Reconfigurable Processor C-Based Hardware Design Platform for Dynamically Reconfigurable Processor September 22 nd, 2005 IPFlex Inc. Agenda Merits of C-Based hardware design Hardware enabling C-Based hardware design DAPDNA-FW

More information

Versal: AI Engine & Programming Environment

Versal: AI Engine & Programming Environment Engineering Director, Xilinx Silicon Architecture Group Versal: Engine & Programming Environment Presented By Ambrose Finnerty Xilinx DSP Technical Marketing Manager October 16, 2018 MEMORY MEMORY MEMORY

More information

Automatic Porting of Binary File Descriptor Library

Automatic Porting of Binary File Descriptor Library Automatic Porting of Binary File Descriptor Library Maghsoud Abbaspour Electrical and Computer Engineering University of Toronto Oct 19, 2001 maghsoud@eecg.toronto.edu Synthesis Reading Group Copyright

More information

Outline. CPE/EE 422/522 Advanced Logic Design L05. Review: General Model of Moore Sequential Machine. Review: Mealy Sequential Networks.

Outline. CPE/EE 422/522 Advanced Logic Design L05. Review: General Model of Moore Sequential Machine. Review: Mealy Sequential Networks. Outline CPE/EE 422/522 Advanced Logic Design L05 Electrical and Computer Engineering University of Alabama in Huntsville What we know Combinational Networks Sequential Networks: Basic Building Blocks,

More information

Networks-on-Chip Router: Configuration and Implementation

Networks-on-Chip Router: Configuration and Implementation Networks-on-Chip : Configuration and Implementation Wen-Chung Tsai, Kuo-Chih Chu * 2 1 Department of Information and Communication Engineering, Chaoyang University of Technology, Taichung 413, Taiwan,

More information

Metodologie di Progettazione Hardware e Software

Metodologie di Progettazione Hardware e Software POLITECNICO DI MILANO Metodologie di Progettazione Hardware e Software Reconfigurable Computing - Design Flow - Marco D. Santambrogio marco.santabrogio@polimi.it Outline 2 Retargetable Compiler Basic Idea

More information

DE2 Board & Quartus II Software

DE2 Board & Quartus II Software January 23, 2015 Contact and Office Hours Teaching Assistant (TA) Sergio Contreras Office Office Hours Email SEB 3259 Tuesday & Thursday 12:30-2:00 PM Wednesday 1:30-3:30 PM contre47@nevada.unlv.edu Syllabus

More information

DSPACE: LISA Methodology

DSPACE: LISA Methodology DSPACE: LISA Methodology Maximilian Odendahl ESA DSP Day, Noordwjik, 28.8.2012 Institute for Communication Technologies and Embedded Systems Outline 2 Introduction DSPACE Requirements LISA Methodology

More information

Reconfigurable Cell Array for DSP Applications

Reconfigurable Cell Array for DSP Applications Outline econfigurable Cell Array for DSP Applications Chenxin Zhang Department of Electrical and Information Technology Lund University, Sweden econfigurable computing Coarse-grained reconfigurable cell

More information

OPERATING SYSTEMS. Prescribed Text Book Operating System Principles, Seventh Edition By Abraham Silberschatz, Peter Baer Galvin and Greg Gagne

OPERATING SYSTEMS. Prescribed Text Book Operating System Principles, Seventh Edition By Abraham Silberschatz, Peter Baer Galvin and Greg Gagne OPERATING SYSTEMS Prescribed Text Book Operating System Principles, Seventh Edition By Abraham Silberschatz, Peter Baer Galvin and Greg Gagne OVERVIEW An operating system is a program that manages the

More information

Often, more information is required when designing system call Information varies according to OS and types of system call

Often, more information is required when designing system call Information varies according to OS and types of system call System Call Parameter Passing Often, more information is required when designing system call Information varies according to OS and types of system call Three general methods used to pass parameters to

More information

Implementing Secure Software Systems on ARMv8-M Microcontrollers

Implementing Secure Software Systems on ARMv8-M Microcontrollers Implementing Secure Software Systems on ARMv8-M Microcontrollers Chris Shore, ARM TrustZone: A comprehensive security foundation Non-trusted Trusted Security separation with TrustZone Isolate trusted resources

More information

Multi-Gigahertz Parallel FFTs for FPGA and ASIC Implementation

Multi-Gigahertz Parallel FFTs for FPGA and ASIC Implementation Multi-Gigahertz Parallel FFTs for FPGA and ASIC Implementation Doug Johnson, Applications Consultant Chris Eddington, Technical Marketing Synopsys 2013 1 Synopsys, Inc. 700 E. Middlefield Road Mountain

More information

Reconfigurable Radio Systems as Enabler for Exploiting the Future Heterogeneous Wireless Communications Landscape

Reconfigurable Radio Systems as Enabler for Exploiting the Future Heterogeneous Wireless Communications Landscape Reconfigurable Radio Systems as Enabler for Exploiting the Future Heterogeneous Wireless Communications Landscape By Markus Mueck, Vladimir Ivanov, Seungwon Choi, Gianmarco Baldini, Antti Piipponen, et

More information

ECE 353 Lab 4. MIDI Receiver in Verilog. Professor Daniel Holcomb UMass Amherst Fall 2016

ECE 353 Lab 4. MIDI Receiver in Verilog. Professor Daniel Holcomb UMass Amherst Fall 2016 ECE 353 Lab 4 MIDI Receiver in Verilog Professor Daniel Holcomb UMass Amherst Fall 2016 Timeline and Grading for Lab 4 Lectures on 11/15 and 11/17 Due on 12/12 Demos in Duda hall Schedule will be posted

More information

FINDING THE OPTIMUM PARTITIONING FOR MULTI-STANDARD RADIO SYSTEMS

FINDING THE OPTIMUM PARTITIONING FOR MULTI-STANDARD RADIO SYSTEMS FINDING THE OPTIMUM PARTITIONING FOR MULTI-STANDARD RADIO SYSTEMS Hans-Martin Bluethgen, Christian Sauer, Matthias Gries, Wolfgang Raab, Dominik Langen, Alexander Schackow, Manuel Loew, Ulrich Hachmann,

More information

Key technologies for many core architectures

Key technologies for many core architectures Key technologies for many core architectures Thierry Collette CEA, LIST thierry.collette@c ea.fr 1 Embedded computing Silicon area offers perfo rmance Applications x 40 from 90 to 45 ns Computing performance

More information

A 167-processor Computational Array for Highly-Efficient DSP and Embedded Application Processing

A 167-processor Computational Array for Highly-Efficient DSP and Embedded Application Processing A 167-processor Computational Array for Highly-Efficient DSP and Embedded Application Processing Dean Truong, Wayne Cheng, Tinoosh Mohsenin, Zhiyi Yu, Toney Jacobson, Gouri Landge, Michael Meeuwsen, Christine

More information

Intel Array Building Blocks

Intel Array Building Blocks Intel Array Building Blocks Productivity, Performance, and Portability with Intel Parallel Building Blocks Intel SW Products Workshop 2010 CERN openlab 11/29/2010 1 Agenda Legal Information Vision Call

More information

Automatic Porting of Binary File Descriptor Library

Automatic Porting of Binary File Descriptor Library Automatic Porting of Binary File Descriptor Library Maghsoud Abbaspour Electrical and Computer Engineering University of Toronto Oct 19, 2001 maghsoud@eecg.toronto.edu Synthesis Reading Group Copyright

More information

NIOS CPU Based Embedded Computer System on Programmable Chip

NIOS CPU Based Embedded Computer System on Programmable Chip 1 Objectives NIOS CPU Based Embedded Computer System on Programmable Chip EE8205: Embedded Computer Systems This lab has been constructed to introduce the development of dedicated embedded system based

More information

Architecture and Automated Design Flow for Digital Network on chip for Analog/RF Building Block Control

Architecture and Automated Design Flow for Digital Network on chip for Analog/RF Building Block Control Architecture and Automated Design Flow for Digital Network on chip for Analog/RF Building Block Control Wolfgang Eberle, PhD IMEC Bioelectronic Systems Bridging software and analog/rf Software defined

More information

Intel Research mote. Ralph Kling Intel Corporation Research Santa Clara, CA

Intel Research mote. Ralph Kling Intel Corporation Research Santa Clara, CA Intel Research mote Ralph Kling Intel Corporation Research Santa Clara, CA Overview Intel mote project goals Project status and direction Intel mote hardware Intel mote software Summary and outlook Intel

More information

ECE 3610 Microprocessing Systems Lab #1 Verilog Design of the TOC Using Quartus II

ECE 3610 Microprocessing Systems Lab #1 Verilog Design of the TOC Using Quartus II ECE 3610 Microprocessing Systems Lab #1 Verilog Design of the TOC Using Quartus II This lab manual presents an introduction to the Quartus II Computer Aided Design (CAD) system. This manual gives step-by-step

More information

Design of Embedded DSP Processors Unit 7: Programming toolchain. 9/26/2017 Unit 7 of TSEA H1 1

Design of Embedded DSP Processors Unit 7: Programming toolchain. 9/26/2017 Unit 7 of TSEA H1 1 Design of Embedded DSP Processors Unit 7: Programming toolchain 9/26/2017 Unit 7 of TSEA26 2017 H1 1 Toolchain introduction There are two kinds of tools 1.The ASIP design tool for HW designers Frontend

More information

SimXMD Co-Debugging Software and Hardware in FPGA Embedded Systems

SimXMD Co-Debugging Software and Hardware in FPGA Embedded Systems University of Toronto FPGA Seminar SimXMD Co-Debugging Software and Hardware in FPGA Embedded Systems Ruediger Willenberg and Paul Chow High-Performance Reconfigurable Computing Group University of Toronto

More information

SimXMD: Simulation-based HW/SW Co-Debugging for FPGA Embedded Systems

SimXMD: Simulation-based HW/SW Co-Debugging for FPGA Embedded Systems FPGAworld 2014 SimXMD: Simulation-based HW/SW Co-Debugging for FPGA Embedded Systems Ruediger Willenberg and Paul Chow High-Performance Reconfigurable Computing Group University of Toronto September 9,

More information

Frequency domain IQ. Richard Maiden - Intel

Frequency domain IQ. Richard Maiden - Intel Frequency domain IQ Richard Maiden - Intel Compliance with IEEE Standards Policies and Procedures Subclause 5.2.1 of the IEEE-SA Standards Board Bylaws states, "While participating in IEEE standards development

More information

VHDL for Synthesis. Course Description. Course Duration. Goals

VHDL for Synthesis. Course Description. Course Duration. Goals VHDL for Synthesis Course Description This course provides all necessary theoretical and practical know how to write an efficient synthesizable HDL code through VHDL standard language. The course goes

More information

A Software Development and Validation Framework for SDR Platforms

A Software Development and Validation Framework for SDR Platforms A Software Development and Validation Framework for SDR Platforms Jeroen.Declerck@imec.be Outline IMEC SDR Platform Problem Statement Framework (XMSF) Implementation XMSS server Graphical logger IMEC SDR

More information

Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver

Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver E.Kanniga 1, N. Imocha Singh 2,K.Selva Rama Rathnam 3 Professor Department of Electronics and Telecommunication, Bharath

More information

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes MSEE Curriculum All MSEE students are required to take the following two core courses: 3531-571 Linear systems 3531-507 Probability and Random Processes The course requirements for students majoring in

More information

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Hardware Design Environments Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Outline Welcome to COE 405 Digital System Design Design Domains and Levels of Abstractions Synthesis

More information

A PROGRAMMABLE BASEBAND PLATFORM FOR SOFTWARE-DEFINED RADIO

A PROGRAMMABLE BASEBAND PLATFORM FOR SOFTWARE-DEFINED RADIO A PROGRAMMABLE BASEBAND PLATFORM FOR SOFTWARE-DEFINED RADIO Hans-Martin Bluethgen, Cyprian Grassmann, Wolfgang Raab, Ulrich Ramacher, Josef Hausner, Infineon Technologies AG, 81609 Munich, Germany, Hans-Martin.Bluethgen@infineon.com

More information

Ting Wu, Chi-Ying Tsui, Mounir Hamdi Hong Kong University of Science & Technology Hong Kong SAR, China

Ting Wu, Chi-Ying Tsui, Mounir Hamdi Hong Kong University of Science & Technology Hong Kong SAR, China CMOS Crossbar Ting Wu, Chi-Ying Tsui, Mounir Hamdi Hong Kong University of Science & Technology Hong Kong SAR, China OUTLINE Motivations Problems of Designing Large Crossbar Our Approach - Pipelined MUX

More information

Test and Verification Solutions. ARM Based SOC Design and Verification

Test and Verification Solutions. ARM Based SOC Design and Verification Test and Verification Solutions ARM Based SOC Design and Verification 7 July 2008 1 7 July 2008 14 March 2 Agenda System Verification Challenges ARM SoC DV Methodology ARM SoC Test bench Construction Conclusion

More information

Synthesis at different abstraction levels

Synthesis at different abstraction levels Synthesis at different abstraction levels System Level Synthesis Clustering. Communication synthesis. High-Level Synthesis Resource or time constrained scheduling Resource allocation. Binding Register-Transfer

More information

FPGA: What? Why? Marco D. Santambrogio

FPGA: What? Why? Marco D. Santambrogio FPGA: What? Why? Marco D. Santambrogio marco.santambrogio@polimi.it 2 Reconfigurable Hardware Reconfigurable computing is intended to fill the gap between hardware and software, achieving potentially much

More information

ECE332, Week 2, Lecture 3. September 5, 2007

ECE332, Week 2, Lecture 3. September 5, 2007 ECE332, Week 2, Lecture 3 September 5, 2007 1 Topics Introduction to embedded system Design metrics Definitions of general-purpose, single-purpose, and application-specific processors Introduction to Nios

More information

ECE332, Week 2, Lecture 3

ECE332, Week 2, Lecture 3 ECE332, Week 2, Lecture 3 September 5, 2007 1 Topics Introduction to embedded system Design metrics Definitions of general-purpose, single-purpose, and application-specific processors Introduction to Nios

More information

Lecture 41: Introduction to Reconfigurable Computing

Lecture 41: Introduction to Reconfigurable Computing inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures Lecture 41: Introduction to Reconfigurable Computing Michael Le, Sp07 Head TA April 30, 2007 Slides Courtesy of Hayden So, Sp06 CS61c Head TA Following

More information

RAIN SDR Quick Start Manual RAIN SDR. Quick Start Manual

RAIN SDR Quick Start Manual RAIN SDR. Quick Start Manual RAIN SDR Quick Start Manual The following table RAIN shows SDR the Quick revision Start history Manual Revision History The following table shows the revision history for this document.change bars indicate

More information

Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator

Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator Stanley Bak Abstract Network algorithms are deployed on large networks, and proper algorithm evaluation is necessary to avoid

More information

Overview. Design flow. Principles of logic synthesis. Logic Synthesis with the common tools. Conclusions

Overview. Design flow. Principles of logic synthesis. Logic Synthesis with the common tools. Conclusions Logic Synthesis Overview Design flow Principles of logic synthesis Logic Synthesis with the common tools Conclusions 2 System Design Flow Electronic System Level (ESL) flow System C TLM, Verification,

More information

Computers in Engineering COMP 208. Computer Structure. Computer Architecture. Computer Structure Michael A. Hawker

Computers in Engineering COMP 208. Computer Structure. Computer Architecture. Computer Structure Michael A. Hawker Computers in Engineering COMP 208 Computer Structure Michael A. Hawker Computer Structure We will briefly look at the structure of a modern computer That will help us understand some of the concepts that

More information

FPGA. Agenda 11/05/2016. Scheduling tasks on Reconfigurable FPGA architectures. Definition. Overview. Characteristics of the CLB.

FPGA. Agenda 11/05/2016. Scheduling tasks on Reconfigurable FPGA architectures. Definition. Overview. Characteristics of the CLB. Agenda The topics that will be addressed are: Scheduling tasks on Reconfigurable FPGA architectures Mauro Marinoni ReTiS Lab, TeCIP Institute Scuola superiore Sant Anna - Pisa Overview on basic characteristics

More information

Programming in the Brave New World of Systems-on-a-chip

Programming in the Brave New World of Systems-on-a-chip Programming in the Brave New World of Systems-on-a-chip Arvind Computer Science & Artificial Intelligence Lab Massachusetts Institute of Technology The 25th International Workshop on Languages and Compilers

More information

Using FPGAs in Supercomputing Reconfigurable Supercomputing

Using FPGAs in Supercomputing Reconfigurable Supercomputing Using FPGAs in Supercomputing Reconfigurable Supercomputing Why FPGAs? FPGAs are 10 100x faster than a modern Itanium or Opteron Performance gap is likely to grow further in the future Several major vendors

More information

Part IV: 3D WiNoC Architectures

Part IV: 3D WiNoC Architectures Wireless NoC as Interconnection Backbone for Multicore Chips: Promises, Challenges, and Recent Developments Part IV: 3D WiNoC Architectures Hiroki Matsutani Keio University, Japan 1 Outline: 3D WiNoC Architectures

More information

EE 3170 Microcontroller Applications

EE 3170 Microcontroller Applications EE 3170 Microcontroller Applications Lecture 4 : Processors, Computers, and Controllers - 1.2 (reading assignment), 1.3-1.5 Based on slides for ECE3170 by Profs. Kieckhafer, Davis, Tan, and Cischke Outline

More information

Digital Systems Design. System on a Programmable Chip

Digital Systems Design. System on a Programmable Chip Digital Systems Design Introduction to System on a Programmable Chip Dr. D. J. Jackson Lecture 11-1 System on a Programmable Chip Generally involves utilization of a large FPGA Large number of logic elements

More information

Improving the Operating System with Reconfigurable Hardware

Improving the Operating System with Reconfigurable Hardware Improving the Operating System with Reconfigurable Hardware (FGBS 11) Michael Gernoth System Software Group Friedrich-Alexander University Erlangen-Nuremberg November 11, 2011 supported by Challenges in

More information

EEM870 Embedded System and Experiment Lecture 4: SoC Design Flow and Tools

EEM870 Embedded System and Experiment Lecture 4: SoC Design Flow and Tools EEM870 Embedded System and Experiment Lecture 4: SoC Design Flow and Tools Wen-Yen Lin, Ph.D. Department of Electrical Engineering Chang Gung University Email: wylin@mail.cgu.edu.tw March 2013 Agenda Introduction

More information

A NOVEL BASED METHOD TO DESIGN A 4G NETWORK AND TO IMPLEMENT IN REAL TIME USING DSP INTERFACE

A NOVEL BASED METHOD TO DESIGN A 4G NETWORK AND TO IMPLEMENT IN REAL TIME USING DSP INTERFACE aerd Scientific Journal of Impact Factor(SJIF): 3.134 e-issn(o): 2348-4470 p-issn(p): 2348-6406 International Journal of Advance Engineering and Research Development Volume 2,Issue 3, March -2015 A NOVEL

More information

OpenSMART: Single-cycle Multi-hop NoC Generator in BSV and Chisel

OpenSMART: Single-cycle Multi-hop NoC Generator in BSV and Chisel OpenSMART: Single-cycle Multi-hop NoC Generator in BSV and Chisel Hyoukjun Kwon and Tushar Krishna Georgia Institute of Technology Synergy Lab (http://synergy.ece.gatech.edu) hyoukjun@gatech.edu April

More information

3L Diamond. Multiprocessor DSP RTOS

3L Diamond. Multiprocessor DSP RTOS 3L Diamond Multiprocessor DSP RTOS What is 3L Diamond? Diamond is an operating system designed for multiprocessor DSP applications. With Diamond you develop efficient applications that use networks of

More information

SimXMD Simulation-based HW/SW Co-debugging for field-programmable Systems-on-Chip

SimXMD Simulation-based HW/SW Co-debugging for field-programmable Systems-on-Chip SimXMD Simulation-based HW/SW Co-debugging for field-programmable Systems-on-Chip Ruediger Willenberg and Paul Chow High-Performance Reconfigurable Computing Group University of Toronto September 4, 2013

More information

Flexible wireless communication architectures

Flexible wireless communication architectures Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston TX Faculty Candidate Seminar Southern Methodist University April

More information

Mapping Stream based Applications to an Intel IXP Network Processor using Compaan

Mapping Stream based Applications to an Intel IXP Network Processor using Compaan Mapping Stream based Applications to an Intel IXP Network Processor using Compaan Sjoerd Meijer (PhD Student) University Leiden, LIACS smeijer@liacs.nl Outline Need for multi-processor platforms Problem:

More information

SYSTEMS PROGRAMMING. Srimanta Pal. Associate Professor Indian Statistical Institute Kolkata OXFORD UNIVERSITY PRESS

SYSTEMS PROGRAMMING. Srimanta Pal. Associate Professor Indian Statistical Institute Kolkata OXFORD UNIVERSITY PRESS SYSTEMS PROGRAMMING Srimanta Pal Associate Professor Indian Statistical Institute Kolkata OXFORD UNIVERSITY PRESS Contents Preface v 1. Scope of Systems Programming 1 1.1 Introduction 7 1.2 Computers and

More information

UWB Wireless Wireless USB Initiative:

UWB Wireless Wireless USB Initiative: UWB Wireless Wireless USB Initiative: First Hi Speed WPAN Interconnect Jeff Ravencraft Technology Strategist Intel Corporation WUSB Promoter Group Chairman February 01, 2005 Intel and the Intel logo are

More information

Parallel Programming Multicore systems

Parallel Programming Multicore systems FYS3240 PC-based instrumentation and microcontrollers Parallel Programming Multicore systems Spring 2011 Lecture #9 Bekkeng, 4.4.2011 Introduction Until recently, innovations in processor technology have

More information

Building Combinatorial Circuit Using Behavioral Modeling Lab

Building Combinatorial Circuit Using Behavioral Modeling Lab Building Combinatorial Circuit Using Behavioral Modeling Lab Overview: In this lab you will learn how to model a combinatorial circuit using behavioral modeling style of Verilog HDL. You will model a combinatorial

More information

Overview. CSE372 Digital Systems Organization and Design Lab. Hardware CAD. Two Types of Chips

Overview. CSE372 Digital Systems Organization and Design Lab. Hardware CAD. Two Types of Chips Overview CSE372 Digital Systems Organization and Design Lab Prof. Milo Martin Unit 5: Hardware Synthesis CAD (Computer Aided Design) Use computers to design computers Virtuous cycle Architectural-level,

More information

Lec 13: Linking and Memory. Kavita Bala CS 3410, Fall 2008 Computer Science Cornell University. Announcements

Lec 13: Linking and Memory. Kavita Bala CS 3410, Fall 2008 Computer Science Cornell University. Announcements Lec 13: Linking and Memory Kavita Bala CS 3410, Fall 2008 Computer Science Cornell University PA 2 is out Due on Oct 22 nd Announcements Prelim Oct 23 rd, 7:30-9:30/10:00 All content up to Lecture on Oct

More information

VHDL. Chapter 1 Introduction to VHDL. Course Objectives Affected. Outline

VHDL. Chapter 1 Introduction to VHDL. Course Objectives Affected. Outline Chapter 1 Introduction to VHDL VHDL VHDL - Flaxer Eli Ch 1-1 Course Objectives Affected Write functionally correct and well-documented VHDL code, intended for either simulation or synthesis, of any combinational

More information

These actions may use different parts of the CPU. Pipelining is when the parts run simultaneously on different instructions.

These actions may use different parts of the CPU. Pipelining is when the parts run simultaneously on different instructions. MIPS Pipe Line 2 Introduction Pipelining To complete an instruction a computer needs to perform a number of actions. These actions may use different parts of the CPU. Pipelining is when the parts run simultaneously

More information

Chapter 12. Microcontroller Application Development Tools

Chapter 12. Microcontroller Application Development Tools Chapter 12 Microcontroller Application Development Tools Lesson 2 Assembler, Compiler, Library File generation and Integrated Development Environment Assembler Compiler Library Manager File Generation

More information

Performance Overhead with High Level Waveform Development

Performance Overhead with High Level Waveform Development Performance Overhead with High Level Waveform Development Stefan Nagel, Michael Schwall, Friedrich K. Jondral 2010 European Reconfigurable Radio Technologies Workshop June 23-25, 2010 Mainz, Germany Communications

More information

CHAPTER 3 ASYNCHRONOUS PIPELINE CONTROLLER

CHAPTER 3 ASYNCHRONOUS PIPELINE CONTROLLER 84 CHAPTER 3 ASYNCHRONOUS PIPELINE CONTROLLER 3.1 INTRODUCTION The introduction of several new asynchronous designs which provides high throughput and low latency is the significance of this chapter. The

More information

Computer Organization and Assembly Language. Lab Session 01

Computer Organization and Assembly Language. Lab Session 01 Objective: Lab Session 01 Introduction to Assembly Language Tools and Familiarization with Emu8086 environment To be able to understand Data Representation and perform conversions from one system to another

More information

MOJTABA MAHDAVI Mojtaba Mahdavi DSP Design Course, EIT Department, Lund University, Sweden

MOJTABA MAHDAVI Mojtaba Mahdavi DSP Design Course, EIT Department, Lund University, Sweden High Level Synthesis with Catapult MOJTABA MAHDAVI 1 Outline High Level Synthesis HLS Design Flow in Catapult Data Types Project Creation Design Setup Data Flow Analysis Resource Allocation Scheduling

More information

Hybrid Threading: A New Approach for Performance and Productivity

Hybrid Threading: A New Approach for Performance and Productivity Hybrid Threading: A New Approach for Performance and Productivity Glen Edwards, Convey Computer Corporation 1302 East Collins Blvd Richardson, TX 75081 (214) 666-6024 gedwards@conveycomputer.com Abstract:

More information

A System IP Approach to High Performance mmwave 5G Wireless Verticals

A System IP Approach to High Performance mmwave 5G Wireless Verticals A System IP Approach to High Performance mmwave 5G Wireless Verticals TechWorks Ray McConnell, CTO Blu Wireless Technology Ltd Gigabit mmwave modem IP Consumer WiGig : IEEE 802.11ad/ay 4G backhaul and

More information

Using Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology

Using Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology Using Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology September 19, 2007 Markus Levy, EEMBC and Multicore Association Enabling the Multicore Ecosystem Multicore

More information

MPSOC 2011 BEAUNE, FRANCE

MPSOC 2011 BEAUNE, FRANCE MPSOC 2011 BEAUNE, FRANCE BOADRES: A SCALABLE BASEBAND PROCESSOR TEMPLATE FOR Gbps RADIOS VICE PRESIDENT, CHAIRMAN OF THE TECHNOLOGY OFFICE PROFESSOR AT THE KATHOLIEKE UNIVERSITEIT LEUVEN STATUS SDR BASEBAND

More information

Wireless Networking: An Introduction. Hongwei Zhang

Wireless Networking: An Introduction. Hongwei Zhang Wireless Networking: An Introduction Hongwei Zhang http://www.cs.wayne.edu/~hzhang Outline Networking as resource allocation A taxonomy of current practice Technical elements Outline Networking as resource

More information

Design methodology for multi processor systems design on regular platforms

Design methodology for multi processor systems design on regular platforms Design methodology for multi processor systems design on regular platforms Ph.D in Electronics, Computer Science and Telecommunications Ph.D Student: Davide Rossi Ph.D Tutor: Prof. Roberto Guerrieri Outline

More information

System Architecture Directions for Networked Sensors[1]

System Architecture Directions for Networked Sensors[1] System Architecture Directions for Networked Sensors[1] Secure Sensor Networks Seminar presentation Eric Anderson System Architecture Directions for Networked Sensors[1] p. 1 Outline Sensor Network Characteristics

More information

Frequency domain IQ. Richard Maiden - Intel

Frequency domain IQ. Richard Maiden - Intel Frequency domain IQ Richard Maiden - Intel Compliance with IEEE Standards Policies and Procedures Subclause 5.2.1 of the IEEE-SA Standards Board Bylaws states, "While participating in IEEE standards development

More information

Characterizing Imbalance in Large-Scale Parallel Programs. David Bo hme September 26, 2013

Characterizing Imbalance in Large-Scale Parallel Programs. David Bo hme September 26, 2013 Characterizing Imbalance in Large-Scale Parallel Programs David o hme September 26, 2013 Need for Performance nalysis Tools mount of parallelism in Supercomputers keeps growing Efficient resource usage

More information