From Temporal Partitioning and Temporal Placement to Algorithmic Skeletons

Size: px
Start display at page:

Download "From Temporal Partitioning and Temporal Placement to Algorithmic Skeletons"

Transcription

1 From Temporal Partitioning and Temporal Placement to Algorithmic Skeletons Florian Dittmann, Franz J. Rammig Heinz Nixdorf Institute University of Paderborn, Germany

2 Motivation Making reconfigurable computing mature - Industrialization Capabilities - Processing in parallel - Runtime reconfiguration - Partial reconfiguration - Space and time -Etc. Abstraction - Layers - Beneficial methods FPGA 2/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

3 Overview Motivation Partitioning methods and their application - Lessons learned Layered approach - Specification Graph Approach - Reconfiguration Port Scheduling - Algorithmic Skeletons Cooperation Part-E Conclusion 3/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

4 Partitioning Applying the spectral method on coarse grained systems - Mesh-based nearest neighbor communication - 2D topology Mapping - Data flow graphs - Resource efficient - Communication optimized PE PE y x y x y x /26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

5 Partitioning Temporal + Spatial Partitioning/Placement Basis - ASAP scheduling - Spectral placement Combination - Focusing on one level - Location of the nodes in the spectral placement Placement of the extracted nodes on PE Benefit - ASAP: precedence constraints - Spectral Method: overall closeness respected 5/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

6 Partitioning Two Slot Model Execution Environment of exactly two slots - Alternating execution of tasks Slot A Controller Slot B - Hiding of the reconfiguration overhead FPGA Freq. Exploiting partial run-time reconfiguration Challenges time - Architecture demands for communication infrastructure - Partial bitstream generation - Task mapping - Partitioning 6/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

7 Two Slot Model Partitioning of the input algorithms B B d) Partitioning e) Including of Buffers Scheduling - Simple dispatching - Single server for two machines Slot A Slot B time time 7/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

8 Lessons Learned Partitioning as reasonable/fundamental step Challenges - Placement: fragmentation - Reconfiguration overhead -Etc. Valuable concept - Two phases Reconfiguration phase Execution phase Derivable concepts - Specification graph approach Domain of platform-based design - Reconfiguration Port Scheduling - Algorithmic Skeletons FPGA? 8/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

9 Layered Model for Design Methods area and phase - varies - varies Challenging design, scheduling, etc. Tasks Layer 1 Layer 2 Layer n Reconfigurable Fabric time Layer model/approach - Abstraction - Specification example Tasks Partitioning Dispatching Execution Environment FPGA 9/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

10 Specification Graph Approach Tasks Problem graph - Integration of the phase Tasks Problem Graph Mapping Architecture Graph FPGA CPU FPGA FPGA Processing units Architecture graph - Also heterogeneous Mapping - Links tasks, communication and reconfiguration with architectural resources - Hierarchical mapping edges Synthesis - Scheduling, allocation and binding - By evolutionary computing 10/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

11 1 C0 T1 Tasks Problem Graph 3 C1 T3 C4 C2 C3 T2 2 Task graph G Add reconfiguration phases G* Add Communication Vertexes G P (Problem Graph) Tasks Problem Graph Mapping Architecture Graph FPGA CPU FPGA 11/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

12 Specification Graph Approach Mapping of Resources problem and architecture graph + mapping edges = specification graph Slot 1 Slot 2 Port 1 T1 C0 3 ms Port 3 C2 C1 C3 T3 C4 T ms 3 ms 12 ms 10 ms 1 ms 15 ms Slot 1 Bus Slot 2 12/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

13 Specification Graph Approach Extensions C1 1 Port Slot_B Multiple devices Multiple reconfiguration ports T1 Slot_C FPGA1 2 C2 C3 3 Gigabit T2 T3 Gigabit C4 Slot_A FPGA2 G P Port Platform-based design All within the domain of synthesis G A1 G A2 Slot A 2 T2 Slot B 1 T1 Slot C 3 T3 time 13/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

14 Reconfiguration Port Scheduling Partially run-time reconfigurable FPGAs for real-time processing - Task set executed on FPGA Slot A Slot B Slot C Area assignment? - Prevent fragmentation - Offer communication Bus FPGA Scheduling? - Execution time of tasks - Reconfiguration process Overhead: time + single port At a pace of the environment Reconfiguration port scheduling Task Set Reconfiguration Port Scheduling Execution Environment FPGA 14/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

15 Reconfiguration Port Scheduling Slotted execution environment Set of n partial Bitstreams Slot A Slot B Slot C d* d Single reconfiguration port Bus FPGA Real-time processing on slotted FPGA architecture - Guarantee meeting of deadlines - Constant reconfiguration phase - Deadline d* - One reconfiguration port mono processor scheduling algorithms Task Sets OS Real-Time Scheduling Execution Environment FPGA 15/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

16 Performance of d* For aperiodic task sets - d* outperforms d - Of 100 feasible task sets d* finds approx. 90 and d finds approx. 70 d d* R T Performance depends on - ratio to t = l - # of slots t 16/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

17 Fixed Priority Example Periodic task set scheduling - Static priorities - Preemption Characteristics - Deadlines (D*) shorter than periods Apply deadline monotonic scheduling (DM) 17/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

18 Fixed Priority Scheduling Schedulability Analysis Parameters Response time analysis - DM with D*: - Schedulable if: τ i : R < D * i i - Calculate R i by R i 1 i R i = t, i + t, j j= 1 Tj - Critical instance: all tasks are released simultaneously Sufficient and necessary for DM with D - Challenging abnormalities for DM with D* 18/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

19 Algorithmic Skeletons Motivation Tasks - Programmability and portability - Structure and behavior of the tasks Application level Use of algorithmic skeletons - Wrapping of tasks - Programming templates Applications Tasks Algorithmic Skeletons Runtime environment - Partial reconfiguration capabilities Dispatcher Skeleton Dispatching Runtime Environment FPGA 19/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

20 Algorithmic Skeletons Background Invented for the parallel processing domain - First discussed by Murray Cole in the mid 80ies Objectives - Separate structure of a computation from the computation itself - Free programmer from the implementation details of the structure - Implementation guideline for activities and their interactions Related: design patterns - Differences Design level Final implementation left to the freedom of the designer Algorithmic skeletons force the applications to be well-formed - Enable to extract valuable information - Design space exploration on a high level of abstraction - Static and dynamic optimization of implementations 20/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

21 Dynamic Reconfiguration Multi-threading on an FPGA - Hosting of more than one task - Share processing resource Challenges - Dispatch newly arriving tasks during run-time Tasks are not known at design time Architecture to facilitate dispatching must exist - Area assignment - Prevent fragmentation - Communication assignment Algorithmic Skeletons Applications Tasks Algorithmic Skeletons Skeleton Dispatching Runtime Environment FPGA 21/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

22 Dynamic Reconfiguration Example Combination of Pipeline and Farm Skeleton - Slotted architecture - Bus and direct communication in Pipeline: W 1 W 2 W p Farm: in out E W 1 W 2 W p FPGA W1 W3 W2 W1 CE C out 22/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

23 Cooperation PadErOl (Erlangen [Prof. Teich] and Oldenburg [Prof. Nebel]) - Integrated design flow, see it journal Braunschweig (Prof. Fekete) - Reconfiguration phase scheduling/ single server scheduling - Comparison of methods Erlangen (Prof. Teich) - Bus-based architecture for reconfiguration port scheduling Paderborn (Prof. Platzner) - Local cooperation - Information exchange Erlangen Slot Machine 23/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

24 Publications 2007 Dittmann, Florian; Frank, Stefan: Hard Real-Time Reconfiguration Port Scheduling. In: Proceedings of the Design, automation and Test in Europe, Nice, France 2007 Dittmann, Florian; Götz, Marcelo; Rettberg, Achim: Model and Methodology for the Synthesis of Heterogeneous and Partially Reconfigurable Systems. In: Proceedings of the Reconfigurable Architecture Workshop, Long Beach, CA, USA 2007 Dittmann, Florian: Algorithmic Skeletons for the Programming of Reconfigurable Systems. In: Proceedings of the SEUS 2007, Santorin, Greece, Mai 2007 Dittmann, Florian; Rammig, Franz Josef; Streubühr, Martin; Haubelt, C.; Schallenberg, Andreas; Nebel, Wolfgang: Exploration, Partitioning and Simulation of Reconfigurable Systems. it - Information Technology (vormals it+ti), 3(7), 1. Jan Dittmann, Florian; Rettberg, Achim; Weber, Raphael: Optimization Techniques for a Reconfigurable Self-Timed and Bit-Serial Architecture. In: Proceedings of the SBCCI 2007, Rio de Janeiro, Brazil, Sep Dittmann, Florian; Frank, S.: Caching in Real-time Reconfiguration Port Scheduling. In: Proceedings of the FPL 2007, Amsterdam, The Netherlands, Aug Dittmann, Florian; Rettberg, Achim; Weber, Raphael: Latency Optimization for a Reconfigurable, Self-Timed and Bit-Serial Architecture. In: Proceedings of the ERSA 2007, Las Vegas, USA, Jun Dittmann, Florian; Heimfarth, Tales: Clock Frequency Variation of Partially Reconfigurable Systems. In: Proceedings of the19th International Conference on Architecture of Computing Systems: Workshop Proceedings, S , Frankfurt, Germany, 1. Jan Warkentin, Alexander; Dittmann, Florian: Data Transfer Protocols for a Two Slot Based Reconfigurable Platform. In: Proceedings of the Reconfigurable Communication-centric SoCs (ReCoSoC), Montpellier, France, 2006 Götz, Marcelo; Dittmann, Florian: Reconfigurable Microkernel-based OS: Mechanisms and Methods for Run-Time Reconfiguration. In: Proceedings of the 3rd International Conference on ReConFigurable Computing and FPGAs 2006 (ReConFig'06), S Dittmann, Florian; Götz, M.: Applying Single Processor Algorithms to Schedule Tasks on Reconfigurable Devices Respecting Reconfiguration Times. In: Proceedings of the 13th Reconfigurable Architectures Workshop (RAW 2006), Rhodes Island, Greece, 2006 Dittmann, Florian; Rettberg, Achim: Design of Partially Reconfigurable Systems: From Abstract Modeling to Practical Realization. In: Proceedings of the 1st International Workshop on Reconfigurable Computing Education, Karlsruhe, Germany, 1. Jan Götz, Marcelo; Dittmann, Florian; Pereira, Carlos E.: Deterministic Mechanism for Run-Time Reconfiguration Activities in an OS. In: Proceedings of the 4th International IEEE Conference on Industrial Informatics (INDIN 2006), Singapore, 2006 Götz, Marcelo; Dittmann, Florian: Scheduling Reconfiguration Activities of Run-time Reconfigurable OS Using an Aperiodic Task Server. In: Proceedings of the ARC 2006, Delft, The Netherlands, Mrz Dittmann, Florian; Rettberg, Achim; Weber, Raphael: Towards the Implementation of Path Concepts for a Reconfigurable Bit-Serial Synchronous Architecture. In: Proceedings of the 3rd International Conference on ReConFigurable Computing and FPGA's, S , San Luis Potosi, Mexico, 2006 Dittmann, Florian; Götz, Marcelo: Reconfiguration Time Aware Processing on FPGAs. In: In Proceedings of the Dagstuhl Seminar Nº on Dynamically Reconfigurable Architectures, Dagstuhl, Germany, /26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

25 Part-E Eclipse based development environment for partial bitstream generation Opensource parte.sf.net Tutorial available Coffee break Bitstreams in 2 min 25/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007

26 Thank you for your attention. Florian Dittmann, Franz Rammig Heinz Nixdorf Institute University of Paderborn Fuerstenallee Paderborn Germany Phone: +49 (0) 52 51/ Fax: +49 (0) 52 51/ Thanks to E. Weber, S. Frank, A Warkentin

COMMUNICATION-AWARE COMPONENT ALLOCATION ALGORITHM FOR A HYBRID ARCHITECTURE

COMMUNICATION-AWARE COMPONENT ALLOCATION ALGORITHM FOR A HYBRID ARCHITECTURE COMMUNICATION-AWARE COMPONENT ALLOCATION ALGORITHM FOR A HYBRID ARCHITECTURE Marcelo Götz, 1 Achim Rettberg 2 and Carlos Eduardo Pereira 3 1 Heinz Nixdorf Institute University of Paderborn, Germany mgoetz@uni-paderborn.de

More information

ReconOS: Multithreaded Programming and Execution Models for Reconfigurable Hardware

ReconOS: Multithreaded Programming and Execution Models for Reconfigurable Hardware ReconOS: Multithreaded Programming and Execution Models for Reconfigurable Hardware Enno Lübbers and Marco Platzner Computer Engineering Group University of Paderborn {enno.luebbers, platzner}@upb.de Outline

More information

Operating System Approaches for Dynamically Reconfigurable Hardware

Operating System Approaches for Dynamically Reconfigurable Hardware Operating System Approaches for Dynamically Reconfigurable Hardware Marco Platzner Computer Engineering Group University of Paderborn platzner@upb.de Outline operating systems for reconfigurable hardware

More information

The Design and Implementation of a Low-Latency On-Chip Network

The Design and Implementation of a Low-Latency On-Chip Network The Design and Implementation of a Low-Latency On-Chip Network Robert Mullins 11 th Asia and South Pacific Design Automation Conference (ASP-DAC), Jan 24-27 th, 2006, Yokohama, Japan. Introduction Current

More information

Reconfigurable Computing. On-line communication strategies. Chapter 7

Reconfigurable Computing. On-line communication strategies. Chapter 7 On-line communication strategies Chapter 7 Prof. Dr.-Ing. Jürgen Teich Lehrstuhl für Hardware-Software-Co-Design On-line connection - Motivation Routing-conscious temporal placement algorithms consider

More information

System Level Modeling and Performance Simulation for Dynamic Reconfigurable Computing Systems in SystemC

System Level Modeling and Performance Simulation for Dynamic Reconfigurable Computing Systems in SystemC In Methoden und Beschreibungssprachen zur Modellierung und Verifikation von Schaltungen und Systemen. by Christian Haubelt, Jürgen Teich (Eds.). GI/ITG/GMM-Workshop, Shaker Verlag, Aachen, March 5-7, 2007

More information

Design methodology for multi processor systems design on regular platforms

Design methodology for multi processor systems design on regular platforms Design methodology for multi processor systems design on regular platforms Ph.D in Electronics, Computer Science and Telecommunications Ph.D Student: Davide Rossi Ph.D Tutor: Prof. Roberto Guerrieri Outline

More information

A SELF-CONTROLLED AND DYNAMICALLY RECONFIGURABLE ARCHITECTURE*

A SELF-CONTROLLED AND DYNAMICALLY RECONFIGURABLE ARCHITECTURE* A SELF-CONTROLLED AND DYNAMICALLY RECONFIGURABLE ARCHITECTURE* Florian Dittmann, Achim Rettberg University of Paderborn, Germany roichen@upb.de, achim@c-lab.de Abstract: Keywords: Reconfigurable systems

More information

Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures

Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures Ass. Professor Dimitrios Soudris School of Electrical and Computer Eng., National Technical Univ. of Athens, Greece

More information

A Process Model suitable for defining and programming MpSoCs

A Process Model suitable for defining and programming MpSoCs A Process Model suitable for defining and programming MpSoCs MpSoC-Workshop at Rheinfels, 29-30.6.2010 F. Mayer-Lindenberg, TU Hamburg-Harburg 1. Motivation 2. The Process Model 3. Mapping to MpSoC 4.

More information

Co-synthesis and Accelerator based Embedded System Design

Co-synthesis and Accelerator based Embedded System Design Co-synthesis and Accelerator based Embedded System Design COE838: Embedded Computer System http://www.ee.ryerson.ca/~courses/coe838/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer

More information

A Novel Design Framework for the Design of Reconfigurable Systems based on NoCs

A Novel Design Framework for the Design of Reconfigurable Systems based on NoCs Politecnico di Milano & EPFL A Novel Design Framework for the Design of Reconfigurable Systems based on NoCs Vincenzo Rana, Ivan Beretta, Donatella Sciuto Donatella Sciuto sciuto@elet.polimi.it Introduction

More information

Optimization of Run-time Reconfigurable Embedded Systems

Optimization of Run-time Reconfigurable Embedded Systems Optimization of Run-time Reconfigurable Embedded Systems Michael Eisenring and Marco Platzner Swiss Federal Institute of Technology (ETH) Zurich, Switzerland {eisenring platzner}@tik.ee.ethz.ch Abstract.

More information

Resource-Efficient Scheduling for Partially-Reconfigurable FPGAbased

Resource-Efficient Scheduling for Partially-Reconfigurable FPGAbased Resource-Efficient Scheduling for Partially-Reconfigurable FPGAbased Systems Andrea Purgato: andrea.purgato@mail.polimi.it Davide Tantillo: davide.tantillo@mail.polimi.it Marco Rabozzi: marco.rabozzi@polimi.it

More information

Single-Path Programming on a Chip-Multiprocessor System

Single-Path Programming on a Chip-Multiprocessor System Single-Path Programming on a Chip-Multiprocessor System Martin Schoeberl, Peter Puschner, and Raimund Kirner Vienna University of Technology, Austria mschoebe@mail.tuwien.ac.at, {peter,raimund}@vmars.tuwien.ac.at

More information

NoC Simulation in Heterogeneous Architectures for PGAS Programming Model

NoC Simulation in Heterogeneous Architectures for PGAS Programming Model NoC Simulation in Heterogeneous Architectures for PGAS Programming Model Sascha Roloff, Andreas Weichslgartner, Frank Hannig, Jürgen Teich University of Erlangen-Nuremberg, Germany Jan Heißwolf Karlsruhe

More information

Fast dynamic and partial reconfiguration Data Path

Fast dynamic and partial reconfiguration Data Path Fast dynamic and partial reconfiguration Data Path with low Michael Hübner 1, Diana Göhringer 2, Juanjo Noguera 3, Jürgen Becker 1 1 Karlsruhe Institute t of Technology (KIT), Germany 2 Fraunhofer IOSB,

More information

Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays

Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays Éricles Sousa 1, Frank Hannig 1, Jürgen Teich 1, Qingqing Chen 2, and Ulf Schlichtmann

More information

From Dynamic Reconfiguration to Self-Configuration Achievements of the DFG Priority Program 1148 Reconfigurable Computing

From Dynamic Reconfiguration to Self-Configuration Achievements of the DFG Priority Program 1148 Reconfigurable Computing From Dynamic Reconfiguration to Self-Configuration Achievements of the DFG Priority Program 1148 Reconfigurable Computing Jürgen Teich Chair for Hardware-Software-Co-Design University of Erlangen-Nuremberg

More information

SDR Forum Technical Conference 2007

SDR Forum Technical Conference 2007 THE APPLICATION OF A NOVEL ADAPTIVE DYNAMIC VOLTAGE SCALING SCHEME TO SOFTWARE DEFINED RADIO Craig Dolwin (Toshiba Research Europe Ltd, Bristol, UK, craig.dolwin@toshiba-trel.com) ABSTRACT This paper presents

More information

A Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management

A Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management A Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management H. Tan and R. F. DeMara Department of Electrical and Computer Engineering University of Central Florida

More information

Hardware/Software Codesign

Hardware/Software Codesign Hardware/Software Codesign SS 2016 Prof. Dr. Christian Plessl High-Performance IT Systems group University of Paderborn Version 2.2.0 2016-04-08 how to design a "digital TV set top box" Motivating Example

More information

Previous Exam Questions System-on-a-Chip (SoC) Design

Previous Exam Questions System-on-a-Chip (SoC) Design This image cannot currently be displayed. EE382V Problem: System Analysis (20 Points) This is a simple single microprocessor core platform with a video coprocessor, which is configured to process 32 bytes

More information

ReNoC: A Network-on-Chip Architecture with Reconfigurable Topology

ReNoC: A Network-on-Chip Architecture with Reconfigurable Topology 1 ReNoC: A Network-on-Chip Architecture with Reconfigurable Topology Mikkel B. Stensgaard and Jens Sparsø Technical University of Denmark Technical University of Denmark Outline 2 Motivation ReNoC Basic

More information

FPGA. Agenda 11/05/2016. Scheduling tasks on Reconfigurable FPGA architectures. Definition. Overview. Characteristics of the CLB.

FPGA. Agenda 11/05/2016. Scheduling tasks on Reconfigurable FPGA architectures. Definition. Overview. Characteristics of the CLB. Agenda The topics that will be addressed are: Scheduling tasks on Reconfigurable FPGA architectures Mauro Marinoni ReTiS Lab, TeCIP Institute Scuola superiore Sant Anna - Pisa Overview on basic characteristics

More information

A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on

A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on on-chip Donghyun Kim, Kangmin Lee, Se-joong Lee and Hoi-Jun Yoo Semiconductor System Laboratory, Dept. of EECS, Korea Advanced

More information

Enhancing Resource Utilization with Design Alternatives in Runtime Reconfigurable Systems

Enhancing Resource Utilization with Design Alternatives in Runtime Reconfigurable Systems Enhancing Resource Utilization with Design Alternatives in Runtime Reconfigurable Systems Alexander Wold, Dirk Koch, Jim Torresen Department of Informatics, University of Oslo, Norway Email: {alexawo,koch,jimtoer}@ifi.uio.no

More information

Throughput-optimizing Compilation of Dataflow Applications for Multi-Cores using Quasi-Static Scheduling

Throughput-optimizing Compilation of Dataflow Applications for Multi-Cores using Quasi-Static Scheduling Throughput-optimizing Compilation of Dataflow Applications for Multi-Cores using Quasi-Static Scheduling Tobias Schwarzer 1, Joachim Falk 1, Michael Glaß 1, Jürgen Teich 1, Christian Zebelein 2, Christian

More information

An application-based EDF scheduler for OSEK/VDX

An application-based EDF scheduler for OSEK/VDX An application-based EDF scheduler for OSEK/VDX Claas Diederichs INCHRON GmbH 14482 Potsdam, Germany claas.diederichs@inchron.de Ulrich Margull 1 mal 1 Software GmbH 90762 Fürth, Germany margull@1mal1.com

More information

Multi Core Real Time Task Allocation Algorithm for the Resource Sharing Gravitation in Peer to Peer Network

Multi Core Real Time Task Allocation Algorithm for the Resource Sharing Gravitation in Peer to Peer Network Multi Core Real Time Task Allocation Algorithm for the Resource Sharing Gravitation in Peer to Peer Network Hua Huang Modern Education Technology Center, Information Teaching Applied Technology Extension

More information

Review: Creating a Parallel Program. Programming for Performance

Review: Creating a Parallel Program. Programming for Performance Review: Creating a Parallel Program Can be done by programmer, compiler, run-time system or OS Steps for creating parallel program Decomposition Assignment of tasks to processes Orchestration Mapping (C)

More information

Cost-and Power Optimized FPGA based System Integration: Methodologies and Integration of a Lo

Cost-and Power Optimized FPGA based System Integration: Methodologies and Integration of a Lo Cost-and Power Optimized FPGA based System Integration: Methodologies and Integration of a Low-Power Capacity- based Measurement Application on Xilinx FPGAs Abstract The application of Field Programmable

More information

Multimedia-Systems. Operating Systems. Prof. Dr.-Ing. Ralf Steinmetz Prof. Dr. rer. nat. Max Mühlhäuser Prof. Dr.-Ing. Wolfgang Effelsberg

Multimedia-Systems. Operating Systems. Prof. Dr.-Ing. Ralf Steinmetz Prof. Dr. rer. nat. Max Mühlhäuser Prof. Dr.-Ing. Wolfgang Effelsberg Multimedia-Systems Operating Systems Prof. Dr.-Ing. Ralf Steinmetz Prof. Dr. rer. nat. Max Mühlhäuser Prof. Dr.-Ing. Wolfgang Effelsberg WE: University of Mannheim, Dept. of Computer Science Praktische

More information

Performance Analysis of OS Scheduling for a Reconfigurable Computing Environment

Performance Analysis of OS Scheduling for a Reconfigurable Computing Environment Indian Journal of Science and Technology, Vol 8(22), DOI: 10.17485/ijst/2015/v8i22/79107, September 2015 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Performance Analysis of OS Scheduling for a Reconfigurable

More information

ParalleX. A Cure for Scaling Impaired Parallel Applications. Hartmut Kaiser

ParalleX. A Cure for Scaling Impaired Parallel Applications. Hartmut Kaiser ParalleX A Cure for Scaling Impaired Parallel Applications Hartmut Kaiser (hkaiser@cct.lsu.edu) 2 Tianhe-1A 2.566 Petaflops Rmax Heterogeneous Architecture: 14,336 Intel Xeon CPUs 7,168 Nvidia Tesla M2050

More information

Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design

Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design C. Tanougast, Y. Berviller, P. Brunet and S. Weber L. I. E. N. Laboratoire d Instrumentation Electronique de Nancy

More information

Improving Energy Efficiency of Block-Matching Motion Estimation Using Dynamic Partial Reconfiguration

Improving Energy Efficiency of Block-Matching Motion Estimation Using Dynamic Partial Reconfiguration , pp.517-521 http://dx.doi.org/10.14257/astl.2015.1 Improving Energy Efficiency of Block-Matching Motion Estimation Using Dynamic Partial Reconfiguration Jooheung Lee 1 and Jungwon Cho 2, * 1 Dept. of

More information

CUDA GPGPU Workshop 2012

CUDA GPGPU Workshop 2012 CUDA GPGPU Workshop 2012 Parallel Programming: C thread, Open MP, and Open MPI Presenter: Nasrin Sultana Wichita State University 07/10/2012 Parallel Programming: Open MP, MPI, Open MPI & CUDA Outline

More information

DESIGN AND IMPLEMENTATION ARCHITECTURE FOR RELIABLE ROUTER RKT SWITCH IN NOC

DESIGN AND IMPLEMENTATION ARCHITECTURE FOR RELIABLE ROUTER RKT SWITCH IN NOC International Journal of Engineering and Manufacturing Science. ISSN 2249-3115 Volume 8, Number 1 (2018) pp. 65-76 Research India Publications http://www.ripublication.com DESIGN AND IMPLEMENTATION ARCHITECTURE

More information

Self-Adaptive FPGA-Based Image Processing Filters Using Approximate Arithmetics

Self-Adaptive FPGA-Based Image Processing Filters Using Approximate Arithmetics Self-Adaptive FPGA-Based Image Processing Filters Using Approximate Arithmetics Jutta Pirkl, Andreas Becher, Jorge Echavarria, Jürgen Teich, and Stefan Wildermann Hardware/Software Co-Design, Friedrich-Alexander-Universität

More information

Exploiting Dynamically Changing Parallelism with a Reconfigurable Array of Homogeneous Sub-cores (a.k.a. Field Programmable Core Array or FPCA)

Exploiting Dynamically Changing Parallelism with a Reconfigurable Array of Homogeneous Sub-cores (a.k.a. Field Programmable Core Array or FPCA) Exploiting Dynamically Changing Parallelism with a Reconfigurable Array of Homogeneous Sub-cores (a.k.a. Field Programmable Core Array or FPCA) Sponsored by SRC and NSF as a Part of Multicore Chip Design

More information

Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP

Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP Presenter: Course: EEC 289Q: Reconfigurable Computing Course Instructor: Professor Soheil Ghiasi Outline Overview of M.I.T. Raw processor

More information

Fault Tolerance Analysis of Distributed Reconfigurable Systems Using SAT-Based Techniques

Fault Tolerance Analysis of Distributed Reconfigurable Systems Using SAT-Based Techniques In Field-Programmable Logic and Applications by Peter Y. K. Cheung, George A. Constantinides, and Jose T. de Sousa (Eds.). In Lecture Notes in Computer Science (LNCS), Volume 2778, c Springer, Berlin,

More information

EE382V: System-on-a-Chip (SoC) Design

EE382V: System-on-a-Chip (SoC) Design EE382V: System-on-a-Chip (SoC) Design Lecture 10 Task Partitioning Sources: Prof. Margarida Jacome, UT Austin Prof. Lothar Thiele, ETH Zürich Andreas Gerstlauer Electrical and Computer Engineering University

More information

Generation of Multigrid-based Numerical Solvers for FPGA Accelerators

Generation of Multigrid-based Numerical Solvers for FPGA Accelerators Generation of Multigrid-based Numerical Solvers for FPGA Accelerators Christian Schmitt, Moritz Schmid, Frank Hannig, Jürgen Teich, Sebastian Kuckuk, Harald Köstler Hardware/Software Co-Design, System

More information

A High Integrity Distributed Deterministic Java Environment. WORDS 2002 January 7, San Diego CA

A High Integrity Distributed Deterministic Java Environment. WORDS 2002 January 7, San Diego CA A High Integrity Distributed Deterministic Java Environment WORDS 2002 January 7, San Diego CA João Ventura Skysoft Portugal SA Fridtjof Siebert & Andy Walter aicas GmbH James Hunt Forschungszentrum Informatik

More information

Chapter 19: Real-Time Systems. Operating System Concepts 8 th Edition,

Chapter 19: Real-Time Systems. Operating System Concepts 8 th Edition, Chapter 19: Real-Time Systems, Silberschatz, Galvin and Gagne 2009 Chapter 19: Real-Time Systems System Characteristics Features of Real-Time Systems Implementing Real-Time Operating Systems Real-Time

More information

Uniprocessor Scheduling. Basic Concepts Scheduling Criteria Scheduling Algorithms. Three level scheduling

Uniprocessor Scheduling. Basic Concepts Scheduling Criteria Scheduling Algorithms. Three level scheduling Uniprocessor Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Three level scheduling 2 1 Types of Scheduling 3 Long- and Medium-Term Schedulers Long-term scheduler Determines which programs

More information

D.A.S.T. Defragmentation And Scheduling of Tasks University of Twente Computer Science

D.A.S.T. Defragmentation And Scheduling of Tasks University of Twente Computer Science D.A.S.T. Defragmentation And Scheduling of Tasks University of Twente Computer Science Frank Vlaardingerbroek Joost van der Linden Stefan ten Heggeler Ruud Groen 14th November 2003 Abstract When mapping

More information

Computers as Components Principles of Embedded Computing System Design

Computers as Components Principles of Embedded Computing System Design Computers as Components Principles of Embedded Computing System Design Third Edition Marilyn Wolf ELSEVIER AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY

More information

ReconOS: An RTOS Supporting Hardware and Software Threads

ReconOS: An RTOS Supporting Hardware and Software Threads ReconOS: An RTOS Supporting Hardware and Software Threads Enno Lübbers and Marco Platzner Computer Engineering Group University of Paderborn marco.platzner@computer.org Overview the ReconOS project programming

More information

Hardware Accelerators

Hardware Accelerators Hardware Accelerators José Costa Software for Embedded Systems Departamento de Engenharia Informática (DEI) Instituto Superior Técnico 2014-04-08 José Costa (DEI/IST) Hardware Accelerators 1 Outline Hardware

More information

Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks

Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks Zhining Huang, Sharad Malik Electrical Engineering Department

More information

CSI3131 Final Exam Review

CSI3131 Final Exam Review CSI3131 Final Exam Review Final Exam: When: April 24, 2015 2:00 PM Where: SMD 425 File Systems I/O Hard Drive Virtual Memory Swap Memory Storage and I/O Introduction CSI3131 Topics Process Computing Systems

More information

Workloads Programmierung Paralleler und Verteilter Systeme (PPV)

Workloads Programmierung Paralleler und Verteilter Systeme (PPV) Workloads Programmierung Paralleler und Verteilter Systeme (PPV) Sommer 2015 Frank Feinbube, M.Sc., Felix Eberhardt, M.Sc., Prof. Dr. Andreas Polze Workloads 2 Hardware / software execution environment

More information

OpenMP Device Offloading to FPGA Accelerators. Lukas Sommer, Jens Korinth, Andreas Koch

OpenMP Device Offloading to FPGA Accelerators. Lukas Sommer, Jens Korinth, Andreas Koch OpenMP Device Offloading to FPGA Accelerators Lukas Sommer, Jens Korinth, Andreas Koch Motivation Increasing use of heterogeneous systems to overcome CPU power limitations 2017-07-12 OpenMP FPGA Device

More information

AutoTVM & Device Fleet

AutoTVM & Device Fleet AutoTVM & Device Fleet ` Learning to Optimize Tensor Programs Frameworks High-level data flow graph and optimizations Hardware Learning to Optimize Tensor Programs Frameworks High-level data flow graph

More information

ECE519 Advanced Operating Systems

ECE519 Advanced Operating Systems IT 540 Operating Systems ECE519 Advanced Operating Systems Prof. Dr. Hasan Hüseyin BALIK (10 th Week) (Advanced) Operating Systems 10. Multiprocessor, Multicore and Real-Time Scheduling 10. Outline Multiprocessor

More information

STLAC: A Spatial and Temporal Locality-Aware Cache and Networkon-Chip

STLAC: A Spatial and Temporal Locality-Aware Cache and Networkon-Chip STLAC: A Spatial and Temporal Locality-Aware Cache and Networkon-Chip Codesign for Tiled Manycore Systems Mingyu Wang and Zhaolin Li Institute of Microelectronics, Tsinghua University, Beijing 100084,

More information

Process Scheduling Part 2

Process Scheduling Part 2 Operating Systems and Computer Networks Process Scheduling Part 2 pascal.klein@uni-due.de Alexander Maxeiner, M.Sc. Faculty of Engineering Agenda Process Management Time Sharing Synchronization of Processes

More information

Department of Computer Science Institute for System Architecture, Operating Systems Group REAL-TIME MICHAEL ROITZSCH OVERVIEW

Department of Computer Science Institute for System Architecture, Operating Systems Group REAL-TIME MICHAEL ROITZSCH OVERVIEW Department of Computer Science Institute for System Architecture, Operating Systems Group REAL-TIME MICHAEL ROITZSCH OVERVIEW 2 SO FAR talked about in-kernel building blocks: threads memory IPC drivers

More information

Design Space Exploration

Design Space Exploration Design Space Exploration SS 2012 Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn Version 1.1.0 2012-06-15 Overview motivation for design space exploration design space exploration

More information

Scheduling Algorithm and Analysis

Scheduling Algorithm and Analysis Scheduling Algorithm and Analysis Model and Cyclic Scheduling (Module 27) Yann-Hang Lee Arizona State University yhlee@asu.edu (480) 727-7507 Summer 2014 Task Scheduling Schedule: to determine which task

More information

Remote and Partial Reconfiguration of FPGAs: Tools and Trends

Remote and Partial Reconfiguration of FPGAs: Tools and Trends Remote and Partial Reconfiguration of FPGAs: Tools and Trends Daniel Mesquita, Fernando Moraes, José palma, Leandro Moller, Ney Calazans Laboratoire de Informatique, de Robotique et de Microéletronique

More information

(b) External fragmentation can happen in a virtual memory paging system.

(b) External fragmentation can happen in a virtual memory paging system. Alexandria University Faculty of Engineering Electrical Engineering - Communications Spring 2015 Final Exam CS333: Operating Systems Wednesday, June 17, 2015 Allowed Time: 3 Hours Maximum: 75 points Note:

More information

Infrastructure for Autonomous Mobile Robots Communication and Coordination

Infrastructure for Autonomous Mobile Robots Communication and Coordination 90 Work in Progress Session Infrastructure for Autonomous Mobile Robots Communication and Coordination Marcelo M. Sobral, Leandro B. Becker Dept of Automation and Systems Universidade Federal de Santa

More information

MEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS

MEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS MEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS INSTRUCTOR: Dr. MUHAMMAD SHAABAN PRESENTED BY: MOHIT SATHAWANE AKSHAY YEMBARWAR WHAT IS MULTICORE SYSTEMS? Multi-core processor architecture means placing

More information

2. Introduction to Software for Embedded Systems

2. Introduction to Software for Embedded Systems 2. Introduction to Software for Embedded Systems Lothar Thiele ETH Zurich, Switzerland 2-1 Contents of Lectures (Lothar Thiele) 1. Introduction to Embedded System Design 2. Software for Embedded Systems

More information

MULTI-OBJECTIVE DESIGN SPACE EXPLORATION OF EMBEDDED SYSTEM PLATFORMS

MULTI-OBJECTIVE DESIGN SPACE EXPLORATION OF EMBEDDED SYSTEM PLATFORMS MULTI-OBJECTIVE DESIGN SPACE EXPLORATION OF EMBEDDED SYSTEM PLATFORMS Jan Madsen, Thomas K. Stidsen, Peter Kjærulf, Shankar Mahadevan Informatics and Mathematical Modelling Technical University of Denmark

More information

Implementing Scheduling Algorithms. Real-Time and Embedded Systems (M) Lecture 9

Implementing Scheduling Algorithms. Real-Time and Embedded Systems (M) Lecture 9 Implementing Scheduling Algorithms Real-Time and Embedded Systems (M) Lecture 9 Lecture Outline Implementing real time systems Key concepts and constraints System architectures: Cyclic executive Microkernel

More information

Program-Driven Fine-Grained Power Management for the Reconfigurable Mesh

Program-Driven Fine-Grained Power Management for the Reconfigurable Mesh Program-Driven Fine-Grained Power Management for the Reconfigurable Mesh Heiner Giefers, Marco Platzner Computer Engineering Group University of Paderborn {hgiefers, platzner}@upb.de Outline 1. Introduction

More information

Concurrent activities in daily life. Real world exposed programs. Scheduling of programs. Tasks in engine system. Engine system

Concurrent activities in daily life. Real world exposed programs. Scheduling of programs. Tasks in engine system. Engine system Real world exposed programs Programs written to interact with the real world, outside the computer Programs handle input and output of data in pace matching the real world processes Necessitates ability

More information

Introduction to Real-Time Systems ECE 397-1

Introduction to Real-Time Systems ECE 397-1 Introduction to Real-Time Systems ECE 97-1 Northwestern University Department of Computer Science Department of Electrical and Computer Engineering Teachers: Robert Dick Peter Dinda Office: L477 Tech 8,

More information

A HEURISTIC APPROACH TO SCHEDULE PERIODIC REAL-TIME TASKS ON RECONFIGURABLE HARDWARE

A HEURISTIC APPROACH TO SCHEDULE PERIODIC REAL-TIME TASKS ON RECONFIGURABLE HARDWARE A HEURISTIC APPROACH TO SCHEDULE PERIODIC REAL-TIME TASKS ON RECONFIGURABLE HARDWARE Klaus Danne Department of Computer Science University of Paderborn Marco Platzner Department of Computer Science University

More information

Impact of Runtime Architectures on Control System Stability

Impact of Runtime Architectures on Control System Stability Impact of Runtime Architectures on Control System Stability P. Feiler, J. Hansson Software Engineering Institute, Pittsburgh, PA Abstract: Control systems are sensitive to the endto-end latency and age

More information

FCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDAto-FPGA

FCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDAto-FPGA 1 FCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDAto-FPGA Compiler Tan Nguyen 1, Swathi Gurumani 1, Kyle Rupnow 1, Deming Chen 2 1 Advanced Digital Sciences Center, Singapore {tan.nguyen,

More information

Resource Reservation & Resource Servers

Resource Reservation & Resource Servers Resource Reservation & Resource Servers Resource Reservation Application Hard real-time, Soft real-time, Others? Platform Hardware Resources: CPU cycles, memory blocks 1 Applications Hard-deadline tasks

More information

A Virtualized Quality of Service Packet Scheduler Accelerator. Kangtao Kendall Chuang

A Virtualized Quality of Service Packet Scheduler Accelerator. Kangtao Kendall Chuang A Virtualized Quality of Service Packet Scheduler Accelerator A Thesis Presented to The Academic Faculty by Kangtao Kendall Chuang In Partial Fulfillment of the Requirements for the Degree Master of Science

More information

A Single-Path Chip-Multiprocessor System

A Single-Path Chip-Multiprocessor System A Single-Path Chip-Multiprocessor System Martin Schoeberl, Peter Puschner, and Raimund Kirner Institute of Computer Engineering Vienna University of Technology, Austria mschoebe@mail.tuwien.ac.at, {peter,raimund}@vmars.tuwien.ac.at

More information

Multithreaded Processors. Department of Electrical Engineering Stanford University

Multithreaded Processors. Department of Electrical Engineering Stanford University Lecture 12: Multithreaded Processors Department of Electrical Engineering Stanford University http://eeclass.stanford.edu/ee382a Lecture 12-1 The Big Picture Previous lectures: Core design for single-thread

More information

Performance Comparison between Linear RVE and Linear Systolic Array Implementations of the Smith-Waterman Algorithm

Performance Comparison between Linear RVE and Linear Systolic Array Implementations of the Smith-Waterman Algorithm Performance Comparison between Linear RVE and Linear Systolic Array Implementations of the Smith-Waterman Algorithm Laiq Hasan Zaid Al-Ars Delft University of Technology Computer Engineering Laboratory

More information

Real-Time Mixed-Criticality Wormhole Networks

Real-Time Mixed-Criticality Wormhole Networks eal-time Mixed-Criticality Wormhole Networks Leandro Soares Indrusiak eal-time Systems Group Department of Computer Science University of York United Kingdom eal-time Systems Group 1 Outline Wormhole Networks

More information

Simplified design flow for embedded systems

Simplified design flow for embedded systems Simplified design flow for embedded systems 2005/12/02-1- Reuse of standard software components Knowledge from previous designs to be made available in the form of intellectual property (IP, for SW & HW).

More information

CENTRUM INDUSTRIAL IT - Where IT meets Automation -

CENTRUM INDUSTRIAL IT - Where IT meets Automation - CENTRUM INDUSTRIAL IT - Where IT meets Automation - M.Sc. Jahanzaib Imtiaz (Institut Industrial IT) Prof. Dr.-Ing. Jürgen Jasperneite (Fraunhofer IOSB-INA) Institut Industrial IT/Fraunhofer IOSB-INA Family!

More information

Software Pipelining for Coarse-Grained Reconfigurable Instruction Set Processors

Software Pipelining for Coarse-Grained Reconfigurable Instruction Set Processors Software Pipelining for Coarse-Grained Reconfigurable Instruction Set Processors Francisco Barat, Murali Jayapala, Pieter Op de Beeck and Geert Deconinck K.U.Leuven, Belgium. {f-barat, j4murali}@ieee.org,

More information

Frequently asked questions from the previous class survey

Frequently asked questions from the previous class survey CS 370: OPERATING SYSTEMS [CPU SCHEDULING] Shrideep Pallickara Computer Science Colorado State University L15.1 Frequently asked questions from the previous class survey Could we record burst times in

More information

Thanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems

Thanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems Thanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems Promotor and co-promotor: Prof. Johan J. Lukkien Dr. Reinder J. Bril Martijn van den Heuvel

More information

Multimedia Systems 2011/2012

Multimedia Systems 2011/2012 Multimedia Systems 2011/2012 System Architecture Prof. Dr. Paul Müller University of Kaiserslautern Department of Computer Science Integrated Communication Systems ICSY http://www.icsy.de Sitemap 2 Hardware

More information

High Performance Interconnect and NoC Router Design

High Performance Interconnect and NoC Router Design High Performance Interconnect and NoC Router Design Brinda M M.E Student, Dept. of ECE (VLSI Design) K.Ramakrishnan College of Technology Samayapuram, Trichy 621 112 brinda18th@gmail.com Devipoonguzhali

More information

Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks

Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks X. Yuan, R. Melhem and R. Gupta Department of Computer Science University of Pittsburgh Pittsburgh, PA 156 fxyuan,

More information

UML for SOC Design GRANT MARTIN WOLFGANG MÜLLER. Edited by. Tensilica Inc., Santa Clara, CA, USA. and. University of Paderborn, Germany

UML for SOC Design GRANT MARTIN WOLFGANG MÜLLER. Edited by. Tensilica Inc., Santa Clara, CA, USA. and. University of Paderborn, Germany UML FOR SOC DESIGN UML for SOC Design Edited by GRANT MARTIN Tensilica Inc., Santa Clara, CA, USA and WOLFGANG MÜLLER University of Paderborn, Germany A C.I.P. Catalogue record for this book is available

More information

CONFIGURABLE HYBRIDKERNEL FOR EMBEDDED REAL-TIME SYSTEMS

CONFIGURABLE HYBRIDKERNEL FOR EMBEDDED REAL-TIME SYSTEMS CONFIGURABLE HYBRIDKERNEL FOR EMBEDDED REAL-TIME SYSTEMS Heinz Nixdorf Institute, University Paderborn Fürstenallee 11, 33102 Paderborn, Germany timo.kerstan@uni-paderborn.de, zottel@uni-paderborn.de Abstract:

More information

Abstract A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE

Abstract A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE Reiner W. Hartenstein, Rainer Kress, Helmut Reinig University of Kaiserslautern Erwin-Schrödinger-Straße, D-67663 Kaiserslautern, Germany

More information

A Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing

A Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing 727 A Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing 1 Bharati B. Sayankar, 2 Pankaj Agrawal 1 Electronics Department, Rashtrasant Tukdoji Maharaj Nagpur University, G.H. Raisoni

More information

Mapping of Applications to Multi-Processor Systems

Mapping of Applications to Multi-Processor Systems Mapping of Applications to Multi-Processor Systems Peter Marwedel TU Dortmund, Informatik 12 Germany Marwedel, 2003 Graphics: Alexandra Nolte, Gesine 2011 年 12 月 09 日 These slides use Microsoft clip arts.

More information

Final Lecture. A few minutes to wrap up and add some perspective

Final Lecture. A few minutes to wrap up and add some perspective Final Lecture A few minutes to wrap up and add some perspective 1 2 Instant replay The quarter was split into roughly three parts and a coda. The 1st part covered instruction set architectures the connection

More information

Memory centric thread synchronization on platform FPGAs

Memory centric thread synchronization on platform FPGAs Memory centric thread synchronization on platform FPGAs Chidamber Kulkarni Xilinx Inc San Jose, Ca Chidamber.Kulkarni@xilinx.com Gordon Brebner Xilinx Inc San Jose, Ca Gordon.Brebner@xilinx.com Abstract

More information

Online Hardware Task Scheduling and Placement Algorithm on Partially Reconfigurable Devices

Online Hardware Task Scheduling and Placement Algorithm on Partially Reconfigurable Devices Online Hardware Task Scheduling and Placement Algorithm on Partially Reconfigurable Devices Thomas Marconi, Yi Lu, Koen Bertels, and Georgi Gaydadjiev Computer Engineering Laboratory, EEMCS TU Delft, The

More information

Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool

Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool Jin Hee Kim and Jason Anderson FPL 2015 London, UK September 3, 2015 2 Motivation for Synthesizable FPGA Trend towards ASIC design flow Design

More information

Resource Efficiency of Scalable Processor Architectures for SDR-based Applications

Resource Efficiency of Scalable Processor Architectures for SDR-based Applications Resource Efficiency of Scalable Processor Architectures for SDR-based Applications Thorsten Jungeblut 1, Johannes Ax 2, Gregor Sievers 2, Boris Hübener 2, Mario Porrmann 2, Ulrich Rückert 1 1 Cognitive

More information