From Temporal Partitioning and Temporal Placement to Algorithmic Skeletons
|
|
- Ethelbert Burke
- 5 years ago
- Views:
Transcription
1 From Temporal Partitioning and Temporal Placement to Algorithmic Skeletons Florian Dittmann, Franz J. Rammig Heinz Nixdorf Institute University of Paderborn, Germany
2 Motivation Making reconfigurable computing mature - Industrialization Capabilities - Processing in parallel - Runtime reconfiguration - Partial reconfiguration - Space and time -Etc. Abstraction - Layers - Beneficial methods FPGA 2/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
3 Overview Motivation Partitioning methods and their application - Lessons learned Layered approach - Specification Graph Approach - Reconfiguration Port Scheduling - Algorithmic Skeletons Cooperation Part-E Conclusion 3/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
4 Partitioning Applying the spectral method on coarse grained systems - Mesh-based nearest neighbor communication - 2D topology Mapping - Data flow graphs - Resource efficient - Communication optimized PE PE y x y x y x /26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
5 Partitioning Temporal + Spatial Partitioning/Placement Basis - ASAP scheduling - Spectral placement Combination - Focusing on one level - Location of the nodes in the spectral placement Placement of the extracted nodes on PE Benefit - ASAP: precedence constraints - Spectral Method: overall closeness respected 5/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
6 Partitioning Two Slot Model Execution Environment of exactly two slots - Alternating execution of tasks Slot A Controller Slot B - Hiding of the reconfiguration overhead FPGA Freq. Exploiting partial run-time reconfiguration Challenges time - Architecture demands for communication infrastructure - Partial bitstream generation - Task mapping - Partitioning 6/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
7 Two Slot Model Partitioning of the input algorithms B B d) Partitioning e) Including of Buffers Scheduling - Simple dispatching - Single server for two machines Slot A Slot B time time 7/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
8 Lessons Learned Partitioning as reasonable/fundamental step Challenges - Placement: fragmentation - Reconfiguration overhead -Etc. Valuable concept - Two phases Reconfiguration phase Execution phase Derivable concepts - Specification graph approach Domain of platform-based design - Reconfiguration Port Scheduling - Algorithmic Skeletons FPGA? 8/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
9 Layered Model for Design Methods area and phase - varies - varies Challenging design, scheduling, etc. Tasks Layer 1 Layer 2 Layer n Reconfigurable Fabric time Layer model/approach - Abstraction - Specification example Tasks Partitioning Dispatching Execution Environment FPGA 9/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
10 Specification Graph Approach Tasks Problem graph - Integration of the phase Tasks Problem Graph Mapping Architecture Graph FPGA CPU FPGA FPGA Processing units Architecture graph - Also heterogeneous Mapping - Links tasks, communication and reconfiguration with architectural resources - Hierarchical mapping edges Synthesis - Scheduling, allocation and binding - By evolutionary computing 10/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
11 1 C0 T1 Tasks Problem Graph 3 C1 T3 C4 C2 C3 T2 2 Task graph G Add reconfiguration phases G* Add Communication Vertexes G P (Problem Graph) Tasks Problem Graph Mapping Architecture Graph FPGA CPU FPGA 11/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
12 Specification Graph Approach Mapping of Resources problem and architecture graph + mapping edges = specification graph Slot 1 Slot 2 Port 1 T1 C0 3 ms Port 3 C2 C1 C3 T3 C4 T ms 3 ms 12 ms 10 ms 1 ms 15 ms Slot 1 Bus Slot 2 12/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
13 Specification Graph Approach Extensions C1 1 Port Slot_B Multiple devices Multiple reconfiguration ports T1 Slot_C FPGA1 2 C2 C3 3 Gigabit T2 T3 Gigabit C4 Slot_A FPGA2 G P Port Platform-based design All within the domain of synthesis G A1 G A2 Slot A 2 T2 Slot B 1 T1 Slot C 3 T3 time 13/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
14 Reconfiguration Port Scheduling Partially run-time reconfigurable FPGAs for real-time processing - Task set executed on FPGA Slot A Slot B Slot C Area assignment? - Prevent fragmentation - Offer communication Bus FPGA Scheduling? - Execution time of tasks - Reconfiguration process Overhead: time + single port At a pace of the environment Reconfiguration port scheduling Task Set Reconfiguration Port Scheduling Execution Environment FPGA 14/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
15 Reconfiguration Port Scheduling Slotted execution environment Set of n partial Bitstreams Slot A Slot B Slot C d* d Single reconfiguration port Bus FPGA Real-time processing on slotted FPGA architecture - Guarantee meeting of deadlines - Constant reconfiguration phase - Deadline d* - One reconfiguration port mono processor scheduling algorithms Task Sets OS Real-Time Scheduling Execution Environment FPGA 15/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
16 Performance of d* For aperiodic task sets - d* outperforms d - Of 100 feasible task sets d* finds approx. 90 and d finds approx. 70 d d* R T Performance depends on - ratio to t = l - # of slots t 16/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
17 Fixed Priority Example Periodic task set scheduling - Static priorities - Preemption Characteristics - Deadlines (D*) shorter than periods Apply deadline monotonic scheduling (DM) 17/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
18 Fixed Priority Scheduling Schedulability Analysis Parameters Response time analysis - DM with D*: - Schedulable if: τ i : R < D * i i - Calculate R i by R i 1 i R i = t, i + t, j j= 1 Tj - Critical instance: all tasks are released simultaneously Sufficient and necessary for DM with D - Challenging abnormalities for DM with D* 18/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
19 Algorithmic Skeletons Motivation Tasks - Programmability and portability - Structure and behavior of the tasks Application level Use of algorithmic skeletons - Wrapping of tasks - Programming templates Applications Tasks Algorithmic Skeletons Runtime environment - Partial reconfiguration capabilities Dispatcher Skeleton Dispatching Runtime Environment FPGA 19/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
20 Algorithmic Skeletons Background Invented for the parallel processing domain - First discussed by Murray Cole in the mid 80ies Objectives - Separate structure of a computation from the computation itself - Free programmer from the implementation details of the structure - Implementation guideline for activities and their interactions Related: design patterns - Differences Design level Final implementation left to the freedom of the designer Algorithmic skeletons force the applications to be well-formed - Enable to extract valuable information - Design space exploration on a high level of abstraction - Static and dynamic optimization of implementations 20/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
21 Dynamic Reconfiguration Multi-threading on an FPGA - Hosting of more than one task - Share processing resource Challenges - Dispatch newly arriving tasks during run-time Tasks are not known at design time Architecture to facilitate dispatching must exist - Area assignment - Prevent fragmentation - Communication assignment Algorithmic Skeletons Applications Tasks Algorithmic Skeletons Skeleton Dispatching Runtime Environment FPGA 21/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
22 Dynamic Reconfiguration Example Combination of Pipeline and Farm Skeleton - Slotted architecture - Bus and direct communication in Pipeline: W 1 W 2 W p Farm: in out E W 1 W 2 W p FPGA W1 W3 W2 W1 CE C out 22/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
23 Cooperation PadErOl (Erlangen [Prof. Teich] and Oldenburg [Prof. Nebel]) - Integrated design flow, see it journal Braunschweig (Prof. Fekete) - Reconfiguration phase scheduling/ single server scheduling - Comparison of methods Erlangen (Prof. Teich) - Bus-based architecture for reconfiguration port scheduling Paderborn (Prof. Platzner) - Local cooperation - Information exchange Erlangen Slot Machine 23/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
24 Publications 2007 Dittmann, Florian; Frank, Stefan: Hard Real-Time Reconfiguration Port Scheduling. In: Proceedings of the Design, automation and Test in Europe, Nice, France 2007 Dittmann, Florian; Götz, Marcelo; Rettberg, Achim: Model and Methodology for the Synthesis of Heterogeneous and Partially Reconfigurable Systems. In: Proceedings of the Reconfigurable Architecture Workshop, Long Beach, CA, USA 2007 Dittmann, Florian: Algorithmic Skeletons for the Programming of Reconfigurable Systems. In: Proceedings of the SEUS 2007, Santorin, Greece, Mai 2007 Dittmann, Florian; Rammig, Franz Josef; Streubühr, Martin; Haubelt, C.; Schallenberg, Andreas; Nebel, Wolfgang: Exploration, Partitioning and Simulation of Reconfigurable Systems. it - Information Technology (vormals it+ti), 3(7), 1. Jan Dittmann, Florian; Rettberg, Achim; Weber, Raphael: Optimization Techniques for a Reconfigurable Self-Timed and Bit-Serial Architecture. In: Proceedings of the SBCCI 2007, Rio de Janeiro, Brazil, Sep Dittmann, Florian; Frank, S.: Caching in Real-time Reconfiguration Port Scheduling. In: Proceedings of the FPL 2007, Amsterdam, The Netherlands, Aug Dittmann, Florian; Rettberg, Achim; Weber, Raphael: Latency Optimization for a Reconfigurable, Self-Timed and Bit-Serial Architecture. In: Proceedings of the ERSA 2007, Las Vegas, USA, Jun Dittmann, Florian; Heimfarth, Tales: Clock Frequency Variation of Partially Reconfigurable Systems. In: Proceedings of the19th International Conference on Architecture of Computing Systems: Workshop Proceedings, S , Frankfurt, Germany, 1. Jan Warkentin, Alexander; Dittmann, Florian: Data Transfer Protocols for a Two Slot Based Reconfigurable Platform. In: Proceedings of the Reconfigurable Communication-centric SoCs (ReCoSoC), Montpellier, France, 2006 Götz, Marcelo; Dittmann, Florian: Reconfigurable Microkernel-based OS: Mechanisms and Methods for Run-Time Reconfiguration. In: Proceedings of the 3rd International Conference on ReConFigurable Computing and FPGAs 2006 (ReConFig'06), S Dittmann, Florian; Götz, M.: Applying Single Processor Algorithms to Schedule Tasks on Reconfigurable Devices Respecting Reconfiguration Times. In: Proceedings of the 13th Reconfigurable Architectures Workshop (RAW 2006), Rhodes Island, Greece, 2006 Dittmann, Florian; Rettberg, Achim: Design of Partially Reconfigurable Systems: From Abstract Modeling to Practical Realization. In: Proceedings of the 1st International Workshop on Reconfigurable Computing Education, Karlsruhe, Germany, 1. Jan Götz, Marcelo; Dittmann, Florian; Pereira, Carlos E.: Deterministic Mechanism for Run-Time Reconfiguration Activities in an OS. In: Proceedings of the 4th International IEEE Conference on Industrial Informatics (INDIN 2006), Singapore, 2006 Götz, Marcelo; Dittmann, Florian: Scheduling Reconfiguration Activities of Run-time Reconfigurable OS Using an Aperiodic Task Server. In: Proceedings of the ARC 2006, Delft, The Netherlands, Mrz Dittmann, Florian; Rettberg, Achim; Weber, Raphael: Towards the Implementation of Path Concepts for a Reconfigurable Bit-Serial Synchronous Architecture. In: Proceedings of the 3rd International Conference on ReConFigurable Computing and FPGA's, S , San Luis Potosi, Mexico, 2006 Dittmann, Florian; Götz, Marcelo: Reconfiguration Time Aware Processing on FPGAs. In: In Proceedings of the Dagstuhl Seminar Nº on Dynamically Reconfigurable Architectures, Dagstuhl, Germany, /26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
25 Part-E Eclipse based development environment for partial bitstream generation Opensource parte.sf.net Tutorial available Coffee break Bitstreams in 2 min 25/26 Anschlusskolloquium, Lübeck, 24. und 25. Mai 2007
26 Thank you for your attention. Florian Dittmann, Franz Rammig Heinz Nixdorf Institute University of Paderborn Fuerstenallee Paderborn Germany Phone: +49 (0) 52 51/ Fax: +49 (0) 52 51/ Thanks to E. Weber, S. Frank, A Warkentin
COMMUNICATION-AWARE COMPONENT ALLOCATION ALGORITHM FOR A HYBRID ARCHITECTURE
COMMUNICATION-AWARE COMPONENT ALLOCATION ALGORITHM FOR A HYBRID ARCHITECTURE Marcelo Götz, 1 Achim Rettberg 2 and Carlos Eduardo Pereira 3 1 Heinz Nixdorf Institute University of Paderborn, Germany mgoetz@uni-paderborn.de
More informationReconOS: Multithreaded Programming and Execution Models for Reconfigurable Hardware
ReconOS: Multithreaded Programming and Execution Models for Reconfigurable Hardware Enno Lübbers and Marco Platzner Computer Engineering Group University of Paderborn {enno.luebbers, platzner}@upb.de Outline
More informationOperating System Approaches for Dynamically Reconfigurable Hardware
Operating System Approaches for Dynamically Reconfigurable Hardware Marco Platzner Computer Engineering Group University of Paderborn platzner@upb.de Outline operating systems for reconfigurable hardware
More informationThe Design and Implementation of a Low-Latency On-Chip Network
The Design and Implementation of a Low-Latency On-Chip Network Robert Mullins 11 th Asia and South Pacific Design Automation Conference (ASP-DAC), Jan 24-27 th, 2006, Yokohama, Japan. Introduction Current
More informationReconfigurable Computing. On-line communication strategies. Chapter 7
On-line communication strategies Chapter 7 Prof. Dr.-Ing. Jürgen Teich Lehrstuhl für Hardware-Software-Co-Design On-line connection - Motivation Routing-conscious temporal placement algorithms consider
More informationSystem Level Modeling and Performance Simulation for Dynamic Reconfigurable Computing Systems in SystemC
In Methoden und Beschreibungssprachen zur Modellierung und Verifikation von Schaltungen und Systemen. by Christian Haubelt, Jürgen Teich (Eds.). GI/ITG/GMM-Workshop, Shaker Verlag, Aachen, March 5-7, 2007
More informationDesign methodology for multi processor systems design on regular platforms
Design methodology for multi processor systems design on regular platforms Ph.D in Electronics, Computer Science and Telecommunications Ph.D Student: Davide Rossi Ph.D Tutor: Prof. Roberto Guerrieri Outline
More informationA SELF-CONTROLLED AND DYNAMICALLY RECONFIGURABLE ARCHITECTURE*
A SELF-CONTROLLED AND DYNAMICALLY RECONFIGURABLE ARCHITECTURE* Florian Dittmann, Achim Rettberg University of Paderborn, Germany roichen@upb.de, achim@c-lab.de Abstract: Keywords: Reconfigurable systems
More informationLow energy and High-performance Embedded Systems Design and Reconfigurable Architectures
Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures Ass. Professor Dimitrios Soudris School of Electrical and Computer Eng., National Technical Univ. of Athens, Greece
More informationA Process Model suitable for defining and programming MpSoCs
A Process Model suitable for defining and programming MpSoCs MpSoC-Workshop at Rheinfels, 29-30.6.2010 F. Mayer-Lindenberg, TU Hamburg-Harburg 1. Motivation 2. The Process Model 3. Mapping to MpSoC 4.
More informationCo-synthesis and Accelerator based Embedded System Design
Co-synthesis and Accelerator based Embedded System Design COE838: Embedded Computer System http://www.ee.ryerson.ca/~courses/coe838/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer
More informationA Novel Design Framework for the Design of Reconfigurable Systems based on NoCs
Politecnico di Milano & EPFL A Novel Design Framework for the Design of Reconfigurable Systems based on NoCs Vincenzo Rana, Ivan Beretta, Donatella Sciuto Donatella Sciuto sciuto@elet.polimi.it Introduction
More informationOptimization of Run-time Reconfigurable Embedded Systems
Optimization of Run-time Reconfigurable Embedded Systems Michael Eisenring and Marco Platzner Swiss Federal Institute of Technology (ETH) Zurich, Switzerland {eisenring platzner}@tik.ee.ethz.ch Abstract.
More informationResource-Efficient Scheduling for Partially-Reconfigurable FPGAbased
Resource-Efficient Scheduling for Partially-Reconfigurable FPGAbased Systems Andrea Purgato: andrea.purgato@mail.polimi.it Davide Tantillo: davide.tantillo@mail.polimi.it Marco Rabozzi: marco.rabozzi@polimi.it
More informationSingle-Path Programming on a Chip-Multiprocessor System
Single-Path Programming on a Chip-Multiprocessor System Martin Schoeberl, Peter Puschner, and Raimund Kirner Vienna University of Technology, Austria mschoebe@mail.tuwien.ac.at, {peter,raimund}@vmars.tuwien.ac.at
More informationNoC Simulation in Heterogeneous Architectures for PGAS Programming Model
NoC Simulation in Heterogeneous Architectures for PGAS Programming Model Sascha Roloff, Andreas Weichslgartner, Frank Hannig, Jürgen Teich University of Erlangen-Nuremberg, Germany Jan Heißwolf Karlsruhe
More informationFast dynamic and partial reconfiguration Data Path
Fast dynamic and partial reconfiguration Data Path with low Michael Hübner 1, Diana Göhringer 2, Juanjo Noguera 3, Jürgen Becker 1 1 Karlsruhe Institute t of Technology (KIT), Germany 2 Fraunhofer IOSB,
More informationRuntime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays
Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays Éricles Sousa 1, Frank Hannig 1, Jürgen Teich 1, Qingqing Chen 2, and Ulf Schlichtmann
More informationFrom Dynamic Reconfiguration to Self-Configuration Achievements of the DFG Priority Program 1148 Reconfigurable Computing
From Dynamic Reconfiguration to Self-Configuration Achievements of the DFG Priority Program 1148 Reconfigurable Computing Jürgen Teich Chair for Hardware-Software-Co-Design University of Erlangen-Nuremberg
More informationSDR Forum Technical Conference 2007
THE APPLICATION OF A NOVEL ADAPTIVE DYNAMIC VOLTAGE SCALING SCHEME TO SOFTWARE DEFINED RADIO Craig Dolwin (Toshiba Research Europe Ltd, Bristol, UK, craig.dolwin@toshiba-trel.com) ABSTRACT This paper presents
More informationA Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management
A Device-Controlled Dynamic Configuration Framework Supporting Heterogeneous Resource Management H. Tan and R. F. DeMara Department of Electrical and Computer Engineering University of Central Florida
More informationHardware/Software Codesign
Hardware/Software Codesign SS 2016 Prof. Dr. Christian Plessl High-Performance IT Systems group University of Paderborn Version 2.2.0 2016-04-08 how to design a "digital TV set top box" Motivating Example
More informationPrevious Exam Questions System-on-a-Chip (SoC) Design
This image cannot currently be displayed. EE382V Problem: System Analysis (20 Points) This is a simple single microprocessor core platform with a video coprocessor, which is configured to process 32 bytes
More informationReNoC: A Network-on-Chip Architecture with Reconfigurable Topology
1 ReNoC: A Network-on-Chip Architecture with Reconfigurable Topology Mikkel B. Stensgaard and Jens Sparsø Technical University of Denmark Technical University of Denmark Outline 2 Motivation ReNoC Basic
More informationFPGA. Agenda 11/05/2016. Scheduling tasks on Reconfigurable FPGA architectures. Definition. Overview. Characteristics of the CLB.
Agenda The topics that will be addressed are: Scheduling tasks on Reconfigurable FPGA architectures Mauro Marinoni ReTiS Lab, TeCIP Institute Scuola superiore Sant Anna - Pisa Overview on basic characteristics
More informationA Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on
A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on on-chip Donghyun Kim, Kangmin Lee, Se-joong Lee and Hoi-Jun Yoo Semiconductor System Laboratory, Dept. of EECS, Korea Advanced
More informationEnhancing Resource Utilization with Design Alternatives in Runtime Reconfigurable Systems
Enhancing Resource Utilization with Design Alternatives in Runtime Reconfigurable Systems Alexander Wold, Dirk Koch, Jim Torresen Department of Informatics, University of Oslo, Norway Email: {alexawo,koch,jimtoer}@ifi.uio.no
More informationThroughput-optimizing Compilation of Dataflow Applications for Multi-Cores using Quasi-Static Scheduling
Throughput-optimizing Compilation of Dataflow Applications for Multi-Cores using Quasi-Static Scheduling Tobias Schwarzer 1, Joachim Falk 1, Michael Glaß 1, Jürgen Teich 1, Christian Zebelein 2, Christian
More informationAn application-based EDF scheduler for OSEK/VDX
An application-based EDF scheduler for OSEK/VDX Claas Diederichs INCHRON GmbH 14482 Potsdam, Germany claas.diederichs@inchron.de Ulrich Margull 1 mal 1 Software GmbH 90762 Fürth, Germany margull@1mal1.com
More informationMulti Core Real Time Task Allocation Algorithm for the Resource Sharing Gravitation in Peer to Peer Network
Multi Core Real Time Task Allocation Algorithm for the Resource Sharing Gravitation in Peer to Peer Network Hua Huang Modern Education Technology Center, Information Teaching Applied Technology Extension
More informationReview: Creating a Parallel Program. Programming for Performance
Review: Creating a Parallel Program Can be done by programmer, compiler, run-time system or OS Steps for creating parallel program Decomposition Assignment of tasks to processes Orchestration Mapping (C)
More informationCost-and Power Optimized FPGA based System Integration: Methodologies and Integration of a Lo
Cost-and Power Optimized FPGA based System Integration: Methodologies and Integration of a Low-Power Capacity- based Measurement Application on Xilinx FPGAs Abstract The application of Field Programmable
More informationMultimedia-Systems. Operating Systems. Prof. Dr.-Ing. Ralf Steinmetz Prof. Dr. rer. nat. Max Mühlhäuser Prof. Dr.-Ing. Wolfgang Effelsberg
Multimedia-Systems Operating Systems Prof. Dr.-Ing. Ralf Steinmetz Prof. Dr. rer. nat. Max Mühlhäuser Prof. Dr.-Ing. Wolfgang Effelsberg WE: University of Mannheim, Dept. of Computer Science Praktische
More informationPerformance Analysis of OS Scheduling for a Reconfigurable Computing Environment
Indian Journal of Science and Technology, Vol 8(22), DOI: 10.17485/ijst/2015/v8i22/79107, September 2015 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Performance Analysis of OS Scheduling for a Reconfigurable
More informationParalleX. A Cure for Scaling Impaired Parallel Applications. Hartmut Kaiser
ParalleX A Cure for Scaling Impaired Parallel Applications Hartmut Kaiser (hkaiser@cct.lsu.edu) 2 Tianhe-1A 2.566 Petaflops Rmax Heterogeneous Architecture: 14,336 Intel Xeon CPUs 7,168 Nvidia Tesla M2050
More informationAutomated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design
Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design C. Tanougast, Y. Berviller, P. Brunet and S. Weber L. I. E. N. Laboratoire d Instrumentation Electronique de Nancy
More informationImproving Energy Efficiency of Block-Matching Motion Estimation Using Dynamic Partial Reconfiguration
, pp.517-521 http://dx.doi.org/10.14257/astl.2015.1 Improving Energy Efficiency of Block-Matching Motion Estimation Using Dynamic Partial Reconfiguration Jooheung Lee 1 and Jungwon Cho 2, * 1 Dept. of
More informationCUDA GPGPU Workshop 2012
CUDA GPGPU Workshop 2012 Parallel Programming: C thread, Open MP, and Open MPI Presenter: Nasrin Sultana Wichita State University 07/10/2012 Parallel Programming: Open MP, MPI, Open MPI & CUDA Outline
More informationDESIGN AND IMPLEMENTATION ARCHITECTURE FOR RELIABLE ROUTER RKT SWITCH IN NOC
International Journal of Engineering and Manufacturing Science. ISSN 2249-3115 Volume 8, Number 1 (2018) pp. 65-76 Research India Publications http://www.ripublication.com DESIGN AND IMPLEMENTATION ARCHITECTURE
More informationSelf-Adaptive FPGA-Based Image Processing Filters Using Approximate Arithmetics
Self-Adaptive FPGA-Based Image Processing Filters Using Approximate Arithmetics Jutta Pirkl, Andreas Becher, Jorge Echavarria, Jürgen Teich, and Stefan Wildermann Hardware/Software Co-Design, Friedrich-Alexander-Universität
More informationExploiting Dynamically Changing Parallelism with a Reconfigurable Array of Homogeneous Sub-cores (a.k.a. Field Programmable Core Array or FPCA)
Exploiting Dynamically Changing Parallelism with a Reconfigurable Array of Homogeneous Sub-cores (a.k.a. Field Programmable Core Array or FPCA) Sponsored by SRC and NSF as a Part of Multicore Chip Design
More informationProcessor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP
Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP Presenter: Course: EEC 289Q: Reconfigurable Computing Course Instructor: Professor Soheil Ghiasi Outline Overview of M.I.T. Raw processor
More informationFault Tolerance Analysis of Distributed Reconfigurable Systems Using SAT-Based Techniques
In Field-Programmable Logic and Applications by Peter Y. K. Cheung, George A. Constantinides, and Jose T. de Sousa (Eds.). In Lecture Notes in Computer Science (LNCS), Volume 2778, c Springer, Berlin,
More informationEE382V: System-on-a-Chip (SoC) Design
EE382V: System-on-a-Chip (SoC) Design Lecture 10 Task Partitioning Sources: Prof. Margarida Jacome, UT Austin Prof. Lothar Thiele, ETH Zürich Andreas Gerstlauer Electrical and Computer Engineering University
More informationGeneration of Multigrid-based Numerical Solvers for FPGA Accelerators
Generation of Multigrid-based Numerical Solvers for FPGA Accelerators Christian Schmitt, Moritz Schmid, Frank Hannig, Jürgen Teich, Sebastian Kuckuk, Harald Köstler Hardware/Software Co-Design, System
More informationA High Integrity Distributed Deterministic Java Environment. WORDS 2002 January 7, San Diego CA
A High Integrity Distributed Deterministic Java Environment WORDS 2002 January 7, San Diego CA João Ventura Skysoft Portugal SA Fridtjof Siebert & Andy Walter aicas GmbH James Hunt Forschungszentrum Informatik
More informationChapter 19: Real-Time Systems. Operating System Concepts 8 th Edition,
Chapter 19: Real-Time Systems, Silberschatz, Galvin and Gagne 2009 Chapter 19: Real-Time Systems System Characteristics Features of Real-Time Systems Implementing Real-Time Operating Systems Real-Time
More informationUniprocessor Scheduling. Basic Concepts Scheduling Criteria Scheduling Algorithms. Three level scheduling
Uniprocessor Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Three level scheduling 2 1 Types of Scheduling 3 Long- and Medium-Term Schedulers Long-term scheduler Determines which programs
More informationD.A.S.T. Defragmentation And Scheduling of Tasks University of Twente Computer Science
D.A.S.T. Defragmentation And Scheduling of Tasks University of Twente Computer Science Frank Vlaardingerbroek Joost van der Linden Stefan ten Heggeler Ruud Groen 14th November 2003 Abstract When mapping
More informationComputers as Components Principles of Embedded Computing System Design
Computers as Components Principles of Embedded Computing System Design Third Edition Marilyn Wolf ELSEVIER AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY
More informationReconOS: An RTOS Supporting Hardware and Software Threads
ReconOS: An RTOS Supporting Hardware and Software Threads Enno Lübbers and Marco Platzner Computer Engineering Group University of Paderborn marco.platzner@computer.org Overview the ReconOS project programming
More informationHardware Accelerators
Hardware Accelerators José Costa Software for Embedded Systems Departamento de Engenharia Informática (DEI) Instituto Superior Técnico 2014-04-08 José Costa (DEI/IST) Hardware Accelerators 1 Outline Hardware
More informationManaging Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks
Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks Zhining Huang, Sharad Malik Electrical Engineering Department
More informationCSI3131 Final Exam Review
CSI3131 Final Exam Review Final Exam: When: April 24, 2015 2:00 PM Where: SMD 425 File Systems I/O Hard Drive Virtual Memory Swap Memory Storage and I/O Introduction CSI3131 Topics Process Computing Systems
More informationWorkloads Programmierung Paralleler und Verteilter Systeme (PPV)
Workloads Programmierung Paralleler und Verteilter Systeme (PPV) Sommer 2015 Frank Feinbube, M.Sc., Felix Eberhardt, M.Sc., Prof. Dr. Andreas Polze Workloads 2 Hardware / software execution environment
More informationOpenMP Device Offloading to FPGA Accelerators. Lukas Sommer, Jens Korinth, Andreas Koch
OpenMP Device Offloading to FPGA Accelerators Lukas Sommer, Jens Korinth, Andreas Koch Motivation Increasing use of heterogeneous systems to overcome CPU power limitations 2017-07-12 OpenMP FPGA Device
More informationAutoTVM & Device Fleet
AutoTVM & Device Fleet ` Learning to Optimize Tensor Programs Frameworks High-level data flow graph and optimizations Hardware Learning to Optimize Tensor Programs Frameworks High-level data flow graph
More informationECE519 Advanced Operating Systems
IT 540 Operating Systems ECE519 Advanced Operating Systems Prof. Dr. Hasan Hüseyin BALIK (10 th Week) (Advanced) Operating Systems 10. Multiprocessor, Multicore and Real-Time Scheduling 10. Outline Multiprocessor
More informationSTLAC: A Spatial and Temporal Locality-Aware Cache and Networkon-Chip
STLAC: A Spatial and Temporal Locality-Aware Cache and Networkon-Chip Codesign for Tiled Manycore Systems Mingyu Wang and Zhaolin Li Institute of Microelectronics, Tsinghua University, Beijing 100084,
More informationProcess Scheduling Part 2
Operating Systems and Computer Networks Process Scheduling Part 2 pascal.klein@uni-due.de Alexander Maxeiner, M.Sc. Faculty of Engineering Agenda Process Management Time Sharing Synchronization of Processes
More informationDepartment of Computer Science Institute for System Architecture, Operating Systems Group REAL-TIME MICHAEL ROITZSCH OVERVIEW
Department of Computer Science Institute for System Architecture, Operating Systems Group REAL-TIME MICHAEL ROITZSCH OVERVIEW 2 SO FAR talked about in-kernel building blocks: threads memory IPC drivers
More informationDesign Space Exploration
Design Space Exploration SS 2012 Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn Version 1.1.0 2012-06-15 Overview motivation for design space exploration design space exploration
More informationScheduling Algorithm and Analysis
Scheduling Algorithm and Analysis Model and Cyclic Scheduling (Module 27) Yann-Hang Lee Arizona State University yhlee@asu.edu (480) 727-7507 Summer 2014 Task Scheduling Schedule: to determine which task
More informationRemote and Partial Reconfiguration of FPGAs: Tools and Trends
Remote and Partial Reconfiguration of FPGAs: Tools and Trends Daniel Mesquita, Fernando Moraes, José palma, Leandro Moller, Ney Calazans Laboratoire de Informatique, de Robotique et de Microéletronique
More information(b) External fragmentation can happen in a virtual memory paging system.
Alexandria University Faculty of Engineering Electrical Engineering - Communications Spring 2015 Final Exam CS333: Operating Systems Wednesday, June 17, 2015 Allowed Time: 3 Hours Maximum: 75 points Note:
More informationInfrastructure for Autonomous Mobile Robots Communication and Coordination
90 Work in Progress Session Infrastructure for Autonomous Mobile Robots Communication and Coordination Marcelo M. Sobral, Leandro B. Becker Dept of Automation and Systems Universidade Federal de Santa
More informationMEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS
MEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS INSTRUCTOR: Dr. MUHAMMAD SHAABAN PRESENTED BY: MOHIT SATHAWANE AKSHAY YEMBARWAR WHAT IS MULTICORE SYSTEMS? Multi-core processor architecture means placing
More information2. Introduction to Software for Embedded Systems
2. Introduction to Software for Embedded Systems Lothar Thiele ETH Zurich, Switzerland 2-1 Contents of Lectures (Lothar Thiele) 1. Introduction to Embedded System Design 2. Software for Embedded Systems
More informationMULTI-OBJECTIVE DESIGN SPACE EXPLORATION OF EMBEDDED SYSTEM PLATFORMS
MULTI-OBJECTIVE DESIGN SPACE EXPLORATION OF EMBEDDED SYSTEM PLATFORMS Jan Madsen, Thomas K. Stidsen, Peter Kjærulf, Shankar Mahadevan Informatics and Mathematical Modelling Technical University of Denmark
More informationImplementing Scheduling Algorithms. Real-Time and Embedded Systems (M) Lecture 9
Implementing Scheduling Algorithms Real-Time and Embedded Systems (M) Lecture 9 Lecture Outline Implementing real time systems Key concepts and constraints System architectures: Cyclic executive Microkernel
More informationProgram-Driven Fine-Grained Power Management for the Reconfigurable Mesh
Program-Driven Fine-Grained Power Management for the Reconfigurable Mesh Heiner Giefers, Marco Platzner Computer Engineering Group University of Paderborn {hgiefers, platzner}@upb.de Outline 1. Introduction
More informationConcurrent activities in daily life. Real world exposed programs. Scheduling of programs. Tasks in engine system. Engine system
Real world exposed programs Programs written to interact with the real world, outside the computer Programs handle input and output of data in pace matching the real world processes Necessitates ability
More informationIntroduction to Real-Time Systems ECE 397-1
Introduction to Real-Time Systems ECE 97-1 Northwestern University Department of Computer Science Department of Electrical and Computer Engineering Teachers: Robert Dick Peter Dinda Office: L477 Tech 8,
More informationA HEURISTIC APPROACH TO SCHEDULE PERIODIC REAL-TIME TASKS ON RECONFIGURABLE HARDWARE
A HEURISTIC APPROACH TO SCHEDULE PERIODIC REAL-TIME TASKS ON RECONFIGURABLE HARDWARE Klaus Danne Department of Computer Science University of Paderborn Marco Platzner Department of Computer Science University
More informationImpact of Runtime Architectures on Control System Stability
Impact of Runtime Architectures on Control System Stability P. Feiler, J. Hansson Software Engineering Institute, Pittsburgh, PA Abstract: Control systems are sensitive to the endto-end latency and age
More informationFCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDAto-FPGA
1 FCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDAto-FPGA Compiler Tan Nguyen 1, Swathi Gurumani 1, Kyle Rupnow 1, Deming Chen 2 1 Advanced Digital Sciences Center, Singapore {tan.nguyen,
More informationResource Reservation & Resource Servers
Resource Reservation & Resource Servers Resource Reservation Application Hard real-time, Soft real-time, Others? Platform Hardware Resources: CPU cycles, memory blocks 1 Applications Hard-deadline tasks
More informationA Virtualized Quality of Service Packet Scheduler Accelerator. Kangtao Kendall Chuang
A Virtualized Quality of Service Packet Scheduler Accelerator A Thesis Presented to The Academic Faculty by Kangtao Kendall Chuang In Partial Fulfillment of the Requirements for the Degree Master of Science
More informationA Single-Path Chip-Multiprocessor System
A Single-Path Chip-Multiprocessor System Martin Schoeberl, Peter Puschner, and Raimund Kirner Institute of Computer Engineering Vienna University of Technology, Austria mschoebe@mail.tuwien.ac.at, {peter,raimund}@vmars.tuwien.ac.at
More informationMultithreaded Processors. Department of Electrical Engineering Stanford University
Lecture 12: Multithreaded Processors Department of Electrical Engineering Stanford University http://eeclass.stanford.edu/ee382a Lecture 12-1 The Big Picture Previous lectures: Core design for single-thread
More informationPerformance Comparison between Linear RVE and Linear Systolic Array Implementations of the Smith-Waterman Algorithm
Performance Comparison between Linear RVE and Linear Systolic Array Implementations of the Smith-Waterman Algorithm Laiq Hasan Zaid Al-Ars Delft University of Technology Computer Engineering Laboratory
More informationReal-Time Mixed-Criticality Wormhole Networks
eal-time Mixed-Criticality Wormhole Networks Leandro Soares Indrusiak eal-time Systems Group Department of Computer Science University of York United Kingdom eal-time Systems Group 1 Outline Wormhole Networks
More informationSimplified design flow for embedded systems
Simplified design flow for embedded systems 2005/12/02-1- Reuse of standard software components Knowledge from previous designs to be made available in the form of intellectual property (IP, for SW & HW).
More informationCENTRUM INDUSTRIAL IT - Where IT meets Automation -
CENTRUM INDUSTRIAL IT - Where IT meets Automation - M.Sc. Jahanzaib Imtiaz (Institut Industrial IT) Prof. Dr.-Ing. Jürgen Jasperneite (Fraunhofer IOSB-INA) Institut Industrial IT/Fraunhofer IOSB-INA Family!
More informationSoftware Pipelining for Coarse-Grained Reconfigurable Instruction Set Processors
Software Pipelining for Coarse-Grained Reconfigurable Instruction Set Processors Francisco Barat, Murali Jayapala, Pieter Op de Beeck and Geert Deconinck K.U.Leuven, Belgium. {f-barat, j4murali}@ieee.org,
More informationFrequently asked questions from the previous class survey
CS 370: OPERATING SYSTEMS [CPU SCHEDULING] Shrideep Pallickara Computer Science Colorado State University L15.1 Frequently asked questions from the previous class survey Could we record burst times in
More informationThanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems
Thanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems Promotor and co-promotor: Prof. Johan J. Lukkien Dr. Reinder J. Bril Martijn van den Heuvel
More informationMultimedia Systems 2011/2012
Multimedia Systems 2011/2012 System Architecture Prof. Dr. Paul Müller University of Kaiserslautern Department of Computer Science Integrated Communication Systems ICSY http://www.icsy.de Sitemap 2 Hardware
More informationHigh Performance Interconnect and NoC Router Design
High Performance Interconnect and NoC Router Design Brinda M M.E Student, Dept. of ECE (VLSI Design) K.Ramakrishnan College of Technology Samayapuram, Trichy 621 112 brinda18th@gmail.com Devipoonguzhali
More informationPerformance of Multihop Communications Using Logical Topologies on Optical Torus Networks
Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks X. Yuan, R. Melhem and R. Gupta Department of Computer Science University of Pittsburgh Pittsburgh, PA 156 fxyuan,
More informationUML for SOC Design GRANT MARTIN WOLFGANG MÜLLER. Edited by. Tensilica Inc., Santa Clara, CA, USA. and. University of Paderborn, Germany
UML FOR SOC DESIGN UML for SOC Design Edited by GRANT MARTIN Tensilica Inc., Santa Clara, CA, USA and WOLFGANG MÜLLER University of Paderborn, Germany A C.I.P. Catalogue record for this book is available
More informationCONFIGURABLE HYBRIDKERNEL FOR EMBEDDED REAL-TIME SYSTEMS
CONFIGURABLE HYBRIDKERNEL FOR EMBEDDED REAL-TIME SYSTEMS Heinz Nixdorf Institute, University Paderborn Fürstenallee 11, 33102 Paderborn, Germany timo.kerstan@uni-paderborn.de, zottel@uni-paderborn.de Abstract:
More informationAbstract A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE
A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE Reiner W. Hartenstein, Rainer Kress, Helmut Reinig University of Kaiserslautern Erwin-Schrödinger-Straße, D-67663 Kaiserslautern, Germany
More informationA Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing
727 A Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing 1 Bharati B. Sayankar, 2 Pankaj Agrawal 1 Electronics Department, Rashtrasant Tukdoji Maharaj Nagpur University, G.H. Raisoni
More informationMapping of Applications to Multi-Processor Systems
Mapping of Applications to Multi-Processor Systems Peter Marwedel TU Dortmund, Informatik 12 Germany Marwedel, 2003 Graphics: Alexandra Nolte, Gesine 2011 年 12 月 09 日 These slides use Microsoft clip arts.
More informationFinal Lecture. A few minutes to wrap up and add some perspective
Final Lecture A few minutes to wrap up and add some perspective 1 2 Instant replay The quarter was split into roughly three parts and a coda. The 1st part covered instruction set architectures the connection
More informationMemory centric thread synchronization on platform FPGAs
Memory centric thread synchronization on platform FPGAs Chidamber Kulkarni Xilinx Inc San Jose, Ca Chidamber.Kulkarni@xilinx.com Gordon Brebner Xilinx Inc San Jose, Ca Gordon.Brebner@xilinx.com Abstract
More informationOnline Hardware Task Scheduling and Placement Algorithm on Partially Reconfigurable Devices
Online Hardware Task Scheduling and Placement Algorithm on Partially Reconfigurable Devices Thomas Marconi, Yi Lu, Koen Bertels, and Georgi Gaydadjiev Computer Engineering Laboratory, EEMCS TU Delft, The
More informationSynthesizable FPGA Fabrics Targetable by the VTR CAD Tool
Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool Jin Hee Kim and Jason Anderson FPL 2015 London, UK September 3, 2015 2 Motivation for Synthesizable FPGA Trend towards ASIC design flow Design
More informationResource Efficiency of Scalable Processor Architectures for SDR-based Applications
Resource Efficiency of Scalable Processor Architectures for SDR-based Applications Thorsten Jungeblut 1, Johannes Ax 2, Gregor Sievers 2, Boris Hübener 2, Mario Porrmann 2, Ulrich Rückert 1 1 Cognitive
More information