Cover Page. The following handle holds various files of this Leiden University dissertation:

Size: px
Start display at page:

Download "Cover Page. The following handle holds various files of this Leiden University dissertation:"

Transcription

1 Cover Page The following handle holds various files of this Leiden University dissertation: Author: Spasic, J. Title: Improved hard real-time scheduling and transformations for embedded Streaming Applications Issue Date:

2 Improved Hard Real-Time Scheduling and Transformations for Embedded Streaming Applications Jelena Spasić

3

4 Improved Hard Real-Time Scheduling and Transformations for Embedded Streaming Applications PROEFSCHRIFT ter verkrijging van de graad van Doctor aan de Universiteit Leiden, op gezag van Rector Magnificus Prof.mr. C.J.J.M. Stolker, volgens besluit van het College voor Promoties te verdedigen op dinsdag 14 november 2017 klokke 13:45 uur door Jelena Spasić geboren te Trgovište, Servië in 1984

5 Promotor: Prof. dr. Joost N. Kok Universiteit Leiden Co-Promotor: Dr. Todor P. Stefanov Universiteit Leiden Promotion Committee: Prof. dr. Alix Munier Kordon Université de Paris - LIP6 Prof. dr. Petru Eles Linköpings Universitet Dr. Andy Pimentel Universiteit van Amsterdam Prof. dr. Aske Plaat Universiteit Leiden Prof. dr. Jaap van den Herik Universiteit Leiden Prof. dr. Harry Wijshoff Universiteit Leiden Improved Hard Real-Time Scheduling and Transformations for Embedded Streaming Applications Jelena Spasić. - Dissertation Universiteit Leiden. - With ref. - With summary in Dutch. Copyright c 2017 by Jelena Spasić. All rights reserved. Cover designed by Miloš Ačanski. This dissertation was typeset using L A TEX. ISBN Printed by Ridderprint, Ridderkerk, The Netherlands.

6 Mojoj porodici To my family

7

8 Contents Table of Contents List of Figures List of Tables List of Abbreviations vii xi xiii xv 1 Introduction Trends in the Design of Embedded Streaming Systems Platform Trend: Multi-Processor System-on-Chip (MPSoC) Design Trend: Model-based Design Methodology Design Requirements and Basic Approaches to Meet the Requirements Timing Requirements Energy Requirements Problem Statement Problem Problem Problem Problem Research Contributions Thesis Outline Background Dataflow Models-of-Computations (MoCs) Cyclo-Static Dataflow (CSDF) Polyhedral Process Network (PPN) Real-Time Scheduling Theory Task Model

9 viii Contents System Model Real-Time Scheduling Algorithms Uniprocessor Schedulability Analysis Multiprocessor Schedulability Analysis Hard Real-Time Scheduling Framework Problem Statement Contributions Related Work Motivational Example Improved Hard Real-Time Scheduling of CSDF Deriving Periods of Tasks Deriving the Earliest Start Time of Actor s First Phase Deriving Channel Buffer Sizes Hard Real-Time Schedulability Performance Analysis Deriving the Number of Processors Evaluation Performance of the ISPS Approach Time Complexity of the ISPS Approach Reducing Latency under ISPS Discussion Exploiting Parallelism in Hard Real-Time Systems to Maximize Performance Problem Statement Contributions Related Work Motivational Example New Unfolding Transformation for SDF Graphs The Algorithm for Finding Proper Unfolding Factors Evaluation Efficiency of the Proposed Unfolding Transformation Performance of Algorithm Time Complexity of Algorithm Discussion Exploiting Parallelism in Hard Real-Time Systems to Minimize Energy Problem Statement

10 Contents ix 5.2 Contributions Related Work Motivational Example System Model Energy Model The Proposed Energy Minimization Approach The Data-Parallel Energy Minimization Algorithm Task Classification for Energy Minimization Task Mapping for Energy Minimization Evaluation Comparison with [CKR14], [LSCS15], [SDK13] on Heterogeneous MPSoCs Comparison with [Lee09] on Heterogeneous MPSoCs Comparison on Homogeneous MPSoC Overhead and Time Complexity Analysis Discussion An Accurate Energy Modeling of Streaming Systems Problem Statement Contributions Related Work System Model Application Model Platform Model Application-to-Platform Mapping Energy Model Model Formulation Derivation of Model Parameters Evaluation of the Energy Model Discussion Summary and Conclusions 133 Bibliography 137 Samenvatting 153 List of Publications 156 Curriculum Vitae 159

11 x Contents Acknowledgments 161

12 List of Figures 1.1 An MPSoC platform example Motion JPEG encoder application A CSDF graph G Example of a PPN (a) and the structure of process P3 (b) (a) The SPS and (b) ISPS of graph G in Figure The periodic schedule σ for the CSDF graph G shown in Figure Production and consumption curves on edge e u = (v i, v j ) An SDF graph G Equivalent graphs of the SDF graph in Figure 4.1 by unfolding actor v 2 by factor 2 and v 3 by factor Unfolding channel e 2 from the graph in Figure 4.1 by using Algorithm 3 when f = [1, 2, 3, 1, 1] Comparison of our unfolding transformation to the approaches in [KM08], [FKBS11], [SLA12], [ZBS13] Results of performance evaluation of our proposed approach in comparison to the approach in [ZBS13] Results of time evaluation of our proposed approach in comparison to the approach in [ZBS13] An SDF graph G A CSDF graph G obtained by unfolding SDF graph G in Figure 5.1 with f = [1, 2, 2, 1] Comparison of our proposed DPEM approach with related approaches on heterogeneous MPSoCs Comparison between DPEM and WYL on heterogeneous MP- SoCs Comparison on homogeneous MPSoC

13 xii List of Figures 6.1 The read primitive implemented in software (a) and hardware (b) The architecture template of MPSoC platforms

14 List of Tables 2.1 Summary of mathematical notations Throughput, latency and number of processors for G under different scheduling schemes Benchmarks used for evaluation Comparison of different scheduling approaches Time complexity (in seconds) of different scheduling approaches Time complexity (in seconds) for the calculation of number of processors Performance of the ISPS approach under different latency constraints Results for G transformed by different transformation approaches Results for G transformed and mapped on 2 processors by different approaches Benchmarks used for evaluation Different MPSoC designs for G in Figure Benchmarks used for evaluation Accuracy of the energy model for CB, ShB and P2P MPSoC platforms Accuracy of the energy estimation when contention is not considered in the model

15

16 List of Abbreviations ADF BF BFD CDP CPU CSDF DCT DLP DM DPEM DSE EDF EE ESL FF FFD FFID FIFO Affine Dataflow Best-Fit Best-Fit Decreasing Constrained-Deadline Periodic Central Processing Unit Cyclo-Static Dataflow Discrete Cosine Transform Data-Level Parallelism Deadline Monotonic Data Parallel Energy Minimization Design Space Exploration Earliest Deadline First Energy Efficient Electronic System-Level First-Fit First-Fit Decreasing First-Fit Increasing Deadlines First-In First-Out xv

17 xvi List of Tables FPGA GPU HSDF ICP IDP ILP ISA ISPS ISS ITRS KPN LLF LP LTE MIDCP MJPEG MoC MPSoC NoC NP PE PLP PM PPN Field-Programmable Gate Array Graphics Processing Unit Homogeneous SDF Integer Convex Programming Implicit-Deadline Periodic Integer Linear Programming Instruction-Set Architecture Improved Strictly Periodic Scheduling Instruction Set Simulators International Technology Roadmap for Semiconductors Kahn Process Network Least Laxity First Linear Programming Long-Term Evolution Mixed Integer Disciplined Convex Programming Motion JPEG Model of Computation Multi-Processor System-on-Chip Network-on-Chip Non-deterministic Polynomial-time Performance Efficient Pipeline-Level Parallelism Power Management Polyhedral Process Network

18 List of Tables xvii PS RM RSD RTA RTL SDF SPS STS TLP VFS VLE WCET WF WFD Periodic Scheduling Rate Monotonic Reed Solomon Decoder Response Time Analysis Register-Transfer-Level Synchronous Data Flow Strictly Periodic Scheduling Self-timed Scheduling Task-Level Parallelism Voltage-Frequency Scaling Variable Length Encoder Worst-Case Execution Time Worst-Fit Worst-Fit Decreasing

19

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle   holds various files of this Leiden University dissertation Cover Page The handle http://hdl.handle.net/1887/32963 holds various files of this Leiden University dissertation Author: Zhai, Jiali Teddy Title: Adaptive streaming applications : analysis and implementation

More information

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle  holds various files of this Leiden University dissertation Cover Page The handle http://hdl.handle.net/1887/32963 holds various files of this Leiden University dissertation Author: Zhai, Jiali Teddy Title: Adaptive streaming applications : analysis and implementation

More information

Extensions of Daedalus Todor Stefanov

Extensions of Daedalus Todor Stefanov Extensions of Daedalus Todor Stefanov Leiden Embedded Research Center, Leiden Institute of Advanced Computer Science Leiden University, The Netherlands Overview of Extensions in Daedalus DSE limited to

More information

Synthesis of a Parallel Data Stream Processor from Data Flow Process Networks

Synthesis of a Parallel Data Stream Processor from Data Flow Process Networks Synthesis of a Parallel Data Stream Processor from Data Flow Process Networks Claudiu Zissulescu-Ianculescu Synthesis of a Parallel Data Stream Processor from Data Flow Process Networks Proefschrift ter

More information

Predictable Mapping of Streaming Applications on Multiprocessors

Predictable Mapping of Streaming Applications on Multiprocessors Predictable Mapping of Streaming Applications on Multiprocessors PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Eindhoven, op gezag van de Rector Magnificus prof.dr.ir.

More information

UvA-DARE (Digital Academic Repository) Should I stay or should I go? Revynthi, A.M. Link to publication

UvA-DARE (Digital Academic Repository) Should I stay or should I go? Revynthi, A.M. Link to publication UvA-DARE (Digital Academic Repository) Should I stay or should I go? Revynthi, A.M. Link to publication Citation for published version (APA): Revynthi, A. M. (2017). Should I stay or should I go? The role

More information

Applications of scenarios in early embedded system design space exploration van Stralen, P.

Applications of scenarios in early embedded system design space exploration van Stralen, P. UvA-DARE (Digital Academic Repository) Applications of scenarios in early embedded system design space exploration van Stralen, P. Link to publication Citation for published version (APA): van Stralen,

More information

Computers as Components Principles of Embedded Computing System Design

Computers as Components Principles of Embedded Computing System Design Computers as Components Principles of Embedded Computing System Design Third Edition Marilyn Wolf ELSEVIER AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY

More information

Citation for published version (APA): Kleefsman, K. M. T. (2005). Water impact loading on offshore structures. - A numerical study s.n.

Citation for published version (APA): Kleefsman, K. M. T. (2005). Water impact loading on offshore structures. - A numerical study s.n. University of Groningen Water impact loading on offshore structures. - A numerical study Kleefsman, Kornelia Marchien Theresa IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's

More information

Composable and Predictable Power Management

Composable and Predictable Power Management Composable and Predictable Power Management Composable and Predictable Power Management PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Delft, op gezag van de Rector

More information

Modelling, Analysis and Scheduling with Dataflow Models

Modelling, Analysis and Scheduling with Dataflow Models technische universiteit eindhoven Modelling, Analysis and Scheduling with Dataflow Models Marc Geilen, Bart Theelen, Twan Basten, Sander Stuijk, AmirHossein Ghamarian, Jeroen Voeten Eindhoven University

More information

A Methodology for Automated Design of Hard-Real-Time Embedded Streaming Systems

A Methodology for Automated Design of Hard-Real-Time Embedded Streaming Systems A Methodology for Automated Design of Hard-Real-Time Embedded Streaming Systems Mohamed A. Bamakhrama, Jiali Teddy Zhai, Hristo Nikolov and Todor Stefanov Leiden Institute of Advanced Computer Science

More information

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle   holds various files of this Leiden University dissertation Cover Page The handle http://hdl.handle.net/1887/43469 holds various files of this Leiden University dissertation Author: Cannella, Emanuele Title: Semi-partitioned scheduling and task migration in dataflow

More information

Universiteit van Amsterdam 1

Universiteit van Amsterdam 1 Universiteit van Amsterdam 1 Embedded systems are ubiquitous! Andy D. Pimentel IPA Lentedagen, 22 April, 2010 2 The design of modern embedded systems becomes increasingly complex Need to support multiple

More information

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle   holds various files of this Leiden University dissertation Cover Page The handle http://hdl.handle.net/1887/32963 holds various files of this Leiden University dissertation Author: Zhai, Jiali Teddy Title: Adaptive streaming applications : analysis and implementation

More information

QoS framework for video streaming in home networks. Dmitri Jarnikov

QoS framework for video streaming in home networks. Dmitri Jarnikov QoS framework for video streaming in home networks Dmitri Jarnikov QoS framework for video streaming in home networks Printed by: Eindhoven University Press, The Netherlands. Cover design: Inga Douhaya.

More information

Dreams. A distributed framework for synchronous coordination. July 22, 2010

Dreams. A distributed framework for synchronous coordination. July 22, 2010 Dreams A distributed framework for synchronous coordination July 22, 2010 José Proença Dreams A distributed framework for synchronous coordination July 22, 2010 ILLC Dissertation Series DS-200X-NN (almost...)

More information

EE382V: System-on-a-Chip (SoC) Design

EE382V: System-on-a-Chip (SoC) Design EE382V: System-on-a-Chip (SoC) Design Lecture 8 HW/SW Co-Design Sources: Prof. Margarida Jacome, UT Austin Andreas Gerstlauer Electrical and Computer Engineering University of Texas at Austin gerstl@ece.utexas.edu

More information

Co-synthesis and Accelerator based Embedded System Design

Co-synthesis and Accelerator based Embedded System Design Co-synthesis and Accelerator based Embedded System Design COE838: Embedded Computer System http://www.ee.ryerson.ca/~courses/coe838/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer

More information

Reliable Embedded Multimedia Systems?

Reliable Embedded Multimedia Systems? 2 Overview Reliable Embedded Multimedia Systems? Twan Basten Joint work with Marc Geilen, AmirHossein Ghamarian, Hamid Shojaei, Sander Stuijk, Bart Theelen, and others Embedded Multi-media Analysis of

More information

Computer Engineering Mekelweg 4, 2628 CD Delft The Netherlands MSc THESIS

Computer Engineering Mekelweg 4, 2628 CD Delft The Netherlands   MSc THESIS Computer Engineering Mekelweg 4, 2628 CD Delft The Netherlands http://ce.et.tudelft.nl/ 2012 MSc THESIS Dynamic loading and task migration for streaming applications on a composable system-on-chip Adriaan

More information

Energy efficient code generation for streaming applications She, D.

Energy efficient code generation for streaming applications She, D. Energy efficient code generation for streaming applications She, D. DOI: 10.6100/IR782261 Published: 01/01/2014 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue

More information

Pavel G. Zaykov. Multithreading for Embedded Reconfigurable Multicore Systems

Pavel G. Zaykov. Multithreading for Embedded Reconfigurable Multicore Systems Pavel G. Zaykov Multithreading for Embedded Reconfigurable Multicore Systems Multithreading for Embedded Reconfigurable Multicore Systems PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische

More information

MULTI-PROCESSOR SYSTEM-LEVEL SYNTHESIS FOR MULTIPLE APPLICATIONS ON PLATFORM FPGA

MULTI-PROCESSOR SYSTEM-LEVEL SYNTHESIS FOR MULTIPLE APPLICATIONS ON PLATFORM FPGA MULTI-PROCESSOR SYSTEM-LEVEL SYNTHESIS FOR MULTIPLE APPLICATIONS ON PLATFORM FPGA Akash Kumar,, Shakith Fernando, Yajun Ha, Bart Mesman and Henk Corporaal Eindhoven University of Technology, Eindhoven,

More information

Transparent Restructuring of Pointer-Linked Data Structures. Harmen Laurens Anne van der Spek

Transparent Restructuring of Pointer-Linked Data Structures. Harmen Laurens Anne van der Spek Transparent Restructuring of Pointer-Linked Data Structures Harmen Laurens Anne van der Spek Transparent Restructuring of Pointer-Linked Data Structures Proefschrift ter verkrijging van de graad van Doctor

More information

Optimal Partition with Block-Level Parallelization in C-to-RTL Synthesis for Streaming Applications

Optimal Partition with Block-Level Parallelization in C-to-RTL Synthesis for Streaming Applications Optimal Partition with Block-Level Parallelization in C-to-RTL Synthesis for Streaming Applications Authors: Shuangchen Li, Yongpan Liu, X.Sharon Hu, Xinyu He, Pei Zhang, and Huazhong Yang 2013/01/23 Outline

More information

Overview of Dataflow Languages. Waheed Ahmad

Overview of Dataflow Languages. Waheed Ahmad Overview of Dataflow Languages Waheed Ahmad w.ahmad@utwente.nl The purpose of models is not to fit the data but to sharpen the questions. Samuel Karlins 11 th R.A Fisher Memorial Lecture Royal Society

More information

Low power architectures for streaming applications

Low power architectures for streaming applications Low power architectures for streaming applications He, Y. DOI: 10.6100/IR759534 Published: 01/01/2013 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and volume

More information

Predictable Multi-processor System on Chip Design for Multimedia Applications

Predictable Multi-processor System on Chip Design for Multimedia Applications Predictable Multi-processor System on Chip Design for Multimedia Applications PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Eindhoven, op gezag van de rector magnificus,

More information

Mapping of Applications to Multi-Processor Systems

Mapping of Applications to Multi-Processor Systems Mapping of Applications to Multi-Processor Systems Peter Marwedel TU Dortmund, Informatik 12 Germany Marwedel, 2003 Graphics: Alexandra Nolte, Gesine 2011 年 12 月 09 日 These slides use Microsoft clip arts.

More information

Statistical Compiler Tuning

Statistical Compiler Tuning Statistical Compiler Tuning Statistical Compiler Tuning proefschrift ter verkrijging van de graad van Doctor aan de Universiteit Leiden, op gezag van de Rector Magnificus Dr. D.D. Breimer, hoogleraar

More information

HETEROGENEOUS MULTIPROCESSOR MAPPING FOR REAL-TIME STREAMING SYSTEMS

HETEROGENEOUS MULTIPROCESSOR MAPPING FOR REAL-TIME STREAMING SYSTEMS HETEROGENEOUS MULTIPROCESSOR MAPPING FOR REAL-TIME STREAMING SYSTEMS Jing Lin, Akshaya Srivasta, Prof. Andreas Gerstlauer, and Prof. Brian L. Evans Department of Electrical and Computer Engineering The

More information

System-on-Chip. 4l1 Springer. Embedded Software Design and Programming of Multiprocessor. Simulink and SystemC. Case Studies

System-on-Chip. 4l1 Springer. Embedded Software Design and Programming of Multiprocessor. Simulink and SystemC. Case Studies Katalin Popovici Frederic Rousseau Ahmed A. Jerraya Marilyn Wolf Embedded Software Design and Programming of Multiprocessor System-on-Chip Simulink and SystemC Case Studies 4l1 Springer Contents 1 Embedded

More information

Automatic Parallelization of NLPs with Non-Affine Index- Expressions. Marco Bekooij (NXP research) Tjerk Bijlsma (University of Twente)

Automatic Parallelization of NLPs with Non-Affine Index- Expressions. Marco Bekooij (NXP research) Tjerk Bijlsma (University of Twente) Automatic Parallelization of NLPs with Non-Affine Index- Expressions Marco Bekooij (NXP research) Tjerk Bijlsma (University of Twente) Outline Context: car-entertainment applications Mapping Flow Motivation

More information

A Process Model suitable for defining and programming MpSoCs

A Process Model suitable for defining and programming MpSoCs A Process Model suitable for defining and programming MpSoCs MpSoC-Workshop at Rheinfels, 29-30.6.2010 F. Mayer-Lindenberg, TU Hamburg-Harburg 1. Motivation 2. The Process Model 3. Mapping to MpSoC 4.

More information

Ozana Silvia Dragomir. K-loops: Loop Transformations for Reconfigurable Architectures

Ozana Silvia Dragomir. K-loops: Loop Transformations for Reconfigurable Architectures Ozana Silvia Dragomir K-loops: Loop Transformations for Reconfigurable Architectures K-loops: Loop Transformations for Reconfigurable Architectures PROEFSCHRIFT ter verkrijging van de graad van doctor

More information

Hardware-Software Codesign

Hardware-Software Codesign Hardware-Software Codesign 8. Performance Estimation Lothar Thiele 8-1 System Design specification system synthesis estimation -compilation intellectual prop. code instruction set HW-synthesis intellectual

More information

Flexible evolutionary algorithms for mining structured process models Buijs, J.C.A.M.

Flexible evolutionary algorithms for mining structured process models Buijs, J.C.A.M. Flexible evolutionary algorithms for mining structured process models Buijs, J.C.A.M. DOI: 10.6100/IR780920 Published: 01/01/2014 Document Version Publisher s PDF, also known as Version of Record (includes

More information

TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO.

TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. vii TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT LIST OF TABLES LIST OF FIGURES LIST OF SYMBOLS AND ABBREVIATION iii xii xiv xvii 1 INTRODUCTION 1 1.1 GENERAL 1 1.2 TYPES OF WIRELESS COMMUNICATION

More information

Improving Concurrency and Recovery in Database Systems by Exploiting Application Semantics. Wijnand Derks

Improving Concurrency and Recovery in Database Systems by Exploiting Application Semantics. Wijnand Derks Improving Concurrency and Recovery in Database Systems by Exploiting Application Semantics Wijnand Derks Ph.D. defense committee: Prof. dr. W. Jonker, University of Twente, Enschede, Netherlands (promotor)

More information

Customizing and Hardwiring On-Chip Interconnects in FPGAs

Customizing and Hardwiring On-Chip Interconnects in FPGAs Customizing and Hardwiring On-Chip Interconnects in FPGAs Customizing and Hardwiring On-Chip Interconnects in FPGAs PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Delft,

More information

Instruction-set architecture synthesis for VLIW processors Jordans, R.

Instruction-set architecture synthesis for VLIW processors Jordans, R. Instruction-set architecture synthesis for VLIW processors Jordans, R. Published: 01/12/2015 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and volume numbers)

More information

EECS4201 Computer Architecture

EECS4201 Computer Architecture Computer Architecture A Quantitative Approach, Fifth Edition Chapter 1 Fundamentals of Quantitative Design and Analysis These slides are based on the slides provided by the publisher. The slides will be

More information

Multithreaded Coprocessor Interface for Dual-Core Multimedia SoC

Multithreaded Coprocessor Interface for Dual-Core Multimedia SoC Multithreaded Coprocessor Interface for Dual-Core Multimedia SoC Student: Chih-Hung Cho Advisor: Prof. Chih-Wei Liu VLSI Signal Processing Group, DEE, NCTU 1 Outline Introduction Multithreaded Coprocessor

More information

Computer Architecture A Quantitative Approach, Fifth Edition. Chapter 1. Copyright 2012, Elsevier Inc. All rights reserved. Computer Technology

Computer Architecture A Quantitative Approach, Fifth Edition. Chapter 1. Copyright 2012, Elsevier Inc. All rights reserved. Computer Technology Computer Architecture A Quantitative Approach, Fifth Edition Chapter 1 Fundamentals of Quantitative Design and Analysis 1 Computer Technology Performance improvements: Improvements in semiconductor technology

More information

Ctadel: A Generator of Efficient Numerical Codes

Ctadel: A Generator of Efficient Numerical Codes Ctadel: A Generator of Efficient Numerical Codes Ctadel: A Generator of Efficient Numerical Codes PROEFSCHRIFT ter verkrijging van de graad van Doctor aan de Rijksuniversiteit te Leiden, op gezag van

More information

Software Synthesis Trade-offs in Dataflow Representations of DSP Applications

Software Synthesis Trade-offs in Dataflow Representations of DSP Applications in Dataflow Representations of DSP Applications Shuvra S. Bhattacharyya Department of Electrical and Computer Engineering, and Institute for Advanced Computer Studies University of Maryland, College Park

More information

Handbook of Hardware/Software Codesign Soonhoi Ha and Jürgen Teich

Handbook of Hardware/Software Codesign Soonhoi Ha and Jürgen Teich Handbook of Hardware/Software Codesign Soonhoi Ha and Jürgen Teich v Contents 9 Scenario-based Design Space Exploration... 1 Andy Pimentel and Peter van Stralen 9.1 Introduction... 9. Application Dynamism...

More information

HW/SW Cyber-System Co-Design and Modeling

HW/SW Cyber-System Co-Design and Modeling HW/SW Cyber-System Co-Design and Modeling Julio OLIVEIRA Karol DESNOS Karol Desnos (IETR) & Julio Oliveira (TNO) 1 Introduction Who are we? Julio de OLIVEIRA Position: TNO - Researcher & innovation scientist

More information

Predictable embedded multiprocessor architecture for streaming applications Moonen, A.J.M.

Predictable embedded multiprocessor architecture for streaming applications Moonen, A.J.M. Predictable embedded multiprocessor architecture for streaming applications Moonen, A.J.M. DOI: 10.6100/IR642808 Published: 01/01/2009 Document Version Publisher s PDF, also known as Version of Record

More information

MOORE S law predicts the exponential growth over time

MOORE S law predicts the exponential growth over time 542 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 27, NO. 3, MARCH 2008 Systematic and Automated Multiprocessor System Design, Programming, and Implementation Hristo

More information

Modelling and simulation of guaranteed throughput channels of a hard real-time multiprocessor system

Modelling and simulation of guaranteed throughput channels of a hard real-time multiprocessor system Modelling and simulation of guaranteed throughput channels of a hard real-time multiprocessor system A.J.M. Moonen Information and Communication Systems Department of Electrical Engineering Eindhoven University

More information

Programming Heterogeneous Embedded Systems for IoT

Programming Heterogeneous Embedded Systems for IoT Programming Heterogeneous Embedded Systems for IoT Jeronimo Castrillon Chair for Compiler Construction TU Dresden jeronimo.castrillon@tu-dresden.de Get-together toward a sustainable collaboration in IoT

More information

Tag der mündlichen Prüfung: 03. Juni 2004 Dekan / Dekanin: Prof. Dr. Bernhard Steffen Gutachter / Gutachterinnen: Prof. Dr. Francky Catthoor, Prof. Dr

Tag der mündlichen Prüfung: 03. Juni 2004 Dekan / Dekanin: Prof. Dr. Bernhard Steffen Gutachter / Gutachterinnen: Prof. Dr. Francky Catthoor, Prof. Dr Source Code Optimization Techniques for Data Flow Dominated Embedded Software Dissertation zur Erlangung des Grades eines Doktors der Naturwissenschaften der Universität Dortmund am Fachbereich Informatik

More information

Copyright 2012, Elsevier Inc. All rights reserved.

Copyright 2012, Elsevier Inc. All rights reserved. Computer Architecture A Quantitative Approach, Fifth Edition Chapter 1 Fundamentals of Quantitative Design and Analysis 1 Computer Technology Performance improvements: Improvements in semiconductor technology

More information

DOTTORATO DI RICERCA

DOTTORATO DI RICERCA Università degli Studi di Cagliari DOTTORATO DI RICERCA INGEGNERIA ELETTRONICA ED INFORMATICA Ciclo XXV Integrated support for Adaptivity and Fault-tolerance in MPSoCs Settore scientifico disciplinare

More information

Quality-driven model-based design of multi-processor accelerators : an application to LDPC decoders Jan, Y.

Quality-driven model-based design of multi-processor accelerators : an application to LDPC decoders Jan, Y. Quality-driven model-based design of multi-processor accelerators : an application to LDPC decoders Jan, Y DOI: 106100/IR732195 Published: 01/01/2012 Document Version Publisher s PDF, also known as Version

More information

Domain Specific Modeling and Analysis

Domain Specific Modeling and Analysis Domain Specific Modeling and Analysis PROEFSCHRIFT ter verkrijging van de graad van Doctor aan de Universiteit Leiden, op gezag van Rector Magnificus prof.mr. P.F. van der Heijden, volgens besluit van

More information

Dealing with dynamism in embedded system design

Dealing with dynamism in embedded system design Dealing with dynamism in embedded system design Gheorghita, S.V. DOI: 10.6100/IR630369 Published: 01/01/2007 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue

More information

Symbolic Buffer Sizing for Throughput-Optimal Scheduling of Dataflow Graphs

Symbolic Buffer Sizing for Throughput-Optimal Scheduling of Dataflow Graphs Symbolic Buffer Sizing for Throughput-Optimal Scheduling of Dataflow Graphs Anan Bouakaz Pascal Fradet Alain Girault Real-Time and Embedded Technology and Applications Symposium, Vienna April 14th, 2016

More information

System-level design space exploration of dynamic reconfigurable architectures Sigdel, K.; Thompson, M.; Pimentel, A.D.; Stefanov, T.; Bertels, K.

System-level design space exploration of dynamic reconfigurable architectures Sigdel, K.; Thompson, M.; Pimentel, A.D.; Stefanov, T.; Bertels, K. UvA-DARE (Digital Academic Repository) System-level design space exploration of dynamic reconfigurable architectures Sigdel, K.; Thompson, M.; Pimentel, A.D.; Stefanov, T.; Bertels, K. Published in: Lecture

More information

Summary of Contents LIST OF FIGURES LIST OF TABLES

Summary of Contents LIST OF FIGURES LIST OF TABLES Summary of Contents LIST OF FIGURES LIST OF TABLES PREFACE xvii xix xxi PART 1 BACKGROUND Chapter 1. Introduction 3 Chapter 2. Standards-Makers 21 Chapter 3. Principles of the S2ESC Collection 45 Chapter

More information

System-Level Design Space Exploration of Dynamic Reconfigurable Architectures

System-Level Design Space Exploration of Dynamic Reconfigurable Architectures System-Level Design Space Exploration of Dynamic Reconfigurable Architectures Kamana Sigdel 1,MarkThompson 2, Andy D. Pimentel 2, Todor Stefanov 1, and Koen Bertels 1 1 Computer Engineering, EEMCS, Delft

More information

Analysis, Design and Management of Multimedia Multiprocessor Systems.

Analysis, Design and Management of Multimedia Multiprocessor Systems. Analysis, Design and Management of Multimedia Multiprocessor Systems. PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Eindhoven, op gezag van de Rector Magnificus, prof.dr.ir.

More information

A Cache-Based Hardware Accelerator for Memory. Data Movements

A Cache-Based Hardware Accelerator for Memory. Data Movements A Cache-Based Hardware Accelerator for Memory Data Movements A Cache-Based Hardware Accelerator for Memory Data Movements PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit

More information

Mining Semi-Structured Data

Mining Semi-Structured Data Mining Semi-Structured Data Theoretical and Experimental Aspects of Pattern Evaluation E.H. de Graaf Mining Semi-Structured Data Theoretical and Experimental Aspects of Pattern Evaluation proefschrift

More information

Scenario-based Design Space Exploration

Scenario-based Design Space Exploration Andy D. Pimentel and Peter van Stralen Abstract Modern embedded systems are becoming increasingly multifunctional and, as a consequence, they more and more have to deal with dynamic application workloads.

More information

Energy-Aware Scheduling for Acyclic Synchronous Data Flows on Multiprocessors

Energy-Aware Scheduling for Acyclic Synchronous Data Flows on Multiprocessors Journal of Interconnection Networks c World Scientific Publishing Company Energy-Aware Scheduling for Acyclic Synchronous Data Flows on Multiprocessors DAWEI LI and JIE WU Department of Computer and Information

More information

System-level Synthesis of Dataflow Applications for FPGAbased Distributed Platforms

System-level Synthesis of Dataflow Applications for FPGAbased Distributed Platforms System-level Synthesis of Dataflow Applications for FPGAbased Distributed Platforms Hugo A. Andrade, Kaushik Ravindran, Alejandro Asenjo, Casey Weltzin NI Berkeley, NI Austin National Instruments Corporation

More information

Multicore SoC is coming. Scalable and Reconfigurable Stream Processor for Mobile Multimedia Systems. Source: 2007 ISSCC and IDF.

Multicore SoC is coming. Scalable and Reconfigurable Stream Processor for Mobile Multimedia Systems. Source: 2007 ISSCC and IDF. Scalable and Reconfigurable Stream Processor for Mobile Multimedia Systems Liang-Gee Chen Distinguished Professor General Director, SOC Center National Taiwan University DSP/IC Design Lab, GIEE, NTU 1

More information

DIGITAL VS. ANALOG SIGNAL PROCESSING Digital signal processing (DSP) characterized by: OUTLINE APPLICATIONS OF DIGITAL SIGNAL PROCESSING

DIGITAL VS. ANALOG SIGNAL PROCESSING Digital signal processing (DSP) characterized by: OUTLINE APPLICATIONS OF DIGITAL SIGNAL PROCESSING 1 DSP applications DSP platforms The synthesis problem Models of computation OUTLINE 2 DIGITAL VS. ANALOG SIGNAL PROCESSING Digital signal processing (DSP) characterized by: Time-discrete representation

More information

UvA-DARE (Digital Academic Repository)

UvA-DARE (Digital Academic Repository) UvA-DARE (Digital Academic Repository) Daedalus: Toward composable multimedia MP-SoC design Nikolov, H.; Thompson, M.; Stefanov, T.; Pimentel, A.D.; Polstra, S.; Bose, R.; Zissulescu, C.; Deprettere, E.

More information

A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on

A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on on-chip Donghyun Kim, Kangmin Lee, Se-joong Lee and Hoi-Jun Yoo Semiconductor System Laboratory, Dept. of EECS, Korea Advanced

More information

Lecture 4: Synchronous Data Flow Graphs - HJ94 goal: Skiing down a mountain

Lecture 4: Synchronous Data Flow Graphs - HJ94 goal: Skiing down a mountain Lecture 4: Synchronous ata Flow Graphs - I. Verbauwhede, 05-06 K.U.Leuven HJ94 goal: Skiing down a mountain SPW, Matlab, C pipelining, unrolling Specification Algorithm Transformations loop merging, compaction

More information

Focused Information Access using XML Element Retrieval

Focused Information Access using XML Element Retrieval Focused Information Access using XML Element Retrieval Börkur Sigurbjörnsson Promotor: Prof.dr. Maarten de Rijke Co-promotor: Dr.ir. Jaap Kamps Committee: Prof.Dr.-Ing. Norbert Fuhr Prof. Mounia Lalmas

More information

PERFORMANCE ANALYSIS OF REAL-TIME EMBEDDED SOFTWARE

PERFORMANCE ANALYSIS OF REAL-TIME EMBEDDED SOFTWARE PERFORMANCE ANALYSIS OF REAL-TIME EMBEDDED SOFTWARE PERFORMANCE ANALYSIS OF REAL-TIME EMBEDDED SOFTWARE Yau-Tsun Steven Li Monterey Design Systems, Inc. Sharad Malik Princeton University ~. " SPRINGER

More information

Real-Time Dynamic Voltage Hopping on MPSoCs

Real-Time Dynamic Voltage Hopping on MPSoCs Real-Time Dynamic Voltage Hopping on MPSoCs Tohru Ishihara System LSI Research Center, Kyushu University 2009/08/05 The 9 th International Forum on MPSoC and Multicore 1 Background Low Power / Low Energy

More information

Test Point Insertion to improve BIST performance, and to reduce ATPG test time and data volume

Test Point Insertion to improve BIST performance, and to reduce ATPG test time and data volume Test Point Insertion to improve BIST performance, and to reduce ATPG test time and data volume Test Point Insertion to improve BIST performance, and to reduce ATPG test time and data volume Proefschrift

More information

MAMPSx: A Design Framework for Rapid Synthesis of Predictable Heterogeneous MPSoCs

MAMPSx: A Design Framework for Rapid Synthesis of Predictable Heterogeneous MPSoCs MAMPSx: A Design Framework for Rapid Synthesis of Predictable Heterogeneous MPSoCs Shakith Fernando, Firew Siyoum, Yifan He, Akash Kumar 2 and Henk Corporaal Department of Electrical Engineering, Eindhoven

More information

Mapping of Applications to Multi-Processor Systems

Mapping of Applications to Multi-Processor Systems Springer, 2010 Mapping of Applications to Multi-Processor Systems Peter Marwedel TU Dortmund, Informatik 12 Germany 2014 年 01 月 17 日 These slides use Microsoft clip arts. Microsoft copyright restrictions

More information

A Unified HW/SW Interface Model to Remove Discontinuities between HW and SW Design

A Unified HW/SW Interface Model to Remove Discontinuities between HW and SW Design A Unified HW/SW Interface Model to Remove Discontinuities between HW and SW Design Ahmed Amine JERRAYA EPFL November 2005 TIMA Laboratory 46 Avenue Felix Viallet 38031 Grenoble CEDEX, France Email: Ahmed.Jerraya@imag.fr

More information

Adaptive Information Filtering concepts and algorithms

Adaptive Information Filtering concepts and algorithms Adaptive Information Filtering concepts and algorithms Adaptive Information Filtering concepts and algorithms Proefschrift ter verkrijging van de graad van Doctor aan de Universiteit Leiden, op gezag

More information

Easy Multicore Programming using MAPS

Easy Multicore Programming using MAPS Easy Multicore Programming using MAPS Jeronimo Castrillon, Maximilian Odendahl Multicore Challenge Conference 2012 September 24 th, 2012 Institute for Communication Technologies and Embedded Systems Outline

More information

Scalable Video Coding

Scalable Video Coding Scalable Video Coding Scalable Video Coding Proefschrift ter verkrijging van de graad van doctor aan de Technische Universiteit Delft, op gezag van de Rector Magnificus prof.ir. K.C.A.M. Luyben; voorzitter

More information

IN order to increase design productivity, raising the level

IN order to increase design productivity, raising the level IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS 1 Electronic System-Level Synthesis Methodologies Andreas Gerstlauer, Member, IEEE, Christian Haubelt, Member, IEEE, Andy D.

More information

Code Generation for Transport Triggered Architectures

Code Generation for Transport Triggered Architectures Code Generation for Transport Triggered Architectures Code Generation for Transport Triggered Architectures PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Delft, op

More information

Contents The Definition of a Fieldbus An Introduction to Industrial Systems Communications.

Contents The Definition of a Fieldbus An Introduction to Industrial Systems Communications. Contents Page List of Tables. List of Figures. List of Symbols. Dedication. Acknowledgment. Abstract. x xi xv xxi xxi xxii Chapter 1 Introduction to FieldBuses Systems. 1 1.1. The Definition of a Fieldbus.

More information

Run-time Spatial Mapping of Streaming Applications to a Heterogeneous Multi-Processor System-on-Chip (MPSOC)

Run-time Spatial Mapping of Streaming Applications to a Heterogeneous Multi-Processor System-on-Chip (MPSOC) un-time Spatial Mapping of Streaming Applications to a Heterogeneous Multi-Processor System-on-Chip (MPSOC) Philip K.F. Hölzenspies, Johann L. Hurink, Jan Kuper, Gerard J.M. Smit University of Twente Department

More information

Embedded Systems. Information. TDDD93 Large-Scale Distributed Systems and Networks

Embedded Systems. Information. TDDD93 Large-Scale Distributed Systems and Networks TDDD93 Fö Embedded Systems - TDDD93 Fö Embedded Systems - 2 Information TDDD93 Large-Scale Distributed Systems and Networks Lectures on Lecture notes: available from the course page, latest 24 hours before

More information

The TM3270 Media-processor. Jan-Willem van de Waerdt

The TM3270 Media-processor. Jan-Willem van de Waerdt The TM3270 Media-processor Jan-Willem van de Waerdt The work described in this thesis was carried out at Philips Semiconductors, San Jose, USA. On the front cover: Realization of the TriMedia TM3270 media-processor

More information

DUE to tight resource constraints, strict real-time performance

DUE to tight resource constraints, strict real-time performance 3126 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 55, NO. 6, JUNE 2007 Parameterized Looped Schedules for Compact Representation of Execution Sequences in DSP Hardware and Software Implementation Ming-Yung

More information

Resource Manager for Non-preemptive Heterogeneous Multiprocessor System-on-chip

Resource Manager for Non-preemptive Heterogeneous Multiprocessor System-on-chip Resource Manager for Non-preemptive Heterogeneous Multiprocessor System-on-chip Akash Kumar, Bart Mesman, Bart Theelen and Henk Corporaal Eindhoven University of Technology 5600MB Eindhoven, The Netherlands

More information

Architectural Notes: a Framework for Distributed Systems Development

Architectural Notes: a Framework for Distributed Systems Development Architectural Notes: a Framework for Distributed Systems Development Luís Ferreira Pires Architectural Notes: a Framework for Distributed Systems Development CTIT Ph. D-thesis series No. 94-01 P.O. Box

More information

A New CDMA Encoding/Decoding Method for on- Chip Communication Network

A New CDMA Encoding/Decoding Method for on- Chip Communication Network A New CDMA Encoding/Decoding Method for on- Chip Communication Network Abstract: As a high performance on-chip communication method, the code division multiple access (CDMA) technique has recently been

More information

EFFICIENT AUTOMATED SYNTHESIS, PROGRAMING, AND IMPLEMENTATION OF MULTI-PROCESSOR PLATFORMS ON FPGA CHIPS. Hristo Nikolov Todor Stefanov Ed Deprettere

EFFICIENT AUTOMATED SYNTHESIS, PROGRAMING, AND IMPLEMENTATION OF MULTI-PROCESSOR PLATFORMS ON FPGA CHIPS. Hristo Nikolov Todor Stefanov Ed Deprettere EFFICIENT AUTOMATED SYNTHESIS, PROGRAMING, AND IMPLEMENTATION OF MULTI-PROCESSOR PLATFORMS ON FPGA CHIPS Hristo Nikolov Todor Stefanov Ed Deprettere Leiden Embedded Research Center Leiden Institute of

More information

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle   holds various files of this Leiden University dissertation Cover Page The handle http://hdl.handle.net/1887/21017 holds various files of this Leiden University dissertation Author: Balevic, Ana Title: Exploiting multi-level parallelism in streaming applications

More information

Fundamentals of Quantitative Design and Analysis

Fundamentals of Quantitative Design and Analysis Fundamentals of Quantitative Design and Analysis Dr. Jiang Li Adapted from the slides provided by the authors Computer Technology Performance improvements: Improvements in semiconductor technology Feature

More information

ADSL Transmitter Modeling and Simulation. Department of Electrical and Computer Engineering University of Texas at Austin. Kripa Venkatachalam.

ADSL Transmitter Modeling and Simulation. Department of Electrical and Computer Engineering University of Texas at Austin. Kripa Venkatachalam. ADSL Transmitter Modeling and Simulation Department of Electrical and Computer Engineering University of Texas at Austin Kripa Venkatachalam Qiu Wu EE382C: Embedded Software Systems May 10, 2000 Abstract

More information

CA-MPSoC: An Automated Design Flow for Predictable Multi-processor Architectures for Multiple Applications

CA-MPSoC: An Automated Design Flow for Predictable Multi-processor Architectures for Multiple Applications -MPSoC: An Automated Design Flow for Predictable Multi-processor Architectures for Multiple Applications A. Shabbir,a,, A. Kumar a,b,, S. Stuijk a,, B. Mesman a,, H. Corporaal a, a Eindhoven University

More information

ARTIST-Relevant Research from Linköping

ARTIST-Relevant Research from Linköping ARTIST-Relevant Research from Linköping Department of Computer and Information Science (IDA) Linköping University http://www.ida.liu.se/~eslab/ 1 Outline Communication-Intensive Real-Time Systems Timing

More information