Designing Predictable Real-Time and Embedded Systems

Size: px
Start display at page:

Download "Designing Predictable Real-Time and Embedded Systems"

Transcription

1 Designing Predictable Real-Time and Embedded Systems Juniorprofessor Dr. Jian-Jia Chen Karlsruhe Institute of Technology (KIT), Germany 0 KIT Feb. University 27-29, 2012 of at thetu-berlin, State of Baden-Wuerttemberg Berlin, Germany: and by Dr. Jian-Jia Chen National Laboratory of the Helmholtz Association

2 About Me Education B.s. Chemistry: National Taiwan University Taiwan Ph.D. Computer Science and Information Engineering: National Taiwan University Taiwan Working Experience 2007 : Compulsory Civil Service in Taiwan April : Postdoc at ETH Zurich, Switzerland 2010 May - Present: Juniorprofessor at Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany Research topics: Low-Power and Energy-Efficient Considerations Timing Predictability of MPSoCs (multiprocessor systems on chip) Energy-Harvesting Systems Reliability System Designs Design Automation Chair of Micro Hardware Technologies for Automation Leader: Jian-Jia Chen 2 Mitarbeiter (one will join in Feb. 2012), 1 Doctoral Stipend 1 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

3 Predictability Due to Resource Sharing Task is executed on Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

4 Predictability Due to Resource Sharing cache misses Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

5 Predictability Due to Resource Sharing cache misses Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

6 Predictability Due to Resource Sharing Access memory Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

7 Predictability Due to Resource Sharing Task is executed on Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

8 Predictability Due to Resource Sharing cache misses Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

9 Predictability Due to Resource Sharing is blocked Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

10 Predictability Due to Resource Sharing Task is executed on Core 3 Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

11 Predictability Due to Resource Sharing cache misses Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

12 Predictability Due to Resource Sharing cache misses Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

13 Predictability Due to Resource Sharing Memory access is blocked Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

14 Predictability Due to Resource Sharing Memory access for core 1 finishes Multi Core CPU 1 Multi Core CPU 2 Main Memory 2 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

15 Simple Problems for Shared Resources Input: Structure of tasks on the core Tasks modeled as sequential / time-triggered superblocks Arbitration policy on the shared resource Static arbiter (TDMA) Dynamic arbiter (FCFS, RR) Adaptive arbiter The timing behavior of the architecture is predictable Output: Resource access time is bounded and tight after granted Only timing interference is considered Spatial interference, such as cache replacement policy, etc., is analyzed beforehand What is the worst-case response time? How do we determine the schedulability due to timing interference? 3 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

16 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

17 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

18 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

19 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

20 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

21 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

22 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

23 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

24 RTC Event Models Given traces of event arrivals in time domain RTC models event arrivals by arrival curves in interval domain time workload maximum events in 3 units possible events in 3 units minimum events in 3 units time Ernesto Wandeler, Lothar Thiele, Marcel Verhoef, Paul Lieverse: System Architecture Evaluation Using Modular Performance Analysis - A Case Study Software Tools for Technology Transfer (STTT), Springer, Vol. 8, No. 6, pages , October, Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

25 Graphical Interpretation workload Maximum buffer B [β l, β u ] [α l, α u ] GPC [β l, β u ] [α l, α u ] Maximum response time D time D = sup{inf{τ 0 : R(t) R (t + τ)} t 0 = sup{inf{τ 0 : α u ( ) β l ( + τ)} 0 5 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

26 Outline 6 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

27 State-of-the-Art Multicore Systems Architecture with shared resource shared memory, communication peripherals, I/O peripherals Blocking access to shared resource one request at a time is served stalling due to contention Possible approaches to reduce the contention structure of tasks on the cores arbitration policy on the shared resource (static/dynamic) 7 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

28 Task Model A sequence of subsequent super-blocks Each super-block j of task i is defined by upper e max i,j upper µ max / lower e min i,j / lower accesses µ min execution time i,j i,j to a shared resource static analysis for two phases (1) assuming resource access requires no additional time to get the execution time e i,j (2) only considering the number of shared resource accesses. communication delay c depends on resource Executing periodically Access to shared resources can happen anytime 8 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

29 Worst-Case Execution Time and Resource Accesses Executable Binary Program Control Flow Graph (CFG) Reconstruction Loop Analysis and Unfolding Loop Bounds Static Analysis Value Analyzer Micro Architecture Abstraction Path Analysis ILP Generator /Pipeline Analyzer Timing Information ILP Solver Micro architecture Analysis WCET Visualization and Analysis Evaluation Worst Case Path Analysis 9 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

30 Task Model (cont.) Multiple tasks execute in a time wheel Periodic sequence of statically scheduled tasks gap g between two tasks is variable 10 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

31 Deriving Interfering (Arrival) Curves find all well defined time-windows count number of events that can happen s1,1 s2,1 s3,1 s4,1 s5,1 s6,1 max µ 1,1 exec max 2,1 min µ 1,1 exec min 2,1 S 1 µ max 3,1 µ min 3,1 µ max 4,1 exec max 5,1 µ max 6,1 µ min 4,1 exec min 5,1 µ min 6,1 g s1,1 s2,1 s3,1 s4,1 s5,1 s6,1 max µ 1,1 exec max 2,1 min µ 1,1 exec min 2,1 S 1 µ max 3,1 µ min 3,1 µ max 4,1 µ min 4,1 exec max 5,1 exec min 5,1 µ max 6,1 µ min 6,1 ρ 1,1 ρ 1,1+W1 1 superblock 2 superblocks 4 superblocks t t ˆ =< µ max 1,1,0> ˆ max max max t =< µ 1,1 + µ 1,2, µ 1,1 C> 4 3 ˆ max max min t =< µ k,1, µ k, 1 C+ exec2,1 > k= 1 k= 1 7 superblocks 7 6 ˆ max max min min t =< µ k,1, µ k,1 C+ exec2,1 + exec5,1 + g> k= 1 k= 1 maximize/minimize to compute the gap 2x 5 superblocks 7 6 ˆ max max min t =< µ k,1, µ k,1 C+ exec5,1 + g> k= 3 k= 3 superblocks considerd for gap computation superblocks considerd for time window computation relevant time window 11 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

32 Resulting Interfering Curves Interference of each processing element ᾱ l i ( ) is the interference of PE i onto resource l by assuming PE i executes in isolation α i ( ) { ˆα i ( ) = max α i ( ), α i ( p i ) + (µ max i,j ) j α i ( k p i ) + k j } 0 p p i 2p (µ max i,j ) otherwise 12 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

33 Dynamic Arbitration Interference from other elements as arrival curve Access profile from superblock under analysis 13 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

34 Analysis for Dynamic Arbitration Use Dynamic Programming: 14 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

35 TDMA on the Shared Resource Independence between tasks Single source of interference 15 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

36 Superblocks with Phases Tasks are structured as sequences of superblocks fixed order of execution upper bound on execution and communication phases may be present (acquisition/execution/replication) 16 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

37 Access Models General Model Dedicated Model Hybrid Model S1 W S s s 1,1 2,1 s3,1 s1,1 l1,1 l2,1 l3,1 l1,1 A/E/R A/E/R A/E/R A/E/R... ρ +W ρ1,1 1,1 1 S1 W1 S1 W1 s1,1 s2,1 s3,1 s1,1 l1,1 l2,1 l3,1 l1,1 ρ 1,1 ρ 1,1 A E R A E R A E R A E... ρ +W 1,1 1 S1 W1 S1 W1 s1,1 s2,1 s3,1 s1,1 l1,1 l2,1 l3,1 l1,1 A A/E/R R A A/E/R R A A/E/R R A A/E/R... ρ 1,1+W1 W t t t 17 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

38 Adaptive Arbitration superblock s 1,1 A E R max,a 1,1 exec max max,r 1,1 1,1 superblock s 1,2 A/E/R max 1,2 exec max PU1 PU2 PU3 1,2 shared resource superblock s 1,3 A A/E/R R... max,a 1,3 max,e max,r 1,3 1,3 exec max 1,3 t Use dynamic programming The algorithm is very complicated minislot M 3 3 2,2 1 3, ,1 1, L ,1 2,2 3,3... t static dynamic 18 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

39 Adaptive Arbitration superblock s 1,1 A E R max,a 1,1 exec max max,r 1,1 1,1 superblock s 1,2 A/E/R max 1,2 exec max PU1 PU2 PU3 1,2 shared resource superblock s 1,3 A A/E/R R... max,a 1,3 max,e max,r 1,3 1,3 exec max 1,3 t Use dynamic programming The algorithm is very complicated minislot M 3 3 2,2 1 3, ,1 1, L ,1 2,2 3,3... t static dynamic 18 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

40 Our Status 19 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

41 Where Are We? Safe worst-case response time analysis under the following assumptions: sequential superblock executions known resource arbitration known task assignment synchronized release of the first superblocks VERY pessimistic interference by summing the curves of the interfering cores together Tightness This is safe, but it does not consider the interferences precisely The more cores the system has, the more pessimistic the resulting interference curve is. We believe our results are not very close to the actual worst cases However, these attempts are safe Comparisons of TDMA and Round-Robin/FIFO TDMA provides some resource reservation, but the worst-case is to WASTE it; however, it is easier to be analyzed tightly Round-Robin and FIFO are more difficult to be analyzed tightly 20 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

42 On-going works Dynamic servers may be helpful They provide resource reservations for guaranteeing the service a core may get They reservation is not based on TDMA, but the worst-case behavior may be like TDMA Good candidates: Constant Bandwidth Server (CBS) and Total Bandwidth Server (TBS) Optimizing the configuration of resource arbiters For example, task assignment, slot assignments in TDMA, slot assignments in adaptive arbiters, etc. We are not able to provide any good/reasonable approaches. 21 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

43 Conclusion Resource Sharing in multicore systems is an important issue in terms of Predictability Efficiency Resource Sharing is not An Easy Problem Static arbitration policies Elimination of Interference Dynamic arbitration policies Approximation of Interference 22 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

44 Conclusion Resource Sharing in multicore systems is an important issue in terms of Predictability Efficiency Resource Sharing is not An Easy Problem Static arbitration policies Elimination of Interference Dynamic arbitration policies Approximation of Interference 22 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

45 Other On-Going Works Power Management and Analysis Power management for multiple voltage-island multicore architecture Worst-case thermal analysis for multicore systems, 3D architectures, and schedulability analysis. Power management for smart grid and data centers. Multicore Resource Sharing Combining analysis by using the abstract-level modeling and fine-grained model checking Extending to more general event models Adopting research reservation servers in resource arbitration and timing analysis Schedulability for real-time systems and real-time database systems Working on resource augmentation analysis and design (best paper candidate in RTSS 2011) 23 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

46 Thank you! 24 Feb , 2012 at TU-Berlin, Berlin, Germany: by Dr. Jian-Jia Chen

Real-Time Communications. LS 12, TU Dortmund

Real-Time Communications. LS 12, TU Dortmund Real-Time Communications Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 20, Jan., 2016 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 29 Random Access no access control; requires low medium utilization Prof.

More information

Evaluation and Validation

Evaluation and Validation 12 Evaluation and Validation Peter Marwedel TU Dortmund, Informatik 12 Germany Graphics: Alexandra Nolte, Gesine Marwedel, 2003 2010 年 12 月 05 日 These slides use Microsoft clip arts. Microsoft copyright

More information

Timing analysis and timing predictability

Timing analysis and timing predictability Timing analysis and timing predictability Architectural Dependences Reinhard Wilhelm Saarland University, Saarbrücken, Germany ArtistDesign Summer School in China 2010 What does the execution time depends

More information

Impact of Resource Sharing on Performance and Performance Prediction: A Survey

Impact of Resource Sharing on Performance and Performance Prediction: A Survey Impact of Resource Sharing on Performance and Performance Prediction: A Survey Andreas Abel, Florian Benz, Johannes Doerfert, Barbara Dörr, Sebastian Hahn, Florian Haupenthal, Michael Jacobs, Amir H. Moin,

More information

A Formal Approach to the WCRT Analysis of Multicore Systems with Memory Contention under Phase-structured Task Sets

A Formal Approach to the WCRT Analysis of Multicore Systems with Memory Contention under Phase-structured Task Sets Noname manuscript No. (will be inserted by the editor) A Formal Approach to the WCRT Analysis of Multicore Systems with Memory Contention under Phase-structured Task Sets Kai Lampka Georgia Giannopoulou

More information

-- the Timing Problem & Possible Solutions

-- the Timing Problem & Possible Solutions ARTIST Summer School in Europe 2010 Autrans (near Grenoble), France September 5-10, 2010 Towards Real-Time Applications on Multicore -- the Timing Problem & Possible Solutions Wang Yi Uppsala University,

More information

Timing Analysis on Complex Real-Time Automotive Multicore Architectures

Timing Analysis on Complex Real-Time Automotive Multicore Architectures 2 nd Workshop on Mapping Applications to MPSoCs St. Goar, June 2009 Timing Analysis on Complex Real-Time Automotive Multicore Architectures Mircea Negrean Simon Schliecker Rolf Ernst Technische Universität

More information

Timed Model Checking with Abstractions: Towards Worst-Case Response Time Analysis in Resource-Sharing Manycore Systems

Timed Model Checking with Abstractions: Towards Worst-Case Response Time Analysis in Resource-Sharing Manycore Systems Timed Model Checking with Abstractions: Towards Worst-Case Response Time Analysis in Resource-Sharing Manycore Systems Georgia Giannopoulou Kai Lampka Nikolay Stoimenov Lothar Thiele Computer Engineering

More information

CONTENTION IN MULTICORE HARDWARE SHARED RESOURCES: UNDERSTANDING OF THE STATE OF THE ART

CONTENTION IN MULTICORE HARDWARE SHARED RESOURCES: UNDERSTANDING OF THE STATE OF THE ART CONTENTION IN MULTICORE HARDWARE SHARED RESOURCES: UNDERSTANDING OF THE STATE OF THE ART Gabriel Fernandez 1, Jaume Abella 2, Eduardo Quiñones 2, Christine Rochange 3, Tullio Vardanega 4 and Francisco

More information

Worst Case Analysis of DRAM Latency in Multi-Requestor Systems. Zheng Pei Wu Yogen Krish Rodolfo Pellizzoni

Worst Case Analysis of DRAM Latency in Multi-Requestor Systems. Zheng Pei Wu Yogen Krish Rodolfo Pellizzoni orst Case Analysis of DAM Latency in Multi-equestor Systems Zheng Pei u Yogen Krish odolfo Pellizzoni Multi-equestor Systems CPU CPU CPU Inter-connect DAM DMA I/O 1/26 Multi-equestor Systems CPU CPU CPU

More information

Partitioned Fixed-Priority Scheduling of Parallel Tasks Without Preemptions

Partitioned Fixed-Priority Scheduling of Parallel Tasks Without Preemptions Partitioned Fixed-Priority Scheduling of Parallel Tasks Without Preemptions *, Alessandro Biondi *, Geoffrey Nelissen, and Giorgio Buttazzo * * ReTiS Lab, Scuola Superiore Sant Anna, Pisa, Italy CISTER,

More information

Cross Clock-Domain TDM Virtual Circuits for Networks on Chips

Cross Clock-Domain TDM Virtual Circuits for Networks on Chips Cross Clock-Domain TDM Virtual Circuits for Networks on Chips Zhonghai Lu Dept. of Electronic Systems School for Information and Communication Technology KTH - Royal Institute of Technology, Stockholm

More information

Deterministic Memory Abstraction and Supporting Multicore System Architecture

Deterministic Memory Abstraction and Supporting Multicore System Architecture Deterministic Memory Abstraction and Supporting Multicore System Architecture Farzad Farshchi $, Prathap Kumar Valsan^, Renato Mancuso *, Heechul Yun $ $ University of Kansas, ^ Intel, * Boston University

More information

Formal Modeling and Analysis of Stream Processing Systems

Formal Modeling and Analysis of Stream Processing Systems Formal Modeling and Analysis of Stream Processing Systems Linh T.X. Phan March 2009 Computer and Information Science University of Pennsylvania 1 High performance Highly optimized requirements systems

More information

ait: WORST-CASE EXECUTION TIME PREDICTION BY STATIC PROGRAM ANALYSIS

ait: WORST-CASE EXECUTION TIME PREDICTION BY STATIC PROGRAM ANALYSIS ait: WORST-CASE EXECUTION TIME PREDICTION BY STATIC PROGRAM ANALYSIS Christian Ferdinand and Reinhold Heckmann AbsInt Angewandte Informatik GmbH, Stuhlsatzenhausweg 69, D-66123 Saarbrucken, Germany info@absint.com

More information

ARTIST-Relevant Research from Linköping

ARTIST-Relevant Research from Linköping ARTIST-Relevant Research from Linköping Department of Computer and Information Science (IDA) Linköping University http://www.ida.liu.se/~eslab/ 1 Outline Communication-Intensive Real-Time Systems Timing

More information

Formal Modeling and Analysis of Stream Processing Systems

Formal Modeling and Analysis of Stream Processing Systems Formal Modeling and Analysis of Stream Processing Systems (cont.) Linh T.X. Phan March 2009 Computer and Information Science University of Pennsylvania 1 Previous Lecture General concepts of the performance

More information

Precise Continuous Non-Intrusive Measurement-Based Execution Time Estimation. Boris Dreyer, Christian Hochberger, Simon Wegener, Alexander Weiss

Precise Continuous Non-Intrusive Measurement-Based Execution Time Estimation. Boris Dreyer, Christian Hochberger, Simon Wegener, Alexander Weiss Precise Continuous Non-Intrusive Measurement-Based Execution Time Estimation Boris Dreyer, Christian Hochberger, Simon Wegener, Alexander Weiss This work was funded within the project CONIRAS by the German

More information

Hardware-Software Codesign. 9. Worst Case Execution Time Analysis

Hardware-Software Codesign. 9. Worst Case Execution Time Analysis Hardware-Software Codesign 9. Worst Case Execution Time Analysis Lothar Thiele 9-1 System Design Specification System Synthesis Estimation SW-Compilation Intellectual Prop. Code Instruction Set HW-Synthesis

More information

arxiv: v2 [cs.ds] 22 Jun 2016

arxiv: v2 [cs.ds] 22 Jun 2016 Federated Scheduling Admits No Constant Speedup Factors for Constrained-Deadline DAG Task Systems Jian-Jia Chen Department of Informatics, TU Dortmund University, Germany arxiv:1510.07254v2 [cs.ds] 22

More information

Real-Time Interfaces for Interface-Based Design of Real-Time Systems with Fixed Priority Scheduling

Real-Time Interfaces for Interface-Based Design of Real-Time Systems with Fixed Priority Scheduling Real-Time Interfaces for Interface-Based Design of Real-Time Systems with Fixed Priority Scheduling Ernesto Wandeler Lothar Thiele Computer Engineering and Networks Laboratory Swiss Federal Institute of

More information

A Scalable Multiprocessor Architecture for Pervasive Computing

A Scalable Multiprocessor Architecture for Pervasive Computing A Scalable Multiprocessor Architecture for Pervasive Computing Long Zheng 1,2, Yanchao Lu 3,JingyuZhou 3,,MinyiGuo 3,HaiJin 1, Song Guo 2, Yao Shen 3, Jiehan Zhou 4, and Jukka Riekki 4 1 Huazhong University

More information

Efficient Throughput-Guarantees for Latency-Sensitive Networks-On-Chip

Efficient Throughput-Guarantees for Latency-Sensitive Networks-On-Chip ASP-DAC 2010 20 Jan 2010 Session 6C Efficient Throughput-Guarantees for Latency-Sensitive Networks-On-Chip Jonas Diemer, Rolf Ernst TU Braunschweig, Germany diemer@ida.ing.tu-bs.de Michael Kauschke Intel,

More information

EE382V: System-on-a-Chip (SoC) Design

EE382V: System-on-a-Chip (SoC) Design EE382V: System-on-a-Chip (SoC) Design Lecture 5 Performance Analysis Sources: Prof. Jacob Abraham, UT Austin Prof. Lothar Thiele, ETH Zurich Prof. Reinhard Wilhelm, Saarland Univ. Andreas Gerstlauer Electrical

More information

Memory Efficient Scheduling for Multicore Real-time Systems

Memory Efficient Scheduling for Multicore Real-time Systems Memory Efficient Scheduling for Multicore Real-time Systems by Ahmed Alhammad A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Doctor of Philosophy

More information

Workload Characterization Model for Tasks with Variable Execution Demand

Workload Characterization Model for Tasks with Variable Execution Demand Workload Characterization Model for Tasks with Variable Execution Demand Alexander Maxiaguine Simon Künzli Lothar Thiele Computer Engineering and Networks Laboratory Swiss Federal Institute of Technology

More information

Chapter 39: Concepts of Time-Triggered Communication. Wenbo Qiao

Chapter 39: Concepts of Time-Triggered Communication. Wenbo Qiao Chapter 39: Concepts of Time-Triggered Communication Wenbo Qiao Outline Time and Event Triggered Communication Fundamental Services of a Time-Triggered Communication Protocol Clock Synchronization Periodic

More information

Composable Resource Sharing Based on Latency-Rate Servers

Composable Resource Sharing Based on Latency-Rate Servers Composable Resource Sharing Based on Latency-Rate Servers Benny Akesson 1, Andreas Hansson 1, Kees Goossens 2,3 1 Eindhoven University of Technology 2 NXP Semiconductors Research 3 Delft University of

More information

Model-based Analysis of Event-driven Distributed Real-time Embedded Systems

Model-based Analysis of Event-driven Distributed Real-time Embedded Systems Model-based Analysis of Event-driven Distributed Real-time Embedded Systems Gabor Madl Committee Chancellor s Professor Nikil Dutt (Chair) Professor Tony Givargis Professor Ian Harris University of California,

More information

HW/SW Codesign. WCET Analysis

HW/SW Codesign. WCET Analysis HW/SW Codesign WCET Analysis 29 November 2017 Andres Gomez gomeza@tik.ee.ethz.ch 1 Outline Today s exercise is one long question with several parts: Basic blocks of a program Static value analysis WCET

More information

Evaluation and Validation

Evaluation and Validation Springer, 2010 Evaluation and Validation Peter Marwedel TU Dortmund, Informatik 12 Germany 2013 年 12 月 02 日 These slides use Microsoft clip arts. Microsoft copyright restrictions apply. Application Knowledge

More information

On the Use of Greedy Shapers in Real-Time Embedded Systems

On the Use of Greedy Shapers in Real-Time Embedded Systems 1 On the Use of Greedy Shapers in Real-Time Embedded Systems ERNESTO WANDELER, ALEXANDER MAXIAGUINE, and LOTHAR THIELE, ETH Zurich Traffic shaping is a well-known technique in the area of networking and

More information

Modeling Hardware Timing 1 Caches and Pipelines

Modeling Hardware Timing 1 Caches and Pipelines Modeling Hardware Timing 1 Caches and Pipelines Peter Puschner slides: P. Puschner, R. Kirner, B. Huber VU 2.0 182.101 SS 2016 Generic WCET Analysis Framework source code Extraction of Flow Facts Compilation

More information

Combining Simulation and Formal Methods for System-Level Performance Analysis

Combining Simulation and Formal Methods for System-Level Performance Analysis Combining and Formal Methods for System-Level Performance Analysis Simon Künzli ETH Zürich Francesco Poletti Università di Bologna Luca Benini Università di Bologna Lothar Thiele ETH Zürich kuenzli@tik.ee.ethz.ch

More information

Lecture 11: SMT and Caching Basics. Today: SMT, cache access basics (Sections 3.5, 5.1)

Lecture 11: SMT and Caching Basics. Today: SMT, cache access basics (Sections 3.5, 5.1) Lecture 11: SMT and Caching Basics Today: SMT, cache access basics (Sections 3.5, 5.1) 1 Thread-Level Parallelism Motivation: a single thread leaves a processor under-utilized for most of the time by doubling

More information

Lecture: SMT, Cache Hierarchies. Topics: SMT processors, cache access basics and innovations (Sections B.1-B.3, 2.1)

Lecture: SMT, Cache Hierarchies. Topics: SMT processors, cache access basics and innovations (Sections B.1-B.3, 2.1) Lecture: SMT, Cache Hierarchies Topics: SMT processors, cache access basics and innovations (Sections B.1-B.3, 2.1) 1 Thread-Level Parallelism Motivation: a single thread leaves a processor under-utilized

More information

Analysis of AADL Models Using Real-Time Calculus With Applications to Wireless Architectures

Analysis of AADL Models Using Real-Time Calculus With Applications to Wireless Architectures University of Pennsylvania ScholarlyCommons Technical Reports (CIS) Department of Computer & Information Science July 2008 Analysis of AADL Models Using Real-Time Calculus With Applications to Wireless

More information

Resource Sharing and Partitioning in Multicore

Resource Sharing and Partitioning in Multicore www.bsc.es Resource Sharing and Partitioning in Multicore Francisco J. Cazorla Mixed Criticality/Reliability Workshop HiPEAC CSW Barcelona May 2014 Transition to Multicore and Manycores Wanted or imposed

More information

MULTIPROCESSORS AND THREAD-LEVEL. B649 Parallel Architectures and Programming

MULTIPROCESSORS AND THREAD-LEVEL. B649 Parallel Architectures and Programming MULTIPROCESSORS AND THREAD-LEVEL PARALLELISM B649 Parallel Architectures and Programming Motivation behind Multiprocessors Limitations of ILP (as already discussed) Growing interest in servers and server-performance

More information

MULTIPROCESSORS AND THREAD-LEVEL PARALLELISM. B649 Parallel Architectures and Programming

MULTIPROCESSORS AND THREAD-LEVEL PARALLELISM. B649 Parallel Architectures and Programming MULTIPROCESSORS AND THREAD-LEVEL PARALLELISM B649 Parallel Architectures and Programming Motivation behind Multiprocessors Limitations of ILP (as already discussed) Growing interest in servers and server-performance

More information

Administrivia. Mini project is graded. 1 st place: Justin (75.45) 2 nd place: Liia (74.67) 3 rd place: Michael (74.49)

Administrivia. Mini project is graded. 1 st place: Justin (75.45) 2 nd place: Liia (74.67) 3 rd place: Michael (74.49) Administrivia Mini project is graded 1 st place: Justin (75.45) 2 nd place: Liia (74.67) 3 rd place: Michael (74.49) 1 Administrivia Project proposal due: 2/27 Original research Related to real-time embedded

More information

Design and Analysis of Real-Time Systems Microarchitectural Analysis

Design and Analysis of Real-Time Systems Microarchitectural Analysis Design and Analysis of Real-Time Systems Microarchitectural Analysis Jan Reineke Advanced Lecture, Summer 2013 Structure of WCET Analyzers Reconstructs a control-flow graph from the binary. Determines

More information

Managing Memory for Timing Predictability. Rodolfo Pellizzoni

Managing Memory for Timing Predictability. Rodolfo Pellizzoni Managing Memory for Timing Predictability Rodolfo Pellizzoni Thanks This work would not have been possible without the following students and collaborators Zheng Pei Wu*, Yogen Krish Heechul Yun* Renato

More information

Atacama: An Open Experimental Platform for Mixed-Criticality Networking on Top of Ethernet

Atacama: An Open Experimental Platform for Mixed-Criticality Networking on Top of Ethernet Atacama: An Open Experimental Platform for Mixed-Criticality Networking on Top of Ethernet Gonzalo Carvajal 1,2 and Sebastian Fischmeister 1 1 University of Waterloo, ON, Canada 2 Universidad de Concepcion,

More information

Shared Cache Aware Task Mapping for WCRT Minimization

Shared Cache Aware Task Mapping for WCRT Minimization Shared Cache Aware Task Mapping for WCRT Minimization Huping Ding & Tulika Mitra School of Computing, National University of Singapore Yun Liang Center for Energy-efficient Computing and Applications,

More information

Process Scheduling Part 2

Process Scheduling Part 2 Operating Systems and Computer Networks Process Scheduling Part 2 pascal.klein@uni-due.de Alexander Maxeiner, M.Sc. Faculty of Engineering Agenda Process Management Time Sharing Synchronization of Processes

More information

WCET-Aware C Compiler: WCC

WCET-Aware C Compiler: WCC 12 WCET-Aware C Compiler: WCC Jian-Jia Chen (slides are based on Prof. Heiko Falk) TU Dortmund, Informatik 12 2015 年 05 月 05 日 These slides use Microsoft clip arts. Microsoft copyright restrictions apply.

More information

MEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS

MEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS MEMORY/RESOURCE MANAGEMENT IN MULTICORE SYSTEMS INSTRUCTOR: Dr. MUHAMMAD SHAABAN PRESENTED BY: MOHIT SATHAWANE AKSHAY YEMBARWAR WHAT IS MULTICORE SYSTEMS? Multi-core processor architecture means placing

More information

Response Time Analysis of Synchronous Data Flow Programs on a Many-Core Processor

Response Time Analysis of Synchronous Data Flow Programs on a Many-Core Processor Response Time Analysis of Synchronous Data Flow Programs on a Many-Core Processor Hamza Rihani, Matthieu Moy, Claire Maiza, Robert Davis, Sebastian Altmeyer To cite this version: Hamza Rihani, Matthieu

More information

DESIGN OF EFFICIENT ROUTING ALGORITHM FOR CONGESTION CONTROL IN NOC

DESIGN OF EFFICIENT ROUTING ALGORITHM FOR CONGESTION CONTROL IN NOC DESIGN OF EFFICIENT ROUTING ALGORITHM FOR CONGESTION CONTROL IN NOC 1 Pawar Ruchira Pradeep M. E, E&TC Signal Processing, Dr. D Y Patil School of engineering, Ambi, Pune Email: 1 ruchira4391@gmail.com

More information

Lecture 5 / Chapter 6 (CPU Scheduling) Basic Concepts. Scheduling Criteria Scheduling Algorithms

Lecture 5 / Chapter 6 (CPU Scheduling) Basic Concepts. Scheduling Criteria Scheduling Algorithms Operating System Lecture 5 / Chapter 6 (CPU Scheduling) Basic Concepts Scheduling Criteria Scheduling Algorithms OS Process Review Multicore Programming Multithreading Models Thread Libraries Implicit

More information

Architecture Modeling and Analysis for Embedded Systems

Architecture Modeling and Analysis for Embedded Systems Architecture Modeling and Analysis for Embedded Systems Overview of AADL and related research activities in RTG Oleg Sokolsky September 19, 2008 Overview Background Architecture description languages Embedded

More information

Multiprocessor scheduling, part 1 -ChallengesPontus Ekberg

Multiprocessor scheduling, part 1 -ChallengesPontus Ekberg Multiprocessor scheduling, part 1 -ChallengesPontus Ekberg 2017-10-03 What is a multiprocessor? Simplest answer: A machine with >1 processors! In scheduling theory, we include multicores in this defnition

More information

Real-Time Internet of Things

Real-Time Internet of Things Real-Time Internet of Things Chenyang Lu Cyber-Physical Systems Laboratory h7p://www.cse.wustl.edu/~lu/ Internet of Things Ø Convergence of q Miniaturized devices: integrate processor, sensors and radios.

More information

Performance Evaluation of Network Processor Architectures: Combining Simulation with Analytical Estimation

Performance Evaluation of Network Processor Architectures: Combining Simulation with Analytical Estimation Performance Evaluation of Network Processor Architectures: Combining Simulation with Analytical Estimation Samarjit Chakraborty, Simon Künzli, Lothar Thiele Computer Engineering and Networks Laboratory

More information

Real-Time Operating Systems Design and Implementation. LS 12, TU Dortmund

Real-Time Operating Systems Design and Implementation. LS 12, TU Dortmund Real-Time Operating Systems Design and Implementation (slides are based on Prof. Dr. Jian-Jia Chen) Anas Toma, Jian-Jia Chen LS 12, TU Dortmund October 19, 2017 Anas Toma, Jian-Jia Chen (LS 12, TU Dortmund)

More information

Hardware-Software Codesign

Hardware-Software Codesign Hardware-Software Codesign 8. Performance Estimation Lothar Thiele 8-1 System Design specification system synthesis estimation -compilation intellectual prop. code instruction set HW-synthesis intellectual

More information

Dynamic Frequency Scaling Schemes for Heterogeneous Clusters under Quality of Service Requirements

Dynamic Frequency Scaling Schemes for Heterogeneous Clusters under Quality of Service Requirements JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 28, 1073-1090 (2012) Dynamic Frequency Scaling Schemes for Heterogeneous Clusters under Quality of Service Requirements JIAN-JIA CHEN 1, KAI HUANG 2 AND LOTHAR

More information

Chapter 5: CPU Scheduling

Chapter 5: CPU Scheduling COP 4610: Introduction to Operating Systems (Fall 2016) Chapter 5: CPU Scheduling Zhi Wang Florida State University Contents Basic concepts Scheduling criteria Scheduling algorithms Thread scheduling Multiple-processor

More information

On mapping to multi/manycores

On mapping to multi/manycores On mapping to multi/manycores Jeronimo Castrillon Chair for Compiler Construction (CCC) TU Dresden, Germany MULTIPROG HiPEAC Conference Stockholm, 24.01.2017 Mapping for dataflow programming models MEM

More information

Distributed Operation Layer Integrated SW Design Flow for Mapping Streaming Applications to MPSoC

Distributed Operation Layer Integrated SW Design Flow for Mapping Streaming Applications to MPSoC Distributed Operation Layer Integrated SW Design Flow for Mapping Streaming Applications to MPSoC Iuliana Bacivarov, Wolfgang Haid, Kai Huang, and Lothar Thiele ETH Zürich MPSoCs are Hard to program (

More information

Uniprocessor Scheduling. Basic Concepts Scheduling Criteria Scheduling Algorithms. Three level scheduling

Uniprocessor Scheduling. Basic Concepts Scheduling Criteria Scheduling Algorithms. Three level scheduling Uniprocessor Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Three level scheduling 2 1 Types of Scheduling 3 Long- and Medium-Term Schedulers Long-term scheduler Determines which programs

More information

Combining Optimistic and Pessimistic DVS Scheduling: An Adaptive Scheme and Analysis

Combining Optimistic and Pessimistic DVS Scheduling: An Adaptive Scheme and Analysis Combining Optimistic and Pessimistic DVS Scheduling: An Adaptive Scheme and Analysis Simon Perathoner, Kai Lampka, Nikolay Stoimenov, Lothar Thiele Computer Engineering and Networks Laboratory (TIK) Swiss

More information

Precise and Efficient FIFO-Replacement Analysis Based on Static Phase Detection

Precise and Efficient FIFO-Replacement Analysis Based on Static Phase Detection Precise and Efficient FIFO-Replacement Analysis Based on Static Phase Detection Daniel Grund 1 Jan Reineke 2 1 Saarland University, Saarbrücken, Germany 2 University of California, Berkeley, USA Euromicro

More information

Lecture: SMT, Cache Hierarchies. Topics: memory dependence wrap-up, SMT processors, cache access basics and innovations (Sections B.1-B.3, 2.

Lecture: SMT, Cache Hierarchies. Topics: memory dependence wrap-up, SMT processors, cache access basics and innovations (Sections B.1-B.3, 2. Lecture: SMT, Cache Hierarchies Topics: memory dependence wrap-up, SMT processors, cache access basics and innovations (Sections B.1-B.3, 2.1) 1 Problem 0 Consider the following LSQ and when operands are

More information

Kaisen Lin and Michael Conley

Kaisen Lin and Michael Conley Kaisen Lin and Michael Conley Simultaneous Multithreading Instructions from multiple threads run simultaneously on superscalar processor More instruction fetching and register state Commercialized! DEC

More information

TDT Coarse-Grained Multithreading. Review on ILP. Multi-threaded execution. Contents. Fine-Grained Multithreading

TDT Coarse-Grained Multithreading. Review on ILP. Multi-threaded execution. Contents. Fine-Grained Multithreading Review on ILP TDT 4260 Chap 5 TLP & Hierarchy What is ILP? Let the compiler find the ILP Advantages? Disadvantages? Let the HW find the ILP Advantages? Disadvantages? Contents Multi-threading Chap 3.5

More information

Lecture: SMT, Cache Hierarchies. Topics: memory dependence wrap-up, SMT processors, cache access basics and innovations (Sections B.1-B.3, 2.

Lecture: SMT, Cache Hierarchies. Topics: memory dependence wrap-up, SMT processors, cache access basics and innovations (Sections B.1-B.3, 2. Lecture: SMT, Cache Hierarchies Topics: memory dependence wrap-up, SMT processors, cache access basics and innovations (Sections B.1-B.3, 2.1) 1 Problem 1 Consider the following LSQ and when operands are

More information

Programming Languages for Real-Time Systems. LS 12, TU Dortmund

Programming Languages for Real-Time Systems. LS 12, TU Dortmund Programming Languages for Real-Time Systems Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 20 June 2016 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 41 References Slides are based on Prof. Wang Yi, Prof.

More information

Single-Path Programming on a Chip-Multiprocessor System

Single-Path Programming on a Chip-Multiprocessor System Single-Path Programming on a Chip-Multiprocessor System Martin Schoeberl, Peter Puschner, and Raimund Kirner Vienna University of Technology, Austria mschoebe@mail.tuwien.ac.at, {peter,raimund}@vmars.tuwien.ac.at

More information

A Dual-Criticality Memory Controller (DCmc): Proposal and Evaluation of a Space Case Study

A Dual-Criticality Memory Controller (DCmc): Proposal and Evaluation of a Space Case Study A Dual-Criticality Memory Controller (DCmc): Proposal and Evaluation of a Space Case Study Javier Jalle,, Eduardo Quiñones, Jaume Abella, Luca Fossati, Marco Zulianello, Francisco J. Cazorla, Barcelona

More information

IS CHIP-MULTIPROCESSING THE END OF REAL-TIME SCHEDULING? Martin Schoeberl and Peter Puschner 1

IS CHIP-MULTIPROCESSING THE END OF REAL-TIME SCHEDULING? Martin Schoeberl and Peter Puschner 1 IS CHIP-MULTIPROCESSING THE END OF REAL-TIME SCHEDULING? Martin Schoeberl and Peter Puschner 1 Abstract Chip-multiprocessing is considered the future path for performance enhancements in computer architecture.

More information

A Server-based Approach for Predictable GPU Access Control

A Server-based Approach for Predictable GPU Access Control A Server-based Approach for Predictable GPU Access Control Hyoseung Kim * Pratyush Patel Shige Wang Raj Rajkumar * University of California, Riverside Carnegie Mellon University General Motors R&D Benefits

More information

Chapter 5: Process Scheduling

Chapter 5: Process Scheduling Chapter 5: Process Scheduling Chapter 5: Process Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Thread Scheduling Operating Systems Examples Algorithm

More information

Buses. Maurizio Palesi. Maurizio Palesi 1

Buses. Maurizio Palesi. Maurizio Palesi 1 Buses Maurizio Palesi Maurizio Palesi 1 Introduction Buses are the simplest and most widely used interconnection networks A number of modules is connected via a single shared channel Microcontroller Microcontroller

More information

Resource Reservation & Resource Servers

Resource Reservation & Resource Servers Resource Reservation & Resource Servers Resource Reservation Application Hard real-time, Soft real-time, Others? Platform Hardware Resources: CPU cycles, memory blocks 1 Applications Hard-deadline tasks

More information

Real-Time Component Software. slide credits: H. Kopetz, P. Puschner

Real-Time Component Software. slide credits: H. Kopetz, P. Puschner Real-Time Component Software slide credits: H. Kopetz, P. Puschner Overview OS services Task Structure Task Interaction Input/Output Error Detection 2 Operating System and Middleware Application Software

More information

EECS 470. Lecture 18. Simultaneous Multithreading. Fall 2018 Jon Beaumont

EECS 470. Lecture 18. Simultaneous Multithreading. Fall 2018 Jon Beaumont Lecture 18 Simultaneous Multithreading Fall 2018 Jon Beaumont http://www.eecs.umich.edu/courses/eecs470 Slides developed in part by Profs. Falsafi, Hill, Hoe, Lipasti, Martin, Roth, Shen, Smith, Sohi,

More information

Scheduling of Parallel Real-time DAG Tasks on Multiprocessor Systems

Scheduling of Parallel Real-time DAG Tasks on Multiprocessor Systems Scheduling of Parallel Real-time DAG Tasks on Multiprocessor Systems Laurent George ESIEE Paris Journée du groupe de travail OVSTR - 23 mai 2016 Université Paris-Est, LRT Team at LIGM 1/53 CONTEXT: REAL-TIME

More information

Embedded Systems 1. Introduction

Embedded Systems 1. Introduction Embedded Systems 1. Introduction Lothar Thiele 1-1 Organization WWW: http://www.tik.ee.ethz.ch/tik/education/lectures/es/ Lecture: Lothar Thiele, thiele@ethz.ch Coordination: Rehan Ahmed, rehan.ahmed@tik.ee.ethz.ch

More information

Efficient Latency Guarantees for Mixed-criticality Networks-on-Chip

Efficient Latency Guarantees for Mixed-criticality Networks-on-Chip Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Efficient Latency Guarantees for Mixed-criticality Networks-on-Chip Sebastian Tobuschat, Rolf Ernst IDA, TU Braunschweig, Germany 18.

More information

CPU Scheduling. Operating Systems (Fall/Winter 2018) Yajin Zhou ( Zhejiang University

CPU Scheduling. Operating Systems (Fall/Winter 2018) Yajin Zhou (  Zhejiang University Operating Systems (Fall/Winter 2018) CPU Scheduling Yajin Zhou (http://yajin.org) Zhejiang University Acknowledgement: some pages are based on the slides from Zhi Wang(fsu). Review Motivation to use threads

More information

Blocking Analysis of FIFO, Unordered, and Priority-Ordered Spin Locks

Blocking Analysis of FIFO, Unordered, and Priority-Ordered Spin Locks On Spin Locks in AUTOSAR: Blocking Analysis of FIFO, Unordered, and Priority-Ordered Spin Locks Alexander Wieder and Björn Brandenburg MPI-SWS RTSS 2013 12/04/2013 Vancouver, Canada Motivation: AUTOSAR:

More information

Energy-aware Scheduling for Frame-based Tasks on Heterogeneous Multiprocessor Platforms

Energy-aware Scheduling for Frame-based Tasks on Heterogeneous Multiprocessor Platforms Energy-aware Scheduling for Frame-based Tasks on Heterogeneous Multiprocessor Platforms Dawei Li and Jie Wu Department of Computer and Information Sciences Temple University Philadelphia, USA {dawei.li,

More information

15-740/ Computer Architecture Lecture 12: Advanced Caching. Prof. Onur Mutlu Carnegie Mellon University

15-740/ Computer Architecture Lecture 12: Advanced Caching. Prof. Onur Mutlu Carnegie Mellon University 15-740/18-740 Computer Architecture Lecture 12: Advanced Caching Prof. Onur Mutlu Carnegie Mellon University Announcements Chuck Thacker (Microsoft Research) Seminar Tomorrow RARE: Rethinking Architectural

More information

250P: Computer Systems Architecture. Lecture 9: Out-of-order execution (continued) Anton Burtsev February, 2019

250P: Computer Systems Architecture. Lecture 9: Out-of-order execution (continued) Anton Burtsev February, 2019 250P: Computer Systems Architecture Lecture 9: Out-of-order execution (continued) Anton Burtsev February, 2019 The Alpha 21264 Out-of-Order Implementation Reorder Buffer (ROB) Branch prediction and instr

More information

Integrating Concurrency Control and Energy Management in Device Drivers. Chenyang Lu

Integrating Concurrency Control and Energy Management in Device Drivers. Chenyang Lu Integrating Concurrency Control and Energy Management in Device Drivers Chenyang Lu Overview Ø Concurrency Control: q Concurrency of I/O operations alone, not of threads in general q Synchronous vs. Asynchronous

More information

A Single-Path Chip-Multiprocessor System

A Single-Path Chip-Multiprocessor System A Single-Path Chip-Multiprocessor System Martin Schoeberl, Peter Puschner, and Raimund Kirner Institute of Computer Engineering Vienna University of Technology, Austria mschoebe@mail.tuwien.ac.at, {peter,raimund}@vmars.tuwien.ac.at

More information

NoCo: ILP-based Worst-Case Contention Estimation for Mesh Real-Time Manycores

NoCo: ILP-based Worst-Case Contention Estimation for Mesh Real-Time Manycores www.bsc.es NoCo: ILP-based Worst-Case Contention Estimation for Mesh Real-Time Manycores Jordi Cardona 1,2, Carles Hernandez 1, Enrico Mezzetti 1, Jaume Abella 1 and Francisco J.Cazorla 1,3 1 Barcelona

More information

Thanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems

Thanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems Thanks to... Composing and synchronizing real-time components through virtual platforms in vehicular systems Promotor and co-promotor: Prof. Johan J. Lukkien Dr. Reinder J. Bril Martijn van den Heuvel

More information

Supporting Distributed Shared Memory. Axel Jantsch Xiaowen Chen, Zhonghai Lu Royal Institute of Technology, Sweden September 16, 2009

Supporting Distributed Shared Memory. Axel Jantsch Xiaowen Chen, Zhonghai Lu Royal Institute of Technology, Sweden September 16, 2009 Supporting Distributed Shared Memory Axel Jantsch Xiaowen Chen, Zhonghai Lu Royal Institute of Technology, Sweden September 16, 2009 Memory content in today s SoCs 3 Elements in SoC Processing: Well understood;

More information

Performance Evaluation of Scheduling Mechanisms for Broadband Networks

Performance Evaluation of Scheduling Mechanisms for Broadband Networks Performance Evaluation of Scheduling Mechanisms for Broadband Networks Gayathri Chandrasekaran Master s Thesis Defense The University of Kansas 07.31.2003 Committee: Dr. David W. Petr (Chair) Dr. Joseph

More information

CPSC/ECE 3220 Summer 2018 Exam 2 No Electronics.

CPSC/ECE 3220 Summer 2018 Exam 2 No Electronics. CPSC/ECE 3220 Summer 2018 Exam 2 No Electronics. Name: Write one of the words or terms from the following list into the blank appearing to the left of the appropriate definition. Note that there are more

More information

Distributed Operation Layer

Distributed Operation Layer Distributed Operation Layer Iuliana Bacivarov, Wolfgang Haid, Kai Huang, and Lothar Thiele ETH Zürich Outline Distributed Operation Layer Overview Specification Application Architecture Mapping Design

More information

Managing Hybrid On-chip Scratchpad and Cache Memories for Multi-tasking Embedded Systems

Managing Hybrid On-chip Scratchpad and Cache Memories for Multi-tasking Embedded Systems Managing Hybrid On-chip Scratchpad and Cache Memories for Multi-tasking Embedded Systems Zimeng Zhou, Lei Ju, Zhiping Jia, Xin Li School of Computer Science and Technology Shandong University, China Outline

More information

Embedded Systems. 1 Introduction. Lothar Thiele. Computer Engineering and Networks Laboratory

Embedded Systems. 1 Introduction. Lothar Thiele. Computer Engineering and Networks Laboratory Embedded Systems 1 Introduction Lothar Thiele Computer Engineering and Networks Laboratory Lecture Organization 261 1 2 Organization WWW: https://www.tec.ee.ethz.ch/education/lectures/embedded systems.html

More information

Beyond ILP. Hemanth M Bharathan Balaji. Hemanth M & Bharathan Balaji

Beyond ILP. Hemanth M Bharathan Balaji. Hemanth M & Bharathan Balaji Beyond ILP Hemanth M Bharathan Balaji Multiscalar Processors Gurindar S Sohi Scott E Breach T N Vijaykumar Control Flow Graph (CFG) Each node is a basic block in graph CFG divided into a collection of

More information

THE DELAY COMPOSITION THEOREM ON PIPELINE SYSTEMS WITH NON-PREEMPTIVE PRIORITY VARYING SCHEDULING ALGORITHMS YI LU THESIS

THE DELAY COMPOSITION THEOREM ON PIPELINE SYSTEMS WITH NON-PREEMPTIVE PRIORITY VARYING SCHEDULING ALGORITHMS YI LU THESIS THE DELAY COMPOSITION THEOREM ON PIPELINE SYSTEMS WITH NON-PREEMPTIVE PRIORITY VARYING SCHEDULING ALGORITHMS BY YI LU THESIS Submitted in partial fulfillment of the requirements for the degree of Master

More information

Integrating Concurrency Control and Energy Management in Device Drivers

Integrating Concurrency Control and Energy Management in Device Drivers Integrating Concurrency Control and Energy Management in Device Drivers Kevin Klues, Vlado Handziski, Chenyang Lu, Adam Wolisz, David Culler, David Gay, and Philip Levis Overview Concurrency Control: Concurrency

More information

Parallel Computing. Parallel Computing. Hwansoo Han

Parallel Computing. Parallel Computing. Hwansoo Han Parallel Computing Parallel Computing Hwansoo Han What is Parallel Computing? Software with multiple threads Parallel vs. concurrent Parallel computing executes multiple threads at the same time on multiple

More information