From Hardware Trace to. System Knowledge

Similar documents
A High-Level Language and Compiler to Configure the Multi-Core Debug Solution

System Level Instrumentation using the Nexus specification

Nexus Makin Multi- Core Work

Trace-Based Context-Sensitive Timing Simulation Considering Execution Path Variations

On the Use of Context Information for Precise Measurement-Based Execution Time Estimation

TRACE32. Product Overview

Reusing Cache for Real-Time Memory Address Trace Compression

Systematic Hardware Platform Selection - Introduction to Embedded Systems-

Nexus Instrumentation architectures and the new Debug Specification

Using Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology

SYSTEMS ON CHIP (SOC) FOR EMBEDDED APPLICATIONS

Independent DSP Benchmarks: Methodologies and Results. Outline

Precise Continuous Non-Intrusive Measurement-Based Execution Time Estimation. Boris Dreyer, Christian Hochberger, Simon Wegener, Alexander Weiss

NEWS Communication: The Key to Success. Panel Discussions. Expert Seminars CONTENTS. Committed to Standards

Growth outside Cell Phone Applications

Managing Complex Trace Filtering and Triggering Capabilities of CoreSight. Jens Braunes pls Development Tools

On-Chip Debugging of Multicore Systems

System Performance Optimization Methodology for Infineon's 32-Bit Automotive Microcontroller Architecture

A Seamless Tool Access Architecture from ESL to End Product

Predictable hardware: The AURIX Microcontroller Family

Overview. Technology Details. D/AVE NX Preliminary Product Brief

ARM Processors for Embedded Applications

AUTOBEST: A microkernel-based system (not only) for automotive applications. Marc Bommert, Alexander Züpke, Robert Kaiser.

Cycle Accurate Binary Translation for Simulation Acceleration in Rapid Prototyping of SoCs

Real-Time Debugging Highly Integrated Embedded Wireless Devices

IDE for medical device software development. Hyun-Do Lee, Field Application Engineer

RM3 - Cortex-M4 / Cortex-M4F implementation

Optimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs

A Seamless Tool Access Architecture from ESL to End Product. Albrecht Mayer (Infineon Microcontrollers) S4D Conference Sophia Antipolis, Sept.

Debug Support, Calibration and Emulation for Multiple Processor and Powertrain Control SoCs

NEWS 2018 CONTENTS SOURCE CODE COVERAGE WORKS WITHOUT CODE INSTRUMENTATION. English Edition

Support for RISC-V. Lauterbach GmbH. Bob Kupyn Lauterbach Markus Goehrle - Lauterbach GmbH

High Performance Computing on GPUs using NVIDIA CUDA

Introduction to Embedded Systems

Exploring System Coherency and Maximizing Performance of Mobile Memory Systems

ARM CORTEX-R52. Target Audience: Engineers and technicians who develop SoCs and systems based on the ARM Cortex-R52 architecture.

On-Chip Instrumentation

Detecting Memory-Boundedness with Hardware Performance Counters

Running ARM7TDMI Processor Software on the Cortex -M3 Processor

Altix Usage and Application Programming

Error Detection by Code Coverage Analysis without Instrumenting the Code

A Study of the Speedups and Competitiveness of FPGA Soft Processor Cores using Dynamic Hardware/Software Partitioning

Design and Analysis of Time-Critical Systems Introduction

TRACE32 Glossary Terms, Abbreviations, and Definitions... 2

Profiling and Debugging Games on Mobile Platforms

RM4 - Cortex-M7 implementation

Software Quality is Directly Proportional to Simulation Speed

Hardware-assisted runtime verification in COEMS

An Introduction to the MPC57xx Nexus Aurora Interface

Performance Analysis with Vampir

It's not about the core, it s about the system

ARM Cortex-M and RTOSs Are Meant for Each Other

Building High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink. Robert Kaye

BlueBox On-Chip Analyzers

Embedded Systems: Projects

CONTENTS SOURCE CODE COVERAGE WORKS WITHOUT CODE INSTRUMENTATION. 2 Code Coverage Live. Trace-based MCDC Coverage. 5 Tracing via PCI Express

Tracking the Virtual World

2-bit ARM Cortex TM -M3 based Microcontroller FM3 Family MB9A130 Series

USING A RUNTIME MEASUREMENT DEVICE WITH MEASUREMENT-BASED WCET ANALYSIS

Each Milliwatt Matters

Dr. Yassine Hariri CMC Microsystems

Embedded Systems: Projects

Programming Heterogeneous Embedded Systems for IoT

Interoperability: Advantages, Problems and Solutions

Cortex-M3/M4 Software Development

The Challenges of System Design. Raising Performance and Reducing Power Consumption

Combining Arm & RISC-V in Heterogeneous Designs

Chapter 15 ARM Architecture, Programming and Development Tools

Optimizing Emulator Utilization by Russ Klein, Program Director, Mentor Graphics

FlexRay The Hardware View

Overview of Development Tools for the ARM Cortex -A8 Processor George Milne March 2006

Static WCET Analysis: Methods and Tools

Architectural Time-predictability Factor (ATF) to Measure Architectural Time Predictability

FPGA Adaptive Software Debug and Performance Analysis

NEXUS 5001 Forum Debug Interface Standard

How Might Recently Formed System Interconnect Consortia Affect PM? Doug Voigt, SNIA TC

Test and Verification Solutions. ARM Based SOC Design and Verification

Modern Computer Architecture. Lecture 12 embedded Applications, classical DSP, automotive (Tricore)

Effective System Design with ARM System IP

EEMBC s Automotive/Industrial Microprocessor Benchmarks. June 4, 2004

Exploiting Dynamically Changing Parallelism with a Reconfigurable Array of Homogeneous Sub-cores (a.k.a. Field Programmable Core Array or FPCA)

Dr. Ishaq Unwala Rockwell Pl Phone #: (512)

Tesla GPU Computing A Revolution in High Performance Computing

Timing analysis and timing predictability

RA3 - Cortex-A15 implementation

EECS Berkeley EE249 LabVIEW Framework. Hugo A. Andrade Principal Architect LabVIEW Platform & Real Time Group

Ultra Depedable VLSI by Collaboration of Formal Verifications and Architectural Technologies

System Impact of Distributed Multicore Systems December 5th 2012

Embedded Systems. 8. Hardware Components. Lothar Thiele. Computer Engineering and Networks Laboratory

Cycle Approximate Simulation of RISC-V Processors

Embedded Systems: Architecture

MPSOC Design examples

Cortex-M Software Development

Adaptive processor architectures for detector applications

Introducing the Superscalar Version 5 ColdFire Core

Assembling and Debugging VPs of Complex Cycle Accurate Multicore Systems. July 2009

Next Generation Visual Computing

FOUR INDEPENDENT TOOLS TO MANAGE COMPLEXITY INHERENT TO DEVELOPING STATE OF THE ART SYSTEMS. DEVELOPER SPECIFIER TESTER

Multi-core microcontroller design with Cortex-M processors and CoreSight SoC

Debugging of FPGA based Prototypes - A Case Study

Transcription:

Fakultät Informatik, Institut für Technische Informatik, Professur VLSI-Entwurfssysteme, Diagnostik und Architektur From Hardware Trace to Data-intensive Hardware Trace Analysis Andreas Gajda TU Dresden, 25.04.2012 http://esoc.inf.tu-dresden.de

Outline 1 Data-intensive Hardware Trace 2 Hardware Trace Analysis 3 Trace Analysis Framework 4 Conclusion 5 References - 2 - http://esoc.inf.tu-dresden.de

Outline 1 Data-intensive Hardware Trace 2 Hardware Trace Analysis 3 Trace Analysis Framework 4 Conclusion 5 References - 3 - http://esoc.inf.tu-dresden.de

Industrial technologies From Hardware-Trace to Avionics Medical technologies Automotive Mobile Devices - 4 - http://esoc.inf.tu-dresden.de

Hardware Trace Format Img 1: OMAP 44x ARM Ltd. [1] - 5 - http://esoc.inf.tu-dresden.de

Hardware Trace Format Img 1: OMAP 44x ARM Ltd. [1] - 6 - http://esoc.inf.tu-dresden.de

Multiple Vendor Solutions ARM CoreSight TM / ETM TM Infineon Multi-Core-Debug-Solution (MCDS) e.g. TriCore TC1797ED Nexus IEEE-ISTO 5001 e.g. PowerPC Aurora Nexus Trace Img 2: CoreSight TM ARM Ltd. [1] Img 3: MCDS [3] Img 4: NEXUS IEEE-ISTO 5001 [2] - 7 - http://esoc.inf.tu-dresden.de

Packet based From Hardware-Trace to Img 5: NEXUS IEEE-ISTO 5001 [2] - 8 - http://esoc.inf.tu-dresden.de

Analysis POV Hardware Trace Content Control Flow Trace Data Flow Trace Performance Counter Run/Trace Control Program Trace IEEE-ISTO-5001 (NEXUS) Features Data Device ID Ownership Watchpoints Trace Trace X X X (X) X X X (X) X X X Run/Trace Control X Breakpoints and other Features are summarized in Run/Trace Control. - 9 - http://esoc.inf.tu-dresden.de

Hardware Trace Amount EEMBC Benchmarks [6] Cycle accurate Program and Data Trace 20M Ticks covered; 100ms Program Execution 28MiB Trace Data 315 MiB/s on 100MHz Embedded PowerPC Processor - 10 - http://esoc.inf.tu-dresden.de

Outline 1 Data-intensive Hardware Trace 2 Hardware Trace Analysis 3 Trace Analysis Framework 4 Conclusion 5 References - 11 - http://esoc.inf.tu-dresden.de

Current Hardware Trace Analysis Single-Core Call Graph / Timeline Code Coverage Counter Timeline Basic Statistics (Minimum, Maximum, Average) - 12 - http://esoc.inf.tu-dresden.de

Learning from Software Trace Analysis [5] Multi-Core Call Graph / Timeline Extended Statistics (Distribution, Coherence etc.) Counter-Arithmetic Multi-dimensional Counter Timelines / Surface Plots Machine Learning Algorithms Graph based Analysis (WCET etc.) Pattern Matching Model Verification Communication analysis Data for external Tools - 13 - http://esoc.inf.tu-dresden.de

Function-Parallel Trace-Analysis A B - 14 - http://esoc.inf.tu-dresden.de

Data-Parallel Trace-Analysis A 1 A 2 A 3-15 - http://esoc.inf.tu-dresden.de

Function Pipeline Trace-Analysis A 1 A 2 A 3-16 - http://esoc.inf.tu-dresden.de

Outline 1 Data-intensive Hardware Trace 2 Hardware Trace Analysis 3 Trace Analysis Framework 4 Conclusion 5 References - 17 - http://esoc.inf.tu-dresden.de

Objectives High Performance Analysis Near Real-Time Parallel and Scalable Vendor Independent Trace Format Caching or Storage of Intermediate and Final Results Interoperability and Independence of Analyses and Atoms Interoperability with external Tools Img 6: Trace Analysis Framework Structure - 18 - http://esoc.inf.tu-dresden.de

Framework Structure Img 6: Trace Analysis Framework Structure Analysis split into Atoms Combine Atoms Independence of Execution Universal Trace Format HW / SW Trace Analysis-Oriented Simple Compression Conversion into SW-Trace Format Intermediate and Final Results can be stored - 19 - http://esoc.inf.tu-dresden.de

Outline 1 Data-intensive Hardware Trace 2 Hardware Trace Analysis 3 Trace Analysis Framework 4 Conclusion 5 References - 20 - http://esoc.inf.tu-dresden.de

Conclusion Demands on Hardware Trace Analysis are rising. Need for high performance and scalable Trace Analysis Decomposition and Arrangement of Analyses Potential for automated Analysis Hardware Platform independent Trace Analysis Single Trace Format Defined Interfaces Analysis independent from Framework implementation Flexible Data-Management Store of Final and Intermediate Results in Trace Format Input / Output from Trace-Database Conversion of Trace-Data into other Trace-Formats - 21 - http://esoc.inf.tu-dresden.de

Future Work Finish Framework Implement Basic Analysis Implement Visualization Components Implement simple Analysis in programmable Hardware (FPGA) - 22 - http://esoc.inf.tu-dresden.de

Outline 1 Data-intensive Hardware Trace 2 Hardware Trace Analysis 3 Trace Analysis Framework 4 Conclusion 5 References - 23 - http://esoc.inf.tu-dresden.de

References (1) (Mar 2012) ARM Ltd., http://www.arm.com, [online], http://www.arm.com/images/coresight_diagram.jpg (2) (Mar 2012) NEXUS 5001 Forum, http://www.nexus5001.org, [online], http://www.nexus5001.org/sites/default/files/waffles_logo.gif (3) (Mar 2012) IPextreme, http://www.ip-extreme.com/ip/mcds.shtml, [online], http://www.ip-extreme.com/images/products/mcds_diagram3.gif (4) (Sep 2008) ARM Ltd., William Orme, Debug and Trace for Multicore SOCs, Whitepaper, http://www.arm.com/files/pdf/coresightwhitepaper.pdf (5) (Aug 2010) Andreas Gajda, Use Cases of Trace Data Analysis, Unpublished Research Report (6) (2011, Oct) The embedded microprocessor benchmark consortium. Website. [Online]. http://www.eembc.org - 24 - http://esoc.inf.tu-dresden.de