Communication Oriented Design Flow

Similar documents
Address InterLeaving for Low- Cost NoCs

Modeling and Simulation of System-on. Platorms. Politecnico di Milano. Donatella Sciuto. Piazza Leonardo da Vinci 32, 20131, Milano

SoC Communication Complexity Problem

NetSpeed ORION: A New Approach to Design On-chip Interconnects. August 26 th, 2013

The Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006

The Challenges of System Design. Raising Performance and Reducing Power Consumption

IMPROVES. Initial Investment is Low Compared to SoC Performance and Cost Benefits

Long Term Trends for Embedded System Design

Design and Test Solutions for Networks-on-Chip. Jin-Ho Ahn Hoseo University

SystemC, OCCN and VPs. Integrated Systems Development NoC Round Table, ESTEC 17 September 2009

SoC Design Lecture 13: NoC (Network-on-Chip) Department of Computer Engineering Sharif University of Technology

FCUDA-NoC: A Scalable and Efficient Network-on-Chip Implementation for the CUDA-to-FPGA Flow

Minje Jun. Eui-Young Chung

Chapter 2 Designing Crossbar Based Systems

Platform-based SoC Design

HotChips An innovative HD video and digital image processor for low-cost digital entertainment products. Deepu Talla.

Platform-based Design

Embedded Systems: Hardware Components (part II) Todor Stefanov

Experiences and Challenges of Transaction-Level Modelling with SystemC 2.0

CHALLENGES OF TODAY'S COMPLEX SOC: PERFORMANCE VERIFICATION PANKAJ SINGH, MALATHI CHIKKANNA

Combining Arm & RISC-V in Heterogeneous Designs

A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on

Mapping Multi-Million Gate SoCs on FPGAs: Industrial Methodology and Experience

Software Driven Verification at SoC Level. Perspec System Verifier Overview

High-Level Simulations of On-Chip Networks

ESE Back End 2.0. D. Gajski, S. Abdi. (with contributions from H. Cho, D. Shin, A. Gerstlauer)

Building High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink. Robert Kaye

System-on-Chip Architecture for Mobile Applications. Sabyasachi Dey

Easy Multicore Programming using MAPS

NETWORKS on CHIP A NEW PARADIGM for SYSTEMS on CHIPS DESIGN

Will Everything Start To Look Like An SoC?

Analyzing and Debugging Performance Issues with Advanced ARM CoreLink System IP Components

Validation Strategies with pre-silicon platforms

Hardware Design and Simulation for Verification

Automatic Instrumentation of Embedded Software for High Level Hardware/Software Co-Simulation

NoC Generic Scoreboard VIP by François Cerisier and Mathieu Maisonneuve, Test and Verification Solutions

Place Your Logo Here. K. Charles Janac

Efficient use of Virtual Prototypes in HW/SW Development and Verification

The Architects View Framework: A Modeling Environment for Architectural Exploration and HW/SW Partitioning

Cadence SystemC Design and Verification. NMI FPGA Network Meeting Jan 21, 2015

Generation of RTL-code and test facilities for on-chip TDM-network

Modular SystemC. In-house Training Options. For further information contact your local Doulos Sales Office.

DESIGN A APPLICATION OF NETWORK-ON-CHIP USING 8-PORT ROUTER

Transaction Level Model Simulator for NoC-based MPSoC Platform

Strato and Strato OS. Justin Zhang Senior Applications Engineering Manager. Your new weapon for verification challenge. Nov 2017

Benefits of Network on Chip Fabrics

Mapping and Physical Planning of Networks-on-Chip Architectures with Quality-of-Service Guarantees

Experience with the NetFPGA Program

Implementing Flexible Interconnect Topologies for Machine Learning Acceleration

Next Generation Verification Process for Automotive and Mobile Designs with MIPI CSI-2 SM Interface

COMMUNICATION AND I/O ARCHITECTURES FOR HIGHLY INTEGRATED MPSoC PLATFORMS OUTLINE

Will Everything Start To Look Like An SoC?

System Level Design with IBM PowerPC Models

Transaction level modeling of SoC with SystemC 2.0

Introduction to System-on-Chip

Introduction to the Qsys System Integration Tool

Growth outside Cell Phone Applications

Virtual PLATFORMS for complex IP within system context

SCope: Efficient HdS simulation for MpSoC with NoC

An Efficient AXI Read and Write Channel for Memory Interface in System-on-Chip

ECE 111 ECE 111. Advanced Digital Design. Advanced Digital Design Winter, Sujit Dey. Sujit Dey. ECE Department UC San Diego

Design methodology for multi processor systems design on regular platforms

OCCN: A Network-On-Chip Modeling and Simulation Framework. M.Coppola, S.Curaba, M.Grammatikakis, R.Locatelli, G.Maruccia, F.Papariello, L.

Multi-core microcontroller design with Cortex-M processors and CoreSight SoC

Future Gigascale MCSoCs Applications: Computation & Communication Orthogonalization

Verification Futures Nick Heaton, Distinguished Engineer, Cadence Design Systems

An Application-Specific Design Methodology for STbus Crossbar Generation

NoCIC: A Spice-based Interconnect Planning Tool Emphasizing Aggressive On-Chip Interconnect Circuit Methods

Yafit Snir Arindam Guha Cadence Design Systems, Inc. Accelerating System level Verification of SOC Designs with MIPI Interfaces

Lecture 2: Topology - I

A Thermal-aware Application specific Routing Algorithm for Network-on-chip Design

SoC Design. Prof. Dr. Christophe Bobda Institut für Informatik Lehrstuhl für Technische Informatik

Embedded HW/SW Co-Development

SYSTEMS ON CHIP (SOC) FOR EMBEDDED APPLICATIONS

DESIGN AND IMPLEMENTATION ARCHITECTURE FOR RELIABLE ROUTER RKT SWITCH IN NOC

Simplifying FPGA Design for SDR with a Network on Chip Architecture

A Methodology for NoC

4. Networks. in parallel computers. Advances in Computer Architecture

Fault Tolerant and Secure Architectures for On Chip Networks With Emerging Interconnect Technologies. Mohsin Y Ahmed Conlan Wesson

ARM Processors for Embedded Applications

FPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC)

Jump-Start Software-Driven Hardware Verification with a Verification Framework

Esto. An Eclipse-Based Embedded S/W Development Tool. Heung-Nam Kim Director of Embedded S/W Technology Center

CCNoC: Specializing On-Chip Interconnects for Energy Efficiency in Cache-Coherent Servers

The CoreConnect Bus Architecture

Spidergon STNoC: The technology that adds value to your System. Marcello Coppola

Achieving UFS Host Throughput For System Performance

Chapter 6 Storage and Other I/O Topics

Fujitsu System Applications Support. Fujitsu Microelectronics America, Inc. 02/02

On-Chip Communications

A Closer Look at the Epiphany IV 28nm 64 core Coprocessor. Andreas Olofsson PEGPUM 2013

Next Generation Enterprise Solutions from ARM

Conquering Memory Bandwidth Challenges in High-Performance SoCs

SONA: An On-Chip Network for Scalable Interconnection of AMBA-Based IPs*

BeiHang Short Course, Part 5: Pandora Smart IP Generators

Veloce2 the Enterprise Verification Platform. Simon Chen Emulation Business Development Director Mentor Graphics

S2C K7 Prodigy Logic Module Series

Software Defined Modem A commercial platform for wireless handsets

Multimedia in Mobile Phones. Architectures and Trends Lund

A 1-GHz Configurable Processor Core MeP-h1

Transcription:

ARTIST2 http://www.artist-embedded.org/fp6/ ARTIST Workshop at DATE 06 W4: Design Issues in Distributed, Communication-Centric Systems Communication Oriented Design Flow Marcello Coppola Head of AST Grenoble Lab STMicroelectronics

Outline MPSOC trends NoC Introduction STBus Genkit and STNoC extensions TLM Modeling OCCN Modeling Conclusion

SoC challanges Embedded SW : doubles every 10 months complexity Nielsen s law : bandwidth doubles every 24 months 2000 2010 time Moore s law : IC complexity doubles every 18 months

Multimedia Chip today: HW view Video Encoder ST220 DV Decoder MPEG Video Decoder FDMA STmxxxx block diagram EMI COMMs LLI USB H/D Central CPU: SH4 STBus Interconnect L M I Audio Encoder Audio Decoder Display Unit FEI CPXM HDDI ST220 ST220

Multimedia Chip today: SW view Unique Applications and Features App1 App2 App3 Central CPU: SH4 Video Encoder ST220 Audio Encoder ST220 DV Decoder Audio Decoder ST220 MPEG Chassis Control Video Decoder FDMA ST API OS EMI STBus ToolKit Interconnect USER INTERFACE STmxxxx block diagram Audio video switch TUNER TxT SWITCH Install EPG DDEC Channel select i²c RESIDENT APPLICATIONS USB COMMs LLI H/D Services Drivers FEI Display Unit Hardware Layer Silicon CPXM SOC IR TeleWeb Grafix KBD AUDIO Data Server OSD 3 rd Party Middleware SDIN User input VTG C.A VIDEO DSM -CC H/V-SRC JVM SERVICES GAMMA DRIVERS HDDI PLATFORM L M I

Multi Processor SoC issues Scalability is limited by: Complexity Programming models limitations Inadequacy of on-chip communication Power consumption, both dynamic and static Technology Costs issues SW complexity and verification HW verification time and costs

SoC Interconnect issues Complexity Interconnect must scale to manage increased number of IPs and interfaces in the system Interconnect must support QoS for reduced interference of applications at the interconnect level leading to reduced effort for performance verification Interconnect technology must support ease of generation Physical integration Interconnect must deal with increased timing closure issues Interconnect must reduce wire congestion and wire cost Bandwidth Interconnect must support operation at high clock frequency Low Power Interconnect must support low power operation

Interconnect: Global wire delay issues Global w/o repeaters Global with repeaters Local (scaled) From ITRS map on interconnects Gate delay Global wires delay increase! (Logic and local wires delay downscale)

Interconnect: area issues Stbus based,more than 40 bus targets/initiators Large interconnect area Cross-talks problems Clock skew DACs Clocks Display Interface Sub- Sub- System System Video LMI Encoder On-Chip Network up4 DVDecoder up1 EMI up3 up2 Blitter Comms MPEG2

Network on Chip NoC is a consolidated research topic (first explicit literature contributions in year 2000) Convergence of multiple disciplines: NoC is a synthesis (On-chip communication, Multiprocessing, Networking, Parallel computing) NoC is a layered approach for on-chip communication design proposed in literature to cope with issues of current SoC architectures

NoC Paradigm The NoC paradigm is one enabling the integration of a large number of computational logic, and storage blocks in a SoC The adoption and deployment of NoCs face important issues relating to design and test methodologies and automation tools. Effective on-chip implementation of NoC paradigms requires developing and deploying a whole new set of infrastructure IPs and supporting tools and methodologies. NoC is a communication-centric approach

STNoC : flexible packet based Performance: emulate crossbar but at lower cost and higher frequency Technology issue: reduce wires, split long connections Productivity: QoS/methodology, real Fast prototyping, easy backend Reuse: modularity for silicon open platform Configurability/flexibility/scalability

STNoC : architecture Octagon and STBus as a starting point Fixed and regular topology Granularity of 2 nodes Routing algorithm: easy to implement deterministic based on packet source and destination address (no stored table) diameter = ceil(n/4) Wormhole switching technique Output buffers Use of Virtual Channels and smart selection to avoid deadlock

STNoC target: to provide on-chip services for a wealth of systems Computer peripherals Data storage Webcam Printers and imaging Monitor Digital consumer DVD Digital TVs Digital cameras MP3 players Set-top boxes Automotive Car radio Engine/body/safety Car multimedia Communications Wireless handset Wireless infrastructure Access Networking Smartcards/industrial Telephone Banking User ID Security

STM Communication centric Flow Diagram in today SoC System Evaluation Design Entry Size & configure TLM model lib System Simulat. Implementation RTL model lib Config & Size Model generat. Simulat. & Analysis TLM CA model lib Design Complet. Back End Verificat. Synthesis RTL generat.

STM GenKit(*) : Why? To have an STBus & STNoC development platform able to exploit and maximize the main STBus & STNoC features: Flexibility & Configurability Extensibility Fast Integration (*) For more info contact Carlo Pistritto ( carlo.pistritto@st.com)

STM GenKit GUI: Introduction The "GenKit" GUI is the entry point for STBus & STNoC interconnects generation. It allows: Creating projects Instantiating STBus & STNoC interconnects, components, clocks and connecting them SystemC TLM interconnects generation & simulation SystemC BCA interconnects generation, simulation & analysis Power consumption analysis Area estimation analysis RTL scripts for coreassembler generation

GenKit Snapshot for STNoC

STNoC Platform Builder within the Genkit System Evaluation Design Entry Size & configure TLM model lib System Simulat. Implementation RTL model lib Config & Size Model generat. Simulat. & Analysis TLM CA model lib Design Complet. Back End Verificat. Synthesis RTL generat.

STNoC Platform Building in Detail (1/2) Configuration file Platform Building IPTG SystemC Model Next configuration GENERATOR XML parser + validator STNoC based platform RUN DB results Results Analysis VCD trace VCD trace TXT reports TXT reports SysProbe Checking Post elaboration scripts

Platform building in detail Standard Component Library composed by TLMStBus, IPTG, STBusGenKit, TLMInfra using GNU Autotools to be faster and more portable, removing where possible the abuse of generic programming (e.g. using template to propagate constants etc.) to be more flexible STNoC SystemC/OCCN model library Standard configuration interface through XML descriptions using Libxml2 Automatic system binding XML driven

STNoC Platform Building in Detail (2/2) The platform is generate at run-time according to the XML description. It s composed by: Traffic Generators (IPTG) STBus Genkit components: Ideal Targets STBus Nodes STBus Buffers STNoC components: Network interfaces (STBus, AXI) Concentrators Routers TLM/BCA wrappers Each of these components can be configured at run-time All the components are shared libraries linked at dynamically, so it s possible to change one of these without recompiling the entire system

IP Traffic Generator (IPTG) (1/2) IPTG is a block developed in SystemC that read a traffic characterization file as input and generate the corresponding traffic as output on a communication structure. IPTG is instantiated for a specific IP to generate the IP traffic on a platform that may contain components of three different abstraction levels: TLM Transaction Level Modelling BCA Bus Cycle Accurate RTL Register Transfer Level

IP Traffic Generator (IPTG) (2/2) IPTG_NAME IPcore1 $IPTG_COM TLM_STBUS $IPTG_MODEL STBUS_PLUG $IPTG_CHK_SIZE 64 $IPTG_MAX_OPC ST32 $IPTG_MIN_OPC ST32 # $IPTG_FIFO_SIZE 256 $IPTG_BEHAVIOUR_SEQ {1,9,(2,8*32,6)*1} #------------------- 1 -------------------------------------- #Description: do nothing while the system being initialized #------------------------------------------------------------ $BEHAVIOUR_NAME WAIT_RESET $BEHAVIOUR_SEQ NOP 7000 #------------------- 8 -------------------------------------- #Description: #------------------------------------------------------------ $BEHAVIOUR_NAME WRITE_LINE_BLIT1 $BEHAVIOUR_FREQ 200 $BEHAVIOUR_DATA_SIZE 2 $BEHAVIOUR_DATA_LENGTH 1286 # 1920 $BEHAVIOUR_ADDR {(0x10630000~0x106300ff)} $BEHAVIOUR_ADDR_POLICY INCREMENTAL Example of a traffic characterization file

Result Analysis VCD Trace ======================================= IP:Initiator_at_1_ ======================================= STbus type : 3 Data size: 64 bits Simulation Start Time: 0 ps Simulation End Time: 500998000 ps Simulation duration: 500998000 ps Clock period: 4000 ps STbus Frequency: 250.00 Mhz STbus Clock cycles 125248 Window Time Frame 10 LATENCY : Name Min Max Ave Req2Gnt 0 46 0.92 Req2R_Req 25 116 38.62 Req2R_Eop 25 116 38.62 R_Req2R_Gnt 0 0 0.00 VCD trace is parsed by SysProbe in batch modality to obtain: Protocol checking Cycle accurate throughput and latency values Example of SysProbe output

Results Analysis TXT Report Textual reports of all components are elaborated by several Perl scripts in order to obtain: Bandwith statistics Arbiters statistics Performance results Total bytes transferred Power estimation other --- < Average BW in [ 0-0.000501 ] > --- Source 1: 174.946 MB/s Source 2: 31.681 MB/s Source 5: 155.146 MB/s Source 6: 26.954 MB/s Source 17: 153.293 MB/s Source 18: 29.637 MB/s Source 21: 156.104 MB/s Source 22: 30.723 MB/s Source 33: 147.481 MB/s Source 34: 27.337 MB/s Source 37: 154.443 MB/s Source 38: 30.403 MB/s TOTAL Bandwith is 1118.14770459082 Example of a script s output

Example of XML configuration file <platform> <interconnect> <topology>spidergon</topology> <node id="0"> <type> Master </type> <initiator> <config_r>custom_r</config_r> <config_w>custom_w</config_w> </initiator> </node> <node id="1"> <type> Slave </type> <target> <start_address>0x100</start_address> <end_address>0x1ff</end_address> </target> </node> <routing> <req_path>af_0</req_path> <res_path>al_0</res_path> </routing> </interconnect> <simulation> <vcd_trace>enabled</vcd_trace> </simulation> </platform>

Conclusion System level analysis and design flow provide tremendous advantages but we need tools for System level estimation (bandwidth, latency, power), Requirements analysis, Application mapping ST endeavours to develop tools providing a complete and fast path to silicon To meet design constraints With short time-to-market Keeping innovation pace

Marcello.Coppola@st.com Thank you for listening!!!