Hardware Software Bring-Up Solutions for ARM v7/v8-based Designs. August 2015

Similar documents
Combining TLM & RTL Techniques:

Yafit Snir Arindam Guha Cadence Design Systems, Inc. Accelerating System level Verification of SOC Designs with MIPI Interfaces

Embedded HW/SW Co-Development

Verification Futures The next three years. February 2015 Nick Heaton, Distinguished Engineer

Software Driven Verification at SoC Level. Perspec System Verifier Overview

Optimizing Hardware/Software Development for Arm-Based Embedded Designs

Verification Futures Nick Heaton, Distinguished Engineer, Cadence Design Systems

Next Generation Verification Process for Automotive and Mobile Designs with MIPI CSI-2 SM Interface

Optimizing ARM SoC s with Carbon Performance Analysis Kits. ARM Technical Symposia, Fall 2014 Andy Ladd

Extending Fixed Subsystems at the TLM Level: Experiences from the FPGA World

Does FPGA-based prototyping really have to be this difficult?

Validation Strategies with pre-silicon platforms

Modeling Performance Use Cases with Traffic Profiles Over ARM AMBA Interfaces

Performance Optimization for an ARM Cortex-A53 System Using Software Workloads and Cycle Accurate Models. Jason Andrews

100M Gate Designs in FPGAs

Veloce2 the Enterprise Verification Platform. Simon Chen Emulation Business Development Director Mentor Graphics

Getting the Most out of Advanced ARM IP. ARM Technology Symposia November 2013

Embedded Hardware and Software

EEM870 Embedded System and Experiment Lecture 4: SoC Design Flow and Tools

Will Everything Start To Look Like An SoC?

Power: What s the problem?

The Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006

Building blocks for 64-bit Systems Development of System IP in ARM

SYSTEMS ON CHIP (SOC) FOR EMBEDDED APPLICATIONS

Will Everything Start To Look Like An SoC?

Designing a Multi-Processor based system with FPGAs

System-on-Chip Architecture for Mobile Applications. Sabyasachi Dey

Parallel Simulation Accelerates Embedded Software Development, Debug and Test

Effective System Design with ARM System IP

MIPI : Advanced Driver Assistance System

NS115 System Emulation Based on Cadence Palladium XP

Early Software Development Through Emulation for a Complex SoC

Introduction to gem5. Nizamudheen Ahmed Texas Instruments

Copyright 2016 Xilinx

Optimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs

Designing, developing, debugging ARM Cortex-A and Cortex-M heterogeneous multi-processor systems

Creating hybrid FPGA/virtual platform prototypes

Analyzing and Debugging Performance Issues with Advanced ARM CoreLink System IP Components

Zynq-7000 All Programmable SoC Product Overview

Building High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink. Robert Kaye

Platform-based Design

Place Your Logo Here. K. Charles Janac

Beyond Hardware IP An overview of Arm development solutions

SoC FPGAs. Your User-Customizable System on Chip Altera Corporation Public

Test and Verification Solutions. ARM Based SOC Design and Verification

Copyright 2014 Xilinx

Samsung and Cadence. Byeong Min, Master of Infrastructure Design Center, System LSI Business, Samsung. The Customer. The Challenge. Business Challenge

Oberon M2M IoT Platform. JAN 2016

Contents 1 Introduction 2 Functional Verification: Challenges and Solutions 3 SystemVerilog Paradigm 4 UVM (Universal Verification Methodology)

MemCon 2014 October 15 th, Achieving End- to- E nd QoS Poonacha K ongetir a

The How To s of Metric Driven Verification to Maximize Productivity

HiSilicon STB Push-to-Talk Development Kit for Amazon AVS

Cadence/EURORPACTICE 2011/2012 Release. IC Package. Cadence Advanced Encryption Standard-64bit

S2C K7 Prodigy Logic Module Series

Using Virtual Platforms To Improve Software Verification and Validation Efficiency

Using Formalized Programming Sequences for Higher Quality Virtual Prototypes

Software Defined Modem A commercial platform for wireless handsets

Exploring System Coherency and Maximizing Performance of Mobile Memory Systems

Intel Research mote. Ralph Kling Intel Corporation Research Santa Clara, CA

Designing with ALTERA SoC Hardware

Software Quality is Directly Proportional to Simulation Speed

The Challenges of System Design. Raising Performance and Reducing Power Consumption

ESE Back End 2.0. D. Gajski, S. Abdi. (with contributions from H. Cho, D. Shin, A. Gerstlauer)

On-chip Networks Enable the Dark Silicon Advantage. Drew Wingard CTO & Co-founder Sonics, Inc.

Fujitsu SOC Fujitsu Microelectronics America, Inc.

Embedded Systems: Architecture

Lesson 6 Intel Galileo and Edison Prototype Development Platforms. Chapter-8 L06: "Internet of Things ", Raj Kamal, Publs.: McGraw-Hill Education

Designing with NXP i.mx8m SoC

SoC Platforms and CPU Cores

Combining Arm & RISC-V in Heterogeneous Designs

iw-rainbow-g3 / G3V FAQs:

ECE 448 Lecture 15. Overview of Embedded SoC Systems

Embedded Software: Its Growing Influence on the Hardware world

Strato and Strato OS. Justin Zhang Senior Applications Engineering Manager. Your new weapon for verification challenge. Nov 2017

Matrix. Get Started Guide

Wai Chee Wong Sr.Member of Technical Staff Freescale Semiconductor. Raghu Binnamangalam Sr.Technical Marketing Engineer Cadence Design Systems

Europractice Cadence release. IC Package ASSURA 4.1 ASSURA 4.1 ASSURA 4.1

2. HW/SW Co-design. Young W. Lim Thr. Young W. Lim 2. HW/SW Co-design Thr 1 / 21

Integrating CPU and GPU, The ARM Methodology. Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM

ESL design with the Agility Compiler for SystemC

System Level Design with IBM PowerPC Models

Age nda. Intel PXA27x Processor Family: An Applications Processor for Phone and PDA applications

Zynq Architecture, PS (ARM) and PL

MYD-Y6ULX Development Board

Verifying big.little using the Palladium XP. Deepak Venkatesan Murtaza Johar ARM India

ZeBu : A Unified Verification Approach for Hardware Designers and Embedded Software Developers

The Veloce Emulator and its Use for Verification and System Integration of Complex Multi-node SOC Computing System

FPGA Entering the Era of the All Programmable SoC

Software Development Using Full System Simulation with Freescale QorIQ Communications Processors

Use ZCU102 TRD to Accelerate Development of ZYNQ UltraScale+ MPSoC

Dr. Ajoy Bose. SoC Realization Building a Bridge to New Markets and Renewed Growth. Chairman, President & CEO Atrenta Inc.

Simulation Based Analysis and Debug of Heterogeneous Platforms

It's not about the core, it s about the system

Introduction to Embedded Systems

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014

Fujitsu System Applications Support. Fujitsu Microelectronics America, Inc. 02/02

Adding C Programmability to Data Path Design

Building Low Power, Modular Systems with Silicon-Proven IP Solutions

IMPROVES. Initial Investment is Low Compared to SoC Performance and Cost Benefits

STM32MP1 Microprocessor Continuing the STM32 Success Story. Press Presentation

Transcription:

Hardware Software Bring-Up Solutions for ARM v7/v8-based Designs August 2015

SPMI USB 2.0 SLIMbus RFFE LPDDR 2 LPDDR 3 emmc 4.5 UFS SD 3.0 SD 4.0 UFS Bare Metal Software DSP Software Bare Metal Software Example ARM-based hardware/software system Software Bare Metal Applications Communications L3 Middleware Communications L2 Operating Systems (OS) Drivers Firmware / HAL Communications L1 RTOS Drivers Firmware / HAL Modem Comms Memory Card FM Receiver DRAM LPDDR NAND NAND FLASH CSI2 CSI3 Camera Interface Application Processor GPS Receiver Cellular Modem DigRF I2C Multimedia Processor Touch Screen Controller Display Driver DSI OCP 2.0 WiFi SDIO LLI OCP 3.0 Audio Interface SLIMbus Bluetooth Motion Sensors cjtag GBT Power Control USB 3.0 OTG HDMI 1.4 SoC System on PCB 2 2015 Cadence Design Systems, Inc. All rights reserved.

ARMv8-based SoC software and verification challenges Multi-core early SW bring-up and integration on ARMv8 64-bit platform Bare-metal SW use case testing to verify multi-core cache and I/O coherency, concurrency, PSO, etc Middleware Operating Systems (OS) Drivers Firmware / HAL Power profiling and analysis of realworld traffic Debugging of complex multi-core SoC SW scenarios on RTL sim/emulation platforms Integrating and verifying 100 s of IP blocks and SW from ARM, internal teams, and 3 rd parties Characterizing and analyzing SoC performance and efficiently debugging issues Verification of IP blocks on AMBA interconnect w/ adherence to cache coherent (ACE) protocol Development flow needs to support hardware/software integration and verification environments on multiple platforms 3 2015 Cadence Design Systems, Inc. All rights reserved.

Software is Key to verification Applications (Basic to Angry Birds) Middleware (Graphics, Audio) OS & Drivers (Linux, Android) Bare metal SW SoC in System May hold final tape out if bug too critical SW Development on Chip System on Chip Sub-System IP Idea to spec Time for critical bugs in System Environment to be removed RTL Becomes stable Only small gate level changes and ECO s Production Post silicon Validation Spec RTL-Design & IP Integration & Verification IP Qualification Netlist to GDSII Fab Post Si Source: Cadence, IBS 4 2015 Cadence Design Systems, Inc. All rights reserved.

ARM partnership to develop and deliver an ARMbased system verification solution 5 2015 Cadence Design Systems, Inc. All rights reserved.

Cadence Verification Solution for ARM v8 SoCs Palladium Hybrid for early SW bring-up and above-os SoC validation Middleware Operating Systems (OS) Drivers Firmware / HAL Perspec System Verifier SW-Driven use case verification Palladium Dynamic Power Analysis profiling real traffic Incisive Debug Analyzer synchronized embedded SW debug on ARM RTL CPU SoC Workbench packaged v8 IP & VIP for rapid SoC integration & verification Interconnect Workbench for SoC verification and performance analysis VIP Portfolio w/amba ACE support on Incisive Simulation, Formal, and Palladium vmanager Planning & Management Virtual System Platform Incisive Formal Incisive Simulation Palladium Acceleration & Emulation System Development Suite Rapid Prototyping Platform Debug Analyzer 6 2015 Cadence Design Systems, Inc. All rights reserved.

Early OS and software bring-up Palladium Hybrid technology ARM Fast Models A57 x 4 A53 x 2 ARMv8 Fast Models configured as per CSS requirements Timers UARTs Virtual emmc Execute software at 100MHz With standard or custom processor models Plug-and-play integration with RTL SoC-specific transactors and RTL I/F Reconfigurable Interconnect TLM Memory Smart DDR TLM / RTL Bridge CPU Sub-system RTL I/F AVIP Interrupt Manager SW Integrator Reset Manager Validate SoC + OS at 5-10MHz High-performance memory coherency Reduce SoC debug effort System messages Hardware/software debuggers Component Color Key Customer RTL RTL TLM Mem I/F ARM CSS RTL qualified and optimized on PXP AXI4 or ACE-Lite SoC Interconnect Fabric Memory Controller DDR3 Smart DDR MMP model Peripheral Fabric GPU Interrupt s UART INTC Timer System Boot Resets Palladium XP II CSI DSI Ethernet Display USB2 USB3 SATA 7 2015 Cadence Design Systems, Inc. All rights reserved.

Close the HW/SW Concurrency Gap Legend SW System HW Next Generation SW-Driven SoC Flow Continuous SW Development & Bringup Continuous System Validation Powered By Platform Hybrids Emulation + Virtual Platform + FPGA HW Development & Verification SW-Enhanced SoC Flow SW Dev On model SW Dev and Bringup On real HW design, Silicon System Validation Enabled By Virtual Platform FPGA Prototype Emulation HW Development & Verification Traditional SoC then SW Flow HW Development & Verification SW Dev and Bringup on Silicon System Validation Tapeout Silicon Samples Product Ships 8 2015 Cadence Design Systems, Inc. All rights reserved.

DDR3 DSI Virtual System Platform Use Models Supported by same VSP System Creator License Early SW Development High performance SW execution Run entire SW stack SW Driver Validation Validate SW driver/firmware Running on the RTL OS and above SW Validation Run entire SW Stack On RTL in Palladium for majority of SoC Arm Processor SRAM GIC Timer UART TLM Wrapper Fast A15 x 4 A7 x 2 Processor Model Timers UARTs VSP emmc PCIe Core PIPE3 PCIe Core SRAM Reconfigurable Interconnect Register Access Linux Shell Cmds TLM Memory Smart DDR TLM / RTL Bridge AVIP Interrupt Manager SW Integrator Reset Manager CDDC PCIe Core Drv CPU Sub-system RTL I/F Linux Kernel Unit Test Func AXI4 or ACE-Lite Inte rru UART pts INTC System Boot Reset s SMP Linux boot: CPU0 615,093,547 instructions CPU1 489,558,519 instructions Total Instructions: 1,104,652,066 In 30 seconds on a laptop computer 36 MIPS Key: UDR kernel Module RTL PCIe RC Initialization Module TLM SW Timer SoC Interconnect Fabric Peripheral Fabric Memory Controller GPU Smart DDR MMP model CSI Ethernet Display Palladium XP USB2 USB3 SATA VSP (SystemC TLM) VSP (SystemC TLM) PXP (RTL) 9 2015 Cadence Design Systems, Inc. All rights reserved.

Challenge is Modeling The Accuracy vs Speed trade-off Direct Memory Interface (DMI) is key to virtual prototype performance: turn off DMI drop 100x in performance To accurately model all memory transactions you must turn off DMI The Accuracy vs. Model Development time trade-off Example modeling projects: TLM 2.0 Loosely Timed Project: 45 models (Avg. time 2 staff weeks per model with a team of experience modelers) TLM 2.0 Approximately Timed DDR Controller project: 1 model, 90% timing accurate (1.5 staff years with a team of experienced modelers) We found a better approach: Hybrid Mixes Virtual Prototyping and RTL to get performance for software and accuracy for hardware 10 2015 Cadence Design Systems, Inc. All rights reserved.

Early, High-Performance SW Execution on Palladium Exclusive Solution combines the best characteristics of emulation and virtual platforms TLM Virtual Platform VSP - Up to 100MHz - Early Availability for SW Developers - Advanced SW Debug - Fast SW Turnaround Time Emulation Palladium XPI/II - Up to 4MHz - From early-rtl to full-soc Validation - Advanced HW Debug - Fast HW Turnaround Time Hybrid Solution with SW Integrator - Boot Complex OS at 48MHz - Speed UP SW-Driven tests 1-10X over emulation. - Early Availability for SW Developers - Advanced HW + SW Debug - Fast HW and SW Turnaround Time 11 2015 Cadence Design Systems, Inc. All rights reserved.

The Palladium/VSP Hybrid Solution Architected for SW Performance Designed to integrate HW and SW flows High-speed virtual platform Does not require changes to HW or SW stacks Asynchronous HW/SW Execution with Interrupt driven sync Virtual connections into SW Engineer s environments High-Speed Multi-Domain Memory Coherency Seamless hybrid execution for both HW and SW users Customer Design in Palladium VSP Virtual Models UART, emmc, USB Integration APIs Customer Virtual Models Virtualized CPU Sub-system CPU AVIP Bridges Smart Memory AMBA, interrupts, resets DDR Memory Controller GPU RTL Fabric IP IP IP SW Integrator VSP Execution Engines Palladium Proven Methodology, Unique Expertise Cross-platform and design integration expertise Exclusive hybrid methodology delivers performance and repeatability Proven during successful application to SW-rich SoCs 12 2015 Cadence Design Systems, Inc. All rights reserved.

A Different Hybrid - Palladium Hybrid Example With virtual platform + Palladium turn-around time and debug Run unmodified, Complex SW such as Android boot, OGL tests at FPGA speeds Add Smart Memory: Patented TLM / RTL coherency Fast Processor Model TLM2 Maia x 4 A57 x 2 CPU Coherent Sub-system Fabric RTL I/F I/F Memory Controller Smart DDR3 DDR MM A57 x 2 CCI400 Timers Reconfigurable Interconnect (memory mapped) TLM Smart Memory DDR Maia x 4 TLM/RTL Bridge Clocks, ACE Resets Compute Subsystem Interrupt Manager GIC400 UARTs Timers Scratch RAM Interrupt s UARTs SW Integrator RTL I/F (ARM TRM standard signals) Boot CPU Customer Design GPU Peripheral Fabric CSI DSI Ethernet Display SW Integrator emmc Reset Manager Component Color Key emmc SDIO USB2 USB3 SATA Customer Cadence RTL RTL TLM Mem I/F Add a minimal virtual model of your CPU sub-system Connect using proven Technology, methodology Use most accurate design model: RTL on Palladium 13 2015 Cadence Design Systems, Inc. All rights reserved.

Hybrid Performance with SW Integrator Compared to an All-RTL in Emulation Configuration Metric All RTL in Palladium* Hybrid** Increase Linux boot (minutes) 30 0.5 60X Android boot (minutes) 900 15 60X Windows RT boot (min.) 1800 30 60X 512x512 2D test (min)*** 30 2 15X # Emulation gates used 70 Million 40 Million 0.6X * 70 million gate application processor, all blocks in Palladium ** Virtualized CPU sub-system with register model of L1 & L2 caches. All other SoC blocks in Palladium. *** Includes Linux boot, data preparation, image processing by HW engine and result checking. 1.3 million memory transactions. All boot numbers are full production images. Linux includes all drivers. Android and Win RT with SW rendering Target Application - Large, compute intensive SoCs Target Users: - HW-Dependent SW engineers, - System validation engineers Accuracy (see notes for details) - SW: Delivers programmers-view accuracy - HW: Full accuracy except for timing between virtual CPU and SoC fabric - Memory: in fast mode, memory transactions are performed back-door. Thus, hybrid models not recommended for power or performance estimation 14 2015 Cadence Design Systems, Inc. All rights reserved.

Customer s Success with Hybrid Global & Local customers

ARM use of Palladium XP platform for HW/SW co-design as part of the Cadence/ARM-based SoC solution Results: 50X faster OS boot-up on ARM Mali -T760 GPU 10X speed-up of overall hardware-software testing Shortened turnaround-time from hours to minutes Source: ARM TechCon 2014 Technology: Palladium XP Hybrid ARM Fast Models Early hardware-software co-development is critical in the design process for advanced, highly integrated projects. By using Cadence Palladium Hybrid technology to combine ARM Mali-T760 emulation with ARM Fast Models, we reduced the OS boot-up time, allowing us to run more extensive system-level software workloads and improve product quality. - Hobson Bullman, general manager, development solutions group, ARM 16 2015 Cadence Design Systems, Inc. All rights reserved.

17 2015 Cadence Design Systems, Inc. All rights reserved.

Palladium/VSP Hybrid with SW Integrator Enable High-Performance execution of SoC SW with RTL Suitable for Bare Metal SW, OS, device drivers, test applications on OS Full memory coherency Boot Linux 30 seconds, run data-intensive SW at near FPGA speeds Increase System Verification Productivity SW run-time reduced by 4-60X compared to standalone emulation Run in simulation, emulation or hot-swap between the two Reduce multiple debug iterations between SW and HW engineers Reuse environments between SW development, HW Verification and system validation Choose the right platform for your requirements Understand your speed vs. accuracy trade-off Required to execute workloads Understand modeling effort trade-offs Utilize hybrid approaches 18 2015 Cadence Design Systems, Inc. All rights reserved.