Extending the Power of FPGAs

Similar documents
Extending the Power of FPGAs to Software Developers:

Optimizing HW/SW Partition of a Complex Embedded Systems. Simon George November 2015.

Design and Verification of FPGA Applications

Integrated Workflow to Implement Embedded Software and FPGA Designs on the Xilinx Zynq Platform Puneet Kumar Senior Team Lead - SPC

Vivado HLx Design Entry. June 2016

All Programmable: from Silicon to System

Adaptable Intelligence The Next Computing Era

SoC Systeme ultra-schnell entwickeln mit Vivado und Visual System Integrator

Designing and Prototyping Digital Systems on SoC FPGA The MathWorks, Inc. 1

Agenda. Introduction FPGA DSP platforms Design challenges New programming models for FPGAs

SoC Systeme ultra-schnell entwickeln mit Vivado und Visual System Integrator

High Performance Memory in FPGAs

NEW FPGA DESIGN AND VERIFICATION TECHNIQUES MICHAL HUSEJKO IT-PES-ES

Simplify System Complexity

Cover TBD. intel Quartus prime Design software

Cover TBD. intel Quartus prime Design software

FPGA Entering the Era of the All Programmable SoC

Design and Verification of FPGA and ASIC Applications Graham Reith MathWorks

Accelerating FPGA/ASIC Design and Verification

Simulation, prototyping and verification of standards-based wireless communications

Hardware-Software Co-Design and Prototyping on SoC FPGAs Puneet Kumar Prateek Sikka Application Engineering Team

Beyond Moore. Beyond Programmable Logic.

MATLAB/Simulink 기반의프로그래머블 SoC 설계및검증

Simplify System Complexity

Design AXI Master IP using Vivado HLS tool

Copyright 2014 Xilinx

Model-Based Design for effective HW/SW Co-Design Alexander Schreiber Senior Application Engineer MathWorks, Germany

Hardware and Software Co-Design for Motor Control Applications

Veloce2 the Enterprise Verification Platform. Simon Chen Emulation Business Development Director Mentor Graphics

Tips for Making Video IP Daniel E. Michek. Copyright 2015 Xilinx.

Shortest path to the lab. Real-world verification. Probes provide observability

Simplifying FPGA Design for SDR with a Network on Chip Architecture

FPGA design with National Instuments

Don t Think You Need an FPGA? Think Again!

Zynq-7000 All Programmable SoC Product Overview

Software Defined Modem A commercial platform for wireless handsets

Key technologies for many core architectures

FPGA 加速机器学习应用. 罗霖 2017 年 6 月 20 日

借助 SDSoC 快速開發複雜的嵌入式應用

Extending Fixed Subsystems at the TLM Level: Experiences from the FPGA World

9 REASONS WHY THE VIVADO DESIGN SUITE ACCELERATES DESIGN PRODUCTIVITY

MPSoC Design Space Exploration Framework

Adaptable Computing The Future of FPGA Acceleration. Dan Gibbons, VP Software Development June 6, 2018

A So%ware Developer's Journey into a Deeply Heterogeneous World. Tomas Evensen, CTO Embedded So%ware, Xilinx

Reducing the cost of FPGA/ASIC Verification with MATLAB and Simulink

CMPE 415 Programmable Logic Devices Introduction

Yafit Snir Arindam Guha Cadence Design Systems, Inc. Accelerating System level Verification of SOC Designs with MIPI Interfaces

10GBase-R PCS/PMA Controller Core

Does FPGA-based prototyping really have to be this difficult?

Protecting Embedded Systems from Zero-Day Attacks

Thomas Lin, Naif Tarafdar, Byungchul Park, Paul Chow, and Alberto Leon-Garcia

Cost-Optimized Backgrounder

Graph-Based Verification in a UVM Environment

Hardware Implementation and Verification by Model-Based Design Workflow - Communication Models to FPGA-based Radio

Embedded Systems. 7. System Components

Verification Futures The next three years. February 2015 Nick Heaton, Distinguished Engineer

Massively Parallel Processor Breadboarding (MPPB)

Design Once with Design Compiler FPGA

SDSoC: Session 1

High-Level Synthesis: Accelerating Alignment Algorithm using SDSoC

A Closer Look at the Epiphany IV 28nm 64 core Coprocessor. Andreas Olofsson PEGPUM 2013

Versal: AI Engine & Programming Environment

FPGA system development What you need to think about. Frédéric Leens, CEO

Burrows-Wheeler Short Read Aligner on AWS EC2 F1 Instances

Validation Strategies with pre-silicon platforms

Optimizing ARM SoC s with Carbon Performance Analysis Kits. ARM Technical Symposia, Fall 2014 Andy Ladd

Design of Transport Triggered Architecture Processor for Discrete Cosine Transform

EEM870 Embedded System and Experiment Lecture 4: SoC Design Flow and Tools

100M Gate Designs in FPGAs

System-on-Chip Architecture for Mobile Applications. Sabyasachi Dey

Navigating the RTL to System Continuum

RFNoC : RF Network on Chip Martin Braun, Jonathon Pendlum GNU Radio Conference 2015

High Performance Embedded Applications. Raja Pillai Applications Engineering Specialist

Vivado Design Suite. By: Tom Feist

Introduction to Embedded System Design using Zynq

Apps with Hardware Enabling Run-time Architectural Customization in Smart Phones

FPGA for Software Engineers

Digital Systems Design. System on a Programmable Chip

SDACCEL DEVELOPMENT ENVIRONMENT. The Xilinx SDAccel Development Environment. Bringing The Best Performance/Watt to the Data Center

RiceNIC. A Reconfigurable Network Interface for Experimental Research and Education. Jeffrey Shafer Scott Rixner

Hardware-Software Codesign. 1. Introduction

Extending Model-Based Design for HW/SW Design and Verification in MPSoCs Jim Tung MathWorks Fellow

Did I Just Do That on a Bunch of FPGAs?

Introducing the FPGA-Based Prototyping Methodology Manual (FPMM) Best Practices in Design-for-Prototyping

ECE 5775 (Fall 17) High-Level Digital Design Automation. Hardware-Software Co-Design

OCP Engineering Workshop - Telco

System Level Design with IBM PowerPC Models

Zhang Tianfei. Rosen Xu

SUSE Linux Entreprise Server for ARM

An Architectural Framework for Accelerating Dynamic Parallel Algorithms on Reconfigurable Hardware

Motor Control: Model-Based Design from Concept to Implementation on heterogeneous SoC FPGAs Alexander Schreiber, MathWorks

FCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDAto-FPGA

Hardware and Software Co-Design for Motor Control Applications

Revolutioni W zi h Wn e hgn e n F a Mi i s liu lsir u e ro e Cri I ti s Ic N al o t V A e n ri n O fi p c ti a o ti n oo

Early Models in Silicon with SystemC synthesis

The Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006

Altera SDK for OpenCL

Targeting Motor Control Algorithms to System-on-Chip Devices

Park Sung Chul. AE MentorGraphics Korea

Designing a Multi-Processor based system with FPGAs

Transcription:

Extending the Power of FPGAs The Journey has Begun Salil Raje Xilinx Corporate Vice President Software and IP Products Development

Agenda The Evolution of FPGAs and FPGA Programming IP-Centric Design with High Level Languages Software Defined Systems

The Evolution of FPGAs and FPGA Programming

The Evolution of Programmable Devices Logic Cells 1M 3D ICs 10K Programmable SoCs FPGAs 100 PLDs 1985 1995 2005 2015 2025

The Progression of FPGA Design Methodology Logic Software Defined Cells 1M IP-Centric with High-level Languages 10K Schematics RTL Programmable SoCs 3D ICs FPGAs 100 PLDs 1985 1995 2005 2015 2025

The Shift in Developer Personas Application Developer Logic Software Defined Cells 1M Hardware Designer Algorithm Developer Embedded SW Dev IP-Centric with High-level Languages 10K Hardware Designer Schematics RTL Programmable SoCs 3D ICs FPGAs 100 PLDs 1985 1995 2005 2015 2025

IP-Centric Design with High Level Languages IP-Centric Design with High Level Languages

Step 1: Leverage Hard and Soft IP + Embedded Processors Example of Hard IP: Zynq MPSOC Examples of Complex Soft IP AXI-MM AXI-Lite AXI-MM interconnect AXI-Lite interconnect AXI-MM AXI-MM AXI-Lite VDMA Deinterlacer V Scaler H Scaler CSC 422-444 420-422 420-422 Letterboxing AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S AXI4-S router 10x10 AXI4-S OTN Subsystem Video Subsystem HMC Controller Digital Pre- Distortion SmartConnect

Step 2: Develop New IP blocks in C/C++ Algorithmic Specification Micro-architecture Exploration RTL Implementation FPGA Integration Create IP from C/C++/System C algorithm specification Abstract algorithm verification 10,000x faster than RTL sim Traditional FPGA design experience not required

Step 3: Use Automated IP Assembly = IP Assembly Example: Zynq Processor Subsystem + Video Subsystem + 6 IP Blocks 4700 lines of VHDL (top-level connectivity only) Video Processing IP Subsystem

High Level Design Case Study: GainSpeed Venture-backed start-up Products for cable operators to: Meet skyrocketing capacity requirements of streaming video Cost-effectively migrate networks to a software-driven, all-ip architecture Need to be 10x better and 10x cheaper than much larger incumbents Have a much smaller team and need to work smarter 11

Previous Approach to Design 100K+ lines of RTL RTL RTL (VHDL) (VHDL) RTL (VHDL) Test Bench Test Test Bench Bench C) C) (System C) Testbench same as driver code Model Sim Minimal Test Cases Synthesis P&R System Debug (Chipscope) Exhaustive Corner Cases Used Virtex-6 240Ts, targeting 200+ MHz Ran P&R on 100 servers Spent 20% of time designing and 80% making it work Took a team of 10 engineers working for 2 years 12

Current Design Methodology : HLS + IPI Low 1000s lines of C code RTL IP RTL Blocks (VHDL) (VHDL) (C code)) Test Bench Test Test Bench Bench (System C) C) (C code) C Compiler Exhaustive Test Cases HLS IPI Synthesis Kintex 480T + off-the-shelf parts Used HLS to build 80% of the IP Blocks DSP functions, closed-loop timing recovery, DMA engines, etc Fast functional simulation in C P&R Much better coverage achieved earlier Team of 2 people working for 6 months System Debug (Chipscope) System-level Debug 13

Automated IP Assembly Eliminated grunt-work in wiring IP 14

Overall Project Results Elapsed time from project start to running system in lab: 6 months Total number of IP blocks integrated: 30+ Leveraged key IP cores: SRIO, 10G Ethernet MAC, MIG controller, FIR Compiler, Reed-Solomon Design running at 368 MHz in Kintex-7 Enabled co-debug with software developers 15

The Era of Software Defined Systems

Why FPGAs for Software Defined Systems? The Era of Virtualization Reconfigurable computing, storage and networking in the cloud The Thirst for Acceleration Heterogeneous computing Compute-intensive algorithms DNA sequencing Search engines Video processing Encryption/Decryption Packet routing FPGAs and Programmable SoCs: Power-efficient Reconfigurable Massively-Parallel Compute Engines

Example of FPGAs as Accelerators Smith-Waterman DNA Sequencing Application Reference Compares Query(N) with Reference(M) genome strings Query Involves MxN Matrix Computation and Dynamic Programming Maximal parallelism along diagonals Xilinx Virtex-7 690T (reference) Intel Xeon E5-2697 12 core Ratio Virtex-7 vs Intel 12 core Intel Xeon Phi 5110P 60 core Ratio Virtex-7 vs Intel 60 Core GCUPS 7700 1975 390 3000 257 Watts 2800 13000 022 22500 012 GCUPS/Watt 275 015 1810 013 2063

SDSoc: Software Defined SoC Development Applications: Machine Vision Driver Assistance/ADAS Software-Defined Radio (SDR) Wireless Radio Surveillance UAV / Drones Full System Optimizing Compiler ARM Code Main( ) C/C++ Development System-level Profiling Mark C/C++ Functions for Acceleration GCC Connectivity HLS+ SP&R Standard Eclipse IDE Accelerator Func( ) Embedded ARM Processor Subsystem Programmable Logic

SDAccel: Software Defined Algorithm Acceleration Sample Applications: Machine Learning Bioinformatics Graph Processing Stringology Data Analytics Modelling Science Codes Signal Processing Video & Image Processing Software-Defined FPGA Acceleration

Platforms Enable Software Defined FPGA Systems Pre-defined Platform Hardware System Performance Partial Design Board Algorithms Support Reconfig Analysis & Host Software Stack

Summary HW designers: SW developers: C-based IP development + highlevel IP assembly are the next step beyond RTL Software-defined algorithm development + platforms will enable you to exploit the power of FPGAs & SoCs We re making major investments in next generation silicon and tools that will revolutionize FPGA design