Software Defined Modem A commercial platform for wireless handsets

Similar documents
Software Defined Modems for The Internet of Things. Dr. John Haine, IP Operations Manager

CEVA-X1 Lightweight Multi-Purpose Processor for IoT

SYSTEMS ON CHIP (SOC) FOR EMBEDDED APPLICATIONS

The Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006

Growth outside Cell Phone Applications

Mobile Broadband Comparison. CDMA Development Group March 2008

Fujitsu System Applications Support. Fujitsu Microelectronics America, Inc. 02/02

5g for connected industries

The Nios II Family of Configurable Soft-core Processors

Designing an efficient LTE-Advanced modem architecture with ARM Cortex -R7 MPCore and CEVA XC4000 processors

Connect your IoT device: Bluetooth 5, , NB-IoT

High performance and efficient single-chip small cell base station SoC

EEM870 Embedded System and Experiment Lecture 4: SoC Design Flow and Tools

Building High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink. Robert Kaye

Combining Arm & RISC-V in Heterogeneous Designs

LTE : The Future of Mobile Broadband Technology

Mobile WiMAX in the Evolving Wireless Broadband Landscape

NS115 System Emulation Based on Cadence Palladium XP

The Challenges of System Design. Raising Performance and Reducing Power Consumption

OCP Engineering Workshop - Telco

SIMULTANEOUS BASEBAND PROCESSING CONSIDERATIONS IN A MULTI-MODE HANDSET USING THE SANDBLASTER BASEBAND PROCESSOR

Hardware Software Bring-Up Solutions for ARM v7/v8-based Designs. August 2015

Connect Your IoT Device: Bluetooth 5, , NB-IoT

Versal: AI Engine & Programming Environment

Extending the Power of FPGAs to Software Developers:

Chapter 5. Introduction ARM Cortex series

HW & SW co-verification of baseband HSPA Processor with Seamless PSP

Copyright 2016 Xilinx

Nutaq. PicoSDR FPGA-based, MIMO-Enabled, tunable RF SDR solutions PRODUCT SHEET I MONTREAL I NEW YORK I. nutaq. .com QUEBEC

Leveraging Mobile GPUs for Flexible High-speed Wireless Communication

Nutaq. PicoSDR FPGA-based, MIMO-Enabled, tunable RF SDR solutions PRODUCT SHEET I MONTREAL I NEW YORK I. nutaq. .com QUEBEC

Effective System Design with ARM System IP

Infrastructure Test System

Designing, developing, debugging ARM Cortex-A and Cortex-M heterogeneous multi-processor systems

Maximize energy efficiency in a normally-off system using NVRAM. Stéphane Gros Yeter Akgul

Wireless Networking: An Introduction. Hongwei Zhang

A Closer Look at the Epiphany IV 28nm 64 core Coprocessor. Andreas Olofsson PEGPUM 2013

Fujitsu SOC Fujitsu Microelectronics America, Inc.

Copyright 2014 Xilinx

Software-Defined Radios: Mobile Architectures Come Full Circle

Snapdragon S4 System on Chip

Transcede. Jim Johnston CTO. Solving 4G Challenges for Pico, Micro and Macrocell Platforms

System-on-Chip Architecture for Mobile Applications. Sabyasachi Dey

Pilot RCU 2.0 Release Note V10.10

Imagining Tomorrow's Wireless Landscape

ARM Processors for Embedded Applications

Maximizing heterogeneous system performance with ARM interconnect and CCIX

Datasheet TeleController TC302

Multi-core microcontroller design with Cortex-M processors and CoreSight SoC

Reconfigurable VLSI Communication Processor Architectures

LEON4: Fourth Generation of the LEON Processor

And MTN said: Let there be SPEED!...

ESE Back End 2.0. D. Gajski, S. Abdi. (with contributions from H. Cho, D. Shin, A. Gerstlauer)

Age nda. Intel PXA27x Processor Family: An Applications Processor for Phone and PDA applications

One Box Tester for LTE-Advanced UE Development

Towards 5G: Advancements from IoT to mmwave Communcations. Next Generation and Standards Princeton IEEE 5G Summit May 26, 2015

Wireless systems overview

Extending the Power of FPGAs

Software Driven Verification at SoC Level. Perspec System Verifier Overview

Adaptable Intelligence The Next Computing Era

Pilot Walktour (Android) V3.8 Product Overview

Considerations for SDR Implementations in Commercial Radio Networks

A Software Development and Validation Framework for SDR Platforms

TCL Communication Technology Holdings Ltd.

One Box Tester for LTE-Advanced UE Development

Real-Time Debugging Highly Integrated Embedded Wireless Devices

Next Generation Verification Process for Automotive and Mobile Designs with MIPI CSI-2 SM Interface

Session 2: 4G to 5G networks and standard releases

FPGA Entering the Era of the All Programmable SoC

Next Generation Enterprise Solutions from ARM

Test and Verification Solutions. ARM Based SOC Design and Verification

Algorithm-Architecture Co- Design for Efficient SDR Signal Processing

Verification Futures Nick Heaton, Distinguished Engineer, Cadence Design Systems

SPEAr: an HW/SW reconfigurable multi processor architecture

Introduction to ASIC Design

Content. New Challenges Memory and bandwidth

HSPA+ R8. February 2009

CoreTile Express for Cortex-A5

Cycle Approximate Simulation of RISC-V Processors

High performance, power-efficient DSPs based on the TI C64x

Takashi Shono, Ph.D. Intel 5G Tokyo Bay Summit 2017

Lecture 5: Computing Platforms. Asbjørn Djupdal ARM Norway, IDI NTNU 2013 TDT

Snapdragon S4 System on Chip

Beyond TrustZone Security Enclaves Reed Hinkel Senior Manager Embedded Security Market Develop

/ 3GPP2-Rev C : Future Mobile Broadband Access Solution

Reconfigurable Cell Array for DSP Applications

Jack Kang ( 剛至堅 ) VP Product June 2018

IFX Day Secure Mobile Solutions. Dominik Bilo CMO Secure Mobile Solutions Business Group. November 16, Munich. IFX Day 2004.

RapidIO.org Update. Mar RapidIO.org 1

TI SimpleLink dual-band CC1350 wireless MCU

On the road with 3GPP. 3GPP s Long Term Evolution and System Architecture Evolution projects

Does FPGA-based prototyping really have to be this difficult?

3G Technical Evolution as an evolving broadband solution

Versal: The New Xilinx Adaptive Compute Acceleration Platform (ACAP) in 7nm

Introducing the Latest SiFive RISC-V Core IP Series

Developing and Integrating FPGA Co-processors with the Tic6x Family of DSP Processors

Flash Storage Trends & Ecosystem

Hardware/Software Co-design

Analyzing and Debugging Performance Issues with Advanced ARM CoreLink System IP Components

RISC-V Core IP Products

Transcription:

Software Defined Modem A commercial platform for wireless handsets Charles F Sturman VP Marketing June 22 nd ~ 24 th Brussels charles.stuman@cognovo.com www.cognovo.com

Agenda SDM Separating hardware from software Why now Multimode modem processing requirements A whole system approach Optimising processor resources for the task in hand Low power design and scalability Performance benchmarks achieved Software development methodology Task scheduling and mapping System design, verification and optimisation Bottom up vs Top down Modem Compute Engine A commercially available platform for low-power handheld wireless 2

Bitrate, Mbps Why Now? A changing market Rapid evolution in applications and services Many products need wireless : Smartphones, Tablets, Gaming, Home entertainment Rapidly increasing modem complexity Driven by applications and services Increasing time to market and cost SDM delivers 500 450 400 350 300 250 200 150 100 50 0 GSM GPRS 200MOp/s LTE 50GOp/s HSPA+ 20GOp/s HSPA 5GOp/s LTE-A 1Gbps 300GOp/s 1990 1995 2000 2005 2010 2015 Faster : Time to market and market response Flexibility : One chip = many products / many standards Smaller : Smaller die size = Lower cost 3

Modem Requirements Layer 3 Radio Resource Control Layer 2 Radio Link Control and Media Access Control Layer 1 PHY Control Sub-frame Sequencing Modem & Channel Codec Protocol Stack Real-time control software Time-critical sequencing Hard-real-time signal processing Layer 1 combines real-time control with time critical signal processing 4

Modem Requirements CPHY PHY Layer 3 Radio Resource Control Radio Layer Driver 2 Radio Link Control and Media Access Control Layer 1 PHY Control RF IC Sub-frame Sequencing Modem & Channel Codec Timing Message Interface Layer 1 Control State Machine Sub-frame Control Sub-frame Sequences Signal Processing Traditionally, custom ASIC hardware Layer 1 combines real-time control with time critical signal processing Deterministic operation has traditionally demanded hardware SDM enables the entire modem to be expressed in software 5

Modem Requirements CPHY PHY Message Interface Radio Driver Layer 1 Control State Machine RF IC Timing Sub-frame Sub-frame Control Sequences Signal Processing S/w implementation hosted on heterogeneous processor platform Modem Compute Engine Layer 1 combines real-time control with time critical signal processing Deterministic operation has traditionally demanded hardware SDM enables the entire modem to be expressed in software 6

A System Solution for SDM Layer 1 Partitioning Applications Modem specific operation Protocol Plane Layers 2, 3, 4 Protocol Stack Control Plane Layer 1 SDM-OS Data Plane Physical Layer Libraries Hardware abstraction & API Common services Common functions Code re-use Software Hardware Generic s/w execution environment Protocol Processor System Modem Compute Engine Control Processor Sequence Processor Data Processor Heterogeneous Multi-processing Common Infrastructure The Soft Modem External Software Domain RF 7

Logical Layer 1 Layer 1 Layer 1 Partitioning Layer 1 Control GP processor support for Layer 1 control PHY Task Sequencing Real-time deterministic task sequencing driven by on-air timing Hard-real-time Signal Processing PPA efficient wireless algorithmic processing No modem-specific hardware processing PHY Control PHY Sequences PHY Kernels RF Driver PHY Modem Compute Engine Logical Control Layer 1 Processor PHY Control Sequence PHY Sequences Processor Data PHY Processor Kernels Common Infrastructure The Soft Modem External Software Domain RF 8

Layer 1 System Design Existing Layer 1 design & implementation must be supported Seamless integration with existing protocol stacks Operational control, management & service provision Largely specific to SDM platform : Auto generated code avoids creating extra engineering workload & human error Sub-frame sequencing Extension of present PHY modelling carried out with e.g. Simulink or Co-Ware : Graphical catpure and auto-compilation of flow control Logical Layer 1 PHY Control Uplink / Downlink PHY algorithms PHY Sequences PHY Kernels Compilation of existing fixed-point C-code, with efficient profiling and optimisation tools. Must be able to re-validate against reference models 9

Peripheral APB Bus System AXI Bus Bridge Bridge Ardbeg VSP Ardbeg VSP Ardbeg VSP Ardbeg VSP Ardbeg VSP Ardbeg VSP Introducing Cognovo s Modem Compute Engine (MCE) Real-time L1 / PHY computing system PHY Data Plane Multi-core Vector Signal Processing Designed-for-Wireless ISA Instruction & Data parallelism >> 130 GOp/s @500 MHz 11-way VLIW 32-way complex vector data-path Layer 1 Control Plane ARM CPU hosts SDM OS and Layer 1 Mode independent timing / power ctl Multi-standard Turbo engine RF interface SDM Sequencer Precise scheduling of kernels Autonomous operation removes overhead from L1 CPU Modem Compute Engine Timing and Slow Clock Unit Power Control Watchdog Timers TCM ARM CPU CACHE Bridge System DMA SDM Sequencer Interrupt Ctl System RAM Control 32b Data 128b VSP Sub-system VSP Sub-system VSP Sub-system Datapath DBG HARQ Memory Datapath DBG Register bank Datapath DBG Register bank VLIW Controller CTL Register bank VLIW Controller CTL I-TCM D-TCM VLIW Controller CTL I-TCM Bus Interface D-TCM Unit I-TCM Bus Interface D-TCM Unit Bus Interface Unit Multilayer Vector Interconnect Shared Memory FEC Engine Multilayer Vector Interconnect Bus Interface Unit Multilayer Vector Interconnect Shared Memory Shared Memory Dual VSP Core (MCE120) Cat 4 LTE capable (150Mb DL / 50Mb UL) Simultaneous DL and UL: 62% loaded at 400MHz 7.5 mm 2 and < 100mw in 32nm Similar power consumption to existing baseband silicon but with smaller area RF IF ARM VSP VDMA Datapath Datapath DBG Register bank Datapath DBG Register bank VLIW Controller CTL Register bank VLIW Controller CTL I-TCM D-TCM VLIW Controller CTL I-TCM Bus Interface D-TCM Unit I-TCM Bus Interface D-TCM Unit VDMA VDMA Coresight DBG JTAG Trace 10

Clock Frequency, MHz MCE120 Performance Benchmarks @ 40nm LP 350 300 250 200 150 100 50 R8 LTE-FDD Configurations 5MHz Channel Cat 2 : 5-40 Mbps 10 MHz Channel Cat 3 : 5-80 Mbps 20 MHz Channel Cat 4 : 20-150 Mbps R5 to R8 3G HSPA Configurations R5 W-CDMA : 384 Kbps R6 HSPA : 14 Mbps R7 HSPA+ MIMO : 28 Mbps R8 HSPA+ Dual Carrier : 42 Mbps 0 0 20 40 60 80 100 120 140 160 On-air Bit-rate, Mbps MCE clock frequency vs on-air bit-rate Connectivity Combined DL / UL estimates for 3GPP Rel5 to Rel8 WiFi 802.11g : 54 Mbps DVB-T (HD) : 15 Mbps HSPA evolution and LTE : Clock-rate scaling from 75MHz to 300MHz Clock rate and power can also scale with signal conditions High SNR (light signal processing) to low SNR (heavy signal processing) 11

Peripheral APB Bus System AXI Bus Bridge Bridge Ardbeg VSP Ardbeg VSP Bitrate, Mbps Power & Performance Scalability The same architecture will scale up or down Modem Compute Engine TCM ARM CPU CACHE Bridge Sequencer VSP Sub-system Datapath Register bank VLIW Controller I-TCM D-TCM DBG CTL Datapath Register bank VLIW Controller I-TCM D-TCM DBG CTL Higher clock rate More memory More processing resources: Multi-core / Multi-way 500 450 400 350 300 250 200 150 100 50 0 GSM GPRS 200MOp/s HSPA+ 20GOp/s HSPA 5GOp/s LTE-A 1Gbps 300GOp/s LTE 50GOp/s 1990 1995 2000 2005 2010 2015 Timing and Slow Clock Unit Interrupt Ctl System RAM Bus Interface Unit Bus Interface Unit Multilayer Vector Interconnect Shared Memory VDMA Power Control Control 32b Watchdog Timers Data 128b HARQ Memory Turbo RF IF System DMA ARM VSP Coresight JTAG Trace Lower clock rate Less memory Less processing resources: Multi-core / Multi-way Modem requirements drive Power Performance and Area PPA drives silicon implementation and process choice 12

A System Solution for SDM Development Partitioning Applications Modem specific operation Software Hardware Generic s/w execution environment Protocol Plane Layers 2, 3, 4 Protocol Stack Protocol Processor System Control Plane Layer 1 SDM-OS Data Plane Physical Layer PHY Kernel Library Modem Compute Engine Control Processor Sequence Processor Data Processor Design Environment System SDK Kernel SDK System integration & modelling PHY coding & optimisation Common Infrastructure The Soft Modem External Software Domain RF 13

Platform Resources Operating System API Development Partitioning Protocol Stack System simulation, visualisation and validation LL1-PhyL1 CPHY Logical Layer 1 PHY Layer 1 Operational control and management OS-like device management and service provision PHY Control RF Driver PHY Sequences Sub-frame flow control and sequencing of PHY operations PHY Kernels PHY C-code source compilation, profiling and optimisation Re-validation against reference models Seamless development flow from modem design through implementation and validation is key 14

Traditional Modem Design Flow System Analysis & Modelling Logical L1 Design Software Layer 1 Dev. Software Sw revision Standards Req s & Spec s Algorithm Design & Modelling Functional partition Subframe Sequencing H/w & S/w Signal Processing Hardware Sub-frame Software FPGA Dev. Hardware Integration and Test RTL Redesign for ASIC ASIC Verif., Fab & Test Integration and Test IOT Design Implement Re-Implement Time 15

SDM Design Flow PPA scaling RTL Design for ASIC ASIC Verif., Fab & Test Integrated Development Environment: System level integration Standards Req s & Spec s System Analysis & Modelling Algorithm Design & Modelling Functional partition Logical L1 Design Software Subframe Sequencing Software Signal Processing Software Develop, Integrate and Test on SDM Development Platform and Silicon IOT System Design Tools System Modelling Debug and Real-time Visualisation PHY Kernel development C Compiler Instruction Set Simulator PPA Profiling and Debug Design Implement Faster Silicon development is outside modem critical path Software-only flow allows Develop, Integrate and Test in single step 16

LTE Downlink Control Processing Example UML constructs for control and data flow Parameterised PHY Kernels Co-ordinated by SDM-OS and executed in real-time by SDM Sequencer 17

System Design Tool LTE Example PHY System Design and Capture LTE PHY Frame Sequence Symbol 0 Symbol 1 Symbol 2 Symbol 3 RF Front-End Processing OFDM Demultiplexing RFDMA Signalling Control Region Decoder Symbol 2 Front-End Processing Downlink Control Information Processing Control Region Processing 18

Visualisation Tool LTE Example Real Time PHY System Trace 10ms LTE Frame Real-time Trace 1ms sub-frames 1ms LTE Sub-frame Real-time Trace 1ms Sub-frame 19

Conclusion Strong demand for a new approach to wireless product development Faster : Time to market Flexibility : Single solution for multiple products, markets and standards Smaller : Cost of development and cost of product However, there must be no penalty Existing products must be met or bettered : Power / Cost / Size Legacy investment maintained : e.g. Protocol stack software / multi-mode Seamless design flow SDM can deliver on all of these factors Heterogeneous task optimised processing Highly parallel architecture for scalability Whole System approach to power management, programming and debug Control / Real-time scheduling / Data processing 20