HotChips An innovative HD video and digital image processor for low-cost digital entertainment products. Deepu Talla.

Similar documents
DaVinci. DaVinci Processor CPU MHz

DSP Solutions For High Quality Video Systems. Todd Hiers Texas Instruments

Emerging Architectures for HD Video Transcoding. Jeremiah Golston CTO, Digital Entertainment Products Texas Instruments

Emerging Architectures for HD Video Transcoding. Leon Adams Worldwide Manager Catalog DSP Marketing Texas Instruments

AT-501 Cortex-A5 System On Module Product Brief

Platform-based Design

Development of Low Power and High Performance Application Processor (T6G) for Multimedia Mobile Applications

Product Technical Brief S3C2416 May 2008

Subject to Change OMAP/ARM 最新技術與開發工具. Anderson Hsiao Cat/EEE Business Development Texas Instruments

Age nda. Intel PXA27x Processor Family: An Applications Processor for Phone and PDA applications

Hi3516C Professsional HD IP Camera SoC. Brief Data Sheet. Issue 01. Date Baseline Date

Introduction to Sitara AM437x Processors

Cannon Mountain Dr Longmont, CO LS6410 Hardware Design Perspective

Module Introduction. Content 15 pages 2 questions. Learning Time 25 minutes

STM32F429 Overview. Steve Miller STMicroelectronics, MMS Applications Team October 26 th 2015

Silicon Motion s Graphics Display SoCs

SBC8140 Single Board Computer

Technical Specifications DiMAGE G600

Classification of Semiconductor LSI

Effective System Design with ARM System IP

A 1-GHz Configurable Processor Core MeP-h1

LeopardBoard Hardware Guide Rev. 1.0

Hi3518E V200 Economical HD IP Camera SoC. Brief Data Sheet. Issue 02. Date

New STM32 F7 Series. World s 1 st to market, ARM Cortex -M7 based 32-bit MCU

New System Solutions for Laser Printer Applications by Oreste Emanuele Zagano STMicroelectronics

The New 5.0 Megapixel DiMAGE G500

SPEAr: an HW/SW reconfigurable multi processor architecture

DevKit7000 Evaluation Kit

The Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006

D Demonstration of disturbance recording functions for PQ monitoring

ID 730L: Getting Started with Multimedia Programming on Linux on SH7724

Course Introduction. Purpose: Objectives: Content: Learning Time:

STM32F7 series ARM Cortex -M7 powered Releasing your creativity

Techniques for Optimizing Performance and Energy Consumption: Results of a Case Study on an ARM9 Platform

Intelop. *As new IP blocks become available, please contact the factory for the latest updated info.

DevKit8500D Evaluation Kit

SAM A5 ARM Cortex - A5 MPUs

BPI-D1 User Manual. SinoVoip Co.,Limited

SoC FPGAs. Your User-Customizable System on Chip Altera Corporation Public

Embest SOC8200 Single Board Computer

Specifications CAMEDIA C-8080 WIDE ZOOM

ECE 471 Embedded Systems Lecture 2

ARM Processors for Embedded Applications

Copyright 2016 Xilinx

Product Technical Brief S3C2413 Rev 2.2, Apr. 2006

Exploring System Coherency and Maximizing Performance of Mobile Memory Systems

ARM Cortex core microcontrollers 3. Cortex-M0, M4, M7

STM32 Journal. In this Issue:

The Power and Bandwidth Advantage of an H.264 IP Core with 8-16:1 Compressed Reference Frame Store

KeyStone C665x Multicore SoC

An Ultra High Performance Scalable DSP Family for Multimedia. Hot Chips 17 August 2005 Stanford, CA Erik Machnicki

Systems in Silicon. Converting Élan SC400/410 Design to Élan SC520

Digital Signal Processor 2010/1/4

The Benefits of GPU Compute on ARM Mali GPUs

DevKit8000 Evaluation Kit

INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES. Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp.

S2C K7 Prodigy Logic Module Series

1. Data plane blocks can be optimized for different applications. 2. The IP blocks can be reused and the design complexity decreases.

2008/12/23. System Arch 2008 (Fire Tom Wada) 1

Product Technical Brief S3C2412 Rev 2.2, Apr. 2006

3.8cm (1.5 inch) low-temperature polysilicon TFT color monitor with anti-reflection coating CCD electronic shutter and mechanical shutter

Product Technical Brief S3C2440X Series Rev 2.0, Oct. 2003

Blackfin Optimizations for Performance and Power Consumption

System-on-Chip. Outline. Example: iphone 3GS disassembled. System-on-Chip is Everywhere! SoC Challenges. SoC Challenges and Current Solutions

RZ Embedded Microprocessors

Day & Night 6MP 360 Panorama Dome IR IP Camera

University Program Advance Material

EagleEye4 Series Smart Camera

H.264 AVC 4k Decoder V.1.0, 2014

PRODUCT PREVIEW TNETV1050 IP PHONE PROCESSOR. description

1/2.7-type interline primary-colour CCD Total pixels: 3.3 million. Auto and 50,100, 200, and 400 ISO equivalents mm (35mm equivalent: mm)

Fault tolerance in consumer products. Ben Pronk

STM32 F0 Value Line. Entry-level MCUs

PDK (Platform Development Kit) Getting Started. Automotive Processors

GENERAL DESCRIPTION. 1.1 Applications

2-bit ARM Cortex TM -M3 based Microcontroller FM3 Family MB9A130 Series

An H.264/AVC Main Profile Video Decoder Accelerator in a Multimedia SOC Platform

FinePix JX

Growth outside Cell Phone Applications

SoC Platforms and CPU Cores

Multimedia in Mobile Phones. Architectures and Trends Lund

Zynq-7000 All Programmable SoC Product Overview

ECE 471 Embedded Systems Lecture 2

Five Ways to Build Flexibility into Industrial Applications with FPGAs

Software Driven Verification at SoC Level. Perspec System Verifier Overview

1. Parts of the camcorder 1: Preview button 2: Menu button 3: LCD screen 4: HDMI jack 5: Mode dial 6: Power LED 7: Shutter button 8: Charging LED 9:

T he key to building a presence in a new market

KONICA MINOLTA DiMAGE Z2 DIGITAL CAMERA

Low Power System Design Using Atmel ARM Cortex -based Products Copyright Atmel Corporation

VLSI Design Automation. Maurizio Palesi

The Challenges of System Design. Raising Performance and Reducing Power Consumption

Zatara Series ARM ASSP High-Performance 32-bit Solution for Secure Transactions

STM32 Cortex-M3 STM32F STM32L STM32W

STM32F7 series ARM Cortex -M7 powered Releasing your creativity

Renesas Synergy MCUs Build a Foundation for Groundbreaking Integrated Embedded Platform Development

DiMAGE A200 Technical Specifications

Intel CoFluent Studio in Digital Imaging

Asynchronous on-chip Communication: Explorations on the Intel PXA27x Peripheral Bus

Incorporating a Capacitive Touch Interface into Your Design

Building High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink. Robert Kaye

Transcription:

HotChips 2007 An innovative HD video and digital image processor for low-cost digital entertainment products Deepu Talla Texas Instruments 1 Salient features of the SoC HD video encode and decode using 16-bit DRAM interface Up to 75 MP/s of image processing for fast picture capture, processing, and display Low-power consumption for portable end equipments Flexibility and programmability for implementing new use-cases High level of integration coupled with high performance to bring high-end features into low-cost products Belongs to the DaVinci TM family of digital video processors; products with this SoC shipping in the market today 2

Outline Target applications Block diagram of the SoC Architectural details of key components Performance and power specifications Software platform Example reference design (used for 2 target applications) Floorplan and key statistics of the SoC 3 Target applications Digital still camera Solid state camcorder HD docking station Digital photo frame IP security network camera Photo printer Other Low-cost EEs HDTV card reader Low-cost portable media player 4

Block diagram Video Processing Subsystem ARM Subsystem ARM926EJ-S 216 MHz CPU 16KB I$, 8KB D$ 32KB RAM 8 KB ROM Imaging acceleration - programmable SIMD engine MJCP MPEG4 HD engine JPEG compression Front End ISP Image signal processor Back End Enhanced On-Screen Display (OSD) Video Enc (VENC) Image/video Statistics CCD/CMOS controller Resizer 10b DAC CCD/ CMOS Imager Analog/ digital video output Peripherals DMA Data & Configuration Bus Connectivity System McBSP x2 EDMA Serial Interfaces I 2 C SPI x3 UART x3 Motor control DDR EMIF (171 MHz) USB2.0 HS GIO NAND/ECC EMIF Timer x6 Program/Data Storage CE-ATA Over MMC WDT MMC/SD SDIO x2 PWM x4 MS/Pro 5 Video processing subsystem State-of-the-art ISP for camera capture Advanced image processing algorithms in hardware with additional software programmable accelerators for flexibility Handles any sensor read-out pattern Latest generation of lens distortion/shading compensation, noise filters, CFA interpolation, color processing, edge enhancement, and resizing IP Real-time statistics for auto focus, exposure, white balance, video stabilization, image classification, etc Optimized display subsystem Multiple video and graphic window support for enhanced GUI Integrated analog components for direct connect to a TV System tailored master interface Multi-level arbitration to re-order and buffer various module commands/data to simplify SoC level interconnect 6

Video and imaging coprocessors (1 of 2) Flexible SIMD engine Allows for implementing differentiated algorithms at 5-10x speed of an ARM processor high ISO (low light) noise filtering, redeye reduction, face tracking/detection and recognition, image stitching, adaptive contrast enhancement, motion estimation Up to 8 ALUs used per cycle MAC, absolute difference, table lookup, max/min, median, etc Local SRAMs with multiple banks to achieve maximum memory performance Programmed using vector style instructions with each instruction operating on a 2D block of data APIs available for common functions 7 Video and imaging coprocessors (2 of 2) MPEG4 compression engine Processing at HD rates with margin to achieve 30 fps in a system Programmable ME engine for resolution versus bit-rate/psnr tradeoff ME engine performs tasks of 422-to-420 conversion, full and half pel ME, inter/intra coding decision, Y and C prediction, and various block copies 16-way SAD engine Budget less than 1000 cycles per macro-block JPEG compression engine Fixed function block > 40 MP/s sustained performance in a system Can be used for video as a mjpeg sequence 8

Infrastructure components Shared across portfolio of DaVinci processors Reuse at IP, SoC, drivers, and software On-chip bus Split command and data to maximize performance for multiple masters at different priorities, latency & bandwidth requirements, and transfer sizes DRAM memory controller (DDR EMIF) Multi-level arbitration schemes to achieve HD performance in a 16-bit DDR2 environment EDMA Flexible DMA engine with 64 channels to simplify programming of slave data transfers 9 Key application performance specs Raw image processing (ISP and JPEG compression) 75 MP/s Sustained image processing (limited by overall system not the SoC) > 5 fps at 8 MP resolution HD video encode and decode (720P) > 30 f/s MPEG4 in full system including digital video stabilization Advanced image processing (using programmability in the SoC) High ISO image capture (10000) Red-eye removal (< 400 ms for 10MP image) Fast face-tracking (> 30 f/s) Adaptive image contrast enhancement (< 500 ms for 10 MP image) Panoramic image stitching (3 images < 1 second) 10

Power consumption Standby power (deep sleep mode) 1 mw Image and video preview < 200 mw without DRAM power; < 250 mw including DRAM power SD video encode and playback < 250 mw without DRAM power; < 330 mw including DRAM power HD video encode (720P) < 400 mw without DRAM power; < 550 mw including DRAM power High speed image processing (> 50 MP/s) < 300 mw without DRAM power, < 400 mw including DRAM power 11 Software platform overview Customer application/s Reference Framework Basic user interface Application APIs Algorithm abstraction Code Composer Studio Dev Tools Trace Application kernel/software Compliant Plug-in OS abstraction Program Build Program Debug Real-Time Analysis Platform support package Chip support library OS port Board support package Other drivers XDS Emulator Host Computer Embedded emulation components SoC physical hardware 12

Example: HW + SW turnkey reference design for digital imaging products (DSC, DVC) Imager Board CCD or CMOS Sensor AFE, Lens, Motor, Strobe, Focus Assist Display Board (LCD) Digital Media Processor Board SD/MMC Digital Media Processor USB 2.0 USB 1.1 NAND DDR1/2/m Audio Codec WLAN Power / UI Board Battery and DC power Battery Charger, Key scan RTC, User buttons, Mode settings Hardware Processor Board LCD Board Imager Board Power / UI Board Software Drivers Chip Support Libraries Operating System Image Pipeline Image Quality & Tuning Tools QA Testing Support Defined API Video and Audio Codecs Customers can customize or use their own codecs and algorithms 13 Floorplan 90nm with 5 metal layers 1.3V core voltage ~2.6 million gates ~1.7 Mbits of memory 12x12 package Major blocks Dark blue ARM subsystem Yellow Imaging and video compression Light green Video processing subsystem 14

References SoC press release: http://focus.ti.com/docs/pr/pressrelease.jhtml?prelid=sc07043 DaVinci technology overview and specifications: http://focus.ti.com/lit/an/sprt401a/sprt401a.pdf DaVinci technology digital video innovation product bulletin: http://www.ti.com/litv/pdf/sprt378d 15