DSP Solutions For High Quality Video Systems. Todd Hiers Texas Instruments

Similar documents
DaVinci. DaVinci Processor CPU MHz

Emerging Architectures for HD Video Transcoding. Jeremiah Golston CTO, Digital Entertainment Products Texas Instruments

Emerging Architectures for HD Video Transcoding. Leon Adams Worldwide Manager Catalog DSP Marketing Texas Instruments

HotChips An innovative HD video and digital image processor for low-cost digital entertainment products. Deepu Talla.

DaVinci 最新技術與開發工具介紹. Sunny Lee

MICROPROCESSOR SYSTEM FOR VISUAL BAKED PRODUCTS CONTROL

DSP 最新技術與開發工具介紹. Sunny Lee

Scalable Multi-DM642-based MPEG-2 to H.264 Transcoder. Arvind Raman, Sriram Sethuraman Ittiam Systems (Pvt.) Ltd. Bangalore, India

Subject to Change OMAP/ARM 最新技術與開發工具. Anderson Hsiao Cat/EEE Business Development Texas Instruments

The Development and Applications of Multi-core Multimedia Platform. Jefferey Chiu Ph.D. DSP System Senior FAE Texas Instruments

D Demonstration of disturbance recording functions for PQ monitoring

TI TMS320C6000 DSP Online Seminar

Embedded Processing Portfolio for Ultrasound

Classification of Semiconductor LSI

TKT-2431 SoC design. Introduction to exercises. SoC design / September 10

Motivation and scope Selection criteria and methodology Benchmarking options Processor architecture options Conclusions

Digital Signal Processor 2010/1/4

2008/12/23. System Arch 2008 (Fire Tom Wada) 1

An H.264/AVC Main Profile Video Decoder Accelerator in a Multimedia SOC Platform

SBC8140 Single Board Computer

Chapter 7. Hardware Implementation Tools

The World Leader in High Performance Signal Processing Solutions. DSP Processors

KeyStone C665x Multicore SoC

Video Encoding with. Multicore Processors. March 29, 2007 REAL TIME HD

PRODUCT PREVIEW TNETV1050 IP PHONE PROCESSOR. description

Application Report. 1 System Requirements. 2 Using the DM643x Pin Multiplexing Utility. Bernard Thompson...

DevKit8500D Evaluation Kit

Hi3536 H.265 Decoder Processor. Brief Data Sheet. Issue 03. Date

IP Video Phone on DM64x

Introduction to Sitara AM437x Processors

Choosing the Appropriate Simulator Configuration in Code Composer Studio IDE

A design of real-time image processing platform based on TMS320C6678

DaVinci Family Video Performance

The S6000 Family of Processors

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

An Example of Network Video Monitoring System Based on DM6446. ChaoJun Yan

SoC Platforms and CPU Cores

TI s PCI2040 PCI-to-DSP Bridge

Hi3516C Professsional HD IP Camera SoC. Brief Data Sheet. Issue 01. Date Baseline Date

CS368-LI Digital Media Software Development Kit

Emerging Integrated Drive Controller

TMS320DM644x DMSoC ARM Subsystem. Reference Guide

The Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006

FPGA Co-Processing Architectures for Video Compression

MYD-C7Z010/20 Development Board

Emerging H.26L Standard:

Video Compression An Introduction

TMS320C6412 Fixed-Point Digital Signal Processor. Data Manual

MYC-C7Z010/20 CPU Module

Product Technical Brief S3C2416 May 2008

KeyStone C66x Multicore SoC Overview. Dec, 2011

EyeCheck Smart Cameras

Platform-based Design

CS365-TI Digital Media Software Development Kit

CS368-TI Digital Media Software Development Kit

New System Solutions for Laser Printer Applications by Oreste Emanuele Zagano STMicroelectronics

Course Introduction. Purpose: Objectives: Content: Learning Time:

SMT943 APPLICATION NOTE 1 APPLICATION NOTE 1. Application Note - SMT372T and SMT943.doc SMT943 SUNDANCE MULTIPROCESSOR TECHNOLOGY LTD.

Ittiam Systems (Pvt.) Ltd.,

SoC Overview. Multicore Applications Team

Digital Signal Processors: fundamentals & system design. Lecture 1. Maria Elena Angoletta CERN

Advanced Video Coding: The new H.264 video compression standard

DevKit8000 Evaluation Kit

SAM A5 ARM Cortex - A5 MPUs

Evaluating the DSP Processor Options (DSP-522)

Introduction to Video Compression

Keystone Architecture Inter-core Data Exchange

Embest SOC8200 Single Board Computer

XDS560V2 Installation Guide

MYD-C437X-PRU Development Board

Fujitsu SOC Fujitsu Microelectronics America, Inc.

S2C K7 Prodigy Logic Module Series

Zynq-7000 All Programmable SoC Product Overview

EMAC SoM Presentation.

EMAC SoM Presentation

Video Interface Module for TI EVM TMDXEVM8148 and TMDXEVM368

VIDEO COMPRESSION STANDARDS

Critical Link, LLC. 1 Copyright 2013, Critical Link LLC Specifications Subject to Change. FEATURES TI OMAP-L138 Dual Core Application Processor

TMS320C6414T/15T/16T Power Consumption Summary

WHICH MICRO? What does MCU needs to do in my system? What are the tasks? Dr. Adriana Becker-Gomez

Programmable Logic Design Grzegorz Budzyń Lecture. 15: Advanced hardware in FPGA structures

SPEAr: an HW/SW reconfigurable multi processor architecture

Advanced Microcontrollers Grzegorz Budzyń Lecture

MPEG-2. And Scalability Support. Nimrod Peleg Update: July.2004

AT-501 Cortex-A5 System On Module Product Brief

High-Performance, Highly Secure Networking for Industrial and IoT Applications

Intelop. *As new IP blocks become available, please contact the factory for the latest updated info.

STM32F429 Overview. Steve Miller STMicroelectronics, MMS Applications Team October 26 th 2015

Design Choices for FPGA-based SoCs When Adding a SATA Storage }

TKT-2431 SoC design. Introduction to exercises

The Scope of Picture and Video Coding Standardization

RZ Embedded Microprocessors

Hi3520D V300 H.264 CODEC Processor. Brief Data Sheet. Issue 04. Date

UCT Software-Defined Radio Research Group

Technology Overview. DaVinci. CONTENTS

Copyright 2016 Xilinx

An Ultra High Performance Scalable DSP Family for Multimedia. Hot Chips 17 August 2005 Stanford, CA Erik Machnicki

MYC-AM335X CPU Module MYC-AM3352 MYC-AM3359

SoC FPGAs. Your User-Customizable System on Chip Altera Corporation Public

Cannon Mountain Dr Longmont, CO LS6410 Hardware Design Perspective

Transcription:

DSP Solutions For High Quality Video Systems Todd Hiers Texas Instruments

TI Video Expertise Enables Faster And Easier Product Innovation TI has a long history covering the video market from end to end R&D began on image processing in the early 80s TI leverages video systems expertise and R&D across internal design teams to drive innovation and business development Customers can leverage TI s expertise in end-to-end video to quickly launch into multiple video markets

Defining the Video Chain Capture Process Deliver Receive View Acquisition of original video content including A/D and sampling Content is encoded, transcoded, transrated and/or analyzed Content is transported via private or public networks Received content is stored, decoded and/or transcoded Content is accessible through a viewing mechanism

Platforms & Uses Likely Uses Platform Encode Transcode Transrate Decode Multi 6455 645x + FPGA Multi DM643x DM643x + FPGA Multi Cat LC C6424 + FPGA DM644x (DaVinci)

Serial RapidIO TM 64-bit EMIF 2-500 McBSP 0/1 PCI-66 and 1 Gigabit EMAC OR HPI and 1 Gigabit EMAC or UTOPIA Switched Central Resource 128 C6455 Quick Introduction 2 MB L2 (Up to 256KB Cache) 128 EDMA 3.0 4-32x PLL 128 8*32 TMS320C6455 32 KB L1P Cache/SRAM 256 New C64x+ DSP Core 100% Code Compatible 20% Better Cycle Performance 20-30% Better Code Size 32 KB L1D Cache/SRAM Package: 697 BGA, 24 mm, 0.8 mm Pitch, Lead-Free Balls Serial Up to 10 Gb/s Bi-di communication Connect to any RapidIO device New C64x+ Core Up to 1 GHz ISA enhancements for better performance and code size C6455-720 MHz: $179 C6455-850 MHz: $219 C6455-1 GHz: $259 (TMS Production prices 2006 / 07-10Ku) Memory Architecture 2 MB L2 256K Cache/SRAM; Remainder is SRAM only 32 KB L1D Cache/SRAM 32 KB L1P Cache/SRAM Two EMIFs 32-bit 2 (2-500 SDRAM) 64-bit EMIF GEMAC / UTOPIA / PCI-66 / HPI Multiple connectivity options

C642x Quick Introduction Features New C64x+ Core 400 MHz, 500 MHz, 600 MHz Up to 4800 MMACs performance Memory C6421 16-KB L1D and 16-KB L1P cache/sram 64-KB L2 cache/sram Memory C6424 80-KB L1D and 32-KB L1P cache/sram 128-KB L2 cache/sram Peripherals C6421 Two EMIFs 16-bit 2 333MHz; 8-bit EMIF VLYNQ / EMAC (RMII/MII) or HPI / RMII McBSP or McASP Audio and telecom interfacing UART (2), I 2 C, GPIO, PWM (3), 64-bit timers (2) Peripherals C6424 enhancements: 32-bit 2 333MHz; 16-bit EMIF PCI33 or VLYNQ / EMAC (RMII) or HPI / RMII McBSP (2) or McASP Package: 361 pin 16 16-mm BGA, 0.8-mm Pitch, 376 pin 23 x 23 mm BGA, Lead-Free Balls Samples: 4Q06; Production: 4Q07 Price C6421/C6424: $8.95/$15.95 at 400 MHz / 10 KU 2007/08 Benefits C6421: Lowest cost DSP with C64x+ performance and Ethernet Smallest package C64x DSP C6421/24 pin Compatible C6424 2 32 EMIF 16 PCI 33 or VLYNQ RMII or HPI RMII I 2 C OSC PLL JTAG EDMA 3.0 DSP Subsystem L1D 80 KB C64x+ Core L1P 32 KB PLL L2 128 KB Cache/ SRAM Timer WDT GPIO McBSP McBSP or McASP Timer x2 PWM x3 UART x2

DM643x Quick Introduction Features New C64x+ Core C64x+ Core @ Up to 600 MHz Memory 80 KB L1D, 32 KB L1P Cache/SRAM 128 KB L2 Cache/SRAM Peripherals Video Port Sub-System (VPSS): Input (CCDC), Output (w/dacs), Resizer, OSD, and Camera Control Two EMIFs: 2-266: 32 bits, 133 MHz; EMIF 2.1 10/100 Ethernet MAC (MII); PCI 33 MHz; HPI; McASP VLYNQ Serial Interface to FPGAs UART (2), I 2 C, SPI, GPIO, PWM (3), CAN (HECC), 64-bit Timers (2) Package: 16 16mm or 23 23mm, 361 Pin, 0.8mm or 376 Pin 1.0mm Pb-Free Balls Pin Compatible with DM6435/3/1 Samples 4Q06; Production 2Q07 Price: $22.95 @ 600 MHz, 10 KU 2007/08 DSP Subsystem L1D 80KB C64x+ TM DSP 600-MHz Core L1P 32KB Peripherals EDMA Serial Interfaces McBSP 2 or McASP DM6437 PCI 33 I 2 C CAN L2 128 KB Cache Connectivity or UART 2 SPI CCD Controller Video Interface Switch Fabric VLYNQ EMAC Video Processing Subsystem or Front End Back End On-Screen Display (OSD) HPI EMAC 2 Controller (32b) Video Enc (VENC) Timer 64-bit 2 Preview Histogram/3A System WD Timer Program/Data Storage EMIF (8b) Resizer 10b DAC 10b DAC 10b DAC 10b DAC PWM 3 PLL JTAG PLL OSC

DM6443/6 ARM Subsystem ARM926EJ-S 300 MHz CPU Peripherals EDMA Audio Serial Port Video-Imaging Coprocessor (VICP) DSP Subsystem USB 2.0 PHY Serial Interfaces I 2 C SPI DM644x Quick Introduction C64x+ TM DSP 600 MHz Core Connectivity EMAC VLYNQ With MDIO Video Processing Subsystem Front End Resizer CCD Controller Histogram/3A Video Interface Video Processing Subsystem Preview On-Screen Display (OSD) Switched Central Resource (SCR) UART UART UART 2 Controller (16b/32b) General- Purpose Timer Back End Video Enc (VENC) System Async EMIF/ NAND/ SmartMedia Watchdog Timer Program/Data Storage 10b DAC 10b DAC 10b DAC 10b DAC ATA/ Compact Flash PWM PWM PWM MMC/ SD Video-Optimized TMS320C64x+ DSP @ 600MHz H.264 MP@L3, 30fps SD Decoding VC1/WMV9 Full D1 SD Decoding MPEG-2 MP@ML SD Decoding MPEG-4 ASP Full D1 SD Decoding H.264 BP D1 Encoding Simultaneous H.264 BP CIF Coding Dedicated video processing subsystem Back end - Integrated OSD, four video DACs, 24-bit digital RGB output Front end Resizer, Image processing engine, 16-bit digital input 2006 10KU Price: TMS320DM6443 $29.95 TMS320DM6446 $34.95

SRIO Quick Introduction Serial RapidIO is a high-performance, packet-switched, interconnect technology that addresses the embedded industry's need for: Reliability Increased Bandwidth Increased Scalability Serial RapidIO allows chip-to-chip and board-to-board communications at performance levels scaling to ten Gigabits per second and beyond C6455 Serial RapidIO Support IEEE 1149.6 Compliant 1, 2, or 2.5 GBit/sec per link Up to four 1x links (each 1x link is bidirectional) --OR-- One 4x link (bi-directional pipe), which provides up to 10 GBit/sec Supports connecting to any other Serial RapidIO device Supports wide variety of network topologies: Rings, Meshes, Stars, & Others Benefits 1x Link is fast enough to send HD 1080i raw video between devices 4x Link is easily fast enough to send HD 1080p raw video between devices Reduction in chip count, board area and system cost

H.264 Quick Introduction Arbitrary Slice Ordering Flexible Macroblock Ordering Redundant Slices Baseline Common Features Motion Prediction: 7 block sizes, ¼ sample accuracy, multiple reference frames Intra Prediction: 17 modes Reversible Transform & Non-uniform Quantization Universal & Context Adaptive VLC (UVLC/CAVLC) Loop (de-blocking) Filter Main B pictures: several prediction modes Context Adaptive Binary Arithmetic Coding (CABAC) Weighted Prediction Adaptive Frame/Field Coding H.264: Baseline & Main Profiles

H.264 Quick Introduction

Lots of Knobs to Turn Resolution Frame rate Output bitrate Encoder Profile (Base, Main, High) Encoder Features Search Range Search Algorithm Search Partitions Refinement Number of Reference Frames All can affect processing load, video quality

Multi 6455 Homogenous processing on General Purpose DSP Highest Performance Most Flexible Most Scalable Most Software Differentiable

Multi 6455 - Architecture Video IO TVP7000 DRAM DRAM DRAM EMIF C6455 C6455 C6455 SRIO SRIO SRIO SRIO Switch Or Direct Connect

Multi 6455 Encode Partitioning (Functional Partition) Intra Prediction + C6455 DCT Q Entropy Encoding Video Input SRIO SRIO IDCT IQ C6455 ENET/PCI Encoded Output Motion Comp Motion Est Recon. Frame De- Blocking

Multi 6455 - Encode Performance (Functional Partition) Est. Utilization Resource CPU SRIO ENET/PCI 720p30 BP (2 DSPs) 800 MHz 165 MB/s 900 MB/s 5 Mb/s 720p30 MP (2 DSPs) 1200 MHz 165 MB/s 1575 MB/s 5 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 64 Search range

Multi 6455 Encode Partitioning (Sliced Partition) SRIO C6455 Bitstream Assembler ENET/ PCI Encoded Output SRIO C6455 SRIO C6455

Multi 6455 - Encode Performance (Sliced Partition) Est. Utilization 720p60 BP 720p60 MP 1080i60 BP 1080i60 MP Resource (3 DSPs) (4 DSPs) (4 DSPs) (6 DSPs) CPU 880 MHz 1000 MHz 750 MHz 750 MHz SRIO 210 MB/s 210 MB/s 240 MB/s 240 MB/s 600 MB/s 790 MB/s 510 MB/s 590 MB/s ENET/PCI 10 Mb/s 10 Mb/s 10 Mb/s 10 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 64 Search range

Multi 6455 Encode Partitioning (Combination Partition) Encoded Output

Multi 6455 - Encode Performance (Combination Partition) Est. Utilization Resource 720p60 BP (4 DSPs) 720p60 MP (4 DSPs) 1080i60 BP (4 DSPs) 1080i60 MP (6 DSPs) CPU 740 MHz 1100 MHz 830 MHz 820 MHz SRIO 525 MB/s 525 MB/s 600 MB/s 600 MB/s 600 MB/s 790 MB/s 510 MB/s 590 MB/s ENET/PCI 20 Mb/s 20 Mb/s 30 Mb/s 30 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 64 Search range

Multi 6455 Transcode Partitioning Residual Info C6455 De- Blocking SRIO SRIO Motion Comp Intra Prediction Reconstructed Video C6455 + IQ IDCT Entropy Decoding ENET/PCI ENET/PCI Encoded Input Encoded Output

Multi 6455 - Transcode Performance Est. Utilization 720p60 BP 720p60 MP 1080i60 BP 1080i60 MP Resource (3 DSPs) (4 DSPs) (4 DSPs) (2 DSPs) CPU 750 MHz 850 MHz 640 MHz 960 MHz SRIO 110 MB/s 110 MB/s 120 MB/s 120 MB/s 120 MB/s 120 MB/s 135 MB/s 135 MB/s ENET/PCI 20 Mb/s 20 Mb/s 20 Mb/s 20 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, Refinement only

645x + FPGA General Purpose DSP Customizable Acceleration Highest Performance Flexible Scalable DSP is ideal for sequential processing steps FPGA is ideal for highly parallel processing steps

645x + FPGA - Architecture With SRIO Without SRIO DRAM C6455 DRAM C6454 SRIO EMIF DRAM SRIO FPGA DRAM EMIF FPGA Video IO TVP7000 Video IO TVP7000

645x + FPGA Encode Partitioning Intra Prediction + C6455 DCT Q Entropy Encoding Video Input SRIO/EMIF SRIO/EMIF IDCT IQ FPGA ENET/PCI Encoded Output Motion Comp Motion Est Recon. Frame De- Blocking

645x + FPGA - Encode Performance Est. Utilization Resource CPU SRIO/EMIF ENET/PCI 720p30 BP (1 DSP) 570 MHz 165 MB/s 150 MB/s 5 Mb/s 720p30 MP (1 DSP) 850 MHz 165 MB/s 165 MB/s 5 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 64 Search range

645x + FPGA Transcode Partitioning C6455 De- Blocking Motion Comp Intra Prediction + IQ IDCT Entropy Decoding ENET/PCI Encoded Input SRIO/EMIF Encoded Output SRIO/EMIF Motion Comp Motion Est FPGA Recon. Frame De- Blocking

645x + FPGA - Transcode Performance Est. Utilization Resource CPU SRIO/EMIF ENET/PCI 720p30 BP (1 DSP) 1170 MHz 110 MB/s 385 MB/s 15 Mb/s 720p30 MP (2 DSP) 850 MHz 110 MB/s 235 MB/s 10 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 64 Search range

Multi DM643x Homogenous processing on DSP with Specialized Video IO Lower cost Flexible Scalable Software Differentiable

Multi DM643x - Architecture DRAM DRAM DRAM DM643x DM643x DM643x VP VP VP Video IO TVP7000 FPGA (Interconnect)

Multi DM643x - Architecture DRAM DRAM DRAM DM643x DM643x DM643x PCI PCI PCI Video IO TVP7000

Multi DM643x Encode Partitioning (Sliced Partition) PCI/VP DM643x Bitstream Assembler ENET/ PCI Encoded Output PCI/VP DM643x PCI/VP DM643x

Multi DM643x - Encode Performance (Sliced Partition) Est. Utilization 480p30 MP 720p30 BP 720p30 MP 1080i60 BP Resource (2 DSPs) (4 DSPs) (4 DSPs) (6 DSPs) CPU 375 MHz 335 MHz 500 MHz 500 MHz VP 10 MB/s 15 MB/s 15 MB/s 20 MB/s 300 MB/s 300 MB/s 520 MB/s 340 MB/s ENET/PCI 10 Mb/s 20 Mb/s 20 Mb/s 20 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 64 Search range

Multi DM643x Decode Partitioning Decoded Output PCI VP/PCI PCI VP/PCI IDCT Motion Comp + IQ DM643x VLD/ CABAC DM643x De- Blocking ENET/ PCI Encoded Input

Multi DM643x - Decode Performance Est. Utilization 720p30 MP 720p60 BP 1080i60 BP Resource (2 DSPs) (2 DSPs) (2 DSPs) CPU 330 MHz 440 MHz 500 MHz VP 110 MB/s 220 MB/s 240 MB/s 415 MB/s 470 MB/s 530 MB/s ENET/PCI 10 Mb/s 20 Mb/s 20 Mb/s Resource Utilization is worst case of the DSPs in the system

DM643x + FPGA DSP with Specialized Video IO Customizable Acceleration Lower Cost Flexible DSP is ideal for sequential processing steps FPGA is ideal for highly parallel processing steps

643x + FPGA - Architecture DRAM C643x VP DRAM VP FPGA Video IO

DM643x + FPGA Decode Partitioning Decoded Output PCI VP/PCI PCI VP/PCI IDCT Motion Comp + IQ DM643x VLD/ De- Blocking CABAC FPGA ENET/ PCI Encoded Input

DM643x + FPGA - Decode Performance Est. Utilization 720p30 MP 720p60 BP 1080i60 BP Resource (1 DSP) (1 DSP) (1 DSP) CPU 330 MHz 440 MHz 500 MHz VP 60 MB/s 110 MB/s 120 MB/s 150 MB/s 150 MB/s 150 MB/s ENET/PCI 10 Mb/s 20 Mb/s 20 Mb/s Resource Utilization is worst case of the DSPs in the system

Multi C642x Homogenous processing on General Purpose DSP Lowest Cost Flexible Some Scalability Software Differentiable

Multi C642x - Architecture DRAM DRAM DRAM C642x C642x C642x PCI PCI PCI Video IO

Multi C642x Transrate Partitioning C642x + IDCT IQ Entropy Decoding Motion Vectors PCI PCI - DCT Q Residual Info C642x Entropy Encoding ENET/PCI Encoded Input Encoded Output Motion Comp Decode & Re-encode Intra Prediction

Multi C642x - Transrate Performance Est. Utilization Resource CPU PCI ENET/PCI 480p30 MP (2 DSPs) 320 MHz 40 MB/s 155 MB/s 15 Mb/s 720p30 BP (2 DSPs) 570 MHz 105 MB/s 235 MB/s 30 Mb/s Resource Utilization is worst case of the DSPs in the system No Motion Estimation

C642x + FPGA General Purpose DSP Customizable Acceleration Lowest Cost

642x + FPGA - Architecture DRAM C642x PCI DRAM PCI FPGA Video IO

C642x + FPGA Transrate Partitioning C642x + IDCT IQ Entropy Decoding Motion Vectors PCI PCI - DCT Q Residual Info FPGA Entropy Encoding ENET/PCI Encoded Input Encoded Output Motion Comp Intra Prediction

C642x + FPGA - Transrate Performance Est. Utilization Resource CPU PCI ENET/PCI 480p30 MP (2 DSPs) 320 MHz 40 MB/s 155 MB/s 15 Mb/s 720p30 BP (2 DSPs) 570 MHz 105 MB/s 235 MB/s 30 Mb/s Resource Utilization is worst case of the DSPs in the system No Motion Estimation

Multi DM644x (DaVinci ) Homogenous Multiprocessing on Specialized Video DSP On-board accelerators Video IO Wide 3 rd Party & Video Library Support Lower cost

Multi DaVinci - Architecture DRAM DRAM DM644x VP ASP ASP DM644x VP Video IO

Multi DaVinci Encode Partitioning (Sliced Partition) VP DM644x ASP Bitstream Assembler ENET/ PCI Encoded Output DM644x VP ASP

Multi DaVinci - Encode Performance Est. Utilization Resource CPU VP ENET/PCI 480p30 MP (2 DSPs) 310 MHz 10 MB/s 150 MB/s 5 Mb/s 720p30 BP (2 DSPs) 550 MHz 30 MB/s 225 MB/s 10 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 32 Search range

Multi DaVinci Decode Partitioning VP IDCT IQ DM643x VLD/ CABAC ENET/ PCI Encoded Input + DM644x VP Motion Comp De- Blocking

Multi DaVinci - Decode Performance Est. Utilization Resource CPU VP ENET/PCI 480p30 MP (2 DSPs) 250 MHz 40 MB/s 145 MB/s 5 Mb/s 720p30 BP (2 DSPs) 440 MHz 110 MB/s 220 MB/s 10 Mb/s Resource Utilization is worst case of the DSPs in the system 1 Reference Frame, +/- 32 Search range

Conclusions Elegant solutions exist for video systems that won t fit in one DSP Many different solutions for different Resolutions Quality needs Price points Mixing and combing examples shown creates an even wider spectrum of solutions

DSP Solutions For High Quality Video Systems Todd Hiers Texas Instruments