An H.264/AVC Main Profile Video Decoder Accelerator in a Multimedia SOC Platform

Similar documents
TKT-2431 SoC design. Introduction to exercises. SoC design / September 10

Fujitsu System Applications Support. Fujitsu Microelectronics America, Inc. 02/02

Emerging Architectures for HD Video Transcoding. Jeremiah Golston CTO, Digital Entertainment Products Texas Instruments

TKT-2431 SoC design. Introduction to exercises

Fujitsu SOC Fujitsu Microelectronics America, Inc.

Emerging Architectures for HD Video Transcoding. Leon Adams Worldwide Manager Catalog DSP Marketing Texas Instruments

DSP Solutions For High Quality Video Systems. Todd Hiers Texas Instruments

The Use Of Virtual Platforms In MP-SoC Design. Eshel Haritan, VP Engineering CoWare Inc. MPSoC 2006

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

Scalable Multi-DM642-based MPEG-2 to H.264 Transcoder. Arvind Raman, Sriram Sethuraman Ittiam Systems (Pvt.) Ltd. Bangalore, India

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

Video Compression Standards (II) A/Prof. Jian Zhang

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

VIDEO COMPRESSION STANDARDS

ECE 111 ECE 111. Advanced Digital Design. Advanced Digital Design Winter, Sujit Dey. Sujit Dey. ECE Department UC San Diego

Lecture 4: Video Compression Standards (Part1) Tutorial 2 : Image/video Coding Techniques. Basic Transform coding Tutorial 2

HotChips An innovative HD video and digital image processor for low-cost digital entertainment products. Deepu Talla.

COMPARATIVE ANALYSIS OF DIRAC PRO-VC-2, H.264 AVC AND AVS CHINA-P7

Video Coding Standards

Advanced Video Coding: The new H.264 video compression standard

Multimedia Decoder Using the Nios II Processor

Digital Video Processing

MPEG-2. And Scalability Support. Nimrod Peleg Update: July.2004

The Scope of Picture and Video Coding Standardization

Product Technical Brief S3C2416 May 2008

EE 5359 H.264 to VC 1 Transcoding

THE H.264 ADVANCED VIDEO COMPRESSION STANDARD

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Video Compression MPEG-4. Market s requirements for Video compression standard

ESE Back End 2.0. D. Gajski, S. Abdi. (with contributions from H. Cho, D. Shin, A. Gerstlauer)

H.264 / AVC (Advanced Video Coding)

100M Gate Designs in FPGAs

Introduction to Video Compression

Product Technical Brief S3C2413 Rev 2.2, Apr. 2006

Introduction to Video Coding

4G WIRELESS VIDEO COMMUNICATIONS

Effective System Design with ARM System IP

Lecture 3: Image & Video Coding Techniques (II) & Standards (I) A/Prof. Jian Zhang

SPEAr: an HW/SW reconfigurable multi processor architecture

Platform-based Design

Hardware Software Co-design and SoC. Neeraj Goel IIT Delhi

Product Technical Brief S3C2412 Rev 2.2, Apr. 2006

Hardware Architecture Design of Video Compression for Multimedia Communication Systems

Over the SoC Verification Hurdles

Complexity Estimation of the H.264 Coded Video Bitstreams

Smart Bus Arbiter for QoS control in H.264 decoders

S2C K7 Prodigy Logic Module Series

System-on-Chip Design Methodology for a Statistical Coder

Video Encoding with. Multicore Processors. March 29, 2007 REAL TIME HD

MAPPING VIDEO CODECS TO HETEROGENEOUS ARCHITECTURES. Mauricio Alvarez-Mesa Techische Universität Berlin - Spin Digital MULTIPROG 2015

FPGA based High Performance CAVLC Implementation for H.264 Video Coding

ASIC Logic. Speaker: Juin-Nan Liu. Adopted from National Chiao-Tung University IP Core Design

Processor and Peripheral IP Cores for Microcontrollers in Embedded Space Applications

Lecture 5: Video Compression Standards (Part2) Tutorial 3 : Introduction to Histogram

Multimedia Standards

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

Interconnects, Memory, GPIO

Designing Embedded Processors in FPGAs

Module Introduction. Content 15 pages 2 questions. Learning Time 25 minutes

Fundamentals of Video Compression. Video Compression

Compressed-Domain Video Processing and Transcoding

ARM Processors for Embedded Applications

5LSE0 - Mod 10 Part 1. MPEG Motion Compensation and Video Coding. MPEG Video / Temporal Prediction (1)

CONTACT: ,

CAMED: Complexity Adaptive Motion Estimation & Mode Decision for H.264 Video

Design and Implementation of High-Performance Master/Slave Memory Controller with Microcontroller Bus Architecture

Standard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology

A 4-way parallel CAVLC design for H.264/AVC 4 Kx2 K 60 fps encoder

Ch. 4: Video Compression Multimedia Systems

SYSTEMS ON CHIP (SOC) FOR EMBEDDED APPLICATIONS

Overview of SOC Architecture design

Intelop. *As new IP blocks become available, please contact the factory for the latest updated info.

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

DEVELOPMENT AND VERIFICATION OF AHB2APB BRIDGE PROTOCOL USING UVM TECHNIQUE

Low power context adaptive variable length encoder in H.264

Platform-based SoC Design

Automotive Challenges Addressed by Standard and Non-Standard Based IP D&R April 2018 Meredith Lucky VP of Sales, CAST, Inc.

Chapter 6 Storage and Other I/O Topics

DaVinci Family Video Performance

Ittiam Systems (Pvt.) Ltd.,

Architecture of High-throughput Context Adaptive Variable Length Coding Decoder in AVC/H.264

An Infrastructural IP for Interactive MPEG-4 SoC Functional Verification

The SOCks Design Platform. Johannes Grad

August Issue Page 96 of 107 ISSN

Integrated Development Environment

Video Codecs. National Chiao Tung University Chun-Jen Tsai 1/5/2015

High Efficiency Video Coding: The Next Gen Codec. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

10.2 Video Compression with Motion Compensation 10.4 H H.263

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 10 ZHU Yongxin, Winson

Co-Design and Co-Verification using a Synchronous Language. Satnam Singh Xilinx Research Labs

System-on-Chip Architecture for Mobile Applications. Sabyasachi Dey

GigaX API for Zynq SoC

Functional modeling style for efficient SW code generation of video codec applications

Hardware-Software Implementation of MPEG-4 Video Codec

Bus AMBA. Advanced Microcontroller Bus Architecture (AMBA)

EMBEDDED SOPC DESIGN WITH NIOS II PROCESSOR AND VHDL EXAMPLES

ISSCC 2006 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1

Transcription:

An H.264/AVC Main Profile Video Decoder Accelerator in a Multimedia SOC Platform Youn-Long Lin Department of Computer Science National Tsing Hua University Hsin-Chu, TAIWAN 300 ylin@cs.nthu.edu.tw 2006/08/16 MPSOC Colorado, USA

Main Points Hardwired design has excellent area, performance, power advantages If it is to be used by 1B people everyday, every bit and every cycle count It is not difficult 15 CS student-years, no background in video nor HDL-based design; neither is the professor RTL design is the easy part; Understanding algorithm and designing architecture are most critical YLLIN NTHU-CS 2

Video Coding Standards Standard MPEG-1 MPEG-2 MPEG-4 H.264 MB size 16*16 16*16(frame) 16*16 16*16 Block size 8*8 8*8 16*16, 8*8 16*16, 16*8, 8*16, 8*8, 8*4, 4*8, 4*4 Transform DCT DCT DCT/ Wavelet 4*4 int transform Entropy coding VLC VLC VLC VLC, CAVLC and CABAC ME, MC Yes Yes Yes 41 MVs per MB Pixel accuracy ½ pel ½ pel ¼ pel ¼ pel Reference frames One frame One frame One frame Multiple (5) frames Picture type I, P, B I, P, B I, P, B I, P, B Transmission rate Up to 1.5 Mbps 2-15 Mbps 64kbps~2Mbps 64kbps ~ 150Mbps YLLIN NTHU-CS 3

Get More for Less H.264 MPEG-2 YLLIN NTHU-CS 4

H.264/AVC Profiles Extended profile SP, SI slice Data partition B slice Weighted prediction Interlace CABAC Main profile Baseline profile Slice group ASO Redundant Slice I slice P slice CAVLC 8x8 transform Quantization matrix Color Sampling FREext (High) profile 8/10/12 bit sampling YLLIN NTHU-CS 5

NTHU H.264/AVC Main Profile Video Decoder Prototype Multimedia SOC Platform FPGA @ 10MHz Main Profile CIF(352x288)@ 30 fps FPGA @ 24MHz Main Profile D1 (720x480)@30fps YLLIN NTHU-CS 6

A Multimedia SOC Platform CPU Accelerator (FPGA) USB(PHY) Daughter Board ROM/ Flash Memory SDRAM FPGA VIC USB 2.0 Static memory SDRAM Controller(4-CH) High-Speed Bus JPEG Codec DMA PWM WDT TIMER APB Bridge Capture Display Controller Peripheral Bus DAI SSI SD SM UART GPIO 12C Audio Codec I2S Flash memory with SSI Video-In Flash Card Button LED CCIR601 TV/LCD YLLIN NTHU-CS 7

H.264/AVC Decoder System Diagram ARM926EJS TV Master Timer Slave SD Card Slave UART Slave AHB1 Slave H264 Master Slave SDC Slave Slave SDC Slave SDRAM LM SDRAM AHB2 YLLIN NTHU-CS 8

H.264/AVC Decoder Architecture SD Card SDRAM CPU Storage Device Input/Ref./Display Frame Display AHB MV Parser Para MC Ref idx reconstruct CABAD MBinfo Intra pred Pred Pic Rec unfilter DF CAVLD Coeff IQ/IDCT Residual DECODER YLLIN NTHU-CS 9

Hierarchical FSM in Main Controller Frame Level MB Level CABAC FSM CABAC MC FSM MC rden rd_addr rd_data Type decoder Main FSM IPRED FSM IQ/IDCT FSM IPRED IQ/IDCT PICREC FSM PICREC Main controller DF FSM DF YLLIN NTHU-CS 10

AMBA interface AHB A slave wrapper LM control register H.264 Decoder MFU VLC & TV OUT DF & MC arbiter 1 arbiter 2 SDC master wrapper 1 master wrapper 2 YLLIN NTHU-CS AHB B 11

Our Design Flow Software spec. in C & Acceleration specify User Spec. SW lib. C models, drivers API Platform spec. Embedded Software Compilation System.h Acceleration System configuration System description Co-Sim Evaluation No Software image HW/SW co-simulation Area & Timing & Power evaluation Performance constraint Yes FPGA prototyping Acceleration HW lib. HDL IPs Platform model Hardware image System Integrate HW IP Synthesizer Accelerator.v System generation System.v Integration Pin assignment & Hardware compilation Parameterized ISS FPGA Verify YLLIN NTHU-CS 12

Memory Traffic Consideration One SDRAM for All External Storage SDRAM Burst Mode Internal Storage for Compact Access & Data Reuse SDRAM Encoded Bitstream Reference Frames Currently Reconstructed Frame Display Buffer YLLIN NTHU-CS 13

Buffer Size vs Bus Traffic Frame per Sec 60 50 40 30 20 10 16MHz/TV 16MHz/LCD 24MHz/TV 24MHz/LCD 0 1 3 5 7 9 11 13 15 17 19 21 Buffer Size YLLIN NTHU-CS 14

Performance Comparison DSP Core HW Accelerated Gate Count 230K 180K 180K MHz 200 10 24 Profile Baseline Main Main Resolution QCIF (176x144) CIF (352x288) D1 (720x480) Frame Rate 15 30 30 YLLIN NTHU-CS 15

Summary An H.264/AVC main profile decoder on an ad hoc multimedia SOC platform Hardware-accelerated approach is highperformance and energy-efficient Memory traffic has major impact on performance It is not as difficult as you may think; algorithm and architecture are critical; writing Verilog is no difference from writing C Do not try to parallelize Reference Software; it is just proof of concept; not an implementation YLLIN NTHU-CS 16

Demo Video YLLIN NTHU-CS 17