ISSCC 2006 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1

Similar documents
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 42, NO. 1, JANUARY

Chapter 2 Joint MPEG-2 and H.264/AVC Decoder

ISSCC 2001 / SESSION 9 / INTEGRATED MULTIMEDIA PROCESSORS / 9.2

Hardware Architecture Design of Video Compression for Multimedia Communication Systems

RECENTLY, researches on gigabit wireless personal area

Multimedia Decoder Using the Nios II Processor

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

Analysis and Architecture Design of Variable Block Size Motion Estimation for H.264/AVC

Advanced Video Coding: The new H.264 video compression standard

Video Compression An Introduction

High Efficiency Data Access System Architecture for Deblocking Filter Supporting Multiple Video Coding Standards

Real-time and smooth scalable video streaming system with bitstream extractor intellectual property implementation

FPGA based High Performance CAVLC Implementation for H.264 Video Coding

IMPLEMENTATION OF DEBLOCKING FILTER ALGORITHM USING RECONFIGURABLE ARCHITECTURE

VIDEO COMPRESSION STANDARDS

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen

A SCALABLE COMPUTING AND MEMORY ARCHITECTURE FOR VARIABLE BLOCK SIZE MOTION ESTIMATION ON FIELD-PROGRAMMABLE GATE ARRAYS. Theepan Moorthy and Andy Ye

Multicore SoC is coming. Scalable and Reconfigurable Stream Processor for Mobile Multimedia Systems. Source: 2007 ISSCC and IDF.

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

Fast frame memory access method for H.264/AVC

A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

Design of a High Speed CAVLC Encoder and Decoder with Parallel Data Path

FAST FOURIER TRANSFORM (FFT) and inverse fast

Vertex Shader Design I

Advances of MPEG Scalable Video Coding Standard

Complexity Estimation of the H.264 Coded Video Bitstreams

A high-level simulator for the H.264/AVC decoding process in multi-core systems

Introduction to Video Compression

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

PERFORMANCE ANALYSIS OF AVS-M AND ITS APPLICATION IN MOBILE ENVIRONMENT

Week 14. Video Compression. Ref: Fundamentals of Multimedia

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

Video Encoding with. Multicore Processors. March 29, 2007 REAL TIME HD

10.2 Video Compression with Motion Compensation 10.4 H H.263

High-Throughput Parallel Architecture for H.265/HEVC Deblocking Filter *

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

Digital Video Processing

MPEG-4: Simple Profile (SP)

THE orthogonal frequency-division multiplex (OFDM)

EE 5359 H.264 to VC 1 Transcoding

Homogeneous Transcoding of HEVC for bit rate reduction

Ch. 4: Video Compression Multimedia Systems

A Motion Vector Predictor Architecture for AVS and MPEG-2 HDTV Decoder

MPEG-2. ISO/IEC (or ITU-T H.262)

DUE to the high computational complexity and real-time

5LSE0 - Mod 10 Part 1. MPEG Motion Compensation and Video Coding. MPEG Video / Temporal Prediction (1)

A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264. Massachusetts Institute of Technology Texas Instruments

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN

H.264 / AVC (Advanced Video Coding)

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

High Performance VLSI Architecture of Fractional Motion Estimation for H.264/AVC

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

Design of Entropy Decoding Module in Dual-Mode Video Decoding Chip for H. 264 and AVS Based on SOPC Hong-Min Yang, Zhen-Lei Zhang, and Hai-Yan Kong

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

The S6000 Family of Processors

Scalable Multi-DM642-based MPEG-2 to H.264 Transcoder. Arvind Raman, Sriram Sethuraman Ittiam Systems (Pvt.) Ltd. Bangalore, India

Research on Transcoding of MPEG-2/H.264 Video Compression

The Design and Evaluation of Hierarchical Multilevel Parallelisms for H.264 Encoder on Multi-core. Architecture.

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 10 ZHU Yongxin, Winson

High Efficiency Video Decoding on Multicore Processor

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

A Parallel Transaction-Level Model of H.264 Video Decoder

ISSCC 2003 / SESSION 14 / MICROPROCESSORS / PAPER 14.5

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

A macroblock-level analysis on the dynamic behaviour of an H.264 decoder

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

TRADITIONALLY, architectural design focuses more on

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

A Dynamic Quality-Adjustable H.264 Video Encoder for Power-Aware Video Applications

Architecture of High-throughput Context Adaptive Variable Length Coding Decoder in AVC/H.264

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

The VC-1 and H.264 Video Compression Standards for Broadband Video Services

Validation of a Real-time AVS Encoder on FPGA

Research of UAV Video Compression System based on DM6467

IN RECENT years, multimedia application has become more

Filter-Based Dual-Voltage Architecture for Low-Power Long-Word TCAM Design

A Low Power 720p Motion Estimation Processor with 3D Stacked Memory

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

2D/3D Graphics Accelerator for Mobile Multimedia Applications. Ramchan Woo, Sohn, Seong-Jun Song, Young-Don

A Partial Memory Protection Scheme for Higher Effective Yield of Embedded Memory for Video Data

PREFACE...XIII ACKNOWLEDGEMENTS...XV

MPEG-2. And Scalability Support. Nimrod Peleg Update: July.2004

Video coding. Concepts and notations.

A 167-processor Computational Array for Highly-Efficient DSP and Embedded Application Processing

Compressed-Domain Video Processing and Transcoding

Cross Layer Protocol Design

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

A 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications

Professor, CSE Department, Nirma University, Ahmedabad, India

Audio and video compression

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis

High-Performance VLSI Architecture of H.264/AVC CAVLD by Parallel Run_before Estimation Algorithm *

Transcription:

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 22.1 A 125µW, Fully Scalable MPEG-2 and H.264/AVC Video Decoder for Mobile Applications Tsu-Ming Liu 1, Ting-An Lin 2, Sheng-Zen Wang 2, Wen-Ping Lee 1, Kang-Cheng Hou 1, Jiun-Yan Yang 1 Chen-Yi Lee 1 1 National Chiao Tung University, Hsinchu, Taiwan 2 Mediatek, Hsinchu, Taiwan A single-chip MPEG-2 SP@ML and H.264/AVC BL@L4 video decoder is fabricated in a.18µm 1P6M CMOS technology with an area of 15.21mm 2. This chip contains 19.2kb and 3.55kb of embedded SRAM for storing neighboring pixels and control tags, and adopts two s for further system integration. It operates at a power-level that is about one order of magnitude less than comparable decoders. This savings in power consumption was attained by means of both throughput and bandwidth improvements while incorporating scalable features. For mobile applications, MPEG-2 and H.264/AVC video decoding of QCIF sequences at 15 frames per second is achieved at a clock frequency of 1.15MHz and requires 18µW and 125µW, respectively, at 1V supply voltage. Moreover, CIF, D1 and HD resolutions are also supported. The chip features are summarized in Fig. 22.1.1. The advent of H.264/AVC provides high compression ratio, but, there is no backward compatibility to the prevalent MPEG-x and H.26x video coding standards. MPEG-2 [1] and H.264/AVC [3] processors have been reported at ISSCC. However, these solutions used separate modules and only processed a single type of video content in each module. To support different system requirements such as DVB-H or HD-DVD, a scalable pipeline is exploited to efficiently integrate both MPEG-2 and H.264/AVC in a single chip. Figure 22.1.2 shows the system block diagram of the proposed dual-video decoder chip. The interface is exploited to access the external. Reading and writing processes are issued by the motion compensation and deblocking filter, respectively. Furthermore, to reduce the bandwidth between the external and deblocking filter, a separate data bus and display engine are utilized for on screen display (OSD) through a direct display interface. The syntax parser, entropy decoder, inverse transformation and deblocking filter for MPEG-2 and H.264/AVC standard have been tightly combined based on characteristics of pipeline scalability. To achieve low power consumption, a simple prediction circuit is interfaced to the embedded SRAM to make a better trade-off between memory cost and transmission bandwidth. Furthermore, a high-throughput motion compensation and deblocking filter is developed to reduce the clock frequency and lower the power requirements. Figure 22.1.3 shows the scalable pipeline for the dual-standard architecture. In H.264/AVC, a 4 4 sub-block is the smallest element to be processed. However, an 8 8 block size is adopted in the MPEG-2 standard. To integrate them efficiently, the buffer size is kept as 8 8 and transfers for both standards are into a 4 4 processing unit since all blocks can be considered as a super-set of a 4 4 sub-block. Compared to a macroblock level pipeline, the power consumption of the pipelined registers is reduced by 75% in MPEG-2 and 93.75% in H.264/AVC, through a clock gating scheme. Input data of the IDCT module is partitioned into even and odd components and they are processed as a 4 4 IDCT, through a recursive IDCT algorithm. The in-loop filter is defined by H.264/AVC and the post-loop filter follows the prevalent MPEG-x standard. However, the performance improvement is very small (only.4db) when applying in-loop filter as a post filter in the prevalent MPEG standard. In Fig. 22.1.3, a H.264-like algorithm for post filtering is used to retain the filtering performance and reduce the integration cost. Finally, the PSNR gains of.2db can be achieved, as compared to an un-filtered design, with an additional gate count of only 2% of the in-loop filter requirement. Figure 22.1.4 shows the proposed bandwidth scalability of the prediction circuit. H.264/AVC achieves a high compression ratio since it utilizes the neighboring pixels to obtain a reliable predictor reducing the prediction errors. However, high data correlation also leads to a design challenge in terms of transmission bandwidth and internal memory cost. In the design, a simple prediction circuit, where a 19.2kb pixel SRAM is employed to cache the pixels of upper neighbors, improves the external bandwidth. The key idea is that not all neighboring pixels should be stored in the internal memory. In certain sequences, most edges are determined as a horizontal prediction in intra-prediction or SKIP mode in the deblocking filter. There is no need to keep them for follow-up decoding procedures. The proposed prediction circuit generates a TAG signal to predict whether the pixel data of the next row of a macroblock should be kept or not; but a prediction miss may occur. Therefore, we provide a flexible solution at the architectural level where a compromise is made between external bandwidth and memory cost. Compared to the intra-prediction and deblocking filter in [3], the proposed prediction circuit saves 33% of the bandwidth and 4% of the internal memory size on average. Figure 22.1.5 shows the processing cycle breakdown of different architectural stages. Several high throughput architectures have been implemented on this chip [4]. A 1 4 decoding order in Figure 22.1.3, context switch buffer and efficient access scheduling is exploited to achieve a 39% cycle reduction in MC. A novel prediction and hybrid schedule reduce the processing cycles a further 35%. Therefore, 11.4Mpixels/s of maximal decoding capability is achieved. Compared to the initial power consumption for 66.21 and 4.43Mpixels/s rates, the savings are 33% and 59%, respectively, for real-time decoding at QCIF resolution. Figure 22.1.6 presents a comparison of power consumption with existing designs [1][2]. Under an identical design specification, the proposed techniques lead to lower system clock rate and supply voltage and thus lower power dissipation. For mobile applications, the power reduction of this chip is about one order of magnitude compared to existing decoders and could be further improved through voltage scaling. Figure 22.1.7 shows a chip micrograph of this dual-video decoder design. Acknowledgements: Authors thank Jeng-Bin Chen, Ching-Che Chung, Wen-Hsiao Peng, Wei- Chin Lee for insightful discussions on this work. Authors also thank Chip Implementation Center (CIC) for testing services. References: [1] H. Yamauchi, et al., A.8W HDTV Video Processor with Simultaneous Decoding of Two MPEG-2-MP@HL Streams and Capable of 3frames/s Reverse Playback, ISSCC Dig. Tech. Papers, pp. 372-474, Feb., 22. [2] Hae-Yong Kang, et al., MPEG4 AVC/H.264 Decoder with Scalable Bus Architecture and Dual Memory Controller, IEEE Intl Symp. on Circuits and Systems, pp. II-145 - II-148, May, 24. [3] Yu-Wen Huang, et al., A 1.3TOPS H.264/AVC Single-Chip Encoder for HDTV Applications, ISSCC Dig. Tech. Papers, pp. 128-129, Feb., 25. [4] Tsu-Ming Liu, et al., An 865µW H.264/AVC Video Decoder for Mobile Applications, IEEE Asian Solid-State Circuits Conference, pp. 31-34, Nov., 25. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / February 8, 26 / 8:3 AM Specification Technology Die Size Package Logic Gates Internal External Core Power Consumption Memory Max. System Clock Max. Processing Throughput MPEG-2 H.264/AVC MPEG-2 SP@ML Dual H.264/AVC BL@L4 Standard.18 m 1P6M CMOS 1.8V core, 3.3V I/O 3.9mm 3.9mm 28-pin CQFP 33.78K 22.75Kb 2 SRAMs s 1MHz 11.4 Mpixels/sec 18 W (1.15MHz@1V, QCIF@15fps) 1.4mW (16.6MHz@1.2V, D1@3fps) 125 W (1.15MHz@1V, QCIF@15fps) 12.4mW (16.6MHz@1.2V, D1@3fps) Stream Buffer Stream Input Host Processor System BUS H.264/MPEG-2 Syntax Parser Compensation CAVLC/VLC Unit I-ZZ I-Q Predicted Path Slice Pixel SRAM MB Pixel SRAM Peripherals Interface Sync. Adder 4x4/8x8 IDCT Post-loop Residual Path 1 : MPEG-2 1: H.264/AVC In/Post-loop In/Post-Loop De-blocking Filter Display Interface YUV/ CCIR656 Display Engine Figure 22.1.1: Chip summary. Figure 22.1.2: System block diagram. 4x4 BUF Compensation BUF Novel Ordering Sync. FIFO... Pixel SRAM Pixel SRAM Compensation read In/Post-Loop De-blocking Filter Display BUF Slice Pixel Memory Pipelined Level: 8x8,4x4 Even/Odd Partition Pipeline Scalability CAVLC/VLC 1-D IDCT I-ZZ I-Q Central Controller 4x4/8x8 IDCT Not filter (Skip mode) Yes Filter? 4x4,bS: H.264 in-loop 8x8, eq_cnt: MPEG-2 post-loop 4x4 Pipelined Level: 4x4/ 8x8 Triple Mode Decision (bs/eq_cnt) 4x4/ 8x8 Triple P-i-P-o Edge Filter MPEG-2 Decoding H.264/AVC Decoding SKIP bs = / eq_cnt<t3 H.264's Strong Edge Filtering bs=4 / eq_cnt>=t2 H.264/MPEG-2 Weak Edge Filtering <bs<4 / T3=<eq_cnt<T2 External Bandwidth (Mbytes/sec) Stefan@18HD 3 25 2 15 1 15kbps 45kbps 1.5Mbps 5 1 2 3 4 5 6 7 8 x 1 4 Mother & Daughter@CIF 14 15kbps 12 45kbps 1.5Mbps 1 8 6 4 5 1 15 Memory size (bits) BW,SIZE = Predictor + Deblocking Filter 18HD@3fps 3 Deblocking Filter Total BW 1 Luma SIZE 2 BW SIZE BW Luma SIZE Design 1.5k 62M 279M.5k Design 2 94M 61.44k 15.4k 94M 76.8k [3].5k 15.4k 15.9k Proposed ( x W/8) 111.33M 7.6k 34.74M 1.92k 146.7M 9.6k 1 : Bytes/sec 2 : bits (Luma) 3 : Stefan@15kbps Figure 22.1.3: Pipeline scalability with dual-standard architecture. Figure 22.1.4: Bandwidth scalability with prediction circuit. Power Consumption(mW) 12 1 8 6 4 2 3.5 3 2.5 2 1.5 Maximal decoding 1 capability.5 94(18HD) 41.5(72HD) 15.5(D1) 4.56(CIF).57(QCIF) Throughput(MPixels/sec) Figure 22.1.5: Throughput improvement of proposed design. Power consumption(mw) QCIF@15fps, 1.8V 4.43 26.95 44.14 66.21 67.36 This Work Mpixels/sec Mpixels/sec W Core power consumption(mw) 1 3 1 2 mw 1 1 uw 1 1-1 1 2 3 4 5 6 7 Decoding throughput (MPixels/sec) Figure 22.1.6: Power consumption. This work@h.264 Decoding This work@mpeg-2 Decoding H.264 Decoder CIF@1.2V,3MHz Power(uW) 25 2 15 1 5 H.264/AVC MPEG-2 Decoder 18HD@1.8V,135MHz H.264 Decoder 18HD@1.2V,13MHz MPEG-2 1.8 1.6 1.4 1.2 1 Voltage(V) 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 Figure 22.1.7: Chip micrograph. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 Specification Dual MPEG-2 SP@ML H.264/AVC BL@L4 Technology Die Size Package Logic Gates Standard.18 m 1P6M CMOS 1.8V core, 3.3V I/O 3.9mm 3.9mm 28-pin CQFP 33.78K Internal External Memory 22.75Kb 2 SRAMs s Max. System Clock Max. Processing Throughput 1MHz 11.4 Mpixels/sec Core Power Consumption MPEG-2 H.264/AVC 18 W (1.15MHz@1V, QCIF@15fps) 1.4mW (16.6MHz@1.2V, D1@3fps) 125 W (1.15MHz@1V, QCIF@15fps) 12.4mW (16.6MHz@1.2V, D1@3fps) Figure 22.1.1: Chip summary. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 Host Processor Peripherals System BUS Stream Input Stream Buffer H.264/MPEG-2 Syntax Parser Compensation CAVLC/VLC I-ZZ I-Q Predicted Path Interface Sync. Adder 4x4/8x8 IDCT MB Pixel SRAM Post-loop Residual Path 1 : MPEG-2 1: H.264/AVC In/Post-loop In/Post-Loop De-blocking Filter Display Interface YUV/ CCIR656 Display Engine Unit Slice Pixel SRAM Figure 22.1.2: System block diagram. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 Compensation 4x4 BUF BUF Novel Ordering Sync. FIFO... Pixel SRAM Pixel SRAM Compensation read In/Post-Loop De-blocking Filter Display BUF Pipeline Scalability CAVLC/VLC I-ZZ I-Q 4x4/8x8 IDCT 4x4 MPEG-2 Decoding H.264/AVC Decoding Pipelined Level: 8x8,4x4 Pipelined Level: Not filter (Skip mode) SKIP Even/Odd Partition 1-D IDCT Central Controller Filter? Yes 4x4,bS: H.264 in-loop 8x8, eq_cnt: MPEG-2 post-loop 4x4/ 8x8 Triple Mode Decision (bs/eq_cnt) 4x4/ 8x8 Triple P-i-P-o Edge Filter bs = / eq_cnt<t3 H.264's Strong Edge Filtering bs=4 / eq_cnt>=t2 H.264/MPEG-2 Weak Edge Filtering <bs<4 / T3=<eq_cnt<T2 Figure 22.1.3: Pipeline scalability with dual-standard architecture. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 Slice Pixel Memory External Bandwidth (Mbytes/sec) 3 Stefan@18HD 25 2 15 1 5 1 2 3 4 5 6 7 8 x 1 4 14 12 1 8 6 Mother & Daughter@CIF 4 5 1 15 BW,SIZE = Predictor + Deblocking Filter 15kbps 45kbps 1.5Mbps 15kbps 45kbps 1.5Mbps Memory size (bits) 18HD@3fps 3 Deblocking Filter Total BW 1 Luma SIZE 2 BW SIZE BW Luma SIZE Design 1.5k 62M 279M.5k Design 2 94M 61.44k 15.4k 94M 76.8k.5k [3] 15.4k 15.9k Proposed ( x W/8) 111.33M 7.6k 34.74M 1.92k 146.7M 9.6k 1 : Bytes/sec 2 : bits (Luma) 3 : Stefan@15kbps Figure 22.1.4: Bandwidth scalability with prediction circuit. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 Power Consumption(mW) 12 1 8 6 4 2 Maximal decoding capability 94(18HD) 41.5(72HD) 15.5(D1) 4.56(CIF).57(QCIF) Throughput(MPixels/sec) Power consumption(mw) 3.5 3 2.5 2 1.5 1.5 QCIF@15fps, 1.8V 4.43 26.95 44.14 66.21 67.36 This Work Mpixels/sec Mpixels/sec Figure 22.1.5: Throughput improvement of proposed design. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 W Core power consumption(mw) mw uw 1 3 1 2 1 1 1 This work@h.264 Decoding This work@mpeg-2 Decoding H.264 Decoder CIF@1.2V,3MHz Power(uW) 25 2 15 1 5 H.264/AVC MPEG-2 Decoder 18HD@1.8V,135MHz H.264 Decoder 18HD@1.2V,13MHz MPEG-2 1.8 1.6 1.4 1.2 1 Voltage(V) 1-1 1 2 3 4 5 6 7 Figure 22.1.6: Power consumption. Decoding throughput (MPixels/sec) 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE

ISSCC 26 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1 Figure 22.1.7: Chip micrograph. 26 IEEE International Solid-State Circuits Conference 1-4244-79-1/6 26 IEEE