CAMED: Complexity Adaptive Motion Estimation & Mode Decision for H.264 Video

Similar documents
Digital Video Processing

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

Introduction to Video Compression

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

Advanced Video Coding: The new H.264 video compression standard

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

VIDEO COMPRESSION STANDARDS

Overview: motion-compensated coding

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Fast Mode Decision for H.264/AVC Using Mode Prediction

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

Lecture 13 Video Coding H.264 / MPEG4 AVC

Xiaoqing Zhu, Sangeun Han and Bernd Girod Information Systems Laboratory Department of Electrical Engineering Stanford University

ARTICLE IN PRESS. Signal Processing: Image Communication

THE H.264 ADVANCED VIDEO COMPRESSION STANDARD

H.264 STANDARD BASED SIDE INFORMATION GENERATION IN WYNER-ZIV CODING

COMPARATIVE ANALYSIS OF DIRAC PRO-VC-2, H.264 AVC AND AVS CHINA-P7

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

The Scope of Picture and Video Coding Standardization

In the name of Allah. the compassionate, the merciful

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

Complexity/Performance Analysis of a H.264/AVC Video Encoder

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

PERFORMANCE ANALYSIS OF AN H.263 VIDEO ENCODER FOR VIRAM

Introduction to Video Coding

Low complexity H.264 list decoder for enhanced quality real-time video over IP

H.264 Video Transmission with High Quality and Low Bitrate over Wireless Network

IN RECENT years, multimedia application has become more

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation

CMPT 365 Multimedia Systems. Media Compression - Video

High Efficiency Video Coding. Li Li 2016/10/18

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

Toward Optimal Pixel Decimation Patterns for Block Matching in Motion Estimation

Recent, Current and Future Developments in Video Coding

Lec 08 Video Signal Processing I

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

Distributed Video Coding

Region-Based Rate-Control for H.264/AVC. for Low Bit-Rate Applications

A macroblock-level analysis on the dynamic behaviour of an H.264 decoder

Video Compression MPEG-4. Market s requirements for Video compression standard

Introduction to Video Encoding

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Performance and Complexity Co-evaluation of the Advanced Video Coding Standard for Cost-Effective Multimedia Communications

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Compressed-Domain Video Processing and Transcoding

Reduced Frame Quantization in Video Coding

Understanding Sources of Inefficiency in General-Purpose Chips. Hameed, Rehan, et al. PRESENTED BY: XIAOMING GUO SIJIA HE

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

Video compression with 1-D directional transforms in H.264/AVC

Scalable Extension of HEVC 한종기

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

High Efficient Intra Coding Algorithm for H.265/HVC

H.264 to MPEG-4 Transcoding Using Block Type Information

EE 5359 H.264 to VC 1 Transcoding

Lec 08 Video Signal Processing I

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

The Application of H.264 Video Coding Technology in the Digital Monitoring Systems

Fast Motion Estimation on Graphics Hardware for H.264 Video Encoding

Implementation and analysis of Directional DCT in H.264

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 9, SEPTEMBER

Complexity Estimation of the H.264 Coded Video Bitstreams

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

Editorial Manager(tm) for Journal of Real-Time Image Processing Manuscript Draft

Transcoding from H.264/AVC to High Efficiency Video Coding (HEVC)

Energy scalability and the RESUME scalable video codec

4G WIRELESS VIDEO COMMUNICATIONS

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

Reliable Video Broadcasting for the E-Learning Environment

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform

EE Low Complexity H.264 encoder for mobile applications

Video coding. Concepts and notations.

A GPU-Based DVC to H.264/AVC Transcoder

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 10 ZHU Yongxin, Winson

Overview: motion estimation. Differential motion estimation

Homogeneous Transcoding of HEVC for bit rate reduction

An Efficient Motion Estimation Method for H.264-Based Video Transcoding with Arbitrary Spatial Resolution Conversion

Article, Postprint version This version is available at

Video Coding Standards

10.2 Video Compression with Motion Compensation 10.4 H H.263

A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264. Massachusetts Institute of Technology Texas Instruments

An H.264/AVC Main Profile Video Decoder Accelerator in a Multimedia SOC Platform

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen

RATE DISTORTION OPTIMIZATION FOR INTERPREDICTION IN H.264/AVC VIDEO CODING

IMPLEMENTATION OF H.264 DECODER ON SANDBLASTER DSP Vaidyanathan Ramadurai, Sanjay Jinturkar, Mayan Moudgill, John Glossner

Video Codecs. National Chiao Tung University Chun-Jen Tsai 1/5/2015

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

New Techniques for Improved Video Coding

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Transcription:

ICASSP 6 CAMED: Complexity Adaptive Motion Estimation & Mode Decision for H.264 Video Yong Wang Prof. Shih-Fu Chang Digital Video and Multimedia (DVMM) Lab, Columbia University

Outline Complexity aware H.264 coding CAMED Rate-Distortion-Complexity video coding Complexity cost function Complexity control Experiment results Conclusions & future work 2

H.264: Emerging Video Standard More efficient motion prediction Variable block size motion compensation (as small as 4 x 4) Quarter pixel interpolation Multiple reference frames (up to 5) Decoupling of referencing order from display order Directional spatial prediction for intra coding SKIP/DIRECT Mode Improved coding efficiency Small block-size transform (4 x 4 integer transform) Lossless inverse transform Context-based adaptive binary arithmetic coding (CABAC) Switching P (SP) and switching I (SI) pictures 3

Complexity Issue In H.264 Much better quality, with much higher complexity [Schäfer 3] [Lappalainen 3] 4

Complexity Aware H.264 Coding Important for mobile video solutions H.264: Emerging coding standard Much higher quality Much higher complexity Complexity constrained video coding ( P) ( P) ( P) min D, s.., t R R, C C T T P : QP, MV, Mode,? 5

Approaches for Comp. Reduction Hardware level Hardware architecture (MMX, SIMD [Chen 4]) Low power chip (Advanced voltage scaling [ARM 3]) High capacity battery [GIT] Encoder complexity Fast motion estimation (FME) [Li 5, Tourapis 3 ] Fast mode decision (FMD) [Zhu 5, Peng 5] Fast reference frame selection [Su 4] Adaptive I-B-P GOP structure [Ray 4] SAD (Sum of Absolute Difference) complexity modeling [He 4] Decoder complexity Smart inverse DCT [Ortega ] Interpolation approximation [Wang 5] Implementation optimization (buffer reusage, assembly coding) Complexity Adaptive ME & MD (CAMED) 6

CAMED: Philosophy Input Frames P(t) - d(t) Motion Compensation++ v v + P R (t) Transform Inverse Transform P( t 1) d T + (t) Frame Buffer+De-blocking (H.264) Quantization Inverse Quantization Pre-decoded Entropy Decoding d T (t) Entropy Encoding Entropy Decoding Video Decoder Decoded Frame Motion Estimation & Mode Decision Video Encoder v v Entropy Encoding Bit Strem Combining Encoded Bitstream 7

CAMED: Highlight Develop an encoding algorithm that achieves both high video quality and low decoding complexity Addressed the most significant module (Motion Compensation) in decoders, so that the complexity reduction is efficient. Nothing need to be done in decoders, and the extra cost in encoders is trivial Full standard compliant The proposed method can be a valuable complement for hardware-level and encoder-level solutions Very promising for mobile video applications. 8

R-D-C Optimization R-D-C Lagrange optimization V * RDC,, C = MOTION V sup{ V} ( BM, ) argmin J ( V BM, ) V sup{ V} * RDC,, C = MODE M sup{ M} M sup{ M} { RD, J } MOTION V BM γ MOTIONCMOTION V BM = arg min (, ) + (, ) ( ) M ( MBQP, ) argmin J M MBQP, ( ) γ MODE MODE ( ) { RD, J } MODE M MBQP C M MBQP = arg min, +, γ MOTION = γ MODE 9

Complexity Cost Function Pixel-based interpolation operation c P ( V) V is integer MV e 6 V is subpixel b,h e6 + e 2 V is subpixel a,c,d,n = 2e6 + e 2 V is subpixel e,g,p,r 7e 6 V is subpixel j 7e6 + e 2 V is subpixel i,f,k,q G a b c H d e f g h i j k m n p q r M s N Can be further simplified counting 7-tap interpolations. 1

Complexity Cost Function (cont.) Block-based interpolation operation Mode Integer 1 6-tap 2 6-tap 7 6-tap Skip/Direct 256 512 592 16x16 256 512 592 16x8 128 256 296 8x16 128 256 296 8x8 64 128 168 8x4 32 64 84 4x8 32 64 84 4x4 16 32 52 Intra 11

Complexity Cost Function (cont.) Complexity benchmark Real time AVC decoder in Dell Axim x5 PDA Decoding Time Per Pixel (1E-7s).6.5.4.3.2.1 Dell Axim x5 w/ 524MHz Xscale CPU, Foreman, QCIF, 15fps, baseline, 2Kbps M1 M2 M3 M4 M5 M6 M7 V1 V5 V9 V13 V2 V3 V4 V6 V8 V14 V16 V7 V1 V11 V12 V15 Different Subpixel Levels 12

Complexity Cost Function (cont.) Modeling based on complexity benchmark ( ) p v M b ( n c + n c c ) c +, = M 6, v 6 2, v 2 Content-independent: the complexity is mainly decided by the arithmetic calculations Can be summarized into look-up table to reduce encoding cost Reflects the actual computing complexity, including not only interpolation, but also other cost rather than interpolation such as memory access and logic flow. l 13

Complexity Control The procedure to adjust coding parameters so that the complexity level meets the target requirement. Similar to rate control Control parameter: Lagrange multiplier γ, γ MOTION MODE Complexity modeling ( 1 MODE ) ( γ ) ln ( γ ) C = D K + K γ MODE MODE () t ( ) KD( t) KDt () C t = exp 1 Compelxity (1K Interpolations) 4 35 3 25 2 15 1 5 Mobile, 6kbps Each line is for one P frame 1 1 1 1 2 1 3 Lagrange multiplier for block mode 14

Complexity Control (cont.) Model estimation Collect complexity stat using sliding window Estimate model parameters through regression Complexity C E Observed ( γ,c) Observed Point Increased frame complexity ( γ,c) Point Buffer management γ MODE Lagrange Multiplier Use complexity buffer to simulate the complexity usage status on the decoder side Estimate of available resource, adjust the parameters, and avoid issues of buffer overflow or underflow 15

Experiment Setup Environment Software: H.264 JM 8.2 Sequences: Akiyo, Foreman, Stefan, Mobile Resolution: QCIF/CIF Profile: Main/Baseline Bit rate: 1kbps~1Mbps Block mode: all on Direct mode: temporal Motion estimation search range: 32 pixels 16

R-D-C Optimization PSNR(dB) 42 41 4 39 38 37 36 35 34 Foreman PSNR 33 H.264 32 La mda=5 31 La mda=2 3 La mda=5 29 Lamda=2 Lamda=5 28 5 1 15 Bandwidth (kbps) Remaining Complexity(%) 8 7 6 5 4 3 Foreman Complexity 2 La mda =5 La mda =2 1 La mda =5 Lamda=2 Lamda=5 5 1 15 Bandwidth (kbps) Up to 6% 6-tap interpolation can be saved while keeping the video quality almost intact (quality difference less than.2db) 17

Demonstration 18

Results: Complexity Benchmark 5. Foreman, QCIF, 15fps, baseline 48. 46. 44. PSNR (db) 42. 4. 38. 36. Beat H.264 up to 1.8dB in terms of C-D performance H.264 L1. L2. L5. L.5 L1 4K 34. 2. 22. 24. 26. 28. 3. 32. 34. 36. 38. 4. Decoding Time (s) Dell Axim x5 19

Results: Compatibility w/ FME 2

Results: Complexity Control Comple xity (K inte rpolation) 4 35 3 25 2 15 1 JM82 5K 1K 15K 2K Foreman, 1kbps 5 5 1 15 2 GOP Index 21

Results: Complexity Control CIF, 3fps, 1kbps 22

Conclusions & Future Work Proposed CAMED for complexity-efficient H.264 coding Applied rigorous R-D-C optimization framework Defined several complexity cost functions Proposed the algorithm of complexity control Performance Validated through extensive simulation experiment Future work Theoretical analysis on the relationship between the complexity level and the control parameter (i.e., the Lagrange multiplier) Considering hardware acceleration (such as MMX, SSE) More efficient complexity modeling and complexity control 23

THANK YOU! Prof. Shih-Fu Chang sfchang@ee.columbia.edu Yong Wang ywang@ee.columbia.edu