Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Similar documents
Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Reduced Frame Quantization in Video Coding

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

An Efficient Table Prediction Scheme for CAVLC

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

An Efficient Mode Selection Algorithm for H.264

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Fast Mode Decision for H.264/AVC Using Mode Prediction

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications

ARTICLE IN PRESS. Signal Processing: Image Communication

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

Performance Comparison between DWT-based and DCT-based Encoders

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

Rate Distortion Optimization in Video Compression

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

Investigation of the GoP Structure for H.26L Video Streams

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

Region-Based Rate-Control for H.264/AVC. for Low Bit-Rate Applications

Video compression with 1-D directional transforms in H.264/AVC

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

Advanced Video Coding: The new H.264 video compression standard

Block-based Watermarking Using Random Position Key

Fast Intra Prediction Algorithm for H.264/AVC Based on Quadratic and Gradient Model

Fast frame memory access method for H.264/AVC

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Frame differencing-based segmentation for low bit rate video codec using H.264. S. Sowmyayani* and P. Arockia Jansi Rani

Homogeneous Transcoding of HEVC for bit rate reduction

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

Bit Allocation for Spatial Scalability in H.264/SVC

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

Performance Analysis of DIRAC PRO with H.264 Intra frame coding

Fast Intra- and Inter-Prediction Mode Decision in H.264 Advanced Video Coding

Performance and Complexity Co-evaluation of the Advanced Video Coding Standard for Cost-Effective Multimedia Communications

Rate Control for H.264 with Two-Step Quantization Parameter Determination but Single-Pass Encoding

Joint redundant motion vector and intra macroblock refreshment for video transmission

STANDARD COMPLIANT FLICKER REDUCTION METHOD WITH PSNR LOSS CONTROL

EE Low Complexity H.264 encoder for mobile applications

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

H.264/AVC Video Watermarking Algorithm Against Recoding

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

THE H.264, the newest hybrid video compression standard

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER

H.264 to MPEG-4 Transcoding Using Block Type Information

High Efficient Intra Coding Algorithm for H.265/HVC

Pattern based Residual Coding for H.264 Encoder *

VHDL Implementation of H.264 Video Coding Standard

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

On the Adoption of Multiview Video Coding in Wireless Multimedia Sensor Networks

A Novel Partial Prediction Algorithm for Fast 4x4 Intra Prediction Mode Decision in H.264/AVC

Realtime H.264 Encoding System using Fast Motion Estimation and Mode Decision

Editorial Manager(tm) for Journal of Real-Time Image Processing Manuscript Draft

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

Department of Electrical Engineering

Video Coding Using Spatially Varying Transform

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Research Article A High-Throughput Hardware Architecture for the H.264/AVC Half-Pixel Motion Estimation Targeting High-Definition Videos

Vector Bank Based Multimedia Codec System-on-a-Chip (SoC) Design

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

ADAPTIVE JOINT H.263-CHANNEL CODING FOR MEMORYLESS BINARY CHANNELS

A deblocking filter with two separate modes in block-based video coding

A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264. Massachusetts Institute of Technology Texas Instruments

Digital Video Processing

A comparison of CABAC throughput for HEVC/H.265 VS. AVC/H.264

Transform Kernel Selection Strategy for the H.264

JPEG 2000 vs. JPEG in MPEG Encoding

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

SAD implementation and optimization for H.264/AVC encoder on TMS320C64 DSP

THE objective of rate control is to regulate the coded bit

PAPER Optimal Quantization Parameter Set for MPEG-4 Bit-Rate Control

COMPARISON OF HIGH EFFICIENCY VIDEO CODING (HEVC) PERFORMANCE WITH H.264 ADVANCED VIDEO CODING (AVC)

Transcoding from H.264/AVC to High Efficiency Video Coding (HEVC)

Decoded. Frame. Decoded. Frame. Warped. Frame. Warped. Frame. current frame

An Efficient Hardware Architecture for H.264 Transform and Quantization Algorithms

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

Complexity/Performance Analysis of a H.264/AVC Video Encoder

Xin-Fu Wang et al.: Performance Comparison of AVS and H.264/AVC 311 prediction mode and four directional prediction modes are shown in Fig.1. Intra ch

High Efficiency Video Decoding on Multicore Processor

Very Low Complexity MPEG-2 to H.264 Transcoding Using Machine Learning

FAST ALGORITHM FOR H.264/AVC INTRA PREDICTION BASED ON DISCRETE WAVELET TRANSFORM

Implementation and analysis of Directional DCT in H.264

Fast intermode decision algorithm based on general and local residual complexity in H.264/ AVC

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

Categorization for Fast Intra Prediction Mode Decision in H.264/AVC

Recent, Current and Future Developments in Video Coding

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC

ENCODER POWER CONSUMPTION COMPARISON OF DISTRIBUTED VIDEO CODEC AND H.264/AVC IN LOW-COMPLEXITY MODE

Gravity direction-based ultra-fast intraprediction algorithm for H.264/AVC video coding

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Transcription:

Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264 Author(s) Ivanov, Yuri; Bleakley, Chris J. Publication date 2005-08-31 Conference details Link to online version Item record/more information 9th Irish Machine Vision and Image Processing Conference (IMVIP), Belfast, Northern Ireland, 30-31 August, 2005 http://www.cs.ucd.ie/staff/cbleakley/home/papers/adaptive_la mbda_for_imvip2005_final.pdf; http://www.cs.qub.ac.uk/imvip2005/ http://hdl.handle.net/10197/7102 Downloaded 2018-07-13T11:37:31Z The UCD community has made this article openly available. Please share how this access benefits you. Your story matters! (@ucd_oa) Some rights reserved. For more information, please see the item record link above.

ADAPTIVE LAGRANGE MULTIPLIER FOR LOW BIT RATES IN H.264 Authors: Yury V. Ivanov, Dr. C.J. Bleakley Dept. of Computer Science, University College Dublin, Belfield, Dublin 4, Ireland e-mail: {yury.ivanov, chris.bleakley}@ucd.ie ABSTRACT This paper describes a novel adaptive Lagrange multiplier algorithm based on the general Rate-Distortion model with dynamic lambda calculation for H.264 video coding. When experimentally tested with the JM 8.1a encoder on QCIF video sequences the algorithm provides 2.8% average bit rate saving with insignificant PSNR loss (around 0.13%) without computational complexity increase. Experiments indicate that the algorithm may also be adopted for higher bit rates. Keywords: H.264, Lagrange multiplier, low bit rate video coding.

ADAPTIVE LAGRANGE MULTIPLIER FOR LOW BIT RATES IN H.264 ABSTRACT This paper describes a novel adaptive Lagrange multiplier algorithm based on the general Rate- Distortion model with dynamic lambda calculation for H.264 video coding. When experimentally tested with the JM 8.1a encoder on QCIF video sequences the algorithm provides 2.8% average bit rate saving with insignificant PSNR loss (around 0.13%) without computational complexity increase. Experiments indicate that the algorithm may also be adopted for higher bit rates. Keywords: H.264, Lagrange multiplier, low bit rate video coding. Distortion and mode decision. Section 3 describes the experimental results of applying the RD technique to low bit rate video sequences. In Section 4 the adaptive Lagrange multiplier algorithm is proposed and experimental results are presented. Section 5 holds a discussion of the achieved results and, finally, conclusions are given in Section 6. 2. RATE-DISTORTION IN H.264 MODE DECISION Rate-Distortion is used in H.264 for making mode decisions. The Rate-Distortion model is based on the following Lagrangian formula [6]: min { J }, where J = D + R (1) 1. INTRODUCTION The new H.264 standard [1] was developed by the Joint Team (JVT) for video compression in applications where bandwidth or storage capacity is limited. Experimental results show that H.264 provides better coding efficiency than MPEG-4 [2] and H.263 [3] at lower bit rates. However, these advantages come at the cost of increased computational complexity. Rate-Distortion Optimization (RDO) [4] is an optional tool specified within the H.264 standard. It achieves improved compression and visual quality. Unfortunately its utilization leads to a computational complexity increase of 100-200% [5]. This paper proposes a technique, based on the general RD model for adaptive Lagrange multiplier calculation. It achieves similar compression results at low bit rates to conventional RD with insignificant drop in visual quality with almost no computational complexity increase. The proposed algorithm can be used instead of the RDO tool for encoding low bit rate video sequences. The rest of this paper is organized as follows. Section 2 explains the background theory on Rate- where D is a distortion measure (usually Sum of Absolute Differences) and R is the number of bits used. The Lagrangian rate-distortion function J is minimized for a particular value of the Lagrange multiplier. This technique allows effective evaluation of a large number of possible coding choices but, in practice, computation time is the limiting factor and some interactions between coding decisions are neglected. Thus, D, and R are approximations. In the H.264 reference encoder JM 8.1a [7], developed by the JVT, the mode decision algorithm uses equation (1). An optional RDO tool is also provided, allowing more precise estimation of J since and R are optimized in a rate-distortion sense. In general, the Lagrange multiplier can be different for the Intra and Inter mode decisions. For the Intra decision it is referred as MODE and for the Inter decision as MOTION. In the case where the RDO tool is not used, the equation for the calculation is quite simple [7]: = f (QP ) (2) MODE MOTION =

where QP is quantizer step size. Thus, MODE and MOTION are fixed for the entire encoding process. In the case where the RDO tool is used, the macroblock QP selected by the minimization routine is dependent on MODE. Therefore, MODE and MOTION are calculated differently: MODE = 0.85 2 ( QP /3) (3) MOTION= MODE (4) In basic operation (RDO off), the encoder performs mode selection based on the parameter MODE, which is calculated from the QP setting as shown in equations (2-4). In fact, the relationship between MODE and QP was derived experimentally by determining the rate-distortion optimal values of QP for a given set of MODE values [4]. This relationship has a second order dependency on the bit rate of the sequence, namely that MODE should be higher for higher bit rates. In the next Section we will investigate the effect of RDO tool when applied to the encoding of low bit rate video sequences. In this way, the practical difference between the usage of equations (2) and (3-4) can be determined. 3. RATE-DISTORTION APPLIED TO LOW BIT RATE VIDEO SEQUENCES For the purpose of RD investigation we used the reference JM 8.1a encoder. Different QCIF video sequences of 298 frames each were encoded with and without the RDO tool. The optimal encoder configuration for low bit rates is based on the experimental results from [8]: All 7 block modes in Variable Block Sizes Hadamard Transform and CABAC is on No B-frames Full search motion estimation algorithm Search Range Size = 8 QP = 28 Picture frequency is 10 frames per second The results of the experiment are given in Tables 1 and 4. It can be seen from Table 4 that bit rate savings for RD depend on the video sequence (4% in average) while the average PSNR drop is insignificant (around 0.1%). The encoding statistics given in Table 1 indicate that the RDO tool uses less Intra coding in favor of more Inter coding. It seems that, for the tested video sequences, the RDO tool alters Intra/Inter decisions so that the bit rate is reduced without impacting visual quality and that most of the bit rate savings are achieved by the use of less Intra coding. sequence Intra MBs (I-blocks) No Opt. RD Table 1. Intra/Inter MBs count for no RD, RD and optimal adaptive C' lambda algorithms. For the no RD case it is possible to vary MODE in order to force the encoder to make more Inter decisions. In this way, it may be possible to reduce the bit rate without impacting a visual quality, in a similar manner to applying the RDO tool. This idea is investigated in the next Section. 4. METHOD OF ADAPTIVE LAGRANGE MULTIPLIER CALCULATION AND SIMULATION RESULTS In the first series of experiments we chose to apply a fixed multiplier, C to the conventional (no RD) value of MODE. Thus, equation (2) was transformed to: = C f (QP) (5) MODE = MOTION Inter MBs (P-blocks) Opt. C' No RD C' RD RD Coast 318 249 288 29184 29253 29214 Foreman 839 728 767 28663 28774 28735 Hall 241 178 227 29261 29324 29225 Mother 156 144 148 29346 29358 29502 Mean difference with no RD, % -5.89-3.4-0.08 0.11 The experimental results below for video sequences with 200 frames each indicate that a bit rate reduction is achieved when 1 < C < 2. For instance, Foreman with C =1.75 shows a 5.39% bit

saving, which is even higher than that achieved using the RDO tool. The optimal value of C marked with gray in Table 2, is different for every video sequence and tends to be higher for the higher bit rate sequence. Bit rate, kbits/s sequence Orig. 0.5 1.25 1.5 1.75 2.0 JM Foreman 49.02 59.8 47.47 46.66 46.38 46.68 News 24.98 27.9 24.77 24.77 25.00 25.28 Mother 17.34 22.53 17.04 16.91 17.06 17.33 Akiyo 8.35 9.98 8.22 8.34 8.47 8.56 Table 2. Bit rate for selected video sequences encoded with fixed value of C. For the given encoding parameters, the dependency of C on bit rate can be approximated using the MSE method as the following function: ' 2 C = f ( R) = 0.0002 R + 0.023 R + 1.099 (6) where R is the average bit rate. Using equation (6) it is possible to calculate C adaptively based on the average bit rate as determined during the encoding process. In the next series of experiments the following algorithm for adaptive C' calculation was used. start encoding with initial C' ; bits sent = 0; for every frame except Intra frames { bits sent += current frame bits; if ( (Frame number % S) == 0) { calculate average bit rate; calculate new C' ; bits sent = 0; } } The experimental results are given in Table 4. The average bit rate saving around 2.8% with PSNR drop of 0.13% was achieved for the adaptive C' calculation algorithm, where initial C' =1.0 and S=5. Finally experiments with different QP were performed. The selected video sequences were encoded using the optimal adaptive C' algorithm with QP=24. Two cases were tested, one using equation (6) "as is" and one using scaling of R according to the ratio of the new bit rate (QP=24) to that of the previous version (QP=28). The results are shown in Table 3. In the case of scaled R, the percentage bit rate savings achieved were similar those obtained in the QP=28 case. sequence Orig. JM (QP=24) Bit rate, kbits/s C ' Opt. as is Opt. C ', when R ' = R / 2 Coastguard 123.02 120.98 120.85 Foreman 76.07 73.97 73.66 Silent 45.87 44.85 44.81 Mother 26.58 26.02 25.99 Table 3. Bit rate for selected video sequences encoded with different algorithms where QP=24. Since modifying the value of in equation (2) leads to the change in the balance between rate and distortion, subjective evaluation of all video sequences was also performed. In fact, the visual difference between the pictures produced by original JM8.1a and adaptive Lambda method can be hardly found (see Figure 1.) Figure 1. Image produced by JM (left) and proposed adaptive Lambda algorithm (right). 5. DISCUSSION Experimental results from Table 4 show that despite the fact that coefficients for equation (6) were calculated empirically from a limited number of video sequences, the method in general is applicable for wide range of QCIF video sequences. For the QP

values rather than the 28, method also works, as can be seen from Table 3, but equation (6) should be extended to allow for the QP change: i.e. the calculation of an average bit rate R should be dependent on QP. Evidently the technique provides improved coding performance. The mean gradient of the RD curve calculated across several sequences for the adaptive lambda algorithm is 48% better than that obtained with simple scaling of QP. The encoding statistics for the optimal version of the adaptive lambda algorithm, shown in the Table 1, indicate that usage of equation (5) instead of equation (2) leads to Intra/Inter decision percentages similar to those obtained using the RDO tool, i.e. equations (3-4). The fact that the bit rate saving of the RDO tool was only partially achieved with the adaptive lambda algorithm can be explained by noting that the RDO tool performs a J minimization routine in addition to applying equations (3-4). The adaptive lambda algorithm only relies on the tendency of the optimal MODE to be higher for higher bit rate sequences. Bit rate, kbits/s PSNR, db sequence Orig. Opt. Orig. Opt. RD ' RD ' JM C JM C Coast 65.3 63.93 63.90 40.01 39.97 39.96 Carphon 51.56 49.42 49.59 38.88 38.84 38.8 e Foreman 44.12 41.94 42.32 39.99 38.95 38.91 Table 41.9 40.14 41.13 38.50 38.46 38.46 Silent 28.15 26.83 27.56 37.92 37.88 37.91 Mean difference with orig. JM, % -4.01-2.79 - -0.1-0.13 Table 4. Bit rate and PSNR for different algorithms. It should also be noticed, that the bit rate saving of 2.8% for proposed method is achieved without computational cost increase. Usage of RD leads up to 200% computational complexity increase [5] and generally is not recommended for QCIF sequences for that reason [8]. 6. CONCLUSIONS The proposed adaptive Lagrange multiplier algorithm gives around 2.8% of average bit rate improvement for the H.264 encoder without applying the RDO tool while achieving nearly the same PSNR without computation complexity increase. A method using a fixed multiplier was also tested. However, the optimum value varies with video sequence. While nearly the same results were achieved with the fixed multiplier, the dynamic version is preferred due to its robust nature. Potential future work includes extension of the algorithm to higher bit rates (CIF) and refinement of the adaptation model based on the analysis of a greater number of test cases. 6. REFERENCES [1] Joint Team (JVT) of ISO/IEC MPEG and ITU-T VCEG. Draft ITU-T Recommendation and Final Draft International Standard of Joint Specification (ITU-T Rec. H264 ISO/IEC 14496-10 AVC), document JVT-G050d35.doc, 7 th Meeting: Pattaya, Thailand, March, 2003. [2] A. Luthra, K. Gandhi, R. Mckoen, Y. Yu, K. Panusopone, D. Baylon and L. Wang, Performance of MPEG-4 Profiles Used for Streaming and Comparison with H.26L, M7227, Motorola, Broadband Communications Sector, July 2001. [3] ITU-T Recommendation H.263, Coding for Low Bit Rate Communications, Version 2, Jan. 1998. [4] G.J. Sullivan and T. Wiegand, Rate-Distortion Optimization for Compression, IEEE Signal Processing Magazine, Vol. 15, No 6., pp. 74-90, 1998. [5] Implementation Studies Group of ISO/IEC, Main Results of the AVC Complexity Analysis, ISO/IEC JTC1/SC29/WG11, Klagenfurt, July 2002. [6] H. Everett III, Generalized Lagrange Multiplier Method for Solving Problems of Optimum Allocation of Resources, Operations Research, Vol. 11, pp. 399-417, 1963.

[7] JVT reference software JM 8.1a, on the Web: http://iphome.hhi.de/suehring/tml. [8] S. Saponara, C. Blanch, K. Denolf and J. Bormans, Data Transfer and Storage Complexity Analysis of the AVC Codec on a Tool-By-Tool Basis, ISO/IEC JCT1/SC29/WG11, M8547, Klagenfurt, 22-26 July, 2002.