FAST ALGORITHM FOR H.264/AVC INTRA PREDICTION BASED ON DISCRETE WAVELET TRANSFORM

Similar documents
Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Performance Comparison between DWT-based and DCT-based Encoders

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile

Performance Analysis of DIRAC PRO with H.264 Intra frame coding

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Fast Mode Decision for H.264/AVC Using Mode Prediction

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Fast frame memory access method for H.264/AVC

Realtime H.264 Encoding System using Fast Motion Estimation and Mode Decision

Video compression with 1-D directional transforms in H.264/AVC

Scalable Video Coding

FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

An Efficient Mode Selection Algorithm for H.264

Implementation and analysis of Directional DCT in H.264

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Adaptive Quantization for Video Compression in Frequency Domain

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Reduced Frame Quantization in Video Coding

A Novel Partial Prediction Algorithm for Fast 4x4 Intra Prediction Mode Decision in H.264/AVC

Fast Intra Prediction Algorithm for H.264/AVC Based on Quadratic and Gradient Model

HEVC The Next Generation Video Coding. 1 ELEG5502 Video Coding Technology

Compression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction

High Efficiency Video Coding. Li Li 2016/10/18

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Lecture 5: Error Resilience & Scalability

Complexity Reduction Tools for MPEG-2 to H.264 Video Transcoding

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

Digital Image Steganography Techniques: Case Study. Karnataka, India.

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

EE Low Complexity H.264 encoder for mobile applications

Wavelet-Based Video Compression Using Long-Term Memory Motion-Compensated Prediction and Context-Based Adaptive Arithmetic Coding

EFFICIENT DEISGN OF LOW AREA BASED H.264 COMPRESSOR AND DECOMPRESSOR WITH H.264 INTEGER TRANSFORM

Advanced Video Coding: The new H.264 video compression standard

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

Using animation to motivate motion

Motion-Compensated Wavelet Video Coding Using Adaptive Mode Selection. Fan Zhai Thrasyvoulos N. Pappas

Georgios Tziritas Computer Science Department

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

DWT Based Text Localization

Video Coding Using Spatially Varying Transform

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

Block-based Watermarking Using Random Position Key

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

H.264 to MPEG-4 Transcoding Using Block Type Information

White paper: Video Coding A Timeline

Standard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING

Low-complexity video compression based on 3-D DWT and fast entropy coding

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

Digital Video Processing

Summer Internship at STMicroelectronics, Greater Noida, India

Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

COMPARISONS OF DCT-BASED AND DWT-BASED WATERMARKING TECHNIQUES

Multi-focus Image Fusion Using Stationary Wavelet Transform (SWT) with Principal Component Analysis (PCA)

Performance analysis of Integer DCT of different block sizes.

VHDL Implementation of H.264 Video Coding Standard

A deblocking filter with two separate modes in block-based video coding

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Jun Zhang, Feng Dai, Yongdong Zhang, and Chenggang Yan

Data Hiding in Video

Homogeneous Transcoding of HEVC for bit rate reduction

High Efficiency Video Coding: The Next Gen Codec. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

ENTROPY-BASED IMAGE WATERMARKING USING DWT AND HVS

Video Codecs. National Chiao Tung University Chun-Jen Tsai 1/5/2015

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

IMAGE PROCESSING USING DISCRETE WAVELET TRANSFORM

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

FAST AND EFFICIENT SPATIAL SCALABLE IMAGE COMPRESSION USING WAVELET LOWER TREES

Video Compression Standards (II) A/Prof. Jian Zhang

Comparison of Digital Image Watermarking Algorithms. Xu Zhou Colorado School of Mines December 1, 2014

10.2 Video Compression with Motion Compensation 10.4 H H.263

An Efficient Table Prediction Scheme for CAVLC

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

Prediction Mode Based Reference Line Synthesis for Intra Prediction of Video Coding

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 10 ZHU Yongxin, Winson

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Rate Distortion Optimization in Video Compression

Short Communications

Video coding. Concepts and notations.

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

Xin-Fu Wang et al.: Performance Comparison of AVS and H.264/AVC 311 prediction mode and four directional prediction modes are shown in Fig.1. Intra ch

Introduction to Video Coding

Part 1 of 4. MARCH

Fast Intra- and Inter-Prediction Mode Decision in H.264 Advanced Video Coding

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform

Module 7 VIDEO CODING AND MOTION ESTIMATION

Transcription:

FAST ALGORITHM FOR H.264/AVC INTRA PREDICTION BASED ON DISCRETE WAVELET TRANSFORM Damián Ruíz, Oscar Patiño, David Jiménez, José Manuel Menéndez Grupo de Aplicación de Telecomunicaciones Visuales Universidad Politécnica de Madrid, Madrid, España, email: drc@gatv.ssr.upm.es Email: opg@gatv.ssr.upm.es, djb@gatv.ssr.upm.es, jmm@gatv.ssr.upm.es Abstract H.264 or MPEG-4 AVC (Advanced Video Coding) is the new world-wide accepted international Standard for video coding, approved by ITU-T and ISO. New Tools have been added to improve the coding efficiency allowing a save up above of 50%, when is compared with previous standards (H.263, MPEG-2 y MPEG-4). From April 2007 there is a new set of profiles known as all-intra. They were born as a sub-set from the High profile and have reached a high impact within broadcast industry where the highest quality video formats are demanded. The high efficiency of all-intra mode in H.264 is due to Rate Distortion Optimization (RDO) technique. RDO chooses for each macroblock (MB) the best partition mode and directional prediction. However, the computational burden becomes extremely high due to huge number of prediction-prediction modes that should be evaluated. This article shows a new algorithm for fast partition mode algorithm based on Discrete Wavelet Transform (DWT). It allows reducing the number of candidate modes to those which are strictly defined for each partition. By using the local 2D-DWT over each MB, information of the homogeneity is obtained. It is got from multiresolution analysis of the transformed coefficients in each sub-band. This way it is easier to classify quickly the optimum partition mode avoiding the exhaustive seek made by RDO. Keywords Discrete Wavelet Transform, Intra Prediction, H.264/MPEG-4 AVC, Rate Distortion Optimization, Homogeneity analysis. INTRODUCTION H.264/MPEG-4 AVC [1] uses the traditional hybrid predictive-transformational coding scheme, but new techniques that represent the estate-of-art in coding are added, as Multi- Picture Inter-Picture Prediction, Variable Block-Size Motion Compensation, Context-Adaptive Binary Arithmetic Coding, Weighted Prediction and Intra Prediction, among others. RDO used in H.264 is considered the key tool for Intra coding. It is basic for the selection of the best mode of prediction but its calculation demands a very costly computation. It is based on Lagrange cost function that demands the knowledge of both, distortion rate (D) and bit rate (R) generated during each mode of Intra partition coding (I_Mode). Next equation (1) shows the mathematical expression for Lagrange function cost, where QP is the quantization parameter applied and λ MODE is the Lagrange multiplicator: J(I MODE ) = D(I MODE QP, λ MODE ) + λ MODE R(I MODE QP, λ MODE ) (1)

Specifically, the exhaustive computation of RDO checks all the possible combinations. Thus it means that RDO has to evaluate every MB of the image 736 times when any profile of the High family has been chosen. This has caused that many research works trying to reduce the number of iterations needed in RDO meaningfully have been done. These algorithms are called Fast Intra. Most of them are based on directional information obtained from MBs in order to get a close approximation to the directional predictors defined in the standard. Several analyses in spatial domain are proposed to get that information. Within spatial domain, Pan [2] deals with a fast algorithm based on histogram analysis of the edge map in the image. It is obtained from the gradient vector calculation for every pixel within the block trough Sobel operator. Candidate modes are reduced to those close to the direction of the dominant gradient. Fangwen [3] shows an optimization of Pan s algorithm based on statistical analysis of neighbours MB. This allows calculating the border map only for the most probable modes. Yue [4] computes RDO for the 16 MB sub-blocks which, in turns, allows to estimate a reduced number of candidates in the and set of partitions. Chih- Hsien [5] defends a fast algorithm based on the symmetry of adjacent angles of directional predictors in order to reduce the number of candidate modes. Other algorithms get the directional information from MBs through analysis in the frequency domain. By analysing the transformed coefficients (DCT), Takeshi [6] obtains the energetic information of vertical and horizontal edges to determine the dominant gradient for sub-blocks. Zhenyu [7] characterizes the predominant direction of the pixel gradient within the MB through a non-normalized Haar Transform. The set of described algorithms are based on seeking candidate modes for the directional prediction despite of the possible partition mode. The algorithm that is proposed in this article shows an innovative strategy. The objective consists of obtaining the optimum partition for the MB first, then applying RDO scheme. Thus it is only necessary to check the defined modes for the selected partition while reducing drastically the number of evaluations. INTRA MODE PREDICTIONS IN H.264/AVC The new visual coding standard H.264/MPEG-4 AVC unlike previous standards (H.261, H.263, MPEG-1, MPEG-2, MPEG-4) adds the novelty of a spatial prediction stage for the Intraframe mode among the macroblocks to code and the adjacent MBs. Specifically, the standard defines the next partition-prediction combinations for the Luma component: I_: single blocks with 4 directional predictors. I_: four blocks with 9 predictor. It is only defined for High profiles by low-pass filtering the pixels used in the prediction. I_: sixteen sub-blocks with the same 9 directional predictors, this time without previous filtering. Fig. 1 shows the directional predictors for I_, I_ and respectively.

Figure 1. Directional predictors for I_, I_ and Table 1 shows the directional predictors aforementioned. Chroma components are never partitioned and always use the same directional predictors as Luma MB (I_). Partition & Mode Directional Predictor 0 V Vertical 1 H Horizontal 2 DC Mean Value 3 D Planar Mode 3 V Vertical 1 H Horizontal 2 DC Mean Value 3 DDL Diagonal Down Left 4 DDR Diagonal

Partition Mode Directional Predictor Down Right 5 VR Vertical Right 6 HD Horizontal Down 7 VL Vertical Left 8 HU Horizontal Up Table 1. Prediction Modes for H.264/AVC OPTIMUM PARTITION BASED ON LOCAL 2D-DWT A. 2D-DWT The transformation based on wavelets presents a wide range of applications in the field of images and video sequences, from border detection to pattern recognition, and especially in the image compression field where it is the basis of JPEG 2000 standard [8]. Currently, it has established itself as an alternative to the Discrete Cosine Transformation to decorrelate images, as presenting no block distortion like DCT does. Another advantage is its intrinsic spatial scalability. The bi-dimensional Discrete Wavelet Transform (2D-DWT) can be implemented as one dimensional low and high filtering, followed by a new low and high filtering in the other dimension. Figure 2. Single decomposition level of 2D-DWT Every scale or decomposition level λ provides 4 sub-bands μ, containing different directional information of the image. LH gets horizontal information, HL gets vertical information and HH gets diagonal information. Additionally, a continue component LL is obtained from the decomposition. LL sub-band can be iterated applying a new 2D-DWT to obtain a new decomposition level λ 2. Classic presentation of 2D-DWT consists of placing, left to right and up to down, the first four sub-bands LL 1, LH 1, HL 1, and HH 1, and then replacing on each new scale the LL x subband for the new four sub-bands.

B. ALGORITHM I_ modes provide good performance for low spatial complexity blocks which show a global homogeneity in the MB as a whole. Otherwise, the MBs that are not considered as homogeneous obtain a lower residue if they are coded with smaller partitions, or. The selection is made based on the overall complexity of the MBs. It is easy to consider that using I_ mode can be faster and provides a better adjustment to the gradient for any MB, so it would minimize the cost function. This statement is not true, as there are other factors that have to be taken in account as signaling or the statistical coding. The homogeneity level of the MB is the threshold used within the algorithm that presents a 3 steps scheme: 1) Three level 2D-DWT. 2D-DWT is applied using three levels of decomposition by MB. The orthogonal Haar filter of length 2 is used. A hierarchical storage structure has been used in order to analyse the value of LL2 sub-band. 2) homogeneity analysis. This analysis is made by evaluating the 2x2 sub-bands of the third scale λ 3 in the transformed domain: LL 3, LH 3, HL 3, and HH 3. Every one of the 4 transformed coefficients of each sub-band provides directional information from the (2 λ x 2 λ ) blocks of the original MB. The same interpretation is applied through HL 3, LH 3 and LL 3 sub-bands and the corresponding blocks within MB. If the analysis results in a homogeneous MB, a partition (I_) is selected as the optimum one and the algorithm finishes. Otherwise, the third step is executed. 3) homogeneity analysis. In order to obtain the local homogeneity for every one of the 4 blocks (B i ) within the MB, the second level of decomposition is evaluated. The λ 2 sub-bands, LL 2, LH 2, HV 2, and HH 2 are analysed. Thus, every transformed coefficient of this scale provides frequency information of a area (2 λ x2 λ ), that corresponds with the sub-blocks that conform each B i. Only in the case that the four blocks B i were labeled as homogeneous, the MB is processed with an optimum partition of I_; otherwise the MB would be processed as I_. Figure 4 shows the global scheme of the architecture of the proposed algorithm (detailed for 3 levels). One key element of the algorithm is the metric that performs the homogeneity evaluation. For the first two steps dealing with and MBs, the directional homogeneity index Г has been applied to every sub-band µ belonging to decomposition levels 2 and 3. Г index is calculated as the quotient between the energy of each transformed coefficient Cµλ(k,l) and the variance of that sub-band σ 2 µλ as is shown in Eq. (2): Г(µ,λ) = C µλ (k,l)/ σ 2 µλ (2) The mentioned index is calculated for the 4 sub-bands μ. Then it is compared with the threshold Th LL, Th LH, Th HL and Th HH. Each threshold is calculated empirically for the scales 2 and 3. The global homogeneity condition implies that the index Г(µ,λ) does not exceed the corresponding threshold for the sub-bands.

2D-DWT LL 1 L H V 2D-DWT LL 2 L LH 2 H HL 2 V HH 2 D 2D-DWT LL 3 L LH 3 H HL 3 V HH 3 D Analysis Analysis NO HG NO HG Partición Partition YES YES Partition FIGURE 3. FAST INTRA ALGORITHM ARCHITECTURE FOR H.264/AVC RESULTS To evaluate the proposed algorithm efficiency three different 720p high definition video sequences with a wide range of spatial complexity have been tested: EBU Test Pattern 7, Shield and New Mobile-Calendar (Figure 4). It can be seen that sequences are highly detailed and have several areas with meaningful high frequency content. Otherwise, they also present homogeneous (low frequency) areas.

Figure 3. Sequences "EBU Test Pattern 7, Shields & New Mobile Calendar The first of them, EBU Test Pattern 7, contains patterns with very different resolution and spatial orientation, which provides a good environment to test the algorithm performance on the MB at those sites. The partitions are obtained by applying the proposed algorithm (shown in Figs. 5, 6 & 7), where can be observed how the partition map is accurately adjusted to the image topology setting I_ partitions where complex textures are present, for example on the text displayed at the bottom of the image, the percentage details at the corners and high frequencies data in the left window. In the homogeneous regions can be seen that the algorithm fits perfectly, choosing partition even on areas with high detail level but low energy, as can be observed at the bottom of the high frequencies window. Figure 4. 2D-DWT partition map for EBU Test Pattern 7 On the same way, it can be seen that the sized partitions are related to intermediate critical areas where I_ is a good balanced decision between I_ and I_. Figure 5. 2D-DWT partition map for Shields Qualitatively, the partition map shows that I_ is extensively used and fits with the high complex areas. Other less complex areas shows I_ or I_ partitions. The and partitions are distributed by certain randomness in small regions where it can be considered that MB s area contains a global homogeneity at MB and Block levels. The good behavior of the algorithm is ratified both by natural and synthetic sequences.

Figure 6. 2D-DWT partition map for New Mobile Calendar DISCUSSION This article aims to propose a fast method to get the partitions of MBs for Intraframe mode predictors in H.264/AVC. The described algorithm is based on the application of 2D-DWT locally in the MBs that allows obtaining the optimum partition a priori. By getting a fast estimation of the partition mode the algorithm avoids having to evaluate all the modes as needed in RDO. The experimental results that have been obtained get well correlated results both for synthetic and natural video sequences. The intrinsic features and complexity of the image is extremely related with the content. However, the homogeneity index has provided successful results for all the sequences used as test. REFERENCES [1] ITU-T Rec. H.264 and Final Draft International Standard of Joint Video Specification (ISO/IEC 14496-10 AVC) Joint Video Team. 2003, Doc. JVT-G050. [2] F. Pan, X. Lin, S. Rahardja, et al, Fast Intra Mode Decision Algorithm for H.264/AVC Video Coding, 2004 ICIP. [3] F. Fu; X. Lin; L. Xu, Fast intra prediction algorithm in H.264-AVC, 7th ICIP, 2004, Vol. 2, Pages:1191-94 [4] Yue Yu, Wang, Limin, A fast intra mode selection method for H.264 high profile, ICASSP 2008. Page(s):681 684. [5] C.Hsia; J.Chiang; et al; Fast Intra Prediction Mode Decision Algorithm for H.264/AVC Video Coding Standard, 3rd IIH-MSP 2007, Vol. 2, Pages:535-38. [6] T. Tsukuba, I. Nagayoshi, T. Hanamura and H. Tominaga, H.264 Fast Intra- Prediction mode decision based on Frequency characteristic. [7] Zhenyu Wei; Hongliang Li; King Ngi Ngan; An Efficient Intra Mode Selection Algorithm For H.264 Based On Fast Edge Classification, International Symposium on Circuits and Systems, 2007,Pages:3630 33. [8] Michael W. Marcellin, Michael J. Gormish, Ali Bilgin, Martin P. Boliek, An Overview of JPEG2000, Proc of IEEE Data Compression Conference, pp. 523-541, 2000.