Bit-Depth Scalable Coding Using a Perfect Picture and Adaptive Neighboring Filter *

Similar documents
One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

Performance Comparison between DWT-based and DCT-based Encoders

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

Bit Allocation for Spatial Scalability in H.264/SVC

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

Sergio Sanz-Rodríguez, Fernando Díaz-de-María, Mehdi Rezaei Low-complexity VBR controller for spatialcgs and temporal scalable video coding

Block-based Watermarking Using Random Position Key

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

An Efficient Mode Selection Algorithm for H.264

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

CROSS-PLANE CHROMA ENHANCEMENT FOR SHVC INTER-LAYER PREDICTION

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

Unit-level Optimization for SVC Extractor

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

VHDL Implementation of H.264 Video Coding Standard

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Reduced Frame Quantization in Video Coding

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

Scalable Video Coding

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

EE Low Complexity H.264 encoder for mobile applications

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

Video Quality Analysis for H.264 Based on Human Visual System

Professor, CSE Department, Nirma University, Ahmedabad, India

RATE DISTORTION OPTIMIZED TONE CURVE FOR HIGH DYNAMIC RANGE COMPRESSION

[30] Dong J., Lou j. and Yu L. (2003), Improved entropy coding method, Doc. AVS Working Group (M1214), Beijing, Chaina. CHAPTER 4

No-reference perceptual quality metric for H.264/AVC encoded video. Maria Paula Queluz

Homogeneous Transcoding of HEVC for bit rate reduction

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Low-cost Multi-hypothesis Motion Compensation for Video Coding

RATE-DISTORTION EVALUATION FOR TWO-LAYER CODING SYSTEMS. Philippe Hanhart and Touradj Ebrahimi

Image and Video Quality Assessment Using Neural Network and SVM

Performance analysis of Integer DCT of different block sizes.

The Performance of MANET Routing Protocols for Scalable Video Communication

For layered video encoding, video sequence is encoded into a base layer bitstream and one (or more) enhancement layer bit-stream(s).

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

signal-to-noise ratio (PSNR), 2

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Digital Video Processing

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis

Further Reduced Resolution Depth Coding for Stereoscopic 3D Video

RECOMMENDATION ITU-R BT

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

Compression of Stereo Images using a Huffman-Zip Scheme

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

ISSN: An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

An Efficient Table Prediction Scheme for CAVLC

Template based illumination compensation algorithm for multiview video coding

Adaptive Up-Sampling Method Using DCT for Spatial Scalability of Scalable Video Coding IlHong Shin and Hyun Wook Park, Senior Member, IEEE

Advanced Video Coding: The new H.264 video compression standard

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

Optimal Bitstream Adaptation for Scalable Video Based On Two-Dimensional Rate and Quality Models

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

CHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM

Xin-Fu Wang et al.: Performance Comparison of AVS and H.264/AVC 311 prediction mode and four directional prediction modes are shown in Fig.1. Intra ch

Sample Adaptive Offset Optimization in HEVC

Research Article Optimal Multilayer Adaptation of SVC Video over Heterogeneous Environments

Cross-Layer Optimization for Efficient Delivery of Scalable Video over WiMAX Lung-Jen Wang 1, a *, Chiung-Yun Chang 2,b and Jen-Yi Huang 3,c

Overview: motion-compensated coding

CSEP 521 Applied Algorithms Spring Lossy Image Compression

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Fast Intra Prediction Algorithm for H.264/AVC Based on Quadratic and Gradient Model

Improving the quality of H.264 video transmission using the Intra-Frame FEC over IEEE e networks

PERFORMANCE ANALYSIS OF INTEGER DCT OF DIFFERENT BLOCK SIZES USED IN H.264, AVS CHINA AND WMV9.

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

Scalable Extension of HEVC 한종기

MCTF and Scalability Extension of H.264/AVC and its Application to Video Transmission, Storage, and Surveillance

Compression of 3-Dimensional Medical Image Data Using Part 2 of JPEG 2000

MULTIMEDIA COMMUNICATION

A deblocking filter with two separate modes in block-based video coding

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

Adaptive Quantization for Video Compression in Frequency Domain

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

Advanced Encoding Features of the Sencore TXS Transcoder

LBP-GUIDED DEPTH IMAGE FILTER. Rui Zhong, Ruimin Hu

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

COMPARISON OF HIGH EFFICIENCY VIDEO CODING (HEVC) PERFORMANCE WITH H.264 ADVANCED VIDEO CODING (AVC)

(Invited Paper) /$ IEEE

ARTICLE IN PRESS. Signal Processing: Image Communication

STANDARD COMPLIANT FLICKER REDUCTION METHOD WITH PSNR LOSS CONTROL

Transcription:

Bit-Depth Scalable Coding Using a Perfect Picture and Adaptive Neighboring Filter * LU Feng ( 陆峰 ) ER Guihua ( 尔桂花 ) ** DAI Qionghai ( 戴琼海 ) XIAO Hongjiang ( 肖红江 ) Department of Automation Tsinghua Universit Beijing 100084 China Abstract: Bit-depth scalabilit is a new research field in the on-going scalable extension of the H.264/AVC (SVC) video coding standard. The ke point is to accuratel predict the enhancement laer whose bit depth is 10 or more from the 8 bit base laer. An improved inter-laer prediction scheme for bit-depth scalabilit was developed that ensures compatibilit with the standard and improves the encoding efficienc. The scheme uses a perfect 8 bit picture with an adaptive neighbor filter whose coefficients are optimized b minimizing the block distortion between the 8 bit reconstructed picture and the perfect picture to achieve a more precise 10 bit prediction based on the filtered picture. Double arithmetic precision is used to further improve the encoding efficienc. Experimental results show that the scheme outperforms the recent joint video team (JVT) proposal in the joint scalable video model (JSVM). Ke words: H.264; SVC; bit-depth scalabilit; inter-laer prediction Introduction The co-existence of conventional 8 bit and higher bit-depth video has stimulated the demand for bit-depth scalable coding especiall in fields such as digital arts medical image processing and high qualit computer games. Bit-depth scalable coding ensures that coded video sequences can naturall switch from 8 bit to higher bit depths. Technical difficulties occur because various video/image processing devices and different tone mapping methods ma be used to generate the input 8 bit sequence from the higher bit sequence [12] and because the 8 bit to higher bit prediction is actuall an inverse tone mapping process without prior knowledge of the method used for the initial 8 bit generation stage. Several methods have been proposed to tackle these problems based on the scalable extension of H.264/AVC (SVC) [3] such as the simplest linear scaling [4] the combinational scheme of scale factors and offset [56] and look-up table (LUT) methods [7]. Recent results have shown that the last method outperforms the others. The basic idea of the LUT method is to let both the encoder and decoder refer to the same table which assigns each possible 8 bit value a determined higher bit value. However in practice different tone mapping methods ma cause two pixels to have ver different higher bit values in the higher bit picture even though their 8 bit values in the corresponding 8 bit picture are the same. This alterable mapping between the 8 bit picture and the higher bit picture is the main cause of the prediction precision reduction for the LUT. This paper presents a perfect 8 bit picture method that produces more consistent mappings. The picture is directl down-converted from the higher bit picture b the pre-calculated LUT and is then used to precisel predict the higher bit picture with the same LUT. An adaptive neighboring filter is used to filter the reconstructed 8 bit picture. The filter coefficients are

adaptivel calculated to minimize the difference between the reconstructed 8 bit picture and the perfect picture. Unlike in the original LUT method this method does not use the reconstructed 8 bit picture but the filtered picture to predict the higher bit picture. Since the filtered picture will approximate the perfect picture the inter-laer prediction precision can be improved. 1 Improved Bit-Depth Scalable Coding Scheme 1.1 Scheme architecture The framework of the bit-depth scalable coding including the proposed inter-laer prediction scheme is shown in Fig. 1. Without loss of generalit there are assumed to be onl two bit-depth laers to be encoded: the original 10 bit video sequence as the enhancement laer and the conventional 8 bit version of the same video sequence as the base laer. The base laer encoding process is completel compatible with H.264/AVC and the generated 8 bit reconstruction picture serves as the input for the inter-laer prediction. The LUT is calculated for the 10 bit laer encoding when the 8 bit reconstruction is available. This LUT is then used to generate the perfect picture from the 10 bit input picture. Then this perfect picture and the 8 bit reconstruction are used to design an adaptive neighboring filter. After filtering the filtered picture and the LUT are used to give the final prediction for the 10 bit picture. The enhancement laer encoder then encodes the difference between the original and predicted 10 bit pictures as well as the LUT information and the quantized filter coefficients into the enhancement laer stream. The final output of the scalable encoder is a mixed stream of the 8 bit and 10 bit laers. As shown in the dashed box in Fig. 1 this scheme consists of down-converting filtering and up-converting steps. Therefore this scheme is called the DFU scheme. Fig. 1 Bit-depth scalable coding scheme 1.1.1 Perfect 8 bit picture b down-converting The value of each component (i.e. Y Cb and Cr) is limited to the range of 0 to 255 in an 8 bit pixel and the range of 0 to 1023 in a 10 bit pixel. The LUT can onl include 255 10 bit values because the numbers for the 10 bit and 8 bit values should be the same to form one-to-one mapping. Therefore most of the 10 bit values cannot directl correspond to 8 bit values. The prediction of the 10 bit pixel V( x ) from the pixel (x ) in the 8 bit picture is then as follows. If V( x ) can be found in the LUT and its corresponding 8 bit value is v then the value v is called the perfect 8 bit value of the pixel (x ) because V( x ) can be predicted b the corresponding 10 bit value of v in the LUT without error.

However as mentioned before most 10 bit V( x ) cannot be found in the LUT. In such cases v = arg min ( V( x ) LUT( v) ) (1) + { v v } is used as the perfect value where v and v + are constrained b (a) LUT( v ) and LUT( v + ) both exist in the LUT; + (b) LUT( v ) < V( x ) and LUT( v ) > V( x ) ; (c) There are no other 10 bit values between LUT( v ) and LUT( v + ) in the LUT. This perfect v provides the most precise correlation to the 10 bit V( x ) with the minimum error V( x ) LUT( v). An 8 bit picture is regarded as perfect onl if each pixel in it has the perfect v relative to the 10 bit picture for that particular LUT. With this LUT the perfect 8 bit picture provides the best prediction of the original 10 bit picture in terms of integer precision. 1.1.2 Adaptive neighboring filter An adaptive neighboring filter was then used to filter the reconstructed 8 bit picture to make the filtered picture as similar to the perfect 8 bit picture as possible. This process makes the reconstructed picture more suitable for predicting the 10 bit picture. p be a pixel in the reconstructed picture and Let x qx be the corresponding pixel in the perfect picture. The filter coefficients are { c i j} and the band is N. After filtering each pixel p x will be replaced b N N x i= N j= N i j x i j (2) pˆ = c p + + The total difference for ever pixel (x ) between the filtered 8 bit picture and the perfect picture is minimized b optimizing the MSE expression 2 arg min ( pˆ x qx ) (3) x Differentiating Eq. (3) b each cmn and setting the differentials equal to zero gives p p c = q p x+ i + j x+ m + n ij x x+ m + n x ij x mn = N... N (4) Gaussian elimination is then used to solve for the filter coefficients { ci j} from these linear equations. The solutions are alwas small fractions so the are transformed into a set of integers with the integer quantizer q cˆ = floor( c (1 << q) + 0.5) (5) i j i j The encoder will finall perform filtering b p N N 10 ˆ ˆ x = clip(( c p ij x i j) q 02 1) i= N j= N + + >> (6) The integer filter coefficients { c ˆi j} and q are later inserted into the encoded stream and transmitted to the decoder to enable the same filtering process during decoding. 1.1.3 Prediction b up-converting The up-converting processes the filtered 8 bit picture using the LUT to predict the 10 bit picture. For the up-converting each 8 bit pixel with the value v 8 has a corresponding 10 bit value v 10 that is found b simpl looking up the corresponding 10 bit value in the LUT. For example the 8 bit value 180 gives the 10 bit prediction 500 using the LUT shown in Fig. 2. Thus the 10 bit picture from the filtered 8 bit picture will be much closer to the original 10 bit picture than the one up-converted using the standard inter-laer prediction scheme [7]. Fig. 2 Correspondence of 8 bit and 10 bit pixel values for the test sequence Waves 1.2 Double floating point arithmetic for proposed scheme Section 1.1.1 described how the perfect 8 bit picture for inverse tone mapping is developed using a specific LUT. Even with this method 10 bit values which are not in the LUT still do not have a corresponding 8 bit value in the LUT so there is some prediction error regardless of how the LUT is designed. This is due to the integer precision of the LUT and the magnitude of the 10 bit values (i.e. 1024) which are 4 times as large as the 8 bit values (i.e. 256). Thus double floating point arithmetic is used to calculate the perfect picture with an additional method to predict a 10 bit pixel V( x ) when V( x ) is not in the LUT. After using Eq. (1) to

identif two 10 bit values LUT( v ) and LUT( v + ) an 8 bit integer v and its prediction LUT( v ) are obtained which are the closest to but not equal to V( x ). However an other 10 bit value between LUT( v ) and LUT( v + ) will also be predicted b LUT( v ). This conflict is corrected b calculating the perfect value using linear interpolation with double floating point precision V( x ) LUT( v ) + v= v + ( v v ) (7) + LUT( v ) LUT( v ) Thus the perfect picture becomes a double floating point picture rather than an 8 bit integer picture so that the mapping for each 10 bit value is unique. The subsequent filtering process later will also use the double floating point linear interpolation to map the filtered picture to the final 10 bit picture. The memor for storing the double floating point perfect picture is reduced b combining the downconversion and the filter coefficients calculation with the filtering and up-conversion also combined together. Thus there is no need to store the double floating point Fig. 3 Experimental results for the 4CIF Viper sequences with 8 bit base laers and 10 bit enhancement laers at 50 Hz

values as the will be immediatel used and dropped while the are available in the pipeline. This double floating point scheme is called DFU-D. 2 Experimental Results These schemes were implemented in the recent JSVM codec [8] for comparison with the latest scheme reported b HHI [9]. The test sequences include Freewa Plane Staples Waves and CapitolRecords. All of these are standard sequences with nonlinear LUTs. The encoding used 2 bit depth laers a frame rate of 50 Hz and QP = 12 17 22 27 and 32. The experimental results shown in Fig. 3 show that the average peak signal noise ratio (PSNR a ) gain with the DFU-D scheme is 0.51 db over the standard LUT scheme and the average bit rate saving is 12.23%. For the DFU scheme the PSNR a again is 0.39 db and the bit rate saving is 9.47%. Thus these schemes both increase the coding efficienc of the bit-depth scalable coding with the double precision scheme being more effective for the inter-laer prediction. In addition these schemes will have better performance when applied to sequences such as Plane which have larger error variances of mismatch during the mapping from their 8 bit to 10 bit pixel values. This mismatch is a major problem for the LUT which degrades the performance. 3 Conclusions This paper presents an inter-laer prediction scheme for bit-depth scalable coding in the H.264/AVC SVC. The LUT method is used for the down-converting filtering and up-converting processes to observabl improve the prediction efficienc. Double arithmetic is used instead of integer arithmetic to further improve the coding efficienc. Experimental results demonstrate the advantages of this scheme compared favorabl to the recent JVT proposal. References [1] Drago F Mszkowski K Annen T et al. Adaptive logarithmic mapping for displaing high contrast scenes. Computer Graphics Forum 2003 22: 419-426. [2] Mantiuk R Mszkowski K Seidel H P. A perceptual framework for contrast processing of high dnamic range images. In: Proceedings of APGV 2005: 2nd Smposium on Applied Perception in Graphics and Visualization. A Coruna Spain 2005: 87-94. [3] ITU-T and ISO/IEC JTC1. Joint draft ITU-T Rec. H.264 ISO/IEC 14496-10 / Amd.3 scalable video coding. In: JVT-X201. Geneva Switzerland 2007. [4] Gao Yonging Wu Yuwen. Bit depth scalabilit. In: JVT-V061. Marrakech Morocco 2007. [5] Liu Shan Kim W S Vetro A. Bit-depth scalable coding for high dnamic range video. In: SPIE Conference on Visual Communications and Image Processing (VCIP). San Jose USA 2008. [6] Liu Shan Vetro A Kim W S. Inter-laer prediction for SVC bit-depth scalabilit. In: JVT-X201. Geneva Switzerland 2007. [7] Segall A Kerofsk L Lei S. New results with the tone mapping SEI message. In: JVT-U041. Hangzhou China 2006. [8] Joint Video Team (JVT) of ISO/lEC MPEG & ITU-T VCEG. Joint scalable video model JSVM-12. In: JVT-Y202. Shenzhen China 2007. [9] Winken M Schwarz H Marpe D et al. CE2: SVC bit-depth scalable coding. In: JVT-X057. Geneva Switzerland 2007.