Bit Allocation for Spatial Scalability in H.264/SVC

Similar documents
SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

Sergio Sanz-Rodríguez, Fernando Díaz-de-María, Mehdi Rezaei Low-complexity VBR controller for spatialcgs and temporal scalable video coding

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Scalable Video Coding

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

Performance Comparison between DWT-based and DCT-based Encoders

PAPER Optimal Quantization Parameter Set for MPEG-4 Bit-Rate Control

An Efficient Mode Selection Algorithm for H.264

Optimal Bitstream Adaptation for Scalable Video Based On Two-Dimensional Rate and Quality Models

Block-based Watermarking Using Random Position Key

Rate Distortion Optimization in Video Compression

Fast Mode Decision for H.264/AVC Using Mode Prediction

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Unit-level Optimization for SVC Extractor

Reduced Frame Quantization in Video Coding

Bit-Depth Scalable Coding Using a Perfect Picture and Adaptive Neighboring Filter *

ESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS. Bin Zhang, Mathias Wien and Jens-Rainer Ohm

Effective Relay Communication for Scalable Video Transmission

Research Article Optimal Multilayer Adaptation of SVC Video over Heterogeneous Environments

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Fast Mode Decision Algorithm for Scalable Video Coding Based on Probabilistic Models

An Efficient Table Prediction Scheme for CAVLC

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

Pre- and Post-Processing for Video Compression

For layered video encoding, video sequence is encoded into a base layer bitstream and one (or more) enhancement layer bit-stream(s).

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen

Video Watermarking Algorithm for H.264 Scalable Video Coding

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

Quality Scalable Low Delay Video Coding using Leaky Base Layer Prediction

Quality versus Intelligibility: Evaluating the Coding Trade-offs for American Sign Language Video

VIDEO compression is generally processed in both. Joint Temporal Spatial Bit Allocation for Video Coding With Dependency

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

Fast Mode Assignment for Quality Scalable Extension of the High Efficiency Video Coding (HEVC) Standard: A Bayesian Approach

Scalable Video Coding in H.264/AVC

On the impact of the GOP size in a temporal H.264/AVC-to-SVC transcoder in Baseline and Main Profile

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

MCTF and Scalability Extension of H.264/AVC and its Application to Video Transmission, Storage, and Surveillance

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

Pattern based Residual Coding for H.264 Encoder *

Cross-Layer Optimization for Efficient Delivery of Scalable Video over WiMAX Lung-Jen Wang 1, a *, Chiung-Yun Chang 2,b and Jen-Yi Huang 3,c

Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

Smoooth Streaming over wireless Networks Sreya Chakraborty Final Report EE-5359 under the guidance of Dr. K.R.Rao

Homogeneous Transcoding of HEVC for bit rate reduction

EE Low Complexity H.264 encoder for mobile applications

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

Quality Optimal Policy for H.264 Scalable Video Scheduling in Broadband Multimedia Wireless Networks

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

Mode-Dependent Pixel-Based Weighted Intra Prediction for HEVC Scalable Extension

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Bit Allocation and Encoded View Selection for Optimal Multiview Image Representation

(Invited Paper) /$ IEEE

Region-Based Rate-Control for H.264/AVC. for Low Bit-Rate Applications

Sergio Sanz-Rodríguez, Fernando Díaz-de-María In-layer multibuffer framework for ratecontrolled scalable video coding

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

MULTI-BUFFER BASED CONGESTION CONTROL FOR MULTICAST STREAMING OF SCALABLE VIDEO

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER

ARTICLE IN PRESS. Signal Processing: Image Communication

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

Department of Electrical Engineering, IIT Bombay.

Context based optimal shape coding

Real-time and smooth scalable video streaming system with bitstream extractor intellectual property implementation

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

The Performance of MANET Routing Protocols for Scalable Video Communication

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

A BACKGROUND PROPORTION ADAPTIVE LAGRANGE MULTIPLIER SELECTION METHOD FOR SURVEILLANCE VIDEO ON HEVC

Optimized architectures of CABAC codec for IA-32-, DSP- and FPGAbased

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

IEEE TRANSACTIONS ON BROADCASTING, VOL. 51, NO. 4, DECEMBER

Evaluation of H.264/AVC Coding Elements and New Improved Scalability/Adaptation Algorithm/Methods

CMPT 365 Multimedia Systems. Media Compression - Video

Investigation of the GoP Structure for H.26L Video Streams

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation

Conference object, Postprint version This version is available at

High-Performance VLSI Architecture of H.264/AVC CAVLD by Parallel Run_before Estimation Algorithm *

Optimal Estimation for Error Concealment in Scalable Video Coding

JPEG 2000 vs. JPEG in MPEG Encoding

Transform Kernel Selection Strategy for the H.264

Advances of MPEG Scalable Video Coding Standard

RATE-DISTORTION EVALUATION FOR TWO-LAYER CODING SYSTEMS. Philippe Hanhart and Touradj Ebrahimi

A Dedicated Hardware Solution for the HEVC Interpolation Unit

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

Robust Wireless Delivery of Scalable Videos using Inter-layer Network Coding

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

Transcription:

Bit Allocation for Spatial Scalability in H.264/SVC Jiaying Liu 1, Yongjin Cho 2, Zongming Guo 3, C.-C. Jay Kuo 4 Institute of Computer Science and Technology, Peking University, Beijing, P.R. China 100871 { 1 liujiaying, 3 guozongming}@icst.pku.edu.cn Department of Electrical Engineering, University of Southern California, Los Angeles, CA 90089 { 2 yongjinc, 4 cckuo}@sipi.usc.edu Abstract We propose a model-based spatial layer bit allocation algorithm for H.264/SVC in this work. The spatial scalability of H.264/SVC is achieved by a multi-layer approach, where an enhancement layer is bound by the dependency on its preceding layers. The inter-layer dependency is decoupled in our analysis by a careful examination of the signal flow in the H.264/SVC encoder. We show that the rate and the distortion (R- D) characteristics of a dependent layer can be represented by a number of independent functions with a group of pictures (GOP) as a basic coding unit. Finally, a low complexity spatial layer bit allocation scheme is developed using the proposed GOP-based R- D models. It is shown by experimental results that our proposed bit allocation algorithm can achieve the coding performance close to the optimal R-D performance of full search and is significantly improved from current reference software JSVM. I. INTRODUCTION H.264/SVC is recently standardized as a scalable extension of H.264/AVC standard [1]. A SVC video stream is equipped with great flexibility and adaptability in terms of frame rates, display resolutions and quality levels. These characteristics of SVC video offer attractive solutions for service providers who are in face of a heterogeneous service environment. However, the provision of scalability is traded-off by lower coding efficiency, which has been the major reason that prevents scalable video from its practical employment. In this work, we focus on the problem of spatial bit allocation for H.264/SVC, which is a challenging task due to inter-layer dependency. The H.264/SVC reference encoder (Joint Scalable Video Model, JSVM) specifies a bottom-up approach to produce a scalable bit stream [2]. That is, the encoding process starts from the bottom-most base layer (BL) and subsequent enhancement layers (EL s) are encoded in an ordered manner. However, the current version of JSVM does not support any encoding tool for rate control among spatial layers. Bit allocation algorithms for inter-frame dependency have been examined since MPEG. For example, Ramachandran et al. studied the dependent bit allocation problem with a trellis-based solution framework in [3]. Although it can yield the optimal solution, the complexity of the solution grows exponentially as the number of dependent frames increases. For this reason, it can be used only as a performance benchmark rather than a practical solution. Lin and Ortega [4] speeded up the scheme by encoding the source with only a few quantization steps and using interpolation to find the rate distortion value for other quantization steps. This scheme used the spline interpolation for I frames and the piecewise linear interpolation for P frames. However, the complexity of this algorithms is still high since the source video has to be encoded several times. So it can not be practically extended to solve a dependent bit allocation problem involved with multiple layers in H.264/SVC. In the latest bit allocation algorithms proposed for H.264/SVC, the property of interlayer dependency is not properly addressed in the problem formulation. Liu et al. [5] proposed a rate control algorithm for the spatial and coarse-grain SNR (CGS) scalability of H.264/SVC. The proposed algorithm operates on a fixed rate of each layer and implements an MB-layer bit allocation scheme. The spatial-layer bit allocation problem is not addressed at all. In this work, we consider the inter-layer bit allocation problem for H.264/SVC with spatial scalability using a modelbased approach. To provide a simple yet effective model of the R-D characteristics of dependent layers in a GOP, we focus on the input and the output visual signals to the EL quantizer and propose a method to decouple the influence of each layer s quantization choice on the R-D characteristics of a dependent enhancement layer. Simply speaking, the complexity of the input signal to the EL quantizer is modeled as a function of the BL quantization and, as a result, the impact of the EL quantization is successfully isolated in the proposed R-D model. The details are given in Sec. III. The proposed algorithm allows to allocate bits simultaneously while providing a trade-off between base and enhancement layers so as to improve the efficiency of spatial scalable coding. The rest of the paper is organized as follows. The spatial layer bit allocation problem is formulated in Sec. II. The dependent GOP-based R-D model is analyzed and simplified in Sec. III. The bit allocation algorithm is proposed in Sec. IV. Experimental results are given in Sec. V. Finally, concluding remarks are given in Sec. VI. II. PROBLEM FORMULATION The rate control of our proposed H.264/SVC encoder is achieved by the spatial layer bit allocation followed by the frame layer bit allocation. We first determine the bit budget allocation over spatial layers, and then assign each frame in the same spatial layer a certain amount of bit budget for its encoding. Since the frame-layer bit allocation problem has a lot of precedents in the literature, we mainly focus on the spatial layer bit allocation in this work. 978-1-4244-2295-1/08/$25.00 2008 IEEE 278 MMSP 2008

In H.264/SVC, the spatial scalability is achieved by a multilayer coding approach. That is, a video signal with a high spatial resolution is encoded in such a way that the output bit stream provides multiple layers of various spatial resolutions. When a bit budget constraint is imposed, it is essential for an encoder to efficiently distribute the bit budget to each spatial layer for the optimal coding efficiency. The rate and the distortion of a coded video stream are determined by the choice of quantization step-size Q. In the following, the problem is formulated as the spatial layer Q-decision problem. Let N be the number of spatial layers. R k (Q 1,..., Q k ) and D k (Q 1,..., Q k ) are GOP-based distortion and rate model of the k-th layer with respect to a quantization vector (Q 1,..., Q k ). Given the bit budget R total of a GOP, the bit allocation problem can be formulated as N Q = (Q 1,..., Q N) = arg min ω k D k (Q 1,, Q k ) Q k Q N s.t. R k (Q 1,..., Q k ) R total, (1) where Q = (Q 1,..., Q N ) is the selected Q vector for all spatial layers, Q is the set of all quantization candidates, and ω k is the weighting factor representing the corresponding importance of the k-th layer. Note that Q 1,, Q k 1 in the R-D function of the k-th layer indicate that the coding performance of the k-th layer is dependent upon previous coded (k 1) layers. The Lagrangian multiplier method converts the constrained optimization problem in Eq. (1) to an equivalent unconstrained optimization problem by introducing the Lagrangian cost function as Q = arg min J(Q, λ), Q k Q ( N N ) (2) J(Q, λ) = ω k D k ( ) + λ R k ( ) R total. where λ is the Lagrangian multiplier. Without loss of generality, we consider bit allocation in a simple two-layer scenario first. The solution can be easily generalized to a multi-layer scenario. Mathematically, the Lagrangian cost function can be expressed as J(Q, λ) = ω 1 D 1 (Q 1 ) + ω 2 D 2 (Q 1, Q 2 ) +λ (R 1 (Q 1 ) + R 2 (Q 1, Q 2 ) R total ). In the following discussion, we assume the equal importance of these two layers; namely, ω 1 = ω 2 = 1. To solve the dependent quantization decision problem in Eq. (3), one solution is to conduct a full search over all possible combinations of admissible quantization choices. However, since the search space grows exponentially as the number of layers increases, the complexity of full search is prohibitively large. We address the complexity issue by modeling the R-D characteristics of dependent layers in the next section. III. DISTORTION AND RATE MODELING OF DEPENDENT LAYERS Generally speaking, the R-D characteristics of a dependent layer are represented by a function consisting of quantization step sizes for the reference layer and the dependent layer. The impact of an individual quantization parameter on the R-D characteristics of the dependent layer has to be known to solve the bit allocation problem. For dependent R-D modeling, we convert the multi-variable function into a number of singlevariable functions which simplifies the solution framework to the bit allocation problem greatly. A. GOP-Based Distortion Modeling It is a challenge in distortion modeling of dependent layers; namely, to determine the impact of each individual variable on the distortion of the target layer. To achieve this goal, we attempt to analyze the processing of the input video signal in the H.264/SVC encoder. In Fig. 1, we depict the H.264/SVC encoder whose input is a CIF sequence and output is a bitstream consisting of two spatial layers, i.e., a base layer (BL) (3) EL Encoder CIF Sequence High Frequency of Original Seq. BL Distortion Q2 EL Bit Stream Down-sample Filter Up-sample Filter QCIF Sequence Low Frequency of Original Seq. Q1 BL Reconstruct BL Encoder Fig. 1. Decomposition of the H.264/SVC encoder diagram. 279

and a dependent enhancement layer (EL). We are mainly interested in the input and the output of the EL quantizer (i.e., Q 2 ) in Fig. 1. We first obtain a low frequency component of the input CIF video by the down-sampling process. The lowpass filtered signal is fed into the BL encoder to produce the BL reconstruction signal, which corresponds to a quantized version of the low-frequency video using quantization step size Q 1. The reconstructed BL is used as a basis to predict the low frequency component of the input to reduce inter-layer redundancy. Then, we use the differential signal between the original and the extrapolated BL signals as the input to the EL encoder. Because of the influence of the BL encoding on the input to the EL encoder, the differential signal depends on the BL encoding with quantization step size Q 1. EL Input Seq. - Differential Seq. (a) Akiyo (b) City BL Distortion + High Frequency of Original Input Seq. (c) Mobile (d) Football Q1 = 14 Q2 EL Bit Stream Fig. 3. Illustration of the linear relationship between the GOP complexity and Q 1. EL Distortion Q1 = 44 Fig. 2. Coding of the enhancement layer. The differential signal actually consists of two parts: 1) the high frequency component and 2) the distortion in the low frequency component due to the quantization effect in the BL, which is denoted as the BL Distortion in Fig. 2. The second part controls the coupling between BL s and EL s encodings. If the BL distortion term is much smaller than the high frequency term, such a coupling effect can be ignored. Fig. 2 illustrates the influence of different Q 1 values in the BL encoding on the EL encoding. The GOP complexity proposed in [6] can be used to capture the characteristics of the input signal into the EL quantizer. Thus, we adopt the GOP complexity estimation method given in [6] to investigate the relationship between Q 1 and the EL input differential signal. Fig. 3 shows the relationship between the estimated GOP complexity (C) of the differential input and the BL encoder step size Q 1 for various test sequences. The data in these figures indicate a linear model between C and Q 1, which can be written as C(Q 1 ) = c Q 1 + θ, (4) where c and θ are model parameters. With the input to the EL encoder characterized by Eq. (4), the output of the EL encoder is influenced by the EL quantization step size Q 2 only. Fig. 4 shows the distortion of the enhancement layer, denoted by D 2 (Q 1, Q 2 ), as a function Fig. 4. The EL distortion as a function of the complexity C and quantization step size Q 2 for the Mobile sequence. of C(Q 1 ) and Q 2. In Fig. 5, we plot the projection of the 3D plot onto the D 2 C plane. We have the following main observations from these experiments: 1) For a fixed Q 2 value, we observe a linear region between between D 2 (Q 1, Q 2 ) and C(Q 1 ) when C(Q 1 ) is small. 2) The slope in the linear region reflects the dependency between layers since it is a function of Q 1 and Q 2. 3) Distortion D 2 (Q 1, Q 2 ) becomes flat after the inflection point which is located at QP 1 QP 2 6, where QP 1 and QP 2 are quantization parameters for quantization step sizes Q 1 and Q 2, respectively. When QP 2 QP 1 = 6, we have that the corresponding quantization step size is halved, i.e., Q 1 = 1 2 Q 2. Intuitively, the flat distortion phenomenon can be explained as follows. When Q 2 is relatively small as compared with Q 1, the EL 280

distortion is primarily determined by Q 2 alone. (a) Mobile B. GOP-Based Rate Model To derive the GOP-based rate model of the enhancement layer, denoted by R 2 (Q 1, Q 2 ), we plot R 2 (Q 1, Q 2 ) as a function of R 1 (Q 1 ) for some video sequences in Fig. 7. We see a set of approximately parallel lines. In other words, the inter-layer rate dependency is relatively low. Thus, we propose the following rate model R 2 (Q 1, Q 2 ) = { r R 1 (Q 1 ) + (s r)r 1 (Q 2 ), QP 1 QP 2, s R 1 (Q 2 ), QP 1 < QP 2. (6) where s and r are the slope of the line when QP 1 = QP 2 and QP 2 fixed, respectively. The proposed rate model is plotted in Fig. 8. Fig. 5. (b) Football Illustration of distortion dependency between two consecutive layers. (a) Mobile QP1 = QP2-6 QP2 fixed D2 Q1, Q2 pi = f (Q2) D2 = m C + n C Q1 Fig. 7. (b) Football Illustration of rate dependency between two consecutive layers. Fig. 6. The proposed distortion model for dependent layers. Based on the above observation, we propose the following distortion model D 2 (Q 1, Q 2 ) = { p i C(Q 1 ) + (m p i ) C(Q 2 /2) + n, QP 1 QP 2 6, m C(Q 2 /2) + n, QP 1 > QP 2 6. (5) where p i is the slope, which is a function of Q 2, and m and n are the slope and the intercept of the line with Q 1 = 1 2 Q 2 as indicated in Fig. 6. IV. PROPOSED SPATIAL-LAYER BIT ALLOCATION ALGORITHM For the single variable rate and distortion models, we employ the models proposed by Kamaci et al. [7] of the following form: R(Q i ) = a Q α i, and D(Q i ) = b Q β i, (7) where Q i is the quantization step, a, b, α and β are model parameters. 281

QP1 = QP2 QP2 fixed encoding iterations exceed the pre-defined maximal threshold (50 iterations for each layer in our experiments). R2 Q1, Q2 Slope: r Slope: s TABLE I THE SETTING OF TWO-LAYER SPATIAL SCALABLE CODING Layer No. Spatial Resolution Temporal Resolution Initial QP 0 QCIF 15 32 1 CIF 15 32 Fig. 8. R1 Q1 The proposed rate model for dependent layers. Then, the Lagrangian cost function in Eq. (3) can be solved with a closed-form solution by applying the proposed GOPbased R-D models as given in Eqs. (5) and (6). That is, we have J(Q, λ) = b Q β 1 + cp i Q 1 + c 2 (m p i)q 2 + (mθ + n) +λ [(1 + r)a Q α 1 + (s r)a Q α 2 R total ]. (8) To optimize Eq. (8), we first take the partial derivatives with respect to Q 1 and Q 2, which yields the following two equations (bβ Q β 1 1 + cp i ) [(1 + r)aα Q α 1 1 ] λ = 0, c 2 (m p i) [(s r)aα Q α 1 2 ] λ = 0, Another equation can be derived from the bit budget constraint R GOP = R total ; namely, (9) R total = (1 + r)a Q α 1 + (s r)a Q α 2. (10) Based on Eqs. (9) and (10), we can compute Q 1 and Q 2 values that optimize the Lagrangian cost function given in Eq. (8). Finally, we can determine the encoder input parameters, QP 1 and QP 2, using the one-to-one correspondence between quantization step size Q and quantization parameter QP. V. EXPERIMENTAL RESULTS Since there is no spatial layer rate control algorithm in the current version of JSVM, the performance of the proposed algorithm is compared with that of the full search (FS) method. By testing all possible combinations of Q 1 and Q 2 for each spatial layer, an input video is iteratively encoded to find the best R-D performance at each target bit rate. The R-D curve obtained via FS provide the R-D performance bound with twolayer encoding. We also consider comparing with encoding results of reference JSVM FixedQPEncoder tool [8] based on the SVC testing conditions JVT-Q205 defined in [9]. The logarithmic search algorithm and cascading QPs method are adopted in JSVM FixedQPEncoder tool to find a proper quantization parameter. The generalized bit rate value with each trial of quantization parameter for the next trial, until the generated bit rate is within the acceptable range (±0.5% in our experiments) or the The proposed bit allocation algorithm was implemented with JSVM 9.6 in our experiment. It was tested to verify the efficiency of our bit allocation algorithm for spatial scalability, with the setting shown in Table I. Layer 0 is the base layer which is encoded without any inter-layer prediction. Layer 1 is a spatial enhancement layer by using the adaptive interlayer prediction from base layer. To compare our scheme with JSVM9.6, the initial setting of QP is the same for both schemes. We have encoded 10 GOPs from three sequences with low to high spatial complexity (Akiyo, Football and Mobile) to verify the performance of the proposed algorithm. The GOP size is set to be 16. The performance of the proposed algorithm is compared with FS and JSVM method shown in Fig. 9. We see little performance degradation of our method with respect to the optimal solution obtained by FS method at various bit rates. The maximum performance degradation observed is about 0.34 db for the Mobile sequence. Significant coding gain, around 1dB, is achieved by the proposed algorithm in comparison with JSVM. For the computational complexity issue, the proposed algorithm only need to encode four times to decide the model parameters, but the FS method and current JSVM take 52 52 and average of 40 iteration times respectively, where the complexity are extremely higher. These experimental results demonstrate the effectiveness and the robustness of the proposed algorithm for video sequences with various spatial characteristics. VI. CONCLUSION We investigated a model-based spatial layer bit allocation scheme for H.264/SVC in this work. The inter-layer dependence of the distortion and the rate function were successfully modeled. Based on these models, we proposed a bit allocation algorithm that achieved a near optimal R-D performance. ACKNOWLEDGMENT This work was supported by National Key Technology R&D Program of China under Grant 2006BAH02A10 and by National High-tech R&D Program (863 Program) of China under Grant 2006AA01Z320, with additional support by the State Scholarship Fund from China Scholarship Council. REFERENCES [1] T. Wiegand, G. J. Sullivan, J. Reichel, H. Schwarz, and M. Wien, Amendment 3 to ITU-T Rec. H.264 (2005) ISO/IEC 14496-10: 2005, Scalable Video Coding, July 2007. [2] H. Schwarz, D. Marpe, and T. Wiegand, Overview of the scalable video coding extension of the H.264/AVC standard, IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, pp. 1103 1120, September 2007. 282

vol. 18, no. 1, pp. 116 121, January 2008. [6] D.-K. Kwon, Y. Cho, and C.-C. J. Kuo, A simplified rate control scheme for non-conversational H.264 video, in International Workshop on Multimedia Signal Processing, October 2007. [7] N. Kamaci, Y. Altinbasak, and R. M. Mersereau, Frame bit allocation for H.264/AVC video coder via Cauchy-density-based rate and distortion models, IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 8, pp. 994 1006, August 2005. [8] Joint Video Team of ITU-T VCEG and ISO/IEC MPEG. Joint Scalable Video Model software 9.6. [Online]. Available: ftp://garcon. ient.rwth-aachen.de [9] T. Wiegand, G. J. Sullivan, H. Schwarz, and M. Wien, Joint draft 10 of SVC amendment, Joint Video Team, JVT-Q205, San Jose, CA, USA, April 2007. (a) Akiyo (b) Football (c) Mobile Fig. 9. The R-D performance of the proposed bit allocation algorithm compared with the FS method and JSVM. [3] K. Ramchandran, A. Ortega, and M. Vetterli, Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders, IEEE Transactions on Image Processing, vol. 3, no. 5, pp. 533 545, September 1994. [4] L.-J. Lin and A. Ortega, Bit-rate control using piecewise approximated rate-distortion characteristics, IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, no. 4, pp. 446 459, August 1998. [5] Y. Liu, Z. Li, and Y. C. Soh, Rate control of H.264/AVC scalable extension, IEEE Transactions on Circuits and Systems for Video Technology, 283