FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

Similar documents
Department of Electrical Engineering, IIT Bombay.

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

Unit-level Optimization for SVC Extractor

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Fast Mode Decision for H.264/AVC Using Mode Prediction

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Fast Mode Decision Algorithm for Scalable Video Coding Based on Probabilistic Models

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

Mode-Dependent Pixel-Based Weighted Intra Prediction for HEVC Scalable Extension

Scalable Video Coding

Bit Allocation for Spatial Scalability in H.264/SVC

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER

Optimal Bitstream Adaptation for Scalable Video Based On Two-Dimensional Rate and Quality Models

For layered video encoding, video sequence is encoded into a base layer bitstream and one (or more) enhancement layer bit-stream(s).

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

On the impact of the GOP size in a temporal H.264/AVC-to-SVC transcoder in Baseline and Main Profile

Real-time and smooth scalable video streaming system with bitstream extractor intellectual property implementation

ESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS. Bin Zhang, Mathias Wien and Jens-Rainer Ohm

Sergio Sanz-Rodríguez, Fernando Díaz-de-María, Mehdi Rezaei Low-complexity VBR controller for spatialcgs and temporal scalable video coding

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Recent, Current and Future Developments in Video Coding

Scalable Extension of HEVC 한종기

Professor, CSE Department, Nirma University, Ahmedabad, India

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

High Efficient Intra Coding Algorithm for H.265/HVC

Low-complexity transcoding algorithm from H.264/AVC to SVC using data mining

Optimal Estimation for Error Concealment in Scalable Video Coding

Improving Energy Efficiency of Block-Matching Motion Estimation Using Dynamic Partial Reconfiguration

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN

Video compression with 1-D directional transforms in H.264/AVC

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING

Fast Mode Assignment for Quality Scalable Extension of the High Efficiency Video Coding (HEVC) Standard: A Bayesian Approach

An Efficient Inter-Frame Coding with Intra Skip Decision in H.264/AVC

Scalable Video Coding in H.264/AVC

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

Performance Comparison between DWT-based and DCT-based Encoders

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications

Cross-Layer Optimization for Efficient Delivery of Scalable Video over WiMAX Lung-Jen Wang 1, a *, Chiung-Yun Chang 2,b and Jen-Yi Huang 3,c

Video Watermarking Algorithm for H.264 Scalable Video Coding

Motion Vector Coding Algorithm Based on Adaptive Template Matching

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Testing HEVC model HM on objective and subjective way

Homogeneous Transcoding of HEVC for bit rate reduction

(Invited Paper) /$ IEEE

ISSCC 2006 / SESSION 22 / LOW POWER MULTIMEDIA / 22.1

Advanced Video Coding: The new H.264 video compression standard

A Novel Partial Prediction Algorithm for Fast 4x4 Intra Prediction Mode Decision in H.264/AVC

Lec 10 Video Signal Processing II

Categorization for Fast Intra Prediction Mode Decision in H.264/AVC

Fast Encoding Techniques for Multiview Video Coding

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis

Pseudo sequence based 2-D hierarchical coding structure for light-field image compression

Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Evaluation of H.264/AVC Coding Elements and New Improved Scalability/Adaptation Algorithm/Methods

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

FAST MOTION ESTIMATION DISCARDING LOW-IMPACT FRACTIONAL BLOCKS. Saverio G. Blasi, Ivan Zupancic and Ebroul Izquierdo

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

H.264 to MPEG-4 Transcoding Using Block Type Information

A VIDEO TRANSCODING USING SPATIAL RESOLUTION FILTER INTRA FRAME METHOD IN MULTIMEDIA NETWORKS

An Independent Motion and Disparity Vector Prediction Method for Multiview Video Coding

A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation

implementation using GPU architecture is implemented only from the viewpoint of frame level parallel encoding [6]. However, it is obvious that the mot

Conference object, Postprint version This version is available at

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

STANDARD COMPLIANT FLICKER REDUCTION METHOD WITH PSNR LOSS CONTROL

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

ARTICLE IN PRESS. Signal Processing: Image Communication

In the name of Allah. the compassionate, the merciful

Decoding-Assisted Inter Prediction for HEVC

Digital Video Processing

Reduced Frame Quantization in Video Coding

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform

Advances of MPEG Scalable Video Coding Standard

Rate Distortion Optimization in Video Compression

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

ECE 634: Digital Video Systems Scalable coding: 3/23/17

A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264. Massachusetts Institute of Technology Texas Instruments

Fast frame memory access method for H.264/AVC

MCTF and Scalability Extension of H.264/AVC and its Application to Video Transmission, Storage, and Surveillance

Error Concealment Used for P-Frame on Video Stream over the Internet

Area Efficient SAD Architecture for Block Based Video Compression Standards

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

Smoooth Streaming over wireless Networks Sreya Chakraborty Final Report EE-5359 under the guidance of Dr. K.R.Rao

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

VIDEO COMPRESSION STANDARDS

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

Implementation and analysis of Directional DCT in H.264

HEVC. Complexity Reduction Algorithm for Quality Scalability in Scalable. 1. Introduction. Abstract

Transcription:

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION Yen-Chieh Wang( 王彥傑 ), Zong-Yi Chen( 陳宗毅 ), Pao-Chi Chang( 張寶基 ) Dept. of Communication Engineering, National Central University E-mail:{yjwang, zychen, pcchang}@vaplab.ce.ncu.edu.tw ABSTRACT Scalable Video Coding (SVC) is getting popular but exhibits a problem of high computational complexity compared with H.264/AVC single layer coding. A fast mode decision algorithm that reduces the candidate modes in motion estimation can certainly reduce its computation time. We propose an adaptive temporal level mode decision algorithm in spatial scalable video coding. In spatial scalability, we utilize the spatial candidate modes table constructed based on the statistics of real data to reduce the candidate modes. In temporal scalability, we apply the base layer mode information directly at low temporal levels. The simulation results show that the proposed algorithm can reduce computation time up to 78.28% with less than 0.14 db video quality degradation compared with 9.12. 1. INTRODUCTION Multimedia applications on various devices and heterogeneous networks have been becoming popular. However, these applications usually exist different requirements, such as bandwidth constraints, CPU processing capabilities, and display resolutions, to be satisfied simultaneously. Scalable coding is one of good solutions. SVC is finalized as an extension of H.264/AVC video coding standard in 2007 [1]. A coded SVC bit-stream is composed of a base layer (BL) and several enhancement layers (ELs)[1]. The BL contains a lower resolution or quality version of each coded frame which can be adapted to some resource constrained devices, such as portable phones, PDAs, and laptops. The ELs provide a higher quality service such as various frame rates and display resolutions. The function of variable block size in H.264/AVC increases the candidate modes during motion estimation (ME) process and results in better rate-distortion (R-D) performance. However, it also increases large amounts of computation complexity accordingly. Similar to H.264/AVC, SVC calculates the R-D cost of each candidate mode and selects the one with minimum R-D cost as the best mode. Additionally, inter-layer prediction in SVC increases the prediction times of all modes. And the inherent hierarchical B Picture (HBP) structure executes ME in forward, backward, and biprediction directions. Therefore, the computation complexity of SVC is much higher than H.264/AVC. Consequently, a method which can reduce the complexity with just a little R-D loss is desirable. Recently, many kinds of fast mode decision schemes have been developed for H.264/AVC. Even though these algorithms are efficient in reducing the computational complexity with negligible quality degradation in H.264/AVC, but inter-layer information is not utilized when they apply to SVC. The spatial and temporal scalabilities in SVC are widely used in scalable coding, but there are few works which focus on the fast algorithm of these two scalabilities both. Li proposed a fast mode decision algorithm for spatial scalability in SVC [3]. In this scheme, they used the mode distribution relationship between base layer and enhancement layers. In [4], a fast mode decision algorithm had been proposed for inter-frame coding supporting spatial scalability by Li. In [5], Lim proposed a fast algorithm which use the mode history map (MHM) and motion vector predictor (MVP) to reduce and modify the candidate modes. In [6], Lin focused on the CGS and temporal scalabilities, and a fast algorithm which utilizes the features between CGS layers to reduce the computation complexity was proposed. In [7], Kim proposed a fast algorithm which uses the neighboring macroblock (MB) modes to reduce the candidate modes of current MB. In this work, we develop a fast algorithm which uses spatial features in different temporal levels. The proposed algorithm can save the encoding time significantly while maintaining the R-D performance both in spatial and temporal scalabilities. The rest of the paper is organized as follows. Section 2 introduces the inter-layer prediction, complexity analysis, and the observations of spatial scalability in different temporal levels. In Section 3, we describe the proposed fast mode decision algorithm. Section 4 shows the experimental results. And we conclude this work in Section 5.

2. COMPLEXITY ANALYSIS AND THE OBSERVATIONS IN SPATIAL SCALABILITY 2.1. Inter-layer Prediction There are three inter-layer predictions in spatial scalability [1]: Inter-layer motion prediction, Inter-layer residual prediction, and Inter-layer intra prediction. In residual prediction of 9.12, the current encoding MB in EL subtracts the corresponding reconstructed upsampled BL MB residual and executes inter prediction to get smaller EL residual. In inter-layer motion prediction, the current MB uses the upsampled BL motion vector (MV) to be the MVP for prediction. In inter-layer intra prediction, the reconstructed intra signal of corresponding MB in BL is upsampled to be the prediction signal of EL. In this work, we focus on inter-layer motion prediction. In section 2.2, we analyzed the complexity of inter-layer prediction. In the dyadic spatial scalability with the same quantization parameter (QP) in all layers, the modes size between BL and ELs should also have high probability to be dyadic. For example, when the base layer is QCIF and enhancement layer is CIF resolution, then the best mode in the EL will be 16 16 if the BL mode type is 8 8. Similarly, the best mode in EL will be 8 8 if the BL mode type is 4 4. Even though QP values are different in neighboring layers, we observed that this correlation is still very high. 2.2. Complexity Analysis In inter-layer motion prediction, there exist two extreme methods can be used. The first is ILP_1, which copies the upsampled MVs and modes of corresponding MB of BL to be the MVs and modes of EL. The second one is ILP_2, which uses upsampled MVs of corresponding MB of BL to be the predictor and then search all modes. Fig.1 and Fig.2 show the coding complexity and the rate-distortion (R-D) comparison of these two methods. Because of inter-layer prediction, all modes in EL would be tested more times than BL during ME. In Fig. 1, we can find encoding with ILP_2 increases much more computation complexity than with ILP_1 and ME process in EL occupies almost 90% of total encoding time. However, Fig. 2 shows that the R-D performance improvement encoding with ILP_2 can not be ignored. Hence, encoding with ILP_2 is necessary in general and the speed-up of EL ME is required. In this paper, we concentrate on the fast mode decision in EL. The computation time can be decreased drastically by reducing the candidate modes. If the modes of EL and BL have strong relationship, we should be able to find methods to achieve the following objectives: 1) Reducing the computation time of ME 2) Performance will be very close to the result of ILP_2 PSNR 48 43 38 33 28 23 18 (a) Inter-layer prediction with ILP_1 (b) Inter-layer prediction with ILP_2 Fig. 1: The complexity of ILP_1 and ILP_2 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 bit-rate (kbps) Fig. 2: R-D performance of ILP_1 and ILP_2 2.3. Correlations between Base and Enhancement Layers In this section, we analyze the correlations between the spatial layers in different temporal levels. The statistical analyses are conducted based on the coding of one base layer and one enhancement layer with QCIF and CIF resolution respectively. For simplicity, the QPs in two layers are set to the same values, GOP size is set to 8 which indicates that four temporal layers are provided, and full search is employed with search range 32. 2.3.1 Spatial scalability Because of the dyadic resolutions, the sampling rate in EL is two times to BL in each direction. The MB mode in EL is strongly related to that in BL. Table 1 shows the conditional probability of MB modes in spatial layers when BL is coded with different block types. Table 1 indicates that under the specific MB mode in BL, the corresponding probabilities of each coded mode in EL. The highlight indicates the highest probability for each case. From Table 1, we can observe the MB mode size in EL tends to be larger than the corresponding MB mode size in BL, i.e. if the best mode is Block_8x8 in BL, the best mode in EL almost ranges from _SKIP to Block_8x8. And when MB ILP_1 ILP_2

Enhancement Layer MB _SKIP _16x16 Table 1: s correlation in BL and EL _16x8 MB in Base Layer Average, QP_BL = QP_EL = 25, GOP = 8 _8x16 _8x8 _8x4 _4x8 _4x4 _skip Intra _4x4 Intra _16x16 _SKIP 0.52 0.26 0.15 0.16 0.08 0.02 0.02 0 0.27 0 0.06 _16x16 0.4 0.54 0.45 0.46 0.42 0.07 0.09 0.04 0.42 0.03 0.03 _16x8 0.03 0.08 0.17 0.1 0.15 0.44 0.05 0.07 0.09 0.02 0.03 _8x16 0.03 0.06 0.1 0.13 0.14 0.06 0.41 0.08 0.07 0.04 0 _8x8 0.01 0.02 0.05 0.06 0.09 0.19 0.19 0.45 0.05 0 0 _8x4 0 0.01 0.02 0.02 0.03 0.08 0.06 0.12 0.02 0 0 _4x8 0 0.01 0.01 0.02 0.03 0.05 0.08 0.11 0.02 0 0 _4x4 0 0 0.01 0.01 0.01 0.02 0.03 0.05 0.01 0 0 _skip 0.01 0.02 0.04 0.04 0.05 0.07 0.07 0.09 0.05 0 0 Intra_4x4 0 0 0 0 0 0 0 0 0 0.03 0.03 Intra_BL 0 0 0 0 0 0 0 0 0 0.87 0.86 Intra_16x16 0 0 0 0 0 0 0 0 0 0 0 Fig. 3: Hierarchical B Picture structure and temporal distance of the reference frames 3. THE PROPOSED FAST MODE DECISION ALGORITHM By the analysis in Section 2, we propose a combined spatial and temporal fast mode decision algorithm. Fig. 4 shows the flow chart of the proposed algorithm, and Table 2 is the spatial candidate modes table. in BL is coded as _16x8/_8x16, the best mode in EL is unlikely to be _8x16/ _16x8. When BL MB is coded as Intra mode, EL MB has higher probability to be coded as Intra mode. 2.3.2 Temporal scalability Fig. 3 shows the HBP structure, T 0 and T 3 denote the lowest and the highest temporal level (TL) in this structure. The temporal distance of reference frames increase with the decrement of TLs, i.e. the distance between reference frames and the current frames in T 1 is four, and that is one in T 3. Compared with higher TLs, the lower TLs have stronger spatial layer dependency than temporal layer dependency. Thus we can utilize the spatial dependency directly in low TLs. That is to say we can save the computation complexity by separating temporal levels into low and high parts. However, according to the encoding order, the distortion of low TLs will propagate to higher TLs, so TL1 is more important than TL2 and TL3 by considering the ME references. If we want to speed up the computation time in low TLs, we must reduce the distortion of TL1 to maintain the overall performance by any means. Inter / Inter-layer Prediction Start BL is INTRA? No Is High Temporal Level? No Copy Un-upsampled Base Layer YES YES Inter Decision by Table 2 MV Search RDO for : Intra 4x4 Intra 16x16 Intra BL End Fig. 4: Flow chart of the proposed algorithm

Table 2: Spatial candidate modes table EL MB Base Layer MB MODE_SKIP MODE_16x16 MODE_16x8 MODE_8x16 BLK_8x8 BLK_8x4 BLK_4x8 BLK_4x4 BLK_SKIP MODE_SKIP MODE_16x16 MODE_16x8 MODE_8x16 BLK_8x8 BLK_8x4 BLK_4x8 BLK_4x4 BLK_SKIP In Table 2, based on the collocated MB mode in BL, EL MB only executes ME with the modes labeled by star. In our proposed algorithm, when the corresponding BL MB mode is intra coded, the candidates in EL are reduced to only three intra modes. Otherwise, we separate the TLs into low and high first. If the current encoding frames belong to low TL, we copy the BL mode as the mode in EL directly. If the TL is high, the spatial candidate modes table is used to reduce the candidate modes. By the observation in Section 2.3.2, in order to improve the video quality of low TLs in our proposed algorithm, we copy the un-upsampled mode of BL mode in EL directly. Table 3 shows the R-D performance of using upsampled and un-upsampled mode respectively in low TLs without using spatial candidate modes table and intra modes reduction. The distortion of TL1 using un-upsampled mode is less than using upsampled mode. Therefore, we use the unupsampled mode of BL mode directly to be the EL mode. By experiments, we find that the performance is better when we regard the temporal distance farther than three frames as low TLs. Table 3: Performance comparison of BL mode copy (Sequence: BUS) QP PSNR (db) UP Bit-rate (Kbps) Time (sec) PSNR (db) Un-up Bit-rate (Kbps) 4. EXPERIMENTAL RESULTS Time (sec) 15 41.839 2431.678 2571.8 41.842 2426.386 2643.56 25 34.705 817.526 1916.4 34.715 815.046 1899.3 35 28.131 244.582 1578.5 28.140 244.485 1576.53 For the experiments, the proposed algorithm is implemented with 9.12 and tested using five video sequences in two scenarios. With two spatial Table 4: Test Environments Scenario #1 Scenario #2 Layer BL EL BL EL QP 15, 20, 25, 30, 35, 40 15, 20, 25, 30, 35, 40 Resolution QCIF CIF CIF 4CIF Frame rate 15 15 30 30 Frames 150 300 Sequence BUS, MOBILE, SOCCER CITY, CREW, SOCCER 9.12 GOP : 8, 16 Full Search with Search Range = 32 Reference Frame : 1 layers, the HBP structure is employed with GOP size equaling to 8 and 16. More encoder configurations are stated in Table 4. In Table 5, the proposed scheme is compared with 9.12 in terms of d_psnr, d_bitrate, TS, and TS e, which are defined as follows. d _ PSNR = PSNRproposed PSNR bitrateproposed bitrate d _ Bitrate = 100% bitrate run _ timeproposed run _ time TS = 100% run _ time EL _ ME _ run _ timeproposed EL _ ME _ run _ time TS e = 100% EL _ ME _ run _ time In Table 5, only the results of three QPs and GOP = 8 are shown because of the limited space. The proposed scheme averagely provides 71% and 75% overall time saving with the 80% and 85% EL ME time saving in scenario one and two respectively. Moreover, the average bit-rate increase is no more than 1.26% and 0.26% with negligible PSNR degradation in scenario one and two. The results of proposed algorithm compared with ILP_1 are shown in Table 6. With the

significant improvement in R-D performance, the encoding time of our proposed algorithm is close to ILP_1. And the results with GOP = 16 and other QPs have similar performance both in rate-distortion and time saving mentioned above. In Fig. 5, we compare the total encoding time of three methods: 1) 9.12 2) the scheme use the spatial candidate modes table only 3) the proposed algorithm Obviously, the saving time does not vary acutely with the change of QP, and the executing time of method 2 is close to the time of method 3 in higher QP. There are two reasons for this situation. First, the spatial base layer is a downsampled version of the original picture and lots of residual signals tend to be quantized to zero under high QP. The detail in the picture, which is the high frequency part, is filtered because of the downsampling process and the residual signal has no significant difference between using large and small block sizes, which implies that the bitrate plays a more important role than distortion does in the mode decision cost function. Consequently the best MB mode in BL is likely to be encoded with larger MB modes. Second, the spatial candidate mode in EL is just the un-upsampled mode in low TL in the proposed algorithm. Therefore, the encoding time of method 2 and 3 becomes closer in higher QP. 5. CONCLUSIONS In this paper, we propose an adaptive temporal level mode decision algorithm in spatial scalable video coding. In spatial scalability, the spatial candidate modes table is used to reduce the candidate modes. In temporal scalability, we separate TLs into low and high to decide whether the mode of BL to be copied directly or not. The proposed algorithm can save up to 78.28% computation complexity while maintaining very good rate-distortion performance at the same time. ACKNOWLEDGEMENT This work was supported in part by the NSC, Taiwan under Grant 96-2221-E-008-016-MY2. Table 5: Performance comparisons with GOP size is 8 GOP = 8 SCENARIO #1 SCENARIO #2 Sequence QP d_psnr(db) d_bitrate(%) TS(%) TS e (%) Sequence QP d_psnr(db) d_bitrate(%) TS(%) TS e (%) BUS MOBILE SOCCER 20-0.04 2.11-66.42-73.55 20-0.02 0.69-75.11-85.55 30-0.1 2.22-72.07-82.2 CITY 30-0.04 0.45-77.01-88.5 40-0.14 0.49-76.09-87.82 40-0.03-0.22-77.97-89.19 20-0.05 1.67-72.13-78.92 20-0.04 0.52-72.9-81.23 30-0.1 1.69-75.32-85.55 CREW 30-0.06 0.08-76.23-87.19 40-0.1-0.05-77.32-88.71 40-0.04-0.14-78.28-89.09 20-0.08 1.44-65.34-73.53 20-0.04 0.63-71.75-81.14 30-0.09 0.78-72.4-82.72 SOCCER 30-0.06 0.36-75.58-86.59 40-0.1-0.49-76.72-87.82 40-0.05-0.27-77.51-88.71 sec 7000 6000 5000 4000 3000 Spatial Proposed sec 45000 40000 35000 30000 25000 20000 Spatial Proposed 2000 1000 0 15 20 25 30 35 40 QP 15000 10000 5000 0 15 20 25 30 35 40 QP (a) MOBILE in Scenario #1 (b) SOCCER in Scenario #2 Fig. 5: The encoding time histogram of three methods

Table 6: Performance comparison of ILP_1 and the proposed algorithm ILP_1 Proposed Algorithm SCENARIO #1 Sequence QP d_psnr(db) d_bitrate(%) TS(%) Sequence QP d_psnr(db) d_bitrate(%) TS(%) 15-0.27 12.71-92 15-0.02 1.76-65.54 BUS 25-0.75 15.1-89.42 BUS 25-0.07 2.13-68.87 35-1.21 8.82-87.26 35-0.14 1.61-74.47 15-0.21 10.34-93.29 15-0.03 1.44-71.18 MOBILE 25-0.59 13.72-89.63 MOBILE 25-0.07 1.73-73.22 35-1.43 6.06-88.74 35-0.13 1.08-76.7 15-0.56 13.68-90.42 15-0.05 1.56-62.88 SOCCER 25-1.04 7.17-88.64 SOCCER 25-0.09 1.19-69.05 35-0.83-1.39-87.86 35-0.1 0.36-75.04 SCENARIO #2 Sequence QP d_psnr(db) d_bitrate(%) TS(%) Sequence QP d_psnr(db) d_bitrate(%) TS(%) 15-0.15 8.52-90.2 15-0.03 0.34-74.37 CITY 25-0.66 10.43-87.82 CITY 25-0.03 0.68-76.29 35-1.29 1.7-87.37 35-0.04-0.05-77.44 15-0.25 3.9-91.67 15-0.04 0.43-70.66 CREW 25-0.43 2.59-88.85 CREW 25-0.05 0.37-75.01 35-0.77-3.75-87.77 35-0.05-0.16-77.21 15-0.23 6.68-90.08 15-0.03 0.4-70.29 SOCCER 25-0.63 4.21-88.11 SOCCER 25-0.05 0.57-73.84 35-0.78-3.54-87.62 35-0.06 0.05-76.73 REFERENCES [1] H. Schwarz, D. Marpe, and T. Wiegand, Overview of the Scalable Video Coding Extension of the H.264/AVC Standard, IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 9, pp. 1103-1120, Sept. 2007. [2] A. Segall and G. J. Sullivan Spatial Scalability Within the H.264/AVC Scalable Video Coding Extension, IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 9, pp. 1121 1135, Sep. 2007. [3] H. Li, Z. G. Li, and C. Wen, Fast Decision Algorithm for Inter-Frame Coding in Fully Scalable Video Coding, IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, No. 7, pp. 889-895, July 2006. [6] H.C. Lin, W.H. Peng, H.M. Hang, and W.J. Ho, Layeradaptive mode decision and motion search for scalable video coding with the combination of coarse grain scalability(cgs) and temporal Scalability, IEEE International Conference on Image Processing, pp. 289-292, Sept. 2007. [7] S.T. Kim, Krishna Reddy Konda and C.-S. Cho, Fast Decision Algorithm for Spatial and SNR and Scalable Video Coding, IEEE International Symposium on Circuits and Systems (ISCAS 2009), pp. 872 875, May 2009. [4] H. Li, Z. G. Li, C. Wen, and L. P. Chau, Fast mode decision for spatial scalable video coding, IEEE International Symposium on Circuits and Systems, pp. 3005-3008, May 2006. [5] S. Lim, J. Yang, and B. Jeon, Fast coding mode decision for scalable video coding, International Conference on Advanced Communication Technology (ICACT), vol.3, pp.1897-1900, Feb. 2008.