ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

Similar documents
Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Unit-level Optimization for SVC Extractor

Optimal Estimation for Error Concealment in Scalable Video Coding

Fast Mode Decision for H.264/AVC Using Mode Prediction

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Reduced Frame Quantization in Video Coding

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

Investigation of the GoP Structure for H.26L Video Streams

EE Low Complexity H.264 encoder for mobile applications

An Efficient Mode Selection Algorithm for H.264

Video Coding Using Spatially Varying Transform

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

MULTI-BUFFER BASED CONGESTION CONTROL FOR MULTICAST STREAMING OF SCALABLE VIDEO

Joint redundant motion vector and intra macroblock refreshment for video transmission

Quality Scalable Low Delay Video Coding using Leaky Base Layer Prediction

ADAPTIVE JOINT H.263-CHANNEL CODING FOR MEMORYLESS BINARY CHANNELS

Scalable video coding with robust mode selection

Fast frame memory access method for H.264/AVC

Advanced Video Coding: The new H.264 video compression standard

Performance Comparison between DWT-based and DCT-based Encoders

Improving the quality of H.264 video transmission using the Intra-Frame FEC over IEEE e networks

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video

10.2 Video Compression with Motion Compensation 10.4 H H.263

Video Coding Standards: H.261, H.263 and H.26L

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS

H.264 to MPEG-4 Transcoding Using Block Type Information

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

Standard-Compliant Enhancements of JVT Coded Video for Transmission over Fixed and Wireless IP

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 9, SEPTEMBER

Region-Based Rate-Control for H.264/AVC. for Low Bit-Rate Applications

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation

IST MPEG-4 Video Compliant Framework

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

The Performance of MANET Routing Protocols for Scalable Video Communication

Multiple Description Coding for Video Using Motion Compensated Prediction *

Pattern based Residual Coding for H.264 Encoder *

Motion Vector Coding Algorithm Based on Adaptive Template Matching

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

Next-Generation 3D Formats with Depth Map Support

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 10 ZHU Yongxin, Winson

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

VIDEO COMPRESSION STANDARDS

Video Redundancy Coding in H.263+ Stephan Wenger Technische Universität Berlin

Digital Video Processing

High Efficiency Video Decoding on Multicore Processor

Low complexity H.264 list decoder for enhanced quality real-time video over IP

An Implementation of Multiple Region-Of-Interest Models in H.264/AVC

Scalable Video Coding

Unbalanced Multiple Description Video Communication Using Path Diversity

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

40 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 1, JANUARY 2006

H.264/AVC Video Watermarking Algorithm Against Recoding

RTP Payload Format for SVC Video draft-ietf-avt-rtp-svc-13

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

An Efficient Table Prediction Scheme for CAVLC

THE H.264 ADVANCED VIDEO COMPRESSION STANDARD

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

Performance Analysis of DIRAC PRO with H.264 Intra frame coding

ISO/IEC JTC1/SC29/WG11 N4065 March 2001, Singapore. Deadline for formal registration Deadline for submission of coded test material.

Video Compression Standards (II) A/Prof. Jian Zhang

First-order distortion estimation for efficient video streaming at moderate to high packet loss rates

The Scope of Picture and Video Coding Standardization

Error Concealment Used for P-Frame on Video Stream over the Internet

Smoooth Streaming over wireless Networks Sreya Chakraborty Final Report EE-5359 under the guidance of Dr. K.R.Rao

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

VHDL Implementation of H.264 Video Coding Standard

Advanced Encoding Features of the Sencore TXS Transcoder

Block-based Watermarking Using Random Position Key

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

White paper: Video Coding A Timeline

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

over the Internet Tihao Chiang { Ya-Qin Zhang k enormous interests from both industry and academia.

Digital Image Stabilization and Its Integration with Video Encoder

Xiaoqing Zhu, Sangeun Han and Bernd Girod Information Systems Laboratory Department of Electrical Engineering Stanford University

MPEG-2. ISO/IEC (or ITU-T H.262)

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

University of Erlangen-Nuremberg. Cauerstrasse 7/NT, D Erlangen, Germany. ffaerber stuhl

MCTF and Scalability Extension of H.264/AVC and its Application to Video Transmission, Storage, and Surveillance

A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video

RECOMMENDATION ITU-R BT.1720 *

streams cannot be decoded properly, which would severely affect the quality of reconstructed video. Therefore, error resilience is utilized to solve

Video Codecs. National Chiao Tung University Chun-Jen Tsai 1/5/2015

Comparison of Shaping and Buffering for Video Transmission

On the Adoption of Multiview Video Coding in Wireless Multimedia Sensor Networks

Error Control Techniques for Interactive Low-bit Rate Video Transmission over the Internet.

Transcription:

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS Ye-Kui Wang 1, Miska M. Hannuksela 2 and Moncef Gabbouj 3 1 Tampere International Center for Signal Processing (TICSP), Tampere, Finland 2 Nokia Mobile Software, Tampere, Finland 3 Tampere University of Technology, Finland ABSTRACT The technique isolated regions, which is based on constraining inter and in-picture prediction jointly, was adopted into the emerging video coding standard H.4 (also know as MPEG-4 part 10 or AVC), as a tool to provide gradual random access and error resilience among other things. This paper addresses the error resiliency performance of isolated regions in the aspect of error-robust inter/intra macroblock mode selection. In addition to improved error resilience, random accessibility from non-intra pictures is provided as a side product by using isolated regions. It is shown by simulations that coding of isolated regions has competitive error resilience to loss-aware rate-distortion optimized macroblock mode selection algorithm () with low complexity, and has consistently better performance than the latter when employed simultaneously with. 1. INTRODUCTION Video encoders can help decoders to stop spatio-temporal propagation of transmission errors by coding macroblocks (MBs) in intra mode. Various types of MB mode selection algorithms exist. The algorithms can be categorized into non-adaptive and adaptive classes, and adaptive methods can be further classified to cost-function-based and ratedistortion optimized ones. The family of non-adaptive algorithms includes the circular intra refresh algorithm that scans the picture area in a pre-defined order and codes a certain number of intra MBs per picture in the pre-defined scan order. Another example of a non-adaptive algorithm is to intra code a certain number of MBs at randomly selected MB locations. Adaptive MB mode decision methods select the intra-coded MB locations so that the content of the pictures is taken into account. For example, a static background area needs not be refreshed in intra mode as often as moving objects. Cost-function-based methods, such as [1] and [2], calculate a cost for each MB with a certain function that may take into account the amount of prediction error data after motion compensation, for example. A certain number of MBs having the highest cost are coded in intra mode. Rate-distortion optimized MB mode selection algorithms use a Lagrangian cost function that linearly combines terms rate and distortion. The mode of each MB is selected so that the combined cost is minimized. An estimate of the expected distortion caused by transmission errors and losses may be taken into account in the cost function. A number of distortion estimating algorithms have been proposed and one of them, herein referred to as the loss-aware rate-distortion-optimized MB mode selection algorithm (), has been selected into H.4 reference implementation [3]. The method has significantly better performance than non-adaptive and cost-function-based algorithms. However, its computational complexity is typically multifold, which limits the use in practical implementations. This paper proposes to use the isolated regions coding technique [4] for error robust MB mode selection. Section 2 presents the fundamentals of the isolated regions technique. Coding of isolated regions for MB mode decision is described in Section 3. Simulation results with comparisons are provided in Section 4 and finally Section 5 concludes the paper.

2. FUNDAMENTALS OF ISOLATED REGIONS The isolated regions technique is based on constraining inter and in-picture prediction jointly. In-picture prediction includes intra prediction of samples and motion vector prediction within a certain picture. Inter prediction is means motion compensated prediction from previously coded pictures. An isolated region is a solid area of MBs, defining the shape of the border across which in-picture prediction is limited. In addition, when coding an isolated region, temporal prediction outside its corresponding isolated region in a reference frame should be disallowed. The corresponding isolated region of an isolated region refers to that in a previous picture from which it is evolved. A picture can contain zero or more isolated regions that do not overlap. The area of the picture that is not covered by any isolated region forms the leftover region. There is no prediction limit for the leftover region. An isolated region may start from any picture and evolve to following pictures. Generally, an isolated region ends after it evolves to be the entire picture. However, it may also ends even when it does not cover the entire picture area. The period from the picture where an isolated region starts to the picture where it ends is defined as the period of the isolated region. There maybe unlimited amount of evolving patterns for isolated regions, some of them were described in [4]. One evolving pattern, which is called clockwise box-out, was applied in the simulations for this paper. The first MB to be allocated in the isolated region is in the center of the picture, and the growth order from the first MB is defined as shown in Fig.1. The growth rate in terms of MBs per picture may be either constant or variable. 9 2 3 4 1 0 5 8 7 6 Fig.1. Growth order of box-out clockwise evolution. The isolated regions technique provides an elegant solution for many applications, such as gradual random access, error resiliency and recovery, region-of-interest coding and prioritization, picture in picture functionality, and coding of masked video scene transitions [4-7]. With gradual random access based on isolated regions, media channel switching for receivers, bit-stream switching for the server, and allowing newcomers for multicast streaming will be as easy as conventional intra picture coding with smooth bit-rate. 3. ISOLATED REGIONS IN MACROBLOCK MODE SELECTION The method to code isolated regions for robust MB mode selection is as follows. An isolated region starts from a picture and evolves non-decreasingly in size. If the background is static hence does not need to be intra updated, the isolated region may end before it evolves to cover the entire picture. Another isolated region repeats in the similar way after the previous one ends. The MBs in the isolated region of the first picture in the isolated region period are intra coded. The MBs in the isolated region in each of the later pictures may be either intra coded or inter coded with

inter prediction from the corresponding isolated regions within the same isolated region period. The above method can be used as either a non-adaptive or an adaptive MB mode selection algorithm. If the region growth rate is constant and all the MBs newly added into the isolated region in a picture are intra coded, the method is used as a non-adaptive algorithm. Otherwise, the algorithm falls into the category of adaptive MB mode selection algorithms. The encoder can select a proper constant growth rate of the isolated region according to the picture size and the assumed transmission error rate. Generally, a good growth rate is equivalent to the expected loss rate of MBs. For example, for a CIF sequence that has 396 MBs per picture, if the packet loss rate is 20%, a growth rate of about 80 MBs per picture is appropriate. However, due to the possible large differences in sequence characteristics and different coding options, a content-adaptive growth rate may perform better and is under investigation. 4. SIMULATIONS Four algorithms were compared: conventional circular intra refresh at a rate of one MB row per picture (), LA- RDO, isolated regions based method (), and a combination of and. Real-time multicast/broadcast to users with different network conditions was assumed. Therefore, the coding options were selected to have best error resiliency performance. 20 % was assumed to be the largest packet loss rate. The coded sequences were decoded after packet loss simulation under packet loss rates 0, 3, 5, 10 and 20%. In the simulations, isolated regions are coded with constant growth rate according to the clockwise box-out evolving pattern. The algorithms were implemented based on JM-1.4 [8]. Other simulation conditions are specified as follows. Bitrate and PSNR Calculation: According to the error resilient testing conditions specified in [9], coding parameters such as quantization parameter were chosen to make the resulting bitrate as close as possible to the channel bitrate, taking into account the 40 bytes of IP/UDP/RTP headers per packet. PSNR values were calculated using each and every frame in the source sequence including skipped and lost frames. To reduce the effect imposed on the overall result by the first frame (the first encoded frame have a larger average size than the average size of the entire sequence), bitrate and average PSNR value were calculated from the sixth coded frame. This method allows coding short sequences with fair results. 300-400 frames of each designated sequence were used, to ensure that at least 100 frames are coded and at least 300 frames of a sequence are used. Packet Loss Simulation: We assumed that the packet containing the parameter set is conveyed reliably (possibly out-of-band during the session setup), and therefore no error pattern was read from the error pattern file for it. At least one packet of the first frame should be received to avoid decoder crash. For that purpose, the first packet of the first frame was always received regardless of the corresponding error pattern. Multiple Decoding Runs: The coded bitstream was decoded multiple times, and each time is called a decoding run. The beginning loss position of the run with order n+1 continuously follows the ending loss position of the nth run. The number of decoding runs was selected so that there are totally at least 8000 packets. The overall average PSNR was obtained by averaging the average PSNR values of all decoding runs. Six coded sequences for each intra refresh algorithm were generated: Hall Monitor QCIF at 32 kbits/s, Foreman QCIF at 64 kbits/s, Foreman QCIF at 144 kbits/s, Paris CIF at 144 kbits/s, Paris CIF at 384 kbits/s, and Irene CIF at 384 kbits/s. The average Y-PSNR values of the coded sequences under different packet loss rates are shown in Fig.2-7.

Foreman QCIF 64 kbps 21 20 + Fig.2. MB mode selection simulation results of Foreman QCIF at 64 kbits/s. Foreman QCIF 144 kbps 21 + Fig.3. MB mode selection simulation results of Foreman QCIF at 144 kbits/s. Hall QCIF 32 kbps 32 31 30 29 + Fig.4. MB mode selection simulation results of Hall Monitor QCIF at 32 kbits/s.

Irene CIF 384 kbps 37 36 35 34 33 32 31 30 29 + Fig.5. MB mode selection simulation results of Irene CIF at 384 kbits/s. Paris CIF 144 kbps + Fig.6. MB mode selection simulation results of Paris CIF at 144 kbits/s. Paris CIF 384 kbps 31 30 29 + Fig.7. MB mode selection simulation results of Paris CIF at 384 kbits/s.

The simulation results show that outperforms significantly in all cases. Compared to, the performance of is better in three cases while worse in the left three cases. Therefore, the performances of the two methods are competitive. However, the complexity of is similar to that of, which is a fractional of s complexity. In addition, use of is not exclusive to the use of. When combined with LA- RDO, the error resiliency performance of is can be further improved. As can be seen from Fig.2-7, the combined method outperforms consistently in all the six cases. 5. CONCLUSION The error resiliency performance of the isolated regions technique in error-robust macroblock mode selection is investigated in this paper. Compared to conventional methods, the performance of the proposed method is significantly better, either in error resiliency or in computational complexity. This profit, together with many others, makes the isolated regions technique a highly valuable video coding tool. REFERENCES [1] ISO/IEC International Standard 14496-2:2001, Annex E, Features supported, by the algorithm. [2] J. Y. Liao and J. D. Villasenor, Adaptive intra update for video coding over noisy channels, in Proc. IEEE International Conference on Image Processing, Oct. 1996. [3] ITU-T Rec. H.4 ISO/IEC 14496-10 AVC Draft Text, Joint Video Team document JVT-E146d37, Oct. 2002. [4] Ye-Kui Wang and Miska M. Hannuksela, Isolated regions: motivation, problems and solutions, Joint Video Team document JVT-C072, May 2002. [5] Ye-Kui Wang and Miska M. Hannuksela, Error-robust video coding using isolated regions, Joint Video Team document JVT-C073, May 2002. [6] Ye-Kui Wang and Miska M. Hannuksela, Gradual decoder refresh using isolated regions, Joint Video Team document JVT-C074, May 2002. [7] Ye-Kui Wang and Miska M. Hannuksela, Coding of masked scene transitions using isolated regions, Joint Video Team document JVT-C075, May 2002. [8] ITU-T Rec. H.4 ISO/IEC 14496-10 AVC Joint Model, version JM-1.4, Apr. 2002. [9] Stephan Wenger, Common conditions for wire-line, low delay IP/UDP/RTP packet loss resilient testing, ITU- T VCEG document VCEG-N79r1, Sep. 2001.