University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2006.

Similar documents
Improved H.264/AVC Requantization Transcoding using Low-Complexity Interpolation Filters for 1/4-Pixel Motion Compensation

Homogeneous Transcoding of HEVC for bit rate reduction

H.264 to MPEG-4 Transcoding Using Block Type Information

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Introduction to Video Encoding

Video Coding Using Spatially Varying Transform

Advanced Video Coding: The new H.264 video compression standard

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Fast Mode Decision for H.264/AVC Using Mode Prediction

Complexity Reduction Tools for MPEG-2 to H.264 Video Transcoding

Editorial Manager(tm) for Journal of Real-Time Image Processing Manuscript Draft

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

Multidimensional Transcoding for Adaptive Video Streaming

SNR Scalable Transcoding for Video over Wireless Channels

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Very Low Complexity MPEG-2 to H.264 Transcoding Using Machine Learning

Introduction to Video Encoding

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC

High Efficiency Video Coding: The Next Gen Codec. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

In the name of Allah. the compassionate, the merciful

Homogeneous Transcoding of HEVC

Digital Image Stabilization and Its Integration with Video Encoder

COMPRESSED DOMAIN SPATIAL SCALING OF MPEG VIDEO BIT- STREAMS WITH COMPARATIVE EVALUATION

Reduced Frame Quantization in Video Coding

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

Video Transcoding Architectures and Techniques: An Overview. IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang

An Efficient Mode Selection Algorithm for H.264

Rate Distortion Optimization in Video Compression

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING

ABSTRACT. KEYWORD: Low complexity H.264, Machine learning, Data mining, Inter prediction. 1 INTRODUCTION

An Efficient Motion Estimation Method for H.264-Based Video Transcoding with Arbitrary Spatial Resolution Conversion

CONTENT BASED ARCHITECTURE FOR VIDEO TRANSCODING

Video Compression MPEG-4. Market s requirements for Video compression standard

Complexity Estimation of the H.264 Coded Video Bitstreams

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

Professor, CSE Department, Nirma University, Ahmedabad, India

Performance Comparison between DWT-based and DCT-based Encoders

EE 5359 H.264 to VC 1 Transcoding

FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS

IEEE 1857 Standard Empowering Smart Video Surveillance Systems

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Fast Intra Mode Decision in High Efficiency Video Coding

Advanced Encoding Features of the Sencore TXS Transcoder

Context based optimal shape coding

FAST MOTION ESTIMATION DISCARDING LOW-IMPACT FRACTIONAL BLOCKS. Saverio G. Blasi, Ivan Zupancic and Ebroul Izquierdo

Context-Adaptive Binary Arithmetic Coding with Precise Probability Estimation and Complexity Scalability for High- Efficiency Video Coding*

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

Transcoding from H.264/AVC to High Efficiency Video Coding (HEVC)

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

Categorization for Fast Intra Prediction Mode Decision in H.264/AVC

Elsevier Editorial System(tm) for Journal of Visual Communication and Image Representation

Secure Scalable Streaming and Secure Transcoding with JPEG-2000

White paper: Video Coding A Timeline

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

High Efficiency Video Coding. Li Li 2016/10/18

Updates in MPEG-4 AVC (H.264) Standard to Improve Picture Quality and Usability

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Digital Video Processing

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

MPEG-2. ISO/IEC (or ITU-T H.262)

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

A Novel Partial Prediction Algorithm for Fast 4x4 Intra Prediction Mode Decision in H.264/AVC

Copyright Notice. Springer papers: Springer. Pre-prints are provided only for personal use. The final publication is available at link.springer.

Smoooth Streaming over wireless Networks Sreya Chakraborty Final Report EE-5359 under the guidance of Dr. K.R.Rao

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Sample Adaptive Offset Optimization in HEVC

ADAPTIVE JOINT H.263-CHANNEL CODING FOR MEMORYLESS BINARY CHANNELS

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

Star Diamond-Diamond Search Block Matching Motion Estimation Algorithm for H.264/AVC Video Codec

Video Codecs. National Chiao Tung University Chun-Jen Tsai 1/5/2015

TRANSCODING OF H264 BITSTREAM TO MPEG 2 BITSTREAM. Dr. K.R.Rao Supervising Professor. Dr. Zhou Wang. Dr. Soontorn Oraintara

COMPARATIVE ANALYSIS OF DIRAC PRO-VC-2, H.264 AVC AND AVS CHINA-P7

Building an Area-optimized Multi-format Video Encoder IP. Tomi Jalonen VP Sales

Implementation and analysis of Directional DCT in H.264

An Efficient Table Prediction Scheme for CAVLC

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

Optimized Segmentation of H.264/AVC Video for HTTP Adaptive Streaming

Compression and File Formats

Fast frame memory access method for H.264/AVC

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

CMPT 365 Multimedia Systems. Media Compression - Video

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

VHDL Implementation of H.264 Video Coding Standard

CISC 7610 Lecture 3 Multimedia data and data formats

Module 7 VIDEO CODING AND MOTION ESTIMATION

Transcription:

Lefol, D., Bull, D. R., & Canagarajah, C. N. (006). Mode refinement algorithm for H.64 intra frame requantization. In 006 IEEE International Symposium on Circuits and Systems, 006 (ISCAS 006) Kos, Greece. (pp. 4459-446). Institute of Electrical and Electronics Engineers (IEEE). DOI: 0.09/ISCAS.006.6969 Peer reviewed version Link to published version (if available): 0.09/ISCAS.006.6969 Link to publication record in Eplore Bristol Research PDF-document University of Bristol - Eplore Bristol Research General rights This document is made available in accordance with publisher policies. Please cite only the published version using the reference above. Full terms of use are available: http://www.bristol.ac.uk/pure/about/ebr-terms.html

Mode Refinement Algorithm for.64 Intra Frame Requantization Damien Lefol, Dave Bull and Nishan Canagarajah Department of Electrical and Electronic Engineering University of Bristol Bristol, U.K. Damien.lefol@bristol.ac.uk Abstract - The latest video coding standard H.64 has been As demonstrated in previous work [7], the FPDT recently approved and has already been adopted for numerous algorithm cannot be used with H.64 bitstream, and the applications including HD-DVD and satellite broadcast. To CPDT quality can be significantly lower than a full decode allow interconnectivity between different applications using and recode. This difference is mainly due to the large H.64, transcoding will be a key factor. When requantizing a number of new tools introduced by H.64. The compression bitstream the incoming coding decisions are usually kept unchanged to reduce the compleity, but it can have a major efficiency of this new standard is maimal only when all impact on the coding efficiency. This paper proposes a novel modes are used. When requantizing H.64 bitstream with algorithm for mode refinement of intra prediction in the case CPDT, the encoding decisions of the incoming bitstream are of requantization of H.64 intra frames. The proposed kept to reduce the compleity. This implies that the approach gives a comparable quality to a full search for a transcoded video uses sub-optimal encoding parameters. A fraction of its compleity by eploiting the statistical properties mode refinement algorithm is needed to improve the coding of the mode distribution. efficiency of the transcoder while reusing as much of the incoming information as possible. The mode refinement I. INTRODUCTION algorithm presented in this paper allows to choose the best prediction mode without having to do a full search in the The new standard H.64 [] is already successful thanks to case of intra frame requantization. The simulation results the variety of scenarios that it can cover and the high quality show it is p eqtozave a q itcloto aefull of video it can deliver even at low bitrates. Recently it has seac whi savingiup to he complity. been provisionally approved as one of the standards for HD- sectio this p wl g q v DVD and many broadcasters plan to use it to deliver req tion alorithm used The limitation of thi Alargle amount of research is ongoing in H.64 satellite video. which of the. requantzation algorithm used. The limitation of this algorithm and the proposed mode refinement algorithm are is A repamu oresac is o oi inh e64 whi described in section together with different possible tuning thought to replace MPEG- in the coming years. The toblneqaiyndcmey.suainrslsae applications using H.64 will range from multimedia content delivery on mobile handset to High Definition given in Section 4. Finally, section 5 concludes the paper. television broadcasting. To allow such diversity in the video broadcasting, it will be necessary to have means of adapting II. REQUANTIZATION FOR H.64 the video to the distribution channel. One solution would be The requantization algorithm used in this paper is a to store only the highest quality bitstream on the server side Cascaded Piel Domain Transcoder (CPDT) adapted to and to transcode the bitstream depending on the customers H.64 as described in [7]. The transcoding compleity is needs. kept low by reusing the encoding information from the Many algorithms have been developed for the incoming bitstream. This approach yields good results when requantization of video in the last decade. Some of those the transcoded bitrate is close to the original bitrate, but for algorithms, such as the Cascaded Piel Domain Transcoder large differences the quality drops. This is due to a sub- (CPDT) [] and the Fast Piel Domain Transcoder (FPDT) o [], [4], [],[4] have been hae ben used ued successfully uccesfuly in n many may practical pactcal optimal use of H.64 encoding macroblock (MB) modes. Fiueoband sngte prmtlstp applications [5], [6]. It is possible to adapt those algorithms Fgure, obtained usng the eperimental setup with some changes to the new H.64 standard. described in section 4, shows that for intra frame, in a high bitrate video, intra 44 modes (I_44) will be predominant The work is funded by CRL ROAM4G. 0-780-990-/06/$0.00 006 IEEE 4459 ISCAS 006 Authorized licensed use limited to: UNIVERSITY OF BRISTOL. Downloaded on January, 009 at 05:59 from IEEE Xplore. Restrictions apply.

whereas in a low bitrate video intra 66 (I_66) can be evolve in opposite direction. The mode refinement decision nearly as frequent. is taken at a MB level using the macroblock size (in bits), number of intra 44 and intra 66 mode in function of the bitrate mb size, as specified in equation. 00_ 90 80 compute sad_in 70 cr refine = O 60 iintra 66 MB NO 50-40 0 N size_ma 0 0 try 8. 0.4 0.6 0.8 ta 4.6.8. NO bitrate 07 keep old 44 Fig. repartition of the intra prediction mode with the bitrate t6 = t6 X 0.999 <s The increase of bitrate due to this loss of compression new_mode=66 efficiency can be as high as 60% when a very high bitrate cr_refine= needs to be transcoded to lower bitrates (Cf. figure 4). To tl6=tl6.00 compensate for this loss it is necessary to change at least some of the incoming encoding decision. The algorithm NO size > size_ma presented below allows to decide when to refine an incoming macroblock or not. I III. MODE REFINEMENT ALGORITHM keep old 44 NO s 4< sa One of the problems of mode refinement is to decide when ES to refine a macroblock or not since this could lead to poor new_mode=44 quality or higher compleity. The statistics obtained from cr_refine= bitstream requantized at different bitrates show that intra t4 = t4 X 0.999 44 becomes intra 6l6 when the bitrate decreases. However this requires knowledge of which intra 44 r become intra 66 and what should be done with the remaining intra 44. One option is to consider all intra 44 try croma but this will increase the compleity. On the other hand when the quantization parameter (QP) increases the cost of encoding intra prediction direction for 44 MB can be relatively high. The intra 44 mode predicted from its Fig. Flowchart of the threshold mode refinement algorithm neighbors as most probable is then chosen more often as it Only incoming MB encoded in 4 are concered by the will save bits thanks to the intra mode predictive encoding. mode refinement because if the encoder used a 66 mode Failing to use this most probable mode can incur an increase in the high bitrate video it is nearly sure that the same mode of 5% of the MB headers. The number of mode refinement will be kept at a lower bitrate. If the mode refinement tried needed increases with the difference of bitrate between the is not better than the original mode (using a sum of absolute input and the output of the transcoder. difference or SAD comparison) the original mode is kept. If mbtype = I_44, then The threshold t6 is then decreased and t4 increased by 0.0% _mbsize < ma_mb_size t6 I_66 refinement to reduce the chances to try a refinement for the net MB, mb_size _mb_size thus keeping the compleity as low as possible. On the other Lmb size>ma_mb_sizet - - 44rehand if the mode refinement tried is better, the new mode is else if mb_type = I_66, then kept and the corresponding threshold is modified by 0.% no mode refinement so that mode refinement is more likely for the net MB. This adaptation of the threshold is done on a macroblock Theappoachsento olv ths poblm i tousetwo basis because the amount of refinement needed is closely variable thresholds. The first one, called t6, is used to decide if a I_66 mode should be tried and the second lne otelclvdopoete.crm eieeti thehld ald 4 o I44moe.Nteta - n turn on if any luma mode refinement tried for this MB has '- kept, otherwise no mode refinement is done for the ~~~~~~~~been 4460 Authorized licensed use limited to: UNIVERSITY OF BRISTOL. Downloaded on January, 009 at 05:59 from IEEE Xplore. Restrictions apply.

chroma component of the current MB. The flowchart of our against the output bitrate wanted. The thresholds values of threshold mode refinement algorithm is given in figure. each possible QP were then optimized to reduce the number The initial value of the thresholds is a function of the of mode refinement tried but not kept. This first approach input and output quantization parameters (Cf. equation ). If was refined by applying multiplicative coefficients k, QP is smaller than QP, no mode refinement will be tried included between and, to the threshold values so as since this would be done on a noisy video and thus give to have a finer approimation of the optimum thresholds. worse results (Cf figure 7). If QP is higher than QP an Figure and 4 shows the results obtained from those initial threshold value is set depending on the difference of simulations. The comparisons are always done with the full QP (AQP). The smaller AQP is, the higher the threshold will search corresponding to 00% of compleity and 0% of be, so less mode refinements are done. The initial values of thresholds have been chosen empirically by trying to match bitrate increase. measured compleity in number of mode tried the result given by a full search process. 70 -'-refinement k = refinement k Ql > QP.4 6andt, =( 60 refinement k =0.8 () -refinement k=. wqpi LQ'~ < <Qf~ QP Qt4 = [AQP]and X6QPP t6 Y[AQP] 40 50refinementk= with, y [0,] and A\QP = QP QPI 50 -refinement variable k } 0 \ IV. RESULTS 0 The simulations were done by transcoding a video sequence, 0\ composed of three concatenated standard definition 0- sequences (SD 70576). The first 5 frames are from _ "Pedestrian", then 5 frames from "Tractor" and the last 5 o.5 bi (inb5) 5 frames are from "Toys". Those frames are separated by 5 0 frames in the original sequence so as to represent a typical f Fig. 4 Compleity comparison for different mode refinement thresholds encoding situation with an intra frame every half a second. Figure shows that doing mode refinement is highly The first sequence contains multiple occlusions, the second beneficial for the output bitrate of the transcoder. When a tracking camera and high teture and the third, comple compared to transcoding without refinement we obtain a motions and uniform areas. gain of up to 55% in terms of bitrate at similar quality. Difference of bitrate Actually the mode refinement gives result comparable to the 50 60 --full searchf -no refinement u searc. ure 4 slows the comp ieityin teirms o -refinement k = number of modes tried for the developed algorithms. It can refinement k = -refinement k = 0.8 be seen that for bitrate close to the original one, there is no. -0 40- \ O -refinement k = increase in compleity; it then increases as more MBs are At\ p ~~~~~~refinement variable kcopeiyinras 8 -refinement variable k tried for refinement. It also shows clearly that the balance 0- between compleity and quality of the mode refinement 0 algorithm can be modified gradually by using different values of k. values of k. Rate Distortion 0 n 0full 0 search _-no refinement 50 threshold refinement bitrate (in bits/sec))x07 Fig. bitrate difference for different mode refinement thresholds 45 The bitstream has been encoded at 0 frames per second with a group of picture containing only intra frames. The 40 - z original sequence is encoded in H.64 using the reference software JM8.5 at a QP of 0. This sequence is then / transcoded at different bitrates using our transcoder. This 4 simulation corresponds to an incoming video at a bitrate of 0-06 4.68 Mbps. This is rather high since no inter frames werellll used in the simulation. % 4 4.5 The mode refinement algorithm uses thresholds defined bitrate (in bits/sec) 0 empirically by fitting the statistical distribution of the modes Fg aedsoto oprsnfra nu eunea P 446 Authorized licensed use limited to: UNIVERSITY OF BRISTOL. Downloaded on January, 009 at 05:59 from IEEE Xplore. Restrictions apply.

in inter frames and linked to motion vector refinement algorithm previous developed such as [8]. measured compleity -compleity in number of mode -compleity computation 80 in 70 time measured compleity 70- -6=compleity in number of mode 60 60\: in computation time -~~~~~~~~~~~~~~~~~~~~~~~~~~~~compleity 50 - -5040-8s _L000-0 0 80 bitrate (in bits/sec) 4 Fig. 6. Compleity of the mode refinement algorithm (input at QPO) 4.5 0 From the simulation done with different multiplicative coefficients k, we decided which k to use depending on the difference between input and output bitrate. This approach, called variable k in the previous graph, is designed by threshold refinement algorithm in the rest of this paper. Figure 5 gives the rate distortion obtained. Figure 6 shows the corresponding compleity in number of modes tried and time spent on mode refinement by the transcoder. It shows that approimating the compleity of the mode refinement algorithm by the number of modes tried is quite close to the reality. Figure 6 shows clearly that when the bitrate of the output bitstream is more than half the input bitrate, the compleity of our algorithm is less than 5% of full search. Even in the case of a bitrate reduction of 85%, the compleity is still kept under 50% compared to full search. In the worst case scenario the compleity is still 5% lower for the same quality. -no refinement - -threshold refinement 40 - (in bits/sec) 07 CONCLUSION REFERENCES G ITU-T Rec. H.64 ISO, "ISO/IEC 4496-0 AVC International Standard for Advanced Video Coding," 00. A. Vetro, C. Christopoulos, and H. Sun, "Video transcoding architectures and techniques: An overview," IEEE Signal Processing Magazine, vol. 0, pp. 8-9, 00. P. A. A. Assuncao and M. Ghanbari, "Frequency-domain video dynamic bit-rate reduction of MPEG- bit /,streams," IEEE Transactions on Circuits and Systems for Video transcoder for 8 ' 6 4 _-_ // [4] _ 47%o bitratet increase ca / f, -- / - bitrate H.64 provides an efficient compression standard for video coding, but when requantizing H.64 bitstream its efficiency can be seriously limited if using non optimal macroblock mode and intra prediction mode. When the incoming coding information is reused without mode refinement, the bitrate increase can be as high as 60% compared to a full recode. The threshold mode refinement algorithm presented here provides an efficient tool to keep the quality and bitrate of the transcoded intra frames at its maimum while reducing the compleity by at least 5% and up to 95% compared to a full search approach. [] 8 -/ 0 _ V [] -full search 4 Fig. 8. Compleity of the mode refinement algorithm (input at QP0) [] Rate Distortion graph 44 & 5 '--_ 7 N< // > 4 bitrate (in bits/sec) [5] Fig. 7 Rate distortion comparison for an input sequence at QP 0 07 Technology, vol. 8, pp. 95-967, 998. J. Youn, M.-T. Sun, and J. Xin, "Video transcoder architectures for bit rate scaling of H.6 bit streams," presented at of the 999 7th International Multimedia /Proceedings Conference - ACM MULTIMEDIA '99, Oct 0-Nov 5 999, Orlando, FL, USA, 999. P. N. Tudor and 0. H. Werner, "Real-time transcoding of MPEG- video bit streams," presented at Proceedings of the 997 International Broadcasting Convention, Sep -6 997, Amsterdam, Neth, 997. [6] Figure 7 and Figure 8 shows the validity of those results for a different input bitrate. The transcoder input is now at [7] peaks on the RD curve are due to matching QPs when trying [8] 8.8 Mbps (QP 0), and the transcoder gives an output, two,. The ranging from twice to a tenth of the input bitrate. to transcode at a bitrate higher than the input one. The mode refinement approach presented here is simple and efficient. It could be easily adapted for mode refinement ACTS, "Advanced television at low bit rates and networked transmission over integrated communication systems," Online: web-site: ht p: vwvv.bbc.co.uk at/antic., 996. D. Lefol, D. Bull, N. Canagarajah, and F. Rovati, "Performance evaluation of transcoding algorithms for H.64," to be presented ICCE 006 - International Conference on Consumer ~~~~~at "in press", Las Vegas, USA, 006. Electronics, J. Youn, M.-T. Sun, and C.-W. Lin, "Motion estimation for high performance transcoding," IEEE Transactions on Consumer Electronics, vol. 44, pp. 649-658, 998. 446 Authorized licensed use limited to: UNIVERSITY OF BRISTOL. Downloaded on January, 009 at 05:59 from IEEE Xplore. Restrictions apply.