Motion Vector Coding Algorithm Based on Adaptive Template Matching

Size: px
Start display at page:

Download "Motion Vector Coding Algorithm Based on Adaptive Template Matching"

Transcription

1 Motion Vector Coding Algorithm Based on Adaptive Template Matching Wen Yang #1, Oscar C. Au #2, Jingjing Dai #3, Feng Zou #4, Chao Pang #5,Yu Liu 6 # Electronic and Computer Engineering, The Hong Kong University of Science and Technology Hong Kong 1 eeyangw@ust.hk, 2 eeau@ust.hk, 3 jjdai@ust.hk, 4 fengzou@ust.hk, 5 pcece@ust.hk MTI/ECE,Applied Science and Technology Research Institute HK Science Park, Shatin, NT, Hong Kong 6 liuyu@astri.org Abstract Motion estimation as well as the corresponding motion compensation is a core part of modern video coding standards, which highly improves the compression efficiency. On the other hand, motion information takes considerable portion of compressed bit stream, especially in low bit rate situation. In this paper, an efficient motion vector prediction algorithm is proposed to minimize the bits used for coding the motion information. First, a possible motion vector predictor (MVP) candidate set (CS) including several scaled spatial and temporal predictors is defined. To increase the diversity of predictors, the spatial predictor is adaptively changed based on current distribution of neighboring motion vectors. After that, adaptive template matching technique is applied to remove non-effective predictors from the CS so that the bits used for the MVP index can be significantly reduced. As the final MVP is chosen based on minimum motion vector difference criterion, a guessing strategy is further introduced so that in some situations the bits consumed by signaling the MVP index to the decoder can be totally omitted. The experimental results indicate that the proposed method can achieve an average bit rate reduction of 5.9% compared with the H.264 standard. I. INTRODUCTION Most state-of-the-art video compression standards such as MPEG4 and ITU JVT/H.264 [1] involve lots of techniques so that the compression efficiency is highly improved. For example, the block matching motion estimation (ME) is widely utilized to exploit temporal correlation between frames and achieve a good compression goal by reducing this redundancy. ME methods focus on finding a good prediction of current block from a restricted area in the reference frame, meanwhile trying to maintain low complexity load. The relative displacement between the best-match block and the current block is called the motion vector (MV). The corresponding reconstruction is named as motion compensation (MC) which reconstructs the block from the best-match block and the residue. In order to perform the MC at the decoder side, video standards need to specify the coding of MV. H.264 allows subblock motion estimation to acquire higher prediction accuracy, which means each sub-block has its own motion and needs extra bits to indicate its own MV, thus for inter blocks a large number of bits are spent on the coding of motion information. When bit rate is low, the percentage of the bits spent on motion Fig. 1. Proportion of bits for motion information vector coding can be up to 43%, which is shown in Figure 1. There is thus an impending need to carry out highly efficient coding schemes for the motion vector. To reduce the amount of compressed bits for representing the MV information, H.264 uses a predictive coding technique to encode the MV. For each block, H.264 constructs a motion vector predictor (MVP) by using the median of three neighboring MVs, mv H.264 = median(mv A, mv B, mv C ) (1) where mv A, mv B and mv C are the MVs of the neighboring blocks A, B and C respectively (The locations of block A, B and C are shown in Figure 2). The motion vector difference (MVD) between the MVP and the MV of current block is then encoded into the bit stream. Since the MV should be coded without any loss, the coding performance mainly depends on the prediction accuracy. The mv H.264 is effective in reducing the MV coding bits because it tends to be similar to the MV in most cases; however, it is not always optimal for minimizing MVD. If other more effective MVPs are considered, there is a high chance that the MV coding bits can be further reduced. In recent years, MV coding has attracted lots of attention and much research has been conducted [2-6]. In Kim and Ra s work [2], several neighboring MVP candidates were examined and the one producing the minimum bit rate in MVD coding

2 Fig. 2. Neighboring blocks of current block was selected. In order to obtain the MVP at the decoder, the predictor indexes for x and y components needed to be transmitted. Guillaume Laroche et al. [3] improved [2] by considering other useful predictors and jointly predicting the x and y components of the MV. They also proposed a MV prediction criterion based on a modified Rate-Distortion (RD) cost optimization which covered not only the MVD cost but also the index cost. [2] and [3] indeed provided the best results in terms of prediction error, however the required side information is considerably large, thus in [4] it proposed a guessing strategy to reduce the index bits in the situations when decoder can detect the MVP itself. Bongsoo Jung et al. [5] raised a new macroblock coding mode pooled zero vector coding as an efficient representation when all 4 4 blocks in one macroblock (MB) have zero MVDs, so that in this situation it can go with consuming only 5 bits for the header information instead of 32 bits in traditional coding. Finally in [6], S. Kamp et al. put forward a decoder-side motion estimation which totally eliminated the coding of MVD and reference index. They defined a set of decoded pixels surrounding current block as the template, and performed a matching process for this template to find the MV of current block. This process was called template matching (TM), which we will discuss in detail later. In this paper, we propose an efficient motion vector coding algorithm. First, a possible MVP candidate set (CS) including several spatial and temporal predictors is defined, among which one spatial predictor is adaptively changed based on the current distribution of neighboring MVs. After that, we apply TM to exclude some non-effective predictors from the CS so that the bits used for indexing the final MVP can be significantly reduced. Adaptive template width and shape strategies are advanced to increase the TM accuracy. The final optimal MVP of current block is selected among the size-reduced CS based on minimum MVD criterion. At last, the guessing strategy is further introduced so that in some situations the bits consumed by signaling the MVP index to the decoder can be totally avoided. Simulation results indicate that the proposed method can achieve a significant bit rate reduction of 5.9% on average compared with the H.264 standard. II. PROPOSED MOTION VECTOR PREDICTION ALGORITHM A. Motion Vector Predictor Scaling The MVs of neighboring blocks are usually used as possible MVP candidates for the current block. However, as multiple reference frames are allowed in H.264, different MVs may refer to different reference frames, their temporal distances relative to the current frame are different. Under this condition, even if the neighboring MVs follow the same motion with the current block, the virtual values may differ a lot, therefore MVP candidates should be scaled according to their temporal distances. Taking mv A in Eqn. 1 as an example, supposing the temporal distance between block A and its reference frame is d a, and the temporal distance between current block and its reference frame is d c, then the scaled predictor can be calculated as mv sa = mv A d c d a (2) Eqn. 2 can be used to calculate mv sb for block B, mv sc for block C (B, C have been identified in Fig. 2), mv scol for the collocated block which locates at the same position as current block but in the previous frame, etc. In the proposed method every predictor used is the corresponding scaled one. B. Motion Vector Predictor Candidate Set As depicted above, mv H.264 only considers the spatial correlation, and the nature of median can give rise to a false result when current MV follows the minority motion of the three neighboring MVs, which often happens at object boundaries. To improve the accuracy of MVP, other temporal and spatial related MVs should also be considered in the MVP candidate set(cs). In the proposed algorithm, the CS includes four predictors, CS = {mv sh.264, mv 0,0, mv scol, mv snei } (3) where mv sh.264 is similar to mv H.264 except that the three neighboring MVs are the scaled version, mv sa, mv sb and mv sc, according to Eqn. 2. mv 0,0 is the (0,0) motion vector. In many actual videos, there are lots of objects keeping stationary with MV (0,0), such as the background. In addition, when current block has a random motion which often happens in zooming or other complex scenes, mv 0,0 is probably a good predictor. mv scol is the scaled collocated predictor, it is considered because the situation that the current block and its collocated block belonging to the same object is very frequent in most video sequences, thus there is a high probability that the current block and its collocated block undergo a similar motion. The vector mv snei is the farthest vector from mv sh.264 among the three neighboring MVs mv sa, mv sb and mv sc [4]: mv snei = arg max mv i mv i mv sh.264 2, (4) mv i S = {mv sa, mv sb, mv sc } The mv snei is adaptively changed based on current distribution of the neighboring MVs. When only one of the neighboring blocks belongs to the object containing the current block, the mv sh.264 will choose the majority case which may have low correlation with the current MV. In such a case, mv snei is more correlated and there is a high chance that mv snei provides a better prediction.

3 Fig. 3. Template matching Note that at current stage only four predictors have been contained in the CS, however, as the proposed algorithm utilizes adaptive template matching to exclude non-effective predictors, the size of the CS and the predictors included in the CS can be further improved, without burdening the bitstream. C. Adaptive Template Matching and Candidate Set Reduction The initial CS can contain many MVPs and the index to specify the final predictor may require many bits. To solve this problem, the number of predictors in the CS is reduced by adaptive template matching (ATM) technique. In our implementation, we use ATM to retain two better predictors in the reduced candidate set (RCS), hence only 1 bit is needed for the index. In next section the experimental results will prove that keeping two predictors in the RCS is a good trade-off between prediction accuracy and index bits reduction. Template matching (TM) is primitively proposed for texture synthesis, in which the TM uses the neighboring information to synthesize the required image or video. Nowadays many scholars state that the TM can also be employed in general video coding like what in [6], [7]. The basic principle of the TM is shown in Figure 3 [6]: in order to derive a good prediction for the current block, a template region (TR) is defined around the target block. As ATM needs to be employed both at the encoder and decoder sides without mismatch, and at the decoder only the outer reconstructed top and left boundaries of the current block are available, thus the template is usually of L-shape. The template widthes M left and M up are defined to be the widthes of pixels extended to the left and top of the target in the TR. Usually, we calculate the sum of absolute difference (SAD) between the template of the current block and the template of the candidate block corresponding to a MVP to measure their similarity. If a template is highly correlated with its corresponding block, it is reasonable to assume that the block corresponded to the well-match template can also provide a good prediction of the current block. This happens when the template belongs to the same object as the current block. However, when some parts in the TR belong to another object with different motion, it may lead to a large template SAD even for a good MVP. To avoid this situation, in the following we propose adaptive template shape and width criterions based on the possible correlation between the template and the target block, as well as the similarity inside the template. The L-shaped template is divided into two parts: the top portion and the left portion. Typically, we allow the template to be the L-shaped template, left template or top template. The reason for only using one portion is that when a macroblock is divided into different sub-blocks, it means there is relatively low correlation between these sub-blocks, hence including pixels from another sub-block into current template may bring risk of matching inaccuracy for the current sub-block. Under this condition, the template shape selection strategy is defined as follows: if (MB type == P16 16) Current MB uses L-shape template; else if(mb type == P16 8) /*with upper 16 8 sub-block and lower 16 8 sub-block*/ The upper sub-block uses L-shape template; The lower sub-block uses left template; else if (MB type == P8 16)/*with left 8 16 sub-block and right 8 16 sub-block*/ The left sub-block uses L-shape template; The right sub-block uses top template; else if (MB type == P8 8)/*with four 8 8 sub-blocks, some of which may be subdivided into smaller blocks with size 4 8, 8 4, 4 4)*/ The block at the upper left corner of current MB uses L-shape template; The blocks at the left boundary of current MB use left template; The blocks at the top boundary of current MB use top template; Other blocks use L-shape template. /*For these blocks, although both of their top and left neighboring regions belong to other blocks, they don t have any other choices but still use the neighboring regions as template*/ If the current block is smooth/textured, we would like the pixels in the template to have similar smoothness/texture characteristics. If the current block has an edge, we would like the template to have an edge which is an extension of the edge inside the block. In other words, we expect the template to have the uniformity or similarity with the current block while excluding pixels nonuniform or dissimilar. This actually reflects an image segmentation issue. Image segmentation has been widely studied and lots of techniques have been realized in the literature, but in our situation, due to the limited TR, computation capability as well as our special purposes, many image segmentation approaches are not appropriate. For example, when the edge is vertically across the top template (or horizontally across the left template), the edge is an extension of what inside the current block and the pixels at both sides of the edge are included in current block, thus our purpose is to retain the pixels at both sides of the edge in the template. On the contrary, when the edge is horizontally across the top template (or vertically across the left template), the pixels at the outer side of the edge may have a relative low correlation with the current block, in such a case our purpose is to remove these pixels from the TR. Although both above situations have edges in the template, our purposes to deal with the edges are quite different. However, many image

4 Fig. 4. Different rows in template segmentation algorithms can only process both situations in the same way, so they are not fit for our application. In above example, as we only want to remove the outside pixels when there is an edge basically parallel to the target block boundary, simply it comes down to choice of adaptively deciding the width of the template for the current block. In the proposed method, we calculate the inside difference of the template to achieve this goal. Take the top template as an example (which is shown in Figure 4), the determination criterion of the template width is defined as follows: The biggest width of a template is 4, while the smallest width is 1 (i.e., at least 1 row will be included in the TR if the top template is considered); Calculate the difference between adjacent rows in template: SAD12= SAD between Row1 and Row2; SAD23= SAD between Row2 and Row3; SAD34= SAD between Row3 and Row4; if ( SAD12 SAD23) The top template only includes Row1; /*When SAD12 is much bigger than SAD23, it is probable that Row1 belongs to one object (probably the same as current block) while Row2 and Row3 belong to a different object, or there is an edge between Row1 and Row2. So we do not include Row 2-4 in the template.*/ else if (SAD23 SAD12 SAD12 SAD34) The top template includes both Row1 and Row2;/*At current stage, Row 1-2 have been included in the TR. When SAD23 is quite larger than SAD12, it is probable that there is an edge between Row2 and Row3. When SAD12 is much bigger than SAD34, it s probable that Row1 and Row2 belong to texture or edge region while Row3 and Row4 belonging to smooth region. In both situations, we do not include Row 3-4 in the template.*/ else if(sad34 SAD23) The top template includes Row1, Row2, Row3;/*At current stage, Row 1-3 have been included in the TR. When SAD34 is quite larger than SAD23, it is probable that there is an edge between Row3 and Row4. Thus we do not include Row4 in the template.*/ else The top template includes Row1, Row2, Row3, Row4; We use the same strategy to determine the width of the left template. After the shape and width of the template are determined for the current block, this adaptive template is utilized to measure the similarity between the template of the current block and that of the candidate block corresponding to each MVP in the CS. Considering in most cases the template belongs to the same object as the current block, if a candidate predictor provides a small template SAD, which means a high degree of template matching, we assume it can also provide a small SAD between the corresponding prediction block and the current block. Particularly, the current adaptive template is higher correlated with the current block than the traditional fixed template, the matching accuracy is further increased. Finally, we exclude the predictors with bigger template SAD and keep the smaller two in the RCS. Unlike [6] totally replacing ME by TM to avoid the load of MVD and reference index, the proposed method only uses ATM to reduce the size of the CS. Replacing ME by TM may introduce some problems: first, it arises a heavy computation load at the decoder. To reduce the computation complexity, in [6] the search area of TM was restricted much smaller than that of general ME, which reduced the prediction accuracy. Second, when the template and the current block belong to different objects with different motions, TM instead of ME would increase the prediction error thus increasing the residue entropy. As in our proposed scheme we only use ATM to remove non-effective predictors, the ME accuracy can be preserved. D. Final MVP and Index Coding Although ATM is effective in selecting good predictors, it can t be totally accurate in all of the times, thus instead of using ATM to choose the final MVP, we only use it to reduce the size of the CS. The final MVP pmv opt for the current block is decided on as follows: pmv opt = arg min pmv i D(pmv i mv), pmv i RCS (5) where mv represents the current MV, D( ) is the function used to measure the vector distance. So the final MVD of the current block is equal to mv pmv opt. When all the MVPs in the RCS are identical, no index is needed, otherwise 1 bit is needed to signal the selection of the final MVP. However, in some situations the final MVP can be self-derived by a guessing strategy at the decoder [4], in such a case there is no need to transmit the index. The guessing strategy relies on the fact that the final MVP is selected on the basis of minimum MVD criterion. We also introduce this strategy into our scheme, and in the following we list the concrete steps on how encoder/decoder applies this strategy to decide whether to encode/decode the index: Step1: Obtain two possible MVs based on the calculated/received MVD and predictors in the RCS: mv 1 = MV D + mv AT M1 and mv 2 = MV D + mv AT M2 ; Step2: Assuming mv 1 is the true motion vector, select the best predictor for mv 1 according to Eqn. 5 and calculate the new MVD. If the new MVD equals to the original one, set flag AT M1 = 1, otherwise flag AT M1 = 0; Step3: Assuming mv 2 is the true motion vector, repeat Step2 to determine the value of flag AT M2 ; Step4: If (flag AT M1 = 1&&flag AT M2 = 1) (flag AT M1 = 0&&flag AT M2 = 0), the index of the final MVP will be transmitted/decoded. Otherwise no index is

5 TABLE I SIMULATION CONDITIONS Profile Main Prediction structure IPPP Entropy coding CAVLC Quantization parameters 22,27,32,37 Reference frame 4 Search range 32 RDO On transmitted/decoded, and the final predictor can be derived according to flag AT M1 and flag AT M2. For example, when mv AT M1, mv AT M2 and current MV are (1,0), (2,0) and (3,1) respectively, from Eqn. 5 the mv AT M2 will be selected as the final MVP, and the MVD is (1,1). At the decoder side, following the previous steps we will get flag AT M1 to be 0 and flag AT M2 to be 1. In this situation, the decoder detects mv AT M2 to be the optimal MVP thus no index is needed. However, when mv AT M1, mv AT M2 and current MV are (1,0), (2,0) and (2,0) respectively, following the previous steps the decoder will find that both predictors are possible MVPs of the current block, so the encoder/decoder needs to transmit/decode the index. III. EXPERIMENTAL RESULTS AND DISCUSSION The proposed algorithm has been incorporated into the H.264/AVC key technical area (KTA) reference software 2.2 version developed based on H.264/AVC JM11.0. The coding efficiency of the proposed method is compared with the H.264 standard with traditional MV coding method (as the anchor). The results of the method in [3] are also listed for comparison. Besides, we generate a new method called OnlyATM as another comparison. OnlyATM uses ATM to select the final MVP directly without the need of sending the index. It is implemented to reveal the accuracy of ATM and prove that the proposed scheme is a better trade-off. To ensure the fairness, all of the three algorithms use the same predictor candidate set. The main simulation conditions are summarized in Table I. The Main profile is selected as the platform for simulation, allowing the usage of most latest H.264/AVC normative tools. The test set is composed of six CIF sequences of 150 frames each (for Stefan there are only 90 frames) and four 720P sequences of 100 frames each, with various representative contents and motions. Several typical quantization parameters (QP) are tested so that the quality is between 27 and 45 db which corresponds to a visual quality in line with most of the industrial applications. Table II exhibits the performances of each algorithm in terms of the Bjontegaard Delta bit rate (BDBR) and Bjontegaard Delta Peak Signal to Noise Ratio (BDPSNR) [8] compared with the anchor. It demonstrates that all of the MV coding algorithms have certain bit rate reductions compared with the anchor, among which the proposed method has the most significant bit rate reduction of 4.1% for CIF sequences and 8.5% for 720P sequences on average. Method in [3] needs average two extra bits for each MV coding, which reduces the profit obtained from precise MV coding. For Night sequence, it got even worse results than the anchor. In OnlyATM, when ATM selects a wrong predictor, the encoder may take the risk that even more bits than index bits are sacrificed to compensate the increment of MVD entropy, so at last the overall bit rate of OnlyATM is higher than that of the proposed method. The proposed method performs well especially for those video sequences with fast motion fields and without too many textures, such as Foreman, Bus and Spincalendar. In fast motion sequences, our proposed method can provide more precise MVP than the conventional median predictor, meanwhile, in sequences without too many textures, the motion information consumes a relatively large proportion of the total bits, which makes the improvement of MV coding more distinct. Figure 5 further depicts the RD curves of the MV coding schemes for several sequences in test. In these figures, the efficiency of the each algorithm is illustrated for every rate point. It shows that mostly the proposed method outperforms the other two MV coding schemes as well as the anchor under each QP. The distinction is more obvious when the bit rate is low, which may be explained by the fact that the motion information takes more significant portion of the whole bit stream at low bit rate. Other sequences also have similar results. We also calculate the statistical probabilities that the proposed algorithm and OnlyATM select the actual optimal predictor (the predictor which minimizes the MVD) among the CS to measure the template matching accuracy. As shown in Table III, by using fixed template (width 4, L-shape), the probability is about 77% for OnlyATM and 87% for the proposed method; while by using adaptive template, the probability raises to about 80% for OnlyATM and 89% for the proposed method. The reason of higher possibility in the proposed method compared with that in OnlyATM is straightforward: in the proposed method, we use ATM to select two predictors first and choose the final MVP based on minimum MVD criterion, which of course increases the preciseness. The data also indicates that the proposed adaptive template shape and width strategies indeed help to improve the template matching accuracy. In OnlyATM, the probability of choosing the optimal predictor only depends on template matching accuracy, therefore the probability increment by using adaptive template is more obvious in OnlyATM than that in the proposed method. Although the increments in both OnlyATM and the proposed method are not distinct, actually they can lead up to 2% bit rate reduction in some video sequences. IV. CONCLUSION In this paper, a novel motion vector prediction method is proposed to minimize the bits used for MV coding. First, a predictor candidate set is defined to exploit the spatial and temporal correlation in the motion fields. Especially, one spatial predictor can be adaptively changed depending on

6 TABLE II PERFORMANCE EVALUATION OF THE MV CODING METHODS Method in [3] OnlyATM Proposed Sequence BDBR(%) BDPSNR(dB) BDBR BDPSNR(dB) BDBR BDPSNR(dB) Bus -2.5% % % 0.29 Foreman -0.8% % % 0.23 Mobile -0.8% % % 0.19 CIF Stefan -0.3% % % 0.16 Table -2.0% % % 0.20 Tempete -0.2% % % 0.16 Average -1.1% % % 0.21 Night 0.6% % % 0.10 Raven 0.0% % % P Spincalendar -5.7% % % 0.32 Jets -9.0% % % 0.57 Average -3.5% % % 0.35 Total average -2.1% % % 0.27 Fig. 5. RD performance comparison for sequences TABLE III TEMPLATE MATCHING ACCURACY COMPARISON OnlyATM Proposed Sequence Fixed Adaptive Fixed Adaptive Bus 73.2% 75.0% 88.0% 89.4% Foreman 76.3% 79.2% 84.9% 86.1% Mobile 74.2% 75.9% 89.7% 90.6% CIF Stefan 78.1% 79.8% 91.6% 92.4% Table 77.7% 80.2% 85.4% 86.5% Tempete 75.0% 76.8% 86.5% 87.4% Average 75.7% 77.8% 87.7% 88.7% Night 73.6% 77.1% 82.5% 84.6% Raven 83.8% 86.2% 90.0% 90.6% 720P Spincalendar 77.2% 80.3% 88.7% 91.2% Jets 81.8% 84.7% 84.9% 89.0% Average 79.1% 82.1% 86.5% 88.8% Total average 77.1% 79.5% 87.2% 88.8% current distribution of the neighboring motion vectors. Then a template with adaptive shape and width for the current block is determined and is utilized to select predictors in the CS with corresponding better-match templates, so as to reduce the bits consumed for predictor index. Furthermore, a guessing strategy is performed to totally eliminate the index bits when the decoder can derive the predictor itself. Simulation results indicate that by using ATM, the best predictor can be selected in the probability up to 90%. And the results also demonstrate that the proposed scheme provides a significant bit rate reduction compared with the standard as well as other methods. ACKNOWLEDGMENT This work has been supported by the Hong Kong Applied Science and Technology Research Institute (ASTRI) in the Future Multimedia Standards Project (ART/037). REFERENCES [1] Advanced Video Coding (AVC)- 3rd Edition. ITU-T Recommendation H.264 and ISO/IEC (MPEG-4 Part 10), July [2] S. D. Kim and J. B. Ra, An efficient motion vector coding scheme based on minimum bitrate prediction, IEEE Transaction on Image Processing, vol. 8, no. 8, pp , Aug [3] G. Laroche, J. Jung, and B. Pesquent-Popescu, RD optimized coding for motion vector predictor selection, IEEE Transaction on Circuits and Systems for Video Technology, vol. 18, no. 9, pp , Sep [4] J. J. Dai, O. C. Au, C. Pang, W. Yang, and F. Zou, Motion vector coding based on optimal predictor selection, in IEEE Pacific-Rim Conference on Multimedia, Bangkok, Dec [5] B. Jung and B. Jeon, Pooled zero vector coding for enhanced compression of motion vectors, in Proc. IEEE Asia Pacific Conference on Circuits and Systems, 2008, pp [6] M. E. S. Kamp and M. Wien, Decoder side motion vector derivation for inter frame video coding, in IEEE International Conference on Image Processing, 2008, pp [7] T. Tan et al., Intra prediction by template matching, in Proc. ICIP 2006, Atlanta, GA, USA, Oct [8] G. Bjontegaard, Calculation of average PSNR differences between RDcurves, in VCEG Contribution VCEG-M33, Austin, Apr

Decoding-Assisted Inter Prediction for HEVC

Decoding-Assisted Inter Prediction for HEVC Decoding-Assisted Inter Prediction for HEVC Yi-Sheng Chang and Yinyi Lin Department of Communication Engineering National Central University, Taiwan 32054, R.O.C. Email: yilin@ce.ncu.edu.tw Abstract In

More information

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Complexity Reduced Mode Selection of H.264/AVC Intra Coding Complexity Reduced Mode Selection of H.264/AVC Intra Coding Mohammed Golam Sarwer 1,2, Lai-Man Po 1, Jonathan Wu 2 1 Department of Electronic Engineering City University of Hong Kong Kowloon, Hong Kong

More information

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE 5359 Gaurav Hansda 1000721849 gaurav.hansda@mavs.uta.edu Outline Introduction to H.264 Current algorithms for

More information

An Optimized Template Matching Approach to Intra Coding in Video/Image Compression

An Optimized Template Matching Approach to Intra Coding in Video/Image Compression An Optimized Template Matching Approach to Intra Coding in Video/Image Compression Hui Su, Jingning Han, and Yaowu Xu Chrome Media, Google Inc., 1950 Charleston Road, Mountain View, CA 94043 ABSTRACT The

More information

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames Ki-Kit Lai, Yui-Lam Chan, and Wan-Chi Siu Centre for Signal Processing Department of Electronic and Information Engineering

More information

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Efficient MPEG- to H.64/AVC Transcoding in Transform-domain Yeping Su, Jun Xin, Anthony Vetro, Huifang Sun TR005-039 May 005 Abstract In this

More information

Video Coding Using Spatially Varying Transform

Video Coding Using Spatially Varying Transform Video Coding Using Spatially Varying Transform Cixun Zhang 1, Kemal Ugur 2, Jani Lainema 2, and Moncef Gabbouj 1 1 Tampere University of Technology, Tampere, Finland {cixun.zhang,moncef.gabbouj}@tut.fi

More information

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING Md. Salah Uddin Yusuf 1, Mohiuddin Ahmad 2 Assistant Professor, Dept. of EEE, Khulna University of Engineering & Technology

More information

Fast Mode Decision for H.264/AVC Using Mode Prediction

Fast Mode Decision for H.264/AVC Using Mode Prediction Fast Mode Decision for H.264/AVC Using Mode Prediction Song-Hak Ri and Joern Ostermann Institut fuer Informationsverarbeitung, Appelstr 9A, D-30167 Hannover, Germany ri@tnt.uni-hannover.de ostermann@tnt.uni-hannover.de

More information

A reversible data hiding based on adaptive prediction technique and histogram shifting

A reversible data hiding based on adaptive prediction technique and histogram shifting A reversible data hiding based on adaptive prediction technique and histogram shifting Rui Liu, Rongrong Ni, Yao Zhao Institute of Information Science Beijing Jiaotong University E-mail: rrni@bjtu.edu.cn

More information

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding Jung-Ah Choi and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 261 Cheomdan-gwagiro, Buk-gu, Gwangju, 500-712, Korea

More information

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

International Journal of Emerging Technology and Advanced Engineering Website:   (ISSN , Volume 2, Issue 4, April 2012) A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and

More information

Advanced Video Coding: The new H.264 video compression standard

Advanced Video Coding: The new H.264 video compression standard Advanced Video Coding: The new H.264 video compression standard August 2003 1. Introduction Video compression ( video coding ), the process of compressing moving images to save storage space and transmission

More information

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms RC24748 (W0902-063) February 12, 2009 Electrical Engineering IBM Research Report Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms Yuri Vatis Institut für Informationsverarbeitung

More information

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC Proceedings of the 7th WSEAS International Conference on Multimedia, Internet & Video Technologies, Beijing, China, September 15-17, 2007 198 Reduced 4x4 Block Intra Prediction Modes using Directional

More information

High Efficient Intra Coding Algorithm for H.265/HVC

High Efficient Intra Coding Algorithm for H.265/HVC H.265/HVC における高性能符号化アルゴリズムに関する研究 宋天 1,2* 三木拓也 2 島本隆 1,2 High Efficient Intra Coding Algorithm for H.265/HVC by Tian Song 1,2*, Takuya Miki 2 and Takashi Shimamoto 1,2 Abstract This work proposes a novel

More information

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION Yen-Chieh Wang( 王彥傑 ), Zong-Yi Chen( 陳宗毅 ), Pao-Chi Chang( 張寶基 ) Dept. of Communication Engineering, National Central

More information

Video compression with 1-D directional transforms in H.264/AVC

Video compression with 1-D directional transforms in H.264/AVC Video compression with 1-D directional transforms in H.264/AVC The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation Kamisli, Fatih,

More information

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS Ye-Kui Wang 1, Miska M. Hannuksela 2 and Moncef Gabbouj 3 1 Tampere International Center for Signal Processing (TICSP), Tampere,

More information

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications Fast Intra/Inter Mode Decision lgorithm of H.64/VC for Real-time pplications Bin Zhan, Baochun Hou, and Reza Sotudeh School of Electronic, Communication and Electrical Engineering University of Hertfordshire

More information

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN BANDWIDTH REDUCTION SCHEMES FOR MPEG- TO H. TRANSCODER DESIGN Xianghui Wei, Wenqi You, Guifen Tian, Yan Zhuang, Takeshi Ikenaga, Satoshi Goto Graduate School of Information, Production and Systems, Waseda

More information

Fast Motion Estimation for Shape Coding in MPEG-4

Fast Motion Estimation for Shape Coding in MPEG-4 358 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 13, NO. 4, APRIL 2003 Fast Motion Estimation for Shape Coding in MPEG-4 Donghoon Yu, Sung Kyu Jang, and Jong Beom Ra Abstract Effective

More information

Enhanced Hexagon with Early Termination Algorithm for Motion estimation

Enhanced Hexagon with Early Termination Algorithm for Motion estimation Volume No - 5, Issue No - 1, January, 2017 Enhanced Hexagon with Early Termination Algorithm for Motion estimation Neethu Susan Idiculay Assistant Professor, Department of Applied Electronics & Instrumentation,

More information

STANDARD COMPLIANT FLICKER REDUCTION METHOD WITH PSNR LOSS CONTROL

STANDARD COMPLIANT FLICKER REDUCTION METHOD WITH PSNR LOSS CONTROL STANDARD COMPLIANT FLICKER REDUCTION METHOD WITH PSNR LOSS CONTROL A. Jiménez-Moreno, E. Martínez-Enríquez, F. Díaz-de-María Department of Signal Theory and Communications Universidad Carlos III, Leganés

More information

Digital Video Processing

Digital Video Processing Video signal is basically any sequence of time varying images. In a digital video, the picture information is digitized both spatially and temporally and the resultant pixel intensities are quantized.

More information

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation 2009 Third International Conference on Multimedia and Ubiquitous Engineering A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation Yuan Li, Ning Han, Chen Chen Department of Automation,

More information

Digital Image Stabilization and Its Integration with Video Encoder

Digital Image Stabilization and Its Integration with Video Encoder Digital Image Stabilization and Its Integration with Video Encoder Yu-Chun Peng, Hung-An Chang, Homer H. Chen Graduate Institute of Communication Engineering National Taiwan University Taipei, Taiwan {b889189,

More information

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46 LIST OF TABLES TABLE Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46 Table 5.2 Macroblock types 46 Table 5.3 Inverse Scaling Matrix values 48 Table 5.4 Specification of QPC as function

More information

An Efficient Inter-Frame Coding with Intra Skip Decision in H.264/AVC

An Efficient Inter-Frame Coding with Intra Skip Decision in H.264/AVC 856 IEEE Transactions on Consumer Electronics, Vol. 56, No. 2, May 2 An Efficient Inter-Frame Coding with Intra Sip Decision in H.264/AVC Myounghoon Kim, Soonhong Jung, Chang-Su Kim, and Sanghoon Sull

More information

Reduced Frame Quantization in Video Coding

Reduced Frame Quantization in Video Coding Reduced Frame Quantization in Video Coding Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P. O. Box 500, FIN-900 University

More information

A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation

A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation S. López, G.M. Callicó, J.F. López and R. Sarmiento Research Institute for Applied Microelectronics (IUMA) Department

More information

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC Hamid Reza Tohidypour, Mahsa T. Pourazad 1,2, and Panos Nasiopoulos 1 1 Department of Electrical & Computer Engineering,

More information

Rate Distortion Optimization in Video Compression

Rate Distortion Optimization in Video Compression Rate Distortion Optimization in Video Compression Xue Tu Dept. of Electrical and Computer Engineering State University of New York at Stony Brook 1. Introduction From Shannon s classic rate distortion

More information

Implementation and analysis of Directional DCT in H.264

Implementation and analysis of Directional DCT in H.264 Implementation and analysis of Directional DCT in H.264 EE 5359 Multimedia Processing Guidance: Dr K R Rao Priyadarshini Anjanappa UTA ID: 1000730236 priyadarshini.anjanappa@mavs.uta.edu Introduction A

More information

FAST MOTION ESTIMATION DISCARDING LOW-IMPACT FRACTIONAL BLOCKS. Saverio G. Blasi, Ivan Zupancic and Ebroul Izquierdo

FAST MOTION ESTIMATION DISCARDING LOW-IMPACT FRACTIONAL BLOCKS. Saverio G. Blasi, Ivan Zupancic and Ebroul Izquierdo FAST MOTION ESTIMATION DISCARDING LOW-IMPACT FRACTIONAL BLOCKS Saverio G. Blasi, Ivan Zupancic and Ebroul Izquierdo School of Electronic Engineering and Computer Science, Queen Mary University of London

More information

Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder

Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder M. TUN, K. K. LOO, J. COSMAS School of Engineering and Design Brunel University Kingston Lane, Uxbridge, UB8 3PH UNITED KINGDOM

More information

An Efficient Mode Selection Algorithm for H.264

An Efficient Mode Selection Algorithm for H.264 An Efficient Mode Selection Algorithm for H.64 Lu Lu 1, Wenhan Wu, and Zhou Wei 3 1 South China University of Technology, Institute of Computer Science, Guangzhou 510640, China lul@scut.edu.cn South China

More information

Fast frame memory access method for H.264/AVC

Fast frame memory access method for H.264/AVC Fast frame memory access method for H.264/AVC Tian Song 1a), Tomoyuki Kishida 2, and Takashi Shimamoto 1 1 Computer Systems Engineering, Department of Institute of Technology and Science, Graduate School

More information

FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS

FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS Yen-Kuang Chen 1, Anthony Vetro 2, Huifang Sun 3, and S. Y. Kung 4 Intel Corp. 1, Mitsubishi Electric ITA 2 3, and Princeton University 1

More information

IN RECENT years, multimedia application has become more

IN RECENT years, multimedia application has become more 578 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 17, NO. 5, MAY 2007 A Fast Algorithm and Its VLSI Architecture for Fractional Motion Estimation for H.264/MPEG-4 AVC Video Coding

More information

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology Course Presentation Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology Video Coding Correlation in Video Sequence Spatial correlation Similar pixels seem

More information

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD Siwei Ma, Shiqi Wang, Wen Gao {swma,sqwang, wgao}@pku.edu.cn Institute of Digital Media, Peking University ABSTRACT IEEE 1857 is a multi-part standard for multimedia

More information

Context based optimal shape coding

Context based optimal shape coding IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing September 13-15, 1999, Copenhagen, Denmark Electronic Proceedings 1999 IEEE Context based optimal shape coding Gerry Melnikov,

More information

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec Proceedings of the International MultiConference of Engineers and Computer Scientists 8 Vol I IMECS 8, 19-1 March, 8, Hong Kong Fast Wavelet-based Macro-block Selection Algorithm for H.64 Video Codec Shi-Huang

More information

A deblocking filter with two separate modes in block-based video coding

A deblocking filter with two separate modes in block-based video coding A deblocing filter with two separate modes in bloc-based video coding Sung Deu Kim Jaeyoun Yi and Jong Beom Ra Dept. of Electrical Engineering Korea Advanced Institute of Science and Technology 7- Kusongdong

More information

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile An Efficient Intra Prediction Algorithm for H.264/AVC High Profile Bo Shen 1 Kuo-Hsiang Cheng 2 Yun Liu 1 Ying-Hong Wang 2* 1 School of Electronic and Information Engineering, Beijing Jiaotong University

More information

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 261 Cheomdan-gwagiro, Buk-gu, Gwangju, 500-712,

More information

Unit-level Optimization for SVC Extractor

Unit-level Optimization for SVC Extractor Unit-level Optimization for SVC Extractor Chang-Ming Lee, Chia-Ying Lee, Bo-Yao Huang, and Kang-Chih Chang Department of Communications Engineering National Chung Cheng University Chiayi, Taiwan changminglee@ee.ccu.edu.tw,

More information

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H. EE 5359 MULTIMEDIA PROCESSING SPRING 2011 Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.264 Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY

More information

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS 4th European Signal Processing Conference (EUSIPCO ), Florence, Italy, September 4-8,, copyright by EURASIP BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION

More information

Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials

Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering

More information

An Adaptive Cross Search Algorithm for Block Matching Motion Estimation

An Adaptive Cross Search Algorithm for Block Matching Motion Estimation An Adaptive Cross Search Algorithm for Block Matching Motion Estimation Jiancong Luo', Ishfaq Ahmad' and Xzhang Luo' 1 Department of Computer Science and Engineering, University of Texas at Arlington,

More information

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING ABSTRACT Xiangyang Ji *1, Jizheng Xu 2, Debin Zhao 1, Feng Wu 2 1 Institute of Computing Technology, Chinese Academy

More information

Pattern based Residual Coding for H.264 Encoder *

Pattern based Residual Coding for H.264 Encoder * Pattern based Residual Coding for H.264 Encoder * Manoranjan Paul and Manzur Murshed Gippsland School of Information Technology, Monash University, Churchill, Vic-3842, Australia E-mail: {Manoranjan.paul,

More information

VIDEO streaming applications over the Internet are gaining. Brief Papers

VIDEO streaming applications over the Internet are gaining. Brief Papers 412 IEEE TRANSACTIONS ON BROADCASTING, VOL. 54, NO. 3, SEPTEMBER 2008 Brief Papers Redundancy Reduction Technique for Dual-Bitstream MPEG Video Streaming With VCR Functionalities Tak-Piu Ip, Yui-Lam Chan,

More information

Recent, Current and Future Developments in Video Coding

Recent, Current and Future Developments in Video Coding Recent, Current and Future Developments in Video Coding Jens-Rainer Ohm Inst. of Commun. Engineering Outline Recent and current activities in MPEG Video and JVT Scalable Video Coding Multiview Video Coding

More information

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain Author manuscript, published in "International Symposium on Broadband Multimedia Systems and Broadcasting, Bilbao : Spain (2009)" One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

More information

ABSTRACT. KEYWORD: Low complexity H.264, Machine learning, Data mining, Inter prediction. 1 INTRODUCTION

ABSTRACT. KEYWORD: Low complexity H.264, Machine learning, Data mining, Inter prediction. 1 INTRODUCTION Low Complexity H.264 Video Encoding Paula Carrillo, Hari Kalva, and Tao Pin. Dept. of Computer Science and Technology,Tsinghua University, Beijing, China Dept. of Computer Science and Engineering, Florida

More information

Spline-Based Motion Vector Encoding Scheme

Spline-Based Motion Vector Encoding Scheme Spline-Based Motion Vector Encoding Scheme by Parnia Farokhian A thesis submitted to the Faculty of Graduate Studies and Research in partial fulfillment of the requirements for the degree of Master of

More information

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER Zong-Yi Chen, Jiunn-Tsair Fang 2, Tsai-Ling Liao, and Pao-Chi Chang Department of Communication Engineering, National Central

More information

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING Dieison Silveira, Guilherme Povala,

More information

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation Chapter 10 Basic Video Compression Techniques 10.1 Introduction to Video Compression 10.2 Video Compression with Motion Compensation 10.3 Search for Motion Vectors 10.4 H.261 10.5 H.263 10.6 Further Exploration

More information

Module 7 VIDEO CODING AND MOTION ESTIMATION

Module 7 VIDEO CODING AND MOTION ESTIMATION Module 7 VIDEO CODING AND MOTION ESTIMATION Lesson 20 Basic Building Blocks & Temporal Redundancy Instructional Objectives At the end of this lesson, the students should be able to: 1. Name at least five

More information

Video Quality Analysis for H.264 Based on Human Visual System

Video Quality Analysis for H.264 Based on Human Visual System IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021 ISSN (p): 2278-8719 Vol. 04 Issue 08 (August. 2014) V4 PP 01-07 www.iosrjen.org Subrahmanyam.Ch 1 Dr.D.Venkata Rao 2 Dr.N.Usha Rani 3 1 (Research

More information

For layered video encoding, video sequence is encoded into a base layer bitstream and one (or more) enhancement layer bit-stream(s).

For layered video encoding, video sequence is encoded into a base layer bitstream and one (or more) enhancement layer bit-stream(s). 3rd International Conference on Multimedia Technology(ICMT 2013) Video Standard Compliant Layered P2P Streaming Man Yau Chiu 1, Kangheng Wu 1, Zhibin Lei 1 and Dah Ming Chiu 2 Abstract. Peer-to-peer (P2P)

More information

ISSN: An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding

ISSN: An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding Ali Mohsin Kaittan*1 President of the Association of scientific research and development in Iraq Abstract

More information

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 7, NO. 2, APRIL 1997 429 Express Letters A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation Jianhua Lu and

More information

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC Randa Atta, Rehab F. Abdel-Kader, and Amera Abd-AlRahem Electrical Engineering Department, Faculty of Engineering, Port

More information

ARTICLE IN PRESS. Signal Processing: Image Communication

ARTICLE IN PRESS. Signal Processing: Image Communication Signal Processing: Image Communication 23 (2008) 571 580 Contents lists available at ScienceDirect Signal Processing: Image Communication journal homepage: www.elsevier.com/locate/image Fast sum of absolute

More information

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION Yi-Hau Chen, Tzu-Der Chuang, Chuan-Yung Tsai, Yu-Jen Chen, and Liang-Gee Chen DSP/IC Design Lab., Graduate Institute

More information

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform Circuits and Systems, 2010, 1, 12-17 doi:10.4236/cs.2010.11003 Published Online July 2010 (http://www.scirp.org/journal/cs) Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block

More information

Affine SKIP and MERGE Modes for Video Coding

Affine SKIP and MERGE Modes for Video Coding Affine SKIP and MERGE Modes for Video Coding Huanbang Chen #1, Fan Liang #2, Sixin Lin 3 # School of Information Science and Technology, Sun Yat-sen University Guangzhou 510275, PRC 1 chhuanb@mail2.sysu.edu.cn

More information

Optimal Estimation for Error Concealment in Scalable Video Coding

Optimal Estimation for Error Concealment in Scalable Video Coding Optimal Estimation for Error Concealment in Scalable Video Coding Rui Zhang, Shankar L. Regunathan and Kenneth Rose Department of Electrical and Computer Engineering University of California Santa Barbara,

More information

Localized Multiple Adaptive Interpolation Filters with Single-Pass Encoding

Localized Multiple Adaptive Interpolation Filters with Single-Pass Encoding Localized Multiple Adaptive Interpolation Filters with Single-Pass Encoding Xun Guo 1, Kai Zhang 1,3, Yu-Wen Huang 2, Jicheng An 1, Chih-Ming Fu 2 and Shawmin Lei 2 1 MediaTek Inc., Beijing, China 2 MediaTek

More information

Professor, CSE Department, Nirma University, Ahmedabad, India

Professor, CSE Department, Nirma University, Ahmedabad, India Bandwidth Optimization for Real Time Video Streaming Sarthak Trivedi 1, Priyanka Sharma 2 1 M.Tech Scholar, CSE Department, Nirma University, Ahmedabad, India 2 Professor, CSE Department, Nirma University,

More information

Next-Generation 3D Formats with Depth Map Support

Next-Generation 3D Formats with Depth Map Support MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Next-Generation 3D Formats with Depth Map Support Chen, Y.; Vetro, A. TR2014-016 April 2014 Abstract This article reviews the most recent extensions

More information

Block-Matching based image compression

Block-Matching based image compression IEEE Ninth International Conference on Computer and Information Technology Block-Matching based image compression Yun-Xia Liu, Yang Yang School of Information Science and Engineering, Shandong University,

More information

Homogeneous Transcoding of HEVC for bit rate reduction

Homogeneous Transcoding of HEVC for bit rate reduction Homogeneous of HEVC for bit rate reduction Ninad Gorey Dept. of Electrical Engineering University of Texas at Arlington Arlington 7619, United States ninad.gorey@mavs.uta.edu Dr. K. R. Rao Fellow, IEEE

More information

10.2 Video Compression with Motion Compensation 10.4 H H.263

10.2 Video Compression with Motion Compensation 10.4 H H.263 Chapter 10 Basic Video Compression Techniques 10.11 Introduction to Video Compression 10.2 Video Compression with Motion Compensation 10.3 Search for Motion Vectors 10.4 H.261 10.5 H.263 10.6 Further Exploration

More information

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING Journal of the Chinese Institute of Engineers, Vol. 29, No. 7, pp. 1203-1214 (2006) 1203 STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING Hsiang-Chun Huang and Tihao Chiang* ABSTRACT A novel scalable

More information

An Efficient Table Prediction Scheme for CAVLC

An Efficient Table Prediction Scheme for CAVLC An Efficient Table Prediction Scheme for CAVLC 1. Introduction Jin Heo 1 Oryong-Dong, Buk-Gu, Gwangju, 0-712, Korea jinheo@gist.ac.kr Kwan-Jung Oh 1 Oryong-Dong, Buk-Gu, Gwangju, 0-712, Korea kjoh81@gist.ac.kr

More information

H.264 to MPEG-4 Transcoding Using Block Type Information

H.264 to MPEG-4 Transcoding Using Block Type Information 1568963561 1 H.264 to MPEG-4 Transcoding Using Block Type Information Jae-Ho Hur and Yung-Lyul Lee Abstract In this paper, we propose a heterogeneous transcoding method of converting an H.264 video bitstream

More information

Star Diamond-Diamond Search Block Matching Motion Estimation Algorithm for H.264/AVC Video Codec

Star Diamond-Diamond Search Block Matching Motion Estimation Algorithm for H.264/AVC Video Codec Star Diamond-Diamond Search Block Matching Motion Estimation Algorithm for H.264/AVC Video Codec Satish Kumar Sahu 1* and Dolley Shukla 2 Electronics Telecommunication Department, SSTC, SSGI, FET, Junwani,

More information

Performance Comparison between DWT-based and DCT-based Encoders

Performance Comparison between DWT-based and DCT-based Encoders , pp.83-87 http://dx.doi.org/10.14257/astl.2014.75.19 Performance Comparison between DWT-based and DCT-based Encoders Xin Lu 1 and Xuesong Jin 2 * 1 School of Electronics and Information Engineering, Harbin

More information

VIDEO COMPRESSION STANDARDS

VIDEO COMPRESSION STANDARDS VIDEO COMPRESSION STANDARDS Family of standards: the evolution of the coding model state of the art (and implementation technology support): H.261: videoconference x64 (1988) MPEG-1: CD storage (up to

More information

Overview: motion-compensated coding

Overview: motion-compensated coding Overview: motion-compensated coding Motion-compensated prediction Motion-compensated hybrid coding Motion estimation by block-matching Motion estimation with sub-pixel accuracy Power spectral density of

More information

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

2014 Summer School on MPEG/VCEG Video. Video Coding Concept 2014 Summer School on MPEG/VCEG Video 1 Video Coding Concept Outline 2 Introduction Capture and representation of digital video Fundamentals of video coding Summary Outline 3 Introduction Capture and representation

More information

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING 1 Michal Joachimiak, 2 Kemal Ugur 1 Dept. of Signal Processing, Tampere University of Technology, Tampere, Finland 2 Jani Lainema,

More information

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264. Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title Adaptive Lagrange Multiplier for Low Bit Rates

More information

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution Jae-Ho Hur, Hyouk-Kyun Kwon, Yung-Lyul Lee Department of Internet Engineering, Sejong University,

More information

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC Jung-Ah Choi, Jin Heo, and Yo-Sung Ho Gwangju Institute of Science and Technology {jachoi, jinheo, hoyo}@gist.ac.kr

More information

View Synthesis for Multiview Video Compression

View Synthesis for Multiview Video Compression View Synthesis for Multiview Video Compression Emin Martinian, Alexander Behrens, Jun Xin, and Anthony Vetro email:{martinian,jxin,avetro}@merl.com, behrens@tnt.uni-hannover.de Mitsubishi Electric Research

More information

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France Video Compression Zafar Javed SHAHID, Marc CHAUMONT and William PUECH Laboratoire LIRMM VOODDO project Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier LIRMM UMR 5506 Université

More information

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About

More information

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Prashant Ramanathan and Bernd Girod Department of Electrical Engineering Stanford University Stanford CA 945

More information

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Prashant Ramanathan and Bernd Girod Department of Electrical Engineering Stanford University Stanford CA 945

More information

Optimizing Motion Estimation for H.264 Encoding

Optimizing Motion Estimation for H.264 Encoding Optimizing Motion Estimation for H.264 Encoding ABSTRACT Mateus Krepsky Ludwich Federal University of Santa Catarina UFSC Laboratory for Software and Hardware Integration LISHA PO Box 476-88049-900 - Florianópolis,

More information

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Yongying Gao and Hayder Radha Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48823 email:

More information

Inter-prediction methods based on linear embedding for video compression

Inter-prediction methods based on linear embedding for video compression Inter-prediction methods based on linear embedding for video compression Martin Alain, Christine Guillemot, Dominique Thoreau, Philippe Guillotel To cite this version: Martin Alain, Christine Guillemot,

More information

A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation

A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 1, JANUARY 2001 111 A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation

More information