MULTIVIEW video is capable of providing viewers

Size: px
Start display at page:

Download "MULTIVIEW video is capable of providing viewers"

Transcription

1 IEEE TRANSACTIONS ON BROADCASTING, VOL. 57, NO. 1, MARCH Efficient Multi-Reference Frame Selection Algorithm for Hierarchical B Pictures in Multiview Video Coding Yun Zhang, Sam Kwong, Senior Member, IEEE, Gangyi Jiang, and Hanli Wang, Member, IEEE Abstract Multiple reference frame prediction technology is adopted in Multiview Video Coding (MVC) to explore temporal and inter-view redundancies of multiview videos, resulting in extremely high encoding complexity by searching the best reference frame indices and the best reference directions for each macroblock (MB). In order to reduce MVC coding computations while keeping the coding efficiency and thus to advance MVC in real-time multimedia broadcasting applications, we propose a Fast Multi-reference Frame Selection Algorithm (FMFSA) for hierarchical B picture prediction structure in this paper. Due to high spatial correlations within a MB, there is a high probability for smaller MB partition modes selecting the same reference frame and direction as B16 16 does. Therefore, the reference information of latter checked MB partition modes can be directly set according to the reference information of previous examined mode. Experimental results on MVC show that the proposed FMFSA can achieve 68.34% 79.01% total encoding time reduction while the average bit rate increase and peak signal-to-noise ratio degradation are within 0.54% and 0.04 db, respectively for test multiview sequences with various motion properties and camera arrangements. Index Terms Hierarchical B picture, multi-reference frame prediction, multiview video coding. I. INTRODUCTION MULTIVIEW video is capable of providing viewers with a totally new stereoscopic vision and interactive viewing experience [1]. It would be useful for many new multimedia applications, such as Free-viewpoint TeleVision (FTV), Three Dimensional TeleVision (3DTV) broadcasting, Manuscript received March 22, 2010; revised August 24, 2010; accepted September 07, Date of publication November 11, 2010; date of current version February 23, This work was supported in part by Hong Kong RGC General Research Fund (GRF) Projects (CityU ) and in part by the Natural Science Foundation of China under Grants and Y. Zhang is with the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen , China, and also with the Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong ( yunzhang@cityu.edu.hk). S. Kwong is with the Department of Computer Science, City University of Hong Kong, Hong Kong ( cssamk@cityu.edu.hk). G. Jiang is with the Faculty of Information Science and Engineering, Ningbo University, Ningbo , China ( jianggangyi@nbu.edu.cn). H. Wang is with the Department of Computer Science and Technology and the Key Laboratory of Embedded System and Service Computing, Ministry of Education, Tongji University, Shanghai , China ( hanliwang@tongji. edu.cn). Color versions of one or more of the figures in this paper are available online at Digital Object Identifier /TBC immersive teleconference, virtual reality and games. With the advances in the area of 3D display technology [2], image analysis and depth image based rendering [3], many difficulties that hampered a technical application of FTV or 3DTV so far have been overcome. However, since the data volume of multiview video sequences is proportional to the number of views, it requires huge storage space, wide transmission bandwidth and large computing power. Therefore, it is essential to develop Multiview Video Coding (MVC) algorithms with high compression efficiency and low complexity for real-time video applications, such as live 3D broadcasting, remote control and interactive video communication. Many attempts on MVC have been made to improve compression efficiency and lower complexity. Motion Picture Experts Group (MPEG) has surveyed some of MVC schemes, such as group-of-gop prediction, checkerboard decomposition, sequential view prediction and so on [4]. Merkle et al. proposed a MVC scheme using Hierarchical B Pictures (MVC-HBP) with superior compression efficiency and temporal scalability [5]. This MVC-HBP prediction structure has been adopted into MVC standardization draft by Joint Video Team (JVT), which was organized by ISO/IEC MPEG and ITU-T Video Coding Experts Group (VCEG), and used in reference software Joint Multiview Video Coding (JMVC). However, MVC-HBP is quite complex and it is necessary to develop fast algorithms to reduce its complexity for practical application [6] [8]. Peng et al. proposed a fast mode decision algorithm for MVC with dynamic early termination [6]. Li and Shen et al. proposed fast Motion Estimation (ME) and Disparity Estimation (DE) to reduce MVC s complexity [7], [8]. Furthermore, MVC schemes adopt Multi-Reference Frame (MRF) prediction technology to further explore both temporal and inter-view redundancies to improve coding efficiency. However, adopting MRF prediction technology increases computational complexity significantly when compared with that of single reference frame in video coding. Several methods have been proposed to reduce the complexity of MRF estimation for H.264/AVC [9] [11]. Su and Sun proposed a fast MRF algorithm by adopting continuity of motion vectors among different reference frames [9]. Huang et al. proposed a fast MRF algorithm by searching either the previous or every reference frame based on the result of ME from the previous frame [10]. Kuo and Lu reduced the number of reference frames based on the best reference frame selected by the B8 8 mode and the variance of their motion vectors [11]. However, these schemes are mainly proposed for P frames in mono-view H.264/AVC, so they can hardly /$ IEEE

2 16 IEEE TRANSACTIONS ON BROADCASTING, VOL. 57, NO. 1, MARCH 2011 Fig. 1. MVC-HBP prediction structure in JMVC. be applied to MVC using hierarchical B pictures. In addition, both inter-view and temporal predictions are adopted in MVC, which has different statistical properties from the temporal prediction that is employed only in mono-view video coding. Lin and Tang presented a fast decision algorithm to predict the direction of motion compensation prediction or disparity compensation prediction for MVC [12]. Zhu et al. proposed a fast Inter mode decision scheme, in which inter-view prediction of other variable block size modes are reduced based on B16 16 reference information while encoding inter-view views [13]. However, multi-reference frame selection of temporal views was not optimized. Zhang et al. proposed adaptive reference selection that reduces coding complexity and improves random accessibility according to inter-view and temporal correlation of multiview video sequences [14]. However, the encoding time saving ratio is still limited and unstable for multiview videos with different spatio-temporal correlations. In this paper, we propose an efficient Fast Multi-reference Frame Selection Algorithm (FMFSA) for complexity reduction of MVC. The rest of this paper is organized as follows. The mechanism of multi-reference selection in MVC is reviewed in Section II. Then, the proposed FMFSA for MVC is presented in Section III. Experimental results and analyses are shown in Section IV. Finally, Section V concludes this paper. II. REVIEW ON MRF SELECTION FOR HIERARCHICAL B PICTURE IN MVC The MVC-HBP structure [5] is inter-view and temporal prediction hybrid and has been adopted into MVC standardization draft since it is with high coding efficiency by applying bi-directional prediction hierarchically, i.e. adopting hierarchical B pictures [15]. Fig. 1 shows an example of MVC-HBP prediction structure for an 8-view sequence when the Group-Of- Picture (GOP) length is 12. As we can see that frames in the GOP are coded with high complex hierarchical B pictures in order to achieve high compression efficiency, where stands for the floor operation. The illustration of variable block size mode decision and MRF selection in MVC-HBP is shown in Fig. 2. There are two different loop levels for encoding each macroblock (MB). One is the variable block size mode decision loop in which the best Fig. 2. Illustration of mode decision and MRF selection in MVC-HBP. mode is selected by checking mode candidates one by one. There are DIRECT mode, Inter-MB modes (B16 16, B16 8, B8 16, B8 8Frext and B8 8) and intra-mb modes (I4MB, I8MB, I16MB and PCM) for MB in B slices. Each 8 8 block of the B8 8 mode can be further sub-partitioned into smaller blocks and its prediction modes include SubDIRECT, SubB8 4, SubB4 8 and SubB4 4. The sub-partitioned blocks within a B8 8 block have the same reference frame. The other is the MRF loop that selects the best reference frame by checking each active reference frame and direction for each variable block size mode. While encoding one MB, the MRF selection is performed for each Inter mode. As for the MRF selection loop, there are another two inner loop levels. The first loop is the reference direction loop over List0 (forward), List1 (backward) and bi iterative direction. List0 and List1 are the memory lists storing the forward and backward reference frames, respectively. In the bi-directional iterative search, all the frames in List0 and List1 are searched for refinement. In the following section, the forward, backward and bi-directional predictions are denoted as FWD, BWD and BI for short. The second loop is looping over reference frames in all active reference frames (1 NumberReferenceFrames (NRFs)) in each direction. Finally, after comparing the Rate Distortion (RD) cost of forward, backward and bi-directional iterative prediction, reference information, including reference frame indices and reference direction, with the smallest RD cost are stored. The MRF process in MVC is of extreme high complexity while combining with the variable block size technology, because the complex MRF selection process is required for each MB and sub-mb partitions.

3 ZHANG et al.: MULTI-REFERENCE FRAME SELECTION ALGORITHM FOR HIERARCHICAL B PICTURES 17 Fig. 3. Illustration of MRF selection for different MB partitions. III. PROPOSED FAST MULTI-REFERENCE SELECTION ALGORITHM BASED ON BLOCK CORRELATION While encoding one MB in hierarchical B pictures, the optimal mode, optimal reference direction and reference frame,, are determined by RD optimization. It can be expressed as (1) where and are the current and reference block, respectively; represents the set of Inter modes. The reference frame indicator is composed of two elements: the reference index in List0 (RefIdx0) and the reference index in List1 (RefIdx1), RefIdx0, RefIdx1. indicates the reference prediction direction,. Fig. 3 shows an example of inter-view and temporal joint prediction, where Ref1 and Ref2 are reference frames in List0, Ref3 and Ref4 are in List1. Due to similar properties and high spatial correlation of the pixels within MB, smaller MB partition modes, e.g., B8 8 and B16 8, will probably select the same prediction direction and reference indices as B16 16 does, that is, it is of high probability that and for encoding modes with partition size smaller than This probability will be statistically identified as follows. A. Statistical Analyses of MRF Selection Let be the event that both reference frame and prediction direction of the best mode be equal to the reference frame and direction of B16 16 mode, be the event of selecting DI- RECT, SubDIRECT, I4MB, I8MB, I16MB and PCM mode as the best mode, and be the event of selecting other Inter modes. The corresponding probabilities of, and are denoted as, and, respectively. It can be seen that and are mutually exclusive and satisfy When takes place, we can assume that arbitrary reference frame and prediction direction are the best because the modes of do not need to do MRF selection. In this sense, it can also be (2) Fig. 4. Statistical analyses on probability P (AjB ) and mode probability P (B ) when bqp is 28: (a) probability P (AjB ); (b) mode probability P (B ). considered that always happens as takes place. Therefore, we have the conditional probability As happens, the probability of selecting the reference frame and prediction direction of B16 16 as the best can be represented as the conditional probability. In Fig. 4, the statistical analyses are given for probabilities and by searching all reference frames, directions and MB modes. Three multiview video sequences, Breakdancers (fast motion), Ballet (moderate motion) and Doorflowers (slow motion), are analyzed. Fast ME/DE is enabled and the parameter NRF is set to 2. Eight views, including four temporal coded views (even views) and four inter-view/temporal joint coded views (odd views), are encoded. The x-axis of the figures shows different view section and different frames in each view section. The y-axis is the probability of for Fig. 4(a) and mode probability for Fig. 4(b). It can be observed that the probability of is as high as 75% 99% for all the frames of the test sequences when the basis Quantization Parameter (QP), denoted by, is 28. In addition, Fig. 4(b) shows the (3)

4 18 IEEE TRANSACTIONS ON BROADCASTING, VOL. 57, NO. 1, MARCH 2011 TABLE I AVERAGE PROBABILITY OF P(AjB ), P(B ) AND HIT RATE P(A) [UNIT:%] percentage of MBs coded by modes in and about 60% 99% MBs are coded by the modes. The average conditional probability and average mode probability with different s are shown in Table I, where it can be seen that and increase as increases, and decreases as the motion gets fast. Generally, the average value of is larger than 66.31% for all the test sequences and s. Similar statistical results can also be found when full ME/DE search is enabled. According to conditional probability theory, we can obtain (4) Then, based on (4), we can rewrite as (5) Fig. 5. Statistical analyses on P (AjB ) and P (B ) with different NRFs: (a) probability P (AjB ); (b) mode probability P (B ). Taking Breakdancers as an example, the average and are 66.31% and 81.54%, respectively. According to (5) and (2) and statistical data in Table I, the hit rate. Higher values can be obtained for larger QP and the multiview video sequences with moderate and slow motion, e.g., Ballet and Doorflowers. As shown in the last four rows in Table I (i.e., results), very few MBs, i.e.,, miss the optimal reference frame or prediction direction. Fig. 5 shows the statistical analyses on and under the test condition that fast ME/DE is enabled with different NRFs and being 28. It can be observed that the probabilities and are consistent for different NRFs. In other words, more computational complexities can be reduced for larger NRF values. However, according to inter-view and temporal correlation analyses on multiview videos, frames in different views and at different time instants relative to the current encoding frame are usually of low dependencies and unnecessary to be referenced for the current frame [5]. So, in the following experiments of evaluating the proposed FMFSA, NRF is set to 2, which is the default setting of JMVC. Based on the above analyses, the following two aspects can be obtained. 1 If B16 16 mode selects one frame as the optimal reference frame, smaller MB partition modes, such as B16 8, B8 16, B8 8, will select the same reference frame as the optimal one in each prediction direction. 2 Smaller block partition modes will select the same prediction direction, i.e., FWD, BWD or BI, as B16 16 does. Accordingly, a novel FMFSA is proposed for MVC as follows. B. Proposed FMFSA and Complexity Analysis Due to high probability of and, we can solve the optimal problem of (1) with the following two steps: Firstly, check B16 16 mode with all active reference frames and directions and we can obtain the optimal reference and direction for B16 16 mode as Then, when checking other Inter modes, their reference frame and reference direction are set as and directly. That is Therefore, the coding complexity can be significantly reduced. The proposed FMFSA algorithm is described as follows. Step 1) Encode the current MB with DIRECT mode. (6) (7)

5 ZHANG et al.: MULTI-REFERENCE FRAME SELECTION ALGORITHM FOR HIERARCHICAL B PICTURES 19 Fig. 6. Eight views of multiview video sequences: (a) Race1 (KDDI); (b) Ballroom (MERL); (c) Exit (MERL); (d) Doorflowers (HHI); (e) Lovebird1 (ETRI); (f) Ballet (MSR); (g) Breakdancers (MSR); (h) Dog (Nagoya Univ.) Step 2) Encode the current MB with B16 16 mode and obtain the best reference frames in List0 and List1, respectively. Save the prediction directions of the B16x16 mode for later use. Step 3) Encode the current MB with other Inter modes, by using the reference information of B16 16 which is available from Step 2. Step Step 4) Encode the current MB with Intra modes. 5) Store the coding parameters with the smallest RD cost and write coded bitstream. Then go to Step 1 for next MB. After presentation of the proposed FMFSA, its complexity analysis is given below. Let be the complexity of B16 16, the complexity of DIRECT, other Inter modes (including B16 8, B8 16, B8 8Frext, B8 8, SubB8 4, SubB4 8 and SubB4 4) and Intra modes (including I4MB I8MB, I16MB and PCM) can be represented as, and, where, and are positive multiplication factors. Therefore, the total complexity of encoding one MB via JMVC is. TABLE II TEST MULTIVIEW VIDEO SEQUENCES Let and be the complexity of multi-reference search for FWD and BWD, respectively, where reference frames

6 20 IEEE TRANSACTIONS ON BROADCASTING, VOL. 57, NO. 1, MARCH 2011 TABLE III RATE, PSNR AND ENCODING TIME COMPARISONS AMONG ORIGINAL JMVC, KUO S SCHEME, ZHU S SCHEME AND FMFSA are active in each memory list, i.e., NRF equals to. Based on our empirical experiences, can be considered approximately equal to, so only is used, i.e.,. Let be the complexity of bi-directional iterative search, where is a positive multiplication factor that depends on the number of iterations, iterative search algorithm and iter-

7 ZHANG et al.: MULTI-REFERENCE FRAME SELECTION ALGORITHM FOR HIERARCHICAL B PICTURES 21 ative search range. Because each Inter mode is composed of FWD/BWD multi-reference search and bi-directional iterative search, equals to. Let, and be the probabilities of selecting FWD, BWD and BI as the best direction, where. Thus, considering that the complexities of a MB for performing ME/DE on each reference frame are almost the same, the total computational complexity of encoding a MB via FMFSA can be calculated as. On the basis of complexity analyses of MVC coding process, we obtain,,, when search range is 96, fast ME/DE is enabled, equals 2, the number of bi-directional iterative search is 4, and iterative search range is 8. Hence, the total complexities of encoding one MB via original JMVC and FMFSA are and, respectively. For different multiview video sequences, usually ranges from 5 35%, and 20% in average. Thus, the total complexity of encoding one MB via FMFSA is, which means 70.0% complexity reduction in average can be achieved. IV. EXPERIMENTAL RESULTS AND ANALYSES The recent H.264/AVC based MVC reference software JMVC 3.0 [16] is utilized to evaluate the proposed FMFSA algorithm. Fast ME/DE is enabled and their search range is set to 96. The number of bi-prediction iteration is 4 and the search range for iterations is 8. The maximum number of reference frames is 2 and the GOP length is 12. Eight different multiview video test sequences, including Race1, Ballroom, Exit, Lovebird1, Doorflowers, Breakdancers, Ballet and Dog, with various motion properties and camera arrangements are adopted. Fig. 6 shows eight views of these test multiview video sequences. Detailed information of the test sequences is given in Table II. Eight views for each multiview video sequences and 61 frames for each view are encoded. Four values, 24, 28, 32 and 36, are used in our experiments. The coding parameters are consistent for the original JMVC, Kuo s scheme [11], Zhu s scheme [13] and the proposed FMFSA for a fair comparison. All video coding experiments are performed on Dell OPTIPLEX GX620 computer, Intel Pentium IV dual Core 3.20 GHz and 3.19 GHz CPU, 2 GB memory, Microsoft Windows XP Professional operating system. Table III shows the encoding time, Peak Signal-to-Noise Ratio (PSNR), bit rate comparison among the test algorithms, where the time saving ratio, PSNR difference and bit rate increment between the original JMVC encoder and test algorithms are computed as where, and are total encoding time, PSNR and bit rate of algorithm,,, and are total encoding time, PSNR and bit rate of the original JMVC. From Table III, Kuo s scheme reduces 29.69% 39.70% total encoding time for even views; meanwhile, the average (8) Fig. 7. RD curves of JMVC, Kuo s scheme, Zhu s scheme and the proposed FMFSA. Fig. 8. Encoding time saving ratio achieved by Kuo s scheme, Zhu s scheme and FMFSA. bit rate increase is within and the average PSNR degradation is within db. For odd views, Kuo s scheme can achieve more complexity reduction, 44.00% 56.37% in average; however, the average bit rate increases %, 6.41% in average. That large amount of bit rate increase is due to the fact that Kuo s scheme is proposed for the traditional mono-view video coding and has not taken the inter-view prediction into consideration. As for encoding odd views by using Zhu s scheme, 45.15% 55.96% computational complexity reduction is achieved, meanwhile, the bit rate increase within 0.33% and the average PSNR degradation is within 0.01 db. However, Zhu s scheme is proposed for complexity reduction for odd views and not applicable for encoding even views, i.e., encoding the even views by using original JMVC and no complexity reduction is achieved. As for the proposed FMFSA, the average bit rate increase is 0.32% for even views and 0.54% for odd views for all eight test multiview video sequences. Meanwhile, the PSNR degradation is 0.02 db in average and within 0.04 db for all test sequences and all views. For better observation, Fig. 7 shows the comparison of

8 22 IEEE TRANSACTIONS ON BROADCASTING, VOL. 57, NO. 1, MARCH 2011 RD curves among the test algorithms. The proposed FMFSA retains almost the same RD performance as the original JMVC and Zhu s scheme, and outperforms Kuo s scheme. As far as the coding complexity is concerned, the proposed FMFSA can reduce the total encoding time by 68.34% 79.01% for all test sequences. FMFSA achieves 73.67% complexity reduction in average, which is much more than 41.45%, the average complexity reduction achieved by Kuo s scheme. As for the complexity comparison for coding odd views, FMFSA achieves 76.68% complexity reduction in average, which is 24.64% more than the complexity reduction achieved by Zhu s scheme. Fig. 8 shows the average encoding time saving ratio for even views and odd views, respectively. From the results, we obtain the following three facts: 1) the proposed FMFSA can achieve 5% more complexity reduction for odd views as compared to even views, because these odd views require more coding efforts originally. 2) The overall results indicate FMFSA retains reliable and consistent complexity reduction, approximately 70%, for all test sequences, even with various video contents, camera arrangements and motion properties. 3) FMFSA reduces much more computational complexity than Kuo s scheme and Zhu s scheme. The FMFSA is a flexible framework focusing on MRF selection and can be integrated with other existing fast algorithms, such as fast mode decision and fast ME/DE, to further reduce MVC encoding computations. V. CONCLUSION This paper presents an efficient multi-reference frame selection algorithm for hierarchical B pictures by exploiting high reference frame and direction correlation among variable block size coding modes. Experimental results show that the proposed FMFSA achieves 68.34% 79.01% total encoding time reduction as compared to the original JMVC3.0. Meanwhile, the average results about bit rate increase and PSNR degradation of FMFSA are within 0.54% and 0.04 db, respectively, which keep the RD performance of the original JMVC more or less intact. In the future, we will study new fast algorithms including fast mode decision and fast motion estimation based on FMFSA to further reduce the encoding computations for MVC. REFERENCES [1] M. Tanimoto, Overview of free viewpoint television, Signal Process: Image Commun., vol. 21, no. 6, pp , Jul [2] N. Nithiyanandam, A three-dimensional digital image display system, IEEE Trans. Broadcast., vol. BC-21, no. 4, p. 53, Dec [3] L. Zhang and W. J. Tam, Stereoscopic image generation based on depth images for 3DTV, IEEE Trans. Broadcast., vol. 51, no. 2, pp , Jun [4] Survey of Algorithms Used for Multi-View Video Coding (MVC), ISO/IEC JTC1/ SC29/WG11, N6909, Jan [5] P. Merkle, A. Smolic, K. Müller, and T. Wiegand, Efficient prediction structures for multi-view video coding, IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 11, pp , Nov [6] Z. J. Peng, G. Y. Jiang, and M. Yu, A fast multiview video coding algorithm based dynamic multi-threshold, in In Proc. IEEE ICME 09, Jun. 2009, pp [7] X. M. Li, D. B. Zhao, S. W. Ma, and W. Gao, Fast disparity and motion estimation based on correlations for multiview video coding, IEEE Trans. Consumer Electron., vol. 54, no. 4, pp , Nov [8] L. Q. Shen, Z. Liu, S. X. Liu, Z. Y. Zhang, and P. An, Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding, IEEE Trans. Broadcast., vol. 55, no. 4, pp , Dec [9] Y. P. Su and M. T. Sun, Fast multiple reference frame motion estimation for H.264/AVC, IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 3, pp , Mar [10] Y. W. Huang, B. Y. Hsieh, and S. Y. Chien et al., Analysis and complexity reduction of multiple reference frames motion estimation in H.264/AVC, IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 4, pp , Apr [11] T. Y. Kuo and H. J. Lu, Efficient reference frame selector for H.264, IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 3, pp , Apr [12] J. P. Lin and A. C. Tang, A fast direction predictor of inter frame prediction for multi-view video coding, in In Proc. IEEE ISCAS 09, Taipei, Taiwan, May 2009, pp [13] W. Zhu, W. Jiang, and Y. Chen, A fast inter mode decision for multiview video coding, in In Proc. ICIECS 09, Qiangdao, China, Dec. 2009, pp [14] Y. Zhang, G. Y. Jiang, M. Yu, and Y. S. Ho, Adaptive multiview video coding scheme based on spatio-temporal correlation analyses, ETRI Journal, vol. 31, no. 2, pp , Apr [15] H. Schwarz, D. Marpe, and T. Wiegand, Hierarchical B pictures, in JVT of ISO/IEC MPEG & ITU-T VCEG, Poznan, PL, Jul. 2005, Doc. JVT-P014. [16] Y. Chen, P. Pandit, and S. Yea, WD1 reference software for MVC (JMVC) 3.0, in JVT of ISO/IEC MPEG & ITU-T VCEG, Busan, Korea, Oct. 2008, Doc. JVT-AC207. Yun Zhang received the B.S. and M.S. degrees in electrical engineering from Ningbo University, Ningbo, China, in 2004 and 2007, respectively, and the Ph.D. degree in computer science from Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China, in From 2009 to 2010, he was a Visiting Scholar with the Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong. In 2010, he joined the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, as an Assistant Researcher. His research interests are multiview video coding, video object segmentation and content based video processing. Sam Kwong (M 93-SM 04) received the B.S. and M.S. degrees in electrical engineering from the State University of New York at Buffalo in 1983, the University of Waterloo, Waterloo, ON, Canada, in 1985, and the Ph.D. degree from the University of Hagen, Germany, in From 1985 to 1987, he was a Diagnostic Engineer with Control Data Canada. He joined Bell Northern Research Canada as a Member of Scientific Staff. In 1990, he became a Lecturer in the Department of Electronic Engineering, City University of Hong Kong, where he is currently a Professor in the Department of Computer Science. His research interests are video and image coding and evolutionary algorithms. Gangyi Jiang received the M.S. degree from Hangzhou University, Hangzhou, China, in 1992, and received the Ph.D. degree from Ajou University, Korea, in In 2000, he joined the Faculty of Information Science and Engineering, Ningbo University, China, as a Professor. His research interests include digital video compression and communications, multi-view video coding and image processing.

9 ZHANG et al.: MULTI-REFERENCE FRAME SELECTION ALGORITHM FOR HIERARCHICAL B PICTURES 23 Hanli Wang (M 08) received the B.S. and M.S. degrees in electrical engineering from Zhejiang University, Hangzhou, China, in 2001 and 2004, respectively, and the Ph.D. degree in computer science from City University of Hong Kong (CityU), Kowloon, Hong Kong, in From 2007 to 2008, he was a Research Fellow with the Department of Computer Science, CityU. From 2007 to 2008, he also was a Visiting Scholar with Stanford University, Palo Alto, CA, invited by Prof. C. K. Chui. From 2008 to 2009, he was a Research Engineer with Precoad, Inc., Menlo Park, CA. From 2009 to 2010, he was an Alexander von Humboldt Research Fellow in University of Hagen, Hagen, Germany. In 2010, he joined the Department of Computer Science & Technology, Tongji University, Shanghai, China, as a Professor. His current research interests include digital video coding, image processing, pattern recognition and video analysis.

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE 5359 Gaurav Hansda 1000721849 gaurav.hansda@mavs.uta.edu Outline Introduction to H.264 Current algorithms for

More information

View Synthesis Prediction for Rate-Overhead Reduction in FTV

View Synthesis Prediction for Rate-Overhead Reduction in FTV MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com View Synthesis Prediction for Rate-Overhead Reduction in FTV Sehoon Yea, Anthony Vetro TR2008-016 June 2008 Abstract This paper proposes the

More information

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING 1 Michal Joachimiak, 2 Kemal Ugur 1 Dept. of Signal Processing, Tampere University of Technology, Tampere, Finland 2 Jani Lainema,

More information

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Complexity Reduced Mode Selection of H.264/AVC Intra Coding Complexity Reduced Mode Selection of H.264/AVC Intra Coding Mohammed Golam Sarwer 1,2, Lai-Man Po 1, Jonathan Wu 2 1 Department of Electronic Engineering City University of Hong Kong Kowloon, Hong Kong

More information

Fast mode decision based on texture depth correlation and motion prediction for multiview depth video coding

Fast mode decision based on texture depth correlation and motion prediction for multiview depth video coding J Real-Time Image Proc (2016) 11:27 36 DOI 10.1007/s11554-013-0328-3 ORIGINAL REEARCH PAPER Fast mode decision based on texture depth correlation and motion prediction for multiview depth video coding

More information

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain Author manuscript, published in "International Symposium on Broadband Multimedia Systems and Broadcasting, Bilbao : Spain (2009)" One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

More information

978 IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 11, NO. 4, AUGUST 2015

978 IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 11, NO. 4, AUGUST 2015 978 IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 11, NO. 4, AUGUST 2015 Fast Mode Decision Using Inter-View and Inter-Component Correlations for Multiview Depth Video Coding Jianjun Lei, Member, IEEE,

More information

Fast Mode Decision Algorithm for Multiview Video Coding Based on Binocular Just Noticeable Difference

Fast Mode Decision Algorithm for Multiview Video Coding Based on Binocular Just Noticeable Difference 2428 JOURNAL OF COMPUTERS, VOL. 9, NO., OCTOBER 214 Fast Mode Decision Algorithm for Multiview Video Coding Based on Binocular Just Noticeable Difference Yapei Zhu 1 1 Faculty of Information Science and

More information

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC Randa Atta, Rehab F. Abdel-Kader, and Amera Abd-AlRahem Electrical Engineering Department, Faculty of Engineering, Port

More information

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec Proceedings of the International MultiConference of Engineers and Computer Scientists 8 Vol I IMECS 8, 19-1 March, 8, Hong Kong Fast Wavelet-based Macro-block Selection Algorithm for H.64 Video Codec Shi-Huang

More information

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC Hamid Reza Tohidypour, Mahsa T. Pourazad 1,2, and Panos Nasiopoulos 1 1 Department of Electrical & Computer Engineering,

More information

Fast Encoding Techniques for Multiview Video Coding

Fast Encoding Techniques for Multiview Video Coding Fast Encoding Techniques for Multiview Video Coding S. Khattak a, R. Hamzaoui a,, S. Ahmad a, P. Frossard b a Centre for Electronic and Communications Engineering, De Montfort University, United Kingdom

More information

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding Jung-Ah Choi and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 261 Cheomdan-gwagiro, Buk-gu, Gwangju, 500-712, Korea

More information

WITH the improvements in high-speed networking, highcapacity

WITH the improvements in high-speed networking, highcapacity 134 IEEE TRANSACTIONS ON BROADCASTING, VOL. 62, NO. 1, MARCH 2016 A Virtual View PSNR Estimation Method for 3-D Videos Hui Yuan, Member, IEEE, Sam Kwong, Fellow, IEEE, Xu Wang, Student Member, IEEE, Yun

More information

View Synthesis for Multiview Video Compression

View Synthesis for Multiview Video Compression View Synthesis for Multiview Video Compression Emin Martinian, Alexander Behrens, Jun Xin, and Anthony Vetro email:{martinian,jxin,avetro}@merl.com, behrens@tnt.uni-hannover.de Mitsubishi Electric Research

More information

An Efficient Mode Selection Algorithm for H.264

An Efficient Mode Selection Algorithm for H.264 An Efficient Mode Selection Algorithm for H.64 Lu Lu 1, Wenhan Wu, and Zhou Wei 3 1 South China University of Technology, Institute of Computer Science, Guangzhou 510640, China lul@scut.edu.cn South China

More information

View Generation for Free Viewpoint Video System

View Generation for Free Viewpoint Video System View Generation for Free Viewpoint Video System Gangyi JIANG 1, Liangzhong FAN 2, Mei YU 1, Feng Shao 1 1 Faculty of Information Science and Engineering, Ningbo University, Ningbo, 315211, China 2 Ningbo

More information

CONSEQUENTLY with H.264 Advanced Video Coding

CONSEQUENTLY with H.264 Advanced Video Coding 1598 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 22, NO. 4, APRIL 2013 Multiview Coding Mode Decision With Hybrid Optimal Stopping Model Tiesong Zhao, Member, IEEE, Sam Kwong, Senior Member, IEEE, Hanli

More information

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

International Journal of Emerging Technology and Advanced Engineering Website:   (ISSN , Volume 2, Issue 4, April 2012) A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and

More information

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc. Upcoming Video Standards Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc. Outline Brief history of Video Coding standards Scalable Video Coding (SVC) standard Multiview Video Coding

More information

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION Yen-Chieh Wang( 王彥傑 ), Zong-Yi Chen( 陳宗毅 ), Pao-Chi Chang( 張寶基 ) Dept. of Communication Engineering, National Central

More information

THE HIGH definition (HD) and ultra HD videos have

THE HIGH definition (HD) and ultra HD videos have IEEE TRANSACTIONS ON BROADCASTING, VOL. 62, NO. 3, SEPTEMBER 2016 675 Fast Motion Estimation Based on Content Property for Low-Complexity H.265/HEVC Encoder Zhaoqing Pan, Member, IEEE, Jianjun Lei, Member,

More information

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec Seung-Hwan Kim and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu,

More information

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION Yi-Hau Chen, Tzu-Der Chuang, Chuan-Yung Tsai, Yu-Jen Chen, and Liang-Gee Chen DSP/IC Design Lab., Graduate Institute

More information

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 7, NO. 2, APRIL 1997 429 Express Letters A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation Jianhua Lu and

More information

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames Ki-Kit Lai, Yui-Lam Chan, and Wan-Chi Siu Centre for Signal Processing Department of Electronic and Information Engineering

More information

Fast frame memory access method for H.264/AVC

Fast frame memory access method for H.264/AVC Fast frame memory access method for H.264/AVC Tian Song 1a), Tomoyuki Kishida 2, and Takashi Shimamoto 1 1 Computer Systems Engineering, Department of Institute of Technology and Science, Graduate School

More information

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile An Efficient Intra Prediction Algorithm for H.264/AVC High Profile Bo Shen 1 Kuo-Hsiang Cheng 2 Yun Liu 1 Ying-Hong Wang 2* 1 School of Electronic and Information Engineering, Beijing Jiaotong University

More information

EE Low Complexity H.264 encoder for mobile applications

EE Low Complexity H.264 encoder for mobile applications EE 5359 Low Complexity H.264 encoder for mobile applications Thejaswini Purushotham Student I.D.: 1000-616 811 Date: February 18,2010 Objective The objective of the project is to implement a low-complexity

More information

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation 2009 Third International Conference on Multimedia and Ubiquitous Engineering A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation Yuan Li, Ning Han, Chen Chen Department of Automation,

More information

LBP-GUIDED DEPTH IMAGE FILTER. Rui Zhong, Ruimin Hu

LBP-GUIDED DEPTH IMAGE FILTER. Rui Zhong, Ruimin Hu LBP-GUIDED DEPTH IMAGE FILTER Rui Zhong, Ruimin Hu National Engineering Research Center for Multimedia Software,School of Computer, Wuhan University,Wuhan, 430072, China zhongrui0824@126.com, hrm1964@163.com

More information

Fast Mode Decision for H.264/AVC Using Mode Prediction

Fast Mode Decision for H.264/AVC Using Mode Prediction Fast Mode Decision for H.264/AVC Using Mode Prediction Song-Hak Ri and Joern Ostermann Institut fuer Informationsverarbeitung, Appelstr 9A, D-30167 Hannover, Germany ri@tnt.uni-hannover.de ostermann@tnt.uni-hannover.de

More information

An Independent Motion and Disparity Vector Prediction Method for Multiview Video Coding

An Independent Motion and Disparity Vector Prediction Method for Multiview Video Coding Preprint Version (2011) An Independent Motion and Disparity Vector Prediction Method for Multiview Video Coding Seungchul Ryu a, Jungdong Seo a, Dong Hyun Kim a, Jin Young Lee b, Ho-Cheon Wey b, and Kwanghoon

More information

A New H.264-Based Rate Control Algorithm for Stereoscopic Video Coding

A New H.264-Based Rate Control Algorithm for Stereoscopic Video Coding A New H.264-Based ate Control Algorithm for Stereoscopic Video Coding Yi Liao, Wencheng Yang, and Gangyi Jiang International Science Index, Computer and Information Engineering waset.org/publication/161

More information

Experimental Evaluation of H.264/Multiview Video Coding over IP Networks

Experimental Evaluation of H.264/Multiview Video Coding over IP Networks ISSC 11, Trinity College Dublin, June 23-24 Experimental Evaluation of H.264/Multiview Video Coding over IP Networks Zhao Liu *, Yuansong Qiao *, Brian Lee *, Enda Fallon **, Karunakar A. K. *, Chunrong

More information

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010 EE 5359 Low Complexity H.264 encoder for mobile applications Thejaswini Purushotham Student I.D.: 1000-616 811 Date: February 18,2010 Fig 1: Basic coding structure for H.264 /AVC for a macroblock [1] .The

More information

Extensions of H.264/AVC for Multiview Video Compression

Extensions of H.264/AVC for Multiview Video Compression MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Extensions of H.264/AVC for Multiview Video Compression Emin Martinian, Alexander Behrens, Jun Xin, Anthony Vetro, Huifang Sun TR2006-048 June

More information

High Efficient Intra Coding Algorithm for H.265/HVC

High Efficient Intra Coding Algorithm for H.265/HVC H.265/HVC における高性能符号化アルゴリズムに関する研究 宋天 1,2* 三木拓也 2 島本隆 1,2 High Efficient Intra Coding Algorithm for H.265/HVC by Tian Song 1,2*, Takuya Miki 2 and Takashi Shimamoto 1,2 Abstract This work proposes a novel

More information

Template based illumination compensation algorithm for multiview video coding

Template based illumination compensation algorithm for multiview video coding Template based illumination compensation algorithm for multiview video coding Xiaoming Li* a, Lianlian Jiang b, Siwei Ma b, Debin Zhao a, Wen Gao b a Department of Computer Science and technology, Harbin

More information

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis Euy-Doc Jang *, Jae-Gon Kim *, Truong Thang**,Jung-won Kang** *Korea Aerospace University, 100, Hanggongdae gil, Hwajeon-dong,

More information

Research Article Fast Macroblock Mode Selection Algorithm for Multiview Video Coding

Research Article Fast Macroblock Mode Selection Algorithm for Multiview Video Coding Hindawi Publishing Corporation EURASIP Journal on Image and Video Processing Volume 28, Article ID 393727, 14 pages doi:11155/28/393727 Research Article Fast Macroblock Mode Selection Algorithm for Multiview

More information

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER

EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER EFFICIENT PU MODE DECISION AND MOTION ESTIMATION FOR H.264/AVC TO HEVC TRANSCODER Zong-Yi Chen, Jiunn-Tsair Fang 2, Tsai-Ling Liao, and Pao-Chi Chang Department of Communication Engineering, National Central

More information

On the Adoption of Multiview Video Coding in Wireless Multimedia Sensor Networks

On the Adoption of Multiview Video Coding in Wireless Multimedia Sensor Networks 2011 Wireless Advanced On the Adoption of Multiview Video Coding in Wireless Multimedia Sensor Networks S. Colonnese, F. Cuomo, O. Damiano, V. De Pascalis and T. Melodia University of Rome, Sapienza, DIET,

More information

Homogeneous Transcoding of HEVC for bit rate reduction

Homogeneous Transcoding of HEVC for bit rate reduction Homogeneous of HEVC for bit rate reduction Ninad Gorey Dept. of Electrical Engineering University of Texas at Arlington Arlington 7619, United States ninad.gorey@mavs.uta.edu Dr. K. R. Rao Fellow, IEEE

More information

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD Siwei Ma, Shiqi Wang, Wen Gao {swma,sqwang, wgao}@pku.edu.cn Institute of Digital Media, Peking University ABSTRACT IEEE 1857 is a multi-part standard for multimedia

More information

IN RECENT years, multimedia application has become more

IN RECENT years, multimedia application has become more 578 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 17, NO. 5, MAY 2007 A Fast Algorithm and Its VLSI Architecture for Fractional Motion Estimation for H.264/MPEG-4 AVC Video Coding

More information

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING Journal of the Chinese Institute of Engineers, Vol. 29, No. 7, pp. 1203-1214 (2006) 1203 STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING Hsiang-Chun Huang and Tihao Chiang* ABSTRACT A novel scalable

More information

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Reducing/eliminating visual artifacts in HEVC by the deblocking filter. 1 Reducing/eliminating visual artifacts in HEVC by the deblocking filter. EE5359 Multimedia Processing Project Proposal Spring 2014 The University of Texas at Arlington Department of Electrical Engineering

More information

Unit-level Optimization for SVC Extractor

Unit-level Optimization for SVC Extractor Unit-level Optimization for SVC Extractor Chang-Ming Lee, Chia-Ying Lee, Bo-Yao Huang, and Kang-Chih Chang Department of Communications Engineering National Chung Cheng University Chiayi, Taiwan changminglee@ee.ccu.edu.tw,

More information

FOR compressed video, due to motion prediction and

FOR compressed video, due to motion prediction and 1390 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 24, NO. 8, AUGUST 2014 Multiple Description Video Coding Based on Human Visual System Characteristics Huihui Bai, Weisi Lin, Senior

More information

Reduced Frame Quantization in Video Coding

Reduced Frame Quantization in Video Coding Reduced Frame Quantization in Video Coding Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P. O. Box 500, FIN-900 University

More information

H.264 to MPEG-4 Transcoding Using Block Type Information

H.264 to MPEG-4 Transcoding Using Block Type Information 1568963561 1 H.264 to MPEG-4 Transcoding Using Block Type Information Jae-Ho Hur and Yung-Lyul Lee Abstract In this paper, we propose a heterogeneous transcoding method of converting an H.264 video bitstream

More information

View Synthesis for Multiview Video Compression

View Synthesis for Multiview Video Compression MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com View Synthesis for Multiview Video Compression Emin Martinian, Alexander Behrens, Jun Xin, and Anthony Vetro TR2006-035 April 2006 Abstract

More information

Cross-Layer Optimization for Efficient Delivery of Scalable Video over WiMAX Lung-Jen Wang 1, a *, Chiung-Yun Chang 2,b and Jen-Yi Huang 3,c

Cross-Layer Optimization for Efficient Delivery of Scalable Video over WiMAX Lung-Jen Wang 1, a *, Chiung-Yun Chang 2,b and Jen-Yi Huang 3,c Applied Mechanics and Materials Submitted: 2016-06-28 ISSN: 1662-7482, Vol. 855, pp 171-177 Revised: 2016-08-13 doi:10.4028/www.scientific.net/amm.855.171 Accepted: 2016-08-23 2017 Trans Tech Publications,

More information

DISPARITY-ADJUSTED 3D MULTI-VIEW VIDEO CODING WITH DYNAMIC BACKGROUND MODELLING

DISPARITY-ADJUSTED 3D MULTI-VIEW VIDEO CODING WITH DYNAMIC BACKGROUND MODELLING DISPARITY-ADJUSTED 3D MULTI-VIEW VIDEO CODING WITH DYNAMIC BACKGROUND MODELLING Manoranjan Paul and Christopher J. Evans School of Computing and Mathematics, Charles Sturt University, Australia Email:

More information

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC Proceedings of the 7th WSEAS International Conference on Multimedia, Internet & Video Technologies, Beijing, China, September 15-17, 2007 198 Reduced 4x4 Block Intra Prediction Modes using Directional

More information

A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation

A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation Journal of Automation and Control Engineering Vol. 3, No. 1, February 20 A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation Dam. Minh Tung and Tran. Le Thang Dong Center of Electrical

More information

Novel United Buffer Rate Control Methods for Stereoscopic Video

Novel United Buffer Rate Control Methods for Stereoscopic Video JOURNAL OF SOFTWARE, VOL. 8, NO. 8, AUGUST 2013 2015 Novel United Buffer Rate Control Methods for Stereoscopic Video Yi Liao Faculty of Information Science and Engineering, Ningbo University, Ningbo, China

More information

Overview of Multiview Video Coding and Anti-Aliasing for 3D Displays

Overview of Multiview Video Coding and Anti-Aliasing for 3D Displays MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Overview of Multiview Video Coding and Anti-Aliasing for 3D Displays Anthony Vetro, Sehoon Yea, Matthias Zwicker, Wojciech Matusik, Hanspeter

More information

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution

H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution H.264/AVC Baseline Profile to MPEG-4 Visual Simple Profile Transcoding to Reduce the Spatial Resolution Jae-Ho Hur, Hyouk-Kyun Kwon, Yung-Lyul Lee Department of Internet Engineering, Sejong University,

More information

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen DSP/IC Design Lab., Graduate Institute of Electronics Engineering, National

More information

Fast Mode Decision for Depth Video Coding Using H.264/MVC *

Fast Mode Decision for Depth Video Coding Using H.264/MVC * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 31, 1693-1710 (2015) Fast Mode Decision for Depth Video Coding Using H.264/MVC * CHIH-HUNG LU, HAN-HSUAN LIN AND CHIH-WEI TANG* Department of Communication

More information

HEVC based Stereo Video codec

HEVC based Stereo Video codec based Stereo Video B Mallik*, A Sheikh Akbari*, P Bagheri Zadeh *School of Computing, Creative Technology & Engineering, Faculty of Arts, Environment & Technology, Leeds Beckett University, U.K. b.mallik6347@student.leedsbeckett.ac.uk,

More information

Bit Allocation for Spatial Scalability in H.264/SVC

Bit Allocation for Spatial Scalability in H.264/SVC Bit Allocation for Spatial Scalability in H.264/SVC Jiaying Liu 1, Yongjin Cho 2, Zongming Guo 3, C.-C. Jay Kuo 4 Institute of Computer Science and Technology, Peking University, Beijing, P.R. China 100871

More information

Testing HEVC model HM on objective and subjective way

Testing HEVC model HM on objective and subjective way Testing HEVC model HM-16.15 on objective and subjective way Zoran M. Miličević, Jovan G. Mihajlović and Zoran S. Bojković Abstract This paper seeks to provide performance analysis for High Efficient Video

More information

Performance Comparison between DWT-based and DCT-based Encoders

Performance Comparison between DWT-based and DCT-based Encoders , pp.83-87 http://dx.doi.org/10.14257/astl.2014.75.19 Performance Comparison between DWT-based and DCT-based Encoders Xin Lu 1 and Xuesong Jin 2 * 1 School of Electronics and Information Engineering, Harbin

More information

Adaptive Up-Sampling Method Using DCT for Spatial Scalability of Scalable Video Coding IlHong Shin and Hyun Wook Park, Senior Member, IEEE

Adaptive Up-Sampling Method Using DCT for Spatial Scalability of Scalable Video Coding IlHong Shin and Hyun Wook Park, Senior Member, IEEE 206 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL 19, NO 2, FEBRUARY 2009 Adaptive Up-Sampling Method Using DCT for Spatial Scalability of Scalable Video Coding IlHong Shin and Hyun

More information

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms RC24748 (W0902-063) February 12, 2009 Electrical Engineering IBM Research Report Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms Yuri Vatis Institut für Informationsverarbeitung

More information

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Efficient MPEG- to H.64/AVC Transcoding in Transform-domain Yeping Su, Jun Xin, Anthony Vetro, Huifang Sun TR005-039 May 005 Abstract In this

More information

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264.

Title Adaptive Lagrange Multiplier for Low Bit Rates in H.264. Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title Adaptive Lagrange Multiplier for Low Bit Rates

More information

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework System Modeling and Implementation of MPEG-4 Encoder under Fine-Granular-Scalability Framework Literature Survey Embedded Software Systems Prof. B. L. Evans by Wei Li and Zhenxun Xiao March 25, 2002 Abstract

More information

Wyner Ziv-Based Multiview Video Coding Xun Guo, Yan Lu, Member, IEEE, Feng Wu, Senior Member, IEEE, Debin Zhao, and Wen Gao, Senior Member, IEEE

Wyner Ziv-Based Multiview Video Coding Xun Guo, Yan Lu, Member, IEEE, Feng Wu, Senior Member, IEEE, Debin Zhao, and Wen Gao, Senior Member, IEEE IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 18, NO. 6, JUNE 2008 713 Wyner Ziv-Based Multiview Video Coding Xun Guo, Yan Lu, Member, IEEE, Feng Wu, Senior Member, IEEE, Debin Zhao,

More information

Toward Optimal Pixel Decimation Patterns for Block Matching in Motion Estimation

Toward Optimal Pixel Decimation Patterns for Block Matching in Motion Estimation th International Conference on Advanced Computing and Communications Toward Optimal Pixel Decimation Patterns for Block Matching in Motion Estimation Avishek Saha Department of Computer Science and Engineering,

More information

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Prashant Ramanathan and Bernd Girod Department of Electrical Engineering Stanford University Stanford CA 945

More information

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN

BANDWIDTH REDUCTION SCHEMES FOR MPEG-2 TO H.264 TRANSCODER DESIGN BANDWIDTH REDUCTION SCHEMES FOR MPEG- TO H. TRANSCODER DESIGN Xianghui Wei, Wenqi You, Guifen Tian, Yan Zhuang, Takeshi Ikenaga, Satoshi Goto Graduate School of Information, Production and Systems, Waseda

More information

A Survey on Early Determination of Zero Quantized Coefficients in Video Coding

A Survey on Early Determination of Zero Quantized Coefficients in Video Coding A Survey on Early Determination of Zero Quantized Coefficients in Video Coding S. Immanuel Alex Pandian Dr. G. Josemin Bala A. Anci Manon Mary Asst. Prof., Dept. of. ECE, Prof. & Head, Dept. of EMT PG

More information

VIDEO streaming applications over the Internet are gaining. Brief Papers

VIDEO streaming applications over the Internet are gaining. Brief Papers 412 IEEE TRANSACTIONS ON BROADCASTING, VOL. 54, NO. 3, SEPTEMBER 2008 Brief Papers Redundancy Reduction Technique for Dual-Bitstream MPEG Video Streaming With VCR Functionalities Tak-Piu Ip, Yui-Lam Chan,

More information

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING Md. Salah Uddin Yusuf 1, Mohiuddin Ahmad 2 Assistant Professor, Dept. of EEE, Khulna University of Engineering & Technology

More information

Block-based Watermarking Using Random Position Key

Block-based Watermarking Using Random Position Key IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.2, February 2009 83 Block-based Watermarking Using Random Position Key Won-Jei Kim, Jong-Keuk Lee, Ji-Hong Kim, and Ki-Ryong

More information

An Information Hiding Algorithm for HEVC Based on Angle Differences of Intra Prediction Mode

An Information Hiding Algorithm for HEVC Based on Angle Differences of Intra Prediction Mode An Information Hiding Algorithm for HEVC Based on Angle Differences of Intra Prediction Mode Jia-Ji Wang1, Rang-Ding Wang1*, Da-Wen Xu1, Wei Li1 CKC Software Lab, Ningbo University, Ningbo, Zhejiang 3152,

More information

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 9, SEPTEMBER

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 9, SEPTEMBER IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 9, SEPTEER 2009 1389 Transactions Letters Robust Video Region-of-Interest Coding Based on Leaky Prediction Qian Chen, Xiaokang

More information

A reversible data hiding based on adaptive prediction technique and histogram shifting

A reversible data hiding based on adaptive prediction technique and histogram shifting A reversible data hiding based on adaptive prediction technique and histogram shifting Rui Liu, Rongrong Ni, Yao Zhao Institute of Information Science Beijing Jiaotong University E-mail: rrni@bjtu.edu.cn

More information

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC Jung-Ah Choi, Jin Heo, and Yo-Sung Ho Gwangju Institute of Science and Technology {jachoi, jinheo, hoyo}@gist.ac.kr

More information

Next-Generation 3D Formats with Depth Map Support

Next-Generation 3D Formats with Depth Map Support MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Next-Generation 3D Formats with Depth Map Support Chen, Y.; Vetro, A. TR2014-016 April 2014 Abstract This article reviews the most recent extensions

More information

Error Concealment Used for P-Frame on Video Stream over the Internet

Error Concealment Used for P-Frame on Video Stream over the Internet Error Concealment Used for P-Frame on Video Stream over the Internet MA RAN, ZHANG ZHAO-YANG, AN PING Key Laboratory of Advanced Displays and System Application, Ministry of Education School of Communication

More information

Pattern based Residual Coding for H.264 Encoder *

Pattern based Residual Coding for H.264 Encoder * Pattern based Residual Coding for H.264 Encoder * Manoranjan Paul and Manzur Murshed Gippsland School of Information Technology, Monash University, Churchill, Vic-3842, Australia E-mail: {Manoranjan.paul,

More information

Investigation of the GoP Structure for H.26L Video Streams

Investigation of the GoP Structure for H.26L Video Streams Investigation of the GoP Structure for H.26L Video Streams F. Fitzek P. Seeling M. Reisslein M. Rossi M. Zorzi acticom GmbH mobile networks R & D Group Germany [fitzek seeling]@acticom.de Arizona State

More information

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations Prashant Ramanathan and Bernd Girod Department of Electrical Engineering Stanford University Stanford CA 945

More information

Edge Detector Based Fast Level Decision Algorithm for Intra Prediction of HEVC

Edge Detector Based Fast Level Decision Algorithm for Intra Prediction of HEVC Journal of Signal Processing, Vol.19, No.2, pp.67-73, March 2015 PAPER Edge Detector Based Fast Level Decision Algorithm for Intra Prediction of HEVC Wen Shi, Xiantao Jiang, Tian Song and Takashi Shimamoto

More information

Enhanced Hexagon with Early Termination Algorithm for Motion estimation

Enhanced Hexagon with Early Termination Algorithm for Motion estimation Volume No - 5, Issue No - 1, January, 2017 Enhanced Hexagon with Early Termination Algorithm for Motion estimation Neethu Susan Idiculay Assistant Professor, Department of Applied Electronics & Instrumentation,

More information

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC 17th European Signal Processing Conference (EUSIPCO 2009) Glasgow, Scotland, August 24-28, 2009 IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC Damian Karwowski, Marek Domański Poznań University

More information

Fast HEVC Intra Mode Decision Based on Edge Detection and SATD Costs Classification

Fast HEVC Intra Mode Decision Based on Edge Detection and SATD Costs Classification Fast HEVC Intra Mode Decision Based on Edge Detection and SATD Costs Classification Mohammadreza Jamali 1, Stéphane Coulombe 1, François Caron 2 1 École de technologie supérieure, Université du Québec,

More information

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012 To: Dr. K. R. Rao From: Kaustubh V. Dhonsale (UTA id: - 1000699333) Date: 04/24/2012 Subject: EE-5359: Class project interim report Proposed project topic: Overview, implementation and comparison of Audio

More information

Key-Words: - Free viewpoint video, view generation, block based disparity map, disparity refinement, rayspace.

Key-Words: - Free viewpoint video, view generation, block based disparity map, disparity refinement, rayspace. New View Generation Method for Free-Viewpoint Video System GANGYI JIANG*, LIANGZHONG FAN, MEI YU AND FENG SHAO Faculty of Information Science and Engineering Ningbo University 315211 Ningbo CHINA jianggangyi@126.com

More information

J. Vis. Commun. Image R.

J. Vis. Commun. Image R. J. Vis. Commun. Image R. 21 (2010) 498 512 Contents lists available at ScienceDirect J. Vis. Commun. Image R. journal homepage: www.elsevier.com/locate/jvci Depth perceptual region-of-interest based multiview

More information

Video Coding Using Spatially Varying Transform

Video Coding Using Spatially Varying Transform Video Coding Using Spatially Varying Transform Cixun Zhang 1, Kemal Ugur 2, Jani Lainema 2, and Moncef Gabbouj 1 1 Tampere University of Technology, Tampere, Finland {cixun.zhang,moncef.gabbouj}@tut.fi

More information

A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT

A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT Wai Chong Chia, Li-Minn Ang, and Kah Phooi Seng Abstract The 3D Set Partitioning In Hierarchical Trees (SPIHT) is a video

More information

Professor, CSE Department, Nirma University, Ahmedabad, India

Professor, CSE Department, Nirma University, Ahmedabad, India Bandwidth Optimization for Real Time Video Streaming Sarthak Trivedi 1, Priyanka Sharma 2 1 M.Tech Scholar, CSE Department, Nirma University, Ahmedabad, India 2 Professor, CSE Department, Nirma University,

More information

Xin-Fu Wang et al.: Performance Comparison of AVS and H.264/AVC 311 prediction mode and four directional prediction modes are shown in Fig.1. Intra ch

Xin-Fu Wang et al.: Performance Comparison of AVS and H.264/AVC 311 prediction mode and four directional prediction modes are shown in Fig.1. Intra ch May 2006, Vol.21, No.3, pp.310 314 J. Comput. Sci. & Technol. Performance Comparison of AVS and H.264/AVC Video Coding Standards Xin-Fu Wang (ΞΠΛ) and De-Bin Zhao (± ) Department of Computer Science, Harbin

More information

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications

A Fast Intra/Inter Mode Decision Algorithm of H.264/AVC for Real-time Applications Fast Intra/Inter Mode Decision lgorithm of H.64/VC for Real-time pplications Bin Zhan, Baochun Hou, and Reza Sotudeh School of Electronic, Communication and Electrical Engineering University of Hertfordshire

More information