LBP-GUIDED DEPTH IMAGE FILTER. Rui Zhong, Ruimin Hu
|
|
- Natalie Grant
- 5 years ago
- Views:
Transcription
1 LBP-GUIDED DEPTH IMAGE FILTER Rui Zhong, Ruimin Hu National Engineering Research Center for Multimedia Software,School of Computer, Wuhan University,Wuhan, , China ABSTRACT The multi-view video plus depth (MVD) format has been put forward for the call for proposals in free view video (FVV) and 3DTV. Since representing the 3D scene geometry, depth maps are used for synthesizing virtual views. However, compression artifacts of the depth images always lead to geometry distortions in synthesized views. By exploiting LBP features of the corresponding color samples, we propose a novel local binary pattern (LBP) guided depth filter which enables the local neighborhood samples those are in the same object of the current pixel to be filtering input. In recognition of its ability for describing the object edges, the LBP operator is used to calculate the weighted values of the local depth pixels for the depth-map filter. Furthermore, the filter is incorporated into the framework of H.264/MVC as an in-loop filter. The experimental results demonstrate that the proposed approach offers 0.45dB and 0.66dB average PSNR gains in terms of video rendering quality and depth coding efficiency, as well as significant subjective improvement in rendering views. 1. INTRODUCTION 3D TV and movies which provide people with favorable immersion and visual experiences accelerate the development of corresponding technologies. Although multi-view videos of a scene are captured at the same time to support the stereo perception, it is still difficult to generate the 3D perception at free view during interactive activities with users. Therefore, how to render the free views from the available MVD becomes a hot issue. P. Merkle illustrated the special statistic characteristics of depth image that it consists of large homogeneous regions partitioned by sharp object edges [1]. Therefore, the common quantization and block-based coding unit in hybrid coding framework, like H.264/AVC, lead to serious artifacts around the object edges in depth image. Since depth map provides the geometry information of a 3D scene, the depth errors further result in geometry distortions in synthesized views. In contrast to the depth errors in homogeneous areas, the pixel displacements around the sharp edges degrade the quality of the synthesized views more seriously [2]. So, it is essential to maintain the sharpness of object edges when coding depth images. In order to meet the backward compatibility of depth codec, K. Oh [3] incorporated an in-loop filter into H.264 framework to denoise depth images. The in-loop filter recovers the noisy pixels from neighboring pixels within the limitation of geometric closeness, photometric similarity and occurrence frequency. This method significantly improves the efficiency of depth coding and generates better synthesized views. However, the corrupted depth pixels may reduce the accuracy of the reconstructed pixels. And the lower the bitrate, the worse the filtering performance. Based on the idea of bilateral filter, Liu [4] presented a trilateral filter in which the structural similarity between color images and corresponding depth maps also contributes to the weighted values. This filter can definitely remove artifacts in decoded depth maps, as well as improve the synthesized video quality under same bitrate, but its performance is not stable if there are few similar pixels around the current pixel. To achieve the stable filtering output, the filter needs to eliminate the effect caused by the neighboring pixels with significantly different values. In this paper we propose a local binary pattern (LBP) guided depth map filter based on bilateral filter. LBP is firstly presented by Timo Ojala [5] as a robust micro-texture descriptor which describes the uniform distribution of the circularly symmetric neighboring pixels. LBP measures the structural image features such as edges, corners, spots and planes accurately. Since depth map filter selects input pixels on the basis of the similarity of the neighboring depth pixels in an object, LBP of the color image can be used to determine whether the near pixels belong to the same object of current pixels. In addition, LBP is robust to the monotonic gray scale transformation and quantization [6], the slight quantization errors in depth maps do not decrease its description accuracy. Hence, using the LBP of color images to limit the input of the filter is more robust than calculating the weighted vales depending on the luminance component of corresponding color images. The remainder of the paper is organized as follows: the encoding framework is presented in Section 2. Section 3 demonstrates the LBP-guided filter. Section 4 analyzes the experimental results. The paper is concluded in Section 5.
2 2. THE ENCODING FRAMEWORK The depth video could be coded by H.264/MVC as luminance component of the color video. Since representing the geometry information of the 3D scene, it is mainly used to synthesize the virtual view video, not to display directly. Therefore, the block-based encoding and the quantization of depth map lead to serious artifacts in rendering image, such as blocking artifacts, ringing or blurring [7]. To eliminate the artifacts in the depth reconstruction image, this paper designs a depth filter under the guidance of LBP operator which describes the color image structure accurately. Finally, the filter is added into the framework of H.264/MVC codec as an in-loop filter. The specific framework is shown in Fig LBP operator For each pixel g c of the color image, the Local Binary Pattern (LBP) operator is derived from the local circularly symmetric neighbors set containing P members on a circle of radius R, denoted as LBP P,R. The variant P controls the quantization of the angular space, and R is related to the space resolution of the operator[5]. As shown in Fig. 2, it is the location of the neighboring pixels set, where R equals 1 and P is assigned as THE LBP GUIDED DEPTH FILTER Bilateral filter is a non-linear filter that generates the filtering output by the weighted average of the neighboring input pixels. The weighted values are calculated depending on geometric closeness and photometric similarity between the input pixels and the current filtered pixel [8]. The trilateral filter is to exploit not only the photometric similarity in depth image, but also the photometric similarity in corresponding color image that has the similar structure with depth image. By doing so, the smaller weighted values are set for near outlier pixels for producing a smooth depth image while preserving the edges [4]. However, a number of dissimilarity pixels around the current pixel will result in an unstable filtering output. The pixels within an object correspond to similar intensity values in depth-image. To eliminate the unstable influence taken by dissimilarity input pixels, the proposed depth-image filter only select the local pixels within the object which consists of the current filtered pixel as filtering input. Due to the structural similarity between depth and color images, the LBP feature of color image is used to identify the pixels which are in the same object of current pixel. And then those similar depth pixels are taken as filtering input to ensure the stable filtering output. Fig. 2. The locations of the current pixel and the neighboring pixels set T represents the the derivation of the texture operator, the joint distribution of local near pixels could be expressed as, T = T (g c, g 0,, g P 1 ) (1) Where g c is the color sample value at the location of the current filtered depth pixel, and g p (p = 0, P 1) denotes the color sample values of its local circularly symmetric neighboring pixels. While the differences (g p g c ) are independent of g c, the joint distribution of gray values is transformed into joint difference distribution [9]. T T (g 0 g c,, g P 1 g c ) (2) Since the signs of the difference s (g 0 g c ) will not vary with the average luminance, we let the binary signs s (g 0 g c ) replace the gray shift. where, { 1, x 0 s (x) = (3) 0, x < 0 Fig. 1. Framework of H.264/MVC codec for a view 3.2. LBP guided filtering Except for geometric closeness and photometric similarity in depth image, the LBP-guided filter also depends on the LBP
3 operator of the corresponding color image at the same location. In local window w which is labeled in Fig. 2,the depth sample value d p (p = 0, P 1) of the neighboring pixels are weighted to get the filtered output G c for the current filtered depth pixel d c. The LBP-guided filter is finally given by equation (4). G c = 1 F c G S (d p d c ) G Q ( p c ) LBP uni P,R (g p g c ) d p (4) Where, LBPP,R uni is the uniform local binary pattern of the near color pixels in window w. In expression (5), we calculate the summed weighted value F c, the range filter kernel G S (d p d c ) with the photometric similarity (d p d c ), and the spatial kernel G Q (p c) with the geometric closeness p c. F c = G S (d p d c ) G Q ( p c ) LBPP,R uni (g p g c ) ) G S (d p d c ) = exp ( dp dc 2 σs 2 ) G Q ( p c ) = exp ( xp xc 2 + y p y c 2 (5) The neighboring pixels in local window around the depth object edge belong to different objects. In our filter, only the pixels which are in the same object of the current filtered pixel will be taken as the filtering input. As shown in Fig. 3, the LBP operator of the color image is the guidance of local pixel classification by its ability of edge description. For the purpose of describing the object edge direction accurately, the proposed filter omits the procedure of rotating the LBP operator which is used to deduce the minimum value in [5]. Therefore, the operator LBP P,R equals to binary joint distribution of T. LBP P,R = T (s (g 0 g c ),, s (g P 1 g c )) LBP P,R (g p g c ) = s (g p g c ), p [0, P 1] In Fig. 3, the LBP operator classifies the local pixels in local window w into two sets by binary labels, one of which contains the similar pixels with the current filtered pixel while the other set consists of dissimilarity pixels. However, the LBP operator is unable to justify which set the current pixel d c should belong to. In equation (7), we compare the distances from the pixel sets divided by LBP operator to the current filtered pixel. And then the LBP values of the pixels set with closer distance is assigned as 1. LBP uni { P,R (g p g c ) = σ 2 Q (6) LBP P,R (g p g c ), if sum zero sum one 1 LBP P,R (g p g c ), if sum zero < sum one (7) Where p [0, P 1], the variables sum zero and sum one represent the distances from the pixels sets partitioned by LBP operator to the current pixel respectively, as shown in equation (8): sum zero = LBP P,R (g p g c ) d p d c sum one = 1 LBP P,R (g p g c ) d p d c Where, d p (p [0, P 1]) are the near pixels around the current filtered pixel in window w, and d c denotes the current filtered pixel in depth-image. Fig. 3. Edge description of LBP operator 4. THE EXPERIMENT RESULTS The critical contribution of depth video is to assist the synthesis of virtual view, not to display directly. Therefore we evaluate the effectiveness of depth-image filter not only by the coding efficiency of depth image, but also by the quality of virtual view. The trilateral filter proposed by [4] is labeled method 1 as the reference method. Table 1. Depth coding result for ballet sequence The encoding efficiency of the depth image The coding efficiency of the depth-video is evaluated in terms of PSNR and bitrate. The proposed in-loop filter is implemented in reference software JMVC 6.0 [10]. The encoding configuration is follow the standard given by JVT [11]. We set 4 basic QPs 22, 27, 32 and 37 for depth video, and 1 basic QP 22 for color video. 100 frames are coded with 2 temporal reference pictures and 2 reference views. And the search range is ±96. (8)
4 Table 2. Depth coding result for breakdancer sequence This paper uses the sequences Ballet and Breakdancer with resolution of 1024*768 provided by Microsoft. In the two MVD sequences, view 0 and view 2 are selected as the reference views, and view 1 is the virtual view. Table. 1 and Table. 2 give the depth coding results of view 2 for t- wo sequences. And the rate-distortion (RD) curves in terms of bitrate and depth-image objective quality are shown in Fig. 4(a) and Fig. 4(b). (a) 4.2. The quality of the rendered video The software VSRS 2.3 [12] is used to render the virtual view. In the two MVD sequences, view 0 and view 2 are selected as the reference views, and view 1 is the virtual view. The PSNR value is calculated based on the difference between the virtual view image and the original image. While the virtual view image is rendered from the filtered depth image, the o- riginal image is derived from the original depth image. Table. 3, and Table. 4 illustrate the rendering results for Ballet and Breakdancer. The RD curves in terms of depth rate and rendering objective quality are shown in the Fig. 5. The subjective results are shown in Fig. 6. Table 3. Rendering result for ballet sequence Table 4. Rendering result for breakdancer sequence (b) Fig. 4. (a): Rate-distortion curves in terms of depth bitrate and depth quality (ballet). (b): Rate-distortion curves in terms of depth bitrate and depth quality (breakdancer) Analysis of the results Based on the above results, we will calculate the bitrate saving with the Bjontegaard metric [13], which computes average PSNR difference between RD-curves. Fig. 4(a) and Fig. 4(b). show that the proposed depth filter reaches better depth coding efficiency compared to the state-of-the-art approach [4]. The PSNR gains of depth image achieve up to 0.64d- B and 0.69dB, which translate to a bit rate savings of 6.55% and 10.45% for Ballet and Breakdancer. Therefore, the average PSNR gain is 0.66 db for depth coding. Fig. 5(a) and Fig. 5(b) demonstrate that the rendering quality achieves PSNR gains of 0.26dB and 0.63dB, which correspond to a bit rate savings of 8.19% and 23.58% for Ballet and Breakdancer respectively. Thus the average PSNR gain is 0.45dB for rendering video. However, the proposed filter results in a little more bitrate cost for depth video coding at the status of low bitrate. From the subjective results shown in Fig. 6, we conclude that the rendering image quality generated by our LBP-guided filter is better than method 1. Especially the hole and the blur are relieved by our method.
5 International Conference on, sept. 2010, pp [2] Y. Morvan P. Merkle and A. Smolic, The effects of multiview depth video compression on multiview rendering, Signal Processing: Image Communication, vol. 24, no. 1-2, pp. 73C88, January (a) [3] Kwan-Jung Oh, A. Vetro, and Yo-Sung Ho, Depth coding using a boundary reconstruction filter for 3-d video systems, Circuits and Systems for Video Technology, IEEE Transactions on, vol. 21, no. 3, pp , march [4] Shujie Liu, PoLin Lai, Dong Tian, and Chang Wen Chen, New depth coding techniques with utilization of corresponding video, Broadcasting, IEEE Transactions on, vol. 57, no. 2, pp , june [5] T. Ojala, M. Pietikainen, and T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 24, no. 7, pp , jul (b) Fig. 5. (a): Rate-distortion curves in terms of depth bitrate and rendering quality (ballet). (b): Rate-distortion curves in terms of depth bitrate and depth quality (breakdancer). 5. CONCLUSION To solve the problem that the neighboring depth pixels with significantly different values will introduce unstable filtering output result, we made the first attempt to let the LBP operator guide the depth image filtering. Since LBP describes the structural feature of the color image accurately, it is used to select the similar near pixels which are in the same object with the current filtered pixel. The experiment results demonstrate that the proposed depth filter achieves better depth-image coding efficiency, rendering quality and subjective effect than the trilateral depth filter which is the optimal method currently. However, the bitrate cost of depth video is a little increased at low bitrate situation. This problem will be researched in our future work. 6. REFERENCES [1] P. Merkle, J.B. Singla, K. Muller, and T. Wiegand, Correlation histogram analysis of depth-enhanced 3d video coding, in Image Processing (ICIP), th IEEE [6] B. Widrow, I. Kollar, and Ming-Chang Liu, Statistical theory of quantization, Instrumentation and Measurement, IEEE Transactions on, vol. 45, no. 2, pp , apr [7] C. Dorea P. Yin P. Lai, A. Ortega and C. Gomila, Statistical theory of quantization, in Proc. of Visual Communic. and Image Proc., VCIP 09. San Jose, CA, USA,, Jan [8] C. Tomasi and R. Manduchi, Bilateral filtering for gray and color images, in Computer Vision, Sixth International Conference on, jan 1998, pp [9] Timo Ojala, Kimmo Valkealahti, Erkki Oja, and Matti Pietikinen, Texture discrimination with multidimensional distributions of signed gray-level differences, Pattern Recognition, vol. 34, no. 3, pp , [10] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, Draft reference software for mvc, in JVT- AE207,London, June [11] ISO/IEC JTC1/SC29/WG11, Common test conditions for multi-view video coding, in JVT- U211,Hangzhou,China, October [12] Kazuyoshi Suzuki, Reference software for view synthesis version 2.3, in m16090, June [13] G. Bjontegaard, Common test conditions for multiview video coding, in VCEG Contribution VCEG-M33, 13th VCEG Meeting, Austin, Texas, USA, April 2001.
6 (a1) (b1) (a2) (b2) (a3) (a4) (b3) (b4) (a5) (a6) (b5) (b6) Fig. 6. Subjective rendering image results for sequences Ballet(a) and Breakdancer(b). (a1) and (b1) Results by method 1. (a2) and (b2) Results by our method. (a3) To (a6) Magnified results of Ballet : (a3)(a5), method 1. (a4)(a6), our method. (b3) To (b6) Magnified results of Breakdancer : (b3)(b5), method 1. (b4)(b6), our method.
Efficient Techniques for Depth Video Compression Using Weighted Mode Filtering
1 Efficient Techniques for Depth Video Compression Using Weighted Mode Filtering Viet-Anh Nguyen, Dongbo Min, Member, IEEE, and Minh N. Do, Senior Member, IEEE Abstract This paper proposes efficient techniques
More informationFocus on visual rendering quality through content-based depth map coding
Focus on visual rendering quality through content-based depth map coding Emilie Bosc, Muriel Pressigout, Luce Morin To cite this version: Emilie Bosc, Muriel Pressigout, Luce Morin. Focus on visual rendering
More informationDepth Map Boundary Filter for Enhanced View Synthesis in 3D Video
J Sign Process Syst (2017) 88:323 331 DOI 10.1007/s11265-016-1158-x Depth Map Boundary Filter for Enhanced View Synthesis in 3D Video Yunseok Song 1 & Yo-Sung Ho 1 Received: 24 April 2016 /Accepted: 7
More informationView Synthesis Prediction for Rate-Overhead Reduction in FTV
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com View Synthesis Prediction for Rate-Overhead Reduction in FTV Sehoon Yea, Anthony Vetro TR2008-016 June 2008 Abstract This paper proposes the
More informationCountermeasure for the Protection of Face Recognition Systems Against Mask Attacks
Countermeasure for the Protection of Face Recognition Systems Against Mask Attacks Neslihan Kose, Jean-Luc Dugelay Multimedia Department EURECOM Sophia-Antipolis, France {neslihan.kose, jean-luc.dugelay}@eurecom.fr
More informationView Synthesis for Multiview Video Compression
View Synthesis for Multiview Video Compression Emin Martinian, Alexander Behrens, Jun Xin, and Anthony Vetro email:{martinian,jxin,avetro}@merl.com, behrens@tnt.uni-hannover.de Mitsubishi Electric Research
More informationAnalysis of Depth Map Resampling Filters for Depth-based 3D Video Coding
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Analysis of Depth Map Resampling Filters for Depth-based 3D Video Coding Graziosi, D.B.; Rodrigues, N.M.M.; de Faria, S.M.M.; Tian, D.; Vetro,
More informationDepth Estimation for View Synthesis in Multiview Video Coding
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Depth Estimation for View Synthesis in Multiview Video Coding Serdar Ince, Emin Martinian, Sehoon Yea, Anthony Vetro TR2007-025 June 2007 Abstract
More informationView Generation for Free Viewpoint Video System
View Generation for Free Viewpoint Video System Gangyi JIANG 1, Liangzhong FAN 2, Mei YU 1, Feng Shao 1 1 Faculty of Information Science and Engineering, Ningbo University, Ningbo, 315211, China 2 Ningbo
More informationHomogeneous Transcoding of HEVC for bit rate reduction
Homogeneous of HEVC for bit rate reduction Ninad Gorey Dept. of Electrical Engineering University of Texas at Arlington Arlington 7619, United States ninad.gorey@mavs.uta.edu Dr. K. R. Rao Fellow, IEEE
More informationView Synthesis for Multiview Video Compression
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com View Synthesis for Multiview Video Compression Emin Martinian, Alexander Behrens, Jun Xin, and Anthony Vetro TR2006-035 April 2006 Abstract
More informationDEPTH-LEVEL-ADAPTIVE VIEW SYNTHESIS FOR 3D VIDEO
DEPTH-LEVEL-ADAPTIVE VIEW SYNTHESIS FOR 3D VIDEO Ying Chen 1, Weixing Wan 2, Miska M. Hannuksela 3, Jun Zhang 2, Houqiang Li 2, and Moncef Gabbouj 1 1 Department of Signal Processing, Tampere University
More informationFast Mode Decision for Depth Video Coding Using H.264/MVC *
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 31, 1693-1710 (2015) Fast Mode Decision for Depth Video Coding Using H.264/MVC * CHIH-HUNG LU, HAN-HSUAN LIN AND CHIH-WEI TANG* Department of Communication
More informationAn Efficient Table Prediction Scheme for CAVLC
An Efficient Table Prediction Scheme for CAVLC 1. Introduction Jin Heo 1 Oryong-Dong, Buk-Gu, Gwangju, 0-712, Korea jinheo@gist.ac.kr Kwan-Jung Oh 1 Oryong-Dong, Buk-Gu, Gwangju, 0-712, Korea kjoh81@gist.ac.kr
More informationKey-Words: - Free viewpoint video, view generation, block based disparity map, disparity refinement, rayspace.
New View Generation Method for Free-Viewpoint Video System GANGYI JIANG*, LIANGZHONG FAN, MEI YU AND FENG SHAO Faculty of Information Science and Engineering Ningbo University 315211 Ningbo CHINA jianggangyi@126.com
More informationFast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda
Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE 5359 Gaurav Hansda 1000721849 gaurav.hansda@mavs.uta.edu Outline Introduction to H.264 Current algorithms for
More informationAnalysis of 3D and Multiview Extensions of the Emerging HEVC Standard
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Analysis of 3D and Multiview Extensions of the Emerging HEVC Standard Vetro, A.; Tian, D. TR2012-068 August 2012 Abstract Standardization of
More informationBRIEF Features for Texture Segmentation
BRIEF Features for Texture Segmentation Suraya Mohammad 1, Tim Morris 2 1 Communication Technology Section, Universiti Kuala Lumpur - British Malaysian Institute, Gombak, Selangor, Malaysia 2 School of
More informationNext-Generation 3D Formats with Depth Map Support
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Next-Generation 3D Formats with Depth Map Support Chen, Y.; Vetro, A. TR2014-016 April 2014 Abstract This article reviews the most recent extensions
More informationAn adaptive preprocessing algorithm for low bitrate video coding *
Li et al. / J Zhejiang Univ SCIENCE A 2006 7(12):2057-2062 2057 Journal of Zhejiang University SCIENCE A ISSN 1009-3095 (Print); ISSN 1862-1775 (Online) www.zju.edu.cn/jzus; www.springerlink.com E-mail:
More informationWITH the improvements in high-speed networking, highcapacity
134 IEEE TRANSACTIONS ON BROADCASTING, VOL. 62, NO. 1, MARCH 2016 A Virtual View PSNR Estimation Method for 3-D Videos Hui Yuan, Member, IEEE, Sam Kwong, Fellow, IEEE, Xu Wang, Student Member, IEEE, Yun
More informationLow-Complexity, Near-Lossless Coding of Depth Maps from Kinect-Like Depth Cameras
Low-Complexity, Near-Lossless Coding of Depth Maps from Kinect-Like Depth Cameras Sanjeev Mehrotra, Zhengyou Zhang, Qin Cai, Cha Zhang, Philip A. Chou Microsoft Research Redmond, WA, USA {sanjeevm,zhang,qincai,chazhang,pachou}@microsoft.com
More informationH.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho
H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC Jung-Ah Choi, Jin Heo, and Yo-Sung Ho Gwangju Institute of Science and Technology {jachoi, jinheo, hoyo}@gist.ac.kr
More informationAn Efficient Mode Selection Algorithm for H.264
An Efficient Mode Selection Algorithm for H.64 Lu Lu 1, Wenhan Wu, and Zhou Wei 3 1 South China University of Technology, Institute of Computer Science, Guangzhou 510640, China lul@scut.edu.cn South China
More informationDeblocking Filter Algorithm with Low Complexity for H.264 Video Coding
Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding Jung-Ah Choi and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 261 Cheomdan-gwagiro, Buk-gu, Gwangju, 500-712, Korea
More informationA FRAMEWORK FOR ANALYZING TEXTURE DESCRIPTORS
A FRAMEWORK FOR ANALYZING TEXTURE DESCRIPTORS Timo Ahonen and Matti Pietikäinen Machine Vision Group, University of Oulu, PL 4500, FI-90014 Oulun yliopisto, Finland tahonen@ee.oulu.fi, mkp@ee.oulu.fi Keywords:
More informationOptimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform
Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform Torsten Palfner, Alexander Mali and Erika Müller Institute of Telecommunications and Information Technology, University of
More informationOptimizing the Deblocking Algorithm for. H.264 Decoder Implementation
Optimizing the Deblocking Algorithm for H.264 Decoder Implementation Ken Kin-Hung Lam Abstract In the emerging H.264 video coding standard, a deblocking/loop filter is required for improving the visual
More informationOverview of Multiview Video Coding and Anti-Aliasing for 3D Displays
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Overview of Multiview Video Coding and Anti-Aliasing for 3D Displays Anthony Vetro, Sehoon Yea, Matthias Zwicker, Wojciech Matusik, Hanspeter
More informationDEPTH CODING BASED ON BINARY TREE DECOMPOSITION. Gonçalo Nuno Silva do Carmo
DEPTH CODING BASED ON BINARY TREE DECOMPOSITION Gonçalo Nuno Silva do Carmo Instituto de Telecomunicações Grupo de Imagem Instituto Superior Técnico Av. Rovisco Pais, 049-00 Lisboa, Portugal E-mail: goncalo.do.carmo@ist.utl.pt
More informationFinal report on coding algorithms for mobile 3DTV. Gerhard Tech Karsten Müller Philipp Merkle Heribert Brust Lina Jin
Final report on coding algorithms for mobile 3DTV Gerhard Tech Karsten Müller Philipp Merkle Heribert Brust Lina Jin MOBILE3DTV Project No. 216503 Final report on coding algorithms for mobile 3DTV Gerhard
More informationOn the Adoption of Multiview Video Coding in Wireless Multimedia Sensor Networks
2011 Wireless Advanced On the Adoption of Multiview Video Coding in Wireless Multimedia Sensor Networks S. Colonnese, F. Cuomo, O. Damiano, V. De Pascalis and T. Melodia University of Rome, Sapienza, DIET,
More informationMulti-View Image Coding in 3-D Space Based on 3-D Reconstruction
Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Yongying Gao and Hayder Radha Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48823 email:
More informationInternational Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)
A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and
More informationAn Efficient Image Compression Using Bit Allocation based on Psychovisual Threshold
An Efficient Image Compression Using Bit Allocation based on Psychovisual Threshold Ferda Ernawan, Zuriani binti Mustaffa and Luhur Bayuaji Faculty of Computer Systems and Software Engineering, Universiti
More informationReducing/eliminating visual artifacts in HEVC by the deblocking filter.
1 Reducing/eliminating visual artifacts in HEVC by the deblocking filter. EE5359 Multimedia Processing Project Proposal Spring 2014 The University of Texas at Arlington Department of Electrical Engineering
More informationCoding of 3D Videos based on Visual Discomfort
Coding of 3D Videos based on Visual Discomfort Dogancan Temel and Ghassan AlRegib School of Electrical and Computer Engineering, Georgia Institute of Technology Atlanta, GA, 30332-0250 USA {cantemel, alregib}@gatech.edu
More informationFAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING
FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING 1 Michal Joachimiak, 2 Kemal Ugur 1 Dept. of Signal Processing, Tampere University of Technology, Tampere, Finland 2 Jani Lainema,
More informationEnhanced View Synthesis Prediction for Coding of Non-Coplanar 3D Video Sequences
Enhanced View Synthesis Prediction for Coding of Non-Coplanar 3D Video Sequences Jens Schneider, Johannes Sauer and Mathias Wien Institut für Nachrichtentechnik, RWTH Aachen University, Germany Abstract
More informationBit Allocation and Encoded View Selection for Optimal Multiview Image Representation
Bit Allocation and Encoded View Selection for Optimal Multiview Image Representation Gene Cheung #, Vladan Velisavljević o # National Institute of Informatics 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo, Japan
More informationImplementation of a Face Recognition System for Interactive TV Control System
Implementation of a Face Recognition System for Interactive TV Control System Sang-Heon Lee 1, Myoung-Kyu Sohn 1, Dong-Ju Kim 1, Byungmin Kim 1, Hyunduk Kim 1, and Chul-Ho Won 2 1 Dept. IT convergence,
More informationOVERVIEW OF IEEE 1857 VIDEO CODING STANDARD
OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD Siwei Ma, Shiqi Wang, Wen Gao {swma,sqwang, wgao}@pku.edu.cn Institute of Digital Media, Peking University ABSTRACT IEEE 1857 is a multi-part standard for multimedia
More informationarxiv: v1 [cs.mm] 8 May 2018
OPTIMIZATION OF OCCLUSION-INDUCING DEPTH PIXELS IN 3-D VIDEO CODING Pan Gao Cagri Ozcinar Aljosa Smolic College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics V-SENSE
More informationHEVC based Stereo Video codec
based Stereo Video B Mallik*, A Sheikh Akbari*, P Bagheri Zadeh *School of Computing, Creative Technology & Engineering, Faculty of Arts, Environment & Technology, Leeds Beckett University, U.K. b.mallik6347@student.leedsbeckett.ac.uk,
More informationWATERMARKING FOR LIGHT FIELD RENDERING 1
ATERMARKING FOR LIGHT FIELD RENDERING 1 Alper Koz, Cevahir Çığla and A. Aydın Alatan Department of Electrical and Electronics Engineering, METU Balgat, 06531, Ankara, TURKEY. e-mail: koz@metu.edu.tr, cevahir@eee.metu.edu.tr,
More informationIBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms
RC24748 (W0902-063) February 12, 2009 Electrical Engineering IBM Research Report Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms Yuri Vatis Institut für Informationsverarbeitung
More informationA LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING
2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING Dieison Silveira, Guilherme Povala,
More informationVideo compression with 1-D directional transforms in H.264/AVC
Video compression with 1-D directional transforms in H.264/AVC The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation Kamisli, Fatih,
More informationA study of depth/texture bit-rate allocation in multi-view video plus depth compression
A study of depth/texture bit-rate allocation in multi-view video plus depth compression Emilie Bosc, Fabien Racapé, Vincent Jantet, Paul Riou, Muriel Pressigout, Luce Morin To cite this version: Emilie
More informationUpcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.
Upcoming Video Standards Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc. Outline Brief history of Video Coding standards Scalable Video Coding (SVC) standard Multiview Video Coding
More informationINTERNATIONAL ORGANISATION FOR STANDARISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO
INTERNATIONAL ORGANISATION FOR STANDARISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 MPEG/M15672 July 2008, Hannover,
More informationDEPTH IMAGE BASED RENDERING WITH ADVANCED TEXTURE SYNTHESIS. P. Ndjiki-Nya, M. Köppel, D. Doshkov, H. Lakshman, P. Merkle, K. Müller, and T.
DEPTH IMAGE BASED RENDERING WITH ADVANCED TEXTURE SYNTHESIS P. Ndjiki-Nya, M. Köppel, D. Doshkov, H. Lakshman, P. Merkle, K. Müller, and T. Wiegand Fraunhofer Institut for Telecommunications, Heinrich-Hertz-Institut
More informationColor Local Texture Features Based Face Recognition
Color Local Texture Features Based Face Recognition Priyanka V. Bankar Department of Electronics and Communication Engineering SKN Sinhgad College of Engineering, Korti, Pandharpur, Maharashtra, India
More informationPerformance Comparison between DWT-based and DCT-based Encoders
, pp.83-87 http://dx.doi.org/10.14257/astl.2014.75.19 Performance Comparison between DWT-based and DCT-based Encoders Xin Lu 1 and Xuesong Jin 2 * 1 School of Electronics and Information Engineering, Harbin
More informationUnit-level Optimization for SVC Extractor
Unit-level Optimization for SVC Extractor Chang-Ming Lee, Chia-Ying Lee, Bo-Yao Huang, and Kang-Chih Chang Department of Communications Engineering National Chung Cheng University Chiayi, Taiwan changminglee@ee.ccu.edu.tw,
More informationPlatelet-based coding of depth maps for the transmission of multiview images
Platelet-based coding of depth maps for the transmission of multiview images Yannick Morvan a, Peter H. N. de With a,b and Dirk Farin a a Eindhoven University of Technology, P.O. Box 513, The Netherlands;
More informationCompression-Induced Rendering Distortion Analysis for Texture/Depth Rate Allocation in 3D Video Compression
2009 Data Compression Conference Compression-Induced Rendering Distortion Analysis for Texture/Depth Rate Allocation in 3D Video Compression Yanwei Liu, Siwei Ma, Qingming Huang, Debin Zhao, Wen Gao, Nan
More informationNoise Robustness of Irregular LBP Pyramids
Noise Robustness of Irregular LBP Pyramids Christoph Körner, Ines Janusch, Walter G. Kropatsch Pattern Recognition and Image Processing (PRIP) Vienna University of Technology, Austria {christoph,ines,krw}@prip.tuwien.ac.at
More informationFast Mode Decision for H.264/AVC Using Mode Prediction
Fast Mode Decision for H.264/AVC Using Mode Prediction Song-Hak Ri and Joern Ostermann Institut fuer Informationsverarbeitung, Appelstr 9A, D-30167 Hannover, Germany ri@tnt.uni-hannover.de ostermann@tnt.uni-hannover.de
More informationImproved Geometric Warping-Based Watermarking
Improved Geometric Warping-Based Watermarking Dima Pröfrock, Mathias Schlauweg, Erika Müller Institute of Communication Engineering, Faculty of Computer Science and Electrical Engineering, University of
More informationFrequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding
2009 11th IEEE International Symposium on Multimedia Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding Ghazaleh R. Esmaili and Pamela C. Cosman Department of Electrical and
More informationAn Optimized Template Matching Approach to Intra Coding in Video/Image Compression
An Optimized Template Matching Approach to Intra Coding in Video/Image Compression Hui Su, Jingning Han, and Yaowu Xu Chrome Media, Google Inc., 1950 Charleston Road, Mountain View, CA 94043 ABSTRACT The
More informationBit Allocation for Spatial Scalability in H.264/SVC
Bit Allocation for Spatial Scalability in H.264/SVC Jiaying Liu 1, Yongjin Cho 2, Zongming Guo 3, C.-C. Jay Kuo 4 Institute of Computer Science and Technology, Peking University, Beijing, P.R. China 100871
More informationQUAD-TREE PARTITIONED COMPRESSED SENSING FOR DEPTH MAP CODING. Ying Liu, Krishna Rao Vijayanagar, and Joohee Kim
QUAD-TREE PARTITIONED COMPRESSED SENSING FOR DEPTH MAP CODING Ying Liu, Krishna Rao Vijayanagar, and Joohee Kim Department of Electrical and Computer Engineering, Illinois Institute of Technology, Chicago,
More informationExtensions of H.264/AVC for Multiview Video Compression
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Extensions of H.264/AVC for Multiview Video Compression Emin Martinian, Alexander Behrens, Jun Xin, Anthony Vetro, Huifang Sun TR2006-048 June
More informationComputer Vision I - Filtering and Feature detection
Computer Vision I - Filtering and Feature detection Carsten Rother 30/10/2015 Computer Vision I: Basics of Image Processing Roadmap: Basics of Digital Image Processing Computer Vision I: Basics of Image
More informationDEPTH PIXEL CLUSTERING FOR CONSISTENCY TESTING OF MULTIVIEW DEPTH. Pravin Kumar Rana and Markus Flierl
DEPTH PIXEL CLUSTERING FOR CONSISTENCY TESTING OF MULTIVIEW DEPTH Pravin Kumar Rana and Markus Flierl ACCESS Linnaeus Center, School of Electrical Engineering KTH Royal Institute of Technology, Stockholm,
More informationFast Encoding Techniques for Multiview Video Coding
Fast Encoding Techniques for Multiview Video Coding S. Khattak a, R. Hamzaoui a,, S. Ahmad a, P. Frossard b a Centre for Electronic and Communications Engineering, De Montfort University, United Kingdom
More informationGraph-based representation for multiview images with complex camera configurations
Graph-based representation for multiview images with complex camera configurations Xin Su, Thomas Maugey, Christine Guillemot To cite this version: Xin Su, Thomas Maugey, Christine Guillemot. Graph-based
More informationSurvey on Extraction of Texture based Features using Local Binary Pattern
Survey on Extraction of Texture based Features using Local Binary Pattern Ch.Sudha Sree Computer Applications RVR&JC college of Engineering Guntur, India Abstract Local Binary Pattern (LBP) is one of the
More informationReduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC
Proceedings of the 7th WSEAS International Conference on Multimedia, Internet & Video Technologies, Beijing, China, September 15-17, 2007 198 Reduced 4x4 Block Intra Prediction Modes using Directional
More informationsignal-to-noise ratio (PSNR), 2
u m " The Integration in Optics, Mechanics, and Electronics of Digital Versatile Disc Systems (1/3) ---(IV) Digital Video and Audio Signal Processing ƒf NSC87-2218-E-009-036 86 8 1 --- 87 7 31 p m o This
More informationContent based Image Retrieval Using Multichannel Feature Extraction Techniques
ISSN 2395-1621 Content based Image Retrieval Using Multichannel Feature Extraction Techniques #1 Pooja P. Patil1, #2 Prof. B.H. Thombare 1 patilpoojapandit@gmail.com #1 M.E. Student, Computer Engineering
More informationEE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.
EE 5359 MULTIMEDIA PROCESSING SPRING 2011 Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.264 Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY
More informationDisparity Search Range Estimation: Enforcing Temporal Consistency
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Disparity Search Range Estimation: Enforcing Temporal Consistency Dongbo Min, Sehoon Yea, Zafer Arican, Anthony Vetro TR1-13 April 1 Abstract
More informationCompression Artifact Reduction with Adaptive Bilateral Filtering
Compression Artifact Reduction with Adaptive Bilateral Filtering Ming Zhang and Bahadir Gunturk Department of Electrical and Computer Engineering Louisiana State University Baton Rouge, LA 70803 ABSTRACT
More information5LSH0 Advanced Topics Video & Analysis
1 Multiview 3D video / Outline 2 Advanced Topics Multimedia Video (5LSH0), Module 02 3D Geometry, 3D Multiview Video Coding & Rendering Peter H.N. de With, Sveta Zinger & Y. Morvan ( p.h.n.de.with@tue.nl
More informationAn Adaptive Threshold LBP Algorithm for Face Recognition
An Adaptive Threshold LBP Algorithm for Face Recognition Xiaoping Jiang 1, Chuyu Guo 1,*, Hua Zhang 1, and Chenghua Li 1 1 College of Electronics and Information Engineering, Hubei Key Laboratory of Intelligent
More informationImage Quality Assessment based on Improved Structural SIMilarity
Image Quality Assessment based on Improved Structural SIMilarity Jinjian Wu 1, Fei Qi 2, and Guangming Shi 3 School of Electronic Engineering, Xidian University, Xi an, Shaanxi, 710071, P.R. China 1 jinjian.wu@mail.xidian.edu.cn
More informationExperimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis
Experimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis N.Padmapriya, Ovidiu Ghita, and Paul.F.Whelan Vision Systems Laboratory,
More informationScene Segmentation by Color and Depth Information and its Applications
Scene Segmentation by Color and Depth Information and its Applications Carlo Dal Mutto Pietro Zanuttigh Guido M. Cortelazzo Department of Information Engineering University of Padova Via Gradenigo 6/B,
More informationOptimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec
Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec Seung-Hwan Kim and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu,
More informationCoding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter
Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter Y. Vatis, B. Edler, I. Wassermann, D. T. Nguyen and J. Ostermann ABSTRACT Standard video compression techniques
More informationDigital Image Stabilization and Its Integration with Video Encoder
Digital Image Stabilization and Its Integration with Video Encoder Yu-Chun Peng, Hung-An Chang, Homer H. Chen Graduate Institute of Communication Engineering National Taiwan University Taipei, Taiwan {b889189,
More informationLossless Compression of Stereo Disparity Maps for 3D
2012 IEEE International Conference on Multimedia and Expo Workshops Lossless Compression of Stereo Disparity Maps for 3D Marco Zamarin, Søren Forchhammer Department of Photonics Engineering Technical University
More informationComparative Analysis of Local Binary Patterns using Feature Extraction and Classification
Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 5 (2017) pp. 1025-1036 Research India Publications http://www.ripublication.com Comparative Analysis of Local Binary Patterns
More informationCompression of Stereo Images using a Huffman-Zip Scheme
Compression of Stereo Images using a Huffman-Zip Scheme John Hamann, Vickey Yeh Department of Electrical Engineering, Stanford University Stanford, CA 94304 jhamann@stanford.edu, vickey@stanford.edu Abstract
More informationBilateral Depth-Discontinuity Filter for Novel View Synthesis
Bilateral Depth-Discontinuity Filter for Novel View Synthesis Ismaël Daribo and Hideo Saito Department of Information and Computer Science, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama 3-85, Japan
More informationFast HEVC Intra Mode Decision Based on Edge Detection and SATD Costs Classification
Fast HEVC Intra Mode Decision Based on Edge Detection and SATD Costs Classification Mohammadreza Jamali 1, Stéphane Coulombe 1, François Caron 2 1 École de technologie supérieure, Université du Québec,
More informationQuality improving techniques in DIBR for free-viewpoint video Do, Q.L.; Zinger, S.; Morvan, Y.; de With, P.H.N.
Quality improving techniques in DIBR for free-viewpoint video Do, Q.L.; Zinger, S.; Morvan, Y.; de With, P.H.N. Published in: Proceedings of the 3DTV Conference : The True Vision - Capture, Transmission
More informationOne-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain
Author manuscript, published in "International Symposium on Broadband Multimedia Systems and Broadcasting, Bilbao : Spain (2009)" One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain
More informationNetwork Image Coding for Multicast
Network Image Coding for Multicast David Varodayan, David Chen and Bernd Girod Information Systems Laboratory, Stanford University Stanford, California, USA {varodayan, dmchen, bgirod}@stanford.edu Abstract
More informationReduced Frame Quantization in Video Coding
Reduced Frame Quantization in Video Coding Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P. O. Box 500, FIN-900 University
More informationA reversible data hiding based on adaptive prediction technique and histogram shifting
A reversible data hiding based on adaptive prediction technique and histogram shifting Rui Liu, Rongrong Ni, Yao Zhao Institute of Information Science Beijing Jiaotong University E-mail: rrni@bjtu.edu.cn
More informationMotion Vector Coding Algorithm Based on Adaptive Template Matching
Motion Vector Coding Algorithm Based on Adaptive Template Matching Wen Yang #1, Oscar C. Au #2, Jingjing Dai #3, Feng Zou #4, Chao Pang #5,Yu Liu 6 # Electronic and Computer Engineering, The Hong Kong
More informationESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS. Bin Zhang, Mathias Wien and Jens-Rainer Ohm
19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011 ESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS Bin Zhang,
More informationDEPTH LESS 3D RENDERING. Mashhour Solh and Ghassan AlRegib
DEPTH LESS 3D RENDERING Mashhour Solh and Ghassan AlRegib School of Electrical and Computer Engineering Georgia Institute of Technology { msolh,alregib } @gatech.edu ABSTRACT We propose a new view synthesis
More informationTexture Features in Facial Image Analysis
Texture Features in Facial Image Analysis Matti Pietikäinen and Abdenour Hadid Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P.O. Box 4500, FI-90014 University
More informationA SCALABLE CODING APPROACH FOR HIGH QUALITY DEPTH IMAGE COMPRESSION
This material is published in the open archive of Mid Sweden University DIVA http://miun.diva-portal.org to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein
More informationFusion of colour and depth partitions for depth map coding
Fusion of colour and depth partitions for depth map coding M. Maceira, J.R. Morros, J. Ruiz-Hidalgo Department of Signal Theory and Communications Universitat Polite cnica de Catalunya (UPC) Barcelona,
More information