International Journal of Advanced Research in Computer Science and Software Engineering

Similar documents
International Journal of Advanced Research in Computer Science and Software Engineering

Transactions on Information and Communications Technologies vol 19, 1997 WIT Press, ISSN

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Multiview Image Compression using Algebraic Constraints

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Video Compression An Introduction

Compression of Stereo Images using a Huffman-Zip Scheme

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Lecture 14, Video Coding Stereo Video Coding

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Image Compression Algorithm and JPEG Standard

Motion Estimation for Video Coding Standards

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation

HYBRID TRANSFORMATION TECHNIQUE FOR IMAGE COMPRESSION

Module 7 VIDEO CODING AND MOTION ESTIMATION

DISTANCE MEASUREMENT USING STEREO VISION

Efficient Block Matching Algorithm for Motion Estimation

Low-Complexity, Near-Lossless Coding of Depth Maps from Kinect-Like Depth Cameras

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

Georgios Tziritas Computer Science Department

Multiview Image Compression: Future Challenges and Today s Solutions

Advanced Video Coding: The new H.264 video compression standard

VIDEO COMPRESSION STANDARDS

Practice Exam Sample Solutions

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

Stereo Image Compression

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

An Improved Complex Spatially Scalable ACC DCT Based Video Compression Method

Lecture 13 Video Coding H.264 / MPEG4 AVC

Context based optimal shape coding

Lecture 5: Error Resilience & Scalability

IMAGE COMPRESSION USING HYBRID QUANTIZATION METHOD IN JPEG

Digital Image Processing COSC 6380/4393

A Comparative Study of DCT, DWT & Hybrid (DCT-DWT) Transform

FPGA Implementation of Low Complexity Video Encoder using Optimized 3D-DCT

MPEG-4: Simple Profile (SP)

Reduction of Blocking artifacts in Compressed Medical Images

DTU M.SC. - COURSE EXAM Revised Edition

IMAGE COMPRESSION USING HYBRID TRANSFORM TECHNIQUE

Image Gap Interpolation for Color Images Using Discrete Cosine Transform

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

Anno accademico 2006/2007. Davide Migliore

High Efficiency Video Coding. Li Li 2016/10/18

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

Stereo Vision Image Processing Strategy for Moving Object Detecting

CS4733 Class Notes, Computer Vision

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006,

Depth Estimation for View Synthesis in Multiview Video Coding

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

Visually Improved Image Compression by using Embedded Zero-tree Wavelet Coding

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING

SEGMENTATION BASED CODING OF STEREOSCOPIC IMAGE SEQUENCES

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Complexity Reduction Tools for MPEG-2 to H.264 Video Transcoding

Performance Comparison between DWT-based and DCT-based Encoders

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING

A 3-D Virtual SPIHT for Scalable Very Low Bit-Rate Embedded Video Compression

Compressive Sensing for Multimedia. Communications in Wireless Sensor Networks

Digital Image Stabilization and Its Integration with Video Encoder

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Adaptive Quantization for Video Compression in Frequency Domain

Feature Based Watermarking Algorithm by Adopting Arnold Transform

IMAGE COMPRESSION TECHNIQUES

Feature Extraction and Image Processing, 2 nd Edition. Contents. Preface

Midterm Examination CS 534: Computational Photography

Wavelet Based Image Compression Using ROI SPIHT Coding

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

NOVEL ALGORITHMS FOR FINDING AN OPTIMAL SCANNING PATH FOR JPEG IMAGE COMPRESSION

Video Quality Analysis for H.264 Based on Human Visual System

Compression of Light Field Images using Projective 2-D Warping method and Block matching

The Video Z-buffer: A Concept for Facilitating Monoscopic Image Compression by exploiting the 3-D Stereoscopic Depth map

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

University of Mustansiriyah, Baghdad, Iraq

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

A deblocking filter with two separate modes in block-based video coding

Rate Distortion Optimization in Video Compression

Video Compression Standards (II) A/Prof. Jian Zhang

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Generation of Digital Watermarked Anaglyph 3D Image Using DWT

signal-to-noise ratio (PSNR), 2

View Synthesis Prediction for Rate-Overhead Reduction in FTV

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video

A multiresolutional region based segmentation scheme for stereoscopic image compression

Using Shift Number Coding with Wavelet Transform for Image Compression

( ) ; For N=1: g 1. g n

PERFORMANCE ANALYSIS OF CANNY AND OTHER COMMONLY USED EDGE DETECTORS Sandeep Dhawan Director of Technology, OTTE, NEW YORK

BIG DATA-DRIVEN FAST REDUCING THE VISUAL BLOCK ARTIFACTS OF DCT COMPRESSED IMAGES FOR URBAN SURVEILLANCE SYSTEMS

Fundamentals of Digital Image Processing

ELEC Dr Reji Mathew Electrical Engineering UNSW

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Computer Vision 2. SS 18 Dr. Benjamin Guthier Professur für Bildverarbeitung. Computer Vision 2 Dr. Benjamin Guthier

International Journal of Wavelets, Multiresolution and Information Processing c World Scientific Publishing Company

[Singh*, 5(3): March, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Three Dimensional Motion Vectorless Compression

Transcription:

Volume 2, Issue 8, August 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Study on Block Based Vs Object-Based Stereo Image Compression T.Ramaprabha Lecturer, Dept of Computer Science Sarah Tucker College (Autonomous) Tirunelveli 627007,South India Dr.M.Mohamed Sathik Principal Sadakkathella Appa College(Autonomous) Tirunelveli-627 011,South India Abstract: The first stereo image compression algorithm was to code the sum and difference of the two images. However, the performance of this technique decreases with increased disparity values. A modification to this method is to shift one of the images horizontally to the point where the cross correlation between the imagesof stereo pair reaches its maximum value The shifted image is then subtracted from its partner image and the difference is encoded. This method, assumes that objects in the scene have similar disparity values and thus is not particularly efficient. Another approach is to translate the row blocks instead of the whole image. stereoscopic sequences can be compressed much more efficiently than the independent compression of its two image streams by exploiting, in addition to the spatial and temporal correlations that are exploited by sequence compression schemes, the high cross-stream correlations present between the streams. A disparity-based segmentation and the subsequent transmission of these segment disparities encode one image of a stereoscopic image pair at a very low coding overhead given the other image. In this paper I focused the advantage of object based stereo image compression over blocl based stereo image compression. Keywords: Stereo images, block based stereo image compression,object based stereo image compression. I. Introduction : stereo images Stereovision could be simulated by acquiring two views of a 3D scene and by presenting them separately to the left and right eyes. Stereo capture equipment play an important role in the first step of this process. Many technical and operational variations are present between the practical designs of capture equipment. A pair of stereo images is very similar to each other as they are the images of a stationary object taken from two different angles. This is why compressing both images independently is an in-efficient way of compressing stereo images. An efficient way to compress a pair of stereoscopic images is to calculate the difference between the two images; also known as disparity estimation and then compress one image independently. Fig 1. Stereo images This image is known as the reference image and can be either the right image or the left image. The reference image and the disparity vectors are then used to reconstruct the second image. A stereo image is produced by taking two cameras, separated by a distance of 6.5 centimeters which is approximately the distance between the human eyes and recording the perspectives of the right eye and the left eye using different lenses. The left image is seen through the left lens and the right image is seen through the right lens. The brain then merges the two images into one and also perceives the depth of the object. Stereo images can be produced cheaply using inexpensive digital cameras and are used widely in clinical applications, mining, metallurgy, environmental science and entertainment. 2012, IJARCSSE All Rights Reserved Page 271...

II. Block based stereo image compression The left image is taken as the reference image and the disparity vectors between the two images are estimated using the Exhaustive Block Matching Algorithm (EBMA). The reference image is transformed using two-dimensional discrete cosine transform (DCT-II) and quantized using the JPEG quantization matrix. The resulting matrix is compressed into a bit stream using arithmetic coding. The left image is taken as the reference image and is transformed using two dimensional forward discrete cosine transform (DCT-II). The resulting image matrix is quantized using the JPEG quantization matrix in (3) and then compressed using coding. The second part of the encoder involves compressing the right image. Since the two images are very similar to each other, disparity vectors between the two images are estimated. The resulting disparity vectors are compressed into a bit stream using arithmetic encoding. When decode the image, all the steps in encoding were performed in reverse order.. Left image Dct II Quantiz ation encoding Right image Block matching algorithm encoding Fig 2. Block based stereo Image encoding process decoding Inverse DCT II Inverse Quantizatio n Reconstructed Left Image Compressed bit stream for Left Compressed bit stream for right image decoding Reconstructed Right Image Fig 3. Block based stereo image Decoding process The fundamental issue is that when 3D-stereoscopy is implemented on a single display each eye gets in some sense only half the display. A user contemplating using 3D-stereoscopy must thus acquire a display (and the underlying system to support it) with twice the pixel-per-second capability of the minimal display needed for the flat application; the alternatives require choosing between a flickering image or a reduced spatial resolution image. The lower level capacities of the system s components must also be doubled. In particular, all the information captured by two cameras must be stored or transmitted or both. Doubling these capacities may be more difficult than doubling the capability of the display, inasmuch as the capability of the display can be increased by simply paying more. The most difficult system component to increase is probably the bandwidth of the transmission system, which is often subject to powerful regulatory as well as technical constraints. Nevertheless, the bandwidth must apparently be doubled to transmit 3D-stereoscopic image streams at the same spatial resolution and temporal update frequency as either flat image stream. In fact, because the two views comprising a 3D-stereoscopic image pair are nearly identical, i.e., the information content of both together is only a little more than the information content of one alone, it is possible to find representations of image pairs and streams that take up little more storage space and transmission bandwidth than the space or bandwidth that is required by either alone. III. Object based stereo image compression To reflect the true disparity between left and right frames, free-form object based matching would be more efficient than block-based techniques. This has been proved in video compression, such as MPEG-4. However, the price to pay is to encode the arbitrary shape of those objects and the complexity level will also be increased accordingly. To make a better balance, we proposed a hybrid scheme with features of both block-based and object-based coding 2012, IJARCSSE All Rights Reserved Page 272

techniques to achieve data compression for stereo image pairs. The algorithm comprises the following operations are, extract objects by contour analysis for both frames, and the objects are then matched to find those areas, which are similar in terms of their shapes; then enclose the matching object pairs by object bounding rectangles, in a similar spirit of MPEG-4. A parity based contour filling algorithm is further used to identify the interior and exterior pixels of the object, in which the shapes of the objects are identified by binary object planes. These planes are only used as an aid in the coding process rather than being encoded finally,encode the objects in terms of three groups like objects that are enclosed by closed contours, objects enclosed by contours that terminate at the image frame boundaries; and unidentified areas that are treated as the background. Fig 3. Stereo image pairs- cans Fig 4. Contour plots for cans a)contour extraction The contour is extracted by a two-step process. Firstly, the image is convolved (LoG*Image) with a Laplacian-of-Gaussian (LoG) operator, defined by: (1) where and is the standard deviation. To facilitate its application, equation (1) can be discretized in various ways. Secondly, edges are detected at the zero-crossing points (e.g., patterns such as "++--" and "--++" along both vertical and horizontal directions). In this step, the slopes of the LoG of the image along both x and y directions, denoted by Sx and Sy, are used to compute the edge strength at each zero-crossing point. An edge strength at a point ( x,y ) is defined as follows: S (x,y) = (2) The contour points are chosen using a hysteresis thresholding technique, that is the edge strength at each point along the contour is greater than Tl and at least one point on the contour has an edge strength greater than Tu, where Tl and Tu are pre-set thresholds and Tl < Tu. Generally, Tl is set sufficiently low to preserve the whole contour around the region boundary and Tu is chosen large enough to avoid spurious edges. Contour search is initiated whenever one point with a value greater than Tu is scanned. The search is conducted in both directions of the contour and the neighbouring pixels, with values greater than Tl being accepted as contour points. The search is terminated when no neighbouring pixels are found to satisfy this condition. Then all edge strength values along the detected contour are set to zethat these points will not be visited again. Contour Matching The contoured objects are matched in terms of: (i) those that are closed in as illustrated in Fig. 3(a); and (ii) those which are open, and terminated at the image boundaries as shown in Fig. 5 Fig 5. Matched contour pair Closed contour matching: For every closed-in contour, three shape attributes are computed: (i) the number of pixels representing the perimeter of the contour, ; (ii) the y co-ordinate of the centroid ; and (iii) the first invariant moment, h. is considered as a good matching attribute as we assume that the vertical stereo disparity between points in the image frames are negligible. This is because that parallel axis geometry [1] is used in obtaining all the test stereo image pairs. If and represent the x and y co-ordinates of the points along the contour, can be defined as follows: 2012, IJARCSSE All Rights Reserved Page 273

(3) Where and. (4) Every closed-in contour of the right frame is compared with that of the left frame. A contour from the left frame is accepted as an initial candidate match for the right frame contour, if the differences between each of their shape attributes fall below some pre-set thresholds At the end of this procedure, closed-in contours in right frames may have multiple candidate matches and a further step is necessary to find the best match among them. Contour blocking is a process designed to divide the matched objects into blocks in a most efficient way so that different encoding techniques can be applied to compress the stereo image pair. The basic principle is similar to MPEG-4 techniques, which include fitting an object into smallest possible rectangle, padding the reference object and produce disparity compensated predictive errors. Object Encoding The stereo image pair is encoded on an object basis by identifying three types of objects within the images. These include: objects that are bounded within closed contours, objects that have open contours and both ends of the open contour terminate at image boundaries, and the rest which fall into the background. Although there exist differences among the three types and their encoding requires individual co-ordination, the major proposed encoding techniques can be described as: (i) padding of left object bounding rectangles; (ii) disparity-based prediction for both arbitrary shaped boundary blocks and internal blocks; (iii) background encoding. RESULTS OF COMPARISION MATCHING TYPE CR PSNR BLOCK BASED 39.2 39.5 OBJECT BASED 44.5 38.6 TABLE 1.Block Vs Object based compression Chart 1. Block Vs Object based compression IV. Conclusion In object based stereo image compression, padding of left object bounding rectangles features local gradient inside the object; The disparity values and the prediction errors for each arbitrary shaped boundary block is transmitted on an object by object basis. In this way, the error blocks and disparity values can be properly identified and used to reconstruct the right objects at the decoding end. In the prediction process, the shape of the objects in the left frame are used to produce errors for right objects, and thus no bits are consumed to encode the shape information. Yet correct shape can be reconstructed at the decoding end since the lost information will be picked up by corresponding background block encoding; Internal areas are encoded adaptive to their local texture. An object-based coding scheme was presented for the coding of a channel of a stereo- scopic image sequence using motion and depth information. The segmentation part of the algorithm was interleaved with the estimation part in order to optimize the coding performance. Depth information was transmitted either in the form of dense depth maps or using a wireframe model. Furthermore, techniques were examined for the updating of the depth map and the segmentation information. Study results have shown the performance of the object based stereo image compression to be slightly better than the performance of blockbased approaches. VI. References [1]. V. E. Seferidis, \Stereo Image Coding Using Generalized Block Matching and Quad-Tree Structured Spatial Decomposition," IEEE Trans. on Image Processing, to appear. 2012, IJARCSSE All Rights Reserved Page 274

[2] D. Tzovaras, M. G. Strintzis, and H. Sahinoglou, \Evaluation of Multiresolution Techniques for Motion and Disparity Estimation," Signal Processing : Image Com- munication, vol. 6, no. 1, pp. 59{67, Mar. 1994. [3] M. Hotter, \Object-oriented analysis-synthesis coding based on moving two- dimensional objects," Signal Processing : Image Communication, vol. 2, no. 4, pp. 409{428, Dec. 1990. [4] J. L. Dugelay and D. Pele, \Motion and Disparity Analysis of a Stereoscopic Se- quence. Application to 3DTV Coding," EUSIPCO '92, pp. 1295{1298, Oct. 1992. [5] D. V. Papadimitriou, \Stereo in Model-Based Image Coding," in Int'l Workshop on Coding Techniques for Very Low Bit-rate Video (VLBV 94), (Colchester), Apr. 1994. [6] S. Panis and M. Ziegler, \Object Based Coding Using Motion and Stereo Information," in Picture Coding Symposium (PCS '94), pp. 308{312, Sep. 1994. [7]. N. Grammalidis, S. Malassiotis, D. Tzovaras, and M. G. Strintzis, \Stereo Image Sequence Coding Based on 3-D Motion Estimation and Compensation," Signal Pro- cessing : Image Communication, vol. 7, no. 1, pp. 129{145, Jan. 1995. [8] D. Tzovaras, N. Grammalidis, and M. G. Strintzis, \3-D Motion/Disparity Segmen- tation for Object- Based Image Sequence Coding," Optical Engineering, special issue on Visual Communications and Image Processing, vol. 35, no. 1, pp. 137{145, Jan. 1996. [9]. H. G. Mussman, M. Hotter, and J. Ostermann, \Object-oriented analysis-synthesis coding of moving images," Signal Processing : Image Communication, vol. 1, no. 2, pp. 117{138, Oct. 1989. 2012, IJARCSSE All Rights Reserved Page 275