Figure 1: Representation of moving images using layers Once a set of ane models has been found, similar models are grouped based in a mean-square dist
|
|
- Augustine Webster
- 5 years ago
- Views:
Transcription
1 ON THE USE OF LAYERS FOR VIDEO CODING AND OBJECT MANIPULATION Luis Torres, David Garca and Anna Mates Dept. of Signal Theory and Communications Universitat Politecnica de Catalunya Gran Capita s/n, D Barcelona, Spain fluis, albroto, Abstract Layered representations, originally introduced in [7] are important in several applications. Specically, they are very appropriate for object tracking, object manipulation and content-based scalability which are among the main functionalities of the future standard MPEG-4 [2, 3]. A robust representation of moving images based on layers has been presented in [5]. One of the most crucial stages in the formation of layers is the motion segmentation. In particular, the initial segmentation is very critical for the success of the layered system. In this context, the objective of this paper 1 is to provide some insight on the performance of layers by studying dierent initial segmentations for the construction of layers. In order to have a selfcontained paper, the rst part of the paper reviews the main concepts and results presented in [5] which includes image synthesis and object manipulation. The second part provides dierent type of initial segmentations that help to analyze the performance of the overall layered representation of moving images. Some comments on the use of layers for video coding are also introduced. 1 INTRODUCTION The basic idea behind the new standard MPEG-4 [2] is to represent the world understood as a composition of audio-visual objects, following a script that describes their spatial and temporal relationship. This type of representation should provide the possibility that the user interacts with the various audio-visual objects in the scene, in a way similar to the actions taken in everyday life. Although this content-based approach to the scene representation may be considered evident for a human being, it represents in fact a revolution in terms of video representation architecture used in the available standards, since it allows a jump in the 1 This work was supported in part by the CICYT grant of the Spanish government: TIC C05 type of functionalities that may be provided to the user. A scene represented as a composition of (more or less independent) audio - visual objects oers to the user the possibility to play with the scene content, by changing some of the objects characteristics (e.g. position, motion, texture or shape), by accessing only selected parts of scene or even by cut and pasting objects from one scene to another. Content and interaction are thus central concepts in MPEG-4 [3]. Basic concepts and several approaches to achieve different functionalities in the context of MPEG-4 may be found in [6]. An approach to represent the visual world in terms of objects (called Video Object Planes, VOP's, in MPEG-4) is the representation in layers proposed by Wang and Adelson [7]. Each layer contains three different maps: 1) the intensity map, 2) the alpha map, which denes the opacity or transparency of the layer at each point and 3) the velocity map which describes how the map should be warped over time. To clearly show the concepts behind the layered representation, Figure 1 advances our own results on the layered representation of the Mobile Calendar sequence. The representation has been able to extract four dierent layers corresponding to the ball, the train, the calendar and the background. The last image represents the reconstruction obtained from the dierent layers. In each layer all the visible information of the corresponding object is accumulated in one single extended frame. Notice that this representation allows manipulation of the content of the scene and besides each frame is dened only through the motion parameters what gives a very compact representation very useful in video coding applications Our construction of the layers follows conceptually the approach presented in [7], but using more ecient and robust techniques. The process implies two basic operations: 1) a local motion estimation and 2) a motion segmentation by ane model tting. We have developed a simple and yet robust method to nd the motion eld. The technique is presented in Section 2.
2 Figure 1: Representation of moving images using layers Once a set of ane models has been found, similar models are grouped based in a mean-square distance between the motion vectors of the pixels belonging to each of the models being compared. This gives an ecient motion segmentation strategy which is explained in Section 3. Section 4 provides some results of reconstructed sequences using this layered approach along with an example of object manipulation. Section 5 gives some insight on the performance of layers by giving results using manual and semi-manual initial segmentations. Section 6 provides a few ideas on the use of layers for video coding applications. Finally, Section 7 draws some conclusions. 2 ROBUST MOTION ESTI- MATION In order to estimate the motion eld of the scene, we estimate rst an initial approximation of the motion of some pixels located in a rectangular grid using block matching techniques. Given the position ~r of a pixel and a vector motion ~v = (v x ; v y ) jv x j; jv y j maximum displacement, the following cost function is dened: G(~r; ~v) = X ~r02w (~r) I t (~r0)? I t?1 (~r0? ~v) 2 (1) where W (~r) is a neighboorhod of ~r and I t, I t?1 are the actual and previous images. The motion vectors of the remaining pixels are interpolated from these rst ones. Due to local minima of the cost function, some motion vectors may be erroneous. To improve these motion vectors, a motion gradient is found for each pixel: MG(~r) = 1 4 X ( (1, 0) (0, 1) ~d= (-1, 0) (0, -1) ~v(~r)? ~v(~r? ~ d) 2 (2) Then for those pixels whose motion vectors are above the mean of all the gradients, the motion vectors are recalculated using a bigger W (~r). The interpolated motion vectors are rened with integer precision. The nal motion vector for each pixel of the whole image is obtained through a renement process based on one third-pixel accuracy. Please notice that if occluded areas of I t?1 are uncovered in I t, the cost function G(~r; ~v) cannot give the correct motion estimation vector. To minimize this problem, the above explained technique is applied to estimate the motion eld between I t and I t+1. This information is used to compensate for the errors of the rst motion eld (I t?1 and I t ). Figure 2 shows the motion eld obtained for the rst two images of the Flower Garden sequence. This gure shows a motion vector per pixel, an acceptable motion eld and a quite good performance in the contours. 3 ROBUST MOTION SEG- MENTATION The next step is to segment the motion vector eld by tting an ane motion model. The objective is that those regions of the image with the same ane model will be grouped. As we do not have a reliable initial
3 models obtained from an original decomposition of the image in rectangular regions of pixels. We believe that this technique represents also an important improvement with respect to the original technique presented in [7]. Figure 4 and Figure 5 show the original sequence Flower Garden and its corresponding segmentation. Figure 2: Motion eld of the Flower Garden sequence image partition, the image is initially partitioned in rectangular regions (same approach as taken in the original paper in [7]). For each of these rectangular regions, the best ane model in the minimum square sense is found through an iterative k-means clustering algorithm. Once a set of ane models have been found, similar models are grouped based in a meansquare distance between the motion vectors of the pixels belonging to each of the models being compared. Two models are grouped each time. The ordering in which all the model pairs are grouped is of paramount importance as it will x the stability and convergence of the technique. The distance between two given af- ne models a i and a j belonging respectively to the regions R i and R j, is dened as dist(a i ; a j ) = 1 n X ~r2r i;r j V ai (~r)? V aj (~r) 2 (3) where n indicates the number of pixels of R i [ R j, V ai (~r) and V aj (~r) the motion vectors represented by a i and a j in the position ~r. Once the distance between two ane models is dened, the grouping of models is done according to the following steps: 1. Find the pair (i; j) such that the distance between the corresponding models is minimum: dist(a i ; a j ) dist(a k ; a l ) 8 (k; l) 6= (i; j) 2. Group a i and a j in a single model, which gives the best estimation of the motion eld of the regions R i and R j. 3. Iterate the process while dist(a i ; a j ) is less than a prespecied threshold. A threshold of 1 pixel is a very reasonable option for most of the sequences. Using this grouping technique, the segmentation algorithm proves to be very ecient as it gives very reliable nal models at a fast convergence rate. As an example, Figure 3 has only needed four iterations to segment the motion eld, starting from a list of ane a) Initial, n = 0. b) n = 1. c) n = 3. d) Post-processed after n = 4. Figure 3: Results of the motion segmentation between frames #1 and #2 of Flower Garden after dierent iterations. White zones represent unassigned pixels 4 RESULTS The robust segmentation technique explained above provides a series of non-overlapped regions that will be used to form the nal layers. Our layer synthesis approach follows very closely to the original one presented in [7]. To prove the feasibility of our approach Figure 6 and Figure 7-b present the reconstruction of three frames of the Flower Garden sequence and the 15th frame of the Mobile Calendar sequence using the approach described above. Although it is not possible to fully appreciate in these still images the improvements of our method over the time domain, the reconstruction of the image it is believed to be of a very good quality. As a very basic example of object manipulation, Figure 7-c shows the same image with the ball removed. Please also notice the quality of the reconstructed image, specially in the location of the ball. All these sequences have been obtained using a rectangular initial partition.
4 a) #5 b) #15 c) #25 Figure 4: Original Flower Garden Sequence a) #5 b) #15 c) #25 Figure 5: Segmentation of Flower Garden a) #5 b) #15 c) #25 Figure 6: Reconstruction of Flower Garden using the proposed robust approach a) Original b) Reconstruction c) Object manipulation Figure 7: Reconstruction of Mobile Calendar and object manipulation using layers
5 a) Wall. b) Calendar. c) Ball. d) Train. Figure 8: Layers extracted from the Mobile Calendar sequence using the manual approach. Static layer not shown. 5 RESULTS USING MANUAL INITIAL SEGMENTATIONS Section 4 has shown some results of the layered representation using as an initial segmentation of the motion segmentation algorithm a decomposition of the image in rectangular regions. The rectangular shape of this rst segmentation has been chosen due to the lack of a better segmentation. This procedure has led to a quite acceptable segmentation of the video sequence in layers and to a good reconstruction of the synthesized image sequence. At this stage, one may wonder which can be the performance of the system if a better approach than the initial rectangular segmentation is selected. In order to nd a perfect representation of a moving image in layers, the rst (and easy) temptation is to manually segment all the images of the whole sequence. Although this may sound as a highly impracticable procedure for the majority of applications, it may provide a bound on the performance of the approach with respect to which other sub optimal approaches might be compared. To that end we have performed the following experiment. Using a graphic editor the Mobile Calendar sequence has been manually segmented and then the same process explained in Sections 2 and 3 has been applied to nd the motion and the corresponding segmentation. Figure 8 shows the extracted layers using this manual approach. It can be observed the almost perfect decomposition of the moving image in layers. Figure 9 shows the corresponding result using the rectangular decomposition of the rst image. It can be appreciated that the rectangular approach provides a much worse representation. Notice for instance that in Figure 9 the calendar and the wall are extracted as a single layer. Figure 12 shows the reconstruction of frames 1, 15 and 30 of the Mobile Calendar sequence using the manual approach. Although when presented as a static result it is dicult to appreciate the dierence, the reconstruction using the automatic segementation (rectangular regions) is worse than the reconstruction using the manual approach. This can be fully ap-
6 a) Wall and Calendar. b) Train. c) Ball. d) Static information. Figure 9: Layers extracted from the Mobile Calendar sequence with the rst image segmented rectangularly. preciated in Figure 10 where the PSNR between the original Mobile Calendar sequence and the reconstructed sequences using the manual segmentation (solid line) and the automatic segmentation (dashed line) are presented. It can be appreciated that the best reconstruction is provided for frame number 15, since it has been selected as reference for the construction of the layers. Similar results are found with other sequences. This experiment has shown that if an adequate segmentation is found, the decomposition of a moving image using layers is a good approach for image representation, object manipulation and video coding. At this stage the following question arises naturally: is it possible to obtain the same results by using a complete automatic segmentation? Our own results of Section 4 show that it is not possible or at least we do not know how to do it. However we can improve the results of Section 4 by performing another experiment. Instead of segmenting manually the whole sequence, what will the results be of the layered representation if we only segment manually the rst image? Figure 11 shows the layers obtained using this semiautomatic approach. Notice that it is still possible to distinguish between the wall and calendar layers, which was not the case with the totally automatic approach. Figure 13 shows the corresponding reconstructed sequence and Figure 14 presents the PSNR of the reconstructed images. For convenience the three reconstructions are shown together. The solid line represents the semiautomatic approach, the top dashed line represents the manual segmentation and the down dashed line the automatic approach. As one might be expecting, the semiautomatic approach lies half way between the corresponding fully manual and automatic approaches. These results show that some improvements may expected in the layered representation of moving images. Instead of segmenting manually the rst image and in order to have a completely automatic approach we are currently working in using an automatic spatial segmentation of the rst image. First results using this approach look promising and we hope to present them very soon.
7 a) Y component. b) U component. c) V component. Figure 10: PSNR between the original Mobile Calendar sequence and the reconstructed sequence using the manual segmentation (solid line) and the automatic segmentation (dashed line) 6 THE LAYERED REPRE- SENTATION OF MOVING IMAGES FOR VIDEO CO- DING The results of the layered approach shown above gives an idea of the potentiality of the representation for video coding applications. To encode an image sequence represented by layers the information that needs to be transmitted is: the intensity maps, the alpha maps, the motion parameters for each layer and possibly an error signal (delta map). Notice that the intensity map can be coded as a still image, although the contours of the layer need also to be transmitted what requires extra bits. Still image compression techniques such as subband, vector quantization or wavelets may be used to encode the arbitrarily shaped intensity map [1]. The shape of the layer may be encoded using a lossless modied chain coded approach or a lossy scheme such as the multigrid technique [4]. For binary alpha maps, the opacity information is already included in the layer shape. The six parameters of the ane motion information for each layer and each frame need also to be encoded. We are currently developing a new scheme to combine in a unique motion parameter set the motion parameters associated to each layer. This approach will further reduce the number of bits associated to each set of parameters of each frame. Finally, for perfect reconstruction, the error signal needs also to be sent. Our rst results show that if the motion information associated to each frame of each layer is good enough, there is no need to send the error signal, what increases the compression ratio. Although we are not ready yet to provide reliable compression results, our rst experiments look very promising what may proof that the layered approach is very suitable for video coding applications. 7 CONCLUSIONS A robust representation of images using layers has been presented in this paper. To that end, robust motion estimation and segmentation techniques have been presented. Examples of image reconstruction and object manipulation have been provided which proves the feasibility of the approach. Dierent type of segmentations have been provided which have given further insight into the performance bounds of the layered approach. It has been shown that the representation of moving images using layers is very promising for object manipulation as well as for video coding. References [1] R. J. Clarke. Digital compression of still images and video. Academic Press, [2] ISO/IEC JTC1/SC29/WG11. MPEG-4 Proposal Package Description (PPD). July [3] F. Pereira. MPEG-4: a new challenge for the representation of audio-visual information. In Picture Coding Symposium, Melbourne, Australia, March [4] P. Salembier, F. Marques, and A. Gasull. Coding of partition sequences. In L. Torres and M. Kunt, editors, Video coding: the second generation approach, pages 125{169. Kluwer Academic Publishers, [5] L. Torres, D. Garca, and A. Mates. A robust motion estimation and segmentation approach to represent moving images with layers. In International Conference on Acoustics, Speech and Signal Processing, Munich, Germany, April [6] L. Torres and M. Kunt. Video coding: the second generation approach. Kluwer Academic Publishers, Englewood Clis, [7] J. Y. A. Wang and E. H. Adelson. Representing moving images with layers. IEEE Transactions on Image Processing, 3(5):625 { 638, September 1994.
8 a) Wall. b) Calendar. c) Ball. d) Train. e) Static textures. Figure 11: Layers extracted from the Mobile Calendar sequence with the rst image segmented manually.
9 a) #1 b) #15 c) #30 Figure 12: Reconstruction of the Mobile Calendar sequence with the whole sequence segmented manually. a) #1 b) #15 c) #30 Figure 13: Reconstruction of the Mobile Calendar sequence with the rst image segmented manually. a) Y component. b) U component. c) V component. Figure 14: PSNR of the reconstructed images. The solid line represents the semiautomatic approach, the top dashed line represents the manual segmentation and the down dashed line the automatic approach
PROJECTION MODELING SIMPLIFICATION MARKER EXTRACTION DECISION. Image #k Partition #k
TEMPORAL STABILITY IN SEQUENCE SEGMENTATION USING THE WATERSHED ALGORITHM FERRAN MARQU ES Dept. of Signal Theory and Communications Universitat Politecnica de Catalunya Campus Nord - Modulo D5 C/ Gran
More informationGeorgios Tziritas Computer Science Department
New Video Coding standards MPEG-4, HEVC Georgios Tziritas Computer Science Department http://www.csd.uoc.gr/~tziritas 1 MPEG-4 : introduction Motion Picture Expert Group Publication 1998 (Intern. Standardization
More informationInitial Partitions. Region. Region Interpolation. 3. Region parametrization. Ordering. Partition Creation. Interpolated Partitions
SEGMENTATION-BASED MORPHOLOGICAL INTERPOLATION OF PARTITION SEQUENCES R. BR EMOND and F. MARQU ES Dept. of Signal Theory and Communications Universitat Politecnica de Catalunya Campus Nord -Modulo D5 C/
More informationRepresenting Moving Images with Layers. J. Y. Wang and E. H. Adelson MIT Media Lab
Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab Goal Represent moving images with sets of overlapping layers Layers are ordered in depth and occlude each other Velocity
More informationDistortion. Distortion Points on the convex hull Points not on the convex hull. Rate. Rate. R * = R budget. R budget.
Video Sequence Segmentation Based On Rate-Distortion Theory J.R. Morros, F. Marques, M. Pardas, P. Salembier Dept. of Signal Theory and Communications ETSETB -Universitat Politecnica de Catalunya Campus
More informationFigure 6: Hierarchical segmentation by merging. Merging intraframe. Merging intraframe. Initial partition. intraframe
IMAGE SEQUENCE ANALYSIS AND MERGING ALGORITHMS Philippe Salembier, Luis Garrido, David Garcia Universitat Politecnica de Catalunya Campus Nord, Modulo D5 C. Gran Capita, sn 08950 Barcelona, SPAIN E-mail:
More informationPhase2. Phase 1. Video Sequence. Frame Intensities. 1 Bi-ME Bi-ME Bi-ME. Motion Vectors. temporal training. Snake Images. Boundary Smoothing
CIRCULAR VITERBI BASED ADAPTIVE SYSTEM FOR AUTOMATIC VIDEO OBJECT SEGMENTATION I-Jong Lin, S.Y. Kung ijonglin@ee.princeton.edu Princeton University Abstract - Many future video standards such as MPEG-4
More informationPartition definition. Partition coding. Texture coding
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 5, NO. 6, JUNE 1996 881 Morphological Operators for Image and Video Compression Philippe Salembier, Patrick Brigger, Josep R. Casas and Montse Pardas Abstract
More informationOutline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami
to MPEG Prof. Pratikgiri Goswami Electronics & Communication Department, Shree Swami Atmanand Saraswati Institute of Technology, Surat. Outline of Topics 1 2 Coding 3 Video Object Representation Outline
More informationSpatial Scene Level Shape Error Concealment for Segmented Video
Spatial Scene Level Shape Error Concealment for Segmented Video Luis Ducla Soares 1, Fernando Pereira 2 1 Instituto Superior de Ciências do Trabalho e da Empresa Instituto de Telecomunicações, Lisboa,
More informationMotion Estimation for Video Coding Standards
Motion Estimation for Video Coding Standards Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Introduction of Motion Estimation The goal of video compression
More informationsignal-to-noise ratio (PSNR), 2
u m " The Integration in Optics, Mechanics, and Electronics of Digital Versatile Disc Systems (1/3) ---(IV) Digital Video and Audio Signal Processing ƒf NSC87-2218-E-009-036 86 8 1 --- 87 7 31 p m o This
More informationAn Embedded Wavelet Video Coder. Using Three-Dimensional Set. Partitioning in Hierarchical Trees. Beong-Jo Kim and William A.
An Embedded Wavelet Video Coder Using Three-Dimensional Set Partitioning in Hierarchical Trees (SPIHT) Beong-Jo Kim and William A. Pearlman Department of Electrical, Computer, and Systems Engineering Rensselaer
More information2 Algorithm Description Active contours are initialized using the output of the SUSAN edge detector [10]. Edge runs that contain a reasonable number (
Motion-Based Object Segmentation Using Active Contours Ben Galvin, Kevin Novins, and Brendan McCane Computer Science Department, University of Otago, Dunedin, New Zealand. Abstract: The segmentation of
More informationScalable Multiresolution Video Coding using Subband Decomposition
1 Scalable Multiresolution Video Coding using Subband Decomposition Ulrich Benzler Institut für Theoretische Nachrichtentechnik und Informationsverarbeitung Universität Hannover Appelstr. 9A, D 30167 Hannover
More informationIST MPEG-4 Video Compliant Framework
IST MPEG-4 Video Compliant Framework João Valentim, Paulo Nunes, Fernando Pereira Instituto de Telecomunicações, Instituto Superior Técnico, Av. Rovisco Pais, 1049-001 Lisboa, Portugal Abstract This paper
More informationSample Based Texture extraction for Model based coding
DEPARTMENT OF APPLIED PHYSICS AND ELECTRONICS UMEÅ UNIVERISTY, SWEDEN DIGITAL MEDIA LAB Sample Based Texture extraction for Model based coding Zhengrong Yao 1 Dept. Applied Physics and Electronics Umeå
More informationA MULTIPOINT VIDEOCONFERENCE RECEIVER BASED ON MPEG-4 OBJECT VIDEO. Chih-Kai Chien, Chen-Yu Tsai, and David W. Lin
A MULTIPOINT VIDEOCONFERENCE RECEIVER BASED ON MPEG-4 OBJECT VIDEO Chih-Kai Chien, Chen-Yu Tsai, and David W. Lin Dept. of Electronics Engineering and Center for Telecommunications Research National Chiao
More informationINTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO-IEC JTC1/SC29/WG11
INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO-IEC JTC1/SC29/WG11 CODING OF MOVING PICTRES AND ASSOCIATED ADIO ISO-IEC/JTC1/SC29/WG11 MPEG 95/ July 1995
More informationVideo Compression MPEG-4. Market s requirements for Video compression standard
Video Compression MPEG-4 Catania 10/04/2008 Arcangelo Bruna Market s requirements for Video compression standard Application s dependent Set Top Boxes (High bit rate) Digital Still Cameras (High / mid
More informationBI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH
BI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH Marc Servais, Theo Vlachos and Thomas Davies University of Surrey, UK; and BBC Research and Development,
More informationMotion-Compensated Subband Coding. Patrick Waldemar, Michael Rauth and Tor A. Ramstad
Video Compression by Three-dimensional Motion-Compensated Subband Coding Patrick Waldemar, Michael Rauth and Tor A. Ramstad Department of telecommunications, The Norwegian Institute of Technology, N-7034
More informationMulti-View Image Coding in 3-D Space Based on 3-D Reconstruction
Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Yongying Gao and Hayder Radha Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48823 email:
More informationWavelet Transform (WT) & JPEG-2000
Chapter 8 Wavelet Transform (WT) & JPEG-2000 8.1 A Review of WT 8.1.1 Wave vs. Wavelet [castleman] 1 0-1 -2-3 -4-5 -6-7 -8 0 100 200 300 400 500 600 Figure 8.1 Sinusoidal waves (top two) and wavelets (bottom
More informationData Hiding in Video
Data Hiding in Video J. J. Chae and B. S. Manjunath Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 9316-956 Email: chaejj, manj@iplab.ece.ucsb.edu Abstract
More informationEE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)
EE799 -- Multimedia Signal Processing Multimedia Signal Compression VI (MPEG-4, 7) References: 1. http://www.mpeg.org 2. http://drogo.cselt.stet.it/mpeg/ 3. T. Berahimi and M.Kunt, Visual data compression
More informationObject classes. recall (%)
Using Genetic Algorithms to Improve the Accuracy of Object Detection Victor Ciesielski and Mengjie Zhang Department of Computer Science, Royal Melbourne Institute of Technology GPO Box 2476V, Melbourne
More informationThe Video Z-buffer: A Concept for Facilitating Monoscopic Image Compression by exploiting the 3-D Stereoscopic Depth map
The Video Z-buffer: A Concept for Facilitating Monoscopic Image Compression by exploiting the 3-D Stereoscopic Depth map Sriram Sethuraman 1 and M. W. Siegel 2 1 David Sarnoff Research Center, Princeton,
More informationTECHNICAL RESEARCH REPORT
TECHNICAL RESEARCH REPORT An Advanced Image Coding Algorithm that Utilizes Shape- Adaptive DCT for Providing Access to Content by R. Haridasan CSHCN T.R. 97-5 (ISR T.R. 97-16) The Center for Satellite
More informationARTIFICIAL INTELLIGENCE LABORATORY. A.I. Memo No November, K.P. Horn
MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY A.I. Memo No. 1584 November, 1996 Edge and Mean Based Image Compression Ujjaval Y. Desai, Marcelo M. Mizuki, Ichiro Masaki, and
More informationStorage System. Distributor. Network. Drive. Drive. Storage System. Controller. Controller. Disk. Disk
HRaid: a Flexible Storage-system Simulator Toni Cortes Jesus Labarta Universitat Politecnica de Catalunya - Barcelona ftoni, jesusg@ac.upc.es - http://www.ac.upc.es/hpc Abstract Clusters of workstations
More informationEfficient Image Compression of Medical Images Using the Wavelet Transform and Fuzzy c-means Clustering on Regions of Interest.
Efficient Image Compression of Medical Images Using the Wavelet Transform and Fuzzy c-means Clustering on Regions of Interest. D.A. Karras, S.A. Karkanis and D. E. Maroulis University of Piraeus, Dept.
More informationBluray (
Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)
More informationAn Embedded Wavelet Video. Set Partitioning in Hierarchical. Beong-Jo Kim and William A. Pearlman
An Embedded Wavelet Video Coder Using Three-Dimensional Set Partitioning in Hierarchical Trees (SPIHT) 1 Beong-Jo Kim and William A. Pearlman Department of Electrical, Computer, and Systems Engineering
More informationTENTH WORLD CONGRESS ON THE THEORY OF MACHINE AND MECHANISMS Oulu, Finland, June 20-24, 1999 Finding Innitesimal Motions of Objects in Assemblies Usin
TENTH WORLD CONGRESS ON THE THEORY OF MACHINE AND MECHANISMS Oulu, Finland, June 20-24, 1999 Finding Innitesimal Motions of Objects in Assemblies Using Grassmann-Cayley Algebra E. Staetti, L. Ros, and
More informationReview and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.
Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About
More informationOptimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform
Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform Torsten Palfner, Alexander Mali and Erika Müller Institute of Telecommunications and Information Technology, University of
More informationR-D points Predicted R-D. R-D points Predicted R-D. Distortion (MSE) Distortion (MSE)
A SCENE ADAPTIVE BITRATE CONTROL METHOD IN MPEG VIDEO CODING Myeong-jin Lee, Soon-kak Kwon, and Jae-kyoon Kim Department of Electrical Engineering, KAIST 373-1 Kusong-dong Yusong-gu, Taejon, Korea ABSTRACT
More informationCompression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction
Compression of RADARSAT Data with Block Adaptive Wavelets Ian Cumming and Jing Wang Department of Electrical and Computer Engineering The University of British Columbia 2356 Main Mall, Vancouver, BC, Canada
More informationBy Bonemap Extending Isadora s 3D Particles and 3D Model Particles
Extending Isadora s 3D Particles and 3D Model Particles By Bonemap 2018 Page 1 v.1 Extending Isadora s 3D Particles and 3D Model Particles with media instances By Bonemap 2018 One of the more difficult
More informationThanks for slides preparation of Dr. Shawmin Lei, Sharp Labs of America And, Mei-Yun Hsu February Material Sources
An Overview of MPEG4 Thanks for slides preparation of Dr. Shawmin Lei, Sharp Labs of America And, Mei-Yun Hsu February 1999 1 Material Sources The MPEG-4 Tutuorial, San Jose, March 1998 MPEG-4: Context
More informationFlow Estimation. Min Bai. February 8, University of Toronto. Min Bai (UofT) Flow Estimation February 8, / 47
Flow Estimation Min Bai University of Toronto February 8, 2016 Min Bai (UofT) Flow Estimation February 8, 2016 1 / 47 Outline Optical Flow - Continued Min Bai (UofT) Flow Estimation February 8, 2016 2
More informationUniversity of Erlangen-Nuremberg. Cauerstrasse 7, Erlangen, Germany. Abstract
Motion-Based Analysis and Segmentation of Image Sequences using 3-D Scene Models Eckehard Steinbach, Peter Eisert, and Bernd Girod Telecommunications Laboratory, University of Erlangen-Nuremberg Cauerstrasse
More informationBMVC 1996 doi: /c.10.41
On the use of the 1D Boolean model for the description of binary textures M Petrou, M Arrigo and J A Vons Dept. of Electronic and Electrical Engineering, University of Surrey, Guildford GU2 5XH, United
More informationVideo Coding Standards
Based on: Y. Wang, J. Ostermann, and Y.-Q. Zhang, Video Processing and Communications, Prentice Hall, 2002. Video Coding Standards Yao Wang Polytechnic University, Brooklyn, NY11201 http://eeweb.poly.edu/~yao
More informationBit-Plane Decomposition Steganography Using Wavelet Compressed Video
Bit-Plane Decomposition Steganography Using Wavelet Compressed Video Tomonori Furuta, Hideki Noda, Michiharu Niimi, Eiji Kawaguchi Kyushu Institute of Technology, Dept. of Electrical, Electronic and Computer
More informationCluster quality 15. Running time 0.7. Distance between estimated and true means Running time [s]
Fast, single-pass K-means algorithms Fredrik Farnstrom Computer Science and Engineering Lund Institute of Technology, Sweden arnstrom@ucsd.edu James Lewis Computer Science and Engineering University of
More informationEgemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc. Abstract. Direct Volume Rendering (DVR) is a powerful technique for
Comparison of Two Image-Space Subdivision Algorithms for Direct Volume Rendering on Distributed-Memory Multicomputers Egemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc Dept. of Computer Eng. and
More informationChapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:
Chapter 11.3 MPEG-2 MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Simple, Main, SNR scalable, Spatially scalable, High, 4:2:2,
More informationHowever, m pq is just an approximation of M pq. As it was pointed out by Lin [2], more precise approximation can be obtained by exact integration of t
FAST CALCULATION OF GEOMETRIC MOMENTS OF BINARY IMAGES Jan Flusser Institute of Information Theory and Automation Academy of Sciences of the Czech Republic Pod vodarenskou vez 4, 82 08 Prague 8, Czech
More informationA PROPOSAL FOR DEPENDENT OPTIMIZATION IN SCALABLE REGION-BASED CODING SYSTEMS
A PROPOSAL FOR DEPENDENT OPTIMIZATION IN SCALABLE REGION-BASED CODING SYSTEMS Josep Ramon Morros and Ferrun Murquks Dept. of Signal Theory and Communications ETSETB - Universitat Politkcnica de Catalunya
More informationImage Segmentation Techniques for Object-Based Coding
Image Techniques for Object-Based Coding Junaid Ahmed, Joseph Bosworth, and Scott T. Acton The Oklahoma Imaging Laboratory School of Electrical and Computer Engineering Oklahoma State University {ajunaid,bosworj,sacton}@okstate.edu
More informationImplication of variable code block size in JPEG 2000 and its VLSI implementation
Implication of variable code block size in JPEG 2000 and its VLSI implementation Ping-Sing Tsai a, Tinku Acharya b,c a Dept. of Computer Science, Univ. of Texas Pan American, 1201 W. Univ. Dr., Edinburg,
More informationLecture 5: Compression I. This Week s Schedule
Lecture 5: Compression I Reading: book chapter 6, section 3 &5 chapter 7, section 1, 2, 3, 4, 8 Today: This Week s Schedule The concept behind compression Rate distortion theory Image compression via DCT
More informationMotion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.
ON VIDEO SNR SCALABILITY Lisimachos P. Kondi, Faisal Ishtiaq and Aggelos K. Katsaggelos Northwestern University Dept. of Electrical and Computer Engineering 2145 Sheridan Road Evanston, IL 60208 E-Mail:
More informationFrequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding
2009 11th IEEE International Symposium on Multimedia Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding Ghazaleh R. Esmaili and Pamela C. Cosman Department of Electrical and
More informationWhat is Computer Vision?
Perceptual Grouping in Computer Vision Gérard Medioni University of Southern California What is Computer Vision? Computer Vision Attempt to emulate Human Visual System Perceive visual stimuli with cameras
More information2 F. ZANOGUERA ET AL. are no longer valid. 2. Although some of the techniques proposed can directly be applied to the segmentation of 3D images, no pr
A SEGMENTATION PYRAMID FOR THE INTERACTIVE SEGMENTATION OF 3-D IMAGES AND VIDEO SEQUENCES F. ZANOGUERA, B. MARCOTEGUI and F. MEYER Centre de Morphologie Mathematique - Ecole des Mines de Paris 35, rue
More informationAdaptive Quantization for Video Compression in Frequency Domain
Adaptive Quantization for Video Compression in Frequency Domain *Aree A. Mohammed and **Alan A. Abdulla * Computer Science Department ** Mathematic Department University of Sulaimani P.O.Box: 334 Sulaimani
More informationCompression of Stereo Images using a Huffman-Zip Scheme
Compression of Stereo Images using a Huffman-Zip Scheme John Hamann, Vickey Yeh Department of Electrical Engineering, Stanford University Stanford, CA 94304 jhamann@stanford.edu, vickey@stanford.edu Abstract
More informationMPEG-4 AUTHORING TOOL FOR THE COMPOSITION OF 3D AUDIOVISUAL SCENES
MPEG-4 AUTHORING TOOL FOR THE COMPOSITION OF 3D AUDIOVISUAL SCENES P. Daras I. Kompatsiaris T. Raptis M. G. Strintzis Informatics and Telematics Institute 1,Kyvernidou str. 546 39 Thessaloniki, GREECE
More informationModule 7 VIDEO CODING AND MOTION ESTIMATION
Module 7 VIDEO CODING AND MOTION ESTIMATION Lesson 20 Basic Building Blocks & Temporal Redundancy Instructional Objectives At the end of this lesson, the students should be able to: 1. Name at least five
More informationEfficient Method for Half-Pixel Block Motion Estimation Using Block Differentials
Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering
More informationANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION
ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION K Nagamani (1) and AG Ananth (2) (1) Assistant Professor, R V College of Engineering, Bangalore-560059. knmsm_03@yahoo.com (2) Professor, R V
More informationClassifier C-Net. 2D Projected Images of 3D Objects. 2D Projected Images of 3D Objects. Model I. Model II
Advances in Neural Information Processing Systems 7. (99) The MIT Press, Cambridge, MA. pp.949-96 Unsupervised Classication of 3D Objects from D Views Satoshi Suzuki Hiroshi Ando ATR Human Information
More informationProceedings of the 6th Int. Conf. on Computer Analysis of Images and Patterns. Direct Obstacle Detection and Motion. from Spatio-Temporal Derivatives
Proceedings of the 6th Int. Conf. on Computer Analysis of Images and Patterns CAIP'95, pp. 874-879, Prague, Czech Republic, Sep 1995 Direct Obstacle Detection and Motion from Spatio-Temporal Derivatives
More informationJPEG IMAGE CODING WITH ADAPTIVE QUANTIZATION
JPEG IMAGE CODING WITH ADAPTIVE QUANTIZATION Julio Pons 1, Miguel Mateo 1, Josep Prades 2, Román Garcia 1 Universidad Politécnica de Valencia Spain 1 {jpons,mimateo,roman}@disca.upv.es 2 jprades@dcom.upv.es
More informationUnsupervised Moving Object Edge Segmentation Using Dual-Tree Complex Wavelet Transform
International Journal of Natural and Engineering Sciences 2 (3): 69-74, 2008 ISSN: 307-49, www.nobelonline.net Unsupervised Moving Object Edge Segmentation Using Dual-Tree Complex Wavelet Transform Turgay
More informationJPEG 2000 vs. JPEG in MPEG Encoding
JPEG 2000 vs. JPEG in MPEG Encoding V.G. Ruiz, M.F. López, I. García and E.M.T. Hendrix Dept. Computer Architecture and Electronics University of Almería. 04120 Almería. Spain. E-mail: vruiz@ual.es, mflopez@ace.ual.es,
More information3.1. Solution for white Gaussian noise
Low complexity M-hypotheses detection: M vectors case Mohammed Nae and Ahmed H. Tewk Dept. of Electrical Engineering University of Minnesota, Minneapolis, MN 55455 mnae,tewk@ece.umn.edu Abstract Low complexity
More informationVideo Compression Standards (II) A/Prof. Jian Zhang
Video Compression Standards (II) A/Prof. Jian Zhang NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2009 jzhang@cse.unsw.edu.au Tutorial 2 : Image/video Coding Techniques Basic Transform coding Tutorial
More informationFRAGILE WATERMARKING USING SUBBAND CODING
ICCVG 2002 Zakopane, 25-29 Sept. 2002 Roger ŚWIERCZYŃSKI Institute of Electronics and Telecommunication Poznań University of Technology roger@et.put.poznan.pl FRAGILE WATERMARKING USING SUBBAND CODING
More informationFinal Exam Assigned: 11/21/02 Due: 12/05/02 at 2:30pm
6.801/6.866 Machine Vision Final Exam Assigned: 11/21/02 Due: 12/05/02 at 2:30pm Problem 1 Line Fitting through Segmentation (Matlab) a) Write a Matlab function to generate noisy line segment data with
More information15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION
15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:
More informationMultiframe Blocking-Artifact Reduction for Transform-Coded Video
276 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 12, NO. 4, APRIL 2002 Multiframe Blocking-Artifact Reduction for Transform-Coded Video Bahadir K. Gunturk, Yucel Altunbasak, and
More informationVIDEO CODING. The Second Generation Approach
VIDEO CODING The Second Generation Approach VIDEO CODING The Second Generation Approach EDITED BY luis Torres Department of Signal Theory and Communications, Universitat Politecnica de Catalunya Barcelona,
More informationReduced Frame Quantization in Video Coding
Reduced Frame Quantization in Video Coding Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P. O. Box 500, FIN-900 University
More informationMotion Estimation. There are three main types (or applications) of motion estimation:
Members: D91922016 朱威達 R93922010 林聖凱 R93922044 謝俊瑋 Motion Estimation There are three main types (or applications) of motion estimation: Parametric motion (image alignment) The main idea of parametric motion
More informationFilter Banks with Variable System Delay. Georgia Institute of Technology. Abstract
A General Formulation for Modulated Perfect Reconstruction Filter Banks with Variable System Delay Gerald Schuller and Mark J T Smith Digital Signal Processing Laboratory School of Electrical Engineering
More informationA Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation
, pp.162-167 http://dx.doi.org/10.14257/astl.2016.138.33 A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation Liqiang Hu, Chaofeng He Shijiazhuang Tiedao University,
More informationDigital Video Processing
Video signal is basically any sequence of time varying images. In a digital video, the picture information is digitized both spatially and temporally and the resultant pixel intensities are quantized.
More informationA Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
International Journal of Engineering Research and General Science Volume 3, Issue 4, July-August, 15 ISSN 91-2730 A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
More informationUsing Local Trajectory Optimizers To Speed Up Global. Christopher G. Atkeson. Department of Brain and Cognitive Sciences and
Using Local Trajectory Optimizers To Speed Up Global Optimization In Dynamic Programming Christopher G. Atkeson Department of Brain and Cognitive Sciences and the Articial Intelligence Laboratory Massachusetts
More informationFast Color-Embedded Video Coding. with SPIHT. Beong-Jo Kim and William A. Pearlman. Rensselaer Polytechnic Institute, Troy, NY 12180, U.S.A.
Fast Color-Embedded Video Coding with SPIHT Beong-Jo Kim and William A. Pearlman Electrical, Computer and Systems Engineering Dept. Rensselaer Polytechnic Institute, Troy, NY 12180, U.S.A. Tel: (518) 276-6982,
More information( ) ; For N=1: g 1. g n
L. Yaroslavsky Course 51.7211 Digital Image Processing: Applications Lect. 4. Principles of signal and image coding. General principles General digitization. Epsilon-entropy (rate distortion function).
More informationAlgorithms, Probability, and Computing Special Assignment 1 HS17
Institute of Theoretical Computer Science Mohsen Ghaffari, Angelika Steger, David Steurer, Emo Welzl, eter Widmayer Algorithms, robability, and Computing Special Assignment 1 HS17 The solution is due on
More informationFraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)
Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6) 9 th Meeting: 2-5 September 2003, San Diego Document: JVT-I032d1 Filename: JVT-I032d5.doc Title: Status:
More informationJPEG 2000 compression
14.9 JPEG and MPEG image compression 31 14.9.2 JPEG 2000 compression DCT compression basis for JPEG wavelet compression basis for JPEG 2000 JPEG 2000 new international standard for still image compression
More informationDifferential Compression and Optimal Caching Methods for Content-Based Image Search Systems
Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems Di Zhong a, Shih-Fu Chang a, John R. Smith b a Department of Electrical Engineering, Columbia University, NY,
More informationNeTra-V: Towards an Object-based Video Representation
Proc. of SPIE, Storage and Retrieval for Image and Video Databases VI, vol. 3312, pp 202-213, 1998 NeTra-V: Towards an Object-based Video Representation Yining Deng, Debargha Mukherjee and B. S. Manjunath
More informationIT Digital Image ProcessingVII Semester - Question Bank
UNIT I DIGITAL IMAGE FUNDAMENTALS PART A Elements of Digital Image processing (DIP) systems 1. What is a pixel? 2. Define Digital Image 3. What are the steps involved in DIP? 4. List the categories of
More informationTechniques. IDSIA, Istituto Dalle Molle di Studi sull'intelligenza Articiale. Phone: Fax:
Incorporating Learning in Motion Planning Techniques Luca Maria Gambardella and Marc Haex IDSIA, Istituto Dalle Molle di Studi sull'intelligenza Articiale Corso Elvezia 36 - CH - 6900 Lugano Phone: +41
More informationDigital Image Stabilization and Its Integration with Video Encoder
Digital Image Stabilization and Its Integration with Video Encoder Yu-Chun Peng, Hung-An Chang, Homer H. Chen Graduate Institute of Communication Engineering National Taiwan University Taipei, Taiwan {b889189,
More informationUniversity of Bristol - Explore Bristol Research. Link to published version (if available): /s
Anantrasirichai, N., Canagarajah, C. N., Redmill, D. W., Akbari, A. S., & Bull, D. (2011). Colour volumetric compression for realistic view synthesis applications. Multimedia Tools and Applications, 53(1),
More informationCMPT 365 Multimedia Systems. Media Compression - Video Coding Standards
CMPT 365 Multimedia Systems Media Compression - Video Coding Standards Spring 2017 Edited from slides by Dr. Jiangchuan Liu CMPT365 Multimedia Systems 1 Video Coding Standards H.264/AVC CMPT365 Multimedia
More informationInteroperable Content-based Access of Multimedia in Digital Libraries
Interoperable Content-based Access of Multimedia in Digital Libraries John R. Smith IBM T. J. Watson Research Center 30 Saw Mill River Road Hawthorne, NY 10532 USA ABSTRACT Recent academic and commercial
More informationDATA hiding [1] and watermarking in digital images
14 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 6, NO. 1, MARCH 2011 Data Hiding in Motion Vectors of Compressed Video Based on Their Associated Prediction Error Hussein A. Aly, Member,
More informationRhinoCFD Tutorial. Flow Past a Sphere
RhinoCFD Tutorial Flow Past a Sphere RhinoCFD Ocial document produced by CHAM September 26, 2017 Introduction Flow Past a Sphere This tutorial will describe a simple calculation of ow around a sphere and
More informationIEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 6, NO. 11, NOVEMBER
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 6, NO. 11, NOVEMBER 1997 1487 A Video Compression Scheme with Optimal Bit Allocation Among Segmentation, Motion, and Residual Error Guido M. Schuster, Member,
More informationA Hierarchical Statistical Framework for the Segmentation of Deformable Objects in Image Sequences Charles Kervrann and Fabrice Heitz IRISA / INRIA -
A hierarchical statistical framework for the segmentation of deformable objects in image sequences Charles Kervrann and Fabrice Heitz IRISA/INRIA, Campus Universitaire de Beaulieu, 35042 Rennes Cedex,
More information