Gwenaëlle MARQUANT, Stéphane PATEUX and Claude LABIT IRISA/INRIA - Campus Universitaire de Beaulieu RENNES Cedex - France

Size: px
Start display at page:

Download "Gwenaëlle MARQUANT, Stéphane PATEUX and Claude LABIT IRISA/INRIA - Campus Universitaire de Beaulieu RENNES Cedex - France"

Transcription

1 Mesh-Based Scalable Video Coding with Rate-Distortion Optimization Gwenaëlle MARQUANT, Stéphane PATEUX and Claude LABIT IRISA/INRIA - Campus Universitaire de Beaulieu RENNES Cedex - France ABSTRACT In this paper, we present a mesh-based motion estimation scheme for image sequence. Nodal motion vectors optimization is performed by using a multi-resolution differential method. Because our final aim is mesh tracking throughout a video sequence with optimized reconstruction, neither backward tracking nor forward tracking is well suited. One motivation of our work is to take advantage of both forward tracking (which enables tracking) and backward tracking (for its efficiency) in a backward in forward method. For the optimization of the nodal motion vectors, we also propose a novel approach with multi-resolution and several hierarchy levels, which, in addition, makes it possible scalable representation. This is achieved with a progressive representation defined according to a rate distortion criterion. Results are presented to illustrate the proposed methods. Keywords: Object-based video coding, Mesh-based motion tracking, Rate/distortion optimization, Mixed hierarchical/multiresolution motion estimation, Scalability. 1. INTRODUCTION Recent development in video coding research deals with the use of a time-varying mesh for video representation and associated mesh-based model for motion description. New functionalities of video application, such as search and editing, but also transmission or storage ability, require a flexible framework of video representation. Mesh-based representation has been more and more employed for these purposes. Various approaches have emerged recently in order to deal with hierarchical and/or adaptive and/or progressive meshes ( 1{3 ). In addition, such mesh-based motion estimation schemes allow to characterize local deformations between two video frames more accurately, that is, without blocking artifacts. Moreover, transmitted bit-rates have to be reduced as much as possible in order to adapt to the network available bandwidth. For this purpose, most previous works in inter-frame coding deal with adaptive node sampling according to image content ( 1 ). Nevertheless, in the proposed approaches, no optimization of a compromise between distortion and bitrate is considered: the representation coding cost is often stated but not taken into account as a constraint. Compared to these methods, our paper presents a motion estimation scheme for image sequences using a triangular mesh and nodal motion vectors optimization, further improved by a backward in forward method. Our approach proposes for considering a coupled hierarchical/multi-resolution representation of 2D meshes. In addition, this representation is adaptive in such a way that its criterion optimizes both the coding cost and the tracked image rendering. Even if our major aim is mesh-based motion estimation, we impose several constraints: motion estimation accuracy, compression efficiency, suitability for a high level content-based representation, quality scalability, progressive representation. Further author information: G. Marquant: gwenaelle.marquant@irisa.fr S. Pateux: stephane.pateux@irisa.fr C. Labit: claude.labit@irisa.fr

2 This paper is organized as follows. In section 2, we present the mesh and the constraints that are requested. Then, in section 3, motion estimation with the backward in forward solution is explained. Section 4 reports the optimization of motion vectors. Section 5 explains how to get a scalable mesh representation. Finally, the last section presents experimental results and concluding remarks. 2. MESH MODEL DEFINITION Image rendering coding performances improvement needs obviously a data simplification while assuming a maximum subjective quality by mean of an accurate motion estimation. For several reasons (reported in 4 ), the most adapted mesh seems to be an adaptive hierarchical regular mesh: it combines easy coding of a regular triangulation with heterogeneity of irregular mesh nodes Mesh model The mesh we chose follows from previous works and is motivated by several constraints we are requested: motion description accuracy, low bit cost, scalability by making use of an embedded representation and adaptive location of nodes points that places more motion vectors over complex motion areas. Thanks to lots of discussions in literature, all these reasons lead us to choosing a regular triangular mesh because of its lower coding cost. Practically, the first image is coded using a quincunx triangular mesh on which an affine interpolation is achieved. In addition, such a mesh is hierarchical and adaptive. This hierarchy is adaptively refined by regular splitting according to a rate/distortion criterion, while enabling a simple topological embedding of various levels triangles (cf Fig.1 and 6). a b c Figure 1. (a)initial mesh, (b) first level of balanced mesh before deformation due to motion estimation, (c) Final adaptive mesh before deformation due to motion estimation Scalability We want our method to be scalable because it seems to be very useful to enable access to a given precision level. Furthermore, this level can be adapted to the image content, to the desired representation, to the admissible level with respect to the available network bandwidth, or to a chosen optimization criterion (rate/distortion for instance). With this scalability aim in view, we are able to increase representation quality of the motion estimation across mesh levels by locally adapting data density. Node points sampling rate is adapted to the local motion complexity of the image. In other words, our method makes it possible to control the motion vector density, control the mesh quality and low complexity by the use of the coarse-to-fine approach Triangulation splitting criterion In our approach, the hierarchical adaptive mesh is obtained by successive triangles splittings according to a simple criterion. Various solutions have already been studied to increase node density in critical areas: for more details, reader can report to 4. However, none of these techniques guarantees to obtain optimal results in the rate/distortion sense. Logically, we decided to split a triangle at level i if several successive splittings up to level i max lead to a significant fall of the squared error. As a result, this criterion takes into account the squared errors magnitudes SE and it can be considered according to a relative (rate-variation / distortion-variation) criterion for adaptive mesh reconstruction.

3 3. THE BACKWARD IN FORWARD SOLUTION 3.1. Classical motion estimation techniques for meshes Two methods allow to predict deformations of a mesh throughout a video sequence: forward or backward mesh tracking. On the one hand, forward tracking requires the mesh at time t?1 and looks for its matching position in frame t. On the other hand, backward tracking requires initial position of the mesh at t and looks for its matching position in frame t?1. The choice between backward or forward node tracking deserves to look at the pros and cons about each method. A lot of mesh based motion estimation methods have been widely tested: various block matching methods ( 5 ), differential methods in forward tracking ( 1 ) and backward tracking ( 6 ). Thanks to its easy implementation, the forward node tracking is the most widely used method in order to predict an image. However, this tracking scheme suffers from a few drawbacks: the estimation aim is to optimize the reconstruction at time t?1 even though we want to reconstruct t. Alternately, backward tracking optimizes reconstruction at t but involves knowing the mesh at time t. Therefore, in backward no tracking is feasible while keeping the same structure, unless all the sequence is available and the last frame can be considered as the one with the initial mesh, which is backward tracked back to the first frame. For all these reasons, we propose to improve the forward tracking by analyzing at time t?1 (as in the forward scheme), while simulating the analysis at t (backward) The backward in forward In forward tracking, we aim at minimizing: E F = P = P (x;y)2 t?1 (x;y)2 t?1 (I(x; y; t?1)? I(x +dx; y +dy; t)) 2? F (x; y; t? 1) 2 (1) where denotes the image domain definition or the object mask (cf.4.2) and I(x +dx; y +dy; t) denotes the intensity evaluated at the pixel (x; y) displaced by the motion vector (dx; dy). Alternately, backward tracking aims at reconstructing the current image t from the previous one t?1, and this tracking method supposes that mesh (t) is known. In backward, we aim at minimizing: P E B = = P (x 0 ;y 0 )2 t (I(x 0 ; y 0 ; t)? I(x 0?dx; y 0?dy; t?1)) 2 (x 0 ;y 0 )2 t Equations (1) and (2) can also be written in the continuous case: E B = ZZ? B (x 0 ; y 0 ; t) 2 (2)? B (x 0 ; y 0 ; t) 2 dst S t and E F = ZZ S t?1? F (x; y; t? 1) 2 dst?1 where ds t?1 and ds t represent the surface element, that is to say one pixel. Moreover, F (x; y; t) = B (x 0?dx; y 0?dy; t?1). This implies that the integration over S t can be accomplished over ds t?1 by a change of variables, as ZZ? B 2 ZZ? (x; y; t) dst = F (x 0 ; y 0 2 ; t?1) J(x 0 ; y 0 )ds t?1 (3) S t S t?1 where J(x 0 ; y 0 ) is the Jacobian of the affine transformation between the two related triangles and is defined as J(x 0 ; y 0 )

4 Equation (3) looks like the computation of the error in a forward mapping, excepted here the Jacobian which is easy to compute. Finally, equation (3) can be written in the discrete form: E B = X (x;y)2 t?1? F (x; y; t? 1) 2 J(x 0 ; y 0 ) The Jacobian describes the infinitesimal area ratio of the deformed triangle to the original at point (x; y). Consequently, it can be shown that J(x 0 ; y 0 ) measures the ratio between post- and pre-deformations triangles. To conclude, nodal tracking algorithms are developed by minimizing the prediction errors measured over the original triangle at time (t?1). Wang and Lee 7 used a similar change of variable with the use of a mapping function into a master element between the two patches to match. However, they concluded that the two Jacobians were equal, and for this reason did not use further the area ratio. Furthermore, we propose to integrate at time (t?1) rather than at an other triangle that may imply sampling problems Affine warping 4. OPTIMIZATION OF MOTION VECTORS To get the optimum node positions for the grids of the current and the previous frames, we have to perform a spatial transformation between patches of these two grids in order to predict as good as possible the current frame. The matching transformation is applied independently for each patch. Several warpings are commonly used: bilinear between quadrangles, affine between triangles,... However, some assumptions about the mapping transform can be pointed out as the fact that the interiors of the patches will be mapped to the corresponding interiors in the prediction frame using a geometrical transform. Such assumptions guarantee that the resulting prediction frame, reconstructed by transforming the patches in the previous image to the patches in the current one, will be spatially continuous, which is of great importance in the forward matching process. This transform involves finding the spatially corresponding position of each pixel from an image to another one. In this way, the resulting coordinates are generally not integers and a final bilinear interpolation is used in order to get the pixel value. In our study, we use a triangular mesh which implies affine warpings between triangular patches Motion estimation method Scalable transmission and representation of video is actually of major importance, in order to be adapted to the available bandwidth of the network. In this way, we propose to use several hierarchical levels for motion estimation. In order to minimize equation (3), we use a Gauss Newton algorithm. Moreover, a multi-resolution strategy is used. This implementation is optimized over a Burt and Adelson multi-resolution pyramid (Fig.2) from coarse to fine level, according to the hierarchical levels. In this manner, large displacements are taken into account and mesh overlappings are reduced by motion estimation only on triangles with a sufficient area at the processed resolution. So, our solution enables to get the optimal motion adapted to the received hierarchical level, and not to the complete sent hierarchy. level L motion optimization low pass filtering downsampling projection motion optimization projection level 0 motion optimization Figure 2. multi-resolution strategy Fig.3 shows various ways that can be performed for motion estimation : by mean of multi-resolution over the image, for a given mesh ( 6 ) : method (a) in figure 3; by mean of hierarchical meshes over the image, at full image resolution : method (b) in figure 3;

5 by mean of mixed hierarchical/multi-resolution solutions over the image : method (c) and (d) in figure 3. Nevertheless, we can notice that solutions (c) and (d) have different functionalities: (c) makes it possible to have available full resolution image reconstruction for each hierarchy level and consequently adaptive reconstruction is feasible; during the motion estimation process, (d) takes better into account the triangle area (due to the hierarchical level) according to the resolution level. However, this solution only provides a full resolution image reconstruction for the last hierarchy level which does not make possible scalable transmission. HIERARCHY b c RESOLUTION H0R0 (full resolution) a H1R0 d H2R0 H0R1 H1R1 H0R2 Figure 3. Combination of multi-resolution and hierarchy levels 4.3. Definition of the optimization problem For each position (x; y) in images I 1 and I 2 at time t 1 and t 2 we associate grey level values I 1 (x 1 ; y 1 ) and I 2 (x 2 ; y 2 ). If we consider the basic initial assumption that the brightness of a particular moving point is constant in time, so, if dx and dy represent the motion of a point (x; y) between I 1 and I 2, the reconstructed image b I2 satisfies : 8(x; y) b I2 (x; y) = I 1 (x + dx; y + dy) (4) We introduce a mesh-based motion estimation scheme for image sequence : given the motion of the node points, we are able to reconstruct the image. The major problem consists in estimating as well as possible the motion vectors between frames in order to encode and reconstruct an image given the previous one. Because there is a mutual influence between mesh nodes, we can express the displacements as a weighted sum over the displacements of the nodes j : dx(x; y) = P j w j (x; y)dx j and dy(x; y) = P j w j(x; y)dy j. The functions w j represents the barycentric coordinate of the j th vertex. It is a function of all possible positions (x; y) in the image support. It can be pointed out that: w j (x; y) = = 0 If (x; y) 62 node j surrounding hexagon 6= 0 Else In this manner, considering that functions have limited supports, the optimization constraint of the j th node value only depends on a limited number of coefficients in the surrounding hexagon around this node. If w j and w i relative supports do not intersect, then w j w i = 0. Consequently, the system to solve is a sparse matrix and can be quickly processed with iterative techniques. In order to optimize the problem, we have to minimize the error E(x; y) = I 2 (x; y)? b I2 (x; y) over the whole image, that is min? X (x;y)2i 2 E 2 (x; y) {z } E The Gauss Newton algorithm leads to the following system to solve : (5)

6 j = 0,, X I 2 (x; y)? I2 b (x; y) {z } (x;y)2 I 2 V ariation to linearize X (x;y)2 I 2 X?! 5 x I 1 (x + dx; y + dy) w j (x; y) = 0?!5 x I 1 (x + dx; y + dy) w j (x; y) w l (x; y) l 5 x I 1 (x + dx; y + dy) dx l + 5 y I 1 (x + dx; y + dy) dy l = X (x;y)2 I 2 w j (x; y) 5 x I 1 (x + dx; y + dy) I 2 (x; y)? b I k 2 (x; y) (6) Furthermore, on both sides of these equations, we can add the jacobian term J(x; y) introduced by the backward in forward method. Such an hierarchical/multi-resolution optimization leads to significant quality improvements: multi-resolution takes into account large displacements, differential approach outperforms block matching methods and the Jacobian approach also improves usual backward or forward tracking methods. Furthermore, this can be achieved by a fast iterative solving of the large sparse linear system above mentioned (equation 6). It can be pointed out that our error minimization only handles pixels inside the video object mask. This implies that motion estimation is object-based. Additionally, a constraint region ( 8,1 ) is defined for each node to avoid mesh overlappings. Unlike other existing solutions ( 1,3 ), we propose the node locations at coarser levels not to be updated, even though refreshment is performed at finer hierarchy levels to increase motion accuracy. So, our solution enables to get the optimal motion adapted to the received hierarchical level, and not to the complete hierarchy sent. In this way, successive mesh levels in the hierarchy are not geometrically embedded but topologically embedded. Implementations show that this solution allows to outperform mono-resolution and/or single level motion estimation and is adapted to progressive/scalable transmission. 5. OPTIMUM RECONSTRUCTION IN THE RATE DISTORTION SENSE In order to control this quality scalability, we also propose to take the rate-distortion criterion into account (details in 4 ). This is of key importance while transmitted bit-rates have to be reduced as much as possible in order to adapt to the network capabilities. Therefore, this additional representation scheme authorizes to reconstruct the video sequence as best suited anytime, for a given rate or distortion Adaptive mesh Thanks to lots of discussions in literature ( 4,3 ), we have chosen a regular triangular mesh because of its lower coding cost. This mesh is then adaptively refined by regular triangles splitting according to an error criterion, in an embedded pyramid way. This criterion in inter frame tracking is the companion piece to the one we have introduced in intra-frame coding ( 4 ). This method allows control of the node density in high motion areas, control of the mesh quality and low complexity. As regards the adaptive concept, we suggest to split a triangle if its iterative division - thus more motion vectors in this area - leads to a significant fall of its squared error between the considered level and the finest level, that is, less errors in motion estimation. Thus, the adequate criterion is a compromise between this and the splitting coding cost. Classically, in a rate/distortion approach, a Lagrangian multipliers technique is used. It amounts to minimize a R + D criterion for a given, where R is the bit rate and D is the distortion 9. The best suited is then searched to reach a desired bit rate or quality. Our method is not Lagrangian multipliers-based and avoids the -search step. We propose an iterative method by using at each step the splitting that maximizes the criterion jdj R. This technique can be interpreted as a Lagrangian minimization scheme, with a progressive rise of (in the Lagrangian minimization, we want R + D < 0, that is jdj R > 1, thus a sorting according to a criterion jdj R ). Consequently, each representation level is defined by the list of successive triangles to split. Therefore, each representation level contains all size triangles. Given this triangles divisions sorting, the coder is able to adapt to the desired quality or to the available capabilities of the network. For a given representation level, the progressive coding can be improved by interlacing the structure informations and nodes motion informations: firstly, nodes values in the initial grid are transmitted, then, for each division information, new created nodes values are transmitted. Therefore, mesh triangles are sorted in a specific way that authorizes to reconstruct the image at best anytime, for a given rate or for a desired quality.

7 5.2. Adaptive warping The image is reconstructed once the mesh is positioned. It can be pointed out that when we have a triangle that keeps close to finer level triangles (for instance (c) in figure 1, with a regular embedded mesh), the affine used interpolation ensures C 0 continuity between triangles by using fictitious triangles in the larger triangle to interpolate continuously displacements with respect to the smaller triangles that stand opposite to it (cf figure 4). With regard to new nodes in following hierarchy levels, a b Figure 4. (a) Original hierarchical adaptive mesh. (b) Original hierarchical adaptive mesh and the added fictitious triangles (dotted lines) : this ensures C 0 continuity across different hierarchy levels triangles. new nodes in hierarchy h + 1 located in the middle of an arc between two nodes in hierarchy h : its initial motion (in hierarchy h + 1) is set equal to the mean of its two parent nodes in hierarchy h. Obviously, in a rate/distortion context, the transmitted information must be as relevant as possible. That is the reason why an adaptive mesh is gradually transmitted thanks to a triangles sorting. This triangles sorting is obtained by comparing the influence of splitting one triangle into four smaller ones to others feasible splittings (cf subsection 5.1). However, when one triangle has been chosen for division, its six children nodes can move. It involves that non-split neighbor triangles are influenced and put out of shape. Figure 5 shows that affine mapping for the original ABC triangle must be performed with four simultaneous smaller ones triangles Age, efg, Bef and fgc. B B e f C A C A g C Figure 5. (on the left) Original triangle ABC. (on the right) Original triangle ABC influenced by neighbor smaller triangles : in order to reconstruct the image, triangle ABC has to be split into 4 smaller fictitious ones : Aeg, efg, Bef and fgc. Figure 6 illustrates the embedded point of view : when one triangle is divided into children triangles (whom are adjusted to better predict motion in this area), triangles A eg, efg, B ef and fgc are said topologically embedded in ABC even if they are not geometrically embedded. 6. EXPERIMENTAL RESULTS This section shows the experimental results we obtained for all the methods we have just presented, where an initial uniform triangular mesh is used. We measured the peak signal-to-noise ratio of warped frames with respect to the original frames The backward in forward method We compare the performances of forward and backward in forward tracking. The visual tracking results are demonstrated in figure 7 which represents the original Lena frame (considered as frame at time t? 1) and an artificial warping of Lena

8 e A B g C f Figure 6. (on the left) Original mesh with hierarchy 0. (in the middle) Adaptive mesh and the influence area when the central triangle has been split. (on the right) Adaptive mesh : Original mesh hierarchy 0 with children triangles with hierarchy 1 for one triangle. Fictitious triangles (dotted lines) ensure C 0 continuity across different hierarchy levels triangles. forward backward-forward mono multi mono multi artificial-lena Suzie Table 1. Mesh tracking results for artificial warped Lena frame and Suzie (frames 51-52). Each entry gives the PSNR (db) of a rendered video frame, using either mono-resolution or multi-resolution (three levels) motion estimation, for the two different estimation schemes. The entries are the results obtained at the finest full-detail level. (considered as frame at time t) generated with a non-piecewise affine motion field. The tracked meshes are overlaid on the original frame at time t?1 and the motion compensated frame at time t. The results show that backward in forward tracking is better than forward tracking in case there is complex motion in the video sequence, and at least equal if motion is simple. In the same way, this scheme is improved by making use of multi-resolution, especially to deal with large motion.to evaluate the performance of the multi-resolution versus mono-resolution motion estimation, and forward versus backward in forward, we measured the peak signal-to-noise ratio of warped frames with respect to the original frames. Different PSNR results for artificial-warped-lena and frames from the Suzie sequence are summarized in Table 1. Figure 7. Tracked meshes are superimposed on the original frame at time t? 1 and the motion compensated frame at time t. On the left: image Lena with an artificial warping, on the right: images Suzie In our experiments, we do not ensure special conditions for node points on border areas. In other words, we consider that if a node point lies on a corner or an edge of an image, it does not mean necessarily that this node has to stay at the same place or to move along the edge in the following frame: objects can appear and disappear in an image (zoom, background movement,...). Since the reconstructed image can not always be predicted near borders, we build our mesh slightly smaller than the full frame dimension. Moreover, we compute the PSNR on a smaller area (10 pixels smaller) than the full mesh (because it can move from the image border and uncover areas).

9 Coast-guard Table-tennis H0R0 = full resolution, no hierarchy a = multi-resolution, no hierarchy b = hierarchical, full image resolution c = mix a+b Table 2. Mesh tracking results for Coast-guard (frames ) and Table-tennis (frames 1-2). Each entry gives the PSNR (db) of a rendered video frame, using different triangle sizes (172 or 43 pixels large). Each row corresponds with various approaches pointed out in Fig.3. A maximum of 3 hierarchical levels and 3 resolution levels are used. H0R0 means no hierarchy and full resolution Mixed hierarchical/multi-resolution approach Different mixed hierarchical/multi-resolution approaches and results are summarized in Fig.3 and Table 2. First of all, we can notice that multi-resolution improves accuracy in mesh-based motion estimation. However, it is all the more pronounced as motion range is large (see Table 2). Similarly, making use of several hierarchy levels increases performances since it allows to refine motion estimation in high activity areas. Let us notice that, for all these methods, the number of triangles can be reduced in case of quite rigid objects without significant quality loss. Consequently, dealing with large triangles means few node points to encode, which is of major interest given our scalability constraints. Whereas most classical methods would use much more triangles to stay into the object, Table 2 reports good results using only 10 triangles for coast-guard, results that are not really improved (0:24dB) by successive hierarchical levels. Furthermore, we notice that obtained results with triangles of side of 172 pixels after 3 successive divisions (thus 43 pixels large)(raws a,b,c and column 172 in table 2) are better than those obtained from initial triangles with side if 43 pixels (raw H0R0, column 43): it is due to a better initialization transmitted to successive hierarchical levels. Various tests have reported that solution (c) provides the best results compared to (a) and (b). As for methods (c) and (d), method (c) provides motion estimations and reconstructions at various levels of hierarchy with full image resolution, thus (c) allows progressive data reconstruction unlike method (d). In the same way, these schemes are slightly improved by making use of backward in forward. Finally, using jointly multi-resolution and hierarchy appears very efficient and accurate even if the corresponding performances mainly depend on the object internal properties: rigidity and motion amplitude. Finally, the motion compensation can be transmitted at various levels of details according to the available adaptive representation thanks to progressive rate distortion splittings and sortings. 7. CONCLUSION All things considered, our motion estimation is obtained from a multi-resolution and hierarchical method while taking the rate-distortion criterion into account. A novel adapted tracking scheme backward in forward has been presented to take advantage of both forward and backward tracking. Results obtained show the relevance of the proposed approach. Such an implementation works for the constraints we are requested. Particularly, progressive/scalable transmission is now feasible, adapted to the available bandwidth of the network. REFERENCES 1. P. van Beek, A. Tekalp, N. Zhuang, I. Celasun, and M. Xia, Hierarchical 2D mesh representation, tracking and compression for object-based video, IEEE Trans. on Circ. and Syst. for Video Tech. 9, p. 353, march special issue. 2. C. L. B. Jordan, T. Ebrahimi, and M. Kunt, Progressive mesh-based coding of arbitrary-shaped video objects, in SPIE VCIP, pp , january P. Lechat, N. Laurent, and H. Sanson, Scalable image coding with fine granularity based on hierarchical mesh, in SPIE VCIP, january G. Marquant, S. Pateux, and C. Labit, Mesh-based scalable image coding with rate-distortion optimization, in Image and Video Communications and Processing 2000, (San Jose, USA), Jan J. Nieweglowski, T. G. Campbell, and P. Haavisto, A novel video coding scheme based on temporal prediction using digital image warping, in IEEE Transactions on Consumer Electronics, vol. 39, pp , August 1993.

10 6. P. Lechat, M. Ropert, and H. Sanson, Hierarchical mesh-based motion estimation using a differential approach and application to video coding, in EUSIPCO 98, vol. 4, pp , sept Y. Wang and O. Lee, Use of two-dimensional deformable mesh structures for video coding, part i-the synthesis problem: mesh-based function approximation and mapping, in IEEE Transactions on circuits and systems for video technology, vol. 6, pp , december N. Zhuang, P. van Beek, I. Celasun, and A. M. Tekalp, Hierarchical 2D content-based mesh tracking for video object, in SPIE VCIP, pp , january D. Tzovaras, S. Vachtsevanos, and M. Strintzis, Optimization of wireframe model adaptation and motion estimation in a rate-distortion framework, International journal of Imaging systems and technology 9, pp , 1998.

Tracking of video objects using a backward projection technique

Tracking of video objects using a backward projection technique Tracking of video objects using a backward projection technique Stéphane Pateux IRISA/INRIA, Temics Project Campus Universitaire de Beaulieu 35042 Rennes Cedex, FRANCE ABSTRACT In this paper, we present

More information

A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation

A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 1, JANUARY 2001 111 A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation

More information

Mesh Based Interpolative Coding (MBIC)

Mesh Based Interpolative Coding (MBIC) Mesh Based Interpolative Coding (MBIC) Eckhart Baum, Joachim Speidel Institut für Nachrichtenübertragung, University of Stuttgart An alternative method to H.6 encoding of moving images at bit rates below

More information

Compression of Light Field Images using Projective 2-D Warping method and Block matching

Compression of Light Field Images using Projective 2-D Warping method and Block matching Compression of Light Field Images using Projective 2-D Warping method and Block matching A project Report for EE 398A Anand Kamat Tarcar Electrical Engineering Stanford University, CA (anandkt@stanford.edu)

More information

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC Randa Atta, Rehab F. Abdel-Kader, and Amera Abd-AlRahem Electrical Engineering Department, Faculty of Engineering, Port

More information

Context based optimal shape coding

Context based optimal shape coding IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing September 13-15, 1999, Copenhagen, Denmark Electronic Proceedings 1999 IEEE Context based optimal shape coding Gerry Melnikov,

More information

Data Partitioning. Figure 1-31: Communication Topologies. Regular Partitions

Data Partitioning. Figure 1-31: Communication Topologies. Regular Partitions Data In single-program multiple-data (SPMD) parallel programs, global data is partitioned, with a portion of the data assigned to each processing node. Issues relevant to choosing a partitioning strategy

More information

Motion Estimation for Video Coding Standards

Motion Estimation for Video Coding Standards Motion Estimation for Video Coding Standards Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Introduction of Motion Estimation The goal of video compression

More information

BI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH

BI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH BI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH Marc Servais, Theo Vlachos and Thomas Davies University of Surrey, UK; and BBC Research and Development,

More information

Fast image warping using adaptive partial matching

Fast image warping using adaptive partial matching Fast image warping using adaptive partial matching Dong-Keun Lim Samsung Electronics Company Limited Digital Media Research and Development Center 416 Maetan-3Dong Paldal-Gu, Suwon Kyungi-Do, 442-742 Korea

More information

Fine grain scalable video coding using 3D wavelets and active meshes

Fine grain scalable video coding using 3D wavelets and active meshes Fine grain scalable video coding using 3D wavelets and active meshes Nathalie Cammas a,stéphane Pateux b a France Telecom RD,4 rue du Clos Courtel, Cesson-Sévigné, France b IRISA, Campus de Beaulieu, Rennes,

More information

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV Jeffrey S. McVeigh 1 and Siu-Wai Wu 2 1 Carnegie Mellon University Department of Electrical and Computer Engineering

More information

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P SIGNAL COMPRESSION 9. Lossy image compression: SPIHT and S+P 9.1 SPIHT embedded coder 9.2 The reversible multiresolution transform S+P 9.3 Error resilience in embedded coding 178 9.1 Embedded Tree-Based

More information

Modified SPIHT Image Coder For Wireless Communication

Modified SPIHT Image Coder For Wireless Communication Modified SPIHT Image Coder For Wireless Communication M. B. I. REAZ, M. AKTER, F. MOHD-YASIN Faculty of Engineering Multimedia University 63100 Cyberjaya, Selangor Malaysia Abstract: - The Set Partitioning

More information

A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT

A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT Wai Chong Chia, Li-Minn Ang, and Kah Phooi Seng Abstract The 3D Set Partitioning In Hierarchical Trees (SPIHT) is a video

More information

ELEC Dr Reji Mathew Electrical Engineering UNSW

ELEC Dr Reji Mathew Electrical Engineering UNSW ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion

More information

VIDEO OBJECT SEGMENTATION BY EXTENDED RECURSIVE-SHORTEST-SPANNING-TREE METHOD. Ertem Tuncel and Levent Onural

VIDEO OBJECT SEGMENTATION BY EXTENDED RECURSIVE-SHORTEST-SPANNING-TREE METHOD. Ertem Tuncel and Levent Onural VIDEO OBJECT SEGMENTATION BY EXTENDED RECURSIVE-SHORTEST-SPANNING-TREE METHOD Ertem Tuncel and Levent Onural Electrical and Electronics Engineering Department, Bilkent University, TR-06533, Ankara, Turkey

More information

A 3-D Virtual SPIHT for Scalable Very Low Bit-Rate Embedded Video Compression

A 3-D Virtual SPIHT for Scalable Very Low Bit-Rate Embedded Video Compression A 3-D Virtual SPIHT for Scalable Very Low Bit-Rate Embedded Video Compression Habibollah Danyali and Alfred Mertins University of Wollongong School of Electrical, Computer and Telecommunications Engineering

More information

Multiresolution Meshes. COS 526 Tom Funkhouser, Fall 2016 Slides by Guskov, Praun, Sweldens, etc.

Multiresolution Meshes. COS 526 Tom Funkhouser, Fall 2016 Slides by Guskov, Praun, Sweldens, etc. Multiresolution Meshes COS 526 Tom Funkhouser, Fall 2016 Slides by Guskov, Praun, Sweldens, etc. Motivation Huge meshes are difficult to render store transmit edit Multiresolution Meshes! [Guskov et al.]

More information

Dense Motion Field Reduction for Motion Estimation

Dense Motion Field Reduction for Motion Estimation Dense Motion Field Reduction for Motion Estimation Aaron Deever Center for Applied Mathematics Cornell University Ithaca, NY 14853 adeever@cam.cornell.edu Sheila S. Hemami School of Electrical Engineering

More information

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Overview Examples and motivations Scalable coding for network transmission Techniques for multiple description coding 2 27/05/2013

More information

Overview: motion estimation. Differential motion estimation

Overview: motion estimation. Differential motion estimation Overview: motion estimation Differential methods Fast algorithms for Sub-pel accuracy Rate-constrained motion estimation Bernd Girod: EE368b Image Video Compression Motion Estimation no. 1 Differential

More information

Local Image Registration: An Adaptive Filtering Framework

Local Image Registration: An Adaptive Filtering Framework Local Image Registration: An Adaptive Filtering Framework Gulcin Caner a,a.murattekalp a,b, Gaurav Sharma a and Wendi Heinzelman a a Electrical and Computer Engineering Dept.,University of Rochester, Rochester,

More information

MOTION ESTIMATION WITH THE REDUNDANT WAVELET TRANSFORM.*

MOTION ESTIMATION WITH THE REDUNDANT WAVELET TRANSFORM.* MOTION ESTIMATION WITH THE REDUNDANT WAVELET TRANSFORM.* R. DeVore A. Petukhov R. Sharpley Department of Mathematics University of South Carolina Columbia, SC 29208 Abstract We present a fast method for

More information

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About

More information

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform Torsten Palfner, Alexander Mali and Erika Müller Institute of Telecommunications and Information Technology, University of

More information

Depth Estimation for View Synthesis in Multiview Video Coding

Depth Estimation for View Synthesis in Multiview Video Coding MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Depth Estimation for View Synthesis in Multiview Video Coding Serdar Ince, Emin Martinian, Sehoon Yea, Anthony Vetro TR2007-025 June 2007 Abstract

More information

arxiv: v1 [math.na] 20 Sep 2016

arxiv: v1 [math.na] 20 Sep 2016 arxiv:1609.06236v1 [math.na] 20 Sep 2016 A Local Mesh Modification Strategy for Interface Problems with Application to Shape and Topology Optimization P. Gangl 1,2 and U. Langer 3 1 Doctoral Program Comp.

More information

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 7, NO. 2, APRIL 1997 429 Express Letters A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation Jianhua Lu and

More information

5.7. Fractal compression Overview

5.7. Fractal compression Overview 5.7. Fractal compression Overview 1. Introduction 2. Principles 3. Encoding 4. Decoding 5. Example 6. Evaluation 7. Comparison 8. Literature References 1 Introduction (1) - General Use of self-similarities

More information

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation

More information

Image Interpolation using Collaborative Filtering

Image Interpolation using Collaborative Filtering Image Interpolation using Collaborative Filtering 1,2 Qiang Guo, 1,2,3 Caiming Zhang *1 School of Computer Science and Technology, Shandong Economic University, Jinan, 250014, China, qguo2010@gmail.com

More information

OPTIMIZED MULTIPLE DESCRIPTION SCALAR QUANTIZATION BASED 3D MESH CODING

OPTIMIZED MULTIPLE DESCRIPTION SCALAR QUANTIZATION BASED 3D MESH CODING OPTIMIZED MULTIPLE DESCRIPTION SCALAR QUANTIZATION BASED 3D MESH CODING M. Oguz Bici 1, Gozde Bozdagi Akar 1, Andrey Norkin 2 and Atanas Gotchev 2 1 Middle East Technical University, Ankara, Turkey 2 Department

More information

CSC Computer Graphics

CSC Computer Graphics // CSC. Computer Graphics Lecture Kasun@dscs.sjp.ac.lk Department of Computer Science University of Sri Jayewardanepura Polygon Filling Scan-Line Polygon Fill Algorithm Span Flood-Fill Algorithm Inside-outside

More information

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter Y. Vatis, B. Edler, I. Wassermann, D. T. Nguyen and J. Ostermann ABSTRACT Standard video compression techniques

More information

Finite Element Method. Chapter 7. Practical considerations in FEM modeling

Finite Element Method. Chapter 7. Practical considerations in FEM modeling Finite Element Method Chapter 7 Practical considerations in FEM modeling Finite Element Modeling General Consideration The following are some of the difficult tasks (or decisions) that face the engineer

More information

Lecture 2 September 3

Lecture 2 September 3 EE 381V: Large Scale Optimization Fall 2012 Lecture 2 September 3 Lecturer: Caramanis & Sanghavi Scribe: Hongbo Si, Qiaoyang Ye 2.1 Overview of the last Lecture The focus of the last lecture was to give

More information

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames Ki-Kit Lai, Yui-Lam Chan, and Wan-Chi Siu Centre for Signal Processing Department of Electronic and Information Engineering

More information

Using temporal seeding to constrain the disparity search range in stereo matching

Using temporal seeding to constrain the disparity search range in stereo matching Using temporal seeding to constrain the disparity search range in stereo matching Thulani Ndhlovu Mobile Intelligent Autonomous Systems CSIR South Africa Email: tndhlovu@csir.co.za Fred Nicolls Department

More information

Video Coding Using Spatially Varying Transform

Video Coding Using Spatially Varying Transform Video Coding Using Spatially Varying Transform Cixun Zhang 1, Kemal Ugur 2, Jani Lainema 2, and Moncef Gabbouj 1 1 Tampere University of Technology, Tampere, Finland {cixun.zhang,moncef.gabbouj}@tut.fi

More information

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Literature Survey Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Omer Shakil Abstract This literature survey compares various methods

More information

FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS

FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS Yen-Kuang Chen 1, Anthony Vetro 2, Huifang Sun 3, and S. Y. Kung 4 Intel Corp. 1, Mitsubishi Electric ITA 2 3, and Princeton University 1

More information

Fast and Effective Interpolation Using Median Filter

Fast and Effective Interpolation Using Median Filter Fast and Effective Interpolation Using Median Filter Jian Zhang 1, *, Siwei Ma 2, Yongbing Zhang 1, and Debin Zhao 1 1 Department of Computer Science, Harbin Institute of Technology, Harbin 150001, P.R.

More information

Overview: motion-compensated coding

Overview: motion-compensated coding Overview: motion-compensated coding Motion-compensated prediction Motion-compensated hybrid coding Motion estimation by block-matching Motion estimation with sub-pixel accuracy Power spectral density of

More information

Video Transcoding Architectures and Techniques: An Overview. IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang

Video Transcoding Architectures and Techniques: An Overview. IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang Video Transcoding Architectures and Techniques: An Overview IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang Outline Background & Introduction Bit-rate Reduction Spatial Resolution

More information

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS Xie Li and Wenjun Zhang Institute of Image Communication and Information Processing, Shanghai Jiaotong

More information

Dynamic Obstacle Detection Based on Background Compensation in Robot s Movement Space

Dynamic Obstacle Detection Based on Background Compensation in Robot s Movement Space MATEC Web of Conferences 95 83 (7) DOI:.5/ matecconf/79583 ICMME 6 Dynamic Obstacle Detection Based on Background Compensation in Robot s Movement Space Tao Ni Qidong Li Le Sun and Lingtao Huang School

More information

Scanning Real World Objects without Worries 3D Reconstruction

Scanning Real World Objects without Worries 3D Reconstruction Scanning Real World Objects without Worries 3D Reconstruction 1. Overview Feng Li 308262 Kuan Tian 308263 This document is written for the 3D reconstruction part in the course Scanning real world objects

More information

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.11, November 2013 1 Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial

More information

Redundancy and Correlation: Temporal

Redundancy and Correlation: Temporal Redundancy and Correlation: Temporal Mother and Daughter CIF 352 x 288 Frame 60 Frame 61 Time Copyright 2007 by Lina J. Karam 1 Motion Estimation and Compensation Video is a sequence of frames (images)

More information

Block-Matching based image compression

Block-Matching based image compression IEEE Ninth International Conference on Computer and Information Technology Block-Matching based image compression Yun-Xia Liu, Yang Yang School of Information Science and Engineering, Shandong University,

More information

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS Television services in Europe currently broadcast video at a frame rate of 25 Hz. Each frame consists of two interlaced fields, giving a field rate of 50

More information

Module 7 VIDEO CODING AND MOTION ESTIMATION

Module 7 VIDEO CODING AND MOTION ESTIMATION Module 7 VIDEO CODING AND MOTION ESTIMATION Lesson 20 Basic Building Blocks & Temporal Redundancy Instructional Objectives At the end of this lesson, the students should be able to: 1. Name at least five

More information

Subdivision Of Triangular Terrain Mesh Breckon, Chenney, Hobbs, Hoppe, Watts

Subdivision Of Triangular Terrain Mesh Breckon, Chenney, Hobbs, Hoppe, Watts Subdivision Of Triangular Terrain Mesh Breckon, Chenney, Hobbs, Hoppe, Watts MSc Computer Games and Entertainment Maths & Graphics II 2013 Lecturer(s): FFL (with Gareth Edwards) Fractal Terrain Based on

More information

coding of various parts showing different features, the possibility of rotation or of hiding covering parts of the object's surface to gain an insight

coding of various parts showing different features, the possibility of rotation or of hiding covering parts of the object's surface to gain an insight Three-Dimensional Object Reconstruction from Layered Spatial Data Michael Dangl and Robert Sablatnig Vienna University of Technology, Institute of Computer Aided Automation, Pattern Recognition and Image

More information

Reduced Frame Quantization in Video Coding

Reduced Frame Quantization in Video Coding Reduced Frame Quantization in Video Coding Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P. O. Box 500, FIN-900 University

More information

3D Mesh Sequence Compression Using Thin-plate Spline based Prediction

3D Mesh Sequence Compression Using Thin-plate Spline based Prediction Appl. Math. Inf. Sci. 10, No. 4, 1603-1608 (2016) 1603 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.18576/amis/100440 3D Mesh Sequence Compression Using Thin-plate

More information

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain Author manuscript, published in "International Symposium on Broadband Multimedia Systems and Broadcasting, Bilbao : Spain (2009)" One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

More information

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 6, NO. 11, NOVEMBER

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 6, NO. 11, NOVEMBER IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 6, NO. 11, NOVEMBER 1997 1487 A Video Compression Scheme with Optimal Bit Allocation Among Segmentation, Motion, and Residual Error Guido M. Schuster, Member,

More information

Feature Tracking and Optical Flow

Feature Tracking and Optical Flow Feature Tracking and Optical Flow Prof. D. Stricker Doz. G. Bleser Many slides adapted from James Hays, Derek Hoeim, Lana Lazebnik, Silvio Saverse, who 1 in turn adapted slides from Steve Seitz, Rick Szeliski,

More information

Model-Aided Coding: A New Approach to Incorporate Facial Animation into Motion-Compensated Video Coding

Model-Aided Coding: A New Approach to Incorporate Facial Animation into Motion-Compensated Video Coding 344 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 3, APRIL 2000 Model-Aided Coding: A New Approach to Incorporate Facial Animation into Motion-Compensated Video Coding Peter

More information

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video Bit-Plane Decomposition Steganography Using Wavelet Compressed Video Tomonori Furuta, Hideki Noda, Michiharu Niimi, Eiji Kawaguchi Kyushu Institute of Technology, Dept. of Electrical, Electronic and Computer

More information

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation , pp.162-167 http://dx.doi.org/10.14257/astl.2016.138.33 A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation Liqiang Hu, Chaofeng He Shijiazhuang Tiedao University,

More information

Stereo Image Compression

Stereo Image Compression Stereo Image Compression Deepa P. Sundar, Debabrata Sengupta, Divya Elayakumar {deepaps, dsgupta, divyae}@stanford.edu Electrical Engineering, Stanford University, CA. Abstract In this report we describe

More information

Clustering Part 4 DBSCAN

Clustering Part 4 DBSCAN Clustering Part 4 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville DBSCAN DBSCAN is a density based clustering algorithm Density = number of

More information

Image Coding with Active Appearance Models

Image Coding with Active Appearance Models Image Coding with Active Appearance Models Simon Baker, Iain Matthews, and Jeff Schneider CMU-RI-TR-03-13 The Robotics Institute Carnegie Mellon University Abstract Image coding is the task of representing

More information

EUSIPCO A SPACE-VARIANT CUBIC-SPLINE INTERPOLATION

EUSIPCO A SPACE-VARIANT CUBIC-SPLINE INTERPOLATION EUSIPCO 213 1569744341 A SPACE-VARIAN CUBIC-SPLINE INERPOLAION Jianxing Jiang, Shaohua Hong, Lin Wang Department of Communication Engineering, Xiamen University, Xiamen, Fujian, 3615, P.R. China. ABSRAC

More information

ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION

ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION K Nagamani (1) and AG Ananth (2) (1) Assistant Professor, R V College of Engineering, Bangalore-560059. knmsm_03@yahoo.com (2) Professor, R V

More information

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework System Modeling and Implementation of MPEG-4 Encoder under Fine-Granular-Scalability Framework Final Report Embedded Software Systems Prof. B. L. Evans by Wei Li and Zhenxun Xiao May 8, 2002 Abstract Stream

More information

Wavelet Based Image Compression Using ROI SPIHT Coding

Wavelet Based Image Compression Using ROI SPIHT Coding International Journal of Information & Computation Technology. ISSN 0974-2255 Volume 1, Number 2 (2011), pp. 69-76 International Research Publications House http://www.irphouse.com Wavelet Based Image

More information

A new predictive image compression scheme using histogram analysis and pattern matching

A new predictive image compression scheme using histogram analysis and pattern matching University of Wollongong Research Online University of Wollongong in Dubai - Papers University of Wollongong in Dubai 00 A new predictive image compression scheme using histogram analysis and pattern matching

More information

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

2014 Summer School on MPEG/VCEG Video. Video Coding Concept 2014 Summer School on MPEG/VCEG Video 1 Video Coding Concept Outline 2 Introduction Capture and representation of digital video Fundamentals of video coding Summary Outline 3 Introduction Capture and representation

More information

Compression of Stereo Images using a Huffman-Zip Scheme

Compression of Stereo Images using a Huffman-Zip Scheme Compression of Stereo Images using a Huffman-Zip Scheme John Hamann, Vickey Yeh Department of Electrical Engineering, Stanford University Stanford, CA 94304 jhamann@stanford.edu, vickey@stanford.edu Abstract

More information

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala Tampere University of Technology Korkeakoulunkatu 1, 720 Tampere, Finland ABSTRACT In

More information

Chapter 7 Practical Considerations in Modeling. Chapter 7 Practical Considerations in Modeling

Chapter 7 Practical Considerations in Modeling. Chapter 7 Practical Considerations in Modeling CIVL 7/8117 1/43 Chapter 7 Learning Objectives To present concepts that should be considered when modeling for a situation by the finite element method, such as aspect ratio, symmetry, natural subdivisions,

More information

Motion Vector Estimation Search using Hexagon-Diamond Pattern for Video Sequences, Grid Point and Block-Based

Motion Vector Estimation Search using Hexagon-Diamond Pattern for Video Sequences, Grid Point and Block-Based Motion Vector Estimation Search using Hexagon-Diamond Pattern for Video Sequences, Grid Point and Block-Based S. S. S. Ranjit, S. K. Subramaniam, S. I. Md Salim Faculty of Electronics and Computer Engineering,

More information

CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT

CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT 9.1 Introduction In the previous chapters the inpainting was considered as an iterative algorithm. PDE based method uses iterations to converge

More information

Quality versus Intelligibility: Evaluating the Coding Trade-offs for American Sign Language Video

Quality versus Intelligibility: Evaluating the Coding Trade-offs for American Sign Language Video Quality versus Intelligibility: Evaluating the Coding Trade-offs for American Sign Language Video Frank Ciaramello, Jung Ko, Sheila Hemami School of Electrical and Computer Engineering Cornell University,

More information

Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials

Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering

More information

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS 4th European Signal Processing Conference (EUSIPCO ), Florence, Italy, September 4-8,, copyright by EURASIP BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION

More information

Guidelines for proper use of Plate elements

Guidelines for proper use of Plate elements Guidelines for proper use of Plate elements In structural analysis using finite element method, the analysis model is created by dividing the entire structure into finite elements. This procedure is known

More information

A New Technique of Extraction of Edge Detection Using Digital Image Processing

A New Technique of Extraction of Edge Detection Using Digital Image Processing International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) A New Technique of Extraction of Edge Detection Using Digital Image Processing Balaji S.C.K 1 1, Asst Professor S.V.I.T Abstract:

More information

Reconstruction of complete 3D object model from multi-view range images.

Reconstruction of complete 3D object model from multi-view range images. Header for SPIE use Reconstruction of complete 3D object model from multi-view range images. Yi-Ping Hung *, Chu-Song Chen, Ing-Bor Hsieh, Chiou-Shann Fuh Institute of Information Science, Academia Sinica,

More information

An Efficient Mode Selection Algorithm for H.264

An Efficient Mode Selection Algorithm for H.264 An Efficient Mode Selection Algorithm for H.64 Lu Lu 1, Wenhan Wu, and Zhou Wei 3 1 South China University of Technology, Institute of Computer Science, Guangzhou 510640, China lul@scut.edu.cn South China

More information

University of Florida CISE department Gator Engineering. Clustering Part 4

University of Florida CISE department Gator Engineering. Clustering Part 4 Clustering Part 4 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville DBSCAN DBSCAN is a density based clustering algorithm Density = number of

More information

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 1115 Low Bit-Rate Video Coding with Implicit Multiscale Segmentation Seung Chul Yoon, Krishna Ratakonda, and

More information

Chapter 1. Introduction

Chapter 1. Introduction Introduction 1 Chapter 1. Introduction We live in a three-dimensional world. Inevitably, any application that analyzes or visualizes this world relies on three-dimensional data. Inherent characteristics

More information

Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder

Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder Semi-Hierarchical Based Motion Estimation Algorithm for the Dirac Video Encoder M. TUN, K. K. LOO, J. COSMAS School of Engineering and Design Brunel University Kingston Lane, Uxbridge, UB8 3PH UNITED KINGDOM

More information

Dense Image-based Motion Estimation Algorithms & Optical Flow

Dense Image-based Motion Estimation Algorithms & Optical Flow Dense mage-based Motion Estimation Algorithms & Optical Flow Video A video is a sequence of frames captured at different times The video data is a function of v time (t) v space (x,y) ntroduction to motion

More information

Reduction of Blocking artifacts in Compressed Medical Images

Reduction of Blocking artifacts in Compressed Medical Images ISSN 1746-7659, England, UK Journal of Information and Computing Science Vol. 8, No. 2, 2013, pp. 096-102 Reduction of Blocking artifacts in Compressed Medical Images Jagroop Singh 1, Sukhwinder Singh

More information

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Yongying Gao and Hayder Radha Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48823 email:

More information

*Faculty of Electrical Engineering, University of Belgrade, Serbia and Montenegro ** Technical University of Lodz, Institute of Electronics, Poland

*Faculty of Electrical Engineering, University of Belgrade, Serbia and Montenegro ** Technical University of Lodz, Institute of Electronics, Poland Terrain-adaptive infrared line-scan coding: A new image segmentation scheme by D. Milovanovic *, A. Marincic * and B. Wiecek ** *Faculty of Electrical Engineering, University of Belgrade, Serbia and Montenegro

More information

Predictive Interpolation for Registration

Predictive Interpolation for Registration Predictive Interpolation for Registration D.G. Bailey Institute of Information Sciences and Technology, Massey University, Private bag 11222, Palmerston North D.G.Bailey@massey.ac.nz Abstract Predictive

More information

Image Inpainting by Hyperbolic Selection of Pixels for Two Dimensional Bicubic Interpolations

Image Inpainting by Hyperbolic Selection of Pixels for Two Dimensional Bicubic Interpolations Image Inpainting by Hyperbolic Selection of Pixels for Two Dimensional Bicubic Interpolations Mehran Motmaen motmaen73@gmail.com Majid Mohrekesh mmohrekesh@yahoo.com Mojtaba Akbari mojtaba.akbari@ec.iut.ac.ir

More information

MOTION COMPENSATION IN BLOCK DCT CODING BASED ON PERSPECTIVE WARPING

MOTION COMPENSATION IN BLOCK DCT CODING BASED ON PERSPECTIVE WARPING MOTION COMPENSATION IN BLOCK DCT CODING BASED ON PERSPECTIVE WARPING L. Capodiferro*, S. Puledda*, G. Jacovitti** * Fondazione Ugo Bordoni c/o ISPT, Viale Europa 190, 00149 Rome, Italy Tel: +39-6-54802132;

More information

Sample Based Texture extraction for Model based coding

Sample Based Texture extraction for Model based coding DEPARTMENT OF APPLIED PHYSICS AND ELECTRONICS UMEÅ UNIVERISTY, SWEDEN DIGITAL MEDIA LAB Sample Based Texture extraction for Model based coding Zhengrong Yao 1 Dept. Applied Physics and Electronics Umeå

More information

Image Error Concealment Based on Watermarking

Image Error Concealment Based on Watermarking Image Error Concealment Based on Watermarking Shinfeng D. Lin, Shih-Chieh Shie and Jie-Wei Chen Department of Computer Science and Information Engineering,National Dong Hwa Universuty, Hualien, Taiwan,

More information

Finite Element Analysis Prof. Dr. B. N. Rao Department of Civil Engineering Indian Institute of Technology, Madras. Lecture - 36

Finite Element Analysis Prof. Dr. B. N. Rao Department of Civil Engineering Indian Institute of Technology, Madras. Lecture - 36 Finite Element Analysis Prof. Dr. B. N. Rao Department of Civil Engineering Indian Institute of Technology, Madras Lecture - 36 In last class, we have derived element equations for two d elasticity problems

More information

Space Filling Curves and Hierarchical Basis. Klaus Speer

Space Filling Curves and Hierarchical Basis. Klaus Speer Space Filling Curves and Hierarchical Basis Klaus Speer Abstract Real world phenomena can be best described using differential equations. After linearisation we have to deal with huge linear systems of

More information

Optimal Estimation for Error Concealment in Scalable Video Coding

Optimal Estimation for Error Concealment in Scalable Video Coding Optimal Estimation for Error Concealment in Scalable Video Coding Rui Zhang, Shankar L. Regunathan and Kenneth Rose Department of Electrical and Computer Engineering University of California Santa Barbara,

More information