MCTF and Scalability Extension of H.264/AVC and its Application to Video Transmission, Storage, and Surveillance

Size: px
Start display at page:

Download "MCTF and Scalability Extension of H.264/AVC and its Application to Video Transmission, Storage, and Surveillance"

Transcription

1 MCTF and Scalability Extension of H.264/AVC and its Application to Video Transmission, Storage, and Surveillance Ralf Schäfer, Heiko Schwarz, Detlev Marpe, Thomas Schierl, and Thomas Wiegand * Fraunhofer Institute for Telecommunications Heinrich Hertz Institute (HHI), Image Processing Department, Einsteinufer 37, Berlin, Germany ABSTRACT The extension of H.264/AVC hybrid video coding towards scalable video coding (SVC) using motion-compensated temporal filtering (MCTF) is presented. Utilizing the lifting approach to implement MCTF, the motion compensation features of H.264/AVC can be re-used for the MCTF step and extended in a straightforward way for the MCTF step. The MCTF extension of H.264/AVC is also incorporated into a video codec that provides SNR, spatial, and (similar to hybrid video coding) temporal scalability. The paper provides a description of these techniques and presents experimental results that validate their efficiency. In addition applications of SVC to video transmission and video surveillance are described. Keywords: Scalable video coding, video coding standards, H.264, MPEG-4 AVC 1. INTRODUCTION The scalable extension of H.264/AVC as proposed in [1] has been chosen to be the starting point [2] of MPEG s Scalable Video Coding (SVC) standardization project in October In January 2005, MPEG and the Video Coding Experts Group (VCEG) of the ITU-T agreed to jointly finalize the SVC project as an Amendment of their H.264/AVC standard [3][4], and the scalable extension of H.264/AVC was selected as the first Working Draft [5]. The Working Draft provides a specification of the bit-stream syntax and the decoding process, the reference encoding process is described in the Joint Scalable Video Model (JSVM0) [6]. The corresponding JSVM 0 software [7] can be downloaded at the following web site: The basic design idea of the JSVM is to extend the hybrid video coding approach of H.264/AVC towards motioncompensated temporal filtering (MCTF) [8][9] by using a lifting framework [10]. Because lifting is invertible, any motion compensation (MC) technique can be incorporated into the and steps of the filter bank. By using the highly efficient motion model of H.264/AVC in conjunction with a block-adaptive switching between the Haar and the 5/3 spline wavelet, both the and the step are similar to MC techniques in the generalized B slices [11][ 12] of H.264/AVC. Furthermore, the open-loop structure of a temporal subband representation offers the possibility to efficiently incorporate SNR and spatial scalability. SNR scalability is basically achieved by residual quantization with very little changes to H.264/AVC, and it is also being extended to fine granular SNR scalability. For spatial scalability, a combination of MCTF and over-sampled pyramid decomposition is proposed, which requires some additional mechanisms to convey bit rate from lower resolution to higher resolution layers. However, for each layer, the macroblock-based structure of H.264/AVC can be maintained as will be shown. Because of the similarities in MC, the approach to temporal scalability of H.264/AVC is maintained. Motivated by these facts, we have investigated the possibility of a simple but yet efficient extension of H.264/AVC hybrid video coding towards MCTF and scalability. The next section describes the MCTF approach. Sec. 3 shows how H.264/AVC is extended towards MCTF and Sec. 4 describes the scalability extensions. Experimental results are provided in Sec. 5. Example applications to video transmission and video surveillance are described in Sec. 6. * {schaefer, hschwarz, marpe, schierl, wiegand}@hhi.fraunhofer.de

2 2. MOTION-COMPENSATED TEMPORAL FILTERING In this section, we briefly review MCTF for the case of a 2-tap filter. The generic lifting scheme consists of three steps: polyphase operation,, and. Figure 1 shows a two-channel filter bank with "P" representing the step and "U" representing the step. Figure 1: Lifting representation of an analysis-synthesis filter bank. The input signal S k to the analysis filter bank corresponds to a video picture sampled at time instant k. The polyphase decomposition splits the set of pictures into two sets of pictures with even (2k) and odd indices (2k+1). The picture S 2k+1 is predicted using MC, i.e., spatial shift alignment of picture S 2k towards S 2k+1 yielding P(S 2k ) and the residual H k = S 2k+1 - P(S 2k ) (1) The difference between S 2k+1 and P(S 2k ) is then again motion-compensated, i.e., spatially shift-aligned towards S 2k and divided by 2 yielding U(S 2k+1 - P(S 2k )). When the two MC operators in P( ) and U( ) are linear and invertible against each other such that U(P(s)) = s/2, then the following applies L k = S 2k + U(S 2k+1 - P(S 2k )) = ½ S 2k + U(S 2k+1 ) (2) If the MC operators in P(s) and U(s) do not incur a spatial displacement of s, the signals H k and L k represent high-pass and low-pass bands, respectively, in a known way. Otherwise, H k and L k can be viewed as high-pass and low-pass bands, respectively, but with MC that spatially aligns S 2k and S 2k+1 towards each other. It is easy to see that if H k and L k are not changed (quantized) that the operation of the synthesis filter bank inverts the,, and polyphase decomposition steps and S k is perfectly reconstructed. It should be noted that, when the step is removed, the presented structure is the same as an open-loop version of a hybrid video codec. 3. MCTF EXTENSION OF H.264/AVC H.264/AVC is a hybrid video codec specifying for macroblocks either motion-compensated [13] or intra [3]. Both s are followed by residual coding [14][4]. When lifting is used to implement an MCTF, the and steps are separate mechanisms. Hence, we use the MC of H.264/AVC for the step and a similar technique for the step which is derived from the step. Thus the step also consists of block-based H.264/AVC MC, but with a bit-depth expansion by 1 compared to the step. 3.1 Motion Compensation in H.264/AVC One of the reasons for the improved coding efficiency of H.264/AVC [3] compared to previous standards [15][16][17] is because it permits variable block size MC and multiple reference pictures for MC [18][19]. H.264/AVC specifies block sizes that are signaled through macroblock types for block sizes 16x16, 16x8, 8x16, and 8x8 luma samples. When macroblock type specifies 8x8 blocks, each of these can be split again to 8x4, 4x8, or 4x4 blocks through the submacroblock type. For all blocks one or two motion vectors per block can be signaled for MC, corresponding to predictive or bi-predictive MC, respectively. For all blocks smaller than 8x8 samples, the reference picture applies that is chosen for the 8x8 block that contains them. All other blocks can freely choose between the reference pictures and signal the reference picture index together with each motion vector.

3 3.2 Extension Towards MCTF Using the Update Step We now explain how the in H.264/AVC is combined with the corresponding step. For that, we apply a notation for video samples where s[l, k] is a video sample at spatial location l = (x, y) at time instant k. Let the locations within a block B be noted as l B. The and operators for the temporal decomposition using the lifting representation of the Haar wavelet for MCTF and block B are given by P U ( s[ l, 2k] ) = s[ l + m P0, 2k 2rP0 ], l B ( h[ l k] ) = 1 h[ l + m, k + r ], l B Haar (3) Haar, 2 U0 U0 (4) This Haar wavelet corresponds in the step exactly to predictive coding in H.264/AVC using the motion vector m P0 and the reference picture index r P0. The step also consists of block-based MC, but with a bit-depth expansion by 1 compared to the step. The algorithm for the derivation of the motion vector m U0 and the reference picture index r U0 is given below. For the 5/3 wavelet, the and operators are given by P 1 ( [ l, 2k] ) = ( s[ l + m P0,2k 2rP0 ] + s[ l + m P1, 2k P1]), l B 2 1 ( [ l k] ) = ( h[ l + m, k + r ] + h[ l + m, k 1 ]), l B s (5) 5 / 3 r U 5/ 3, 4 U0 U0 U1 ru1 h (6) Again, the step is exactly the same as bi-predictive MC in H.264/AVC. Note that the utilizes two lists of indices to reference pictures. These lists are named list 0 (m P0 and r P0 ) and list 1 (m P1 and r P1 ) and may contain the same or different reference pictures. The derivation of motion vectors and reference picture indices in the step works as follows. The design goals of the algorithm are to derive a set of H.264/AVC motion vectors and reference picture indices for the step that can be the input to the H.264/AVC motion compensation process without having to change it while achieving highest coding efficiency. The algorithm determines for each 4x4 luma block B 4x4 in the picture U(H k ) the motion vectors and reference picture indices. For each block B 4x4, all motion vectors m P0 and m P1 are evaluated that point into this block. Those m P0 and m P1 are selected that use the maximum number of samples as a reference out of the block B 4x4 and the motion vectors are given as m U0 = m P0 and m U1 = m P1. The reference indices r U0 and r U1 are specifying those pictures into which MC is conducted using m P0 and m P1, respectively. When no motion vectors m P0 exist that point into B 4x4 or when not more than ¾ of the samples of B 4x4 are used as reference for MC using any m P0, the step using m U0 is omitted for B 4x4. The same conditions apply to m P1 and m U1 as well. After processing all blocks B 4x4, the determined m U0, m U1, r U0, and r U1 are formatted into H.264/AVC syntax and syntax limitations are applied if necessary. Please note that the above algorithm is simultaneously applied at coder and decoder so that each step is exactly specified. No side information is transmitted for the step. The H.264/AVC de-blocking filter [21] is applied to the low-pass pictures that are reconstructed in the steps. 3.3 Intra Coding in MCTF When MC does not work, e.g. for scene cuts or uncovered background, the incorporation of intra coding modes increases coding efficiency. For the intra macroblock, the corresponding or step is skipped and the original macroblock samples are placed into the high-pass pictures H k and coded using the intra coding tools of H.264/AVC. Note, that these intra samples are set to zero before they are used for MC in the steps. 3.4 Temporal Coding Structure The temporal coding structure of MCTF is changed relative to hybrid video coding in that not only high-pass pictures H k ( residuals) are resulting from the step but also low-pass pictures L k are resulting from the step. Typically, a group of N 0 input pictures is partitioned into two sets of pictures with one set containing N A (0 < N A < N 0 ) input pictures and the other set containing N B = N 0 N A input pictures. The pictures of the first set are labeled as pictures A k and the pictures of the second set are labeled as pictures B k. The decomposition is performed in a way that the high-pass pictures H k are spatially shift-aligned with pictures B k and the low-pass pictures L k are spatially shift-aligned with pictures A k. Note, that for generating the high-pass pictures in the step, only the input pictures A k can be used as reference pictures for predicting an input picture B k. Figure 2 provides examples for temporal decompositions.

4 (a) (b) (c) Figure 2: Temporal decomposition of input pictures into low and high-pass pictures: a) N A =N B, b) N A =2N B, c) 2N A =N B. For groups of N 0 > 2 pictures it is in general advantageous to apply a multi-channel decomposition instead of a twochannel decomposition. Therefore, the presented two-channel decomposition is iteratively applied to the set the lowpass pictures until a single low-pass picture is obtained or a given number of decomposition stages is performed. In Figure 3(a), a general dyadic temporal decomposition for a group of 16 pictures (GOP) is illustrated. The picture 0 refers to the last (low-pass) picture of the previous GOP, which is used as additional reference picture in the steps, but not altered in the steps. For clarity, only the and steps using directly neighboring reference pictures are illustrated. coding order H H H H H H H H L L L L L L L L L H H H H L L L L L H L L L L H group of pictures (GOP) H L coding order H H H H H H H H L L L L L L L L L H H H H L L L L L GOP border sub-partition border partition border sub-partition border GOP border (a) (b) Figure 3: Dyadic temporal decomposition of a group of 16 pictures: a) without delay constraints, b) with a maximum structural encoding-decoding delay of 4 pictures at the highest temporal resolution. H L L L L Structural delay of 4 pictures C 0 = 3 C 0 = 3 G 0 = 8 G 0 = 8 The delay introduced by each decomposition stage is coupled with the use of reference pictures that are displayed later than the predicted or d picture. Therefore, if the reference pictures for the step are in the past relative to the predicted picture and the step is omitted, no additional delay is introduced. This allows controlling the delay for large groups of pictures or in other words, the low-pass pictures of groups of pictures which correspond to the maximum tolerated delay can be transmitted using motion-compensated as in hybrid video coding. For that, in the steps, the low-pass picture of the previous GOP that is obtained after performing all n decomposition stages is used as additional reference picture for motion-compensated of the current group of pictures. The motion-compensated is only performed inside the GOP. Figure 3(b) shows an example for the dyadic decomposition of a group of 16 pictures with a maximum structural encoding-decoding delay of 4 pictures at the highest temporal resolution. In order to meet the delay constraints, the GOP is partitioned into subgroups, and neither backward steps nor steps are allowed across the partition boundaries. For more details, the reader is referred to [6]. 3.5 Impact on H.264/AVC Syntax, Decoding, and Encoding With the exception of some high-level syntax elements for arbitrarily controlling the reference list for the steps, the syntax of H.264/AVC is not affected by the MCTF extension since all data for the step are derived from data that are already present in the bit-stream. Moreover, the concept of generalized B pictures in H.264/AVC allows the H H L

5 hierarchical temporal decomposition but without the step. Later, we will refer to this concept as hierarchical B pictures. The decoding process of H.264/AVC needs to be extended for the step by the following: The derivation process for the motion vectors in the step must be specified. The motion compensation for the step requires a bit-depth expansion by one bit. In theory this bit-depth expansion happens with every level of the decomposition hierarchy. However, we have found that disallowing any further bit-depth expansion by clipping the low-pass pictures after the steps does not affect performance. We have employed Lagrangian methods for the encoding process similar to those in H.264/AVC [20]. However, the open-loop characteristic of the analysis filter bank and the hierarchical temporal decomposition make a straightforward re-use of these techniques difficult. Due to the steps, the encoding process needs to be operated in reverse order of the decoding process. The Lagrangian costs used for determining the coding modes are based on original reference pictures and do thus only present an even less accurate estimate of the actual costs compared to the approach in [20]. These normalization factors of the analysis-synthesis filter bank are taken into account during quantization. In general, the low-pass pictures are coded with the highest fidelity, since they are employed for motion-compensated of all other pictures. The quantization parameter differences from one decomposition level to the next are determined based on the number of samples for which, bi-, or intra coding is employed. 4. SCALABILITY EXTENSION OF H.264/AVC 4.1 Temporal Scalability The temporal decomposition as described in Sec. 3.4 permits temporal scalability in a similar way as in hybrid video coding. The scalability is achieved by removing those bit-stream parts that correspond to pictures that are not reference pictures for the remaining pictures. 4.2 SNR Scalability For the SNR base layer, H.264/AVC-compatible transform coding is used. The high-pass pictures contain intra or residual macroblocks as in hybrid video coding. For the residual macroblocks, the coding as in H.264/AVC including transformation and quantization is employed. The intra macroblocks are coded using the intra coding modes of H.264/AVC. For each macroblock, the coded block pattern (CBP), and conditioned on CBP the corresponding residual blocks are transmitted together with the macroblocks modes, intra modes, reference picture indices and motion vectors using the B or P slice syntax of H.264/AVC. Low-pass pictures are either coded independently of each other as H.264/AVC intra pictures or are inter coded as H.264/AVC inter pictures. On top of the SNR base layer, SNR enhancement layers are coded. For that, the quantization error between the SNR base layer and the original subband pictures is re-quantized exactly using the same methods as for the base layer but with a finer quantization step size, i.e., a lower value of the quantization parameter. This enhancement layer together with the base layer can be considered to be the base layer for another enhancement layer and the same methods can then be applied again. The reasons why this simple approach to SNR scalability shows good coding efficiency in case enhancement layers are removed is the open-loop encoding method and the temporal decomposition as presented above. In a simple version, the transform coefficient levels of the SNR enhancement layers are transmitted using the residual syntax of H.264/AVC. With this approach only coarse grains of scalable SNR layers can be efficiently represented as factors of 2 in bit-rate. In order to support fine granular SNR scalability, we have introduced so-called progressive refinement slices (for details, see [1][5][6]). Each NAL unit for a progressive refinement slice represents a refinement signals that corresponds to a bisection of the quantization step size. These signals are represented in a way that only a single inverse transform has to be performed for each transform block at the decoder side. The progressive refinement NAL units can be truncated at any arbitrary point, so that the quality of the SNR base layer can be improved in a fine granular way. Therefore, the coding order of transform coefficient levels has been modified. Instead of scanning the transform coefficients macroblock by macroblock as it is done in normal slices, the transform coefficient blocks are scanned in several paths, and in each path only a few coding symbols for a transform coefficient block are coded. With exception of the modified coding order, the CABAC entropy coding [22] as specified in H.264/AVC is re-used.

6 4.3 Spatial Scalability We consider spatial scalable coding of video at multiple resolutions (e.g. QCIF, CIF, and 4CIF) with a factor of 2 in horizontal and vertical resolution. We have represented the video signal using an oversampled pyramid and code the various spatial resolutions independently of each other. From this experiment we have found that it clearly depends on the chosen bit rates in conjunction with the sequence characteristics to what extent the coding efficiency of a spatial layer (e.g. the 4CIF layer) is affected by the presence of additional lower spatial resolution layers (e.g. QCIF and CIF layers). We have also found that it would be efficient to allow the encoder to freely choose which dependencies between the spatial resolution layers need to be exploited through switchable mechanisms. For that, the following techniques turned out to provide gains and are described below: Prediction of a macroblock using the up-sampled lower resolution signal Prediction of motion vectors using the up-sampled lower resolution motion vectors Prediction of the residual signal using the up-sampled residual signal of the lower resolution layer For of motion vectors using the up-sampled lower resolution signal, we introduced two additional macroblock modes that utilize motion information of the lower resolution layer. The macroblock partitioning is obtained by up-sampling the partitioning of the corresponding 8x8 block of the lower resolution layer. For the obtained macroblock partitions, the same reference picture indices as for the corresponding sub-macroblock partition of the base layer block are used; and the associated motion vectors are scaled by a factor of 2. While for the first of these macroblock modes no additional motion information is coded, for the second one, a quarter-sample motion vector refinement is transmitted for each motion vector. Additionally, our approach includes the possibility to use a scaled motion vector of the lower resolution as motion vector predictor for the conventional motion-compensated macroblock modes. A flag that is transmitted with each motion vector difference indicates whether the motion vector predictor is build by conventional spatial or by the corresponding scaled base layer motion vector. In order to also incorporate the possibility of exploiting the residual information coded in the lower resolution layer, an additional flag is transmitted for each macroblock, which signals the application of residual signal from the lower resolution layer. If the flag is true, the base layer residual signals is block-wise up-sampled using a bi-linear filter with constant border extension and used as for the residual signal of the current layer, so that only the corresponding difference signal is coded. In order to enable the inter-layer of low-pass signals, we introduced an additional intra macroblock mode. In that coding mode, the signal is generated by up-sampling the reconstruction signal of the lower resolution layer using the 6-tap filter which is defined in H.264/AVC for the purpose of half-sample interpolation. The residual is transmitted using the H.264/AVC residual coding. For this process it is required in general that the base layer signal is decoded by the computationally complex operations of inverse MCTF and de-blocking. We have found that this problem can be circumvented by restricting the from up-sampled decoded pictures to those parts of the lower layer picture that are coded with intra macroblocks. For that, the intra signal is directly obtained by de-blocking and up-sampling the corresponding 8x8 luma block inside the corresponding lower layer highpass picture. With the proposed changes, the decoding complexity is significantly reduced, since the inverse MCTF is only required for the spatial layer that is actually decoded. Note, that all described inter-layer techniques can also be applied when the base layer has the same spatial resolution as the current layer, i.e., for coarse grain SNR scalability with optimized motion parameters for each SNR layer. In this case, the up-sampling operations are simply discarded. 4.4 Combined Scalability The basic coding scheme for achieving a wide range of combined spatio-temporal and quality scalability can be classified as layered video codec. The coding structure depends on the scalability space that is required by the application. In Figure 4, a block diagram for a typical scenario with 2 spatial layers is depicted. In each layer, an independent MCTF with layer-specific motion parameters is employed. This hierarchical structure provides a temporal scalable representation of a sequence of input pictures that is also suitable for efficiently

7 incorporating spatial and quality scalability. The redundancy between different layers is exploited by different interlayer concepts that include mechanisms for motion parameters as well as texture data. A base representation of the input pictures of each layer is obtained by transform coding similar to that of H.264/AVC, the corresponding NAL units (NAL Network Abstraction Layer) contain motion information and texture data. These base representations determine the minimum bit-rate at which a spatio-temporal resolution can be decoded. The NAL units of the base representation of the lowest layer are compatible with standard H.264/AVC. The reconstruction quality of a layer can be improved by an additional coding of so-called progressive refinement slices. These NAL units represent refinements of the texture data (intra and residual data). Since the coding symbols of progressive refinement slices are ordered similar to a coarse-to-fine description, these NAL units can be arbitrarily truncated in order to support fine granular quality scalability (FGS) or flexible bit-rate adaptation. Enhancement layer encoder (spatial enhancement layer) Progressive refinement texture coding Input video Spat ial decimation Hierarchical motion-compensated texture motion Inter-layer motion and texture Base layer coding Progressive refinement texture coding Multiplex AVC compatible base layer bit-stream Scalable bit-stream Hierarchical motion-compensated texture motion Base layer coding AVC compatible encoder Figure 4: Codec structure example for the scalable extension of H.264/AVC. Bit-streams for a reduced spatial and/or temporal resolution can be simply obtained by discarding NAL units (or network packets) from a global bit-stream that are not needed for decoding the spatio-temporal target resolution. NAL units that correspond to progressive refinement slices can also be arbitrarily truncated in order to further reduce the bitrate and the associated reconstruction quality 4.5 Impact on H.264/AVC Syntax, Decoding, Encoding Scalability requires some high-level syntax support to allow the efficient removal of parts of the bit-stream. The packetbased access unit and NAL unit concept of H.264/AVC is well suited to provide such support. We have introduced additional NAL unit types to indicate the presence of enhancement layers such as an SNR or spatial enhancement layer. Moreover, enhancement slice types need to be specified to facilitate the inter-layer allowing the inclusion of data from one or more lower layers into the current layer such as reconstructed samples, motion vectors, or decoded residual samples. For the decoding of SNR enhancement layers, an additional process for importing motion data as well as intra and residual signals of the subordinate layer is required. The decoding process for spatial enhancement layers needs to be extended by a process for importing and up-sampling of the reconstructed lower resolution signal, a process for importing and up-sampling of residual signals of the lower resolution layer, a process for importing and scaling motion information of the lower resolution layer. In addition, the parsing process needs to be adapted to the modified and newly introduced syntax elements, and the motion vector decoding needs to be extended by the motion vector from the base layer. The encoding process must consider the entire range of rate-distortion points that the decoder may choose to decode. The encoder optimization therefore needs to carefully trade-off mainly motion data and residuals. Since several SNR layers of a spatio-temporal resolution employ a single motion vector field, the trade-off between motion and residual data needs to be adjusted for the entire supported bit-rate range.

8 5. EXPERIMENTAL RESULTS 5.1 Results for the MCTF Extension For evaluating the coding efficiency of the both single-layer MCTF extension and hierarchical B pictures, we compared it to a closed-loop H.264/AVC coder using a similar degree of encoder optimizations [12]. In Figure 5, diagrams with rate distortion curves for the sequences Mobile and Football are depicted. For the H.264/AVC reference, only the first picture is encoded as IDR picture, all following pictures are coded as P and B pictures. Five reference pictures are used, and the rate-distortion curves have been obtained by varying the quantization parameter (QP), where the QP for B pictures was increased by 2 in comparison to the QP for I and P pictures. The GOP size of the MCTF extension was set to 32 pictures and two reference pictures have been used. CABAC was used as entropy coding method for all encoders. (a) Mobile CIF 30Hz H.264/AVC MCTF extension Hierarchical B pictures bit rate [kbit/s] (b) Football CIF 30Hz H.264/AVC MCTF extension Hierarchical B pictures bit rate [kbit/s] Figure 5: Coding efficiency of the MCTF extension in comparison to a closed-loop H.264/AVC coder for the sequences Mobile (a) and Football (b). For the Mobile sequence, which shows smooth motion, the coding efficiency in comparison H.264/AVC with IBBPBBP... coding is improved by both the MCTF extension and hierarchical B pictures. For the latter, the coding gains are the result of the changed temporal decomposition structure together with the cascading of quantization parameters. The usage of the step further increases the coding efficiency and reduces the PSNR fluctuations inside a group of pictures as illustrated in Figure 6. The Football sequence is characterized by strong local motion. For this sequence, the coding efficiency of both the MCTF extension and the hierarchical B picture approach is similar to that of the H.264/AVC reference with the IPPPP temporal coding structure. Mobile CIF 384 kbit/s MCTF extension Hierarchical B pictures frame number Figure 6: PSNR fluctuations of both the MCTF extension and hierarchical B pictures for Mobile coded at 384 kbit/s.

9 5.2 Results for the Scalability Extension In Figure 7, the coding efficiency of the coarse grain SNR-scalable MCTF extension (without the usage of progressive refinement slices) is compared to the coding efficiency of the single-layer MCTF extensions and the H.264/AVC references. While all rate-distortion points for the H.264/AVC reference and the single layer MCTF coder represent different bit-streams, all rate-distortion points for the scalable codec have been obtained by selecting and decoding NAL units of a single scalable bit-stream Mobile CIF 30Hz H.264/AVC MCTF extension SNR-scalable MCTF extension bit rate [kbit/s] Football CIF 30Hz H.264/AVC MCTF extension SNR-scalable MCTF extension bit rate [kbit/s] (a) (b) Figure 7: Coding efficiency of the SNR-scalable (coarse grain) MCTF extension for the sequences Mobile (a) and Football (b). As it can be seen in the rate-distortion plots, the coding efficiency of the SNR-scalable MCTF extension is 0.2 to 1.6 db worse than that of the single-layer version. This coding efficiency loss is related to the fact that for the SNR-scalable codec, a single motion field is used for all rate points, while for the single layer version, the trade-off between motion and residual data is optimized for each bit-rate point. This also explains why the PSNR losses for the Football sequence are larger, since this sequence is characterized by strong local motion. Sequence: Crew Sequence: Soccer QCIF15 CIF7.5 CIF15 CIF30 4CIF15 4CIF30 4CIF60 Layered Coding H.264/AVC High Profile (a) bit-rate [kbit/s] (b) bit-rate [kbit/s] Figure 8: Coding efficiency of the spatio-temporal-snr scalable MCTF extension for the sequences Crew (a) and Soccer (b). QCIF15 CIF7.5 CIF15 CIF30 4CIF15 4CIF30 4CIF60 Layered Coding H.264/AVC High Profile The coding efficiency of the MCTF extension for the support of a large degree of combined spatio-temporal-snr scalability was evaluated for the sequences Crew and Soccer and the results are depicted in Figure 8. For this experiment, the progressive refinement slices have been used for providing an entire range of extractable bit-rates for each supported spatio-temporal resolution. The diagrams contain also the case of layered coding, in which only 6 spatiotemporal-snr points are provided, and no fine grain SNR scalability is supported. For further information, results obtained by H.264/AVC High Profile have been additionally plotted in the diagrams (green squares). These results are provided for the spatial and temporal resolutions and bit rates that are supported by the layered coding. However, in contrast to the combined scalability and layered coding cases, the classical IBBPBBP... coding temporal structure is

10 used and some of the differences can be accounted to that. Additionally, the green squares case is also non-scalable. Note that the scalability extension of H.264/AVC also builds on top of High Profile and the optimization approach and the other parameters including motion search range are similar for all three cases following [20]. While for the sequence Soccer the results of layered coding and H.264/AVC are similar, for the Crew sequence, H.264/AVC High Profile provides additional improvements indicating that further work is needed towards an efficient scalable representation. 6. APPLICATIONS OF SVC 6.1 Video Transmission The hierarchical scalable video coding approach in general allows the split of the bit-stream into a base layer and various enhancement layers. That allows the transmission of these partial bit-streams via different channels or in different network streams. In IP networks the different quality classes can be assigned to one or even more consumed network streams, thus a possible billing depending on network streams is guaranteed. Such network types could be DVB-H or MBMS of 3GPP, with different terminal capability classes. The layered coding approach has further benefits. If the network supports transmission of certain network streams to certain devices only, not even all streams have to be received by all terminals. In MBMS or DVB-H networks it should be possible to send out, e.g., the base layer of a scalable H.264/AVC stream to the low performance device only and guarantee that all layers of the stream reach the high performance terminal only. Such an example is shown in Figure 9. By using the scalable H.264/AVC stream instead of simulcasting, the backbone could be drastically eased. Figure 9: Scalable H.264/AVC video transmission in wireless broadcast networks. Another benefit can be achieved, if the protection of the scalable H.264/AVC layers is treated in different ways. Forward error correction (FEC) is often used to protect data sent out via wireless broadcast channels. Unequal error or erasure protection (UEP/UXP) schemes [23] can be used to ensure an error free transmission of important layers like, e.g., the base layer of a scalable H.264/AVC stream. UEP/UXP can be used on top of the already existing channel FEC. Such a scheme can guarantee a basic video quality over a large range of channel error rates. Such a scenario is shown in Figure 10. Figure 10: Unequal erasure protection with scalable H.264/AVC quality layers. 6.2 Video Surveillance A very promising application of scalable coding is video surveillance. Typically videos from many cameras have to be stored and viewed on diverse displays, which may have different spatial and temporal resolutions (Figure 11). Examples are split screen display of many scenes on one monitor or viewing of scenes from dedicated cameras on mobile devices

11 such as video phones or PDAs. For such applications scalable coding is most attractive, because no transcoding or format conversion is required. Decoding of the lower resolutions videos for split-screen display also saves computing power, therefore many videos can be decoded and displayed with the computing power required for one full resolution video. Figure 11: General networked surveillance scenario. In general, the requirements for surveillance applications can be summarized as follows: Simultaneous transmission and storage of different spatial and temporal resolutions in one bit stream, in order to feed different displays Fine granularity scalability for feeding different transmission links of varying capacity Decoding of lower resolutions should be less complex for split screen display Multiple adaptation of the bit stream should be possible for erosion storage The latter requirement arises from the necessity to store a huge amount of data delivered by the surveillance cameras. By using scalable coding it becomes possible to delete higher resolution layers of stored scenes after certain expiration times and to keep just a lower resolution copy for the archive. This allows a much more flexible usage of storage capacity without the necessity for re-encoding and copying. Typically the full resolution video is kept for 1-3 days, a medium quality (reduced temporal or spatial resolution) video is kept for one week and a low quality (reduced temporal and spatial resolution) video is kept for long time archiving. This functionality results in the following requirements for a file format, which are mostly fulfilled by the MPEG-4 file format [24]: random access to the different layers must be possible without parsing the bit stream an appropriate hierarchical storage structure a hinting track (e.g. RTP) to support streaming applications The prototype of a bit-stream splitter in connection with a real time decoder based on the current reference model for SVC has been developed at the HHI. This prototype allows real time extraction and decoding of partial bit streams from a CIF video. A live demonstration of the system will be given during the presentation of the paper. 7. CONCLUSIONS The single-layer MCTF extension of H.264/AVC does provide for some sequences advantages in coding efficiency up to 0.5 db in terms of objective PSNR measures. Subjectively, a reduction of quality fluctuation is achieved by the step. The extension for coarse grain SNR scalability in conjunction with MCTF is quite efficient in a straightforward way. The methods used in spatial scalability work sequence dependent and are subject to future refinements. Furthermore, fine granular SNR scalability is provided by introducing enhancement NAL units that contain a refinement signal for a subband picture in a coarse-to-fine representation and can be truncated at any arbitrary point. The simulation results show that the coding efficiency of this approach is only slightly worse than that of the layered representation, while it provides a much larger set of decodable spatio-temporal-rate points.

12 ACKNOWLEDGEMENT The authors thank Tobias Hinz and Heiner Kirchhoffer for their help in implementing some features of the coder. REFERENCES 1. H. Schwarz, T. Hinz, H. Kirchhoffer, D. Marpe, and T. Wiegand, "Technical Description of the HHI proposal for SVC CE1," ISO/IEC JTC1/SC29/WG11, Document M11244, Palma de Mallorca, Spain, Oct ISO/IEC JTC1/SC29/WG11, Scalable Video Model 3.0, ISO/IEC JTC1/SC29/WG11, Document N6716, Palma de Mallorca, Spain, Oct ITU-T Recommendation H.264 & ISO/IEC AVC, "Advanced Video Coding for Generic Audiovisual Services," (version 1: 2003, versions 2: 2004) version 3: T. Wiegand, G. J. Sullivan, G. Bjøntegaard, and A. Luthra, "Overview of the H.264/AVC video coding standard," IEEE Trans. CSVT, vol. 13, no. 7, pp , July Joint Video Team of ITU-T VCEG and ISO/IEC MPEG, Scalable Video Coding Working Draft 1, Joint Video Team, Document JVT-N020, Jan Joint Video Team of ITU-T VCEG and ISO/IEC MPEG, Joint Scalable Video Model JSVM0, Joint Video Team, Document JVT-N021, Jan Joint Video Team of ITU-T VCEG and ISO/IEC MPEG, JSVM 0 software, Joint Video Team, Doc. JVT- N022r1, Jan J.-R. Ohm, Complexity and delay analysis of MCTF interframe wavelet structures, ISO/IEC JTC1/SC29/WG11, Document M8520, Jul M. Flierl, Video Coding with Lifted Wavelet Transforms and Frame-Adaptive Motion Compensation, Proc. of VLBV, Sep W. Sweldens, A custom-design construction of bi-orthogonal wavelets, J. Appl. Comp. Harm. Anal., vol. 3, pp , M. Flierl, T. Wiegand, and B. Girod, "A Locally Optimal Design Algorithm for Block-Based Multi-Hypothesis Motion-Compensated Prediction", in Data Compression Conference, Snowbird, USA, Mar M. Flierl and B. Girod. Generalized B Pictures and the Draft JVT/H.264 Video Compression Standard, IEEE Trans. CSVT, vol. 13, pp , Jul T. Wedi, "Motion Compensation in H.264/AVC," IEEE Trans. CSVT, vol. 13, pp , Jul H. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky: Low-Complexity Transform and Quantization in H.264/AVC, IEEE Trans. CSVT, vol. 13, pp , Jul ITU-T and ISO/IEC JTC 1, Generic coding of moving pictures and associated audio information Part 2: Video, ITU-T Recommendation H.262 ISO/IEC (MPEG-2), Nov ITU-T, Video coding for low bit rate communication, ITU-T Recommendation H.263; version 1, Nov. 1995; version 2, Jan. 1998; version 3, Nov ISO/IEC JTC1, Coding of audio-visual objects Part 2: Visual, ISO/IEC (MPEG-4 visual version 1), April 1999; Amendment 1 (version 2), February, 2000; Amendment 4 (streaming profile), January, T. Wiegand, X. Zhang, and B. Girod, Long-Term Memory Motion-Compensated Prediction, IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, pp , Feb T. Wiegand and B. Girod, Multi-frame Motion-Compensated Prediction for Video Transmission, Kluwer Academic Publishers, Sep T. Wiegand, H. Schwarz, A. Joch, F. Kossentini, and G. J. Sullivan: Rate-Constrained Coder Control and Comparison of Video Coding Standards, IEEE Trans. CSVT, vol. 13, pp , Jul P. List, A. Joch, J. Lainema, G. Bjøntegaard, M. Karczewicz: Adaptive Deblocking Filter, IEEE Trans. CSVT, vol. 13, pp , Jul D. Marpe, H. Schwarz, and T. Wiegand: Context-Adaptive Binary Arithmetic Coding in the H.264/AVC Video Compression Standard,, IEEE Trans. CSVT, vol. 13, pp , Jul G. Liebl, M. Wagner, J. Pandel, W. Weng, An RTP Payload Format for Erasure-Resilient Transmission of Progressive Multimedia Streams, Document draft-ietf-avt-uxp-07.txt, IETF, Oct ISO/IEC JTC1/SC 29/WG11 N COR 1 + AMD 1, ISO/IEC :2000(E), Information technology Coding of audio-visual objects Part 1: Systems.

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec Seung-Hwan Kim and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu,

More information

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI) Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6) 9 th Meeting: 2-5 September 2003, San Diego Document: JVT-I032d1 Filename: JVT-I032d5.doc Title: Status:

More information

(Invited Paper) /$ IEEE

(Invited Paper) /$ IEEE IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 17, NO. 9, SEPTEMBER 2007 1103 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard Heiko Schwarz, Detlev Marpe,

More information

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression Fraunhofer Institut für Nachrichtentechnik Heinrich-Hertz-Institut Ralf Schäfer schaefer@hhi.de http://bs.hhi.de H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression Introduction H.264/AVC:

More information

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis Euy-Doc Jang *, Jae-Gon Kim *, Truong Thang**,Jung-won Kang** *Korea Aerospace University, 100, Hanggongdae gil, Hwajeon-dong,

More information

Scalable Video Coding

Scalable Video Coding 1 Scalable Video Coding Z. Shahid, M. Chaumont and W. Puech LIRMM / UMR 5506 CNRS / Universite Montpellier II France 1. Introduction With the evolution of Internet to heterogeneous networks both in terms

More information

Scalable Video Coding in H.264/AVC

Scalable Video Coding in H.264/AVC Scalable Video Coding in H.264/AVC 1. Introduction Potentials and Applications 2. Scalability Extension of H.264/AVC 2.1Scalability Operation and High-Level Syntax 2.2Temporal Scalability 2.3SNR/Fidelity/Quality

More information

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen

BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION. Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen BANDWIDTH-EFFICIENT ENCODER FRAMEWORK FOR H.264/AVC SCALABLE EXTENSION Yi-Hau Chen, Tzu-Der Chuang, Yu-Jen Chen, and Liang-Gee Chen DSP/IC Design Lab., Graduate Institute of Electronics Engineering, National

More information

Performance Comparison between DWT-based and DCT-based Encoders

Performance Comparison between DWT-based and DCT-based Encoders , pp.83-87 http://dx.doi.org/10.14257/astl.2014.75.19 Performance Comparison between DWT-based and DCT-based Encoders Xin Lu 1 and Xuesong Jin 2 * 1 School of Electronics and Information Engineering, Harbin

More information

ADVANCES IN VIDEO COMPRESSION

ADVANCES IN VIDEO COMPRESSION ADVANCES IN VIDEO COMPRESSION Jens-Rainer Ohm Chair and Institute of Communications Engineering, RWTH Aachen University Melatener Str. 23, 52074 Aachen, Germany phone: + (49) 2-80-27671, fax: + (49) 2-80-22196,

More information

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

An Improved H.26L Coder Using Lagrangian Coder Control. Summary UIT - Secteur de la normalisation des télécommunications ITU - Telecommunication Standardization Sector UIT - Sector de Normalización de las Telecomunicaciones Study Period 2001-2004 Commission d' études

More information

Advances of MPEG Scalable Video Coding Standard

Advances of MPEG Scalable Video Coding Standard Advances of MPEG Scalable Video Coding Standard Wen-Hsiao Peng, Chia-Yang Tsai, Tihao Chiang, and Hsueh-Ming Hang National Chiao-Tung University 1001 Ta-Hsueh Rd., HsinChu 30010, Taiwan pawn@mail.si2lab.org,

More information

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING ABSTRACT Xiangyang Ji *1, Jizheng Xu 2, Debin Zhao 1, Feng Wu 2 1 Institute of Computing Technology, Chinese Academy

More information

Smoooth Streaming over wireless Networks Sreya Chakraborty Final Report EE-5359 under the guidance of Dr. K.R.Rao

Smoooth Streaming over wireless Networks Sreya Chakraborty Final Report EE-5359 under the guidance of Dr. K.R.Rao Smoooth Streaming over wireless Networks Sreya Chakraborty Final Report EE-5359 under the guidance of Dr. K.R.Rao 28th April 2011 LIST OF ACRONYMS AND ABBREVIATIONS AVC: Advanced Video Coding DVD: Digital

More information

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE 5359 Gaurav Hansda 1000721849 gaurav.hansda@mavs.uta.edu Outline Introduction to H.264 Current algorithms for

More information

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION Yi-Hau Chen, Tzu-Der Chuang, Chuan-Yung Tsai, Yu-Jen Chen, and Liang-Gee Chen DSP/IC Design Lab., Graduate Institute

More information

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

International Journal of Emerging Technology and Advanced Engineering Website:   (ISSN , Volume 2, Issue 4, April 2012) A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and

More information

EE Low Complexity H.264 encoder for mobile applications

EE Low Complexity H.264 encoder for mobile applications EE 5359 Low Complexity H.264 encoder for mobile applications Thejaswini Purushotham Student I.D.: 1000-616 811 Date: February 18,2010 Objective The objective of the project is to implement a low-complexity

More information

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc. Upcoming Video Standards Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc. Outline Brief history of Video Coding standards Scalable Video Coding (SVC) standard Multiview Video Coding

More information

Advanced Video Coding: The new H.264 video compression standard

Advanced Video Coding: The new H.264 video compression standard Advanced Video Coding: The new H.264 video compression standard August 2003 1. Introduction Video compression ( video coding ), the process of compressing moving images to save storage space and transmission

More information

SCALABLE HYBRID VIDEO CODERS WITH DOUBLE MOTION COMPENSATION

SCALABLE HYBRID VIDEO CODERS WITH DOUBLE MOTION COMPENSATION SCALABLE HYBRID VIDEO CODERS WITH DOUBLE MOTION COMPENSATION Marek Domański, Łukasz Błaszak, Sławomir Maćkowiak, Adam Łuczak Poznań University of Technology, Institute of Electronics and Telecommunications,

More information

Department of Electrical Engineering, IIT Bombay.

Department of Electrical Engineering, IIT Bombay. Scalable Video Coding Prof V M Gadre Prof. V. M. Gadre Department of Electrical Engineering, IIT Bombay. Scalable Video Coding Video streaming over internet is gaining g more and more popularity due to

More information

Wavelet-Based Video Compression Using Long-Term Memory Motion-Compensated Prediction and Context-Based Adaptive Arithmetic Coding

Wavelet-Based Video Compression Using Long-Term Memory Motion-Compensated Prediction and Context-Based Adaptive Arithmetic Coding Wavelet-Based Video Compression Using Long-Term Memory Motion-Compensated Prediction and Context-Based Adaptive Arithmetic Coding Detlev Marpe 1, Thomas Wiegand 1, and Hans L. Cycon 2 1 Image Processing

More information

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC Hamid Reza Tohidypour, Mahsa T. Pourazad 1,2, and Panos Nasiopoulos 1 1 Department of Electrical & Computer Engineering,

More information

Recent, Current and Future Developments in Video Coding

Recent, Current and Future Developments in Video Coding Recent, Current and Future Developments in Video Coding Jens-Rainer Ohm Inst. of Commun. Engineering Outline Recent and current activities in MPEG Video and JVT Scalable Video Coding Multiview Video Coding

More information

Reduced Frame Quantization in Video Coding

Reduced Frame Quantization in Video Coding Reduced Frame Quantization in Video Coding Tuukka Toivonen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P. O. Box 500, FIN-900 University

More information

New Techniques for Improved Video Coding

New Techniques for Improved Video Coding New Techniques for Improved Video Coding Thomas Wiegand Fraunhofer Institute for Telecommunications Heinrich Hertz Institute Berlin, Germany wiegand@hhi.de Outline Inter-frame Encoder Optimization Texture

More information

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain Author manuscript, published in "International Symposium on Broadband Multimedia Systems and Broadcasting, Bilbao : Spain (2009)" One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

More information

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS Ye-Kui Wang 1, Miska M. Hannuksela 2 and Moncef Gabbouj 3 1 Tampere International Center for Signal Processing (TICSP), Tampere,

More information

Bit Allocation for Spatial Scalability in H.264/SVC

Bit Allocation for Spatial Scalability in H.264/SVC Bit Allocation for Spatial Scalability in H.264/SVC Jiaying Liu 1, Yongjin Cho 2, Zongming Guo 3, C.-C. Jay Kuo 4 Institute of Computer Science and Technology, Peking University, Beijing, P.R. China 100871

More information

H.264 / AVC (Advanced Video Coding)

H.264 / AVC (Advanced Video Coding) H.264 / AVC (Advanced Video Coding) 2014-2016 Josef Pelikán CGG MFF UK Praha pepca@cgg.mff.cuni.cz http://cgg.mff.cuni.cz/~pepca/ H.264/AVC 2016 Josef Pelikán, http://cgg.mff.cuni.cz/~pepca 1 / 20 Context

More information

Video Coding Using Spatially Varying Transform

Video Coding Using Spatially Varying Transform Video Coding Using Spatially Varying Transform Cixun Zhang 1, Kemal Ugur 2, Jani Lainema 2, and Moncef Gabbouj 1 1 Tampere University of Technology, Tampere, Finland {cixun.zhang,moncef.gabbouj}@tut.fi

More information

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis High Efficiency Video Coding (HEVC) test model HM-16.12 vs. HM- 16.6: objective and subjective performance analysis ZORAN MILICEVIC (1), ZORAN BOJKOVIC (2) 1 Department of Telecommunication and IT GS of

More information

Unit-level Optimization for SVC Extractor

Unit-level Optimization for SVC Extractor Unit-level Optimization for SVC Extractor Chang-Ming Lee, Chia-Ying Lee, Bo-Yao Huang, and Kang-Chih Chang Department of Communications Engineering National Chung Cheng University Chiayi, Taiwan changminglee@ee.ccu.edu.tw,

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 MPEG2011/N12559 February 2012,

More information

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC 17th European Signal Processing Conference (EUSIPCO 2009) Glasgow, Scotland, August 24-28, 2009 IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC Damian Karwowski, Marek Domański Poznań University

More information

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Overview Examples and motivations Scalable coding for network transmission Techniques for multiple description coding 2 27/05/2013

More information

Homogeneous Transcoding of HEVC for bit rate reduction

Homogeneous Transcoding of HEVC for bit rate reduction Homogeneous of HEVC for bit rate reduction Ninad Gorey Dept. of Electrical Engineering University of Texas at Arlington Arlington 7619, United States ninad.gorey@mavs.uta.edu Dr. K. R. Rao Fellow, IEEE

More information

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION

FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION FAST SPATIAL LAYER MODE DECISION BASED ON TEMPORAL LEVELS IN H.264/AVC SCALABLE EXTENSION Yen-Chieh Wang( 王彥傑 ), Zong-Yi Chen( 陳宗毅 ), Pao-Chi Chang( 張寶基 ) Dept. of Communication Engineering, National Central

More information

Fine grain scalable video coding using 3D wavelets and active meshes

Fine grain scalable video coding using 3D wavelets and active meshes Fine grain scalable video coding using 3D wavelets and active meshes Nathalie Cammas a,stéphane Pateux b a France Telecom RD,4 rue du Clos Courtel, Cesson-Sévigné, France b IRISA, Campus de Beaulieu, Rennes,

More information

ECE 634: Digital Video Systems Scalable coding: 3/23/17

ECE 634: Digital Video Systems Scalable coding: 3/23/17 ECE 634: Digital Video Systems Scalable coding: 3/23/17 Professor Amy Reibman MSEE 356 reibman@purdue.edu hip://engineering.purdue.edu/~reibman/ece634/index.html Scalability Outline IntroducNon: Heterogeneous

More information

University of Brescia (Italy) Signals & Communications Laboratory SVC CE1: STool - a native spatially scalable approach to SVC.

University of Brescia (Italy) Signals & Communications Laboratory SVC CE1: STool - a native spatially scalable approach to SVC. INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 MPEG2004/M11368 October 2004,

More information

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation Optimizing the Deblocking Algorithm for H.264 Decoder Implementation Ken Kin-Hung Lam Abstract In the emerging H.264 video coding standard, a deblocking/loop filter is required for improving the visual

More information

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD Siwei Ma, Shiqi Wang, Wen Gao {swma,sqwang, wgao}@pku.edu.cn Institute of Digital Media, Peking University ABSTRACT IEEE 1857 is a multi-part standard for multimedia

More information

VIDEO COMPRESSION STANDARDS

VIDEO COMPRESSION STANDARDS VIDEO COMPRESSION STANDARDS Family of standards: the evolution of the coding model state of the art (and implementation technology support): H.261: videoconference x64 (1988) MPEG-1: CD storage (up to

More information

In the name of Allah. the compassionate, the merciful

In the name of Allah. the compassionate, the merciful In the name of Allah the compassionate, the merciful Digital Video Systems S. Kasaei Room: CE 315 Department of Computer Engineering Sharif University of Technology E-Mail: skasaei@sharif.edu Webpage:

More information

Quality Scalable Low Delay Video Coding using Leaky Base Layer Prediction

Quality Scalable Low Delay Video Coding using Leaky Base Layer Prediction Quality Scalale Low Delay Video Coding using Leaky Base Layer Prediction Steffen Kamp and Mathias Wien Institute of Communications Engineering RWTH Aachen University, 52056 Aachen, Germany E-mail: kamp@ient.rwth-aachen.de

More information

AVC VIDEO CODERS WITH SPATIAL AND TEMPORAL SCALABILITY. Marek Domański, Łukasz Błaszak, Sławomir Maćkowiak,

AVC VIDEO CODERS WITH SPATIAL AND TEMPORAL SCALABILITY. Marek Domański, Łukasz Błaszak, Sławomir Maćkowiak, AVC VDEO CODERS WTH SATAL AND TEMORAL SCALABLTY Marek Domański, Łukasz Błaszak, Sławomir Maćkowiak, oznań University of Technology, nstitute of Electronics and Telecommunication, oznań, oland E-mail: {

More information

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC Jung-Ah Choi, Jin Heo, and Yo-Sung Ho Gwangju Institute of Science and Technology {jachoi, jinheo, hoyo}@gist.ac.kr

More information

An Efficient Table Prediction Scheme for CAVLC

An Efficient Table Prediction Scheme for CAVLC An Efficient Table Prediction Scheme for CAVLC 1. Introduction Jin Heo 1 Oryong-Dong, Buk-Gu, Gwangju, 0-712, Korea jinheo@gist.ac.kr Kwan-Jung Oh 1 Oryong-Dong, Buk-Gu, Gwangju, 0-712, Korea kjoh81@gist.ac.kr

More information

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames Ki-Kit Lai, Yui-Lam Chan, and Wan-Chi Siu Centre for Signal Processing Department of Electronic and Information Engineering

More information

Transcoding from H.264/AVC to High Efficiency Video Coding (HEVC)

Transcoding from H.264/AVC to High Efficiency Video Coding (HEVC) EE5359 PROJECT PROPOSAL Transcoding from H.264/AVC to High Efficiency Video Coding (HEVC) Shantanu Kulkarni UTA ID: 1000789943 Transcoding from H.264/AVC to HEVC Objective: To discuss and implement H.265

More information

On the impact of the GOP size in a temporal H.264/AVC-to-SVC transcoder in Baseline and Main Profile

On the impact of the GOP size in a temporal H.264/AVC-to-SVC transcoder in Baseline and Main Profile biblio.ugent.be The UGent Institutional Repository is the electronic archiving and dissemination platform for all UGent research publications. Ghent University has implemented a mandate stipulating that

More information

Introduction to Video Coding

Introduction to Video Coding Introduction to Video Coding o Motivation & Fundamentals o Principles of Video Coding o Coding Standards Special Thanks to Hans L. Cycon from FHTW Berlin for providing first-hand knowledge and much of

More information

Scalable Extension of HEVC 한종기

Scalable Extension of HEVC 한종기 Scalable Extension of HEVC 한종기 Contents 0. Overview for Scalable Extension of HEVC 1. Requirements and Test Points 2. Coding Gain/Efficiency 3. Complexity 4. System Level Considerations 5. Related Contributions

More information

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING

STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING Journal of the Chinese Institute of Engineers, Vol. 29, No. 7, pp. 1203-1214 (2006) 1203 STACK ROBUST FINE GRANULARITY SCALABLE VIDEO CODING Hsiang-Chun Huang and Tihao Chiang* ABSTRACT A novel scalable

More information

Video Compression Standards (II) A/Prof. Jian Zhang

Video Compression Standards (II) A/Prof. Jian Zhang Video Compression Standards (II) A/Prof. Jian Zhang NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2009 jzhang@cse.unsw.edu.au Tutorial 2 : Image/video Coding Techniques Basic Transform coding Tutorial

More information

THE H.264 ADVANCED VIDEO COMPRESSION STANDARD

THE H.264 ADVANCED VIDEO COMPRESSION STANDARD THE H.264 ADVANCED VIDEO COMPRESSION STANDARD Second Edition Iain E. Richardson Vcodex Limited, UK WILEY A John Wiley and Sons, Ltd., Publication About the Author Preface Glossary List of Figures List

More information

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Chapter 11.3 MPEG-2 MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Simple, Main, SNR scalable, Spatially scalable, High, 4:2:2,

More information

JPEG 2000 vs. JPEG in MPEG Encoding

JPEG 2000 vs. JPEG in MPEG Encoding JPEG 2000 vs. JPEG in MPEG Encoding V.G. Ruiz, M.F. López, I. García and E.M.T. Hendrix Dept. Computer Architecture and Electronics University of Almería. 04120 Almería. Spain. E-mail: vruiz@ual.es, mflopez@ace.ual.es,

More information

Sergio Sanz-Rodríguez, Fernando Díaz-de-María, Mehdi Rezaei Low-complexity VBR controller for spatialcgs and temporal scalable video coding

Sergio Sanz-Rodríguez, Fernando Díaz-de-María, Mehdi Rezaei Low-complexity VBR controller for spatialcgs and temporal scalable video coding Sergio Sanz-Rodríguez, Fernando Díaz-de-María, Mehdi Rezaei Low-complexity VBR controller for spatialcgs and temporal scalable video coding Conference obect, Postprint This version is available at http://dx.doi.org/10.14279/depositonce-5786.

More information

Pyramid Coding and Subband Coding

Pyramid Coding and Subband Coding Pyramid Coding and Subband Coding Predictive pyramids Transform pyramids Subband coding Perfect reconstruction filter banks Quadrature mirror filter banks Octave band splitting Transform coding as a special

More information

Rate Distortion Optimization in Video Compression

Rate Distortion Optimization in Video Compression Rate Distortion Optimization in Video Compression Xue Tu Dept. of Electrical and Computer Engineering State University of New York at Stony Brook 1. Introduction From Shannon s classic rate distortion

More information

The Scope of Picture and Video Coding Standardization

The Scope of Picture and Video Coding Standardization H.120 H.261 Video Coding Standards MPEG-1 and MPEG-2/H.262 H.263 MPEG-4 H.264 / MPEG-4 AVC Thomas Wiegand: Digital Image Communication Video Coding Standards 1 The Scope of Picture and Video Coding Standardization

More information

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC Randa Atta, Rehab F. Abdel-Kader, and Amera Abd-AlRahem Electrical Engineering Department, Faculty of Engineering, Port

More information

Digital Video Processing

Digital Video Processing Video signal is basically any sequence of time varying images. In a digital video, the picture information is digitized both spatially and temporally and the resultant pixel intensities are quantized.

More information

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms RC24748 (W0902-063) February 12, 2009 Electrical Engineering IBM Research Report Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms Yuri Vatis Institut für Informationsverarbeitung

More information

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Week 14. Video Compression. Ref: Fundamentals of Multimedia Week 14 Video Compression Ref: Fundamentals of Multimedia Last lecture review Prediction from the previous frame is called forward prediction Prediction from the next frame is called forward prediction

More information

FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION

FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION 1 GOPIKA G NAIR, 2 SABI S. 1 M. Tech. Scholar (Embedded Systems), ECE department, SBCE, Pattoor, Kerala, India, Email:

More information

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter Y. Vatis, B. Edler, I. Wassermann, D. T. Nguyen and J. Ostermann ABSTRACT Standard video compression techniques

More information

VHDL Implementation of H.264 Video Coding Standard

VHDL Implementation of H.264 Video Coding Standard International Journal of Reconfigurable and Embedded Systems (IJRES) Vol. 1, No. 3, November 2012, pp. 95~102 ISSN: 2089-4864 95 VHDL Implementation of H.264 Video Coding Standard Jignesh Patel*, Haresh

More information

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS 4th European Signal Processing Conference (EUSIPCO ), Florence, Italy, September 4-8,, copyright by EURASIP BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION

More information

Pyramid Coding and Subband Coding

Pyramid Coding and Subband Coding Pyramid Coding and Subband Coding! Predictive pyramids! Transform pyramids! Subband coding! Perfect reconstruction filter banks! Quadrature mirror filter banks! Octave band splitting! Transform coding

More information

Video Quality Analysis for H.264 Based on Human Visual System

Video Quality Analysis for H.264 Based on Human Visual System IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021 ISSN (p): 2278-8719 Vol. 04 Issue 08 (August. 2014) V4 PP 01-07 www.iosrjen.org Subrahmanyam.Ch 1 Dr.D.Venkata Rao 2 Dr.N.Usha Rani 3 1 (Research

More information

Fast Mode Decision for H.264/AVC Using Mode Prediction

Fast Mode Decision for H.264/AVC Using Mode Prediction Fast Mode Decision for H.264/AVC Using Mode Prediction Song-Hak Ri and Joern Ostermann Institut fuer Informationsverarbeitung, Appelstr 9A, D-30167 Hannover, Germany ri@tnt.uni-hannover.de ostermann@tnt.uni-hannover.de

More information

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 261 Cheomdan-gwagiro, Buk-gu, Gwangju, 500-712,

More information

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework System Modeling and Implementation of MPEG-4 Encoder under Fine-Granular-Scalability Framework Literature Survey Embedded Software Systems Prof. B. L. Evans by Wei Li and Zhenxun Xiao March 25, 2002 Abstract

More information

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation 2009 Third International Conference on Multimedia and Ubiquitous Engineering A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation Yuan Li, Ning Han, Chen Chen Department of Automation,

More information

OPTIMIZATION OF LOW DELAY WAVELET VIDEO CODECS

OPTIMIZATION OF LOW DELAY WAVELET VIDEO CODECS OPTIMIZATION OF LOW DELAY WAVELET VIDEO CODECS Andrzej Popławski, Marek Domański 2 Uniwersity of Zielona Góra, Institute of Computer Engineering and Electronics, Poland 2 Poznań University of Technology,

More information

An Efficient Mode Selection Algorithm for H.264

An Efficient Mode Selection Algorithm for H.264 An Efficient Mode Selection Algorithm for H.64 Lu Lu 1, Wenhan Wu, and Zhou Wei 3 1 South China University of Technology, Institute of Computer Science, Guangzhou 510640, China lul@scut.edu.cn South China

More information

A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264. Massachusetts Institute of Technology Texas Instruments

A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264. Massachusetts Institute of Technology Texas Instruments 2013 IEEE Workshop on Signal Processing Systems A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264 Vivienne Sze, Madhukar Budagavi Massachusetts Institute of Technology Texas Instruments ABSTRACT

More information

ESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS. Bin Zhang, Mathias Wien and Jens-Rainer Ohm

ESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS. Bin Zhang, Mathias Wien and Jens-Rainer Ohm 19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011 ESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS Bin Zhang,

More information

A macroblock-level analysis on the dynamic behaviour of an H.264 decoder

A macroblock-level analysis on the dynamic behaviour of an H.264 decoder A macroblock-level analysis on the dynamic behaviour of an H.264 decoder Florian H. Seitner, Ralf M. Schreier, Member, IEEE Michael Bleyer, Margrit Gelautz, Member, IEEE Abstract This work targets the

More information

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012 To: Dr. K. R. Rao From: Kaustubh V. Dhonsale (UTA id: - 1000699333) Date: 04/24/2012 Subject: EE-5359: Class project interim report Proposed project topic: Overview, implementation and comparison of Audio

More information

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010 EE 5359 Low Complexity H.264 encoder for mobile applications Thejaswini Purushotham Student I.D.: 1000-616 811 Date: February 18,2010 Fig 1: Basic coding structure for H.264 /AVC for a macroblock [1] .The

More information

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding Jung-Ah Choi and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 261 Cheomdan-gwagiro, Buk-gu, Gwangju, 500-712, Korea

More information

Video Transcoding Architectures and Techniques: An Overview. IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang

Video Transcoding Architectures and Techniques: An Overview. IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang Video Transcoding Architectures and Techniques: An Overview IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang Outline Background & Introduction Bit-rate Reduction Spatial Resolution

More information

Professor, CSE Department, Nirma University, Ahmedabad, India

Professor, CSE Department, Nirma University, Ahmedabad, India Bandwidth Optimization for Real Time Video Streaming Sarthak Trivedi 1, Priyanka Sharma 2 1 M.Tech Scholar, CSE Department, Nirma University, Ahmedabad, India 2 Professor, CSE Department, Nirma University,

More information

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000 Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000 EE5359 Multimedia Processing Project Proposal Spring 2013 The University of Texas at Arlington Department of Electrical

More information

Video Compression An Introduction

Video Compression An Introduction Video Compression An Introduction The increasing demand to incorporate video data into telecommunications services, the corporate environment, the entertainment industry, and even at home has made digital

More information

Advanced Encoding Features of the Sencore TXS Transcoder

Advanced Encoding Features of the Sencore TXS Transcoder Advanced Encoding Features of the Sencore TXS Transcoder White Paper November 2011 Page 1 (11) www.sencore.com 1.605.978.4600 Revision 1.0 Document Revision History Date Version Description Author 11/7/2011

More information

Lecture 5: Error Resilience & Scalability

Lecture 5: Error Resilience & Scalability Lecture 5: Error Resilience & Scalability Dr Reji Mathew A/Prof. Jian Zhang NICTA & CSE UNSW COMP9519 Multimedia Systems S 010 jzhang@cse.unsw.edu.au Outline Error Resilience Scalability Including slides

More information

Evaluation of H.264/AVC Coding Elements and New Improved Scalability/Adaptation Algorithm/Methods

Evaluation of H.264/AVC Coding Elements and New Improved Scalability/Adaptation Algorithm/Methods IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 9, Issue 6, Ver. III (Nov - Dec. 2014), PP 105-119 Evaluation of H.264/AVC Coding Elements

More information

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV Jeffrey S. McVeigh 1 and Siu-Wai Wu 2 1 Carnegie Mellon University Department of Electrical and Computer Engineering

More information

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao Video Coding Standards Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao Outline Overview of Standards and Their Applications ITU-T Standards for Audio-Visual Communications

More information

Development and optimization of coding algorithms for mobile 3DTV. Gerhard Tech Heribert Brust Karsten Müller Anil Aksay Done Bugdayci

Development and optimization of coding algorithms for mobile 3DTV. Gerhard Tech Heribert Brust Karsten Müller Anil Aksay Done Bugdayci Development and optimization of coding algorithms for mobile 3DTV Gerhard Tech Heribert Brust Karsten Müller Anil Aksay Done Bugdayci Project No. 216503 Development and optimization of coding algorithms

More information

IN the early 1980 s, video compression made the leap from

IN the early 1980 s, video compression made the leap from 70 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 1, FEBRUARY 1999 Long-Term Memory Motion-Compensated Prediction Thomas Wiegand, Xiaozheng Zhang, and Bernd Girod, Fellow,

More information

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS Television services in Europe currently broadcast video at a frame rate of 25 Hz. Each frame consists of two interlaced fields, giving a field rate of 50

More information

Multimedia Standards

Multimedia Standards Multimedia Standards SS 2017 Lecture 5 Prof. Dr.-Ing. Karlheinz Brandenburg Karlheinz.Brandenburg@tu-ilmenau.de Contact: Dipl.-Inf. Thomas Köllmer thomas.koellmer@tu-ilmenau.de 1 Organisational issues

More information

Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec

Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec Abstract. An improved Context-based Adaptive Binary Arithmetic Coding (CABAC) is presented for application in compression

More information