Yui-Lam CHAN and Wan-Chi SIU

Similar documents
Variable Temporal-Length 3-D Discrete Cosine Transform Coding

Image Compression Algorithm and JPEG Standard

A Very Low Bit Rate Image Compressor Using Transformed Classified Vector Quantization

Digital Video Processing

VIDEO streaming applications over the Internet are gaining. Brief Papers

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

signal-to-noise ratio (PSNR), 2

MOTION estimation is one of the major techniques for

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

Lecture 5: Compression I. This Week s Schedule

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

IMAGE COMPRESSION USING HYBRID QUANTIZATION METHOD IN JPEG

Implementation and analysis of Directional DCT in H.264

MRT based Fixed Block size Transform Coding

Video Compression An Introduction

MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ)

Performance analysis of Integer DCT of different block sizes.

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

Image Quality Assessment Techniques: An Overview

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

Rate Distortion Optimization in Video Compression

Stereo Image Compression

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

International Journal of Advanced Research in Computer Science and Software Engineering

CSEP 521 Applied Algorithms Spring Lossy Image Compression

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Three Dimensional Motion Vectorless Compression

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

A new predictive image compression scheme using histogram analysis and pattern matching

ISSN (ONLINE): , VOLUME-3, ISSUE-1,

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Multimedia Communications. Transform Coding

Multiresolution motion compensation coding for video compression

Interactive Progressive Encoding System For Transmission of Complex Images

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

Compression of Stereo Images using a Huffman-Zip Scheme

An Intraframe Coding by Modified DCT Transform using H.264 Standard

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

CMPT 365 Multimedia Systems. Media Compression - Image

An Optimized Template Matching Approach to Intra Coding in Video/Image Compression

An Embedded Wavelet Video Coder. Using Three-Dimensional Set. Partitioning in Hierarchical Trees. Beong-Jo Kim and William A.

An Embedded Wavelet Video Coder Using Three-Dimensional Set Partitioning in Hierarchical Trees (SPIHT)

Partial Video Encryption Using Random Permutation Based on Modification on Dct Based Transformation

Video compression with 1-D directional transforms in H.264/AVC

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Image Error Concealment Based on Watermarking

A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation

PERFORMANCE ANALYSIS OF INTEGER DCT OF DIFFERENT BLOCK SIZES USED IN H.264, AVS CHINA AND WMV9.

Digital Image Steganography Techniques: Case Study. Karnataka, India.

Adaptive Quantization for Video Compression in Frequency Domain

Image Compression using Haar Wavelet Transform and Huffman Coding

Cross Layer Protocol Design

1740 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 7, JULY /$ IEEE

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

Mesh Based Interpolative Coding (MBIC)

Lecture 6: Compression II. This Week s Schedule

An Embedded Wavelet Video. Set Partitioning in Hierarchical. Beong-Jo Kim and William A. Pearlman

Digital Image Representation Image Compression

Introduction to Video Coding

Transforms for the Motion Compensation Residual

Data Hiding in Video

AUDIOVISUAL COMMUNICATION

A Robust Video Hash Scheme Based on. 2D-DCT Temporal Maximum Occurrence

Digital Image Stabilization and Its Integration with Video Encoder

CHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM

Image Compression - An Overview Jagroop Singh 1

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

Locating 1-D Bar Codes in DCT-Domain

Lecture 3 Image and Video (MPEG) Coding

The new Hybrid approach to protect MPEG-2 video header

A Miniature-Based Image Retrieval System

An Improved Complex Spatially Scalable ACC DCT Based Video Compression Method

MPEG-4: Simple Profile (SP)

Intra-Mode Indexed Nonuniform Quantization Parameter Matrices in AVC/H.264

Lecture 5: Error Resilience & Scalability

06/12/2017. Image compression. Image compression. Image compression. Image compression. Coding redundancy: image 1 has four gray levels

Multiview Image Compression: Future Challenges and Today s Solutions

Digital Image Processing

Video Compression System for Online Usage Using DCT 1 S.B. Midhun Kumar, 2 Mr.A.Jayakumar M.E 1 UG Student, 2 Associate Professor

Robust Image Watermarking based on Discrete Wavelet Transform, Discrete Cosine Transform & Singular Value Decomposition

Transform Kernel Selection Strategy for the H.264

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

JPEG IMAGE CODING WITH ADAPTIVE QUANTIZATION

Reduction of Blocking artifacts in Compressed Medical Images

Compression II: Images (JPEG)

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

EFFICIENT DEISGN OF LOW AREA BASED H.264 COMPRESSOR AND DECOMPRESSOR WITH H.264 INTEGER TRANSFORM

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

Modified SPIHT Image Coder For Wireless Communication

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

FPGA Implementation of Low Complexity Video Encoder using Optimized 3D-DCT

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

On the Selection of Image Compression Algorithms

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

Digital Image Processing

Video Stabilization, Camera Motion Pattern Recognition and Motion Tracking Using Spatiotemporal Regularity Flow

Transcription:

A NEW ADAPTIVE INTERFRAME TRANSFORM CODING USING DIRECTIONAL CLASSIFICATION Yui-Lam CHAN and Wan-Chi SIU Department of Electronic Engineering Hong Kong Polytechnic Hung Hom, Kowloon, Hong Kong ABSTRACT Interframe transform coding is affected not only by the statistics of spatial details within a frame, but also by the variation of the amount of movement and other temporal activities in different regions of the image sequence. Therefore, adaptive techniques have to be used in order to achieve good image quality. In this paper, we propose a new version of the adaptive interframe coding method, namely directional classification, which is based on image sequence statistics. Blocks with different perceptual features such as edges and high motion activity are categorized to different classes. Then, a new adaptive quantization, associated with appropriate scanning and huffman coding, are employed based on the classification map. Coding tests using computer simulation show this technique is indeed very efficient. 1. INTRODUCTION Intra/interframe 2-dimensional Discrete Cosine Transform (DCT) is most popularly used for digital video coding system. But, the intra/interframe 2D- DCT is not compatible with the quick search mode of current consumer VCR. Besides, periodical refreshing ( periodical encoding a full frame of video using intramode) is needed to prevent from propagating errors occurring over the channel and initial turning on of the receiver. Whereas, interframe transform coding technique takes the advantage of frame-to-frame correlation as well as spatial image redundancies[l-31 without the drawbacks of intra/interframe 2-dimensional DCT. In [4], adaptive 3-dimensional DCT coding was reported to be comparable with intra/interframe 2-dimensional DCT in certain image sequences. Among different 2-dimensional image data compres sion techniques, the adaptive transform has been generally recognized as an efficient means and has been considered to have good potentials. Chen and Smith [5] classified the blocks according to the summation of ac energy, four equally populated classes have been developed. Gimlett [6] proposed using the magnitude sum of the ac coefficient. These classification concepts can be extended to the three dimensional space. Due to the image sequence property, it is not often that a 3D block will have simultaneously high spatial and temporal activities. Thus, the ac energy is not a good activity measure for coding purposes. But, in classified VQ[7-91, adaptive coding is introduced by classifying the block based on horizontal, vertical and diagonal energy. In this paper, based on the directional property, a newly selected set of frequency coefficients is used to indicate the spatial and temporal activity. Classification techniques may then be used to categorize each block according to the selected set of frequency coefficients. Then, bit assignment and quantization are employed as shown in figure 1. Based on the directional property, different classes have different coefficient distributions. Hence, a new adaptive scanning method is introduced to reduce the bit rate. At last, different huffman code tables which reflect the statistics of different classes are used for the sake of further compression. This adaptive coding technique utilizes the directional property of blocks and a significant improvement can be achieved as shown below. This paper describes the coding algorithm and also presents the results obtained by our proposed adaptive interframe transform coding using directional classification. The contents of the following sections are as follows. Section 2 describes the three dimensional discrete cosine transform. The new proposed classification scheme is presented in section 3, while the new proposed scanning method is described in section 4. Experimental results obtained by computer simulations are discussed in section 5. Finally, a conclusion is given in section 6. 0-8186-6950-0/94 $4.00 0 1994 IEEE 977

- Directional Classification A Variance --+ Matrix 4 Huffman code b table Bit allocation Matrix v- + v Adaptive ZigZag scanning c Adaptive Quantization coder + Zl -+ Input video b 3-Dimensional DCT runlength coding coding --b I Figure 1: Block diagram of adaptive interframe transforms coding system 2. THREE DIMENSIONAL DISCRETE COSINE TRANSFORM The two-dimensional discrete cosine transform is considered as an effective way to decorrelate still images in the transform domain. The following paragraphs present a summary of the extension of transform coding concepts to three dimensional space. Since the picture is presented at high frame rate, there is considerable correlation between successive frames. Hence a three-dimensional discrete cosine transformation of a block is employed to reduce temporal correlation as well as spatial correlation. Each image sequence is divided into blocks of 8x8~8 pixels and a three-dimensional discrete cosine trans form (3D-DCT) is carried out on each block. The forward transform is defined as follows: 1 7?? X(., "> w) = - y T;: C(4C(")C(W).(k7i 4 64 k=oj=o i=o (2k + 1). (2j + 1)" cos T cos - r 7 16 16 (2i + 1)w cos T 16 where t(k, j, i) is the pixel value in the image sequence for C(n) = for n=o, 1 otherwise 3. DIRECTIONAL CLASSIFICATION The transformed blocks are assigned to suitable classes according to their activity level. Within each class, bits are allocated to individual transform coefficients according to the variance matrix of the class [lo]. The conventional classification is based on the summation of the ac energy(e), defined as follows: E = 7 7 7 u=o~=ow=o C[X(U, v, W)I2 - [X(O,O, ON2 Based on the summation of the ac energy(e), four equally populated classes have been developed. But large distortion is observed in high activity blocks(i.e. blocks with large values of E). It is due to this fact that the ac energy cannot discriminate whether the block contains high motion activity, detail vertical edge or detail horizontal edge in high activity block, even it shows that the block contains high frequency coefficients. In the following scheme, we define some new parameters which can reflect the directionality of the block such as high motion activity, detail vertical edge and horizontal edge. A newly selected set of transform coefficients should be used to identify the type of the blocks. Let us define, as shown below, C,, c h and Ct to reflect the extent of the horizontal, vertical and temporal energy respectively. And also, eight classes are defined as shown in table 1 whereas T,, Th, Tt are the 978

Table 1: Directional Classification Table I 12.. Class 0 is the low activity block which is in the still background. Class 1, 2 and 3 are the blocks containing detail vertical edge, horizontal edge and high motion activity respectively. Class 4 represents high activity in spatial domain. Class 5 reflects the block which has both vertical edge and high motion activity. Class 6 reflects horizontal edge and high motion activity in same block. Class 7 is the block which has simultaneously high spatial details and high temporal activity. 4. ADAPTIVE SCANNING AND HUFFMAN CODING Since the distributions of the quantized coefficients are different in different classes. For example, in class 1 and 2, most coefficients are concentrated in the first frame, whereas, in class 3, most coefficients are concentrated in the temporal direction. For efficient use of the run length coding, adaptive scanning is used [4]. The 3-D block is realized by coding each block with the two scanning stages and the scanning chosen is the one with the lowest bit rate. The first spatial, second temporal and first temporal, second spatial scanning are described as follows: Figure 2: New scanning sequence 1. First spatial, second temporal - the zig-zag scanning of the first frame, followed by the zig-zag scanning of the successive frames. This scanning method is suitably used in still area block. However, in video coding without classification, this scanning method is more suitable. 2. First temporal, second spatial - scanning the corresponding coefficients of successive frames from low to high frequencies. For spatial scanning, it follows the zig-zag scanning. This scanning method is suitably used in moving area block. In our proposed coding scheme, the sequences of 0perations:first spatial, second temporal and first temporal, second spatial, are suitably used in classes 0, 1, 2, 4 and 3, 5, 6 respectively. So, the scanning method is based on the classification map without any extra computation. But, in class 7, since it contains both horizontal, vertical and temporal energy, the above two scanning methods are not suitable. A new scanning sequence is proposed as shown in figure 2. The number in figure 2 is the scanning order of quantized coefficients. For simplicity of illustration, a 4x4~4 block is used. The scanning sequence is: 979

According to the data in the scanning sequence, two kinds of symbols can be generated for the sake of further compression. These two symbols are coded in zerorunlength method. Based on these symbols in different classes, a different huffman code table is generated. Each symbol is then replaced by its corresponding code. 5. RESULT A series of computer simulations have been conducted to evaluate the performance of the adaptive interframe transform coding. These include the 32 frames image sequence football. The sequence contains translation, zooming-out and both slow and fast paning. A peak signal-tenoise ratio(psnr) measure of image fidelity is used to quantitatively evaluate the performance of the interframe coding systems. The PSNR of an qriginal image z(kl j, i) and its reconstruction image 2 (k, j, i) over spatial coordinates j, i and frame coordinate k is defined as : where 2552 PSNR = 10 log - MSE. 7 7 7 Figure 3 provides a comparison on peak signal-tonoise ratio(psnr) between the 3D-DCT without classification and different adaptive 3D-DCT algorithms. The PSNR performance of adaptive coding based on 151 is only slightly better than that of 3D-DCT without classification (about 0.19dB). This is due to the fact that this classification only bases on the ac energy, but no consideration of the directional energy of the block. To resolve this problem, ch, C, and Ct have been previously defined to identify the directional property. Graph (3) makes use of this property and show that it can achieve the improvement on the PSNR (about 0.2dB over graph (2)). Another advantage of the new classification scheme is that it can achieve the improvement in compression ratio with adaptive scanning, but -- on Odl 0.47 0.5r obl 471 0.87 An bi tdpixel (1) - 3D-DCT without classification (2) - classification based on [5] (3) - proposed classification with adaptive quantization (4) - proposed classification with adaptive quantization, scanning and huffman coding Figure 3: Plot of PSNR against bits/pixel no overhead is required. Based on the classification map, when Ct > Tt, (class 3, 5, 6) we can use first temporal, second spatial scanning method and in class 7, the new scanning method(see Fig. 2) is employed. In classification based on the ac energy, only the magnitude of ac energy is calculated, it has no advantage for using adaptive scanning in high activity class. This is due to the fact that the large ac energy blocks do not know the actual distribution of coefficients. Graph (4) shows the performance of classification with adaptive quantization, scanning and huffman coding. The PSNR performance of this adaptive coding is 0.65dB better than that of 3D-DCT without classification, and 0.45dB than that of classification based on the ac energy. 980

6. CONCLUSION A new classification scheme for adaptive 3D-DCT coding which takes the advantage of directionality for different styles of blocks has been described. This classification scheme is based on the horizontal, vertical and temporal energy. Adaptive quantization, scanning and huffman coding method is used based on the classification map. A conventional adaptive coding for image can extend into three dimensional space for video coding. However, the bit allocation map for high activity cannot correctly reflect the energy distribution of horizontal, vertical or temporal direction. So, the PSNR performance of the new classification scheme is better than that of the conventional classification scheme with only a little more overhead information (& bits/pixel). Besides, the computation can also save for calculating Ch, C, and Ct comparing with calculating the total ac energy- In summary, the use of the new simple classification minimizes the inner class differences and improves the objective image quality. Obviously, the classification is simple and matched with the more important block characteristics. [6] J.I.Gimlett, Use of activity classes in adaptive transform image coding, IEEE Transactions on communications, Vol-23, July 1975, pp.785-786. [7] B.RAMAMURTH1 and A.Gersho, Classified Vector quantization of images, IEEE Transactions on communications, vol-34, November 1986, pp.1105-1115. [8] K. Aizawa, H.Harashima and H. Miyakawa, Adaptive discrete cosine transform coding with vector quantization for color images, ICASSP 86, intl. Conf. on Acoust. Speech, Signal process, April,1986,pp. 985-988 [9] H. Holzlwimmer, A. v. Brandt, and W. Tengler, A 64 Kbit/s motion compensated transform coder using vector quantization with scene adaptive codebook, ICC 87, Intl. Conf. on communications, June 1987,pp.151-156. [lo] ASegall, Bit allocation and encoding for vector sources, IEEE transactions on information theory, Vol-22, Mar,1976, pp.162-169. 7. REFERENCES [l] J.A.Roese and W.K.Pratt, Theoretical performance methods for interframe transform and hybrid transform/dpcm coders, Proceedings SPIE, Val-87, August 1976,~~. 172-179. [2] J.A.Roese and W.K.Pratt, Interframe cosine Transform image codin, IEEE Transactions on communications, Vol-25, November 1977, pp. 1329-1338. [3] T. RAJ NATARAJ AN, On interframe transform coding, IEEE Transactions on communications, Voi-25, November 1977, pp. 1329-1338. [4] Olivier Chantelou, Christian Remus, Adaptive transform coding of HDTV picture, International workshop on signal processing of HDTV, L Aguila, Italy, Elsevier Science Publishers, North Holland, pp.231-238, Feb-March, 1988. [5] W.H. Chen and C.H.Smith, Adaptive coding of monochrome and color images, IEEE transactions on communications, Vol-25, November, 1977,pp.1285-1292. 981