ISSN: An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding

Similar documents
Mixed Raster Content for Compound Image Compression

Enhanced Hybrid Compound Image Compression Algorithm Combining Block and Layer-based Segmentation

Mixed Raster Content for Compound Image Compression

946 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL /$ IEEE

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

USING H.264/AVC-INTRA FOR DCT BASED SEGMENTATION DRIVEN COMPOUND IMAGE COMPRESSION

Video Compression An Introduction

Modified SPIHT Image Coder For Wireless Communication

An Optimized Template Matching Approach to Intra Coding in Video/Image Compression

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

Compression of Stereo Images using a Huffman-Zip Scheme

Block-Matching based image compression

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

Multilayer Document Compression Algorithm

A new predictive image compression scheme using histogram analysis and pattern matching

Histogram Based Block Classification Scheme of Compound Images: A Hybrid Extension

Video Quality Analysis for H.264 Based on Human Visual System

H.264/AVC BASED NEAR LOSSLESS INTRA CODEC USING LINE-BASED PREDICTION AND MODIFIED CABAC. Jung-Ah Choi, Jin Heo, and Yo-Sung Ho

Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Digital Video Processing

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Fast Mode Decision for H.264/AVC Using Mode Prediction

Performance Comparison between DWT-based and DCT-based Encoders

JPEG 2000 vs. JPEG in MPEG Encoding

Implementation and analysis of Directional DCT in H.264

Advanced Video Coding: The new H.264 video compression standard

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

Pre- and Post-Processing for Video Compression

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Context based optimal shape coding

Adaptive Quantization for Video Compression in Frequency Domain

Intra-Mode Indexed Nonuniform Quantization Parameter Matrices in AVC/H.264

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

Optimization of Bit Rate in Medical Image Compression

A NOVEL SCANNING SCHEME FOR DIRECTIONAL SPATIAL PREDICTION OF AVS INTRA CODING

IMPROVED CONTEXT-ADAPTIVE ARITHMETIC CODING IN H.264/AVC

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

EE Low Complexity H.264 encoder for mobile applications

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

An Efficient Mode Selection Algorithm for H.264

CROSS-PLANE CHROMA ENHANCEMENT FOR SHVC INTER-LAYER PREDICTION

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

A COMPARISON OF CABAC THROUGHPUT FOR HEVC/H.265 VS. AVC/H.264. Massachusetts Institute of Technology Texas Instruments

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

CONTENT ADAPTIVE COMPLEXITY REDUCTION SCHEME FOR QUALITY/FIDELITY SCALABLE HEVC

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Image Compression Algorithm and JPEG Standard

Lossless and Lossy Minimal Redundancy Pyramidal Decomposition for Scalable Image Compression Technique

CSEP 521 Applied Algorithms Spring Lossy Image Compression

Week 14. Video Compression. Ref: Fundamentals of Multimedia

EXPLORING ON STEGANOGRAPHY FOR LOW BIT RATE WAVELET BASED CODER IN IMAGE RETRIEVAL SYSTEM

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

CS 335 Graphics and Multimedia. Image Compression

Image Compression for Mobile Devices using Prediction and Direct Coding Approach

Context-Adaptive Binary Arithmetic Coding with Precise Probability Estimation and Complexity Scalability for High- Efficiency Video Coding*

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

Professor, CSE Department, Nirma University, Ahmedabad, India

High Efficiency Video Coding. Li Li 2016/10/18

FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION

Decoding-Assisted Inter Prediction for HEVC

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation

signal-to-noise ratio (PSNR), 2

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Digital Image Representation Image Compression

Rate Distortion Optimization in Video Compression

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

ABSTRACT. KEYWORD: Low complexity H.264, Machine learning, Data mining, Inter prediction. 1 INTRODUCTION

Optimum Quantization Parameters for Mode Decision in Scalable Extension of H.264/AVC Video Codec

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Video Coding Using Spatially Varying Transform

JOINT INTER-INTRA PREDICTION BASED ON MODE-VARIANT AND EDGE-DIRECTED WEIGHTING APPROACHES IN VIDEO CODING

Optimal Estimation for Error Concealment in Scalable Video Coding

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

Key words: B- Spline filters, filter banks, sub band coding, Pre processing, Image Averaging IJSER

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video

VHDL Implementation of H.264 Video Coding Standard

The Standardization process

Xin-Fu Wang et al.: Performance Comparison of AVS and H.264/AVC 311 prediction mode and four directional prediction modes are shown in Fig.1. Intra ch

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Modeling and Simulation of H.26L Encoder. Literature Survey. For. EE382C Embedded Software Systems. Prof. B.L. Evans

An Implementation of Multiple Region-Of-Interest Models in H.264/AVC

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

Efficient Dictionary Based Video Coding with Reduced Side Information

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

Low-Complexity, Near-Lossless Coding of Depth Maps from Kinect-Like Depth Cameras

CONTENT ADAPTIVE SCREEN IMAGE SCALING

Transcription:

An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding Ali Mohsin Kaittan*1 President of the Association of scientific research and development in Iraq Abstract The Mixer Raster Content ITU document compression standard (T.44) specifies a multi-layer multi-resolution representation of a compound document. The compound images consists of text, graphics and natural images, which present strong anisotropic featured. It makes existing image coding standards inefficient on compressing them. It is expected that higher compression can be achieved if more efficient compression standards are used to compress each layer. This paper proposes a novel coding scheme based on the H.264 intra fra me coding. Two new intra modes are proposed to better exploit spatial correlations in compound images. The first one is Residual Scalar Quantization mode, where intra-predicted residues are directly quantized entropy coded. The second is base colors and index map mode that can be viewed as an adaptive vector quantization. These two new methods as well as previous intra modes in H.264 are selected by the rate distortion optimization method in each block of image. The result is an unrivaled performance for compressing compound documents and natural images as demonstrated by our experiments. Keywords: H.264/Advanced Video Coding, Joint Photographic Expert Group-2000, Image Coding, Compound Document. ---------------------------------------------------------------------------------IJMTARC-------------------------------------------------------------------------- 1. Introduction The Mixed Raster Content, ITU document compression standard specifies a multi layer multi resolution representation of a compound document. In this paper we present a basic 3-layer Mixed Raster Coding codec that uses the H.264/Advanced Video Coding operating in INTRA mode to encode the binary Mask layer. The main objective is not to propose a new layer separation or a data-filling algorithm, but to show that Maximal Ratio Combining coding based on H.264/Advanced Video Coding and Joint Bi-level Image Expert Group2 can achieve better compression rates than schemes that use other state-of the-art still image coders. They usually are complicated and consist of text, graphics, and natural images. When they are displayed in remote client s wireless projectors or thin clients, how to efficiently compress then has become a prevalent and critical problem in many applications. Well recognized that the state-of the-art image coding standards (Joint Photographic Expert Group, JPEG2000 and intra frame coding of H.264) are all designed for natural images. They generate a compact description in frequency domain if the correlation of samples can be modeled by an isotropic autocorrelation function. The compound images cannot be modeled as isotropic absolutely. In addition for the compression of pure text images such as binary documents, they are already mature algorithms such as JBIG and JBIG2. But they are not good at handling natural images. Therefore some schemes have been reported to efficiently compress compound images. They can be categorized into layer based and block based approaches. The layer based approaches are mainly proposed to efficiently compress compound images. They segment an image into foreground layer, background layer and mask. The colors of text are presented in the foreground layer and the position in the mask. Both foreground and background layers are compressed by the approaches similar to those for natural images. This paper takes H.264 intra-frame coding as a benchmark to develop a coding scheme for compound images because the modebased design in H.264 provides an easy way to incorporate proposed techniques and efficiently do mode selection. At the same time we consider the advantages reported by previous studies on compound image compression. Our main contribution is to develop a comprehensive and systematic coding scheme by fully taking the properties of compound images into account. 2. Related Work Compound documents are examples of raster images that contain a mix of text and pictorial contents. When 1

compressing text it is important to preserve the edges and shape of characters accurately to facilitate reading. An example of compound image is shown in Fig.1. for compound images direct quantization and entropy coding of residues can bring big coding gain on anisotropic parts. Because of the complicated shape of text, the simple directional intra prediction form boundary samples in H.264 are not efficient some time. It not only enables us not to consider directional intra prediction but also has high coding efficiency. Fig.2.1. Example of Compound Image. The rest of this paper is organized as follows. Section 2 gives a brief overview of the proposed coding scheme. Two new intra modes and the mode selection are discussed in detail in section 3. The experimental results presented and excellent performance of the proposed scheme in Section 4. Finally the conclusion is drawn in section 5. 2.2 Methodology of Compression Scheme The main feature of two new intra modes is to fully exploit the correlation among samples without transformation techniques. First of all the strong anisotropic of compound images, transforms are not efficient under the isotropic assumption. Non transform inter coding has been proposed in considering the inefficiency of the transform coding on marginally correlated samples. However 2 Fig.2. 2.1 the block diagram of the proposed scheme. The block diagram of the proposed scheme for coding compound images is depicted in Figure 2. There are three possible paths for coding each block. The path 1 is the existing method in H.264 for intra block coding. The path 2 is the proposed Residual Scalar Quantization method. After the intra prediction the residues are quantized and entropy coded. The path 3 is the proposed BCIM method. It does not need intra prediction. The input block is converted to base colors and an index map to compress. The methods can be applied to different block size from 16x16, 8x8 to 4x4. The mode selection is completed by the rate distortion optimization algorithm similar to the reconstructed samples will update the reference buffer as prediction for neighboring block coding.

3. Proposed Compression Scheme In this paper the proposed Residual Scalar Quantization mode, Binary Coded Image Map mode and mode selection are discussed in this section. Step 1: The directional intra prediction method is similar to that used in H.264. The newest video coding standard the H.264/Advanced Video Coding has been well explained in the literature many papers have illustrated its performance showing many comparative results against coders such as Moving Picture Expert Group-2. H.264/Advanced Video Coding is a video compression standard and it was not conceived to be applied as a still image compression tool. Nevertheless the many coding advances brought into H.264/Advanced Video Coding, not only set a new benchmark for video compression, but they also makes it a formidable compressor for still images. One of the components of these advances is the intra-frame macro block prediction method. Thus in Residual Scalar Quantization mode, the boundary samples are directly used for prediction without any low-pass filtering. After prediction the residues are directly quantized. It is a core part in the Residual Scalar Quantization mode. It is a sort of dead-zone plus uniform threshold quantization and can be described by the function C(r) for the residual signal r. map are compressed with context adaptive arithmetic coder. The YUV component of base color is first quantized. The context and the remapping are used to exploit the similar patterns to enhance the compression. Compound Image Compression: The above explained text block coding procedure is incorporated into H.264 intra coding as a new mode to compress compound images with both picture content and text content efficiently. H.264/Advanced Video Coding intra coding is adopted in the existing scheme for its high coding efficiently on natural images. Further the mode selection algorithm is adopted based on natural images. Further the mode selection algorithm is adopted based on Rate Distortion Optimization in H.264/Advanced Video Coding reference software to distinguish the text blocks. The Rate Distortion Optimization -based mode selection can balance the quality of picture content and text content in terms of mean square errors. It can also select the mode adaptively according to the target bit rate. All modes in proposed scheme can be categorized into two categories those are Spatial Domain and Frequency Domain. There is a flag in bit stream to distinguish them. The whole structure of all modes in the proposed scheme is depicted in the below figure. C(r) = sign(r) x max(0,floor( r /S + 1 Z + f)) (1) S is the quantization step and f is the adaptive rounding offset. Parameter Z control the dead zone that is equal to 2S(Z-f). in order to maintain equal expected values before quantization and after inverse quantization, the rounding offset f is adaptively updated within a block as F i=f i-1 + c x( r - R(C(r) /s, i=1,2,, N-1). (2) C is a positive weighting factor and f o is set as zero at the beginning of each block. N is the pixel number in that block R(x) is the inverse quantization function of x and gives as R(x) =sign(x) x s x ( x - 1 + Z). (3) Step 2: The block after local quantization is further quantized to several base colors. Note that the pixels in the same group are quantized to the same color. The number of base colors of block depends on the content. Instead of quantizing each block to a fixed number if colors, the allowed maximum distortion is set to a constant value e.g. q 2 /4. Where q is the quantization step used in H.264/Advanced Video Coding intra coding. Thus the number of base colors of a 16x16 macro block may vary for 1 to 8. A Tree Structure Vector Quantization method is used. Where each pixel is treated as a vector. The maximum distortion is the criterion to split a tree in Tree Structure Vector Quantization. Both base color and the index 3 Fig.3.1. The structure of all modes in the proposed scheme.

FD indicated all original intra modes in H.264. SD indicates the proposed RSQ and BCIM modes. For 4x4 and 8x8 blocks, eight directions are used for prediction in the RSQ mode. The Binary Coded Image Map mode is only applied on the luminance component for such size of blocks. The 16x16 blocks the BCIM mode is applied to the luminance component or three color component jointly clustered, there is only an index map. It would save more bits. For the Residual Scalar Quantization mode, three prediction directions are available to get the whole residual block on luminance. But the mode selection between the direct quantization and the transform coding on residues is performed on 4x4 sub blocks which is consistent with the transform block size. In addition the flag that indicated FD or SD will be coded using the flags of neighboring blocks as contexts. 4. Simulation Results The proposed coding scheme for compound images is implemented into the H.264/AVC reference software JM14.0. In that only intra frame coding is used. Qp in H.264 is set from 47 to 7 with an decreasing step of %. The de-blocking filter is disabled because of the anisotropic feature of compound images Fig.4. 1 Different Types of testing images we get these results in our experiment. All results of PSNR for the luminance vs overall bit rate curves are depicted in figure 4. The curves of H.264 with the Residual Scalar Quantization mode are marked by Residual Scalar Quantization. The results of H.264 cooperated with the proposed modes Residual Scalar Quantization and Binary Coded Image Map are marked by proposed. Three schemes are selected for compression Joint Photographic Expert Group2000, H.264 intra-frame coding and the scheme proposed in marked by Ding. We only compress the luminance plane by all bits. The result of Joint Photographic Expert Group2000 should be better than that all three components are compressed. We can observe that the proposed scheme outperforms Residual Scalar Quantization and Binary Coded Image Map modes. The similar results are observed in different images. 4

[2] Ding, W., Lu, Y. and Wu,F. (2007) Enable Efficient Compound Image Compression in H.264/AVC Intra Coding, IEEE International Conference on Image Processing, Vol. 2, Pp. II - 337 - II 340. [3] Fan, J. (2008) Text extraction via an edge-bounded averaging and a parametric character model, Proc. Of the SPIE Document Recognition and Retrieval X, Santa Clara, CA, USA, Vol. 5010, Pp. 8-19. [4] Zaghetto, A. and de Queiroz, (2007) Using H.264/AVC-Intra for Segmentation-Driven Compound Document Coding, IEEE ICIP 2007, Pp. II-333 II-336. [5] R. L. de Queiroz, R. Buckley, M. Xu, Mixed raster content (MRC) model for compound image compression, Proc. SPIE, Visual Communications and Image Processing, Vol. 3653, pp. 1106-1117, Jan 2009. [6] R. L. de Queiroz, Compressing Compound Documents, in The Document and Image Compression Handbook, edited by M. Barni, Marcel- Dekker, 2005. Fig.4.2 Experimental results and comparisons 5. Conclusion [7] D. Mukherjee, N. Memon, A. Said,, JPEG-matched MRC Compression of Compound Documents, Proc. IEEE Intl. Conf. on Image Processing, ICIP, Vol. 3, Thessaloniki, Greece, pp. 434-437, Oct. 2007. This paper proposes how to extend the H.264 intra-frame coding for compound images to meet the increasing requirements in various applications. The two new intra modes (RSQ and BCIM) are proposed to fully exploit spatial correlations of samples in compound images. Simulation results demonstrate the performance of H.264 intra frame coding is improved even more than 10db on compressing compound images by integrating our proposed two modes. Meantime it keeps the original performance in compressing natural images. References [1] De Queiroz, R.L. (2011) Compressing Compound Documents, Document and Image Compression Handbook, edited by M. Barni, Marcel-Dekker. Kaittan Ali Mohsin Received technical computer engineering from Baghdad-Iraq Al- Rafidain University. He was a President of the Association of scientific research and development in Iraq. Currently, he is studying his M.E on Computer Science & Engineering at University of engineering Osmania University Hyderabad in AP 5