Contribution of CIWaM in JPEG2000 Quantization for Color Images

Similar documents
IMAGE COMPRESSION ALGORITHM BASED ON HILBERT SCANNING OF EMBEDDED QUADTREES: AN INTRODUCTION OF THE Hi-SET CODER

Modified SPIHT Image Coder For Wireless Communication

CHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM

Compression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

FAST AND EFFICIENT SPATIAL SCALABLE IMAGE COMPRESSION USING WAVELET LOWER TREES

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

MRT based Fixed Block size Transform Coding

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Fingerprint Image Compression

CSEP 521 Applied Algorithms Spring Lossy Image Compression

JPEG Baseline JPEG Pros and Cons (as compared to JPEG2000) Advantages. Disadvantages

A ROBUST WATERMARKING SCHEME BASED ON EDGE DETECTION AND CONTRAST SENSITIVITY FUNCTION

Wavelet Transform (WT) & JPEG-2000

ERICSSON RESEARCH Media Lab. Signal Processing Laboratory Swiss Federal Institute of Technology, Lausanne

ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION

Wavelet Based Image Compression Using ROI SPIHT Coding

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video

A Robust Wavelet-Based Watermarking Algorithm Using Edge Detection

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

DCT Based, Lossy Still Image Compression

International Journal of Advanced Research in Computer Science and Software Engineering

Image Compression for Mobile Devices using Prediction and Direct Coding Approach

Key words: B- Spline filters, filter banks, sub band coding, Pre processing, Image Averaging IJSER

AUDIOVISUAL COMMUNICATION

OPTIMIZED QUANTIZATION OF WAVELET SUBBANDS FOR HIGH QUALITY REAL-TIME TEXTURE COMPRESSION. Bob Andries, Jan Lemeire, Adrian Munteanu

Image coding based on multiband wavelet and adaptive quad-tree partition

Compression of Stereo Images using a Huffman-Zip Scheme

MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ)

EFFICIENT METHODS FOR ENCODING REGIONS OF INTEREST IN THE UPCOMING JPEG2000 STILL IMAGE CODING STANDARD

Performance Comparison between DWT-based and DCT-based Encoders

Image Compression Algorithms using Wavelets: a review

JPEG Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features

ISSN (ONLINE): , VOLUME-3, ISSUE-1,

THREE DESCRIPTIONS OF SCALAR QUANTIZATION SYSTEM FOR EFFICIENT DATA TRANSMISSION

Reconstruction PSNR [db]

Wavelet Image Coding Measurement based on System Visual Human

SURVEILLANCE VIDEO FOR MOBILE DEVICES

THE bit-depth resolution of remote sensing images is often

Improved Qualitative Color Image Steganography Based on DWT

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

Jyoti S. Pawadshetty*, Dr.J.W.Bakal** *(ME (IT)-II, PIIT New Panvel.) ** (Principal, SSJCOE Dombivali.)

Advanced Video Coding: The new H.264 video compression standard

A Study of Image Compression Based Transmission Algorithm Using SPIHT for Low Bit Rate Application

Low-Memory Packetized SPIHT Image Compression

signal-to-noise ratio (PSNR), 2

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106

Lecture 5: Compression I. This Week s Schedule

ISSN: An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding

DCT-BASED IMAGE COMPRESSION USING WAVELET-BASED ALGORITHM WITH EFFICIENT DEBLOCKING FILTER

Comparative Analysis of 2-Level and 4-Level DWT for Watermarking and Tampering Detection

Error resilience capabilities (cont d R=0.5 bit/pixel, ber=0.001

A Very Low Bit Rate Image Compressor Using Transformed Classified Vector Quantization

An embedded and efficient low-complexity hierarchical image coder

JPEG2000: The New Still Picture Compression Standard

Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei

On a Probabilistic Approach to Rate Control for Optimal Color Image Compression and Video Transmission

Metamorphosis of High Capacity Steganography Schemes

Digital Image Steganography Techniques: Case Study. Karnataka, India.

Lossless and Lossy Minimal Redundancy Pyramidal Decomposition for Scalable Image Compression Technique

The Standardization process

Image Gap Interpolation for Color Images Using Discrete Cosine Transform

JPEG 2000 Still Image Data Compression

Embedded Rate Scalable Wavelet-Based Image Coding Algorithm with RPSWS

ELL 788 Computational Perception & Cognition July November 2015

EXPLORING ON STEGANOGRAPHY FOR LOW BIT RATE WAVELET BASED CODER IN IMAGE RETRIEVAL SYSTEM

Analysis of Image Compression Approaches Using Wavelet Transform and Kohonen's Network

JPEG 2000 compression

Reversible Wavelets for Embedded Image Compression. Sri Rama Prasanna Pavani Electrical and Computer Engineering, CU Boulder

An Embedded Wavelet Video Coder. Using Three-Dimensional Set. Partitioning in Hierarchical Trees. Beong-Jo Kim and William A.

An Embedded Wavelet Video. Set Partitioning in Hierarchical. Beong-Jo Kim and William A. Pearlman

Image Wavelet Coding Systems: Part II of Set Partition Coding and Image Wavelet Coding Systems

IMAGE COMPRESSION USING HYBRID QUANTIZATION METHOD IN JPEG

A Comparative Study of DCT, DWT & Hybrid (DCT-DWT) Transform

Image Compression Using BPD with De Based Multi- Level Thresholding

JPEG IMAGE CODING WITH ADAPTIVE QUANTIZATION

Robust Image Watermarking based on DCT-DWT- SVD Method

Keywords - DWT, Lifting Scheme, DWT Processor.

AN MSE APPROACH FOR TRAINING AND CODING STEERED MIXTURES OF EXPERTS

Comparative Analysis of Image Compression Algorithms for Deep Space Communications Systems

DIGITAL IMAGE WATERMARKING BASED ON A RELATION BETWEEN SPATIAL AND FREQUENCY DOMAINS

A Comparative Study of Image Compression Techniques

Optimization of Bit Rate in Medical Image Compression

An Embedded Wavelet Video Coder Using Three-Dimensional Set Partitioning in Hierarchical Trees (SPIHT)

JPEG 2000 Implementation Guide

Image Compression Algorithm for Different Wavelet Codes

Overcompressing JPEG images with Evolution Algorithms

CHAPTER-4 WATERMARKING OF GRAY IMAGES

Robust Image Watermarking based on Discrete Wavelet Transform, Discrete Cosine Transform & Singular Value Decomposition

REGION-BASED SPIHT CODING AND MULTIRESOLUTION DECODING OF IMAGE SEQUENCES

International Journal of Research in Computer and Communication Technology, Vol 4, Issue 11, November- 2015

Video Compression An Introduction

DIGITAL IMAGE PROCESSING WRITTEN REPORT ADAPTIVE IMAGE COMPRESSION TECHNIQUES FOR WIRELESS MULTIMEDIA APPLICATIONS

DISCRETE COSINE TRANSFORM BASED IMAGE COMPRESSION Aniket S. Dhavale 1, Ganesh B. Gadekar 2, Mahesh S. Bhagat 3, Vitthal B.

A combined fractal and wavelet image compression approach

HYBRID TRANSFORMATION TECHNIQUE FOR IMAGE COMPRESSION

Image Enhancement Techniques for Fingerprint Identification

Transcription:

Contribution of CIWaM in JPEG2000 Quantization for Color Images Jaime Moreno, Xavier Otazu and Maria Vanrell Universitat Autònoma de Barcelona, Barcelona, Spain ABSTRACT: The aim of this work is to explain how to apply perceptual concepts to define a perceptual pre-quantizer and to improve JPEG2000 compressor. The approach consists in quantizing wavelet transform coefficients using some of the human visual system behavior properties. Noise is fatal to image compression performance, because it can be both annoying for the observer and consumes excessive bandwidth when the imagery is transmitted. Perceptual prequantization reduces unperceivable details and thus improve both visual impression and transmission properties. The comparison between JPEG2000 without and with perceptual pre-quantization shows that the latter is not favorable in PSNR, but the recovered image is more compressed at the same or even better visual quality measured with a weighted PSNR. Perceptual criteria were taken from the CIWaM (Chromatic Induction Wavelet Model). 1 INTRODUCTION: Digital image compression has been a research topic for many years and a number of image compression standards has been created for different applications. The JPEG2000 [2] is intended to provide ratedistortion and subjective image quality performance superior to existing standards, as well as to supply functionality. However JPEG2000 does not provide the most relevant characteristics of the human visual system, since for removing information in order to compress the image mainly information theory criteria are applied. This information removal introduces artifacts to the image that are visible at high compression rates, because of many pixels with high perceptual significance have been discarded. Hence it is necessary an advanced model that removes information according to perceptual criteria, preserving the pixels with high perceptual relevance regardless of the numerical information. The Chromatic Induction Wavelet Model (CIWaM [4, 5]) presents some perceptual concepts that can be suitable for it. Both CIWaM and JPEG2000 use wavelet transform. CIWaM uses it in order to generate an approximation to how every pixel is perceived from a certain distance taking into account the value of its neighboring pixels. By contrast, JPEG2000 applies a perceptual criteria for all coefficients in a certain spatial frequency independently of the values of its surrounding ones. In other words, JPEG2000 performs a global transformation of wavelet coefficients, while CIWaM performs a local one. CIWaM attenuates the details that the human visual system is not able to perceive, enhances those that are perceptually relevant and produces an approximation of the image that the brain visual cortex perceives. At long distances, as Figure 1d depicts, the lack of information does not produce the well-known compression artifacts, rather it is presented as a softened version, where the details with high perceptual value remain (for example, some edges). 132 2 JPEG2000 QUANTIZATION REVIEW: 2.1 DEAD-ZONE UNIFORM SCALAR QUANTIZER: A uniform scalar quantizer is a function that maps each element in a subset of the real line to a particular value, which ensures that more zeros result [3]. In this way all thresholds are uniformly spaced by step size, except for the interval containing zero, which is called the dead-zone and extends from to +, thus a dead-zone means that the quantization range about 0 is 2. For each spatial frequency s, a basic quantizer step size s is used to quantize all the coefficients in that spatial frequency according to Equation 1. y q = sign(y) (1) s where y is the input to the quantizer or original wavelet coefficient value, sign(y) denotes the sign of y and q is the resulting quantized index.

The inverse quantizer or the reconstructed ŷ is given by the Equation 2, wherein δ is a parameter often set to place the reconstruction value at the centroid of the quantization interval and varies form 0 to 1. (q + δ) s, q > 0 ŷ = (q δ) s, q < 0 (2) 0, q =0 The International Organization for Standardization[2] recommends to adopt the mid-point reconstruction value, setting δ = 0.5. Experience indicates that some small improvements can be obtained by selecting a slightly smaller value, as Pearlman and Said suggest [6] δ = 0.375, especially for higher frequency subbands. It is important to realize that when < y <, the quantizer level and reconstruction value are both 0. For a spatial frequency, there may be many coefficients usually those of higher frequencies, that are set to 0. The array of quantizer levels q is further encoded losslessly. 2.2 JPEG2000 GLOBAL VISUAL FREQUENCY WEIGHTING: In JPEG2000, only one set of weights is chosen and applied to wavelet coefficients according to a particular viewing condition (100, 200 or 400 dpi s) with fixed visual weighting. This viewing condition may be truncated depending on the stages of embedding, in other words at low bit rates, the quality of the compressed image is poor and the detailed features of the image are not available since at a relatively large distance the low frequencies are perceptually more important. 3 CHROMATIC INDUCTION WAVELET MODEL: In order to explain the Chromatic assimilation/contrast phenomena as a unique perceptual process, Otazu et al. propose a low-level Chromatic induction model [4], which combines three important stimulus properties: spatial frequency, spatial orientation and surround contrast. Thereby the input image I is separated into different spatial frequency and orientation components using a multiresolution wavelet decomposition. Thus every single transformed coefficient is weighted using the response of the extended contrast sensitivity function (e- CSF), hence a perceptual Chromatic image I ρ is recovered. The e-csf is an extension of the perceptual CSF considering both spatial surround information and observation distance. Particulary the e-csf value decreases when the surround contrast increases and vice versa. Image I can be decomposed into a set of wavelet planes ω of different spatial frequencies, where each wavelet plane contains details at different spatial resolutions and it is described by: n I = ωs o + c n (3) s=1 o=v,h,d where n is the number of wavelet planes. The term c n is the residual plane and the index o represents the spatial orientation either vertical, horizontal or diagonal. The perceptual image I ρ recovered from the wavelet planes can be written as: n I ρ = C (ṡ, z ctr (s, o)) ωs o + c n. (4) s=1 o=v,h,d The term C (ṡ, z ctr (s, o)) is the e-csf weighting function, that tries to emulate some perceptual properties of human visual system, has a shape similar to the CSF [5]. Figure 1 shows three CIWaM images of Lena, which were calculated for a 19 inch monitor with 1280 pixels of horizontal resolution, at 30, 100 and 200 centimeters of distance. 4 PERCEPTUAL LOCAL WEIGHTING: In order to compare the JPEG2000 effectiveness and to get each bit-plane, some transformed coefficients of the Original Image or I org are selected such that I org 2 thr bpl+1, where bpl is the desired bit-plane and thr is the maximum threshold of I org, expressed as follows: 133

(a) Original (b) 30 cm. (c) 100 cm. (d) 200 cm. Fig. 1 CIWaM images of Lena. thr = log2 max Iorg (i,j) (5) (i,j) This process is applied for the three components of a opponent color space, i.e. Intensity, Red-Green and Blue-Yellow, thus this selected coefficients are inverse transformed in order to create a new Source of Image Data and to separate the original one in bit-planes. To obtain wavelet coefficients of I a Forward Transformation with the 9/7 filter fast wavelet transform is first applied on the source image data. Then, the perceptual quantized coefficients Q, calculated from a known viewing distance d as follows: n o Q= sign(ωso ) s=1 o=v,h,d C (s, zctr (s, o)) ωs s + sign(cn ) cn n (6) This expression is similar to Equation 1, but introduces perceptual criteria to each coefficient contrary to the classical Global Visual Frequency Weighting. A normalized quantization step size = 1/128 is used, namely the range between the minimal and maximal values at Iρ is divided into 128 intervals. Finally, the perceptual quantized coefficients are entropy coded, before forming the output code stream or bitstream. At the decoder, the code stream is, first, entropy second, dequantized using Equation 2 with a decoded in order to reconstruct the perceptual quantized coefficients Q, normalized quantization step size = 1/128 and δ = 3/8. Finally, an inverse discrete wavelet transform is applied to ρ, thus providing the reconstructed perceived image data. recover I 5 EXPERIMENTAL RESULTS: The software used to obtain a JPEG2000 compression was JJ2000, developed by Cannon Research, École Polytechnique Fédérale de Lausanne and Ericsson, available at http://jj2000.epfl.ch. The Perceptual Local Weighting in JPEG2000 was tested on all the color images of the Miscellaneous volume of the University of Southern California Image Data Base, available at http://sipi.usc.edu/database/index.html. The data sets were eight 256 256 pixel images and eight 512 512 pixel images, but only visual results of the well-known images Lena and Baboon are depicted, which are 24-bit color images and 512 512 of resolution. The CIWaM images were calculated for a 19 inch monitor with 1280 pixels of horizontal resolution at 50 centimeters of viewing distance. In order to measure the distortion between the original image f (i, j) and the reconstructed image fˆ(i, j) The Peak Signal to Noise Ratio was employed, however PSNR does not calculate perceptual quality measures. Therefore, it is necessary to weight each PSNR term by means of its local activity factor, taking into account the local variance of the neighbors of the studied wavelet coefficients, thus defining a weighted PSNR or wpsnr [1]. The wpsnr increases with increasing variance and vice versa as: 134

2 Gmax wpsnr = 10 log 10 wmse where G max is the maximum possible intensity value in f (i, j) (M N size) and weighted MSE (wmse) is defined as: wmse = 1 N M 2 f (i, j) ˆf (i, j) (8) NM 1+Var(i, j) i=1 j=1 CIWaM used as a method of pre-quantization, achieves better compression ratios with the same threshold, reaching better results at the highest bit-planes, since CIWaM reduces unperceivable coefficients. Figure 2a shows the contribution of CIWaM in the JPEG2000 compression ratio, for example at the eighth bit-plane, CIWaM diminishes 1.2423 bits per pixel less than without it, namely in a 512 512 pixel color image, CIWaM estimates that 39.75KB of information is perceptually irrelevant at 50 centimeters. The comparison between compression ratio and image quality is depicted by the Figure 2b, which shows that the reconstructed images pre-quantized by CIWaM (continuous function with heavy stars) has less PSNR but higher wpsnr (continuous function with heavy asterisks) than the ones quantized just by a scalar way (continuous function with heavy dots), i.e. even if the reconstructed image has a lower objective quality, this image could have a higher perceptual quality. (7) (a) Contribution of CIWaM in the JPEG2000 compression ratio (b) JPEG2000-CIWaM 27.57dB. by Bit-plane. Fig. 2 Examples of reconstructed images of Lena compressed at 0.9 bpp. Figure 3 depicts examples of reconstructed images compressed at 0.9, by means of JPEG2000 without (a) and with perceptual pre-quantization (b). Also this figures demonstrate that the CIWaM subjective quality is higher than the objective one. The Figure 4 shows examples of reconstructed images of Baboon compressed at 0.59 and 0.54 bits per pixel by means of JPEG2000 without (a) and with perceptual pre-quantization (b). PSNR in 4a is 26.18dB and in 4b is 26.15dB but wpsnr is equal to 34.08 decibels, namely the reconstructed image pre-quantized by CIWaM is perceptually better than the one just quantized by a Scalar Quantizer, since the latter has more compression artifacts, showing for example that the Baboon s eye is softer and better defined, saving additionally 1.6 KB of information. CONCLUSIONS AND FUTURE WORK: This work proposes the incorporation of a pre-quantization step to JPEG2000 using CIWaM. In order to measure the effectiveness of the perceptual quantization a performance analysis is done using 135

(a) JPEG2000 31.19dB. (b) JPEG2000-CIWaM 27.57dB. Fig. 3 Examples of reconstructed images of Lena compressed at 0.9 bpp. (a) JPEG2000 compressed at 0.59 bpp. (b) JPEG2000-CIWaM compressed at 0.54 bpp. Fig. 4 Examples of reconstructed images of Baboon. the PSNR and wpsnr measured between reconstructed and original images. Unlike PSNR, wpsnr uses not only a single coefficient but also its neighbors as well as its psycho-visual properties. The experimental results show that a CIWaM Quantization improves the compression and image perceptual quality and impacts, on the average,with about 20 per cent. One of the future tasks is the use of a threshold based on the e-csf properties, namely a threshold based on the perceptual importance of a coefficient, regardless of its numerical value. REFERENCES: [1] M. Bertini, R. Cucchiara, A. Del Bimbo, and A. Prati. An integrated framework for semantic annotation and adaptation. Multimedia Tools and Applications, 26(3):345 363, August 2005. [2] Martin Boliek, Charilaos Christopoulos, and Eric Majani. Information Technology: JPEG2000 Image Coding System. ISO/IEC JTC1/SC29 WG1, JPEG 2000, JPEG 2000 Part I final committee draft version 1.0 edition, April 2000. [3] Michael W. Marcellin, Margaret A. Lepley, Ali Bilgin, Thomas J. Flohr, Troy T. Chinen, and James H. Kasner. An overview of quantizartion of JPEG2000. Signal Processing: Image Communication, 17(1):73 84, Jan. 2002. [4] X. Otazu, C. A. Parraga, and M. Vanrell. A unified model for chromatic induction. Under review in Journal of Vision., 2009. [5] X. Otazu, M. Vanrell, and C.A. Parraga. Multiresolution wavelet framework models brightness induction effects. Vision Research, 48:733 751, 2007. [6] W. A. Pearlman and A. Said. Image wavelet coding systems: Part II of set partition coding and image wavelet coding systems. Foundations and Trends in Signal Processing, 2(3):181 246, 2008. 136