An Efficient Image Compression Using Bit Allocation based on Psychovisual Threshold

Similar documents
A New Psychovisual Threshold Technique in Image Processing Applications

Compression of Stereo Images using a Huffman-Zip Scheme

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

Introduction ti to JPEG

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology

A COMPRESSION TECHNIQUES IN DIGITAL IMAGE PROCESSING - REVIEW

Image Compression Algorithm and JPEG Standard

MRT based Fixed Block size Transform Coding

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

HYBRID TRANSFORMATION TECHNIQUE FOR IMAGE COMPRESSION

Block-based Watermarking Using Random Position Key

Image Error Concealment Based on Watermarking

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Enhancing the Image Compression Rate Using Steganography

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

JPEG Compression Using MATLAB

Topic 5 Image Compression

Adaptive Quantization for Video Compression in Frequency Domain

DIGITAL IMAGE WATERMARKING BASED ON A RELATION BETWEEN SPATIAL AND FREQUENCY DOMAINS

Implementation and analysis of Directional DCT in H.264

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

IMAGE COMPRESSION. October 7, ICSY Lab, University of Kaiserslautern, Germany

Robust Image Watermarking based on DCT-DWT- SVD Method

Lecture 8 JPEG Compression (Part 3)

IMAGE COMPRESSION USING HYBRID QUANTIZATION METHOD IN JPEG

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

A Novel Approach for Deblocking JPEG Images

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

Multimedia Communications. Transform Coding

A Reversible Data Hiding Scheme for BTC- Compressed Images

A NEW ROBUST IMAGE WATERMARKING SCHEME BASED ON DWT WITH SVD

Video compression with 1-D directional transforms in H.264/AVC

Performance Comparison between DWT-based and DCT-based Encoders

A Robust Digital Watermarking Scheme using BTC-PF in Wavelet Domain

Optimization of Bit Rate in Medical Image Compression

BLIND MEASUREMENT OF BLOCKING ARTIFACTS IN IMAGES Zhou Wang, Alan C. Bovik, and Brian L. Evans. (

JPEG. Wikipedia: Felis_silvestris_silvestris.jpg, Michael Gäbler CC BY 3.0

Department of Electronics and Communication KMP College of Engineering, Perumbavoor, Kerala, India 1 2

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Compression II: Images (JPEG)

CHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM

VC 12/13 T16 Video Compression

Partial Video Encryption Using Random Permutation Based on Modification on Dct Based Transformation

DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

An Optimized Template Matching Approach to Intra Coding in Video/Image Compression

Learning based face hallucination techniques: A survey

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

Comparison of Wavelet Based Watermarking Techniques for Various Attacks

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

EVALUATING THE SHORTCOMINGS OF IMAGE COMPRESSION TECHNIQUES

Compression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction

Digital Image Representation Image Compression

Rate Distortion Optimization in Video Compression

University of Mustansiriyah, Baghdad, Iraq

JPEG decoding using end of block markers to concurrently partition channels on a GPU. Patrick Chieppe (u ) Supervisor: Dr.

PROBABILISTIC MEASURE OF COLOUR IMAGE PROCESSING FIDELITY

Video Compression An Introduction

ISSN (ONLINE): , VOLUME-3, ISSUE-1,

Overcompressing JPEG images with Evolution Algorithms

CMPT 365 Multimedia Systems. Media Compression - Image

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ADVANTAGES OF IMAGE COMPRESSION

Wavelet Based Image Compression Using ROI SPIHT Coding

An introduction to JPEG compression using MATLAB

A Robust Watermarking Algorithm For JPEG Images

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

Lecture 5: Compression I. This Week s Schedule

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Image Gap Interpolation for Color Images Using Discrete Cosine Transform

Compression of 3-Dimensional Medical Image Data Using Part 2 of JPEG 2000

AUDIOVISUAL COMMUNICATION

Data Hiding in Video

INF5063: Programming heterogeneous multi-core processors. September 17, 2010

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Digital Image Watermarking Scheme Based on LWT and DCT

Interactive Progressive Encoding System For Transmission of Complex Images

LETTER Improvement of JPEG Compression Efficiency Using Information Hiding and Image Restoration

Reduced Frame Quantization in Video Coding

Image/video compression: howto? Aline ROUMY INRIA Rennes

Homogeneous Transcoding of HEVC for bit rate reduction

CISC 7610 Lecture 3 Multimedia data and data formats

Vidhya.N.S. Murthy Student I.D Project report for Multimedia Processing course (EE5359) under Dr. K.R. Rao

Image Segmentation Techniques for Object-Based Coding

LBP-GUIDED DEPTH IMAGE FILTER. Rui Zhong, Ruimin Hu

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

CS 260: Seminar in Computer Science: Multimedia Networking

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

VIDEO SIGNALS. Lossless coding

Image Authentication and Recovery Scheme Based on Watermarking Technique

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

A New Watermarking Algorithm for Scanned Grey PDF Files Using Robust Logo and Hash Function

Wireless Communication

A NEW ENTROPY ENCODING ALGORITHM FOR IMAGE COMPRESSION USING DCT

Digital Watermarking with Copyright Authentication for Image Communication

( ) ; For N=1: g 1. g n

Transcription:

An Efficient Image Compression Using Bit Allocation based on Psychovisual Threshold Ferda Ernawan, Zuriani binti Mustaffa and Luhur Bayuaji Faculty of Computer Systems and Software Engineering, Universiti Malaysia Pahang Lebuhraya Tun Razak 26300 Gambang Kuantan, Malaysia E-mail: ferda1902@gmail.com or ferda@ump.edu.my Abstract One of the main part of compression is a quantization process which give a significant effect to the compression performance. However, compression based on the quantization produces blocking effect or artifact. This research proposes a novel bit allocation strategy which assigning an optimal budget of bits in compression. The bit allocation is proposed to replace the role of the quantization process in compression. The principle of psychovisual threshold is adopted to develop bit allocation strategy in the compression. This quantitative research measures the optimal bit of signals and manages the quality level. The experimental results show the efficiency of the proposed bit allocation strategy, and that the proposed bit allocation can achieve the almost same compression rate performance while can significantly produces high quality texture. When compared to JPEG compression, the compression using bit allocation achieves bit rate savings of up to 4%. The quality output provides minimum errors of artifact. The quality reconstruction improvement is up to 14% and the error reconstruction is reduced by up to 37%. Key Words: Bit allocation, Quantization table, Psychovisual threshold, Image compression 1. Introduction An requires large bits to store the data, bits redundancy need to be exploited. In general, the bit allocation process is based on rate-distortion and aims to minimize the distortion subject to a constraint on the available bitrate [1]. Bit allocation provides a critical role in compression and it prescribes bit budget for a compressed file. The bit allocation algorithms perform to realize the target bit rates or quality levels (usually represented by quantization) for different regions [2]. In this paper, the author consider the bit allocation problem represented by quantization process in compression. The quantization process under regular 8 8 block produces interchange block on the overall visual quality caused by sub-block coding. The bit allocation strategies have been studied extensively in compression and video coding. The bit allocation strategies are more frequently implemented in ROI based coding [3], multi-view coding [4]-[8], lossy set compression [9] and video compression [10][11].

This paper proposes bit allocation strategy in compression based on psychovisual threshold. The assigning bit on the signals will influence to the quality of compressed. The uniform bit will cause large quality deviation in compression [2]. By assigning more bits on low frequency order and fewer bits on high frequency order, this strategy can keep the quality stability and avoid too serious degradation. 2. Psychovisual Threshold on 256 256 DCT This research proposes bit allocation strategy based on psychovisual threshold to overcome artifact in compression. The psychovisual prescribes the noticeable difference of the compressed from the original at each frequency order [12]. The psychovisual threshold has been implemented on processing application such as watermarking [13] and compression [14]-[21]. The psychovisual threshold on 256 256 DCT [12] for each frequency order on the grayscale is shown in Fig. 1. Fig. 1. Psychovisual error threshold on 256 256 DCT for 40 real grayscale s. E (s) represents the noticeable difference of compressed from the original by absolute reconstruction error (ARE) on each frequency order. 3. Experimental Setup This experiment uses 80 RGB s to develop a set of bit allocation based on psychovisual threshold in compression. Originally, the s have an size of 512 512 raw. First, an is divided by 256 256 block. Each block is transformed by two-dimensional Discrete Cosine Transform (DCT). The signals on each regular block of 256 256 DCT are listed as a traversing array in a zigzag pattern. A linear array of signals is divided into local block of 8 coefficients. Each local block of

8 coefficients is classified into peak signals and non-peak signals. The local peak signals are identified by finding a maximum of absolute local coefficients and the remains are the nonpeak signals. The peak signals and non-peak signals are separately encoded. The peak signals are rounded into integer values. While, the non-peak signals are masked by assigning bit allocation into an optimal bits for each local block. The directive to reduce bits of signals will leave an effect on the quality. The effect of assigning bit on signals is measured by the reconstruction error per pixel. The assigning non-peak signals is increased by one bit at a time and the psychovisual threshold is set as a reference to the maximum reconstruction error for the effect of assigning bit allocation. The author proposes the assigning bit allocation on the non-peak signals as follows: C(x) M round 2 n (3.1) P M is the frequency masking, x is the local frequency block, C represents non-peak signals, P is the peak signal of local blocks and n represents the number of assigned bits. The inverse of non-peak coding is given as follows: M C(x) P 2 n (3.2) M is masking result of frequency non-peak signals, P is the same local peak signal and n represents budget of bit on the local blocks. The decoded non-peak signals are closer towards the original signals. The budget of bit allocation on each block based on psychovisual threshold 256 256 DCT is listed in Table 1. Table 1. The budget of bit allocation for the local blocks of 8 coefficients on 256 256 DCT Block No Frequency Order Bit 1 1-3 9 2-4 4-7 8 5-16 8-15 7 17-49 16-27 6 50-170 28-51 5 171-479 52-87 4 480-1097 88-131 3 1098-1831 132-170 2 1832-2977 171-217 1 2978-8191 218-507 0 The budget of bit signals is investigated on the across frequency order. This bit budget on signals can provide an optimal balance between quality and compression rate. The peak signals generalize extreme value of AC coefficients

distribution. The extreme distributions of the peak AC coefficients are shown in Fig. 2. Fig. 2. The extreme distribution of peak AC coefficients for 40 real s. Referring to Fig. 2, a red curve represents generalize of extreme distribution estimation at each frequency signal. These results are used to calculate the probability density function (pdf). For non-peak AC coefficients, the normal distribution is implemented to estimate the probability density function after assigning bit allocation. The use of the probability density functions is to make simplify computation in the encoding process. The Huffman tree does not perform repeatedly to compute the probability frequency distribution when the new input s come to be processed. The probability density function is utilized to estimate the probability distribution in Huffman coding rapidly. Three popular quality metrics, absolute reconstruction error (ARE), peak signal to noise ratio (PSNR) and structural similarity (SSIM) index are adopted in the experiment. 4. Experimental Results and Discussion The comparison of the quality from compression using bit allocation and JPEG compression are listed in Table 2. The average bit length of Huffman code on the peak and non-peak AC coefficients are listed in Table 3. Table 2. The average quality from compression using bit allocation and JPEG compression for 40 real s and 40 graphical s Method Bit Allocation JPEG Compression ARE 2.7523 3.1761 4.3724 4.0503 PSNR 38.5525 38.936 33.0548 34.1590 SSIM 0.9865 0.9854 0.9803 0.9838

Table 3. The average bit length of Huffman code from compression using bit allocation and JPEG compression for 40 real s and 40 graphical s Method Bit Allocation JPEG Compression DC Coefficients 16 16 5.7468 5.6722 AC Coefficients 2.7943 2.8314 2.8680 2.9653 Since there are only four DC coefficients under regular 256 256 DCT, they are maintained as the original DC coefficients. The DC coefficients do not give a significant effect to the average bit rate under regular 256 256 DCT, while they give most a crucial impact in the quality. The maximum possible value on the DCT DC coefficient is coded as 16 bits. Therefore, the average bit length of DC coefficients as listed in Table 3 is estimated by 16 bits on each DC coefficient. Otherwise, JPEG compression has 4096 DC coefficients under regular 8 8 DCT which give an effect to the average bit rate. The comparison of the proposed bit allocation and JPEG compression in terms of compression rate and total bit size is shown in Table 4. Table 4. The average compression rate and total bit size from the compressed using a set of bit allocation and JPEG quantization tables for 40 real s and 40 graphical s Method Bit Allocation JPEG Compression Compression rate 2.8628 2.8252 2.7463 2.6599 Total bit size 89.423 kb 90.611 kb 93.215 kb 96.244 kb The implementation of bit allocation in compression provides high quality than JPEG compression. The proposed bit allocation strategy produces high quality caused by the number of peak AC coefficients on low frequency order. The peak signals on low frequency order contains a most significant impact to the quality. The sample Lena s is selected from 80 s in order to analyse the bit allocation strategy in compression. The comparison of visual output from samples Lena, JPEG compression and compression using local bit allocation are shown in Fig. 3. In order to observe the visual quality, Lena right eye are zoomed in to 200% as depicted in Fig. 4.

(a) (b) (c) (d) Fig. 3. The comparison of visual outputs among original Lena (a), original Lena cropped (b), 8 8 JPEG quantization table (c) and compression using local bit allocation based on psychovisual threshold (d). Fig. 4. The comparison of visual outputs between 8 8 JPEG quantization table (left) and compression using local bit allocation (right) zoomed in to 200%. The JPEG compression always produces artifact s and blur in output when the visual is zoomed in to 200%. The proposed bit allocation strategy is able to overcome the artifact in the visual output. The visual quality produces significantly texture and closely to the original Lena. 5. Conclusion This research has proposed the bit allocation strategy in compression to overcome the artifact. The bit allocation strategy has been designed to replace the main role of quantization process in compression. A quantitative experiment has been conducted by measured the noticeable difference of compressed from the original based on budget of bit allocation. The proposed a set of bit allocation on the across frequency is believed that provides an optimal budget of bits in signals and maintains the finer visual quality. The experimental results show that bit allocation strategy in compression produces higher visual quality at a minimum bit rates than JPEG compression. The visual quality produces rich texture pixels and less artifact than JPEG compression.

6. Acknowledgments The authors would like to express a very special thanks to Universiti Malaysia Pahang, for providing financial support to this research project. References [1] Hachicha, W., Kaaniche, M., Beghdadi, A., Cheikh, F. A., Efficient inter-view bit allocation methods for stereo coding. IEEE Transactions on Multimedia, 17 (2015), 765-777. [2] Zhu, Y., Yuan, J., A bit allocation optimization method for ROI based compression with stable quality. 2014 International Conference on Pattern Recognition, 2014, pp. 849-854. [3] Subramanian, P., Alamelu, N. R. and Aramudhan M., Bit rate control schemes for ROI video coding. International Journal of Computer Theory and Engineering, 3 (2011), 628-631. [4] Shao, F., Jiang, G., Lin, W., Yu, M. and Dai, Q., Joint bit allocation and rate control for coding multi-view video plus depth based 3D video. IEEE Transactions on Multimedia, 15 (2013), 1843 1454. [5] Cheung, G., Velisavljevic, V. and Ortega, A., On dependent bit allocation for multiview coding with depth--based rendering. IEEE Transactions on Image Processing, 20 (2011), 3179 3194. [6] Rajaei, B., Maugey, T., Pourreza, H. and Frossard, P., Rate-distortion analysis of multiview coding in a DIBR framework. Annals Telecommunications, 68 (2013), 627 640. [7] Gelman, A., Dragotti, P. L. and Velisavljevic, V., Multiview coding using depth layers and an optimized bit allocation. IEEE Transactions on Image Processing, 21 (2012), 4092 4105. [8] Hachicha, W., Kaaniche, M., Beghdadi, A., Cheikh, F. A., Rate distortion optimal bit allocation for stereo coding. International Conference on Image Processing (ICIP 2014), 2014, pp. 5661-5665. [9] Cheng, H. and Lerner, C., Bit allocation for lossy set compression. IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM 2015), 2015, pp. 52-57. [10] Wiegand, T., Sullivan, G. J., Bjontegaard, G. and Luthra, A., Overview of the H. 264/AVC video coding standard. IEEE Transactions on Circuits and Systems for Video Technology, 13 (2003), 560 576. [11] Gao, W., Kwong, S., Yuan, H. and Wang, X., DCT coefficient distribution modeling and

quality dependency analysis based frame-level bit allocation for HEVC. IEEE Transactions on Circuits and Systems for Video Technology, 26 (2016), 139-153. [12] Abu, N. A. and Ernawan, F., A novel psychovisual threshold on large DCT for compression. The Scientific World Journal, (2015), 001-011. [13] Abu, N. A., Ernawan, F., Suryana, N. and Sahib, S., Image watermarking using psychovisual threshold over the edge. Information and Communication Technology, 2013, pp. 519-527. [14] Ernawan, F., Abu, N. A. and Suryana, N., TMT quantization table generation based on psychovisual threshold for compression. IEEE International Conference of Information and Communication Technology (ICoICT), 2013, pp. 202-207. [15] Ernawan, F., Abu N. A. and Suryana, N., Adaptive tchebichef moment transform compression using psychovisual model. Journal of Computer Science, 9 (2013), 716-725. [16] Ernawan, F., Abu, N. A. and Suryana, N., An adaptive JPEG compression using psychovisual model. Advanced Science Letters, 20 (2014), 026-031. [17] Ernawan, F., Abu, N. A. and Suryana, N., An optimal tchebichef moment quantization using psychovisual threshold for compression. Advanced Science Letters, 20 (2014), 070-074. [18] Ernawan, F., Abu, N. A. and Suryana, N., Integrating a smooth psychovisual threshold into an adaptive JPEG compression. Journal of Computers, 9 (2014), 644-653. [19] Ernawan, F., Abu, N. A. and Suryana, N., A psychovisual threshold for generating quantization process in tchebichef moment compression. Journal of Computers, 9 (2014), 702-710. [20] Abu, N. A., Ernawan, F. and Sahib, S., Psychovisual model on discrete orthonormal transform. International Conference on Mathematical Sciences and Statistics (ICMSS2013), 2013, pp.309-314. [21] Abu, N. A., Ernawan F. and Suryana, N., A generic psychovisual error threshold for the quantization table generation on JPEG compression. IEEE 9th International Colloquium on Signal Processing and its Applications (CSPA), 2013, pp. 039-043. *Corresponding author: Ferda Ernawan, Ph.D. Faculty of Computer Systems and Software Engineering, Universiti Malaysia Pahang, Lebuhraya Tun Razak 26300 Gambang Kuantan, Malaysia E-mail: ferda1902@gmail.com or ferda@ump.edu.my