CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT

Similar documents
MRT based Fixed Block size Transform Coding

Compressive Sensing for Multimedia. Communications in Wireless Sensor Networks

Image Inpainting Using Sparsity of the Transform Domain

Chapter 4 Face Recognition Using Orthogonal Transforms

COMPRESSIVE VIDEO SAMPLING

ISSN (ONLINE): , VOLUME-3, ISSUE-1,

HYBRID TRANSFORMATION TECHNIQUE FOR IMAGE COMPRESSION

Video Compression An Introduction

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

4. Image Retrieval using Transformed Image Content

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

TERM PAPER ON The Compressive Sensing Based on Biorthogonal Wavelet Basis

RECONSTRUCTION ALGORITHMS FOR COMPRESSIVE VIDEO SENSING USING BASIS PURSUIT

An Intuitive Explanation of Fourier Theory

Compression of Stereo Images using a Huffman-Zip Scheme

Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi

Optimised corrections for finite-difference modelling in two dimensions

A Comparative Study of DCT, DWT & Hybrid (DCT-DWT) Transform

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

P257 Transform-domain Sparsity Regularization in Reconstruction of Channelized Facies

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology

A Parallel Reconfigurable Architecture for DCT of Lengths N=32/16/8

Computer Vision 2. SS 18 Dr. Benjamin Guthier Professur für Bildverarbeitung. Computer Vision 2 Dr. Benjamin Guthier

Haar Wavelet Image Compression

DCT image denoising: a simple and effective image denoising algorithm

SOME CONCEPTS IN DISCRETE COSINE TRANSFORMS ~ Jennie G. Abraham Fall 2009, EE5355

Statistical Image Compression using Fast Fourier Coefficients

Compression of Light Field Images using Projective 2-D Warping method and Block matching

FACE RECOGNITION USING INDEPENDENT COMPONENT

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

CSEP 521 Applied Algorithms Spring Lossy Image Compression

Detecting Burnscar from Hyperspectral Imagery via Sparse Representation with Low-Rank Interference

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106

Problem Set 4. Assigned: March 23, 2006 Due: April 17, (6.882) Belief Propagation for Segmentation

Structurally Random Matrices

Redundant Data Elimination for Image Compression and Internet Transmission using MATLAB

CSE237A: Final Project Mid-Report Image Enhancement for portable platforms Rohit Sunkam Ramanujam Soha Dalal

2D and 3D Far-Field Radiation Patterns Reconstruction Based on Compressive Sensing

CHAPTER 6 A SECURE FAST 2D-DISCRETE FRACTIONAL FOURIER TRANSFORM BASED MEDICAL IMAGE COMPRESSION USING SPIHT ALGORITHM WITH HUFFMAN ENCODER

Face identification system using MATLAB

CHAPTER 7. Page No. 7 Conclusions and Future Scope Conclusions Future Scope 123

Automatic Logo Detection and Removal

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

Image processing in frequency Domain

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Schedule for Rest of Semester

A Very Low Bit Rate Image Compressor Using Transformed Classified Vector Quantization

Sparse Reconstruction / Compressive Sensing

Compression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction

A deblocking filter with two separate modes in block-based video coding

Overcompressing JPEG images with Evolution Algorithms

Image Compression using Discrete Wavelet Transform Preston Dye ME 535 6/2/18

CoE4TN3 Image Processing. Wavelet and Multiresolution Processing. Image Pyramids. Image pyramids. Introduction. Multiresolution.

DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

BLIND MEASUREMENT OF BLOCKING ARTIFACTS IN IMAGES Zhou Wang, Alan C. Bovik, and Brian L. Evans. (

Image Compression Algorithm and JPEG Standard

Copyright Detection System for Videos Using TIRI-DCT Algorithm

Two Dimensional Microwave Imaging Using a Divide and Unite Algorithm

IN THIS PAPER we consider the solution of ill-posed

Image Segmentation Techniques for Object-Based Coding

SD 575 Image Processing

G009 Scale and Direction-guided Interpolation of Aliased Seismic Data in the Curvelet Domain

Tomographic reconstruction: the challenge of dark information. S. Roux

Digital Image Processing Lectures 11 & 12

A Relationship between the Robust Statistics Theory and Sparse Compressive Sensed Signals Reconstruction

AUDIOVISUAL COMMUNICATION

Bilevel Sparse Coding

A Novel Approach for Deblocking JPEG Images

Limited view X-ray CT for dimensional analysis

Computational Photography Denoising

The interpolation of a 3-D data set by a pair of 2-D filters

Compressive Sensing Based Image Reconstruction using Wavelet Transform

Context based optimal shape coding

Image Reconstruction from Multiple Sparse Representations

CHAPTER 3 DIFFERENT DOMAINS OF WATERMARKING. domain. In spatial domain the watermark bits directly added to the pixels of the cover

Deep Learning of Compressed Sensing Operators with Structural Similarity Loss

Rate Distortion Optimization in Video Compression

Adaptive Quantization for Video Compression in Frequency Domain

Steganography: Hiding Data In Plain Sight. Ryan Gibson

Optimal Sampling Geometries for TV-Norm Reconstruction of fmri Data

SUMMARY. In combination with compressive sensing, a successful reconstruction

ECE 8201: Low-dimensional Signal Models for High-dimensional Data Analysis

Artifacts and Textured Region Detection

Lecture 5: Compression I. This Week s Schedule

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

ECE 533 Digital Image Processing- Fall Group Project Embedded Image coding using zero-trees of Wavelet Transform

Image Enhancement Techniques for Fingerprint Identification

Illustration of SIR for CETB

The Benefit of Tree Sparsity in Accelerated MRI

Final Review. Image Processing CSE 166 Lecture 18

Measurements and Bits: Compressed Sensing meets Information Theory. Dror Baron ECE Department Rice University dsp.rice.edu/cs

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

Separate CT-Reconstruction for Orientation and Position Adaptive Wavelet Denoising

ADVANCED RECONSTRUCTION TECHNIQUES IN MRI - 2

Implementation and analysis of Directional DCT in H.264

Meshless physical simulation of semiconductor devices using a wavelet-based nodes generator

Non-Parametric Bayesian Dictionary Learning for Sparse Image Representations

CHAPTER 3 WAVELET DECOMPOSITION USING HAAR WAVELET

SECURE SEMI-FRAGILE WATERMARKING FOR IMAGE AUTHENTICATION

Transcription:

CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT 9.1 Introduction In the previous chapters the inpainting was considered as an iterative algorithm. PDE based method uses iterations to converge to the solution and the Exemplar based method uses iterations to propagate the information from the boundary to the interior of the inpainting area. The exemplar based methods incur additional cost as they adopt an exhaustive search looking for a best matching patch. All the inpainting algorithms depend on the information present in the boundary. Hence the error caused near the boundary is propagated into the interior regions of inpainting. These problems are avoided by the method proposed in this chapter, thus addressing the last research objective. 9.2 Compressive Sensing Compressive sensing is a data acquisition technique that aims to reduce a number of measurements required for representation of signals/images by exploiting its compressibility or sparsity. Most of the signals/images will be sparse in some transform domain. A signal is said to be sparse signal if that can be compactly expressed as a linear combination of a small number of elementary signals[37]. The sparse representation theory states that sparse signals can be exactly approximated from a small number of elementary signals. The sparse representation of the signals can be 123

achieved by exploiting its compressibility or diversity, where only a small fraction of the coefficients are zero. The discrete cosine transform (DCT) represents an image as a sum of sinusoids of varying magnitudes and frequencies. The DCT has the property that, for a given image, most of the visually significant information about the image is concentrated only in a few coefficients. This ability of DCT is called energy compaction. This helps to discard the coefficients that are relatively small without introducing any visual distortion in the reconstructed image. This property of DCT is generally used in compression but in this chapter, it is used to inpaint the images. (a) (b) Fig 9.1: (a) image compression, (b) reconstruction; X - Input image, Ψ Transform matrix, S sparse transform coefficients, S= X, Ψ, Φ stable measurement matrix that ensures that the salient information in any K-sparse or compressible signal is not damaged by the dimensionality reduction from X M x N to Y m x n, Y- compressed image (ΦX) 124

Compressive sensing predicts that sparse signals and images can be reconstructed from what was previously believed to be incomplete information. Instead of taking the whole data about the images/signals it takes only the data where the coefficient value is high. The process of compressive sensing is shown in Fig 9.1 Hence, any image/signal can be expressed as, X = ΨS, where S= X, Ψ = Ψ T X and where T denotes the transpose operation. Clearly, X and S are equivalent representations of the same signal/image, with X in the time domain and S in the Ψ domain. Relating the above terms with the proposed method: X is the input image, Ψ is the transform matrix which can be thought of as DCT transform coefficient matrix, Φ is the random selection in CS which can be the pixels that we leave during downsampling, S is the sparse transform coefficients in CS; in this case it can be the transform matrix that remains after thresholding and Y is the downsampled image. The final step of getting back the image in CS is X= ΨS; here it is the zero padded transform matrix passing through inverse discrete cosine transform. Inpainting could be considered as the reconstruction step of compressive sensing where the image has to be predicted from few coefficients. Super resolution[32] image is a higher resolution image extracted from one or more low resolution image. This can be thought of as an inpainting process where the new pixels in higher resolution are considered as unknown pixels. In all these cases the sparse coefficients in the transform domain could be calculated with the known image pixels and these coefficients are used to predict the unknown values by applying suitable sized inverse DCT. The theory was derived from Compressive sensing developed by Donoho [40], [41]. More theory on sparse representation could be found in [42], [43]. Inpainting in Sparse domain is considered as denoising task in [44], [45]. These methods create 125

dictionaries as in [46] to perform the denoising. In [47] a Bayesian framework is used with an iterative EM algorithm to inpaint the image. 9.3 Inpainting Using Inverse DCT (IDCT) The two dimensional DCT transform for an image is described in chapter 3.2.3. The transformation kernel can be represented in a matrix form φ, which remains the same for any image of the same size. The DCT coefficients can be represented as transformed matrix A. For a square image a, it can be calculated using the kernel as given in Eq 9.1. [A]=[φ].[a].[φ T ] (9.1) where [.] represents the matrix and φ T is the transpose of φ. Similarly the image could be reconstructed from the coefficients using Eq. 9.2 [a]=[φ T ].[A].[φ] (9.2) The first element in the transformed matrix represents the DC element and entries with increasing vertical and horizontal index values represent higher vertical and horizontal frequencies. The values on the upper left correspond to the low frequencies and those on the bottom right correspond to the high frequencies. For most images, all the significant values are populated on the top left and the values at the bottom right corner are negligible. The images could be reconstructed from few significant values in a visually plausible manner. Conversely the insignificant values added in the high frequency range could be used to reconstruct an enlarged image. This principle is being utilized in this chapter for inpainting and producing super resolution images. The block diagram for inpainting is shown in Fig 9.2. 126

Fig 9.2: Block diagram showing the process of inpainting For inpainting an image, the mask is accepted from the user and the mask pixels are eliminated which results in reduced size of the image. The DCT coefficients are constructed for the reduced image (m x n) using DCT kernel of same size (m x n) and zeros are added in the high frequency area to make the transformed matrix size equal to that of the original size (M x N). The image is reconstructed from the coefficients using the Inverse DCT kernel of size (M x N). For irregular aperiodic masks the elimination of the mask pixels creates distortion. To avoid this, the lines containing the mask pixels are eliminated. This enables the image to be in regular shape. For color images, the algorithm is applied separately to each of the three color component. The RGB components of the image are separated and after processing each component separately, the results are combined to get back the inpainted color image. For producing the super resolution images, the DCT coefficients are obtained from the low resolution image and zeros are padded to the right and bottom ends of the transformed matrix to convert the image to the required resolution. An Inverse DCT kernel for the updated size is used and the image is reconstructed using Eq 9.2 127

9.4 Experimental Analysis The effectiveness of the DCT in energy compaction is analyzed at various levels. A percentage of the DCT coefficients of the images are neglected from the lower right part of the matrix and the images are reconstructed with lesser coefficients. The mean square error between the actual image and the reconstructed image is calculated. The error values versus various percentages of unknown DCT coefficients for few images are plotted in Fig 9.3. The images used for the analysis is given in Table 9.4. Error 180 160 140 120 100 80 60 40 20 0 50 75 80 90 % of unknown co-efficients DCT_1 DCT_2 DCT_3 DCT_4 DCT_5 DCT_6 Fig 9.3 Effect of unknown DCT coefficients in image reconstruction Image quality of the reconstructed image decreases as the number of known coefficients reduces. Fig 9.4 shows the comparison between the percentage of information known and the error value for an image. It can be seen that when the known information is less than 25% the error value increases tremendously. 128

percentage information Error value 50000 40000 30000 20000 10000 percentage information 0 97.6794.684.2680.7164.1236.6517.14 Percentage of information known Fig 9.4 performance of Image reconstruction using reduced DCT coefficients The performance of the system in producing super resolution images is measured by down sampling images and reconstructing back using suitable sized IDCT kernel. The enlargement is done in various levels. At each level the image is enlarged to twice the size of the reconstructed image of the previous level. The reconstruction up to two levels shows good results, beyond which the reconstructed images shows ringing effect. These are demonstrated in Fig. 9.5. The result of eliminating consecutive rows and columns and reconstructing the image with remaining portions of the image is given in Table 9.1. The percentage of the known pixels used to reconstruct and the mean square error calculated from the actual image is also tabulated. 129

(a) (b) Fig 9.5: (a),(b) 256 x 256 image constructed after one and 3 levels respectively For the inpainting application masks in the form of periodic, regular grids and aperiodic irregular grids are considered for various images. The result of inpainting by removing the grids for gray scale images are given in Table 9.2. The results of inpainting the periodic mask area in color images are listed in Table 9.3. The reconstructed images are clearly visible without much distortion but they appear darker than the original image when the inpainting area increases. This effect is noticeable in color images. 130

Table 9.1 Image reconstruction after ignoring varied number of rows and columns Before inpainting After inpainting % of coefficients known Mean Square error 64% 616.04 36.7% 2.26e+03 17% 4.27e+04 131

Table 9.2 Inpainting with various mask on gray scale images Image with grids Reconstructed image % of known coefficients Mean Square error 80% 716.09 64% 699.68 84% 1.2e+03 36.7% 2.3e+03 132

94.6% 134.96 84.6% 180.94 89% 359.05 133

Table 9.3 Inpainting of color images with periodic mask. Original Image Image with grids After reconstruction % known coefficients 64.13 % per color channel 17.14 % per color channel 17.14 % per color channel 36.19 % per color channel 134

Table 9.4 Input images used for experimenting DCT compaction. DCT_1 DCT_2 DCT_3 DCT_4 DCT_5 DCT_6 9.5 Conclusion DCT makes an image sparse in the transform domain. The transformed image consists of few significant values with which the image could be reconstructed. Through experimentation it is found that with 50% of the coefficients the images were reconstructed in a visually plausible way. Mild distortions are produced in certain 135

images when reconstructed with 30% of the coefficients. Visually noticeable distortions are produced when reconstructed with coefficients less than 25%. Introducing more insignificant values and reconstructing a larger sized image is explored in this chapter. This concept is used for super resolution and inpainting. The image is transformed into its sparse representation with the known coefficients and required amount of insignificant values are added to it. The image is then reconstructed using Inverse DCT. Due to periodic nature of DCT, the image pixels at periodic intervals are reconstructed in a better way. The aperiodic reconstruction is converted into periodic form by losing the information in a periodic form. This leads to additional loss of information. When the aperiodic mask becomes thicker, the quality of image reconstruction drops. The overall brightness of the reconstructed image reduces with reduced number of known coefficients which is predominant in color images. This is because the amplitude of the DC coefficient is to be shared with the newly added pixels. The method discussed in this chapter is a non iterative process and obtains the result with few matrix multiplication operations. This makes the algorithm as the fastest method suggested in the thesis. The inpainted value does not depend on the boundary, but on the entire set of known pixels. The dependency of the boundary pixels are eliminated which addresses the final research objective. 136