Image Restoration and Background Separation Using Sparse Representation Framework

Similar documents
Patch-Based Color Image Denoising using efficient Pixel-Wise Weighting Techniques

Department of Electronics and Communication KMP College of Engineering, Perumbavoor, Kerala, India 1 2

Image Denoising Using Sparse Representations

BSIK-SVD: A DICTIONARY-LEARNING ALGORITHM FOR BLOCK-SPARSE REPRESENTATIONS. Yongqin Zhang, Jiaying Liu, Mading Li, Zongming Guo

Image Deblurring Using Adaptive Sparse Domain Selection and Adaptive Regularization

INCOHERENT DICTIONARY LEARNING FOR SPARSE REPRESENTATION BASED IMAGE DENOISING

Single Image Interpolation via Adaptive Non-Local Sparsity-Based Modeling

An Optimized Pixel-Wise Weighting Approach For Patch-Based Image Denoising

Expected Patch Log Likelihood with a Sparse Prior

Discriminative sparse model and dictionary learning for object category recognition

Adaptive Reconstruction Methods for Low-Dose Computed Tomography

Image Restoration Using DNN

Image denoising with patch based PCA: local versus global

IMAGE DENOISING USING NL-MEANS VIA SMOOTH PATCH ORDERING

Non-Parametric Bayesian Dictionary Learning for Sparse Image Representations

INPAINTING COLOR IMAGES IN LEARNED DICTIONARY. Bijenička cesta 54, 10002, Zagreb, Croatia

Robust Face Recognition via Sparse Representation

SUPPLEMENTARY MATERIAL

Image Denoising via Group Sparse Eigenvectors of Graph Laplacian

Compressive Sensing for Multimedia. Communications in Wireless Sensor Networks

IMAGE RESTORATION VIA EFFICIENT GAUSSIAN MIXTURE MODEL LEARNING

ISSN: (Online) Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

IMA Preprint Series # 2281

Block Matching and 3D Filtering for Image Denoising

IMAGE DE-NOISING USING DEEP NEURAL NETWORK

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

IMA Preprint Series # 2211

Restoration of Images Corrupted by Mixed Gaussian Impulse Noise with Weighted Encoding

SPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL)

Sparse & Redundant Representations and Their Applications in Signal and Image Processing

PATCH-DISAGREEMENT AS A WAY TO IMPROVE K-SVD DENOISING

Hyperspectral Data Classification via Sparse Representation in Homotopy

Image Processing Via Pixel Permutations

DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

Sparse learned representations for image restoration

Improved Non-Local Means Algorithm Based on Dimensionality Reduction

PRINCIPAL COMPONENT ANALYSIS IMAGE DENOISING USING LOCAL PIXEL GROUPING

Learning based face hallucination techniques: A survey

Image Quality Assessment Techniques: An Overview

Image Super-Resolution Using Compressed Sensing Based on Learning Sub Dictionary

Learning Splines for Sparse Tomographic Reconstruction. Elham Sakhaee and Alireza Entezari University of Florida

Efficient Imaging Algorithms on Many-Core Platforms

A Learned Dictionary Model for Texture Classification

Solving Inverse Problems with Piecewise Linear Estimators: From Gaussian Mixture Models to Structured Sparsity

A Joint Histogram - 2D Correlation Measure for Incomplete Image Similarity

COMPARISON BETWEEN K_SVD AND OTHER FILTERING TECHNIQUE

Signal Reconstruction from Sparse Representations: An Introdu. Sensing

A Comparative Analysis of Noise Reduction Filters in Images Mandeep kaur 1, Deepinder kaur 2

Structural Similarity Based Image Quality Assessment Using Full Reference Method

An Improved Approach For Mixed Noise Removal In Color Images

Segmentation and Enhancement of Latent Fingerprints: A Coarse to Fine Ridge Structure Dictionary. Kai Cao January 16, 2014

Patch-based Image Denoising: Probability Distribution Estimation vs. Sparsity Prior

Single Image Super-Resolution via Sparse Representation over Directionally Structured Dictionaries Based on the Patch Gradient Phase Angle

Structural Similarity Optimized Wiener Filter: A Way to Fight Image Noise

Virtual Training Samples and CRC based Test Sample Reconstruction and Face Recognition Experiments Wei HUANG and Li-ming MIAO

BM3D Image Denoising with Shape-Adaptive Principal Component Analysis

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

SHIP WAKE DETECTION FOR SAR IMAGES WITH COMPLEX BACKGROUNDS BASED ON MORPHOLOGICAL DICTIONARY LEARNING

Tomographic Image Reconstruction in Noisy and Limited Data Settings.

Image Super-Resolution Reconstruction Based On L 1/2 Sparsity

Generalized Tree-Based Wavelet Transform and Applications to Patch-Based Image Processing

Deep Learning of Compressed Sensing Operators with Structural Similarity Loss

CHAPTER 6 COUNTER PROPAGATION NEURAL NETWORK FOR IMAGE RESTORATION

IMAGE SUPER-RESOLUTION BASED ON DICTIONARY LEARNING AND ANCHORED NEIGHBORHOOD REGRESSION WITH MUTUAL INCOHERENCE

Detecting Burnscar from Hyperspectral Imagery via Sparse Representation with Low-Rank Interference

Image Denoising Via Learned Dictionaries and Sparse representation

Stimulus Synthesis for Efficient Evaluation and Refinement of Perceptual Image Quality Metrics

Sparse Component Analysis (SCA) in Random-valued and Salt and Pepper Noise Removal

SSIM Image Quality Metric for Denoised Images

Effectiveness of Sparse Features: An Application of Sparse PCA

QUATERNION-BASED SPARSE REPRESENTATION OF COLOR IMAGE. Licheng Yu, Yi Xu, Hongteng Xu, Hao Zhang

A New Orthogonalization of Locality Preserving Projection and Applications

DCT image denoising: a simple and effective image denoising algorithm

A Patch Prior for Dense 3D Reconstruction in Man-Made Environments

Image Interpolation using Collaborative Filtering

Compressed Sensing Algorithm for Real-Time Doppler Ultrasound Image Reconstruction

IMA Preprint Series # 2168

Single-Image Super-Resolution Using Multihypothesis Prediction

Robust Multimodal Dictionary Learning

Non-local Means for Stereo Image Denoising Using Structural Similarity

Synthesis and Analysis Sparse Representation Models for Image Restoration. Shuhang Gu 顾舒航. Dept. of Computing The Hong Kong Polytechnic University

Sparse Coding based Frequency Adaptive Loop Filtering for Video Coding

Solving Optimization Problems That Employ Structural Similarity As The Fidelity Measure

On Single Image Scale-Up using Sparse-Representation

Image denoising in the wavelet domain using Improved Neigh-shrink

Inverse Problems and Machine Learning

Augmented Coupled Dictionary Learning for Image Super-Resolution

Author(s): Title: Journal: ISSN: Year: 2014 Pages: Volume: 25 Issue: 5

Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei

ADVANCED RECONSTRUCTION FOR ELECTRON MICROSCOPY

When Sparsity Meets Low-Rankness: Transform Learning With Non-Local Low-Rank Constraint For Image Restoration

Structure-adaptive Image Denoising with 3D Collaborative Filtering

WEINER FILTER AND SUB-BLOCK DECOMPOSITION BASED IMAGE RESTORATION FOR MEDICAL APPLICATIONS

Efficient Implementation of the K-SVD Algorithm and the Batch-OMP Method

LEARNING OVERCOMPLETE SPARSIFYING TRANSFORMS WITH BLOCK COSPARSITY

DCT-BASED IMAGE QUALITY ASSESSMENT FOR MOBILE SYSTEM. Jeoong Sung Park and Tokunbo Ogunfunmi

An ICA based Approach for Complex Color Scene Text Binarization

P257 Transform-domain Sparsity Regularization in Reconstruction of Channelized Facies

ELEG Compressive Sensing and Sparse Signal Representations

A COMPARATIVE STUDY OF DIFFERENT SUPER- RESOLUTION ALGORITHMS

Transcription:

Image Restoration and Background Separation Using Sparse Representation Framework Liu, Shikun Abstract In this paper, we introduce patch-based PCA denoising and k-svd dictionary learning method for the tasks of image restoration and background separation. We carry out an empirical evaluation of the performance of the algorithm in terms of quantity and quality. The results show that both methods appear to be competitive with the state-of-the-art image restoration and background subtraction algorithms, despite its simplicity, especially for high resolution images. Keywords: Dictionary learning, Background separation, Sparse representation, Principal component analysis, k-svd algorithm 1

Contents 1 Introduction 3 2 Dictionary Learning 3 2.1 Predefined Dictionaries................................ 4 2.2 Learned Dictionaries.................................. 4 2.2.1 Principal Component Analysis........................ 4 2.2.2 K-SVD Algorithm................................ 6 3 Experiments and Performance Evaluation 6 3.1 Image Restoration.................................... 6 3.2 Background Subtraction................................ 9 4 Conclusions 10 Acknowledgements 10 References 11 2

1 Introduction Olshausen & Field [1] first introduced the concept of sparse representations in 1997. After that, it becomes the focus of much research in the area of machine learning and signal processing, leading some of the famous state-of-the-art face recognition [2], handwriting recognition [3] algorithms. Since it has been proven that images can be sparsely represented by structural primitives, sparse coding has been widely studied to solve the inverse problems in restoration, denoising, and background forground detection applications [4] using 0 norm and 1 norm minimization technique. The linear decomposition of a signal using a few atoms of a dictionary for either predefined or learned one has can be over-complete. Unlike decomposition based on principal component analysis and its variants, these model do not impose to be orthogonal, which offers more flexibility to adapt the natural representation of the data. It directly follows that dictionary learning becomes essential and critical to achieve state-of-the-art results. Both predefined and learned dictionaries are introduced in the paper, however we mainly focus on learned dictionaries in this paper. Specifically, we used K-SVD and PCA method to build the dictionaries and conduct image restoration and background separation tasks. The performance achieve superior result in both quantitative and empirical analysis. 2 Dictionary Learning Dictionary learning problem is to find a dictionary that approximates elements of a signal class using as few atoms as possible. Let a dictionary Φ = [φ 1, φ 2,, φ N ] defined a P N matrix whose columns are 2 norm atoms φ i, and a sparse coefficient vector α which most its elements are negligible. Suppose that P sampled signal x k is the linear combination of N components with possible Gaussian noise, it gives Y = N x k + ε, σ 2 = Var[ε] <. ε k=1 We need to solve the following constrained optimization problem, min α 1,α 2,,α N N α k 0 such that Y k=1 N Φα k < τ where α 0 is a pseudo-norm representing the number of nonzero elements in the vector. The constraint in the optimization problem accounts for the presence of noise and model 3 k=1

imperfection. Therefore, τ is predefined sufficiently small constant for noise compensation. If there is no noise, then the linear combination model would give a exact τ = 0, then the inequality constraint can be substituted by a equality constraint. 2.1 Predefined Dictionaries The signal transformation has been around as long as half a century ago. It became tremendously popular with the introduction of Fast Fourier Transform. Discrete Cosine/Sine transform is the result of assuming an anti-symmetric extension of the signal. Hence, it produces a smooth and continuous representation on image recovery. We include visual representations of two well-studied dictionaries below. (a) Orthogonal Vectors (b) Discrete Cosine Transform Figure 1: Top 100 8 8 Patch Predefined Dictionaries 2.2 Learned Dictionaries However, learning the dictionary instead of using predefined ones have shown to dramatically improve the performance of signal reconstruction [4]. Some recent results [5] on dictionary learning accessing the whole training set to minimize the cost function under some constraint. This paper uses the similar idea with low computational cost. 2.2.1 Principal Component Analysis In the past few years, it has been shown preferable to denoise the image patch-wise instead of each pixel individually using a over-complete dictionaries which learned from noisy image. 4

Performing PCA [6] may achieve a good performance due to the patch redundancy property of natural images. By selecting the highest variance, PCA retrieves the most dominant pattern in the image. However, this method will have serious limitations. The first is PCA cannot represent rare patches in the images, since they contribute weakly in the total variance and they usually not sparse in the basis. Second, the natural properties of PCA will have few dominant feature spaces which truly represent the original image while the remaining feature spaces resembling noises. (a) Noisy Barbara Test Image (b) Top 16 Feature Spaces (c) Least 16 Feature Spaces Figure 2: Feature Spaces Visualization From a Noisy Test Image In order to overcome those problems, we propose a sliding square window technique with size W W which conduct PCA on patches with a step = W 2. Since the sliding windows can be over-lapped, it gives a higher chance to provide true noise-free candidates. The final results are the uniform average of those candidates for each pixel. We adopt this idea into color image which apply the PCA to each color channel. Then, we build our desired dictionary as follows. Algorithm 1 Patch-based PCA on Multichannel data 1: Input: y : input image, W : size of sliding window, k : top k features space 2: [D 1, D 2, D 3 ] size(y) 3: for each channel in y do 4: for i = 1 : W : D 1 do 5: for j = 1 : W : D 2 do 6: P i im(i : i + W 1, j : j + W 1) 7: D P mean(p) 8: λ i eig(cov(d)) 9: return Top k features space corresponding to k largest λ i s 5

2.2.2 K-SVD Algorithm K-SVD algorithm [7] which uses the core idea of K-Means is an iterative method which constantly update the atoms in the predefined dictionary. Since K-SVD is flexible and is able to work with any pursuit method, a lot of works [8, 9] have applied involving natural data and prove its superior performance. In this paper, we use orthogonal matching pursuit for updating atoms in each iteration. Algorithm 2 K-SVD Method 1: Input: Y : input data, Φ : a predefined dictionary 2: Initialize a predefined dictionary Φ 3: Use OMP to solve Φ = arg min α1,,α k Y N k=1 Φα k < τ 4: for Update Φ do 5: Remove an atom φ i 6: Compute the approximation error E i = Y j i φ i α i 7: Restrict E k to represent only data points which were actually using φ i 8: Compute the SVD of E k 9: Replace φ i with the first column of SVD. 10: Return: The dictionary Φ which gives the sparsest representation of data Y Even though K-SVD performs well for in application, it is rather computationally expensive when we have a very large dimension of the dictionary. 3 Experiments and Performance Evaluation 3.1 Image Restoration For the image restoration task, both gray-scale and color images are evaluated. We provide two qualitative evaluation measurements, Peak Signal to Noise Ratio (PSNR), and Structural SIMilarity (SSIM) [10] which are defined as PSNR = 10 log 10 255 2 MSE, SSIM = (2µ x µ y + C 1 )(2σ x y C 2 ) (µ 2 x + µ2 y + C 1)(σ 2 x + σ2 y + C 2). where MSE is mean square estimation of the input data, c 1 = (0.01L) 2, c 2 = (0.03L) 2, L = 255 to stabilize the division with weak denominator, µ x, µ y, σ x, σ y and σ x y are local means, standard deviations and cross-covariance for input data. The SSIM will be range in 0 to 1 which the estimated image has a similar structure to the original image if the SSIM is close to 1; the PSNR is used in quality measurement with the higher value the better quality of reconstructed data. 6

We apply Wiener filter, along with 2 different window-size PCA, and k-svd with blocksize 16 dictionary method to compare our results. The following gives the most corrupted images we test with Gaussian noise σ = 50. (a) Original Image (d) PCA with 16 16 Window (b) Noisy Image, σ = 50 (e) PCA with 32 32 Window (c) Wiener Filter (f) K-SVD with 16 16 Window Figure 3: Gray-Scale Test Image Restoration Visualization (a) Original Image (b) Noisy Image, σ = 50 7 (c) Wiener Filter

(d) PCA with 16 16 Window (e) PCA with 32 32 Window (f) K-SVD with 16 16 Window Figure 4: Color Test Image Restoration Visualization We further test two sets of image, gray-scaled Barbara and color Lena using three levels of Gaussian noise with σ = 10, 20, 50 to conclude our evaluation using PSNR and SSIM measurements we mentioned above. The following table gives the complete results in the form of [PSNR SSIM] in each block. Noise Wiener PCA16 PCA32 KSVD σ = 10 Barbara Lena 28.15 0.7147 28.14 0.9611 28.05 0.7935 29.41 0.9733 28.77 0.8186 31.77 0.9828 27.92 0.7841 29.69 0.9727 33.91 0.9275 33.96 0.9895 σ = 20 Barbara Lena 27.00 0.4753 22.11 0.8657 26.23 0.7336 27.95 0.9644 25.85 0.6312 28.20 0.9638 26.14 0.6820 28.31 0.9640 30.16 0.8670 31.04 0.9806 σ = 50 Barbara Lena 14.15 0.1958 14.12 0.5402 22.40 0.5014 25.09 0.9420 18.31 0.3064 21.39 0.8546 20.24 0.3430 23.23 0.8964 24.68 0.6875 26.74 0.9573 Table 1: Image Restoration Quantitative Evaluation In Terms Of PSNR and SSIM From the table, we can see K-SVD gives the best result in all methods we tested above, especially for highly corrupted images. Also, PCA does a decent performance for lowcorrupted images and show to be similar on different window sizes. However, if the images were over-damaged, the Wiener filter surpass the PCA method even though it washes out the images. 8

3.2 Background Subtraction For background subtraction task, we use K-SVD (which shown to be best from the last task) to build our desired over-complete dictionary for background image. After that, we use orthogonal matching pursuit to find coefficient matrix which represents input foreground image. Since foreground has objects which background does not have, it is reasonable to have the high-sparsity coefficients representing the foreground objects. We use this idea to apply an unsupervised classifier K-means method to classify the coefficient data into two groups. We then call the smaller size of group foreground objects. We use PETS 2001 datasets to create our background and foreground dataset. We crop the dataset into 200 200 color image, and we use a overlapped 10 10 and 5 5 patch to build the dictionary. We apply 30 dictionary size with 20 sparsity for patch size 10 and 20 dictionary size with 10 sparsity for patch size 5. The followings are our results. (a) Background Image (b) Foreground Image 1 (c) Detected Objects 1 (10 10) (d) Detected Objects 1(5 5) (e) Background Image (f) Foreground Image 2 (g) Detected Objects 2 (10 10) (h) Detected Objects 2 (10 10) Figure 5: Visualization of Background and Foreground Objects Using K-Means Clustering From above, we can observe the human has been detected perfectly in patch size 10 and have some noises in patch size 5. The car object loses some details probably because the similar color between the car window and road. 9

4 Conclusions In this paper, we applied PCA and K-SVD dictionary learning methods in a sparse representation framework to conduct tasks of image restoration and background subtraction. In image restoration task, we successfully recovered both gray-scale and color damaged images in the level of structure and accuracy. We have shown that K-SVD gives the best performance especially for a highly-damaged image with Gaussian noise level of σ 20. In background subtraction task, we used an unsupervised learning K-Means clustering to classify coefficient vector based on OMP algorithm. The algorithm has given superior performance when applied a large patch size but remained some noises dealing with a smaller one. In order to solve and improve these problems, we will continue to discover and design a more powerful algorithm which would achieve better results in the future. Acknowledgements I would love to express my great gratitude to my project advisor Prof. Constantino Lagoa who introduced and guided me through the concept of convex and sparse optimization. Without his endless support and persistent help, this project would not have been completed. 10

References [1] Olshausen, Bruno A., and David J. Field. "Sparse coding with an overcomplete basis set: A strategy employed by V1?." Vision research 37.23 (1997): 3311-3325. [2] Wright, John, et al. "Robust face recognition via sparse representation." Pattern Analysis and Machine Intelligence, IEEE Transactions on 31.2 (2009): 210-227. [3] Poultney, Christopher, Sumit Chopra, and Yann L. Cun. "Efficient learning of sparse representations with an energy-based model." Advances in neural information processing systems. 2006. [4] Elad, Michael, and Michal Aharon. "Image denoising via sparse and redundant representations over learned dictionaries." Image Processing, IEEE Transactions on 15.12 (2006): 3736-3745. [5] Lee, Sung, et al. "Bio-STEER: A Semantic Web workflow tool for Grid computing in the life sciences." Future Generation Computer Systems 23.3 (2007): 497-509. [6] Jolliffe, Ian. Principal component analysis. John Wiley & Sons, Ltd, 2002. [7] Aharon, Michal, Michael Elad, and Alfred Bruckstein. "K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation." Signal Processing, IEEE Transactions on 54.11 (2006): 4311-4322. [8] Elad, Michael, and Michal Aharon. "Image denoising via sparse and redundant representations over learned dictionaries." Image Processing, IEEE Transactions on 15.12 (2006): 3736-3745. [9] Mairal, Julien, Michael Elad, and Guillermo Sapiro. "Sparse representation for color image restoration." Image Processing, IEEE Transactions on 17.1 (2008): 53-69. [10] Wang Zhou, Bovik, Alan C., Sheikh, Hamid R., and Simoncelli, Eero P. "Image Qualifty Assessment: From Error Visibility to Structural Similarity."" IEEE Transactions on Image Processing, Volume 13, Issue 4, pp. 600612, April 2004 11