DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

Similar documents
Deep Learning of Compressed Sensing Operators with Structural Similarity Loss

Image Quality Assessment Techniques: An Overview

Image Quality Assessment based on Improved Structural SIMilarity

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

SSIM Image Quality Metric for Denoised Images

Compressive Sensing for Multimedia. Communications in Wireless Sensor Networks

Novel Lossy Compression Algorithms with Stacked Autoencoders

Image Restoration Using DNN

Bilevel Sparse Coding

Image Error Concealment Based on Watermarking

EE 5359 Multimedia project

No-reference perceptual quality metric for H.264/AVC encoded video. Maria Paula Queluz

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

THREE DESCRIPTIONS OF SCALAR QUANTIZATION SYSTEM FOR EFFICIENT DATA TRANSMISSION

Learning based face hallucination techniques: A survey

Compressed Sensing Algorithm for Real-Time Doppler Ultrasound Image Reconstruction

CHAPTER-5 WATERMARKING OF COLOR IMAGES

Image and Video Quality Assessment Using Neural Network and SVM

TERM PAPER ON The Compressive Sensing Based on Biorthogonal Wavelet Basis

5.7. Fractal compression Overview

Image Restoration and Background Separation Using Sparse Representation Framework

Collaborative Sparsity and Compressive MRI

A New Psychovisual Threshold Technique in Image Processing Applications

Robust Image Watermarking based on DCT-DWT- SVD Method

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

Today. Motivation. Motivation. Image gradient. Image gradient. Computational Photography

Efficient Color Image Quality Assessment Using Gradient Magnitude Similarity Deviation

A COMPARATIVE STUDY OF QUALITY AND CONTENT-BASED SPATIAL POOLING STRATEGIES IN IMAGE QUALITY ASSESSMENT. Dogancan Temel and Ghassan AlRegib

A Reversible Data Hiding Scheme for BTC- Compressed Images

An Efficient Image Compression Using Bit Allocation based on Psychovisual Threshold

Blind Compressed Sensing Using Sparsifying Transforms

PROBABILISTIC MEASURE OF COLOUR IMAGE PROCESSING FIDELITY

SVD FILTER BASED MULTISCALE APPROACH FOR IMAGE QUALITY ASSESSMENT. Ashirbani Saha, Gaurav Bhatnagar and Q.M. Jonathan Wu

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Haar Wavelet Image Compression

DCT-BASED IMAGE QUALITY ASSESSMENT FOR MOBILE SYSTEM. Jeoong Sung Park and Tokunbo Ogunfunmi

A Novel Approach for Deblocking JPEG Images

A DEEP LEARNING APPROACH TO BLOCK-BASED COMPRESSED SENSING OF IMAGES

CS 195-5: Machine Learning Problem Set 5

Iterative CT Reconstruction Using Curvelet-Based Regularization

VIDEO DENOISING BASED ON ADAPTIVE TEMPORAL AVERAGING

MULTIMEDIA PROCESSING-EE A UNIVERSAL IMAGE QUALITY INDEX and SSIM comparison

A Novel Multi-Frame Color Images Super-Resolution Framework based on Deep Convolutional Neural Network. Zhe Li, Shu Li, Jianmin Wang and Hongyang Wang

Neural style transfer

Comparison of Image Compression Behaviour for WhatsApp, Viber and LINE

COMPRESSIVE VIDEO SAMPLING

EFFICIENT PERCEPTUAL, SELECTIVE,

MULTI-SCALE STRUCTURAL SIMILARITY FOR IMAGE QUALITY ASSESSMENT. (Invited Paper)

Robust image recovery via total-variation minimization

Peripheral drift illusion

Image Inpainting Using Sparsity of the Transform Domain

Patch-Based Color Image Denoising using efficient Pixel-Wise Weighting Techniques

WAVELET USE FOR IMAGE RESTORATION

Image Quality Assessment: From Error Visibility to Structural Similarity. Zhou Wang

New Approach of Estimating PSNR-B For Deblocked

Image Restoration with Deep Generative Models

Structural Similarity Sparse Coding

CS 6501: Deep Learning for Computer Graphics. Training Neural Networks II. Connelly Barnes

Autoencoders, denoising autoencoders, and learning deep networks

CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT

A COMPRESSION TECHNIQUES IN DIGITAL IMAGE PROCESSING - REVIEW

Coding of 3D Videos based on Visual Discomfort

MULTIMEDIA COMMUNICATION

Detecting Burnscar from Hyperspectral Imagery via Sparse Representation with Low-Rank Interference

MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ)

Technical Recommendation S. 10/07: Source Encoding of High Definition Mobile TV Services

A Full Reference Based Objective Image Quality Assessment

Image Deblurring Using Adaptive Sparse Domain Selection and Adaptive Regularization

Multi Focus Image Fusion Using Joint Sparse Representation

CHAPTER-4 WATERMARKING OF GRAY IMAGES

Signal Reconstruction from Sparse Representations: An Introdu. Sensing

Deep Generative Models Variational Autoencoders

3. MODIFIED BLOCK TRUNCATION CODING 3.1 MODIFIED BTC METHODS FOR IMPROVED CONTRAST

Calibrated Image Acquisition for Multi-view 3D Reconstruction

Quality Measurements of Lossy Image Steganography Based on H-AMBTC Technique Using Hadamard Transform Domain

Structural Similarity Based Image Quality Assessment Using Full Reference Method

ANALYSIS OF DIFFERENT DOMAIN WATERMARKING TECHNIQUES

Performance of Quality Metrics for Compressed Medical Images Through Mean Opinion Score Prediction

CHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM

Super-Resolution (SR) image re-construction is the process of combining the information from multiple

Ripplet: a New Transform for Feature Extraction and Image Representation

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University

Lecture 5: Compression I. This Week s Schedule

MIXDES Methods of 3D Images Quality Assesment

An ICA based Approach for Complex Color Scene Text Binarization

Guided Image Super-Resolution: A New Technique for Photogeometric Super-Resolution in Hybrid 3-D Range Imaging

Structural Similarity Based Image Quality Assessment

One Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models

A new predictive image compression scheme using histogram analysis and pattern matching

arxiv: v1 [stat.ml] 14 Jan 2017

Objective Quality Assessment of Screen Content Images by Structure Information

OPTIMIZED QUALITY EVALUATION APPROACH OF TONED MAPPED IMAGES BASED ON OBJECTIVE QUALITY ASSESSMENT

REDUCED ORDER MODELING IN MULTISPECTRAL PHOTOACOUSTIC TOMOGRAPHY

Reduction of Blocking artifacts in Compressed Medical Images

A Model Based Neuron Detection Approach Using Sparse Location Priors

A Digital Video Watermarking Algorithm Based on LSB and DCT

Adaptive compressed image sensing based on wavelet-trees

ISSN: An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding

SCALED WAVELET TRANSFORM VIDEO WATERMARKING METHOD USING HYBRID TECHNIQUE: SWT-SVD-DCT

Learning a Manifold as an Atlas Supplementary Material

Transcription:

DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

ABSTRACT Compressed sensing (CS) is a signal processing framework for efficiently reconstructing a signal from a small number of measurements, obtained by linear projections of the signal. In this paper we present an end-to-end deep learning approach for CS, in which a fullyconnected network performs both the linear sensing and non-linear reconstruction stages. During the training phase, the sensing matrix and the non-linear reconstruction operator are jointly optimized using Structural similarity index (SSIM) as loss rather than the standard Mean Squared Error () loss. We compare the proposed approach with state-of-the-art in terms of reconstruction quality under both losses, i.e. SSIM score and score.

INTRODUCTION Compressed sensing (CS) is a mathematical framework that defines the conditions and tools for the recovery of a signal from a small number of its linear projections (i.e. measurements). In the CS framework, the measurement device acquires the signal in the linear projections domain, and the full signal is reconstructed by convex optimization techniques.

INTRODUCTION In this project we address the CS problem by using a novel loss function, the SSIM loss. Our approach is based on a deep neural network, which simultaneously learns the linear sensing matrix and the non-linear reconstruction operator under the SSIM loss. During training, the proposed network jointly optimizes both the linear sensing matrix and the non-linear reconstruction operator.

COMPRESSES SENSING Given a signal x R N, an M N sensing matrix Φ (such that M N) and a measurements vector y = Φx, the goal of CS is to recover the signal from its measurements. The sensing rate is defined by R = M, and since R 1 the recovery of x is not possible N in the general case.

STRUCTURAL SIMILARITY INDEX (SSIM) The Structural SIMilarity (SSIM) index is a method for measuring the similarity between two images. The SSIM index can be viewed as a quality measure of one of the images being compared, provided the other image is regarded as of perfect quality. The difference with respect to other techniques such as Mean Squared Error () or Peak Signal-to-Noise Ratio (PSNR) is that these approaches estimate absolute errors; on the other hand, SSIM is a perception-based model that considers image degradation as perceived change in structural information, while also incorporating important perceptual phenomena, including both luminance masking and contrast masking terms.

THE PROPOSED APPROACH We propose an end-to-end deep learning solution for CS, which jointly optimizes the sensing matrix Φ and the non-linear reconstruction operator, which is parameterized by a coefficients matrix W. The proposed method provides a solution to the following joint optimization problem: Φ, W = argmin 1 Φ, W N L N W Φx i, x i i=1 where x N i i=1 is the collection of N signals. The loss function L, measures the distance between the input signal and the reconstructed one, provided by the reconstruction operator N W, whose input is the compressed samples, denoted by Φx i. N

THE PROPOSED APPROACH The fully-connected network includes the following layers: An input layer with N nodes. A compressed sensing fully-connected layer with NR nodes, R 1 (its weights form the sensing matrix). K 1 reconstruction layers with NB nodes in each layer, where B 1,2. Each layer is followed by a sigmoid activation unit. An output layer with N nodes.

FULLY- CONNECTED NETWORK SCHEME

DATASET We trained the proposed architecture on CIFAR10. The CIFAR10 dataset contains 60,000 color images of 32 32 = 1024 pixels, drawn from 10 different classes. This dataset is divided into training and test sets, containing 50,000 and 10,000 images respectively. Since our proposed network expects grayscale images, for training on CIFAR10 we used only the 1 st channel for each dataset image. Moreover, we enlarged the training set to 200,000 samples by rotating the original training set images by 90, 180 and 270 degrees.

TRAINING WITH SSIM AS LOSS FUNCTION The SSIM loss function was used with 8 8 window size. We tested 2 different weighting functions for the SSIM loss. The first function is the uniform weighting function, i.e. W x, y 1 The second function is: W x, y = log 1 + σ x 2 1 + σ 2 y Please read in paper about σ x 2, σ y 2, definitions.

PERFORMANCE EVALUATION Sensing Rate No. of Measurements 0.125 128 2 1 0.0625 64 1 2 B K Training with SSIM loss W x, y = log 1 + σ x 2 1 + σ y 2 Training with Training with SSIM loss W x, y 1 SSIM score 0.885392 0. 892905 0.868903 score 0.005893 0. 003517 0.005757 SSIM score 0.760333 0. 793642 0.719304 score 0.010316 0. 006469 0.011553

RECONSTRUCTION RESULTS Original image Original image SSIM(1) stands for weight function W x, y = log 1 + σ x 2 SSIM(2) stands for weight function W x, y 1 1 + σ 2 y

RECONSTRUCTION RESULTS Original image Original image SSIM(1) stands for weight function W x, y = log 1 + σ x 2 SSIM(2) stands for weight function W x, y 1 1 + σ 2 y

FUTURE WORK Since calculating SSIM score is more complicated than calculating score for a given image, our approach can be further improved by combining the SSIM as loss function with block-based compressed sensing approach. Learning a deep neural network for blocks reconstruction under the SSIM loss would be significantly faster than learning a network for full-image reconstruction in cases where there are many patches per image. Another possible future work direction is to expand the SSIM loss function to multi-scale SSIM loss function.