Image and Video Quality Assessment Using Neural Network and SVM

Similar documents
Image Quality Assessment Techniques: An Overview

SSIM Image Quality Metric for Denoised Images

Image Quality Assessment Method Based On Statistics of Pixel Value Difference And Local Variance Similarity

Reduction of Blocking artifacts in Compressed Medical Images

EE 5359 Multimedia project

Image Quality Assessment based on Improved Structural SIMilarity

Implementation and analysis of Directional DCT in H.264

MULTI-SCALE STRUCTURAL SIMILARITY FOR IMAGE QUALITY ASSESSMENT. (Invited Paper)

Video Quality Analysis for H.264 Based on Human Visual System

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

CS 260: Seminar in Computer Science: Multimedia Networking

Structural Similarity Based Image Quality Assessment Using Full Reference Method

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Efficient Color Image Quality Assessment Using Gradient Magnitude Similarity Deviation

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

Objective Quality Assessment of Screen Content Images by Structure Information

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Region-Based Rate-Control for H.264/AVC. for Low Bit-Rate Applications

Intra-Mode Indexed Nonuniform Quantization Parameter Matrices in AVC/H.264

No-reference perceptual quality metric for H.264/AVC encoded video. Maria Paula Queluz

MISB ST STANDARD. 27 February Motion Imagery Interpretability and Quality Metadata. 1 Scope. 2 References. 2.1 Normative References

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

PROBABILISTIC MEASURE OF COLOUR IMAGE PROCESSING FIDELITY

Research on Quality Inspection method of Digital Aerial Photography Results

MIXDES Methods of 3D Images Quality Assesment

Adaptive Zoom Distance Measuring System of Camera Based on the Ranging of Binocular Vision

Overview of H.264 and Audio Video coding Standards (AVS) of China

SCREEN CONTENT IMAGE QUALITY ASSESSMENT USING EDGE MODEL

Learning based face hallucination techniques: A survey

BLIND QUALITY ASSESSMENT OF JPEG2000 COMPRESSED IMAGES USING NATURAL SCENE STATISTICS. Hamid R. Sheikh, Alan C. Bovik and Lawrence Cormack

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

A COMPARATIVE STUDY OF QUALITY AND CONTENT-BASED SPATIAL POOLING STRATEGIES IN IMAGE QUALITY ASSESSMENT. Dogancan Temel and Ghassan AlRegib

Department of Electrical Engineering

Image Quality Assessment: From Error Visibility to Structural Similarity. Zhou Wang

Image Fusion Using Double Density Discrete Wavelet Transform

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Advanced Video Coding: The new H.264 video compression standard

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

AN EFFICIENT VIDEO WATERMARKING USING COLOR HISTOGRAM ANALYSIS AND BITPLANE IMAGE ARRAYS

A Robust Color Image Watermarking Using Maximum Wavelet-Tree Difference Scheme

ISSN: An Efficient Fully Exploiting Spatial Correlation of Compress Compound Images in Advanced Video Coding

SJTU 4K Video Subjective Quality Dataset for Content Adaptive Bit Rate Estimation without Encoding

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS

High Capacity Reversible Watermarking Scheme for 2D Vector Maps

Depth Estimation for View Synthesis in Multiview Video Coding

Coding of 3D Videos based on Visual Discomfort

IMAGE FUSION PARAMETER ESTIMATION AND COMPARISON BETWEEN SVD AND DWT TECHNIQUE

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Bit-Depth Scalable Coding Using a Perfect Picture and Adaptive Neighboring Filter *

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

A NOVEL SECURED BOOLEAN BASED SECRET IMAGE SHARING SCHEME

DCT-BASED IMAGE QUALITY ASSESSMENT FOR MOBILE SYSTEM. Jeoong Sung Park and Tokunbo Ogunfunmi

Methods of Measure and Analyse of Video Quality of the Image

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

A Full Reference Based Objective Image Quality Assessment

Video pre-processing with JND-based Gaussian filtering of superpixels

Fingerprint Image Compression

Image Quality Assessment-Oriented Frame Capture of Video Phone over 3G system

Video Quality assessment Measure with a Neural Network H. El Khattabi, A. Tamtaoui and D. Aboutajdine

How Many Humans Does it Take to Judge Video Quality?

Digital Image Steganography Techniques: Case Study. Karnataka, India.

Lecture 9: Support Vector Machines

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Network-based model for video packet importance considering both compression artifacts and packet losses

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

Image Compression Algorithms using Wavelets: a review

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

Modified SPIHT Image Coder For Wireless Communication

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

Video Compression An Introduction

THREE DESCRIPTIONS OF SCALAR QUANTIZATION SYSTEM FOR EFFICIENT DATA TRANSMISSION

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

A Boosting-Based Framework for Self-Similar and Non-linear Internet Traffic Prediction

Maximum Differentiation Competition: Direct Comparison of Discriminability Models

The Improved Embedded Zerotree Wavelet Coding (EZW) Algorithm

Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM

An Adaptive Cross Search Algorithm for Block Matching Motion Estimation

Classification of Printed Chinese Characters by Using Neural Network

Denoising and Edge Detection Using Sobelmethod

Overview, implementation and comparison of Audio Video Standard (AVS) China and H.264/MPEG -4 part 10 or Advanced Video Coding Standard

Reduced Frame Quantization in Video Coding

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

Image Interpolation using Collaborative Filtering

Mesh Based Interpolative Coding (MBIC)

CSEP 521 Applied Algorithms Spring Lossy Image Compression

Motion Estimation for Video Coding Standards

Support Vector Regression for Software Reliability Growth Modeling and Prediction

MS-UNIQUE: Multi-model and Sharpness-weighted Unsupervised Image Quality Estimation

Performance of Quality Metrics for Compressed Medical Images Through Mean Opinion Score Prediction

Reversible Image Data Hiding with Local Adaptive Contrast Enhancement

Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index

A deblocking filter with two separate modes in block-based video coding

COMPARATIVE ANALYSIS OF DIRAC PRO-VC-2, H.264 AVC AND AVS CHINA-P7

An Efficient Mode Selection Algorithm for H.264

MULTIRESOLUTION QUALITY EVALUATION OF GEOMETRICALLY DISTORTED IMAGES. Angela D Angelo, Mauro Barni

Efficient Method for Half-Pixel Block Motion Estimation Using Block Differentials

Transcription:

TSINGHUA SCIENCE AND TECHNOLOGY ISSN 1007-0214 18/19 pp112-116 Volume 13, Number 1, February 2008 Image and Video Quality Assessment Using Neural Network and SVM DING Wenrui (), TONG Yubing (), ZHANG Qishan (), YANG Dongkai () School of Electronic and Information Engineering, Beihang University, Beijing 100083, China Abstract: An image and video quality assessment method was developed using neural network and support vector machines (SVM) with the peak signal to noise ratio (PSNR) and the structure similarity indexes used to describe image quality. The neural network was used to obtain the mapping functions between the objective quality assessment indexes and quality assessment. The SVM was used to classify the images into different types which were accessed using different mapping functions. Video quality was assessed based on the quality of each frame in the video sequence with various weights to describe motion and scene changes in the video. The number of isolated points in the correlations of the image and video and objective quality assessments was reduced by this method. Simulation results show that the method accurately accesses image quality. The monotonicity of the method for images is 6.94% higher than with the PSNR method, and the root mean square error is at least 35.90% higher than with the PSNR. Key words: image quality assessment; support vector machines (SVM); neural network; isolated points Introduction More and more digital images and videos are being captured, transmitted, and used in computer systems worldwide. Most images and video sequences are compressed with losses before transmission. Therefore, methods are needed to evaluate image quality with the reconstructed images quality directly reflecting the performance of the compression method, much attention has been placed on objective quality assessments which reflect the image and video quality with the main task being to reduce the offset between and objective assessments. Images and video are assessed based on the human visual system (HVS), but the HVS is more complicated than the peak signal to noise ratio (PSNR) and some physiological and psychophysical characteristics of the HVS are not yet well understood. PSNR does not involve Received: 2006-09-05; revised: 2007-09-07 To whom correspondence should be addressed. E-mail: ding@buaa.edu.cn; Tel: 86-10-82317392 much characteristic information concerning the image, so distinct differences occur between the PSNR and the image quality. The structure similarity (SSIM) method accesses the structure similarity between the original signal and the reconstructed signal [1-3]. Here, the images are classified based on neural network (NN) and support vector machines (SVM) first. The PSNR and SSIM are used as indexes describing the image and video sequence quality. Motion and scene changes are described with the mean absolute difference (MAD). Results using the image database of the University of Texas and common video sequences show that the method can accurately assess quality [4]. 1 PSNR and SSIM Analysis PSNR is defined as M N 2 2 PSNR 10 lg255 MN / [ a( i, j) a ( i, j)] (1) i1 j1 where M represents the number of rows in the image, N represents the number of columns in the image,

DING Wenrui () et alimage and Video Quality Assessment 113 a (, i j) is the pixel value of the original image, and a ˆ( i, j) is the pixel value of the distorted image. PSNR is the most common index for assessing signal quality and is valid for most images, but has some weaknesses. For the simplest case, if the following is true, a(, i j) a (, i j) a(, i j) a (, i j) (2) 1 2 Then a 1(, i j) a 2(, i j) or (3) a 1( i, j) a2( i, j) 2 a ( i, j) (4) where a 1(, i j) and a 2(, i j) are two distorted images with the same PSNR in which a 1(, i j) and a 2(, i j) may not be equal and may even look quite different. Such error needs to be reduced so that PSNR can be more effective for assessing quality. This analysis focuses on how to reduce the worst case scenarios. SSIM provides a good approximation of the image distortion by measuring structural information changes with the assumption that the HVS is highly adapted to extract structural information from the viewing field. If x and y represent images, then the SSIM is defined as SSIM( x, y) [ l( x, y)] [ c( x, y)] [ s ( x, y)] (5) where l ( x, y), c ( x, y), and s ( x, y) are the luminance, contrast, and structure comparison functions with,, >0. SSIM( x, y ) is then a quality assessment index. In reality, the mapping functions between SSIM( x, y ) and l ( x, y), c( x, y), and s( x, y) are nonlinear and probably quite different from Eq. (5). As a result, there are points whose objective and assessments differ, which limits the performance of this image quality assessment method. If the assessments of these points were modified and there were less such points, the objective results would be more closely related to the assessments. For this reason SSIM and PSNR are used together here to get a better quality assessment [5,6]. 2 Quality Assessments Using Neural Network and SVM The neural network and SVM methods were combined to set up a new method with two image quality assessment indexes, PSNR and SSIM. The flow chart is shown in Fig. 1. Fig. 1 Image quality assessment method The image quality assessment is divided into training and testing parts [7]. In the training part, the isolated points were analyzed with the mapping functions between the objective and quality assessments obtained using the neural network. In the testing part, the isolated points were identified and the testing images was then assessed by using the new method. 2.1 Isolated points analysis and mapping functions for assessing image quality Definition 1 In the correlation of the and objective assessments, if the offset between the assessments is larger than a threshold, V T, then the image corresponding to that point in the curve is defined as an isolated point. 1st evaluation (space distance): MOS EVAobjective VT (6) 2nd evaluation (slope ratio): (MOS / EVA objective ) VT, or MOS / EVAobjective 1/ VT (7) where V T is the threshold which may be derived from tests, MOS is the quality assessment, and EVAobjective is the objective quality assessment. In this section, the 2nd evaluation method is used with the correlation between the and objective assessment shown in Fig. 2. The isolated points were denoted by + in Fig. 2. From the above definition, the training data set was divided into two subsets, a normal set and an abnormal set.

114 Tsinghua Science and Technology, February 2008, 13(1): 112-116 2.2 Isolated points predictions with the SVM classifier Fig. 2 Correlations of and objective assessments Then the training subset is used to train the backpropagation (BP) neural to obtain the mappings for the PSNR-MOS count and the SSIM-MOS count. The number of isolated points can also be controlled by adjusting the threshold in definition Eq. (7). Different kinds of images can be assessed by using different mappings according to the structure in Fig. 3. The isolated points were defined using the quality assessment, MOS. In reality, there are often no assessments for assessing image quality. Therefore, isolated points were identified using an SVM classifier, which classifies samples into two kinds without using MOS. The SVM uses a kernel function to map the data in the input space to a high-dimensional feature space where the data becomes linearly separable. The SVM determines a generalized optimal classifying plane with the following form, N y f( x; a) ( a a )( x x) b i i i i1 N ( i i) (, i) i1 a a K xx b (9) where K(,) ( xi x ) is a kernel function satisfying the Mercey theory, b is a constant, and a and a are the optimal results of a quadratic programming (QP) problem [8,9]. Here, the SVM classifier used the results of an isolated points analysis. Then the image to be assessed was classified into one of various kinds of images with different classification values. A result of Positive one means that image quality is assessed in the normal way. Negative one means that abnormal assessment of the image quality is made and isolated points are generated. The distance between these two kinds is the largest spread predicted by the SVM classifier. 3 Video Quality Assessment Fig. 3 Mappings for various kinds of image quality assessments PSNR and SSIM were combined as a weighted sum before using the BP neural network, eav_predict ppsnr sssim (8) where p and s are weights derived from experiments. For normal samples, p0.4ands 0.6. For abnormal samples, p0.4ands 0.6. The BP-NN then uses the training data to obtain the final mapping functions by adjusting the weights among the network nodes. Video quality can be assessed using a single image quality assessment with motion information included. Since the video display speed was fixed, the video motion information mainly refers to the changes. The mean absolute difference (MAD) between two continuous video images was used to describe the motion degree [10]. Since the luminance and chroma play different roles in the HVS, in the YUV space luminance and chroma contribute differently to the final value of MAD given by MADi 0.7MAD Y( i) 0.15MAD U( i) 0.15MAD ( i) (10) V

DING Wenrui () et alimage and Video Quality Assessment 115 where MAD Y ( i ) is the MAD value of component Y, MAD U ( i ) is the MAD value of component U and MAD V ( i ) is the MAD value of component V. Then i MAD i1 / MADi (11) where i 1,, n. The degree of motion or scene changes, i, is used to describe the motion in the video. For example, for Quickchange.yuv and Slowchange.yuv with 4:2:0 video format, the scene changes and motion in Quickchange.yuv were more severe than in Slowchange.yuv. The video motion degree is shown in Fig. 4, where the horizontal axis is the number of frames and the vertical axis is the change degree. Fig. 5 Correlation of and objective quality assessment Fig. 4 Degree of motion for two sample videos Then the video sequence quality is assessed using output video output1 output2 i output 2 2 i 2, n n 2i 2i i1 i1 where i is the sequence number of frame. 4 Results The correlation between the and objective quality assessments using the present method is shown in Fig. 5. The results in Fig. 5 show that the number of isolated points is lower in the output-mos curve using the present method than that in the PSNR-MOS or SSIM-MOS curves. Table 1 lists the statistical data from the Video Quality Experts Group (VQEG) final report [5] used to evaluate the performance of image quality assessment methods with two methods to describe the isolated points. r p is the linear correlation coefficient between the objective assessment value and MOS. Then r p provides an evaluation of the prediction monotonicity. The outlier ratio evaluates the prediction consistency of a method. The root mean square error (RMSE) evaluates the prediction accuracy. Table 1 Statistical data for image quality assessment Method r p Outlier RMSE PSNR SSIM Present method 0.8979 0.9601 0.9640 0 0.3947 0 0.3053 0.4860 0.1952 With the same level of prediction consistency, r p (monotonicity) of the present method is 7.42% higher than that of the PSNR method, and the RMSE is 36.06% higher than that of the PSNR. The objective assessment of the distorted image in Fig. 6 derived from the present method was 0.63 with the PSNR being 28.7 db and SSIM being 0.98. The highest score was 1 without any loss between the two images. The quality of Fig. 6b is much worse than that of Fig. 6a, but Fig. 6b is still distinguishable. Therefore, the results of the present method are closer to the quality assessment. Totally 70 quarter common intermediate format (QCIF) video sequences and 25 common intermediate format (CIF) video sequences were used for the video

116 Tsinghua Science and Technology, February 2008, 13(1): 112-116 Fig. 6 Original and distorted images quality assessment. Table 2 lists r p, the outlier ratio, and RMSE for the three methods. The results show that r p of the present method is 10.47% higher than with the PSNR and the RMSE is 10.48% higher than with the PSNR with the consistency being far better than with the PSNR method. Table 2 Statistical data for the video quality assessment Method r p Outlier RMSE PSNR SSIM Present method 5 Conclusions 0.6559 0.7075 0.7246 0.6000 0.2421 0.0316 0.4912 0.3238 0.3864 The concept of isolated points was developed to better define image quality. The isolated points analysis was used to develop a neural network and SVM based image quality assessment method using PSNR and SSIM as two image quality indexes. Tests show that the method more accurately reflects image quality and reduces the number of isolated points in the performance curve. To get more accurate assessment, more HVS characteristics should be analyzed to develop high-performance quality assessment methods [11,12]. References [2] Tong Yubing, Yang Dongkai, Zhang Qishan. Video quality assessment methods overview. Journal of CAD & Graphics, 2006, 18(5): 735-741. (in Chinese) [3] ITU-R Recommendation BT. 500-10 (2000). Methodology for the assessment of the quality of the television pictures. 2000. [4] UTexas s image database and common video sequences. http://live.ece.utexas.edu/index.htm. University of Texas, USA, 2005. [5] Final report from the Video Quality Experts Group on the validation of objective models of video quality assessment. http://www.vqeg.org. 2004. [6] Zhou Wang, Bovik A C. Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 2004, 4(13): 600-612. [7] Zhou Wang, Bovik A C. Structure approached to image quality accenmet. In: Handbook of Image and Video Proceesing, 2nd Ed. New York: Academic Press, 2005. [8] Zhang Li, Zhou Weida, Jiao Licheng. Walvelet support vector machine. IEEE Transactions on System, Man, and CybernericsPart B: Cybernetics, 2004, 34(1): 34-39. [9] Vapnik V N. An overview of statistical learning theory. IEEE Transactions on Neural Networks, 1999, 10(5): 988-999. [10] Li Guiling, Wang Nannan, Zhang Qiang. Study of moving image quality evaluation base on MPEG-2 system. Journal of Tianjin University, 2001, 34(5): 573-576. (in Chinese) [11] Sanghoon L P, Bovik M S, Foveated A C. Video quality assessment. IEEE Transactions on Multimedia, 2002, 4(1): 129-132. [12] Wang Nannan. Objective quality evaluation of digital video. In: The 2000 IEEE APCCS. Tianjin, China, 2000: 791-794. [1] Fang Z, Bovik A C. A universal image quality index. IEEE Signal Processing Letters, 2002, 9(3): 81-84.