An Efficient Saliency Based Lossless Video Compression Based On Block-By-Block Basis Method

Similar documents
Video Compression Using Block By Block Basis Salience Detection

Video Compression System for Online Usage Using DCT 1 S.B. Midhun Kumar, 2 Mr.A.Jayakumar M.E 1 UG Student, 2 Associate Professor

Quality versus Intelligibility: Evaluating the Coding Trade-offs for American Sign Language Video

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

New Approach of Estimating PSNR-B For Deblocked

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

A COMPRESSION TECHNIQUES IN DIGITAL IMAGE PROCESSING - REVIEW

Implementation and analysis of Directional DCT in H.264

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

Rate Distortion Optimization in Video Compression

Image Quality Assessment Techniques: An Overview

Reduced Frame Quantization in Video Coding

Homogeneous Transcoding of HEVC for bit rate reduction

Design & Implementation of Saliency Detection Model in H.264 Standard

Video Compression An Introduction

FOR compressed video, due to motion prediction and

CONTENT BASED IMAGE COMPRESSION TECHNIQUES: A SURVEY

MULTI-SCALE STRUCTURAL SIMILARITY FOR IMAGE QUALITY ASSESSMENT. (Invited Paper)

Image Segmentation Techniques for Object-Based Coding

Image Error Concealment Based on Watermarking

Efficient Image Compression of Medical Images Using the Wavelet Transform and Fuzzy c-means Clustering on Regions of Interest.

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Video Compression Method for On-Board Systems of Construction Robots

International Journal of Advanced Research in Computer Science and Software Engineering

Video Quality Analysis for H.264 Based on Human Visual System

A Novel Approach to Saliency Detection Model and Its Applications in Image Compression

Intra-Mode Indexed Nonuniform Quantization Parameter Matrices in AVC/H.264

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features

Enhanced Hexagon with Early Termination Algorithm for Motion estimation

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

Quality Estimation of Video Transmitted over an Additive WGN Channel based on Digital Watermarking and Wavelet Transform

Adaptive Quantization for Video Compression in Frequency Domain

DCT Based, Lossy Still Image Compression

Digital Video Processing

PERCEPTUALLY-FRIENDLY RATE DISTORTION OPTIMIZATION IN HIGH EFFICIENCY VIDEO CODING. Sima Valizadeh, Panos Nasiopoulos and Rabab Ward

Coding of 3D Videos based on Visual Discomfort

Lecture 5: Compression I. This Week s Schedule

Image and Video Watermarking

No-reference perceptual quality metric for H.264/AVC encoded video. Maria Paula Queluz

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 9, SEPTEMBER

Module 7 VIDEO CODING AND MOTION ESTIMATION

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

Compression of 3-Dimensional Medical Image Data Using Part 2 of JPEG 2000

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

EXPLORING ON STEGANOGRAPHY FOR LOW BIT RATE WAVELET BASED CODER IN IMAGE RETRIEVAL SYSTEM

Context based optimal shape coding

A Low Bit-Rate Video Codec Based on Two-Dimensional Mesh Motion Compensation with Adaptive Interpolation

Mesh Based Interpolative Coding (MBIC)

Video Compression MPEG-4. Market s requirements for Video compression standard

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106

Multiframe Blocking-Artifact Reduction for Transform-Coded Video

Scalable Compression and Transmission of Large, Three- Dimensional Materials Microstructures

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Compression of VQM Features for Low Bit-Rate Video Quality Monitoring

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

The Vehicle Logo Location System based on saliency model

Fast frame memory access method for H.264/AVC

An Efficient Mode Selection Algorithm for H.264

A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation

A deblocking filter with two separate modes in block-based video coding

Performance Comparison between DWT-based and DCT-based Encoders

VIDEO COMPRESSION STANDARDS

FRAME-RATE UP-CONVERSION USING TRANSMITTED TRUE MOTION VECTORS

Using animation to motivate motion

ON QUALITY ASSESSMENT IN THE DELIVERY OF MOBILE VIDEO

IBM Research Report. Inter Mode Selection for H.264/AVC Using Time-Efficient Learning-Theoretic Algorithms

Advanced Video Coding: The new H.264 video compression standard

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

Comparison of EBCOT Technique Using HAAR Wavelet and Hadamard Transform

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

Reconstruction PSNR [db]

Saliency-Based Fidelity Adaptation Preprocessing for Video Coding

Block-Matching based image compression

Keywords Macro block, Motion compensation, Threshold, Adaptive threshold, Psnr

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

A Novel Video Enhancement Based on Color Consistency and Piecewise Tone Mapping

VIDEO DENOISING BASED ON ADAPTIVE TEMPORAL AVERAGING

Professor, CSE Department, Nirma University, Ahmedabad, India

Reduction of Blocking artifacts in Compressed Medical Images

Part 1 of 4. MARCH

FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

Perceptual Video Quality Measurement Based on Generalized Priority Model

Hybrid Video Compression Using Selective Keyframe Identification and Patch-Based Super-Resolution

REDUCTION OF CODING ARTIFACTS IN LOW-BIT-RATE VIDEO CODING. Robert L. Stevenson. usually degrade edge information in the original image.

REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ADVANTAGES OF IMAGE COMPRESSION

Human Visual System-based Unequal Error Protection for Robust Video Coding

Compression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction

Transcription:

An Efficient Saliency Based Lossless Video Compression Based On Block-By-Block Basis Method Ms. P.MUTHUSELVI, M.E(CSE), V.P.M.M Engineering College for Women, Krishnankoil, Virudhungar(dt),Tamil Nadu Sukirthanagarajan@gmail.com ABSTRACT This dissertation explores the concept of visual saliency a measure of propensity for drawing visual attention and presents a variety of novel methods for utilization of visual saliency in video compression and transmission. While the regionof-interest (ROI)-based video coding, ROI parts of the frame are encoded with higher quality than non-roi parts. It was reducing salient coding artifacts in non-roi parts of the frame in order to keep user s attention on ROI. Unfortunately, the salient coding of non-roi compression parts is uneasy to find the block-by-block basis. The proposed method aims at reducing salient coding I. INTRODUCTION Lossy image and video encoders are known to produce undesirable compression artifacts at low bit rates [1], [2]. Blocking artifacts are the most common form of compression artifacts in blockbased video compression. When coarse quantization is combined with motion-compensated prediction, blocking artifacts propagate into subsequent frames and accumulate, causing structured high-frequency noise or motion-compensated edge artifacts that may not be located at block boundaries, and so cannot be attenuated by de-blocking filters that mostly operate on block boundaries [2]. Such visual artifacts may become very severe and attention-grabbing (salient), especially in low-textured regions. artifacts in non-roi parts by optimizing the saliency-related Lagrange parameter, possibly on a block-by-block basis. Experimental results indicate that the proposed method is able to improve visual quality of encoded video relative to conventional rate distortion optimized block-byblock basis video coding, as well as two state-ofthe art perceptual video coding methods. Key words : ROI-based video coding, attentiongrabbing coding artifacts, non-roi parts, salient coding artifact Recently, region-of-interest (ROI) coding of video using computational models of visual attention [3] has been recognized as a promising approach to achieve high-performance video compression [4] [7]. The idea behind most of these methods is to encode an area around the predicted attention grabbing (salient) regions with higher quality compared to other less visually important regions. Such a spatial prioritization is supported by the fact that only a small region of 2 5 of visual angle around the center of gaze is perceived with high spatial resolution due to the highly non-uniform distribution of photoreceptors on the human retina [4]. ROI-based processing can also be employed in the context of video transmission to combat the effects of transmission channel errors. For instance, ROI parts of the frame can be protected heavily (e.g., by using 48

stronger channel codes) than non-roi parts of the frame, so that in the case of channel errors or losses, important parts of the frame can still be decoded correctly. In this case, also, ROI could be detected either based on direct eye-tracking measurement or based on visual saliency models. II. RELATED WORK A. The IKN Saliency Model Among the existing bottom-up computational models of visual attention, the Itti- Koch-Niebur (IKN) model [3] is one of the most well-known and widely used. In this model, the visual saliency is predicted by analyzing the input image through a number of pre-attentive independent feature channels, each locally sensitive to a specific low-level visual attribute, such as local opponent color contrast, intensity contrast, and orientation contrast. More specifically, nine spatial scales are created using dyadic Gaussian pyramids, which progressively low-pass filter and down-sample the input image, yielding an image-size-reduction factor ranging from 1:1 (scale zero) to 1:256 (scale eight) in eight octaves [3]. B. Rate-Distortion Optimization (RDO) The H.264/AVC video coding standard supports various block coding modes such as INTER16 16, INTER16 8, INTER8 16, INTER8 8, INTRA16 16, INTRA4 4, etc. [11]. The rate-distortion optimization (RDO) process proposed in H.264/AVC minimizes the following Lagrangian cost function for coding mode selection of each macro block (MB) [8]: J ψ Q, λ R = D MSE ψ Q + λ R R ψ Q (1) where Q is the quantization step size, D MSE (ψ Q) and R(ψ Q) are, respectively, the Mean Squared Error (MSE) and bit rate for coding the current MB in the coding mode ψ with quantization step size Q, and λ R is the Lagrange multiplier, which quantifies the tradeoff between the rate and distortion [13]. III. PROPOSED METHODOLOGY The Saliency-Aware Video Compression system is a new tool to give the solution for security of templates as well as authentication of the individual users. In this paper, video compression coding is acting as keys method attempts to reduce attention-grabbing coding artifacts and keep viewers attention in areas where the quality is highest. The method allows visual saliency to increase in high quality parts of the frame, and allows saliency to reduce in non-roi parts. The overview of the proposed work is shown in Fig.1. This work consists of two phases; the first one is a video feature extraction and second one is a saliency compression phase. Input Video Gradient Algorithm Lossless data compression algorithms Figure 1: Proposed Architecture Diagram A. Video Feature Extraction Feature Extraction ROI Selection Block-by-Block basis separation Block-by-Block basis separation Video summarization is a compact representation of a video sequence. It is useful for various video applications such as video browsing and retrieval systems. A video summarization can be a preview sequence which can be a collection of key frames which is a set of chosen frames of a video. Key-frame-based video summarization may lose the spatio-temporal properties and audio content in the original video sequence; it is the simplest and the most common method. When temporal order is maintained in selecting the key frames, users can locate specific video segments of interest by choosing a particular key frame using a browsing tool. Key 49

frames are also effective in representing visual content of a video sequence for retrieval purposes. Video indexes may be constructed based on visual features of key frames, and queries may be directed at key frames using image retrieval techniques. Video frames reduce the amount of data required in video indexing and provides framework for dealing with the video content. B. ROI-Based Video Coding After frame feature extraction conversational video, Region of Interest parts are detected using the direct frame difference and skintone information. After detecting ROI parts, several coding parameters including quantization parameter (QP), coding modes, the number of reference frames, the accuracy of motion vectors, and the search range for motion estimation, were adaptively adjusted at the macro block (MB) level according to the relative importance of each MB and a given target bit rate. In ROI-based video coding, more bits are typically assigned to ROI parts of the frame compared to non- ROI parts. There are two main aspects that differentiate existing ROI coding methods: (1) the way in which ROI is decided, and (2) the way in which bits are allocated to ROI vs. non-roi. The proposed method differs from the existing work in the area in both aspects. First, to detect ROI, a novel saliency estimation method is employed. It consists of a convex approximation to the spatial Itti-Koch- Niebur (IKN) saliency combined with a global motion compensated temporal saliency estimate. The new saliency estimate is computationally more efficient, yet competitive in terms of accuracy with the IKN model for video, which is known to be highly accurate in terms of gaze prediction on video. Second, the bit allocation procedure in the proposed method takes into account the way in which saliency is affected by compression. We propose a model for this process and employ it in the RDO procedure to control the way in which saliency is allowed to change after compression. C. Block-by-block Basis Separation using conjugate gradient algorithm The Block-by-block basis Conjugate Gradient (BCG) Method is an iterative method for finding the minimum of a quadratic cost function. The objective of the method here, is the minimization of a cost function. The BCG method originates from the so-called conjugate direction (CD) method. The main idea in the CD method is to obtain a set of linearly independent direction vectors which are conjugate with respect to R so that the vector that minimizes the objective function can be expressed as a linear combination of these vectors. The weights of the linear combination are calculated by imposing R- conjugation between the direction vectors. The set of the R-conjugate direction vectors are generated by a set of M linearly independent vectors. In the Conjugate Gradient (CG) method these M vectors are the successive gradients of the cost function, obtained as the method progresses. The main task is to derive the block-by-block updating the weight coefficient vector, based on the basic CG algorithm. Initially, it is essential to focus on obtaining the block update recursion of the autocorrelation and the cross correlation vectors by using the autocorrelation method of data windowing method. D. Video Compression using Lossless data compression algorithms The video compression process on the audiovisual content should produce a kind of lossless video compression that is imperceptible to the end user of the audiovisual content while simultaneously maintaining the benefits of keeping all of the information of the source and also the benefits of compression during the production process. Lossless compression offers the best of both worlds, with mathematically identical content to an uncompressed file while simultaneously enjoying the benefits of a smaller file size in increased file storage versus uncompressed files, and the benefits of faster transportation of the compressed file. Lossless means that the output from the de-compressor is bit-for-bit identical with the original input to the compressor. The decompressed video stream should be completely identical to original. In a non-video context, there exist a number of examples of lossless formats. 50

The importance of lossless compression has an incontrovertible value to the archival community. Wavelet compression is one of the most effective methods of image compression. The algorithm is based on multi-resolution analysis. Like the traditional JPEG compression, wavelet compression algorithm presents an image as sets of real coefficients. Most of the wavelet coefficients of a typical image are nearly zero, and the image thus is well-approximated with a small number of large wavelet coefficients. The advantage of wavelet compression is that, in contrast to JPEG, wavelet algorithm does not divide image into blocks, but analyze the whole image. The characteristic of wavelet compression allows the process to get best compression ratio, while maintaining the quality. The two options have been proposed: Moving and storing uncompressed files or streams which are absolutely identical to the original, but may require large amounts of bandwidth and storage space; Compressing moving image files or bitstreams in order to reduce storage requirements and transmission times V. EXPERIMENTAL RESULTS The experiment was run in a quiet room with 15 participants (14 male, 1 female, aged between 18 and 30). All participants had normal or corrected to normal vision. A 22-inch Dell monitor with brightness 300 cd/m 2 and resolution 1680 1050 pixels was used in our experiments. The brightness and contrast of the monitor were set to 75%. The actual height of the displayed videos on the screen was 185 millimeters. The results for the comparison are shown in Table I. In Table I we show the number of responses that showed preference for the foveated just noticeabledifference (FJND) model [9] method vs. the proposed method. Using the EWPSNR metric given by [10], we can now define an equivalent eye-tracking-weighted PSNR as follows EWPSNR = 10log 10 2552 EMMSE (2) Table 1: Comparing Various Eye-Tracking- Weighted Method EWPSNR PNSR SSIM FJND +0.40 0.15 0.003 Proposed method +1.55 0.21 0.002 2 1.5 1 0.5 0 Figure 2: Comparison Chart VI. CONCLUSION Error concealment in loss-corrupted streaming video is a challenging under-determined problem. The proposed saliency estimation method uses the spatial component from the convex approximation mentioned above, but improves temporal saliency estimation via global motion compensation. Overall, this method is not convex, but is more accurate than the IKN saliency model on certain sequences with camera motion. This method uses motion vectors to accomplish global motion compensation and subsequently temporal saliency estimation, so it can be very attractive for video compression applications. In our proposed saliency-aware video compression method, we combined a saliency distortion term with the conventional MSE distortion metric. As a possible future work, the conventional MSE distortion metric can be replaced by a more perceptually-relevant distortion metric such as SSIM to achieve even better result REFERENCES EWPSNR PNSR SSIM FJND [1] M. A. Robertson and R. L. Stevenson, DCT quantization noise in compressed images, IEEE 51

Trans. Circuits Syst. Video Technol., vol. 15, no. 1, pp. 27 38, Jan. 2005. [2] A. Leontaris, P. C. Cosman, and A. R. Reibman, Quality evaluation of motion-compensated edge artifacts in compressed video, IEEE Trans. Image Process., vol. 16, no. 11, pp. 943 956, Apr. 2007. [3] L. Itti, C. Koch, and E. Niebur, A model of saliency-based visual attention for rapid scene analysis, IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, no. 11, pp. 1254 1259, Nov. 1998. [4] L. Itti, Automatic foveation for video compression using a neurobiological model of visual attention, IEEE Trans. Image Process., vol. 13, no. 10, pp. 1304 1318, Oct. 2004. [8] I. E. Richardson, The H.264 Advanced Video Compression Standard. New York, NY, USA: Wiley, 2010. [9] Z. Chen and C. Guillemot, Perceptually-friendly H.264/AVC video coding based on foveated justnoticeable-distortion model, IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 6, p. 806 819, Jun. 2010. [10] V. A. Mateescu, H. Hadizadeh, and I. V. Baji c, Evaluation of several visual saliency models in terms of gaze prediction accuracy on video, in Proc. IEEE Globecom Workshops, Dec. 2012, pp. 1304 1308. [5] Z. Chen, N. K. Ngan, and W. Lin, Perceptual video coding: Challenges and approaches, in Proc. IEEE ICME, Jul. 2010, pp. 784 789. [6] Z. Li, S. Qin, and L. Itti, Visual attention guided bit allocation in video compression, Image and Vision Computing, vol. 29, no. 1, pp. 1 14, 2011. [7] Y. Liu, Z. Li, and Y. C. Soh, Region-of-interest based resource allocation for conversational video communications of H.264/AVC, IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 1, pp. 134 139, Jan. 2008. 52