So, what is data compression, and why do we need it?

Similar documents
ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Topic 5 Image Compression

Image compression. Stefano Ferrari. Università degli Studi di Milano Methods for Image Processing. academic year

Multimedia Communications ECE 728 (Data Compression)

A COMPRESSION TECHNIQUES IN DIGITAL IMAGE PROCESSING - REVIEW

Compression; Error detection & correction

REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ADVANTAGES OF IMAGE COMPRESSION

Medical Image Compression using DCT and DWT Techniques

Data Compression. Media Signal Processing, Presentation 2. Presented By: Jahanzeb Farooq Michael Osadebey

IMAGE PROCESSING (RRY025) LECTURE 13 IMAGE COMPRESSION - I

IMAGE COMPRESSION TECHNIQUES

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106

Compression; Error detection & correction

Lossless Compression Algorithms

Multimedia Networking ECE 599

AUDIO COMPRESSION USING WAVELET TRANSFORM

CHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM

IMAGE COMPRESSION. Chapter - 5 : (Basic)

Statistical Image Compression using Fast Fourier Coefficients

1.Define image compression. Explain about the redundancies in a digital image.

Keywords Data compression, Lossless data compression technique, Huffman Coding, Arithmetic coding etc.

ISSN (ONLINE): , VOLUME-3, ISSUE-1,

Lecture 5: Compression I. This Week s Schedule

Digital Image Processing

Department of electronics and telecommunication, J.D.I.E.T.Yavatmal, India 2

7.5 Dictionary-based Coding

Chapter 1. Digital Data Representation and Communication. Part 2

Fundamentals of Video Compression. Video Compression

Image coding and compression

CS 335 Graphics and Multimedia. Image Compression

VC 12/13 T16 Video Compression

DCT Based, Lossy Still Image Compression

GUJARAT TECHNOLOGICAL UNIVERSITY

A Research Paper on Lossless Data Compression Techniques

Image Coding and Compression

NOVEL TECHNIQUE FOR IMPROVING THE METRICS OF JPEG COMPRESSION SYSTEM

Data and information. Image Codning and Compression. Image compression and decompression. Definitions. Images can contain three types of redundancy

IMAGE COMPRESSION- I. Week VIII Feb /25/2003 Image Compression-I 1

Data Compression Techniques for Big Data

Image Compression Algorithm and JPEG Standard

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

University of Mustansiriyah, Baghdad, Iraq

David Rappaport School of Computing Queen s University CANADA. Copyright, 1996 Dale Carnegie & Associates, Inc.

AN OPTIMIZED LOSSLESS IMAGE COMPRESSION TECHNIQUE IN IMAGE PROCESSING

CS/COE 1501

VIDEO SIGNALS. Lossless coding

Robert Matthew Buckley. Nova Southeastern University. Dr. Laszlo. MCIS625 On Line. Module 2 Graphics File Format Essay

Lecture Information Multimedia Video Coding & Architectures

CoE4TN4 Image Processing. Chapter 8 Image Compression

A Novel Image Compression Technique using Simple Arithmetic Addition

2-D SIGNAL PROCESSING FOR IMAGE COMPRESSION S. Venkatesan, Vibhuti Narain Rai

Data Compression. An overview of Compression. Multimedia Systems and Applications. Binary Image Compression. Binary Image Compression

IMAGE COMPRESSION USING HYBRID QUANTIZATION METHOD IN JPEG

Fingerprint Image Compression

Optimized Compression and Decompression Software

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Image Compression Using K-Space Transformation Technique

Hybrid Image Compression Using DWT, DCT and Huffman Coding. Techniques

2. Lossless compression is called compression a) Temporal b) Reversible c) Permanent d) Irreversible

ISSN Vol.06,Issue.10, November-2014, Pages:

Introduction to Compression. Norm Zeck

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Optimization of Bit Rate in Medical Image Compression

Image Compression for Mobile Devices using Prediction and Direct Coding Approach

CS/COE 1501

EE67I Multimedia Communication Systems Lecture 4

Multimedia Communications. Transform Coding

Lecture 8 JPEG Compression (Part 3)

Compression. storage medium/ communications network. For the purpose of this lecture, we observe the following constraints:

WIRE/WIRELESS SENSOR NETWORKS USING K-RLE ALGORITHM FOR A LOW POWER DATA COMPRESSION

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1)

Novel Lossy Compression Algorithms with Stacked Autoencoders

G64PMM - Lecture 3.2. Analogue vs Digital. Analogue Media. Graphics & Still Image Representation

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Lecture 10 (Chapter 7) ZHU Yongxin, Winson

Digital Image Processing

Affable Compression through Lossless Column-Oriented Huffman Coding Technique

A QUAD-TREE DECOMPOSITION APPROACH TO CARTOON IMAGE COMPRESSION. Yi-Chen Tsai, Ming-Sui Lee, Meiyin Shen and C.-C. Jay Kuo

Redundant Data Elimination for Image Compression and Internet Transmission using MATLAB

AN ANALYTICAL STUDY OF LOSSY COMPRESSION TECHINIQUES ON CONTINUOUS TONE GRAPHICAL IMAGES

Introduction to Computer Science (I1100) Data Storage

Noise Reduction in Data Communication Using Compression Technique

Lossless Image Compression having Compression Ratio Higher than JPEG

Dictionary Based Compression for Images

Lecture 12: Compression

Lecture 6 Review of Lossless Coding (II)

Compression of 3-Dimensional Medical Image Data Using Part 2 of JPEG 2000

CSEP 521 Applied Algorithms Spring Lossy Image Compression

Elementary Computing CSC 100. M. Cheng, Computer Science

Compression of Stereo Images using a Huffman-Zip Scheme

Digital Image Processing (EI424)

Proposed Lossy Compression Method Using RLE and Segmentation

SSIM based image quality assessment for vector quantization based lossy image compression using LZW coding

Image and Video Coding I: Fundamentals

Information Technology Department, PCCOE-Pimpri Chinchwad, College of Engineering, Pune, Maharashtra, India 2

Data Compression Fundamentals

An Advanced Text Encryption & Compression System Based on ASCII Values & Arithmetic Encoding to Improve Data Security

Transcription:

In the last decade we have been witnessing a revolution in the way we communicate 2 The major contributors in this revolution are: Internet; The explosive development of mobile communications; and The ever-increasing importance of video communication. Data compression is one of the enabling technologies for each of these aspects of the multimedia revolution. So, what is data compression, and why do we need it?

3 Data Compression Data Compression is the process of converting an input data stream (the source stream or the original raw data) into another data stream (the output, the bit stream, or the compressed stream) that has a smaller size. Data Compression is the art or science of representing information in a compact form. Compression reduces the size of a file: To save space when storing it. To save time when transmitting it. Most files have lots of redundancy.

4 Data Compression Information is something that adds to people's knowledge; Information is not visible without some medium being a carrier. Data is the logical media often carried by some physical media such as a CD or a communication channel. Hence data can be viewed as a basic form of some factual information. The data before any compression process are called the source data, or the source for short. Examples of factual information may be classified broadly as text, audio, image and video.

5 Source Data File Text Image Graphic Sound.Txt,.Tex,.Doc,.Bmp,.Xbm,.Jpg,.Gif,.Png,.Fig,.Wav, Source Binary File Compression (Encoding) Compressed Binary File

6 Encoding and Decoding There are two components in compression process: Encoding algorithm: Generate a "compressed" representation C(M). Decoding algorithm: Reconstruct original message or some approximation M'. Message: Binary data M we want to compress.

7 Data compression methods Data compression is about storing and sending a smaller number of bits. There re two major categories for methods to compress data: lossless methods; and lossy methods

8 Lossless Compression Methods In lossless methods, original data and the data after compression and decompression are exactly the same. Because, in these methods, the compression and decompression algorithms are exact inverses of each other. Redundant data is removed in compression and added during decompression. Lossless methods are used when we can t afford to lose any data: legal and medical documents, computer programs. Consider the sentences Do not send money and Do now send money. Compression ratio typically no better than 4:1 for lossless compression on many kinds of files. Lossless: M=M

9 Lossless Compression Methods Statistical Techniques Huffman coding Arithmetic coding Golomb coding Dictionary techniques LZW, LZ77 Sequitur Burrows-Wheeler Method Standards - Morse code, Braille, Unix compress, gzip, zip, bzip, GIF, JBIG, Lossless JPEG

10 Lossy Compression Methods Used for compressing Audio Video Still images, medical images, photographs Loss of information is acceptable because our eyes and ears cannot distinguish subtle changes, so lossy data is acceptable. These methods are cheaper, less time and space. Compression ratios of 10:1 often yield quite high fidelity results.

11 Lossy Compression Methods Several methods: JPEG: compress pictures and graphics MPEG: compress video MP3: compress audio Vector Quantization Wavelets Block transforms

12 Measures of Performance A compression algorithm can be evaluated in a number of different ways: Measure the relative complexity of the algorithm, The memory required to implement the algorithm, How fast the algorithm performs on a given machine, The amount of compression, and How closely the reconstruction resembles the original.

13 Measures of Performance Compression Ratio: the ratio of the number of bits required to represent the data before compression to the number of bits required to represent the data after compression. Compression Ratio = size of the output stream size of the input stream o Note Values greater than 1 imply an output stream bigger than the input stream (negative compression). The compression ratio can also be called bpb (bit per bit).

14 Measures of Performance Saving percentage: We can also represent the compression ratio as a percentage of the size of the original data. Saving Percentage = 1 compression ratio 100. Compression factor: is the inverse of the compression ratio. Compression Factor = size of the input stream size of the output stream Note In this case, values greater than 1 indicate compression and values less than 1 imply expansion.

15 Measures of Performance Example: Suppose storing an image made up of a square array of 256 256 pixels requires 65536 bytes. the image is compressed into 16384 byte. Compression Ratio = 16384 65536 = 1 4 Saving Percentage = 1 1 4 100 = 75% Compression Factor = 65536 16384 = 4

16 Compression System Model The Compression System Model consists of two parts: The Compressor (Encoding), and The Decompressor (Decoding). The Compressor consists of: Preprocessing stage, and Encoding stage. The decompressor consists of a Decoding stage, followed by Post processing stage.

17 Compression System Model Input file Preprocessing Encoding Compressed file Compression Compressed file Decoding postprocessing Decoded image Decompression Compression System Model

18 The Compressor The compressor can be further broken down into stages: Preprocessing:- o Data reduction. o Mapping process. Encoding process:- Quantization stage Coding Input Image Data Reduction Mapping Quantization Coding Compr essed file Preprocessing Encoding

19 The Decompressor The decompressor can be further broken down into stages: Decoding process:- o Decoding stage. o Inverse mapping. Postprocessing Compr essed file Decoding Inverse mapping Postprocessing Decod ed image

20 fundamentals The term data compression refers to the process of reducing the amount of data required to represent a given quantity of information Data Information Various amount of data can be used to represent the same information Data might contain elements that provide no relevant information : data redundancy Data redundancy is a central issue in image compression. It is not an abstract concept but mathematically quantifiable entity

21 Data redundancy Let n1 and n2 denote the number of information carrying units in two data sets that represent the same information The relative redundancy RD is define as : R D = 1 1 C R where C R, commonly called the compression ratio, is C R = n 1 n 2

22 Data redundancy If n1 = n2, CR=1 and RD=0 If n1 >> n2, CR and RD=1 If n1 << n2, CR 0 and RD no redundancy high redundancy undesirable A compression ration of 10 (10:1) means that the first data set has 10 information carrying units (say, bits) for every 1 unit in the second (compressed) data set. So, RD=0.9, indicating that 90% of its data redundant. In Image compression, 3 basic redundancy can be identified Coding Redundancy Interpixel Redundancy Psychovisual Redundancy

23 Fidelity Criteria The key in image compression algorithm is to determine the minimal data required retaining the necessary information. This is achieved by tacking advantage of the redundancy that exists in images. To determine exactly what information is important and to be able to measure image fidelity, we need to define an image fidelity criterion. Note The information required is application specific, With lossless schemes, there is no need for a fidelity criterion.

24 Fidelity Criteria Divided into two classes: 1. Objective fidelity criteria: This fidelity provides us with equations that can be used to measure the amount of error in the reconstructed (decompressed) image. Commonly used objective measures are: The root-mean-square error (RMSE), The root-mean-square signal-to-noise ratio (SNRRMS), and The peak signal-to-noise ratio (SNRPEAK). The error between an original, uncompressed pixel value and the reconstructed (decompressed) pixel value is: Error r, c = g r, c I r, c I r, c : the original image. g r, c : the decompressed image. r, c: row & column

25 Fidelity Criteria The total error in an (N * N) decompressed image is: N 1 N 1 Total Error = [g r, c I r, c ] r=0 c=0 The root-mean-square error (RMSE), RMSE = N 1 N 1 1 N 2 r=0 c=0 [g r, c I r, c ] Notes The smaller the value of the error metrics, the better the compressed image represents the original images. Alternately, with the signal-to-noise (SNR) metrics, a larger number implies a better image. The SNR metrics consider the decompressed image g(r,c) to be signal and the error to be noise.

26 Fidelity Criteria The root-mean-square signal-to-noise ratio (SNRRMS) SNR RMS = N 1 r=0 N 1 c=0 [g r, c ] 2 N 1 [g r, c I r, c ] 2 N 1 r=0 c=0 The peak signal-to-noise ratio (SNRPEAK). L 1 2 SNR PEAK = 10 log 10 1 N 2 N 1 r=0 N 1 c=0 [g r, c I r, c ] 2 Where L : the number of gray levels (e.g., for 8 bits L =256).

27 Fidelity Criteria 2. Subjective fidelity criteria (Viewed by Human): these criteria require the definition of a qualitative scale to assess image quality. This scale can then be used by human test subjects to determine image fidelity. In order to provide unbiased results, evaluation with subjective measures requires careful selection of the test subjects and carefully designed evaluation experiments. The subjective measures are better method for comparison of compression algorithms, if the goal is to achieve high-quality images as defined by visual perception.

28

29

30

31 Introduction to DATA COMPRESSION (4 th Edition) Khalid Sayood Morgan Kaufmann, 2012 (ISBN 978-0-12-415796-5) Excellent coverage of all compression algorithms and formats