Compression; Error detection & correction

Similar documents
Compression; Error detection & correction

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Fundamentals of Video Compression. Video Compression

Data Storage. Slides derived from those available on the web site of the book: Computer Science: An Overview, 11 th Edition, by J.

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay

5: Music Compression. Music Coding. Mark Handley

UNDERSTANDING MUSIC & VIDEO FORMATS

The Gullibility of Human Senses

Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013

CSCD 443/533 Advanced Networks Fall 2017

So, what is data compression, and why do we need it?

Lossy compression. CSCI 470: Web Science Keith Vertanen

Image, video and audio coding concepts. Roadmap. Rationale. Stefan Alfredsson. (based on material by Johan Garcia)

Lecture Information Multimedia Video Coding & Architectures

Introduction to LAN/WAN. Application Layer 4

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1)

Lecture 16 Perceptual Audio Coding

Elementary Computing CSC 100. M. Cheng, Computer Science

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Data Compression. Media Signal Processing, Presentation 2. Presented By: Jahanzeb Farooq Michael Osadebey

Networking Applications

ELL 788 Computational Perception & Cognition July November 2015

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Mpeg 1 layer 3 (mp3) general overview

Lecture 12: Compression

Introduction to Computer Science (I1100) Data Storage

CS106B Handout 34 Autumn 2012 November 12 th, 2012 Data Compression and Huffman Encoding

Bits and Bit Patterns

CS 335 Graphics and Multimedia. Image Compression

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.

Lecture #3: Digital Music and Sound

Principles of Audio Coding

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

Audio Coding and MP3

CS 074 The Digital World. Digital Audio

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

End-to-End Data. Presentation Formatting. Difficulties. Outline Formatting Compression

JPEG. Table of Contents. Page 1 of 4

NUMBERS AND DATA REPRESENTATION. Introduction to Computer Engineering 2015 Spring by Euiseong Seo

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Audio-coding standards

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

计算原理导论. Introduction to Computing Principles 智能与计算学部刘志磊

CISC 7610 Lecture 3 Multimedia data and data formats

CHAPTER 6 Audio compression in practice

Audio-coding standards

ITNP80: Multimedia! Sound-II!

Lecture 6: Compression II. This Week s Schedule

Index. 1. Motivation 2. Background 3. JPEG Compression The Discrete Cosine Transformation Quantization Coding 4. MPEG 5.

Audio and video compression

Bluray (

Digital Audio Basics

Lecture 5: Compression I. This Week s Schedule

3 Data Storage 3.1. Foundations of Computer Science Cengage Learning

Multimedia Technology

ITEC310 Computer Networks II

Video Compression MPEG-4. Market s requirements for Video compression standard

Introduction to Data Compression

Chapter 1. Digital Data Representation and Communication. Part 2

Image coding and compression

Lecture 19 Media Formats

Tech Note - 05 Surveillance Systems that Work! Calculating Recorded Volume Disk Space

Part 1 of 4. MARCH

Chapter 1. Data Storage Pearson Addison-Wesley. All rights reserved

Multimedia Communications. Audio coding

Bits, bytes, binary numbers, and the representation of information

Image and video processing

Topic 5 Image Compression

Lossy Coding 2 JPEG. Perceptual Image Coding. Discrete Cosine Transform JPEG. CS559 Lecture 9 JPEG, Raster Algorithms

General Computing Concepts. Coding and Representation. General Computing Concepts. Computing Concepts: Review

Megapixel Networking 101. Why Megapixel?

Multimedia Networking ECE 599

Compression. storage medium/ communications network. For the purpose of this lecture, we observe the following constraints:

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

Summary of Digital Information (so far):

Linked Structures Songs, Games, Movies Part IV. Fall 2013 Carola Wenk

Fundamentals of Multimedia. Lecture 5 Lossless Data Compression Variable Length Coding

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft

Unicode. Standard Alphanumeric Formats. Unicode Version 2.1 BCD ASCII EBCDIC

Chapter 14 MPEG Audio Compression

Multimedia Communications ECE 728 (Data Compression)

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

Lossless Compression Algorithms

Digital video coding systems MPEG-1/2 Video

COMPRESSION TECHNIQUES

CS101 Lecture 12: Image Compression. What You ll Learn Today

VALLIAMMAI ENGINEERING COLLEGE

Ch. 2: Compression Basics Multimedia Systems

VIDEO SIGNALS. Lossless coding

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

2.1 Transcoding audio files

Compression and File Formats

Transcription:

Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some bits can be computed or inferred from others some bits don't matter to the recipient and can be dropped entirely error detection & correction: add redundancy to detect and fix up loss or damage add carefully defined, systematic redundancy with enough of the right redundancy, can detect damaged bits can correct errors Compressing English text letters do not occur equally often encode frequent letters with fewer bits, less frequent things with more bits (trades complexity against space) e.g., Morse code, Huffman code,... run-length encoding encode runs of identical things with a count e.g., World Wide Web Consortium => WWWC => W3C words do not occur equally often encode whole words, not just letters e.g., abbreviations for frequent words

Lempel-Ziv coding: adaptive compression algorithm build a dictionary of recently occurring data replace subsequent occurrences by (shorter) reference to the dictionary entry dictionary adapts as more input is seen compression adapts to properties of particular input algorithm is independent of nature of input dictionary is included in the compressed data Lempel-Ziv is the basis of PKZip, Winzip compresses Bible from 4.1 MB to 0.9 MB (typical for text) Lempel-Ziv is a lossless compression scheme compression followed by decompression reproduces the input exactly lossy compression: may do better if can discard some information commonly used for pictures, sounds, movies JPEG (Joint Photographic Experts Group) picture compression a lossy compression scheme, based on how our eyes work digitize picture into pixels discard some color information (use fewer distinct colors) eye is less sensitive to color variation than brightness discard some fine detail decompressed image is not quite as sharp as original discard some fine gradations of color and brightness use Huffman code, run-length encoding, etc., to compress resulting stream of numeric values compression is usually 10:1 to 20:1 for pictures used in web pages, digital cameras,...

MPEG (Moving Picture Experts Group) movie compression MPEG-2: lossy compression scheme, based on human perceptions uses JPEG for individual frames (spatial redundancy) adds compression of temporal redundancy look at image in blocks if a block hasn't changed, just transmit that fact, not the content if a block has moved, transmit amount of motion motion prediction (encode expected differences plus correction) separate moving parts from static background... used in DVD, high-definition TV, digital camcorders, video games rate is 3-15 Mbps depending on size, frame rate 15 Mbps ~ 2 MB/sec or 120 MB/min ~ 100x worse than MP3 3 Mbps ~ 25 MB/min; cf DVD 25 MB/min ~ 3000 MB for 2 hours regular TV is ~ 15 Mbps, HDTV ~ 60-80 Mbps see www.bbc.co.ul/rd/pubs/papers/paper_14/paper_14.shtml MP3 (MPEG Audio Layer-3) sound compression movies have sound as well as motion; this is the audio part 3 levels, with increasing compression, increasing complexity based on "perceptual noise shaping": use characteristics of the human ear to compress better: human ear can't hear some sounds (e.g., very high frequencies) human ear hears some sounds better than others louder sounds mask softer sounds break sound into different frequency bands encode each band separately encode 2 stereo channels as 1 plus difference gives about 10:1 compression over CD-quality audio 1 MB/minute instead of 10 MB/minute can trade quality against compression see http://www.oreilly.com/catalog/mp3/chapter/ch02.html

Other audio compression algorithms AAC (Advanced Audio Coding) default for iphone, ipod, itunes, etc. WMA (Windows Media Audio) Ogg Vorbis (open source)... maybe 20:1 over WAV format at high enough bit rate (e.g., 128 Kbps), all seem equally good speech coding for cell phones, Internet telephony, etc. narrower frequency range (100 Hz - 4 KHz) requires low delay uses a model of human vocal tract much higher compression than for general audio, e.g., 8 Kbps Summary of compression eliminate / reduce redundancy more frequent things encoded with fewer bits use a dictionary of encoded things, and refer to it (LZ) encode repetitions with a count not everything can be compressed something will be bigger lossless vs lossy compression lossy discards something that is not needed by recipient tradeoffs encoding time and complexity vs decoding time and complexity encoding is usually slower and more complicated (done once) parameters in lossy compressions size, speed, quality

Error detection and correction systematic use of redundancy to defend against errors some common numbers have no redundancy and thus can't detect when an error might have occurred e.g., SSN -- any 9-digit number is potentially valid if some extra data is added or if some possible values are excluded, this can be used to detect and even correct errors common examples include ATM & credit card numbers ISBN for books bar codes for products ATM card checksum invented by Peter Luhn, IBM, 1954 (patented 1960) credit card / ATM card checksum: starting at rightmost digit: multiply digit alternately by 1 or 2 if result is > 9 subtract 9 add the resulting digits sum should be divisible by 10 e.g., 12345678 is invalid 8 + (14-9) + 6 + (10-9) + 4 + 6 + 2 + 2 = 34 but 42345678 is valid 8 + (14-9) + 6 + (10-9) + 4 + 6 + 2 + 8 = 40 defends against transpositions and many single digit errors these are the most common errors

ISBN checksum checksum for 10-digit ISBN: starting at leftmost digit: multiply corresponding digit by 10, 9, 8,... down to 1 inclusive (a final X has value 10) add the resulting numbers result must be divisible by 11 e.g., 0-201-61586-X is valid 10*0 + 9*2 + 8*0 + 7*1 + 6*6 + 5*1 + 4*5 + 3*8 + 6*2 + 1*10 = 132 = 12*11 defends against transpositions and single digit errors and catches 90% of others Parity & other binary codes parity bit: use one extra bit so total number of 1-bits is even 0110100 => 01101001 0110101 => 01101010 detects any single-bit error more elaborate codes can detect and even correct errors basic idea is to add extra bits systematically so that legal values are uniformly spread out, so any small error converts a legal value into an illegal one some schemes correct random isolated errors some schemes correct bursts of errors (used in CD-ROMs) no error correcting code can detect/correct all errors a big enough error can convert one legal pattern into another one