Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Save this PDF as:
 WORD  PNG  TXT  JPG

Size: px
Start display at page:

Download "Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:"

Transcription

1 Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate = 8KHz Quantization = 16 bits/sample Bit rate = 128 Kb/s Absolute Threshold A tone is audible only if its power is above the absolute threshold level 1

2 Masking effect If a tone of a certain frequency and amplitude is present, the audibility threshold curve is changed Other tones or noise of similar frequency, but of much lower amplitude, are not audible Example 1 Example Hz Hz Masking Effect (Single Masker) Band n-1 Band n Band n+1 Requires fewer bits Requires more bits Masking Effect (Multiple Maskers) 2

3 Temporal Masking A loud tone of finite duration will mask a softer tone that follows it (for around 30 ms) A similar effect is verified also when the the softer tone precedes the louder tone!!! Perceptual Coding Perceptual coding tries to minimize the perceptual distortion in a transform coding scheme Basic concept: allocate more bits (more quantization levels, less error) to those channels that are most audible, fewer bits (more error) to those channels that are the least audible Needs to continuously analyze the signal to determine the current audibility threshold curve using a perceptual model Audio Coding: Main Standards MPEG (Motion Picture Expert Group) family (note: the standard only specifies the decoder!) MPEG-1 Layer 1 Layer 2 Layer 3 (MP-3) MPEG-2 Back-compatible AAC (non-back-compatible) Dolby AC3 3

4 MPEG-1 Audio Coder Layer 1 Deemed transparent at 384 Kb/s per channel Subband coding with 32 channels Input divided into groups of 12 input samples Coefficient normalization (extracts Scale Factor) For each block, chooses among 15 quantizers for perceptual quantization No entropy coding after transform coding Decoder is much simpler than encoder Intensity stereo mode Stereo effect of middle and high frequencies depends not so much on the different channel content but on the different channel amplitude Middle and upper subbands of the left and right channel are added together, and only the resulting summed samples are quantized The scale factor is sent for both channel so that amplitudes can be controlled independently during playback MPEG-1 Audio Coder (cont d) Layer 2 Transparent at 256 Kb/s per channel Improved perceptual model (more computationally intensive) Finer resolution quantizers Layer 3 (MP-3) Transparent at 96 Kb/s per channel Applies a variable-size modified DCT on the samples of each subband channel Uses non-uniform quantizers Has entopy coder (Huffman) - requires buffering! Much mode complex than Layer 1 and 2 4

5 MPEG-1 Layers 1 and 2 Audio Encoder/Decoder (Single Channel) MPEG-1 Layers 3 (MP-3) Audio Encoder/Decoder (Single Channel) Example Original sound (mono, 44.1 KHz, 16 b/s Kb/s) Subsampled by 8 (without prefiltering) Subsampled by 8 (with prefiltering) Quantized to 2 b/s Coded using MP3 (64 Kb/s) Coded using MP3 (96 Kb/s) Coded using MP3 (32 Kb/s) 5

6 Middle-side Stereo Mode Frequency ranges that would normally be coded as left and right are instead coded as Middle (left+right) and Side (left-right) Side channel can be coded with fewer bits (because the two channels are highly correlated) MPEG-2 Audio Coder Backward compatible (i.e., MPEG-1 decoders can decode a portion of MPEG-2 bit-stream): Original goal: provide theater-style surroundsound capabilities Modes of operation: mono-aural stereo three channel (left, right and center) four channel (left, right, center and rear surround) five channel (four channel + center) Full five-channel surround stereo at 640 Kb/s MPEG-2 Audio Coder (Cont d) Non-backward compatible (AAC): At 320 Kb/s judged to be equivalent to MPEG-2 at 640 Kb/s for five-channels surround-sound Can operate with any number of channels (between 1 and 48) and output bit rate (from 8 Kb/s per channel to 182 Kb/s per channel) Sampling rate can be as low as 8Khz and as high as 96 KHz per channel 6

7 Dolby AC-3 Used in movie theaters as part of the Dolby digital film system Selected for the USA Digital TV (DTV) and DVD Bit-rate: 320 Kb/s for 5.1 stereo Uses 512-point Modified DCT (can be switched to 256-point) Floating-point conversion into exponent-mantissa pairs (mantissas quantized with variable number of bits) Does not transmit bit allocation but perceptual model parameters Dolby AC-3 Encoder PCM Samples Frequency Domain Transform Transform Coefficients Block Floating- Point Exponents Bit Allocation Bitstream Packing Encoded Audio Mantissas Mantissa Quantization Quantized Mantissas References B. Haskell, A. Puri, A. Netravali, Digital Video: An Introduction to MPEG-2, Chapman & Hall, 1997, pp

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding. Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

Lecture 16 Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero

More information

Audio Coding and MP3

Audio Coding and MP3 Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)

More information

Audio and video compression

Audio and video compression Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and

More information

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia? Multimedia What is multimedia? Media types +Text + Graphics + Audio +Image +Video Interchange formats What is multimedia? Multimedia = many media User interaction = interactivity Script = time 1 2 Most

More information

MPEG-4 General Audio Coding

MPEG-4 General Audio Coding MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and

More information

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1 Multimedia What is multimedia? Media types + Text +Graphics +Audio +Image +Video Interchange formats Petri Vuorimaa 1 What is multimedia? Multimedia = many media User interaction = interactivity Script

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

MPEG-l.MPEG-2, MPEG-4

MPEG-l.MPEG-2, MPEG-4 The MPEG Handbook MPEG-l.MPEG-2, MPEG-4 Second edition John Watkinson PT ^PVTPR AMSTERDAM BOSTON HEIDELBERG LONDON. NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO Focal Press is an

More information

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman DSP The Technology Presented to the IEEE Central Texas Consultants Network by Sergio Liberman Abstract The multimedia products that we enjoy today share a common technology backbone: Digital Signal Processing

More information

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck Compression Part 2 Lossy Image Compression (JPEG) General Compression Design Elements 2 Application Application Model Encoder Model Decoder Compression Decompression Models observe that the sensors (image

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

Chapter 4: Audio Coding

Chapter 4: Audio Coding Chapter 4: Audio Coding Lossy and lossless audio compression Traditional lossless data compression methods usually don't work well on audio signals if applied directly. Many audio coders are lossy coders,

More information

DVB Audio. Leon van de Kerkhof (Philips Consumer Electronics)

DVB Audio. Leon van de Kerkhof (Philips Consumer Electronics) eon van de Kerkhof Philips onsumer Electronics Email: eon.vandekerkhof@ehv.ce.philips.com Introduction The introduction of the ompact Disc, already more than fifteen years ago, has brought high quality

More information

CSCD 443/533 Advanced Networks Fall 2017

CSCD 443/533 Advanced Networks Fall 2017 CSCD 443/533 Advanced Networks Fall 2017 Lecture 18 Compression of Video and Audio 1 Topics Compression technology Motivation Human attributes make it possible Audio Compression Video Compression Performance

More information

Audio coding for digital broadcasting

Audio coding for digital broadcasting Recommendation ITU-R BS.1196-4 (02/2015) Audio coding for digital broadcasting BS Series Broadcasting service (sound) ii Rec. ITU-R BS.1196-4 Foreword The role of the Radiocommunication Sector is to ensure

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing

More information

The MPEG-4 General Audio Coder

The MPEG-4 General Audio Coder The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution

More information

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans

More information

Rich Recording Technology Technical overall description

Rich Recording Technology Technical overall description Rich Recording Technology Technical overall description Ari Koski Nokia with Windows Phones Product Engineering/Technology Multimedia/Audio/Audio technology management 1 Nokia s Rich Recording technology

More information

CHAPTER 10: SOUND AND VIDEO EDITING

CHAPTER 10: SOUND AND VIDEO EDITING CHAPTER 10: SOUND AND VIDEO EDITING What should you know 1. Edit a sound clip to meet the requirements of its intended application and audience a. trim a sound clip to remove unwanted material b. join

More information

The Gullibility of Human Senses

The Gullibility of Human Senses The Gullibility of Human Senses Three simple tricks for producing LBSC 690: Week 9 Multimedia Jimmy Lin College of Information Studies University of Maryland Monday, April 2, 2007 Images Video Audio But

More information

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Patrick Brown EE382C Embedded Software Systems May 10, 2000 $EVWUDFW MPEG Audio Layer-3 is a standard for the compression of high-quality digital audio.

More information

Aud-X 5.1 Help.

Aud-X 5.1 Help. Aud-X 5.1 Help www.aud-x.com Overview Aud-X has been engineered as 5.1 channel surround sound CODEC, and is dedicated for the use with Xvid/Divx based movies. It is of course also possible to compress

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N15071 February 2015, Geneva,

More information

DAB. Digital Audio Broadcasting

DAB. Digital Audio Broadcasting DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva

More information

_äìé`çêé. Audio Compression Codec Specifications and Requirements. Application Note. Issue 2

_äìé`çêé. Audio Compression Codec Specifications and Requirements. Application Note. Issue 2 _äìé`çêé Audio Compression Codec Specifications and Requirements Application Note Issue 2 CSR Cambridge Science Park Milton Road Cambridge CB4 0WH United Kingdom Registered in England 3665875 Tel: +44

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29 WG11 N15073 February 2015, Geneva,

More information

1. Before adjusting sound quality

1. Before adjusting sound quality 1. Before adjusting sound quality Functions available when the optional 5.1 ch decoder/av matrix unit is connected The following table shows the finer audio adjustments that can be performed when the optional

More information

AUDIO AND VIDEO COMMUNICATION MEEC EXERCISES. (with abbreviated solutions) Fernando Pereira

AUDIO AND VIDEO COMMUNICATION MEEC EXERCISES. (with abbreviated solutions) Fernando Pereira AUDIO AND VIDEO COMMUNICATION MEEC EXERCISES (with abbreviated solutions) Fernando Pereira INSTITUTO SUPERIOR TÉCNICO Departamento de Engenharia Electrotécnica e de Computadores September 2014 1. Photographic

More information

MPEG-1 Bitstreams Processing for Audio Content Analysis

MPEG-1 Bitstreams Processing for Audio Content Analysis ISSC, Cork. June 5- MPEG- Bitstreams Processing for Audio Content Analysis Roman Jarina, Orla Duffner, Seán Marlow, Noel O Connor, and Noel Murphy Visual Media Processing Group Dublin City University Glasnevin,

More information

Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM

Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM Master s Thesis Computer Science and Engineering Program CHALMERS UNIVERSITY OF TECHNOLOGY Department of Computer Engineering

More information

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION 15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:

More information

Chapter 7 Multimedia Operating Systems

Chapter 7 Multimedia Operating Systems MODERN OPERATING SYSTEMS Third Edition ANDREW S. TANENBAUM Chapter 7 Multimedia Operating Systems Introduction To Multimedia (1) Figure 7-1. Video on demand using different local distribution technologies.

More information

Audio Processing on ARM Cortex -M4 for Automotive Applications

Audio Processing on ARM Cortex -M4 for Automotive Applications Audio Processing on ARM Cortex -M4 for Automotive Applications Introduction By Pradeep D, Ittiam Systems Pvt. Ltd. Automotive infotainment systems have become an integral part of the in-car experience.

More information

UNDERSTANDING MUSIC & VIDEO FORMATS

UNDERSTANDING MUSIC & VIDEO FORMATS ComputerFixed.co.uk Page: 1 Email: info@computerfixed.co.uk UNDERSTANDING MUSIC & VIDEO FORMATS Are you confused with all the different music and video formats available? Do you know the difference between

More information

5LSE0 - Mod 10 Part 1. MPEG Motion Compensation and Video Coding. MPEG Video / Temporal Prediction (1)

5LSE0 - Mod 10 Part 1. MPEG Motion Compensation and Video Coding. MPEG Video / Temporal Prediction (1) 1 Multimedia Video Coding & Architectures (5LSE), Module 1 MPEG-1/ Standards: Motioncompensated video coding 5LSE - Mod 1 Part 1 MPEG Motion Compensation and Video Coding Peter H.N. de With (p.h.n.de.with@tue.nl

More information

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer

More information

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression IMAGE COMPRESSION Image Compression Why? Reducing transportation times Reducing file size A two way event - compression and decompression 1 Compression categories Compression = Image coding Still-image

More information

Ch. 4: Video Compression Multimedia Systems

Ch. 4: Video Compression Multimedia Systems Ch. 4: Video Compression Multimedia Systems Prof. Ben Lee (modified by Prof. Nguyen) Oregon State University School of Electrical Engineering and Computer Science 1 Outline Introduction MPEG Overview MPEG

More information

Simple Watermark for Stereo Audio Signals with Modulated High-Frequency Band Delay

Simple Watermark for Stereo Audio Signals with Modulated High-Frequency Band Delay ACOUSTICAL LETTER Simple Watermark for Stereo Audio Signals with Modulated High-Frequency Band Delay Kazuhiro Kondo and Kiyoshi Nakagawa Graduate School of Science and Engineering, Yamagata University,

More information

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding INTERNATIONAL STANDARD This is a preview - click here to buy the full publication ISO/IEC 23003-3 First edition 2012-04-01 Information technology MPEG audio technologies Part 3: Unified speech and audio

More information

IMAGE COMPRESSION. October 7, ICSY Lab, University of Kaiserslautern, Germany

IMAGE COMPRESSION. October 7, ICSY Lab, University of Kaiserslautern, Germany Lossless Compression Multimedia File Formats Lossy Compression IMAGE COMPRESSION 69 Basic Encoding Steps 70 JPEG (Overview) Image preparation and coding (baseline system) 71 JPEG (Enoding) 1) select color

More information

MPEG-4 aacplus - Audio coding for today s digital media world

MPEG-4 aacplus - Audio coding for today s digital media world MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers

More information

CHAPTER 6 Audio compression in practice

CHAPTER 6 Audio compression in practice CHAPTER 6 Audio compression in practice In earlier chapters we have seen that digital sound is simply an array of numbers, where each number is a measure of the air pressure at a particular time. This

More information

Multimedia Communications. Transform Coding

Multimedia Communications. Transform Coding Multimedia Communications Transform Coding Transform coding Transform coding: source output is transformed into components that are coded according to their characteristics If a sequence of inputs is transformed

More information

Ch 4: Multimedia. Fig.4.1 Internet Audio/Video

Ch 4: Multimedia. Fig.4.1 Internet Audio/Video Ch 4: Multimedia Recent advances in technology have changed our use of audio and video. In the past, we listened to an audio broadcast through a radio and watched a video program broadcast through a TV.

More information

Wavelet filter bank based wide-band audio coder

Wavelet filter bank based wide-band audio coder Wavelet filter bank based wide-band audio coder J. Nováček Czech Technical University, Faculty of Electrical Engineering, Technicka 2, 16627 Prague, Czech Republic novacj1@fel.cvut.cz 3317 New system for

More information

H.261. Lecture Special Topics in Signal Processing. Multimedia Communications: Coding, Systems, and Networking

H.261. Lecture Special Topics in Signal Processing. Multimedia Communications: Coding, Systems, and Networking 8-899 Special Topics in Signal Processing Multimedia Communications: Coding, Sstems, and Networking Prof. Tsuhan Chen tsuhan@ece.cmu.edu Lecture 4 H.6 H.6 ITU-T Stud Group 5, 984-99 Videophone and video

More information

Chapter 1. Data Storage Pearson Addison-Wesley. All rights reserved

Chapter 1. Data Storage Pearson Addison-Wesley. All rights reserved Chapter 1 Data Storage 2007 Pearson Addison-Wesley. All rights reserved Chapter 1: Data Storage 1.1 Bits and Their Storage 1.2 Main Memory 1.3 Mass Storage 1.4 Representing Information as Bit Patterns

More information

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Chapter 11.3 MPEG-2 MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Simple, Main, SNR scalable, Spatially scalable, High, 4:2:2,

More information

Interframe coding of video signals

Interframe coding of video signals Interframe coding of video signals Adaptive intra-interframe prediction Conditional replenishment Rate-distortion optimized mode selection Motion-compensated prediction Hybrid coding: combining interframe

More information

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201 Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Why do we need to compress speech signals Basic components in a source coding

More information

MpegRepair Software Encoding and Repair Utility

MpegRepair Software Encoding and Repair Utility PixelTools MpegRepair Software Encoding and Repair Utility MpegRepair integrates fully featured encoding, analysis, decoding, demuxing, transcoding and stream manipulations into one powerful application.

More information

A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval

A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval 1 A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval Serkan Kiranyaz,

More information

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University ECE 499/599 Data Compression & Information Theory Thinh Nguyen Oregon State University Adminstrivia Office Hours TTh: 2-3 PM Kelley Engineering Center 3115 Class homepage http://www.eecs.orst.edu/~thinhq/teaching/ece499/spring06/spring06.html

More information

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS Technical PapER Extended HE-AAC Bridging the gap between speech and audio coding One codec taking the place of two; one unified system bridging a troublesome gap. The fifth generation MPEG audio codec

More information

Improved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank

Improved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 7, OCTOBER 2002 495 Improved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank Frank Baumgarte Abstract Perceptual

More information

AUDIO information often plays an essential role in understanding

AUDIO information often plays an essential role in understanding 1062 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 3, MAY 2006 A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval Serkan Kiranyaz,

More information

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO International journal of computer science & information Technology (IJCSIT) Vol., No.5, October A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO Pranab Kumar Dhar *, Mohammad

More information

CS 335 Graphics and Multimedia. Image Compression

CS 335 Graphics and Multimedia. Image Compression CS 335 Graphics and Multimedia Image Compression CCITT Image Storage and Compression Group 3: Huffman-type encoding for binary (bilevel) data: FAX Group 4: Entropy encoding without error checks of group

More information

2.1 Transcoding audio files

2.1 Transcoding audio files 2.1 Transcoding audio files 2.1.1 Introduction to Transcoding One of the basic tasks you can perform on an audio track is to convert it into another format. This process known as Transcoding, is the direct

More information

Selection tool - for selecting the range of audio you want to edit or listen to.

Selection tool - for selecting the range of audio you want to edit or listen to. Audacity Quick Guide Audacity is an easy-to-use audio editor and recorder. You can use Audacity to: Record live audio. Convert tapes and records into digital recordings or CDs. Edit sound files. Cut, copy,

More information

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010 ROW.mp3 Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010 Motivation The realities of mp3 widespread use low quality vs. bit rate when compared to modern codecs Vision for row-mp3 backwards

More information

PHABRIX. Dolby Test & Measurement Application Notes. broadcast excellence. Overview. Dolby Metadata Detection. Dolby Metadata Analysis

PHABRIX. Dolby Test & Measurement Application Notes. broadcast excellence. Overview. Dolby Metadata Detection. Dolby Metadata Analysis PHABRIX broadcast excellence Dolby Test & Measurement Application Notes Overview The daily use of technologies such as Dolby E, Dolby Digital and Dolby Digital Plus in all areas of broadcast television

More information

Audio Systems for DTV

Audio Systems for DTV Audio Systems for TV ITU Seminar Kiev, Nov. 14, 2000 Tony Spath olby Laboratories England ts@dolby.com http://www.dolby.com on t forget the audio 2 "The cost of entertainment, at least as far as the hardware

More information

A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION

A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION Armin Taghipour 1, Maneesh Chandra Jaikumar 2, and Bernd Edler 1 1 International Audio Laboratories Erlangen, Am Wolfsmantel

More information

End-to-End Data. Presentation Formatting. Difficulties. Outline Formatting Compression

End-to-End Data. Presentation Formatting. Difficulties. Outline Formatting Compression End-to-End Data Outline Formatting Compression Spring 2009 CSE30264 1 Presentation Formatting Marshalling (encoding) application data into messages Unmarshalling (decoding) messages into application data

More information

Multimedia Signals and Systems Motion Picture Compression - MPEG

Multimedia Signals and Systems Motion Picture Compression - MPEG Multimedia Signals and Systems Motion Picture Compression - MPEG Kunio Takaya Electrical and Computer Engineering University of Saskatchewan March 9, 2008 MPEG video coding A simple introduction Dr. S.R.

More information

A Review of Algorithms for Perceptual Coding of Digital Audio Signals

A Review of Algorithms for Perceptual Coding of Digital Audio Signals A Review of Algorithms for Perceptual Coding of Digital Audio Signals Ted Painter, Student Member IEEE, and Andreas Spanias, Senior Member IEEE Department of Electrical Engineering, Telecommunications

More information

Lightweight Arithmetic for Mobile Multimedia Devices. IEEE Transactions on Multimedia

Lightweight Arithmetic for Mobile Multimedia Devices. IEEE Transactions on Multimedia Lightweight Arithmetic for Mobile Multimedia Devices Tsuhan Chen Carnegie Mellon University tsuhan@cmu.edu Thanks to Fang Fang and Rob Rutenbar IEEE Transactions on Multimedia EDICS Signal Processing for

More information

Anatomy of a Video Codec

Anatomy of a Video Codec Anatomy of a Video Codec The inner workings of Ogg Theora Dr. Timothy B. Terriberry Outline Introduction Video Structure Motion Compensation The DCT Transform Quantization and Coding The Loop Filter Conclusion

More information

DbsEditor v1.8 (Database Editor) QUICK REFERENCE

DbsEditor v1.8 (Database Editor) QUICK REFERENCE DbsEditor v1.8 (Database Editor) QUICK REFERENCE By S. KIRANYAZ This application is a dialog-based program, which is designed to perform several off-line tasks of the MUVIS multimedia databases. DbsEditor

More information

The Best-Performance Digital Video Recorder JPEG2000 DVR V.S M-PEG & MPEG4(H.264)

The Best-Performance Digital Video Recorder JPEG2000 DVR V.S M-PEG & MPEG4(H.264) The Best-Performance Digital Video Recorder JPEG2000 DVR V.S M-PEG & MPEG4(H.264) Many DVRs in the market But it takes brains to make the best product JPEG2000 The best picture quality in playback. Brief

More information

VideoCD Audio + Stills A solution compatible with DVD players

VideoCD Audio + Stills A solution compatible with DVD players VideoCD Audio + Stills A solution compatible with DVD players 1. INTRODUCTION This manual is a translation into English from the original Spanish document available in www.videoedicion.org and www.vcdsp.com,

More information

Module 7 VIDEO CODING AND MOTION ESTIMATION

Module 7 VIDEO CODING AND MOTION ESTIMATION Module 7 VIDEO CODING AND MOTION ESTIMATION Lesson 20 Basic Building Blocks & Temporal Redundancy Instructional Objectives At the end of this lesson, the students should be able to: 1. Name at least five

More information

ENEE408G Multimedia Signal Processing Design Project on Digital Audio Processing

ENEE408G Multimedia Signal Processing Design Project on Digital Audio Processing The Goals ENEE408G Multimedia Signal Processing Design Project on Digital Audio Processing 1. Learn the fundamentals of perceptual coding of audio and intellectual rights protection from multimedia. 2.

More information

In the first part of our project report, published

In the first part of our project report, published Editor: Harrick Vin University of Texas at Austin Multimedia Broadcasting over the Internet: Part II Video Compression Borko Furht Florida Atlantic University Raymond Westwater Future Ware Jeffrey Ice

More information

Genista Products and Solutions

Genista Products and Solutions Genista Products and Solutions All that you need to measure media quality Genista Optimacy Media Quality Assurance Workstation Genista PQoS Portable Media Quality Test Instrument Genista SDK A Software

More information

Compressor User Guide

Compressor User Guide Compressor User Guide KKApple Inc. Copyright 2014 Apple Inc. All rights reserved. Your rights to the software are governed by the accompanying software license agreement. The owner or authorized user of

More information

COMP 249 Advanced Distributed Systems Multimedia Networking. The Video Data Type Coding & Compression Basics

COMP 249 Advanced Distributed Systems Multimedia Networking. The Video Data Type Coding & Compression Basics COMP 9 Advanced Distributed Systems Multimedia Networking The Video Data Type Coding & Compression Basics Kevin Jeffay Department of Computer Science University of North Carolina at Chapel Hill jeffay@cs.unc.edu

More information

Digital Video Compression and H.261 Recommendation

Digital Video Compression and H.261 Recommendation Digital Video Compression and H.261 Recommendation Fernando Pereira Klagenfurt, Austria, October 2008 Video versus Images Still Image Services No strong temporal requirements; no realtime notion. Video

More information

JPEG IMAGE CODING WITH ADAPTIVE QUANTIZATION

JPEG IMAGE CODING WITH ADAPTIVE QUANTIZATION JPEG IMAGE CODING WITH ADAPTIVE QUANTIZATION Julio Pons 1, Miguel Mateo 1, Josep Prades 2, Román Garcia 1 Universidad Politécnica de Valencia Spain 1 {jpons,mimateo,roman}@disca.upv.es 2 jprades@dcom.upv.es

More information

JPEG Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features

JPEG Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features JPEG-2000 Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features Improved compression efficiency (vs. JPEG) Highly scalable embedded data streams Progressive lossy

More information

Mesh Based Interpolative Coding (MBIC)

Mesh Based Interpolative Coding (MBIC) Mesh Based Interpolative Coding (MBIC) Eckhart Baum, Joachim Speidel Institut für Nachrichtenübertragung, University of Stuttgart An alternative method to H.6 encoding of moving images at bit rates below

More information

A Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain

A Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain A Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain Ruili Zhou, Yuesheng Zhu Abstract In this paper, a new robust audio fingerprinting algorithm in MP3 compressed domain is proposed with high

More information

Memory Access and Computational Behavior. of MP3 Encoding

Memory Access and Computational Behavior. of MP3 Encoding Memory Access and Computational Behavior of MP3 Encoding by Michael Lance Karm, B.S.E. Report Presented to the Faculty of the Graduate School of The University of Texas at Austin in Partial Fulfillment

More information

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework System Modeling and Implementation of MPEG-4 Encoder under Fine-Granular-Scalability Framework Final Report Embedded Software Systems Prof. B. L. Evans by Wei Li and Zhenxun Xiao May 8, 2002 Abstract Stream

More information

Module Introduction. Content 15 pages 2 questions. Learning Time 25 minutes

Module Introduction. Content 15 pages 2 questions. Learning Time 25 minutes Purpose The intent of this module is to introduce you to the multimedia features and functions of the i.mx31. You will learn about the Imagination PowerVR MBX- Lite hardware core, graphics rendering, video

More information

HIGH FIDELITY MULTICHANNEL AUDIO COMPRESSION. Dai Yang

HIGH FIDELITY MULTICHANNEL AUDIO COMPRESSION. Dai Yang HIGH FIDELITY MULTICHANNEL AUDIO COMPRESSION by Dai Yang A Dissertation Presented to the FACULTY OF THE GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Fulfillment of the Requirements for

More information

Cisco D9054 HDTV Advanced Compression Encoder

Cisco D9054 HDTV Advanced Compression Encoder Cisco D9054 HDTV Advanced Compression Encoder Product Overview The MPEG-4 part 10 (H.264/AVC) D9054 Encoder is the right choice for any operator who wants to compress high-definition video using MPEG-4

More information

Blu-ray Disc DMP-BDT280

Blu-ray Disc DMP-BDT280 Televisions & Home Entertainment Blu-ray Disc DMP-BDT280 Blu-ray Disc Players PLAYABLE DISCS BD-ROM FULL HD 3D / BD- Video BD-RE / BD- RE DL (Ver.3) BDAV/ BDMV / / FLAC / AAC / MP3 / ALAC / BD-R / BD-R

More information

An Experimental High Fidelity Perceptual Audio Coder

An Experimental High Fidelity Perceptual Audio Coder An Experimental High Fidelity Perceptual Audio Coder Bosse Lincoln Center for Computer Research in Music and Acoustics (CCRMA) Department of Music, Stanford University Stanford, California 94305 March

More information