MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

Similar documents
Mpeg 1 layer 3 (mp3) general overview

5: Music Compression. Music Coding. Mark Handley

Optical Storage Technology. MPEG Data Compression

ELL 788 Computational Perception & Cognition July November 2015

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Audio-coding standards

Audio-coding standards

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Audio Coding Standards

Audio Coding and MP3

CISC 7610 Lecture 3 Multimedia data and data formats

Lecture 16 Perceptual Audio Coding

EE482: Digital Signal Processing Applications

Contents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48

Multimedia Communications. Audio coding

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Principles of Audio Coding

Principles of MPEG audio compression

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Chapter 14 MPEG Audio Compression

Efficient Implementation of Transform Based Audio Coders using SIMD Paradigm and Multifunction Computations

Appendix 4. Audio coding algorithms

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Lossy compression. CSCI 470: Web Science Keith Vertanen

Ch. 5: Audio Compression Multimedia Systems

DRA AUDIO CODING STANDARD

AUDIOVISUAL COMMUNICATION

2.4 Audio Compression

MPEG-4 General Audio Coding

Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013

DAB. Digital Audio Broadcasting

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.

Wavelet filter bank based wide-band audio coder

An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria

AUDIOVISUAL COMMUNICATION

ISO/IEC INTERNATIONAL STANDARD

Lecture 7: Audio Compression & Coding

MPEG-1 Bitstreams Processing for Audio Content Analysis

An investigation of non-uniform bandwidths auditory filterbank in audio coding

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

PQMF Filter Bank, MPEG-1 / MPEG-2 BC Audio. Fraunhofer IDMT

HAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER

MPEG-4 aacplus - Audio coding for today s digital media world

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

Parametric Coding of High-Quality Audio

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:

Chapter 4: Audio Coding

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

Audio and video compression

Digital Audio for Multimedia

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Data Compression. Audio compression

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

Digital Audio Compression

CS 074 The Digital World. Digital Audio

Audio coding for digital broadcasting

CS 335 Graphics and Multimedia. Image Compression

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS

Networking Applications

WORLD AIRLINE ENTERTAINMENT ASSOCIATION TECHNOLOGY COMMITTEE WAEA SPECIFICATION

MUSIC A Darker Phonetic Audio Coder

AN AUDIO WATERMARKING SCHEME ROBUST TO MPEG AUDIO COMPRESSION

Compression; Error detection & correction

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman

Compression transparent low-level description of audio signals

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms

MP3. Panayiotis Petropoulos

ISOIIEC I I NTERNATI O NA L S TA NDA R D. Information technology

The MPEG-4 General Audio Coder

Parametric Coding of Spatial Audio

Proceedings of Meetings on Acoustics

S.K.R Engineering College, Chennai, India. 1 2

Efficient Signal Adaptive Perceptual Audio Coding

AVS-P3: Algorithm and Implementation

CSCD 443/533 Advanced Networks Fall 2017

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Bit or Noise Allocation

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA

Port of a fixed point MPEG2-AAC encoder on a ARM platform

AET 1380 Digital Audio Formats

Multimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology

An Experimental High Fidelity Perceptual Audio Coder

A Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain

Memory Access and Computational Behavior. of MP3 Encoding

Lecture Information Multimedia Video Coding & Architectures

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

A Review of Algorithms for Perceptual Coding of Digital Audio Signals

Transporting audio-video. over the Internet

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

ENTROPY CODING OF QUANTIZED SPECTRAL COMPONENTS IN FDLP AUDIO CODEC

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Transcription:

MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 2

Introduction Digitizing an analog signal is (lossy( lossy) ) compression Digitizing introduces quantization noise Quantization noise imply loss of quality Linear quantization > 16 bit (98 db) inaudible noise (CD) Linear quantization 4 bit (26 db) 25 September 2009 MPEG-1 1 Presentation 3

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 4

History Moving Picture Expert Group (MPEG) Created in January 1988 Starts the development of MPEG-1 1 in May 1988 Publishes the MPEG-1 1 standard in November 1992 (ISO/IEC 11172-3 for audio) MPEG-1 1 standard Defines bit-stream Defines decoding functions DOES NOT define encoding techniques Inspired by MUSICAM (Masking pattern Universal Subband Integrated Coding And Multiplexing) 25 September 2009 MPEG-1 1 Presentation 5

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 6

Masking effect Psychoacoustics Critical bands Time domain masking (Pohlmann 2000) Frequency domain masking (Pohlmann 2000) z/bark lower boundary higher boundary bandwidth central frequency (Brandenburg) 0 0 100 100 50 1 100 200 100 150 2 200 300 100 250 3 300 400 100 350 4 400 510 110 450 5 510 630 120 570 6 630 770 140 700 7 770 920 150 840 Idealized critical bands (Painter & Spanias 2000) 25 September 2009 MPEG-1 1 Presentation 7

Perceptual Coding Dividing the different subbands of a signal Ignoring masked audio information Bits association according to masking threshold (Pohlmann 2000) Introducing inaudible quantization noise Quantization noise added according to masking threshold (Pohlmann 2000) 25 September 2009 MPEG-1 1 Presentation 8

Perceptual Coding Perceptual Encoder/Decoder (Kahrs & Brandenburg 1998) 25 September 2009 MPEG-1 1 Presentation 9

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 10

Entropic Coding Use information about the signal to code efficiently Entropy of a signal Example 1: {0, 2, 2, 2, 0, 0, 0, 0, 0, 2, 0, 3, 2, 2, 0, 0, 0, 3, 0, 0} 20 symbols twelve 0 (0.6), zero 1 (0), six 2 (0.3), two 3 (0.1) Entropy H = 1.30 Example 2: {1, 2, 3, 0, 2, 1, 1, 2, 3, 0, 0, 1, 0, 3, 3, 3, 2, 0, 1, 2} 20 symbols five 0 (0.25), five 1 (0.25), five 2 (0.25), five 3 (0.25) Entropy H = 2 Shannon theorem It is impossible to code with less than H bits/symbol It is possible to code with less than H+1 bits/symbol 25 September 2009 MPEG-1 1 Presentation 11

Huffman coding Entropic Coding Example 1: {0, 2, 2, 2, 0, 0, 0, 0, 0, 2, 0, 3, 2, 2, 0, 0, 0, 3, 0, 0} 20 symbols twelve 0 (0.6), zero 1 (0), six 2 (0.3), two 3 (0.1) Entropy H = 1.30 Immediate coding: 0 00 1 01 2 10 3 11 0010101000000000001000111010000000110000 Huffman coding: 0 0 1 111 2 10 3 110 010101000000100110101000011000 Efficiency: Immediate coding: 2 bits/symbol Huffman coding: 1.5 bits/symbol (statistically) 25 September 2009 MPEG-1 1 Presentation 12

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 13

MPEG-1 Sampling rate: 32, 44.1 and 48 khz Four modes: Mono: 1 channel Stereo: 2 channels Dual: 2 channels independent (e.g. bilingual programmes) Joint stereo: 2 channels coded together 2 perceptual models Floating point quantization (normalization) Error checking: Cyclic redundancy check (CRC) 25 September 2009 MPEG-1 1 Presentation 14

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 15

MPEG-1 1 Layer I From 32 to 448 kbps 32-subband polyphase filterbank Bit allocation (0-15) Max dynamic range > 120 db Linear quantization 1 frame 384 samples Example: Philips Digital Compact Cassette Example of Layer I encoder (Pohlmann 2000) Layer I frame format (Pohlmann 2000) 25 September 2009 MPEG-1 1 Presentation 16

MPEG-1 1 Layer II From 32 to 384 kbps Improvement of Layer I Improved FFT analysis Scale factor redundancy Finer quantization 1 frame 1152 samples Example: Digital Audio Broadcasting (DAB) Example of Layer II encoder (Pohlmann 2000) Layer II frame format (Pohlmann 2000) 25 September 2009 MPEG-1 1 Presentation 17

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 18

MPEG-1 1 Layer III (MP3) From 32 to 320 kbps Improvements: Finer psychoacoustics model Alias reduction (MDCT filters) Nonuniform quantization Entropy coding Adaptative block size Only Layer with patents Inspired by: ASPEC (audio spectral perceptual entropy coding) OCF (optimal coding in the freq. domain) Example of Layer III encoder (Pohlmann 2000) Layer III frame format (Pohlmann 2000) 25 September 2009 MPEG-1 1 Presentation 19

MPEG-1 1 Layer III (MP3) Filtering: Hybrid polyphase filter/mdct Steady-state signals: 18-point MDCT on every subband Frequency resolution: 41.67 Hz Time resolution: 24 ms Transient signals: 6-point 6 MDCT Frequency resolution: 125 Hz Time resolution: 8 ms 3 blocks modes Pre-echo echo detection Quantization : power 3/4 Entropy coding: Huffman tables Run length coding Filtering stage in Layer III encoder (Pohlmann 2000) MDCT filterbank in Layer III encoder (Pohlmann 2000) 25 September 2009 MPEG-1 1 Presentation 20

Intensity coding Joint Stereo Coding Sum of left/right channels Coding of the sum and of left/right scale factors Usually only for high-frequency subbands Efficient for redundant audio channels MS (mid/side) stereo coding Sum and difference of left/right channels Coding of the two values Stereo masking 25 September 2009 MPEG-1 1 Presentation 21

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 25 September 2009 MPEG-1 1 Presentation 22

Audio Quality Comparison with CD-quality at 48 khz (16 bit - 1.412 Mbps) Layer I: No perceptual difference for 384 kbps (stereo) 2:1 compression Layer II: No perceptual difference for 256 kbps (stereo) 4:1 compression Layer III Increase of mean opinion score compared to Layer II at 256 kbps (stereo) for 128 kbps (stereo) 8:1 compression 25 September 2009 MPEG-1 1 Presentation 23

Comparison between Layers Layers I and II are very similar Each Layer has its defined decoder design Encoding/Decoding complexity: Layer I/II broadcasting Encoding/Decoding quality: Layer III audio storage 25 September 2009 MPEG-1 1 Presentation 24

Bibliography: Conclusion Ambikairajah,, E. & al. Auditory masking and MPEG-1 1 audio compression, Electronics & Communication Engineering Journal, 1997 Brandenburg, K. & Bosi,, M. Overview of MPEG Audio: Current and Future Standards for Low-Bit Bit-Rate Audio Coding,, Journal of the Audio Engineering Society, 1997,, Vol. Vol. 45(No. 1/2) Painter, T. & Spanias,, A. Perceptual Coding of Digital Audio,, Proceedings of IEEE, 2000,, Vol. Vol. 88(No. 4) Painter, T. & Spanias,, A. A Review of Algorithms for Perceptual Coding of Digital Audio Signals, Digital Signal Processing, 1997 Pan, D. A Tutorial on MPEG/Audio Compression,, IEEE MultiMedia,, IEEE Computer Society, 1995,, Vol. 2(2), pp. 60-74 Pan, D.Y. Digital Audio Compression,, Digital Technical Journal, 1993,, Vol. 5 Kahrs,, M. and Brandenburg, K. Applications of digital signal processing to audio and acoustics, Kluwer Academic Publishers, 1998 Mallat,, S. Traitement du Signal, Ecole Polytechnique, 2000 Pohlmann,, K.C. Principles of Digital Audio, McGraw-Hill Professional, 2000 25 September 2009 MPEG-1 1 Presentation 25