MPEG-4 General Audio Coding

Similar documents
The MPEG-4 General Audio Coder

Multimedia Communications. Audio coding

5: Music Compression. Music Coding. Mark Handley

Parametric Coding of High-Quality Audio

MPEG-4 Advanced Audio Coding

Chapter 14 MPEG Audio Compression

Audio coding for digital broadcasting

Chapter 4: Audio Coding

Efficient Implementation of Transform Based Audio Coders using SIMD Paradigm and Multifunction Computations

Audio Coding Standards

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

ISO/IEC INTERNATIONAL STANDARD

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

Principles of Audio Coding

ELL 788 Computational Perception & Cognition July November 2015

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Embedded lossless audio coding using linear prediction and cascade coding

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Speech and audio coding

Mpeg 1 layer 3 (mp3) general overview

Audio-coding standards

Audio-coding standards

Contents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

6MPEG-4 audio coding tools

Optical Storage Technology. MPEG Data Compression

2.4 Audio Compression

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding

MPEG-4 BSAC Technology

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

Structural analysis of low latency audio coding schemes

Audio Coding and MP3

Lecture 16 Perceptual Audio Coding

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

ISO/IEC INTERNATIONAL STANDARD. Information technology Coding of audio-visual objects Part 3: Audio

Lecture 7: Audio Compression & Coding

S.K.R Engineering College, Chennai, India. 1 2

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding

Opus, a free, high-quality speech and audio codec

DRA AUDIO CODING STANDARD

PQMF Filter Bank, MPEG-1 / MPEG-2 BC Audio. Fraunhofer IDMT

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

Appendix 4. Audio coding algorithms

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

Port of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform

EE482: Digital Signal Processing Applications

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Audio Engineering Society. Convention Paper. Presented at the 126th Convention 2009 May 7 10 Munich, Germany

3GPP TS V6.2.0 ( )

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.

)3/)%# *4#3#7'. -0%' /CTOBER -ELBOURNE!UDIO 3UBGROUP 2EVISED 2EPORT ON #OMPLEXITY OF -0%'!!# 4OOLS

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP

Bit or Noise Allocation

MPEG-4 aacplus - Audio coding for today s digital media world

Convention Paper 8654 Presented at the 132nd Convention 2012 April Budapest, Hungary

Information technology Coding of audio-visual objects Part 3: Audio

MPEG SURROUND: THE FORTHCOMING ISO STANDARD FOR SPATIAL AUDIO CODING

Ch. 5: Audio Compression Multimedia Systems

Data Compression. Audio compression

Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA

HIGH FIDELITY MULTICHANNEL AUDIO COMPRESSION. Dai Yang

The Reference Model Architecture for MPEG Spatial Audio Coding

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010

ETSI TS V (201

Digital Audio for Multimedia

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

The Sensitivity Matrix

MP3. Panayiotis Petropoulos

Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM

FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM

Principles of MPEG audio compression

_äìé`çêé. Audio Compression Codec Specifications and Requirements. Application Note. Issue 2

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Digital video coding systems MPEG-1/2 Video

Week 14. Video Compression. Ref: Fundamentals of Multimedia

CISC 7610 Lecture 3 Multimedia data and data formats

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

MPEG-1 Bitstreams Processing for Audio Content Analysis

Bluray (

Audio and video compression

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy

MULTIPLEXING/DE-MULTIPLEXING DIRAC VIDEO WITH AAC AUDIO BIT STREAM ASHWINI URS. Presented to the Faculty of the Graduate School of

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

AUDIO AND VIDEO COMMUNICATION MEEC EXERCISES. (with abbreviated solutions) Fernando Pereira

CT516 Advanced Digital Communications Lecture 7: Speech Encoder

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>

Transcription:

MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1

General Audio Coding Solid state players, Internet audio, terrestrial and satellite digital audio broadcasting (HDTV),... MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions to AAC: Perceptual Noise Substitution (PNS) Long Term Prediction TwinVQ Coding Core Bit Slice Arithmetic Coding (BSAC) (v2) Low Delay AAC (v2) Dr. Jürgen Herre, hrr@iis.fhg.de 2

MPEG-2 AAC (1) Encoder Overview: Bitstream Output Bitstream Multiplexer Input time signal Gain Control Perceptual Model Filter Bank TNS Intensity/ Coupling Prediction M/S Scale Factors Quant. Rate/Distortion Control Noiseless Coding Dr. Jürgen Herre, hrr@iis.fhg.de 3

MPEG-2 AAC (2) Basic features: MPEG-4 General Audio Coding High frequency resolution filterbank-based coder (1024 lines MDCT with 50% overlap) 1:8 block switching (8 *128 lines MDCT) Non-uniform quantizer Noise shaping in half critical bands (scalefactor bands) Huffman coding of scalefactors and spectral coefficients Dr. Jürgen Herre, hrr@iis.fhg.de 4

MPEG-2 AAC (3) Advanced coding tools: MPEG-4 General Audio Coding Window shape adaptation Temporal noise shaping (TNS) Gain control (SRS profile only) Backward adaptive prediction Joint stereo coding tools: Mid/Side stereo (MS) per scalefactor band Intensity stereo coding between channel pairs Coupling channel(s) Other features: Flexible bitstream format for up to 48 audio channels, up to 16 Low Frequency Enhancement (LFE) channel(s) Dr. Jürgen Herre, hrr@iis.fhg.de 5

MPEG-2 AAC Performance Test Results: Broadcast quality at 320 kbit/s for 5 channels (better than MPEG-2 Layer II at 640 kbit/s) Broadcast quality at 128 kbit/s stereo Comparison to other codecs: AAC 96 kbit/sec stereo comparable to AC-3 at 160 kbit/s Layer II at 192 kbit/s Layer III at 128 kbit/s Very low bitrates (comparison within MPEG): AAC best audio coder at bitrates down to 16 kbit/s for mono and stereo Dr. Jürgen Herre, hrr@iis.fhg.de 6

Perceptual Noise Substitution Principle: Noise-like signal components are detected on a scalefactor band basis Corresponding groups of spectral coefficients are excluded from quantization/coding Instead, only a "noise substitution flag" plus total power of the substituted band is transmitted in the bitstream Decoder inserts pseudo random vectors with desired target power as spectral coefficients Highly compact representation for noise-like spectral components Dr. Jürgen Herre, hrr@iis.fhg.de 7

Demonstration: Perceptual Noise Substitution Contemporary Pop Spot: w/o PNS with PNS PNS parts only AAC coder @ 32 kbit/s, fs=48 khz, bandwidth 14 khz Note: This is not "normal" operation mode Dr. Jürgen Herre, hrr@iis.fhg.de 8

Transform-Domain Weighted Interleave VQ (1) Background: Desire audio coding at extremely low bitrates (6 kbit/s) CELP speech coders do not perform well for music MPEG-4: Transform-Domain Weighted Interleave Vector Quantization (TwinVQ) as additional coding kernel Fully integrated into MPEG-4 AAC coding system: Uses same spectral representation as AAC coder Makes use of other MPEG-4 tools (e.g. LTP, TNS, joint stereo coding) Possible core coder for MPEG-4 scalable coding Dr. Jürgen Herre, hrr@iis.fhg.de 9

Transform-Domain Weighted Interleave VQ (2) Structure: Normalization of spectral coefficients: LPC envelope (overall spectral shape) Periodic component coding (harmonic components) Bark-scale envelope coding (additional flattening) Vector Quantization (VQ) process: Interleaving of spectral coefficients into new sub-vectors Vector quantization (two sets of codebooks, weighted distortion measure allows distortion control by perceptual model) no bit/noise allocation or rate control iteration Dr. Jürgen Herre, hrr@iis.fhg.de 10

Transform-Domain Weighted Interleave VQ (3) Interleave VQ Structure: Input Vector Interleaving Sub-Vectors (Interleaved) Perceptual Weights Weigthed VQ Weigthed VQ Weigthed VQ Weigthed VQ Vector Quantization Index Values Dr. Jürgen Herre, hrr@iis.fhg.de 11

Conclusions MPEG-4 General Audio Coding: The all-round coding system among the MPEG-4 audio schemes, providing a set powerful tools Based on MPEG-2 Advanced Audio Coding kernel MPEG-4 enhancements add functionality improved coding efficiency Dr. Jürgen Herre, hrr@iis.fhg.de 12