Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding

Similar documents
MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

The MPEG-4 General Audio Coder

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

6MPEG-4 audio coding tools

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Parametric Coding of High-Quality Audio

MPEG-4 General Audio Coding

Lecture 16 Perceptual Audio Coding

Principles of Audio Coding

Optical Storage Technology. MPEG Data Compression

Mpeg 1 layer 3 (mp3) general overview

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

2.4 Audio Compression

Lecture 7: Audio Compression & Coding

Parametric Coding of Spatial Audio

ISO/IEC INTERNATIONAL STANDARD. Information technology Coding of audio-visual objects Part 3: Audio

25*$1,6$7,21,17(51$7,21$/('(1250$/,6$7,21,62,(&-7&6&:* &2',1*2)029,1*3,&785(6$1'$8',2 &DOOIRU3URSRVDOVIRU1HZ7RROVIRU$XGLR&RGLQJ

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Spectral modeling of musical sounds

Parametric Audio Based Decoder and Music Synthesizer for Mobile Applications

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010

An investigation of non-uniform bandwidths auditory filterbank in audio coding

Speech and audio coding

AUDIOVISUAL COMMUNICATION

Digital Audio for Multimedia

Ch. 5: Audio Compression Multimedia Systems

DAB. Digital Audio Broadcasting

AUDIOVISUAL COMMUNICATION

Structural analysis of low latency audio coding schemes

Appendix 4. Audio coding algorithms

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>

MPEG-4 Structured Audio Systems

The Sensitivity Matrix

Multimedia Communications. Audio coding

Audio-coding standards

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

MPEG-7 Audio: Tools for Semantic Audio Description and Processing

Audio-coding standards

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

1 Introduction. 2 Speech Compression

Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model

Audio Coding and MP3

Aalborg Universitet. Published in: Proc. of Asilomar Conference on Signals, Systems, and Computers. Publication date: 2008

Speech-Music Discrimination from MPEG-1 Bitstream

Effect of MPEG Audio Compression on HMM-based Speech Synthesis

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Mahdi Amiri. February Sharif University of Technology

MPEG SURROUND: THE FORTHCOMING ISO STANDARD FOR SPATIAL AUDIO CODING

Efficient Signal Adaptive Perceptual Audio Coding

5: Music Compression. Music Coding. Mark Handley

ELL 788 Computational Perception & Cognition July November 2015

A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION

MPEG-4 Advanced Audio Coding

Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 2015 Sharif University of Technology

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP

CT516 Advanced Digital Communications Lecture 7: Speech Encoder

Signal Coding Pulse Modulation:

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding

Convention Paper 8654 Presented at the 132nd Convention 2012 April Budapest, Hungary

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

The Reference Model Architecture for MPEG Spatial Audio Coding

Speech-Coding Techniques. Chapter 3

Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology

TEMPORAL ENVELOPE CORRECTION FOR ATTACK RESTORATION IN LOW BIT-RATE AUDIO CODING

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

EE482: Digital Signal Processing Applications

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

Data Compression. Audio compression

MPEG-4 aacplus - Audio coding for today s digital media world

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

MPEG-1 Bitstreams Processing for Audio Content Analysis

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms

Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts

Implementation of a MPEG 1 Layer I Audio Decoder with Variable Bit Lengths

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Low Bitrate Coding of Spot Audio Signals for Interactive and Immersive Audio Applications

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Convention Paper Presented at the 140 th Convention 2016 June 4 7, Paris, France

Audio Engineering Society. Convention Paper. Presented at the 126th Convention 2009 May 7 10 Munich, Germany

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

Audio and video compression

Chapter 14 MPEG Audio Compression

FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM

Audio Coding Standards

Audio coding for digital broadcasting

Estimating MP3PRO Encoder Parameters From Decoded Audio

Rich Recording Technology Technical overall description

Application Note PEAQ Audio Objective Testing in ClearView

Transporting audio-video. over the Internet

Transcription:

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1

Introduction: Parametric Coding Audio Signal Analysis Coding Bitstream Source Model Perception Model Established coding techniques: Speech coding: Excitation + Resonances (CELP) source model extensively exploited Audio coding: Spectral Decomposition (MPEG-1/2) perception model extensively exploited waveform coding techniques 2

Introduction: Parametric Coding What is Parametric Audio Coding? Idea: use abstract representation of audio signals (musical score is more compact than waveform) decompose input signal into components select appropriate source models for components describe components by model parameters use perception models to pick relevant compon.s attractive for very low bitrate coding Sound represented by model parameters waveform approximation not necessary 3

Introduction: Parametric Coding Examples of Parametric Coders: Sinusoidal coding N ˆx(t) = i=1 a i (t) sin(ϕ i + 2π t 0 f i(τ) dτ) Extentions to sinusoidal coding: +noise, +transients MPEG-4 HVXC (parametric speech coder) MPEG-4 HILN (parametric audio coder) Q-Design QDMC (?) various approaches, ongoing development 4

Introduction: Coder Example Example: MPEG-4 Parametric Audio Coder HILN ( Harmonic and Individual Lines plus Noise ) Component models and parameters in HILN: harmonic lines: individual lines: noise: fundamental freq. & LPC spectrum frequency & amplitude [opt.: ampl. envelope, start phase] LPC spectrum Note: non-deterministic decoder behaviour (noise generator, random start phases) 5

Introduction: Coder Example Example: MPEG-4 HILN @ 6 kbit/s (f s = 16 khz) Original Signal Parameter Decoding Model Based Synthesis Dequant Harmonic Components Bitstream Demux Dequant Dequant Sinusoidal Components Noise Components Audio Signal Block diagram of HILN decoder 6

Potential Parametric Coding Artifacts Potential artifacts related to source models: limitations of source models bad decomposition (hard decisions are problematic) bad parameter estimation Potential artifacts related to perception models: quantisation (consider just noticeable differences ) selection of most relevant components is phase information irrelevant? (transients, clipping in sinusoidal synthesiser) 7

Examples of Artifacts Parametric coding: no waveform approximation difference signal meaningless original: pop music coded by parametric audio coder difference signal (original-coded) Limitations of source models: model noise with sinusoids (e.g. applause) original: white noise coded using 0 to 120 sinusoids 8

Examples of Artifacts Limitations of source models: no model for transient (percussive) components original: castanets coded using sinusoids + noise same, but with amplitude envelopes enabled 9

Examples of Artifacts Limitations of source models: specialised speech model not suitable for music original: speech coded by parametric speech coder original: pop music coded by parametric speech coder 10

Examples of Artifacts Bad signal decomposition: many sinusoids forced on harmonic grid original: orchestral music coded (harmonic component too strong) Bad signal decomposition: many tonal components modelled as noise original: pop music coded (noise component too strong) 11

Summary & Outlook Summary: Parametric Coding attractive for very low bitrate audio coding new types of artifacts (sounds synthetic?) more chances for unlucky decisions in encoder Outlook: ongoing development parametric coding is still a young technique encoders will improve... parametric encoding = auditory scene analysis? 12

further reading... Parametric Audio Coding Bibliography http://www.tnt.uni-hannover.de/ purnhage/ MPEG Audio Web Page (tutorials, test reports, etc.) http://www.tnt.uni-hannover.de/project/mpeg/audio/ 13