MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding

Similar documents
Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding

The MPEG-4 General Audio Coder

6MPEG-4 audio coding tools

MPEG-4 General Audio Coding

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Optical Storage Technology. MPEG Data Compression

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

2.4 Audio Compression

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

Parametric Coding of High-Quality Audio

Principles of Audio Coding

ISO/IEC INTERNATIONAL STANDARD. Information technology Coding of audio-visual objects Part 3: Audio

Speech and audio coding

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

5: Music Compression. Music Coding. Mark Handley

3GPP TS V6.2.0 ( )

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

Lecture 7: Audio Compression & Coding

MPEG-4 Advanced Audio Coding

Multimedia Communications. Audio coding

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

Mpeg 1 layer 3 (mp3) general overview

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

Audio Coding and MP3

Lecture 16 Perceptual Audio Coding

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Ch. 5: Audio Compression Multimedia Systems

25*$1,6$7,21,17(51$7,21$/('(1250$/,6$7,21,62,(&-7&6&:* &2',1*2)029,1*3,&785(6$1'$8',2 &DOOIRU3URSRVDOVIRU1HZ7RROVIRU$XGLR&RGLQJ

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Opus, a free, high-quality speech and audio codec

Digital Audio for Multimedia

EE482: Digital Signal Processing Applications

DAB. Digital Audio Broadcasting

Mahdi Amiri. February Sharif University of Technology

Spectral modeling of musical sounds

Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 2015 Sharif University of Technology

Audio-coding standards

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Signal Coding Pulse Modulation:

Data Compression. Audio compression

ETSI TS V (201

Audio-coding standards

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

Speech-Music Discrimination from MPEG-1 Bitstream

Speech-Coding Techniques. Chapter 3

S.K.R Engineering College, Chennai, India. 1 2

ISO/IEC INTERNATIONAL STANDARD

Chapter 14 MPEG Audio Compression

Chapter 4: Audio Coding

Audio Coding Standards

ELL 788 Computational Perception & Cognition July November 2015

Audio coding for digital broadcasting

1 Introduction. 2 Speech Compression

A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE

Digital Speech Coding

Presents 2006 IMTC Forum ITU-T T Workshop

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010

MPEG-4 aacplus - Audio coding for today s digital media world

Convention Paper 8654 Presented at the 132nd Convention 2012 April Budapest, Hungary

Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

MPEG-1 Bitstreams Processing for Audio Content Analysis

ITNP80: Multimedia! Sound-II!

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Effect of MPEG Audio Compression on HMM-based Speech Synthesis

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model

the Audio Engineering Society. Convention Paper Presented at the 120th Convention 2006 May Paris, France

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Speech Synthesis. Simon King University of Edinburgh

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Transporting audio-video. over the Internet

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP

Digital Radio Mondiale

CHAPTER 6 Audio compression in practice

Parametric Coding of Spatial Audio

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Appendix 4. Audio coding algorithms

Parametric Audio Based Decoder and Music Synthesizer for Mobile Applications

Multimedia Database Systems. Retrieval by Content

Multi-Pulse Based Code Excited Linear Predictive Speech Coder with Fine Granularity Scalability for Tonal Language

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1)

Compression transparent low-level description of audio signals

Source Coding Techniques

Lecture Information Multimedia Video Coding & Architectures

MPEG-4 ALS International Standard for Lossless Audio Coding

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Audio Engineering Society. Convention Paper. Presented at the 126th Convention 2009 May 7 10 Munich, Germany

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval

MPEG-7 Audio: Tools for Semantic Audio Description and Processing

An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria

VS1063 ENCODER DEMONSTRATION

08 Sound. Multimedia Systems. Nature of Sound, Store Audio, Sound Editing, MIDI

MPEG-4 Structured Audio Systems

Extraction and Representation of Features, Spring Lecture 4: Speech and Audio: Basics and Resources. Zheng-Hua Tan

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

Transcription:

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany

Outline Introduction What is "Parametric Audio Coding"? HILN - MPEG-4 Parametric Audio Coding Audio Demonstration Outlook 2

Introduction: MPEG-4 Audio Coding Audio Signal Analysis Coding Bit Stream Source Model Perception Model MPEG-4 Audio Requirements: efficient coding (various content types / bitrates) other functionality (e.g. scalability) => Combination of coding techniques required utilise different source and perception models 3

Introduction: Speech & Audio Coding Established coding techniques: Speech coding: Excitation + Resonances (CELP) => source model extensively exploited Audio coding: Spectral Decomposition (transform coding) => perception model extensively exploited => "waveform" coding techniques 4

What is "Parametric Audio Coding"? Representation of audio signal x physical: waveform x(t) abstract: musical score (compact, ambiguous) => promising approach for efficient audio coding Encoding: physical repr. => abstract repr. but: automatic transcription to score very difficult!!! Audio Coding: Compact representation of audio automatically derived from real-world signal 5

What is "Parametric Audio Coding"? Some source models for audio signals: Model Assumption: (quasi-)stationary physically generated pure tones transients noise Model Parameters: spectral samples excitation + resonances freq. & ampl. of sinusoids amplitude envelope noise spectrum => Problem: Choice of source model? Efficiency vs. Generality (specialised model not suitable for arbitrary signals) 6

What is "Parametric Audio Coding"? Parametric Audio Coding combination of different source models => decompose audio signal into components utilise perception models => pick "most relevant" components Sound represented by model parameters => waveform approximation not necessary Parameter Quantisation and Coding quant. step size: "just noticeable differences" entropy coding 7

MPEG-4 Parametric Audio Coding MPEG-4 Audio Version 2: HILN "Harmonic and Individual Lines plus Noise" harmonic lines: fundamental freq. & LPC spectrum individual lines: frequency & amplitude [opt.: ampl. envelope, start phase] noise: LPC spectrum => 4.. 16 kbit/s @ 8 khz bandwidth (typ.) 8

MPEG-4 Parametric Audio Coding Parameter Estimation Harmonic Components Perception Model Parameter Coding Quant selection of relevant components Audio Signal Model Based Decomposition Sinusoidal Components Noise Components Quant Quant Mux Bit Stream Parametric Audio Encoder (HILN) 9

MPEG-4 Parametric Audio Coding Demo: HILN 16 khz mono @ 6 kbit/s Original Signal Parameter Decoding Synthesis Dequant Harmonic Components Bit Stream Demux Dequant Dequant Sinusoidal Components Noise Components Audio Signal Parametric Audio Decoder (HILN) 10

MPEG-4 Parametric Audio Coding Additional Functionality of HILN Bitrate scalability: - base layer (major signal components) - enhancement layer(s) (minor signal components) Signal modification in decoder: - speed and pitch change (parameter modification) Error Robustness (UEP) Integrated Parametric Decoder (HVXC + HILN) Example: speech + background music 2 kbit/s HVXC + 4 kbit/s HILN = 6 kbit/s 16

Audio Demonstration Demo: HILN 16 khz mono @ 6 kbit/s Signal modification in decoder: pitch change +20% speed change +20% Comparison of MPEG-4 coding techniques: original (8 khz bandwidth) speech coding (CELP) 6 kbit/s transform coding (TwinVQ) 6 kbit/s parametric audio coding (HILN) 6 kbit/s 17

Outlook HILN Encoder Optimisation (not normative) improved signal segmentation (e.g. tonal / noise discrimination) improved parameter estimation & tracking Combination of coding techniques select best coding technique for actual signal segmentation into audio objects (e.g. automatic segmentation of speech / music)... but: Audio objects are transparent! 24

further reading... Official MPEG Home Page http://www.cselt.it/mpeg/ MPEG Audio Web Page http://www.tnt.uni-hannover.de/project/mpeg/audio/ Parametric Audio Coding Bibliography: http://www.tnt.uni-hannover.de/~purnhage/ 25