Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio
|
|
- Godfrey Day
- 6 years ago
- Views:
Transcription
1 Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany
2 Introduction: Sound Images? Humans live in a world of sound continuously listen to the sound surrounding us perceive the acoustic waves reaching the ears and interpret them (re)construct an acoustic scene In analogy to visual perception, we form a sound image... This talk: How to efficiently represent and reproduce sound images! Dr. Jürgen Herre 11/07 Page 2
3 Overview Part I Part II Part III What constitutes a sound image? Efficient coding of spatial sound images Perceptual audio coding Coding of multi-channel / surround sound Spatial Audio Coding & MPEG Surround Next-generation interactive coding / rendering of sound images: Spatial Audio Object Coding Demonstration Dr. Jürgen Herre 11/07 Page 3
4 Part I: What Constitutes A Sound Image? Dr. Jürgen Herre 11/07 Page 4
5 Spatial Sound Perception The Physics Part We are permanently listening to sound through our two sensors (ears) Ears receive a complex sound mixture of direct sound that is radiated from several sources (frequently concurrently), and many reflections from our acoustic environment ( ambient sound ) Dr. Jürgen Herre 11/07 Page 5
6 Spatial Sound Perception (2) The Perceptual Part Some Key Words Humans interpret sound to make sense of it Map it to an internal sound scene which is a perceptual correlate of the actual physical scene (not necessarily identical!) Very complex & sophisticated process - by far not fully understood! Psychoacoustics [Zwicker, Moore, Fletcher, Blauert, ] Perceptual streaming, auditory scene analysis [Bregman, ] How to form a consistent interpretation of the acoustic world from the stream of received information fragments Dr. Jürgen Herre 11/07 Page 6
7 Spatial Sound Perception (3) A Note of Caution In the following, we will look at the phenomenon of spatial audio perception in a grossly simplified, yet illustrative and very useful way Allows immediate technical application to efficient coding of spatial sound Highlight some analogies between sound images and visual images Frequency Selectivity The Human Auditory System processes sound in a frequency selective way (roughly logarithmic, ca. 1/3 octave BARK and ERB scales) Dr. Jürgen Herre 11/07 Page 7
8 Spatial Sound Perception (4) Sound Localization Complex process, mostly driven by interaural signal differences L R Depending on its position, a sound source produces differences in the ear signals Most important perceptual cues contained in the ear signals: e.g. [Blauert 1984] Interaural Level Differences (ILD) Interaural Phase/Time Differences (IPD/ITD) (relevant up to ca. 4 khz frequency) Temporal signal envelope (high frequencies) ILD, IPD/ITD determine perceived lateral position of sound sources! Dr. Jürgen Herre 11/07 Page 8
9 Spatial Sound Perception (5) In realistic closed rooms, reflections from the walls (and other objects) occur These blur the simple ILD, IPD/ITD relations by decorrelating the ear signals Decorrelated sound at the two ears represents reflections and thus provides information on the acoustic environment Distinct early reflections nearby walls Late reverberation statistical summary of acoustic environment / room properties Auditory system needs to parse sound into direct & ambient sound! Dr. Jürgen Herre 11/07 Page 9
10 Spatial Sound Perception (6) e1 [Faller 2004] Dr. Jürgen Herre 11/07 Page 10 e2 Important role of Interaural Coherence (IC): Determines spatial extent (=width) of sound event [Schuijers03] [Faller03] Perceived width of auditory event increases as coherence decreases (1 4); eventually separates into 2 distinct events Max. of normalized cross-correlation: IC = max d $ % n=#$ $ e 1 (n) " e 2 (n + d) % e 2 1 (n) " % e 2 2 (n + d) Key to source width and source/ambience perception! n=#$ $ n=#$
11 Some A V Analogies in Scene Attributes Visual Domain Auditory Domain ( auditory cues) Foreground object Sound sources ( high IC) - directional Object position - Object position ( ILD/ITD/IPD for lat. position) Object size - Object size ( IC) Background Ambience ( low IC) - non-directional Dr. Jürgen Herre 11/07 Page 11
12 Part II: Efficient Coding Of Spatial Sound Images Dr. Jürgen Herre 11/07 Page 12
13 Basics of Audio Coding Goal Predominant Approach Represent audio data as compactly as possible while maintaining sound quality (ideally: transparent coding) Concept of perceptual audio coding Optimize subjective quality rather than objective distortion metrics (e.g. MSE/SNR) Use knowledge about signal receiver: Psychoacoustics gives limits of perception Keep coding distortion below limits! No universal source model available (unlike in speech coding) Dr. Jürgen Herre 11/07 Page 13
14 Psychoacoustics 80 db L T Masking Threshold Masker 20 Threshold in quiet Inaudible Signal Masked Sound 0 0,02 0,05 0,1 0,2 0, khz f T Dr. Jürgen Herre 11/07 Page 14
15 Demonstration: The "13 db Miracle" Historic demonstration by James D. Johnston and Karlheinz Brandenburg at AT&T Bell Laboratories in 1990 using the best psychoacoustic model available at that time... Original signal Original + white noise, SNR = 13,6 db Original + noise at threshold, SNR = 13,6 db Difference signal: White noise Difference signal: Noise at threshold SNR does not adequately describe subjective sound quality! Putting psychoacoustics to work makes a huge difference! Dr. Jürgen Herre 11/07 Page 15
16 Basic Paradigm of (Monophonic) Perceptual Audio Coding audio in Analysis Filterbank Quantization & Coding Encoding of Bitstream bitstream out Perceptual Model bitstream in Decoding of Bitstream Inverse Quantization Synthesis Filterbank audio out Dr. Jürgen Herre 11/07 Page 16
17 A Real Audio Coder (MPEG-2 AAC, 1997) Bitstream Output Bitstream Multiplexer Input time signal Gain Control Perceptual Model Filter Bank TNS Intensity/ Coupling Prediction M/S Scale Factors Quant. Rate/Distortion Control Noiseless Coding Dr. Jürgen Herre 11/07 Page 17
18 The Better Spatial Sound Image: Surround Sound / Multi-Channel Audio Significantly increased spatial realism over stereo, envelopment Origins in movie sound (5.1); now also for music, broadcasting Increasingly adopted in consumers homes Dr. Jürgen Herre 11/07 Page 18 Fraunhofer Institut
19 Traditional Delivery Formats For Surround Sound Matrixed Surround (Prologic, Neo6, ) Discrete Surround (AAC, AC-3, ) Downmix of 5.1 sound into stereo signal, upmix at the receiver side Efficient in terms of transmission bandwidth (same bitrate as stereo) Backward compatible to stereo delivery Limited computation necessary Significant loss in subjective audio quality Separate transmission of each channel Significantly higher bitrate than stereo Moderate amount of computation High subjective audio quality possible Dr. Jürgen Herre 11/07 Page 19
20 A Major Step Ahead: Spatial Audio Coding Rather recent development Main Characteristics Compression efficiency: Transmits multi-channel audio at bitrates used for 2-channel stereo (or even mono) Backward compatibility: SAC multi-channel audio is coded in a backward compatible way existing infrastructures can be seamlessly upgraded to multi-channel / surround! High subjective audio quality Heavily based on exploiting perception rather than waveform coding! Dr. Jürgen Herre 11/07 Page 20
21 The Spatial Audio Coding (SAC) Concept Spatial Audio Coding = Downmix + Parametric Spatial Synthesis e.g. ICLD, ICTD/ICPD, ICC Extracts & reproduces Inter-Channel Counterparts of Inter-Aural Parameters Dr. Jürgen Herre 11/07 Page 21
22 Related Technology Generalization / Extension of Binaural Cue Coding Parametric Coding of Multi-Channel into a Mono Channel Parametric Stereo Parametric Coding of 2-Channel Stereo into a Mono Channel Matrix Surround (Dolby Prologic, Logic 7, Circle Surround ) First commercial Application (2003) MP3 Surround Backward compatible extension of stereo MP3 towards 5.1 surround sound at kbit/s Dr. Jürgen Herre 11/07 Page 22
23 Example: MP3 Surround Encoding Principle: Surround Stereo + Side Information L R C Ls Rs Compatible Downmix Lc Rc Standard MP3 Encoder MP3 Surround Bitstream Surround BCC Encoder Ancillary Data Surround Enhancement [Herre et al. 2004] Dr. Jürgen Herre 11/07 Page 23
24 Example: MP3 Surround Decoding Principle: Stereo + Side Information Surround MP3 Surround Bitstream Standard MP3 Decoder Lc Rc Advanced Surround Stereo Multi- Playback Channel (simple Decoderdecoder) Decoding L R C Ls Rs Ancillary Data Surround Enhancement [Herre et al. 2004] Dr. Jürgen Herre 11/07 Page 24
25 MPEG Spatial Audio Coding / MPEG Surround Work item Spatial Audio Coding (SAC) Technical development 3/2004-7/2006 Main contributors: Fraunhofer IIS, Agere Systems, Coding Technologies and Philips Renamed into MPEG Surround (MPEG-D) Applications: Efficient & backward compatible upgrade of audio distribution to multi-channel, e.g.: Music download service Multi-channel streaming / Internet radio Digital audio broadcasting Dr. Jürgen Herre 11/07 Page 25
26 MPEG Surround Synthesis: General Concept Decoder Structure Input 1 hybrid analysis hybri d synthesi s Output 1 Input 2 hybrid analysis Spatial synthesis hybri d synthesi s Output 2 hybri d synthesi s Output N Spatial parameters hybrid = QMF filterbank + 2nd stage non-uniform frequency resolution relating to frequ. resolution of human auditory system Same QMF filterbank as in MPEG-4 HE-AAC (AAC + SBR) Dr. Jürgen Herre 11/07 Page 26
27 MPEG Surround Synthesis: General Concept (2) Frequency selective processing: Dynamic matrix + decorrelation Many more details, please see dedicated publications! Side information rate typ kbit/s Dr. Jürgen Herre 11/07 Page 27
28 Underlying Idea: Hierarchical En/decoding Encoder Coding Decoder l f l b OTT encoder element OTT decoder element l f l b c lfe r f r b OTT encoder element OTT encoder element TTT encoder element l 0 r 0 l 0 r 0 TTT decoder element OTT decoder element OTT decoder element c lfe r f r b Spatial parameters Spatial parameters Dr. Jürgen Herre 11/07 Page 28
29 MPEG Surround Concepts Generalization Similar trees for mono-based operation ( ), other modes ( 7-2-7, ) Spatial Parameters Channel Level Differences (CLDs) Inter-Channel Correlations (ICCs) Channel Prediction Coefficients (CPCs) Prediction errors (residuals) Other Aspects Decorrelation by QMF-domain all-pass filters Several tools for handling fine temporal envelope structure (both without and with additional side information) Dr. Jürgen Herre 11/07 Page 29
30 MPEG Surround: Additional Functionalities Artistic Downmix Matrix Surround Compatibility Enhanced Matrix Mode Binaural Rendering for headphone Externally created downmixes can be used Stereo downmix can be made compatible with common matrix surround decoders MPEG Surround decoder can decode matrix surround signal (i.e. work without side info) Downmix can be generated as virtual surround, or MPEG Surround can be decoded directly into virtual surround very efficiently Rich set of attractive features for practical application Dr. Jürgen Herre 11/07 Page 30
31 MPEG Surround: Recent Verification Test MPEG Surround MPEG Surround without side-info Matrixed Surround (Dolby Prologic II) Dr. Jürgen Herre 11/07 Page 31 Music-Store test scenario: Stereo downmix coded using
32 Part III: Next Generation Interactive Coding / Rendering of Sound Images From Spatial Audio Coding to Spatial Audio Object Coding Dr. Jürgen Herre 11/07 Page 32
33 Classic MPEG-4 Interactive Scene Composition (1996ff) audiovisual objects hierarchically multiplexed downstream control / data hierarchically multiplexed upstream control / data video compositor projection plane voice sprite 2D background scene coordinate system z y x audiovisual presentation user events 3D objects audio compositor voice Hierarchy of objects person sprite 2D background scene globe furniture desk audiovisual presentation Dr. Jürgen Herre 11/07 Page 33 hypothetical viewer speaker display user input Scene is composed of multiple A/V objects and can be rendered interactively
34 MPEG-4 Object Based (De)Coding Discrete approach comes at a rather high price: Bitrate and decoding complexity grow with number of objects Structural complexity Dr. Jürgen Herre 11/07 Page 34
35 From Spatial Audio Coding (SAC) to SAOC Regular Spatial Audio Coding: Channel-oriented scheme (MPEG Surround) Chan. #1 Chan. #2 Chan. #3 SAC Encoder Downmix signal(s) SAC Decoder Chan. #1 Chan. #2 Chan. #3 Chan. #4... Side Info Chan. #4... Dr. Jürgen Herre 11/07 Page 35
36 From SAC to SAOC (2) Alternative: Object-oriented Spatial Audio Coding Obj. #1 Obj. #2 Obj. #3 Obj. #4... SAOC Encoder Downmix signal(s) Side Info SAOC Decoder obj. #1 obj. #2 obj. #3 obj. #4... Renderer Chan. #1 Chan. #2... User Interaction / Ctrl. Processes object signals instead of channel signals Mixing /rendering parameters vary according to user interaction Combined obj. decoding & rendering computationally efficient! Previous work by Faller & Baumgarte [2001ff] and Faller [2006] Dr. Jürgen Herre 11/07 Page 36
37 Real-time interactive rendering of audio objects from a mono audio downmix + SAOC side information Dr. Jürgen Herre 11/07 Page 37
38 New MPEG Standardization Activities Work on Spatial Audio Object Coding (SAOC) started Transcoding approach: SAOC + rendering info MPEG Surround SAOC Decoder Reference model and working draft recently established (10/2007) Dr. Jürgen Herre 11/07 Page 38
39 Conclusions Sound Images carry some analogy to images in the visual world Spatial Audio Coding schemes code surround sound based on perception (rather than on waveform match) Object positions are represented by perceptual spatial parameters Audio Object Texture is coded using regular mono/stereo coder Such schemes can bring surround sound into existing infrastructures High compression factor (surround sound at 64kbps and below!) Stereo / mono backward compatibility Extension towards efficient interactive, object-based scene coding / rendering (Spatial Audio Object Coding) is currently on its way Dr. Jürgen Herre 11/07 Page 39
40 Dr. Jürgen Herre 11/07 Page 40
Parametric Coding of Spatial Audio
Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial
More informationMPEG-4 General Audio Coding
MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and
More informationThe MPEG-4 General Audio Coder
The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution
More informationMpeg 1 layer 3 (mp3) general overview
Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,
More informationPerceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding
Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.
More informationThe Reference Model Architecture for MPEG Spatial Audio Coding
The Reference Model Architecture for MPEG J. Herre 1, H. Purnhagen 2, J. Breebaart 3, C. Faller 5, S. Disch 1, K. Kjörling 2, E. Schuijers 4, J. Hilpert 1, F. Myburg 4 1 Fraunhofer Institute for Integrated
More informationSAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.
SAOC and USAC Spatial Audio Object Coding / Unified Speech and Audio Coding Lecture Audio Coding WS 2013/14 Dr.-Ing. Andreas Franck Fraunhofer Institute for Digital Media Technology IDMT, Germany SAOC
More informationMPEG SURROUND: THE FORTHCOMING ISO STANDARD FOR SPATIAL AUDIO CODING
MPEG SURROUND: THE FORTHCOMING ISO STANDARD FOR SPATIAL AUDIO CODING LARS VILLEMOES 1, JÜRGEN HERRE 2, JEROEN BREEBAART 3, GERARD HOTHO 3, SASCHA DISCH 2, HEIKO PURNHAGEN 1, AND KRISTOFER KJÖRLING 1 1
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationMPEG-4 aacplus - Audio coding for today s digital media world
MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers
More informationELL 788 Computational Perception & Cognition July November 2015
ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)
More informationChapter 14 MPEG Audio Compression
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationEfficiënte audiocompressie gebaseerd op de perceptieve codering van ruimtelijk geluid
nederlands akoestisch genootschap NAG journaal nr. 184 november 2007 Efficiënte audiocompressie gebaseerd op de perceptieve codering van ruimtelijk geluid Philips Research High Tech Campus 36 M/S2 5656
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationParametric Coding of High-Quality Audio
Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationINTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N15071 February 2015, Geneva,
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationAudio coding for digital broadcasting
Recommendation ITU-R BS.1196-4 (02/2015) Audio coding for digital broadcasting BS Series Broadcasting service (sound) ii Rec. ITU-R BS.1196-4 Foreword The role of the Radiocommunication Sector is to ensure
More informationAudio Compression. Audio Compression. Absolute Threshold. CD quality audio:
Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =
More information2.4 Audio Compression
2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and
More informationLecture 16 Perceptual Audio Coding
EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero
More informationJND-based spatial parameter quantization of multichannel audio signals
Gao et al. EURASIP Journal on Audio, Speech, and Music Processing (2016) 2016:13 DOI 10.1186/s13636-016-0091-z RESEARCH JND-based spatial parameter quantization of multichannel audio signals Li Gao 1,2,3,
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More informationAudio Coding and MP3
Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More informationCompressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.
Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity
More information5: Music Compression. Music Coding. Mark Handley
5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics
More informationMultimedia Communications. Audio coding
Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated
More informationISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding
INTERNATIONAL STANDARD This is a preview - click here to buy the full publication ISO/IEC 23003-3 First edition 2012-04-01 Information technology MPEG audio technologies Part 3: Unified speech and audio
More informationAudio Coding Standards
Audio Standards Kari Pihkala 13.2.2002 Tik-111.590 Multimedia Outline Architectural Overview MPEG-1 MPEG-2 MPEG-4 Philips PASC (DCC cassette) Sony ATRAC (MiniDisc) Dolby AC-3 Conclusions 2 Architectural
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,
More informationSAOC FOR GAMING THE UPCOMING MPEG STANDARD ON PARAMETRIC OBJECT BASED AUDIO CODING
FOR GAMING THE UPCOMING MPEG STANDARD ON PARAMETRIC OBJECT BASED AUDIO CODING LEONID TERENTIEV 1, CORNELIA FALCH 1, OLIVER HELLMUTH 1, JOHANNES HILPERT 1, WERNER OOMEN 2, JONAS ENGDEGÅRD 3, HARALD MUNDT
More informationOptical Storage Technology. MPEG Data Compression
Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the
More informationAppendix 4. Audio coding algorithms
Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically
More informationSpatial Audio Object Coding (SAOC) The Upcoming MPEG Standard on Parametric Object Based Audio Coding
Spatial Audio Object Coding () The Upcoming MPEG Standard on Parametric Object Based Audio Coding Jonas Engdegård 1, Barbara Resch 1, Cornelia Falch 2, Oliver Hellmuth 2, Johannes Hilpert 2, Andreas Hoelzer
More informationISO/IEC Information technology High efficiency coding and media delivery in heterogeneous environments. Part 3: 3D audio
INTERNATIONAL STANDARD ISO/IEC 23008-3 First edition 2015-10-15 Corrected version 2016-03-01 Information technology High efficiency coding and media delivery in heterogeneous environments Part 3: 3D audio
More informationConvention Paper Presented at the 124th Convention 2008 May Amsterdam, The Netherlands
Audio Engineering Society Convention Paper Presented at the 124th Convention 2008 May 17 20 Amsterdam, The Netherlands The papers at this Convention have been selected on the basis of a submitted abstract
More informationMPEG Surround binaural rendering Surround sound for mobile devices (Binaurale Wiedergabe mit MPEG Surround Surround sound für mobile Geräte)
MPEG Surround binaural rendering Surround sound for mobile devices (e Wiedergabe mit MPEG Surround Surround sound für mobile Geräte) Jan Plogsties*, Jeroen Breebaart**, Jürgen Herre*, Lars Villemoes***,
More information14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP
TRADEOFF BETWEEN COMPLEXITY AND MEMORY SIZE IN THE 3GPP ENHANCED PLUS DECODER: SPEED-CONSCIOUS AND MEMORY- CONSCIOUS DECODERS ON A 16-BIT FIXED-POINT DSP Osamu Shimada, Toshiyuki Nomura, Akihiko Sugiyama
More informationINTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29 WG11 N15073 February 2015, Geneva,
More informationModule 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur
Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define
More informationBluray (
Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)
More information1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends
Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques
More informationChapter 4: Audio Coding
Chapter 4: Audio Coding Lossy and lossless audio compression Traditional lossless data compression methods usually don't work well on audio signals if applied directly. Many audio coders are lossy coders,
More informationMULTI-CHANNEL GOES MOBILE: MPEG SURROUND BINAURAL RENDERING
MULTI-CHANNEL GOES MOBILE: MPEG SURROUND BINAURAL RENDERING JEROEN BREEBAART 1, JÜRGEN HERRE, LARS VILLEMOES 3, CRAIG JIN 4, KRISTOFER KJÖRLING 3, JAN PLOGSTIES AND JEROEN KOPPENS 5 1 Philips Research
More informationData Compression. Audio compression
1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic
More informationFor Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer
For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing
More information6MPEG-4 audio coding tools
6MPEG-4 audio coding 6.1. Introduction to MPEG-4 audio MPEG-4 audio [58] is currently one of the most prevalent audio coding standards. It combines many different types of audio coding into one integrated
More informationChapter 5.5 Audio Programming
Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming
More information3GPP TS V6.2.0 ( )
TS 26.401 V6.2.0 (2005-03) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; General audio codec audio processing functions; Enhanced
More informationMPEG-4 Structured Audio Systems
MPEG-4 Structured Audio Systems Mihir Anandpara The University of Texas at Austin anandpar@ece.utexas.edu 1 Abstract The MPEG-4 standard has been proposed to provide high quality audio and video content
More informationAudio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011
Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law
More informationSPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL
SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,
More informationTechnical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS
Technical PapER Extended HE-AAC Bridging the gap between speech and audio coding One codec taking the place of two; one unified system bridging a troublesome gap. The fifth generation MPEG audio codec
More informationEE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)
EE799 -- Multimedia Signal Processing Multimedia Signal Compression VI (MPEG-4, 7) References: 1. http://www.mpeg.org 2. http://drogo.cselt.stet.it/mpeg/ 3. T. Berahimi and M.Kunt, Visual data compression
More informationAudio Engineering Society. Convention Paper. Presented at the 123rd Convention 2007 October 5 8 New York, NY, USA
Audio Engineering Society Convention Paper Presented at the 123rd Convention 2007 October 5 8 New York, NY, USA The papers at this Convention have been selected on the basis of a submitted abstract and
More informationDSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman
DSP The Technology Presented to the IEEE Central Texas Consultants Network by Sergio Liberman Abstract The multimedia products that we enjoy today share a common technology backbone: Digital Signal Processing
More informationTECHNICAL PAPER. Personal audio mix for broadcast programs. Fraunhofer Institute for Integrated Circuits IIS
TECHNICAL PAPER Personal audio mix for broadcast programs Finding the right balance between dialogue and ambient sound within a broadcast program is a major challenge for sound engineers and an increasing
More informationFigure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.
Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION
More informationSurrounded by High-Definition Sound
Surrounded by High-Definition Sound Dr. ChingShun Lin CSIE, NCU May 6th, 009 Introduction What is noise? Uncertain filters Introduction (Cont.) How loud is loud? (Audible: 0Hz - 0kHz) Introduction (Cont.)
More informationMPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings
MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and
More informationLow Bitrate Coding of Spot Audio Signals for Interactive and Immersive Audio Applications
Low Bitrate Coding of Spot Audio Signals for Interactive and Immersive Audio Applications Athanasios Mouchtaris, Christos Tzagkarakis, and Panagiotis Tsakalides Department of Computer Science, University
More informationFundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic
More informationAudio Engineering Society. Convention Paper. Presented at the 116th Convention 2004 May 8 11 Berlin, Germany
Audio Engineering Society Convention Paper Presented at the 116th Convention 2004 May 8 11 Berlin, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,
More informationDAB. Digital Audio Broadcasting
DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva
More informationijdsp Interactive Illustrations of Speech/Audio Processing Concepts
ijdsp Interactive Illustrations of Speech/Audio Processing Concepts NSF Phase 3 Workshop, UCy Presentation of an Independent Study By Girish Kalyanasundaram, MS by Thesis in EE Advisor: Dr. Andreas Spanias,
More informationMP3. Panayiotis Petropoulos
MP3 By Panayiotis Petropoulos Overview Definition History MPEG standards MPEG 1 / 2 Layer III Why audio compression through Mp3 is necessary? Overview MPEG Applications Mp3 Devices Mp3PRO Conclusion Definition
More informationNetworking Applications
Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services
More informationCISC 7610 Lecture 3 Multimedia data and data formats
CISC 7610 Lecture 3 Multimedia data and data formats Topics: Perceptual limits of multimedia data JPEG encoding of images MPEG encoding of audio MPEG and H.264 encoding of video Multimedia data: Perceptual
More informationMPEG-4 Advanced Audio Coding
MPEG-4 Advanced Audio Coding Peter Doliwa Abstract The goal of the MPEG-4 Audio standard is to provide a universal toolbox for transparent and efficient coding of natural audio signals for many different
More informationPreview only reaffirmed Fifth Avenue, New York, New York, 10176, US
reaffirmed 2017 551 Fifth Avenue, New York, New York, 10176, US Revision of AES55-2007 (reaffirmed 2017) AES standard for digital audio engineering - Carriage of MPEG Surround in an AES3 bitstream Published
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective
More informationAudio and video compression
Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and
More informationISO/IEC INTERNATIONAL STANDARD
INTERNATIONAL STANDARD ISO/IEC 13818-7 Second edition 2003-08-01 Information technology Generic coding of moving pictures and associated audio information Part 7: Advanced Audio Coding (AAC) Technologies
More informationConvention Paper Presented at the 140 th Convention 2016 June 4 7, Paris, France
Audio Engineering Society Convention Paper Presented at the 140 th Convention 2016 June 4 7, Paris, France This convention paper was selected based on a submitted abstract and 750-word precis that have
More informationAdvanced Functionality & Commercial Aspects of DRM. Vice Chairman DRM Technical Committee,
Advanced Functionality & Commercial Aspects of DRM Dipl.-Ing. Alexander Zink, MBA Vice Chairman DRM Technical Committee, DRM Treasurer, Vice President DRM Association Services & Structure Up to 4 Services
More informationAUDIO MEDIA CHAPTER Background
CHAPTER 3 AUDIO MEDIA 3.1 Background It is important to understand how the various audio software is distributed in order to plan for its use. Today, there are so many audio media formats that sorting
More informationAn investigation of non-uniform bandwidths auditory filterbank in audio coding
PAGE 360 An investigation of non-uniform bandwidths auditory filterbank in audio coding Andrew Lin, Stevan Berber, Waleed Abdulla Department of Electrical and Computer Engineering University of Auckland,
More informationMPEG-4 AUTHORING TOOL FOR THE COMPOSITION OF 3D AUDIOVISUAL SCENES
MPEG-4 AUTHORING TOOL FOR THE COMPOSITION OF 3D AUDIOVISUAL SCENES P. Daras I. Kompatsiaris T. Raptis M. G. Strintzis Informatics and Telematics Institute 1,Kyvernidou str. 546 39 Thessaloniki, GREECE
More informationCh. 5: Audio Compression Multimedia Systems
Ch. 5: Audio Compression Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Chapter 5: Audio Compression 1 Introduction Need to code digital
More informationAUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS
AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS Edited by YITENG (ARDEN) HUANG Bell Laboratories, Lucent Technologies JACOB BENESTY Universite du Quebec, INRS-EMT Kluwer
More informationDigital video coding systems MPEG-1/2 Video
Digital video coding systems MPEG-1/2 Video Introduction What is MPEG? Moving Picture Experts Group Standard body for delivery of video and audio. Part of ISO/IEC/JTC1/SC29/WG11 150 companies & research
More informationMPEG Spatial Audio Coding Multichannel Audio for Broadcasting
MPEG Spatial Coding Multichannel for Broadcasting Olaf Korte Fraunhofer IIS Broadcast Applications & Multimedia ealtime Systems www.iis.fraunhofer.de olaf.korte@iis.fraunhofer.de phone: +49-(0) 9131 /
More informationEfficient Signal Adaptive Perceptual Audio Coding
Efficient Signal Adaptive Perceptual Audio Coding MUHAMMAD TAYYAB ALI, MUHAMMAD SALEEM MIAN Department of Electrical Engineering, University of Engineering and Technology, G.T. Road Lahore, PAKISTAN. ]
More informationDesign and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM
Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM Master s Thesis Computer Science and Engineering Program CHALMERS UNIVERSITY OF TECHNOLOGY Department of Computer Engineering
More informationSqueeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft
Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future
More informationDVB Audio. Leon van de Kerkhof (Philips Consumer Electronics)
eon van de Kerkhof Philips onsumer Electronics Email: eon.vandekerkhof@ehv.ce.philips.com Introduction The introduction of the ompact Disc, already more than fifteen years ago, has brought high quality
More informationPerceptual Audio Coders What to listen for: Artifacts of Parametric Coding
Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric
More informationAudio Engineering Society. Convention Paper
Audio Engineering Society onvention Paper Presented at the th onvention 00 May 0{ Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing, corrections,
More informationLecture 3 Image and Video (MPEG) Coding
CS 598KN Advanced Multimedia Systems Design Lecture 3 Image and Video (MPEG) Coding Klara Nahrstedt Fall 2017 Overview JPEG Compression MPEG Basics MPEG-4 MPEG-7 JPEG COMPRESSION JPEG Compression 8x8 blocks
More informationEnhanced Audio Features for High- Definition Broadcasts and Discs. Roland Vlaicu Dolby Laboratories, Inc.
Enhanced Audio Features for High- Definition Broadcasts and Discs Roland Vlaicu Dolby Laboratories, Inc. Entertainment is Changing High definition video Flat panel televisions Plasma LCD DLP New broadcasting
More informationPHABRIX. Dolby Test & Measurement Application Notes. broadcast excellence. Overview. Dolby Metadata Detection. Dolby Metadata Analysis
PHABRIX broadcast excellence Dolby Test & Measurement Application Notes Overview The daily use of technologies such as Dolby E, Dolby Digital and Dolby Digital Plus in all areas of broadcast television
More informationEfficient Implementation of Transform Based Audio Coders using SIMD Paradigm and Multifunction Computations
Efficient Implementation of Transform Based Audio Coders using SIMD Paradigm and Multifunction Computations Luckose Poondikulam S (luckose@sasken.com), Suyog Moogi (suyog@sasken.com), Rahul Kumar, K P
More informationIMMERSIVE AUDIO WITH MPEG 3D AUDIO STATUS AND OUTLOOK
IMMERSIVE AUDIO WITH MPEG 3D AUDIO STATUS AND OUTLOOK Max Neuendorf, Jan Plogsties, Stefan Meltzer Fraunhofer Institute for Integrated Circuits (IIS) Erlangen, Germany Robert Bleidt Fraunhofer USA Digital
More informationHAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER
HAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER Rob Colcord, Elliot Kermit-Canfield and Blane Wilson Center for Computer Research in Music and Acoustics,
More information