Sontacchi A., Noisternig M., Majdak P., Höldrich R.

Size: px
Start display at page:

Download "Sontacchi A., Noisternig M., Majdak P., Höldrich R."

Transcription

1 $Q2EMHFWLYHRGHORI/RFDOLVDWLRQLQ %LQDXUDO6RXQG5HSURGXFWLRQ6\VWHPV Sontacchi A., Noisternig M., Majdak P., Höldrich R. $(6VW,QWHUQDWLRQDO&RQIHUHQFH -XQH 6W3HWHUVEXUJ5XVVLD

2 ,QVWLWXWHRI(OHFWURQLFXVLF DQG$FRXVWLFV University of Music and Dramatic Arts Graz, Austria KWWSLHPNXJDFDW LQFROODERUDWLRQZLWK $.*$FRXVWLFV9LHQQD$XVWULD KWWSZZZDNJFRP

3 Outline HRIR based Binaural Sound Reproduction Systems using Ambisonic Definitions Mathematical Model Results Outlook

4 Binaural Systems using Ambisonic Binaural Reproduction Systems: filtering virtual source signals with HRIRs incorporate head tracking to improve source localisation Problem: time-varying interpolation between HRIRs

5 Binaural Systems using Ambisonic Why Ambisonic? head movement: - time-invariant HRIR filter - cheap time-variant rotation matrix decoding is independent from coding number of transmit channels is independent from number of virtual sources

6 Binaural Systems using Ambisonic encode signals depending on source position rotate Ambisonic depending on head position decode Ambisonic to virtual speaker signals convolve speaker signals with HRIRs

7 Definitions localisation function localisation error localisation blur average localisation error average localisation blur localisation blur localisation error

8 Mathematical Model Intentions: classifying Binaural Reproduction Systems prediction of localisation performance Main assumption: the reference HRIRs result in optimal localisation 2D systems only

9 Mathematical Model Parameter Mathematical Model Device under test ILD ITD HRIRs ILD ITD ITDA ILDA Result Merger

10 Interaural Time Difference (ITD) 2 2 Amplitudengang Magnitude Response.3 HRIR, ϕ= HRIR for Amplitude in db x -3 x Gruppenlaufzeit Group Delay Time Amplitude Time in s x -4 FFT Phase in rad Frequenz in Hz Frequency in Hz x x 4 Phasengang Phase Response π Gruppenlaufzeit Group Delay Time in in s s GΦ GI Frequenz in Hz Frequency in Hz x 4 x Frequenz in Hz Frequency in Hz x 4 x 4

11 Interaural Time Difference (ITD) Calculation of the group delay time: impulse response (HRIR) zero phase filter over critical bands energy center of gravity per band group delay time per band Group Delay Time [s] x Group Delay Time -4 Frequency [Hz] x 4

12 Interaural Time Difference (ITD) ITD over critical bands: difference of the group delay time between ipsi- and contralateral eardrum Group Delay Time left ear right ear ITD critical bands azimuth angle

13 Interaural Level Difference (ILD) Calculation of the ILD: amplitude response (HRTF) energy-difference over critical bands between ipsi- and contralateral eardrums energy over critical bands left ear right ear ILD critical bands azimuth angle

14 Interaural Difference Angle (ITDA/ILDA) Search Algorithm: 5 x -4 ƒ,7'irurqhfulwlfdoedqg FDOFXODWHGIURPPHDVXUHG +5,5 GLVWRUWHG,7' UHIHUHQFH,7' ITD [s] Deviation Azimuth Angle [ ]

15 Merging of ITDA and ILDA ITDA: fade out above 8 Hz ILDA: fully weighted Gewichtung der ITD.8 ITDA Gewichtungsfaktor Frequenzgruppe in bark Gewichtung der ILD.8 Result ILDA Gewichtungsfaktor Frequenzgruppe in bark

16 Results () localisation function (averaged over critical bands) localisation blur (standard deviation over critical bands) Θ Θ + Θ Θ = Θ = = ), ( ) ( ) ( 2 ), ( ) ( ) ( 2 ) ( ],/',/',/' ],7',7',7' ] ] Z ] Z ] ] Z ] Z / [ ] = = Θ Θ Θ = Θ 24 2 ) ( ), ( ) ( ) ( 2 ) ( ] L L,7',/' L L / ] ] Z ] Z %O

17 Results (2) average localisation blur (averaged over the azimuth angle) / = [ /( ΘL ) ΘL ] L= 2 average localisation error averaged over azimuth angle referring to the MAA %O = 36 %O( Θ) Θ= $$ ( Θ)

18 Analysis Variations of design parameters: reference HRIRs length of HRIR ambisonic order weighting of ambisonic channels number of virtual speaker

19 HRIR Type () azimuth angle [ ] 2 azimuth angle [ ] Kemar-HRIR (#2) Propr.-HRIR (#9) filter length = 28 samples 5-5 localisation error [ ] localisation blur [ ] Kemar-HRIR (#2) Proprietary-HRIR (#9) filter length = 28 samples

20 HRIR Type (2) 6 ORFDOLVDWLRQHUURU 6.5 ORFDOLVDWLRQEOXU localisation error [ ] rd ord. KEMAR, 5 (#) 3rd ord. KEMAR, 5 (#2) 3rd ord. AKG, 5 (#9) 4th ord. KEMAR, 5 (#8) 4th ord. AKG, 5 (#23) localisation blur [multiples of MAA] rd ord. KEMAR, 5 3rd ord. KEMAR, 5 3rd ord. Propr., 5 4th ord. KEMAR, 5 4th ord. Propr., filter length [samples] filter length [samples]

21 Ambisonic Order () azimuth angle [ ] azimuth angle [ ] th ord. (#8) 3rd ord. (#) 2nd ord. (#) - localisation error [ ] localisation blur [ ] 4th ord. (#8) 3rd ord. (#) 2nd ord. (#) filter length = 28 samples

22 Ambisonic Order (2) ORFDOLVDWLRQHUURU ORFDOLVDWLRQEOXU th ord. (#8) 3rd ord. (#) 2nd ord. (#) th ord. (#8) 3rd ord. (#) 2nd ord. (#) localisation error [ ] localisation blur [multiples MAA] filter length [samples] filter length [samples]

23 Ambisonic Weighting () azimuth angle [ ] 5 azimuth angle [ ] 3 6 3rd ord., Kaiser window (#24) 4th ord., fully weighted (#8) 4th. ord. Kaiser window (#22) filter length = 28 samples 5-5 localisation error [ ] localisation blur [ ] Kaiser window, 3rd order (#24) Fully weighted, 4th order (#8) Kaiser window, 4th order (#22) length of filter = 28 samples

24 Ambisonic Weighting (2) ORFDOLVDWLRQHUURU ORFDOLVDWLRQEOXU localisation error [ ] rd ord., weighting:.4 (#3) 3rd ord., fully weighted (#) 3rd ord., Kaiser window (#24) 4th ord., fully weighted (#8) 4th ord., Kaiser window (#22) localisation blur [multiples of MAA] rd ord., weighting:.4 (#3) 3rd ord., fully weighted (#) 3rd ord., Kaiser window (#24) 4th ord., fully weighted (#8) 4th ord., Kaiser window (#22) filter length [samples] length of filter [samples]

25 Conclusion prediction of localisation is possible results correspond with ambisonic theory - ambisonic not less than 3rd order for correct localisation - weighting of channels using Kaiser-window yields better localisation - filter length up to 28 samples model was evaluated using listening tests

26 Outlook incorporate IGD (LQWHUDXUDOJURXSGHOD\) as a cue for high frequency localisation classification regarding coloration and externity extend the model for 3D applications warping tables to predict and minimise the localisation errors for further implementations

27 ,QVWLWXWHRI(OHFWURQLFXVLF DQG$FRXVWLFV 7KDQN\RX University of Music and Dramatic Arts Graz, Austria KWWSLHPNXJDFDW

Surrounded by High-Definition Sound

Surrounded by High-Definition Sound Surrounded by High-Definition Sound Dr. ChingShun Lin CSIE, NCU May 6th, 009 Introduction What is noise? Uncertain filters Introduction (Cont.) How loud is loud? (Audible: 0Hz - 0kHz) Introduction (Cont.)

More information

Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues

Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues 3rd International Symposium on Ambisonics & Spherical Acoustics@Lexington, Kentucky, USA, 2nd June 2011 Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues J. Treviño

More information

Measurement of pinna flare angle and its effect on individualized head-related transfer functions

Measurement of pinna flare angle and its effect on individualized head-related transfer functions PROCEEDINGS of the 22 nd International Congress on Acoustics Free-Field Virtual Psychoacoustics and Hearing Impairment: Paper ICA2016-53 Measurement of pinna flare angle and its effect on individualized

More information

Introduction to HRTFs

Introduction to HRTFs Introduction to HRTFs http://www.umiacs.umd.edu/users/ramani ramani@umiacs.umd.edu How do we perceive sound location? Initial idea: Measure attributes of received sound at the two ears Compare sound received

More information

APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION.

APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION. APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION Martin Pollow Institute of Technical Acoustics RWTH Aachen University Neustraße

More information

The FABIAN head-related transfer function data base

The FABIAN head-related transfer function data base The FABIAN head-related transfer function data base Fabian Brinkmann, Alexander Lindau, Stefan Weinzierl TU Berlin, Audio Communication Group Einsteinufer 17c, 10587 Berlin-Germany Gunnar Geissler, Steven

More information

Horizontal plane HRTF reproduction using continuous Fourier-Bessel functions

Horizontal plane HRTF reproduction using continuous Fourier-Bessel functions Horizontal plane HRTF reproduction using continuous Fourier-Bessel functions Wen Zhang,2, Thushara D. Abhayapala,2, Rodney A. Kennedy Department of Information Engineering, Research School of Information

More information

Modeling of Pinna Related Transfer Functions (PRTF) Using the Finite Element Method (FEM)

Modeling of Pinna Related Transfer Functions (PRTF) Using the Finite Element Method (FEM) Modeling of Pinna Related Transfer Functions (PRTF) Using the Finite Element Method (FEM) Manan Joshi *1, Navarun Gupta 1, and Lawrence V. Hmurcik 1 1 University of Bridgeport, Bridgeport, CT *Corresponding

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

Validation of a 3-D Virtual Acoustic Prototyping Method For Use In Structural Design

Validation of a 3-D Virtual Acoustic Prototyping Method For Use In Structural Design Validation of a 3-D Virtual Acoustic Prototyping Method For Use In Structural Design Zachary T. Carwile Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University In partial

More information

From Shapes to Sounds: A perceptual mapping

From Shapes to Sounds: A perceptual mapping From Shapes to Sounds: A perceptual mapping Vikas Chandrakant Raykar vikas@umiacs.umd.edu Abstract In this report we present a perceptually inspired mapping to convert a simple two dimensional image consisting

More information

Auralization in Room Acoustics

Auralization in Room Acoustics Graz University of Technology Auralization in Room Acoustics Bachelor s Thesis by Marius Förster Graz University of Technology Institute of Broadband Communications Head: Univ.-Prof. Dipl.-Ing. Dr.techn.

More information

BINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL

BINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL BINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL Takanori Nishino and Kazuya Takeda Center for Information Media Studies, Nagoya University Furo-cho, Chikusa-ku, Nagoya,

More information

TRANSAURAL STEREO IN A BEAMFORMING APPROACH

TRANSAURAL STEREO IN A BEAMFORMING APPROACH Proc of the 12 th Int Conference on Digital Audio Effects (DAFx-9), Como, Italy, September 1-4, 29 TRANSAURAL STEREO IN A BEAMFORMING APPROACH Markus Guldenschuh Institute of Electronic Music and Acoustics,

More information

JND-based spatial parameter quantization of multichannel audio signals

JND-based spatial parameter quantization of multichannel audio signals Gao et al. EURASIP Journal on Audio, Speech, and Music Processing (2016) 2016:13 DOI 10.1186/s13636-016-0091-z RESEARCH JND-based spatial parameter quantization of multichannel audio signals Li Gao 1,2,3,

More information

Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM)

Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM) Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM) Manan Joshi Navarun Gupta, Ph. D. Lawrence Hmurcik, Ph. D. University of Bridgeport, Bridgeport, CT Objective Measure

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

REAL-TIME BINAURAL AUDIO RENDERING IN THE NEAR FIELD

REAL-TIME BINAURAL AUDIO RENDERING IN THE NEAR FIELD Proceedings of the SMC 29-6th Sound and Music Computing Conference, 23-2 July 29, Porto - Portugal REAL-TIME BINAURAL AUDIO RENDERING IN THE NEAR FIELD Simone Spagnol University of Padova spagnols@dei.unipd.it

More information

A Head-Related Transfer Function Model for Real-Time Customized 3-D Sound Rendering

A Head-Related Transfer Function Model for Real-Time Customized 3-D Sound Rendering 211 Seventh International Conference on Signal Image Technology & Internet-Based Systems A Head-Related Transfer Function Model for Real-Time Customized 3-D Sound Rendering Michele Geronazzo Simone Spagnol

More information

Combining Localization Cues and Source Model Constraints for Binaural Source Separation

Combining Localization Cues and Source Model Constraints for Binaural Source Separation Combining Localization Cues and Source Model Constraints for Binaural Source Separation Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis LabROSA, Dept. of Electrical Engineering Columbia University

More information

HEAD IN SPACE: A HEAD-TRACKING BASED BINAURAL SPATIALIZATION SYSTEM

HEAD IN SPACE: A HEAD-TRACKING BASED BINAURAL SPATIALIZATION SYSTEM HEAD IN SPACE: A HEAD-TRACKING BASED BINAURAL SPATIALIZATION SYSTEM Luca A. Ludovico, Davide A. Mauro, and Dario Pizzamiglio LIM - Laboratorio di Informatica Musicale Dipartimento di Informatica e comunicazione

More information

Dmitry N. Zotkin*, Ramani Duraiswami, Larry S. Davis

Dmitry N. Zotkin*, Ramani Duraiswami, Larry S. Davis 1 Rendering Localized Spatial Audio in a Virtual Auditory Space Dmitry N. Zotkin*, Ramani Duraiswami, Larry S. Davis {dz,ramani,lsd}@umiacs.umd.edu Phone: (301)405-1049 Fax: (301)405-6707 Perceptual Interfaces

More information

ESTIMATION OF MULTIPATH PROPAGATION DELAYS AND INTERAURAL TIME DIFFERENCES FROM 3-D HEAD SCANS. Hannes Gamper, Mark R. P. Thomas, Ivan J.

ESTIMATION OF MULTIPATH PROPAGATION DELAYS AND INTERAURAL TIME DIFFERENCES FROM 3-D HEAD SCANS. Hannes Gamper, Mark R. P. Thomas, Ivan J. ESTIMATION OF MULTIPATH PROPAGATION DELAYS AND INTERAURAL TIME DIFFERENCES FROM 3-D HEAD SCANS Hannes Gamper, Mark R. P. Thomas, Ivan J. Tashev Microsoft Research, Redmond, WA 98052, USA ABSTRACT The estimation

More information

An initial Investigation into HRTF Adaptation using PCA

An initial Investigation into HRTF Adaptation using PCA An initial Investigation into HRTF Adaptation using PCA IEM Project Thesis Josef Hölzl Supervisor: Dr. Georgios Marentakis Graz, July 212 institut für elektronische musik und akustik Abstract Recent research

More information

Distributed Signal Processing for Binaural Hearing Aids

Distributed Signal Processing for Binaural Hearing Aids Distributed Signal Processing for Binaural Hearing Aids Olivier Roy LCAV - I&C - EPFL Joint work with Martin Vetterli July 24, 2008 Outline 1 Motivations 2 Information-theoretic Analysis 3 Example: Distributed

More information

Directivity Measurement with Turntables AN54

Directivity Measurement with Turntables AN54 theta Directivity Measurement with Turntables Application Note to the KLIPPEL ANALYZER SYSTEM (Document Revision 1.5) FEATURES Polar measurement in far field CEA2034 measurement Fast, automatic measurement

More information

Keywords: Bilinear interpolation; HRIR interpolation; HRTF interpolation; Tetrahedral interpolation

Keywords: Bilinear interpolation; HRIR interpolation; HRTF interpolation; Tetrahedral interpolation International Journal of Technology (2017) 1: 186-195 ISSN 2086-9614 IJTech 2017 IMPLEMENTATION OF 3D HRTF INTERPOLATION IN SYNTHESIZING VIRTUAL 3D MOVING SOUND Hugeng Hugeng 1*, Jovan Anggara 1, Dadang

More information

CUSTOMIZABLE AUDITORY DISPLAYS. Dmitry N. Zotkin, Ramani Duraiswami, Larry S. Davis

CUSTOMIZABLE AUDITORY DISPLAYS. Dmitry N. Zotkin, Ramani Duraiswami, Larry S. Davis CUSTOMIZABLE AUDITORY DISPLAYS Dmitry N. Zotkin, Ramani Duraiswami, Larry S. Davis PIRL LAB, UMIACS, University of Maryland College Park, MD 20742 USA {dz,ramani,lsd}@umiacs.umd.edu ABSTRACT High-quality

More information

IEEE Proof Web Version

IEEE Proof Web Version IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 8, NOVEMBER 2009 1 Model-Based Expectation-Maximization Source Separation and Localization Michael I. Mandel, Member, IEEE, Ron

More information

Sound occlusion for virtual 3D environments

Sound occlusion for virtual 3D environments PROFESSIONSBACHELORPROJEKT - MEDIE OG SONOKOMMUNIKATION Sound occlusion for virtual 3D environments A solution for real-time filtered sound propagation for arbitrary and non-convex geometry by Janus Lynggaard

More information

Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization

Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization Technical Report OSU-CISRC-10/09-TR50 Department of Computer Science and Engineering The Ohio State University Columbus, OH 43210-1277 Ftpsite: ftp.cse.ohio-state.edu Login: anonymous Directory: pub/tech-report/2009

More information

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1 Outline Research

More information

Extracting the frequencies of the pinna spectral notches in measured head related impulse responses

Extracting the frequencies of the pinna spectral notches in measured head related impulse responses Extracting the frequencies of the pinna spectral notches in measured head related impulse responses Vikas C. Raykar a) and Ramani Duraiswami b) Perceptual Interfaces and Reality Laboratory, Institute for

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

Room Acoustics. CMSC 828D / Spring 2006 Lecture 20

Room Acoustics. CMSC 828D / Spring 2006 Lecture 20 Room Acoustics CMSC 828D / Spring 2006 Lecture 20 Lecture Plan Room acoustics basics Structure of room impulse response Characterization of room acoustics Modeling of reverberant response Basics All our

More information

SPATIAL FREQUENCY RESPONSE SURFACES (SFRS S): AN ALTERNATIVE VISUALIZATION AND INTERPOLATION TECHNIQUE FOR HEAD-RELATED TRANSFER FUNCTIONS (HRTF S)

SPATIAL FREQUENCY RESPONSE SURFACES (SFRS S): AN ALTERNATIVE VISUALIZATION AND INTERPOLATION TECHNIQUE FOR HEAD-RELATED TRANSFER FUNCTIONS (HRTF S) Cheng and Wakefield Page 1 of 13 Pre-print: AES16: Rovaniemi, Finland SPATIAL FREQUENCY RESPONSE SURFACES (SFRS S): AN ALTERNATIVE VISUALIZATION AND INTERPOLATION TECHNIQUE FOR HEAD-RELATED TRANSFER FUNCTIONS

More information

An auditory localization model based on. high frequency spectral cues. Dibyendu Nandy and Jezekiel Ben-Arie

An auditory localization model based on. high frequency spectral cues. Dibyendu Nandy and Jezekiel Ben-Arie An auditory localization model based on high frequency spectral cues Dibyendu Nandy and Jezekiel Ben-Arie Department of Electrical Engineering and Computer Science The University of Illinois at Chicago

More information

Crosstalk Cancellation with the Cell Broadband Engine

Crosstalk Cancellation with the Cell Broadband Engine Crosstalk Cancellation with the Cell Broadband Engine 9 TH SEMESTER PROJECT, AAU APPLIED SIGNAL PROCESSING AND IMPLEMENTATION (ASPI) Group 940 Thibault Beyou Francois Marot Julien Morand AALBORG UNIVERSITY

More information

Subband selection for binaural speech source localization

Subband selection for binaural speech source localization INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden Subband selection for binaural speech source localization Girija Ramesan Karthik, Prasanta Kumar Ghosh Electrical Engineering, Indian Institute of

More information

Online Pose Classification and Walking Speed Estimation using Handheld Devices

Online Pose Classification and Walking Speed Estimation using Handheld Devices Online Pose Classification and Walking Speed Estimation using Handheld Devices Jun-geun Park MIT CSAIL Joint work with: Ami Patel (MIT EECS), Jonathan Ledlie (Nokia Research), Dorothy Curtis (MIT CSAIL),

More information

Active noise control at a moving location in a modally dense three-dimensional sound field using virtual sensing

Active noise control at a moving location in a modally dense three-dimensional sound field using virtual sensing Active noise control at a moving location in a modally dense three-dimensional sound field using virtual sensing D. J Moreau, B. S Cazzolato and A. C Zander The University of Adelaide, School of Mechanical

More information

Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays

Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays Downloaded from orbit.dtu.dk on: Oct 6, 28 Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays Favrot, Sylvain Emmanuel; Marschall, Marton Published in: Proceedings

More information

Minimally Simple Binaural Room Modelling Using a Single Feedback Delay Network

Minimally Simple Binaural Room Modelling Using a Single Feedback Delay Network Minimally Simple Binaural Room Modelling Using a Single Feedback Delay Network NATALIE AGUS 1, HANS ANDERSON 1, JER-MING CHEN 1, SIMON LUI 1, AND DORIEN HERREMANS 1,2 (natalie agus@mymail.sutd.edu.sg)

More information

A SOURCE LOCALIZATION/SEPARATION/RESPATIALIZATION SYSTEM BASED ON UNSUPERVISED CLASSIFICATION OF INTERAURAL CUES

A SOURCE LOCALIZATION/SEPARATION/RESPATIALIZATION SYSTEM BASED ON UNSUPERVISED CLASSIFICATION OF INTERAURAL CUES Proc of the 9 th Int Conference on Digital Audio Effects (DAFx-6), Montreal, Canada, September 8-, 6 A SOURCE LOCALIZATION/SEPARATION/RESPATIALIZATION SYSTEM BASED ON UNSUPERVISED CLASSIFICATION OF INTERAURAL

More information

AURALISATION OF A SYMPHONY ORCHESTRA WITH ODEON THE CHAIN FROM MUSICAL INSTRUMENTS TO THE EARDRUMS

AURALISATION OF A SYMPHONY ORCHESTRA WITH ODEON THE CHAIN FROM MUSICAL INSTRUMENTS TO THE EARDRUMS AURALISATION OF A SYMPHONY ORCHESTRA WITH ODEON THE CHAIN FROM MUSICAL INSTRUMENTS TO THE EARDRUMS Jens Holger Rindel Odeon A/S Scion DTU, Diplomvej 381, DK-2800 Kgs. Lyngby, Denmark jhr@odeon.dk Claus

More information

System Identification Related Problems at

System Identification Related Problems at media Technologies @ Ericsson research (New organization Taking Form) System Identification Related Problems at MT@ER Erlendur Karlsson, PhD 1 Outline Ericsson Publications and Blogs System Identification

More information

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Networks System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2015-04-28 Page 1 Outline Research

More information

White Paper: An introduction to 3D acoustics

White Paper: An introduction to 3D acoustics White Paper: An introduction to 3D acoustics IRIS White Paper An introduction to 3D acoustics Page 1 Future acoustics now Two-dimensional plans were once considered the standard in architecture and engineering,

More information

arxiv: v2 [eess.as] 3 Apr 2018

arxiv: v2 [eess.as] 3 Apr 2018 IET Research Journals Minimum-Phase HRTF Modelling of Pinna Spectral Notches using Group Delay Decomposition ISSN 1751-8644 doi: 0000000000 www.ietdl.org Sandeep Reddy C 1, Rajesh M Hegde 2 1,2 Department

More information

THE HOUGH TRANSFORM FOR BINAURAL SOURCE LOCALIZATION

THE HOUGH TRANSFORM FOR BINAURAL SOURCE LOCALIZATION Proc. of the th Int. Conference on Digital Audio Effects (DAFx-9), Como, Italy, September -4, 9 THE HOUGH TRANSFORM FOR BINAURAL SOURCE LOCALIZATION Sylvain Marchand LaBRI CNRS University of Bordeaux Talence,

More information

Dolby Atmos Immersive Audio From the Cinema to the Home

Dolby Atmos Immersive Audio From the Cinema to the Home Dolby Atmos Immersive Audio From the Cinema to the Home Nick Engel Senior Director Consumer Entertainment Technology AES Melbourne Section Meeting 11 December 2017 History of Cinema 1980s - Dolby SR noise

More information

Individualized Head-Related Transfer Functions: Efficient Modeling and Estimation from Small Sets of Spatial Samples

Individualized Head-Related Transfer Functions: Efficient Modeling and Estimation from Small Sets of Spatial Samples Individualized Head-Related Transfer Functions: Efficient Modeling and Estimation from Small Sets of Spatial Samples Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy

More information

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans

More information

Model-Based Expectation Maximization Source Separation and Localization

Model-Based Expectation Maximization Source Separation and Localization 1 Model-Based Expectation Maximization Source Separation and Localization Michael I. Mandel, Student Member, IEEE, Ron J. Weiss, Student Member IEEE, Daniel P. W. Ellis, Senior Member, IEEE Abstract This

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

A fast calculation method of the head-related transfer functions for multiple source points based on the boundary element method

A fast calculation method of the head-related transfer functions for multiple source points based on the boundary element method Acoust. Sci. & Tech. 24, 5 (2003) PAPER A fast calculation method of the head-related transfer functions for multiple source points based on the boundary element method Makoto Otani and Shiro Ise Graduate

More information

Automated Loudspeaker Balloon Measurement M. Bigi, M.Jacchia, D. Ponteggia - Audiomatica

Automated Loudspeaker Balloon Measurement M. Bigi, M.Jacchia, D. Ponteggia - Audiomatica Automated Loudspeaker Balloon Measurement ALMA 2009 European Symposium Loudspeaker Design - Science & Art April 4, 2009 Introduction Summary why 3D measurements? Coordinate system Measurement of loudspeaker

More information

Implementation and evaluation of a mobile Android application for auditory stimulation of chronic tinnitus patients

Implementation and evaluation of a mobile Android application for auditory stimulation of chronic tinnitus patients Universität Ulm 89069 Ulm Germany Faculty of Engineering and Computer Science Institute of Databases and Information Systems Implementation and evaluation of a mobile Android application for auditory stimulation

More information

Accelerometer Gesture Recognition

Accelerometer Gesture Recognition Accelerometer Gesture Recognition Michael Xie xie@cs.stanford.edu David Pan napdivad@stanford.edu December 12, 2014 Abstract Our goal is to make gesture-based input for smartphones and smartwatches accurate

More information

AUDIO-VISUAL MULTIPLE ACTIVE SPEAKER LOCALISATION IN REVERBERANT ENVIRONMENTS

AUDIO-VISUAL MULTIPLE ACTIVE SPEAKER LOCALISATION IN REVERBERANT ENVIRONMENTS Proc. of the 5 th Int. Conference on Digital Effects (DAFx-2), York, UK, September 7-2, 22 AUDIO-VISUAL MULTIPLE ACTIVE SPEAKER LOCALISATION IN REVERBERANT ENVIRONMENTS Zhao Li, Thorsten Herfet Telecommunications

More information

Efficient Binaural Sound Localization for Humanoid Robots and Telepresence Applications

Efficient Binaural Sound Localization for Humanoid Robots and Telepresence Applications TECHNISCHE UNIVERSITÄT MÜNCHEN Lehrstuhl für Datenverarbeitung Efficient Binaural Sound Localization for Humanoid Robots and Telepresence Applications Fakheredine Keyrouz Vollständiger Abdruck der von

More information

Using Noise Substitution for Backwards-Compatible Audio Codec Improvement

Using Noise Substitution for Backwards-Compatible Audio Codec Improvement Using Noise Substitution for Backwards-Compatible Audio Codec Improvement Colin Raffel AES 129th Convention San Francisco, CA February 16, 2011 Outline Introduction and Motivation Coding Error Analysis

More information

are best when using more complicated parametrizations, without over-fitting the data

are best when using more complicated parametrizations, without over-fitting the data 77 Chapter 5 Separation This chapter describes the full Model-based EM Source Separation and Localization (MESSL) system. This system separates and localizes multiple sound sources from a reverberant two-channel

More information

Pyramid Coding and Subband Coding

Pyramid Coding and Subband Coding Pyramid Coding and Subband Coding Predictive pyramids Transform pyramids Subband coding Perfect reconstruction filter banks Quadrature mirror filter banks Octave band splitting Transform coding as a special

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Parametric Model Of Head Related Transfer Functions Based On Systematic Movements Of Poles And Zeros With Sound Location For Pole/ zero Models

Parametric Model Of Head Related Transfer Functions Based On Systematic Movements Of Poles And Zeros With Sound Location For Pole/ zero Models University of Denver Digital Commons @ DU Electronic Theses and Dissertations Graduate Studies 1-1-2009 Parametric Model Of Head Related Transfer Functions Based On Systematic Movements Of Poles And Zeros

More information

EFFICIENT HRTF INTERPOLATION IN 3D MOVING SOUND

EFFICIENT HRTF INTERPOLATION IN 3D MOVING SOUND EFFICIENT HRTF INTERPOLATION IN 3D MOVING SOUND FÁBIO P. FREELAND, LUIZ WAGNER P. BISCAINHO AND PAULO SERGIO R. DINIZ Universidade Federal do Rio de Janeiro: LPS - PEE/COPPE & DEL/POLI Caixa Postal 68564-21945-970

More information

Auralization and Geometric acoustics ERIK MOLIN, HANNA AUTIO

Auralization and Geometric acoustics ERIK MOLIN, HANNA AUTIO Auralization and Geometric acoustics ERIK MOLIN, HANNA AUTIO Auralization what and why? For a given acoustic situation (space, sound source(s), listener position ), what sound does the listener hear? Auralization

More information

A Structural Parametric Binaural 3D Sound Implementation Using Open Hardware

A Structural Parametric Binaural 3D Sound Implementation Using Open Hardware A Structural Parametric Binaural 3D Sound Implementation Using Open Hardware Bruno Dal Bó Silva and Marcelo Götz Electrical Engineering Department Federal University of Rio Grande do Sul - UFRGS, Porto

More information

Pyramid Coding and Subband Coding

Pyramid Coding and Subband Coding Pyramid Coding and Subband Coding! Predictive pyramids! Transform pyramids! Subband coding! Perfect reconstruction filter banks! Quadrature mirror filter banks! Octave band splitting! Transform coding

More information

Offset Driver in an Open Ended Transmission Line - Acoustic and Electrical Response 7/03/09. Copyright 2009 by Martin J. King. All Rights Reserved.

Offset Driver in an Open Ended Transmission Line - Acoustic and Electrical Response 7/03/09. Copyright 2009 by Martin J. King. All Rights Reserved. Offset Driver in an Open Ended Transmission Line - Acoustic and Electrical Response 7/3/9 Software : by Martin J. King e-mail MJKing57@aol.com Copyright 29 by Martin J. King. All Rights Reserved. Line

More information

Remote Collaborative Real-Time Multimedia Experience over the Future Internet ROMEO. Grant Agreement Number: D3.6

Remote Collaborative Real-Time Multimedia Experience over the Future Internet ROMEO. Grant Agreement Number: D3.6 Ref. Ares(2013)3156899-02/10/2013 Remote Collaborative Real-Time Multimedia Experience over the Future Internet ROMEO Grant Agreement Number: 287896 D3.6 Final Report on 3D Audio - Video Rendering Algorithms

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Abstract. 1 Introduction

Abstract. 1 Introduction Holographic image-based rendering of spatial audio Ramani Duraiswami, Zhiyun Li, Dmitry N. Zotkin, Nail A. Gumerov, Elena Grassi, Larry S.Davis Perceptual Interfaces and Reality Laboratory, University

More information

A3-D AUDIO system can be used to position sounds around

A3-D AUDIO system can be used to position sounds around IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 8, NOVEMBER 2007 2287 A Minimax Approach for the Joint Design of Acoustic Crosstalk Cancellation Filters Harsha I. K. Rao, Student

More information

Driver in a Double Bass Reflex Enclosure - Acoustic and Electrical Response 8/14/09. Copyright 2009 by Martin J. King. All Rights Reserved.

Driver in a Double Bass Reflex Enclosure - Acoustic and Electrical Response 8/14/09. Copyright 2009 by Martin J. King. All Rights Reserved. Driver in a Double Bass Reflex Enclosure - Acoustic and Electrical Response 8/14/9 Software : by Martin J. King e-mail MJKing57@aol.com Copyright 29 by Martin J. King. All Rights Reserved. Unit and Constant

More information

Periphonic Sound Spatialization in Multi-User Virtual Environments

Periphonic Sound Spatialization in Multi-User Virtual Environments Periphonic Sound Spatialization in Multi-User Virtual Environments Florian Hollerweger corrected version, March 14 th 2006 Institute of Electronic Music and Acoustics (IEM), Center for Research in Electronic

More information

Phase Difference Reconstruction. Outline

Phase Difference Reconstruction. Outline Advanced MRI Phase Difference Reconstruction Faik Can MERAL Outline Introduction Quantitative Description Arctangent operation ATAN2 Phased-Array Multiple Coil Data Correction of Predictable Phase Errors

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Transaural acoustic crosstalk cancellation

Transaural acoustic crosstalk cancellation Transaural acoustic crosstalk cancellation by Alastair ibbald source C listener C A A C source C The cancellation of transaural acoustic crosstalk is an essential and critical feature of all head-related

More information

Single Particle Reconstruction Techniques

Single Particle Reconstruction Techniques T H E U N I V E R S I T Y of T E X A S S C H O O L O F H E A L T H I N F O R M A T I O N S C I E N C E S A T H O U S T O N Single Particle Reconstruction Techniques For students of HI 6001-125 Computational

More information

ELEC Dr Reji Mathew Electrical Engineering UNSW

ELEC Dr Reji Mathew Electrical Engineering UNSW ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion

More information

Proposal and Evaluation of Mapping Hypermedia to 3D Sound Space

Proposal and Evaluation of Mapping Hypermedia to 3D Sound Space Proposal and Evaluation of Mapping Hypermedia to 3D Sound Space Radoslav Buranský Faculty of Mathematics, Physics and Informatics Comenuis University, Bratislava Abstract In this paper we present a new

More information

AUDIBLE AND INAUDIBLE EARLY REFLECTIONS: THRESHOLDS FOR AURALIZATION SYSTEM DESIGN

AUDIBLE AND INAUDIBLE EARLY REFLECTIONS: THRESHOLDS FOR AURALIZATION SYSTEM DESIGN AUDIBLE AND INAUDIBLE EARLY REFLECTIONS: THRESHOLDS FOR AURALIZATION SYSTEM DESIGN Durand R. Begault, Ph.D. San José State University Flight Management and Human Factors Research Division NASA Ames Research

More information

BINAURAL sound source localization (SSL) is to determine

BINAURAL sound source localization (SSL) is to determine 1 Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping Cheng Pang, Hong Liu, Jie Zhang and Xiaofei Li Abstract Binaural sound source localization is an important

More information

Image Filtering, Warping and Sampling

Image Filtering, Warping and Sampling Image Filtering, Warping and Sampling Connelly Barnes CS 4810 University of Virginia Acknowledgement: slides by Jason Lawrence, Misha Kazhdan, Allison Klein, Tom Funkhouser, Adam Finkelstein and David

More information

ROOM ACOUSTIC SIMULATION AND AURALIZATION HOW CLOSE CAN WE GET TO THE REAL ROOM?

ROOM ACOUSTIC SIMULATION AND AURALIZATION HOW CLOSE CAN WE GET TO THE REAL ROOM? WESPAC 8, The Eighth Western Pacific Acoustics Conference, Melbourne, 7-9 April 2003 (Keynote lecture) ROOM ACOUSTIC SIMULATION AND AURALIZATION HOW CLOSE CAN WE GET TO THE REAL ROOM? Manuscript Number

More information

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan. MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.E 4 Assistant Professor, Dept. of ECE, Dr.NGP Institute of Technology,

More information

ENERGY-BASED BINAURAL ACOUSTIC MODELING. Natalie Agus, Hans, Anderson, Jer-Ming Chen, and Simon Lui Singapore University of Technology and Design

ENERGY-BASED BINAURAL ACOUSTIC MODELING. Natalie Agus, Hans, Anderson, Jer-Ming Chen, and Simon Lui Singapore University of Technology and Design ENERGY-BASED BINAURAL ACOUSTIC MODELING Natalie Agus, Hans, Anderson, Jer-Ming Chen, and Simon Lui Singapore University of Technology and Design Information Systems Technology and Design Technical Report

More information

ZOOM Ambisonics Player

ZOOM Ambisonics Player ZOOM Ambisonics Player Version 0 Operation Manual This document cannot be displayed properly on black-and-white displays. 08 ZOOM CORPORATION Copying or reprinting this manual in part or in whole without

More information

Optimal Positioning of a Binaural Sensor on a Humanoid Head for Sound Source Localization

Optimal Positioning of a Binaural Sensor on a Humanoid Head for Sound Source Localization Optimal Positioning of a Binaural Sensor on a Humanoid Head for Sound Source Localization Alain Skaf Patrick Danès CNRS ; LAAS ; 7 avenue du colonel Roche, F-31077 Toulouse Cedex 4, France Université de

More information

HRTF MAGNITUDE MODELING USING A NON-REGULARIZED LEAST-SQUARES FIT OF SPHERICAL HARMONICS COEFFICIENTS ON INCOMPLETE DATA

HRTF MAGNITUDE MODELING USING A NON-REGULARIZED LEAST-SQUARES FIT OF SPHERICAL HARMONICS COEFFICIENTS ON INCOMPLETE DATA HRTF MAGNITUDE MODELING USING A NON-REGULARIZED LEAST-SQUARES FIT OF SPHERICAL HARMONICS COEFFICIENTS ON INCOMPLETE DATA Jens Ahrens, Mark R. P. Thomas, Ivan Tashev Microsoft Research, One Microsoft Way,

More information

Automated Loudspeaker Balloon Response Measurement

Automated Loudspeaker Balloon Response Measurement AES San Francisco 2008 Exhibitor Seminar ES11 Saturday, October 4, 1:00 pm 2:00 pm Automated Loudspeaker Balloon Response Measurement Audiomatica SRL In this seminar we will analyze the whys and the how

More information

Speech and audio coding

Speech and audio coding Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples

More information

Online transmission of panoramic audio

Online transmission of panoramic audio Online transmission of panoramic audio SOPA project April 18, 2013 Contents 1 Introduction 2 2 Purpose of the project 2 3 Principles 4 3.1 Definition of Temporal HRTF.................. 4 3.2 Estimation

More information

I. INTRODUCTION A. Variation of the HRTF with elevation. Electronic mail:

I. INTRODUCTION A. Variation of the HRTF with elevation. Electronic mail: Approximating the head-related transfer function using simple geometric models of the head and torso V. Ralph Algazi and Richard O. Duda a) CIPIC, Center for Image Processing and Integrated Computing,

More information

Efficiënte audiocompressie gebaseerd op de perceptieve codering van ruimtelijk geluid

Efficiënte audiocompressie gebaseerd op de perceptieve codering van ruimtelijk geluid nederlands akoestisch genootschap NAG journaal nr. 184 november 2007 Efficiënte audiocompressie gebaseerd op de perceptieve codering van ruimtelijk geluid Philips Research High Tech Campus 36 M/S2 5656

More information

Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics

Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics Masayo Haneda Peter M.W. Knijnenburg Harry A.G. Wijshoff LIACS, Leiden University Motivation An optimal compiler optimization

More information

User manual HUA-503B

User manual HUA-503B User manual HUA-503B a) HUA-503(Bon conduction amplifier) + HUH-01(Bon conduction headphone) b) Charging cable, Audio connecting cable, Portable pouch c) Product pictures 1. Product configuration AMPHUA

More information