AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS

Size: px
Start display at page:

Download "AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS"

Transcription

1 AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS Edited by YITENG (ARDEN) HUANG Bell Laboratories, Lucent Technologies JACOB BENESTY Universite du Quebec, INRS-EMT Kluwer Academic Publishers Boston/Dordrecht/London

2 Contents Preface xi Contributing Authors 1 Introduction 1 Yiteng (Arden) Huang Jacob Benesty 1. Multimedia Communications 1 2. Challenges and Opportunities 3 3. Organization of the Book 4 xiii Part I Speech Acquisition and Enhancement 2 Differer itial Microphone Arrays Gary W. Elko Introduction Differential Microphone Arrays Array Directional Gain Optimal Arrays for Isotropic Fields 4.1 Maximum Directional Gain 4.2 Maximum Directivity Index for Differential Microphones 4.3 Maximum Front-to-Back Ratio 4.4 Minimum Peak Directional Response 4.5 Beamwidth Design Examples 5.1 First-Order Designs 5.2 Second-Order Designs 5.3 Third-Order Designs 5.4 Higher-Order designs Sensitivity to Microphone Mismatch and Noise Conclusions

3 vi Audio Signal Processing 3 Spherical Microphone Arrays for 3D Sound Recording 67 Jens Meyer Gary W. Elko 1. Introduction Fundamental Concept The Eigenbeamformer Discrete Orthonormality The Eigenbeams The Modal Coefficients Modal-Beamformer Combining Unit Steering Unit Robustness Measure Beampattern Design Arbitrary Beampattern Design Optimum Beampattern Design Measurements Summary Appendix A 89 4 Subband Noise Reduction Methods for Speech Enhancement 91 Eric J. Diethorn 1. Introduction Wiener Filtering Speech Enhancement by Short-Time Spectral Modification Short-Time Fourier Analysis and Synthesis Short-Time Wiener Filter Power Subtraction Magnitude Subtraction Parametric Wiener Filtering Review and Discussion Averaging Techniques for Envelope Estimation Moving Average Single-Pole Recursion Two-Sided Single-Pole Recursion Nonlinear Data Processing Example Implementation Subband Filter Bank Architecture A-Posteriori-SNR Voice Activity Detector Example Conclusion 111 Part II Acoustic Echo Cancellation 5 Adaptive Algorithms for MIMO Acoustic Echo Cancellation 119 Jacob Benesty Tomas Gänsler Yiteng (Arden) Huang Markus Rupp 1. Introduction Normal Equations and Identification of a MIMO System Normal Equations 121

4 Contents Vll 2.2 The Nonuniqueness Problem The Impulse Response Tail Effect Some Different Solutions for Decorrelation The Classical and Factorized Multichannel RLS The Multichannel Fast RLS The Multichannel LMS Algorithm Classical Derivation Improved Version The Multichannel APA The Straightforward Multichannel APA The Improved Two-Channel APA The Improved Multichannel APA The Multichannel Exponentiated Gradient Algorithm The Multichannel Frequency-domain Adaptive Algorithm Conclusions Double-Talk Detectors for Acoustic Echo Cancelers 149 Tomas Gänsler Jacob Benesty 1. Introduction Basics of AEC and DTD AEC Notations The Generic DTD A Suggestion to Performance Evaluation of DTDs Double-Talk Detection Algorithms The Geigel Algorithm The Cross-Correlation Method The Normalized Cross-Correlation Method The Coherence Method The Normalized Cross-correlation Matrix The Two-Path Model DTD Combinations with Robust Statistics Comparison of DTDs by Means of the ROC Discussion The WinEC: A Real-Time Hands-Free Stereo Communication System 171 Tomas Gänsler Volker Fischer Eric J. Diethorn Jacob Benesty 1. Introduction 1.1 Signal model System Description The Audio Module The Network Module The Echo Canceler Module Algorithms of the Echo Canceler Module Adaptive Filter Algorithm Residual Echo and Noise Suppression Masking Threshold for Residual Echo in Noise Analysis of Echo Suppression Requirements Noise and Residual Echo Suppression Simulations Real-Time Tests with Different Modes of Operation 189

5 viii Audio Signal Processing 6.1 Point-to-Point Communication Multi-Point Communication Transatlantic Teleconference in Stereo Discussion 191 Part IH Sound Source Tracking and Separation 8 Time Delay Estimation 197 Jingdong Chen Yiteng (Arden) Huang Jacob Benesty 1. Introduction Signal Models Ideal Propagation Model Multipath Model Reverberant Model Generalized Cross-Correlation Method The Multichannel Cross-Correlation Algorithm Spatial Prediction Technique Time Delay Estimation Using Spatial Prediction Other Information from the Spatial Correlation Matrix Adaptive Eigenvalue Decomposition Algorithm Adaptive Multichannel Time Delay Estimation Principle Time-Domain Multichannel LMS Approach Frequency-Domain Adaptive Algorithms Experiments Experimental Setup Performance Measure Experimental Results Conclusions Source Localization 229 Yiteng (Arden) Huang Jacob Benesty Gary W. Elko 1. Introduction Source Localization Problem Measurement Model and Cramer-Rao Lower Bound for Source Localization Maximum Likelihood Estimator Least Squares Estimators The Least Squares Error Criteria Spherical Intersection (SX) Estimator Spherical Interpolation (SI) Estimator Linear-Correction Least Squares Estimator Example System Implementation Source Localization Examples Conclusions Blind Source Separation for Convolutive Mixtures: A Unified Treatment 255 Herbert Büchner Robert Aichner Walter Kellermann

6 Contents ix 1. Introduction Generic Block Time-Domain BSS Algorithm Matrix Notation for Convolutive Mixtures Cost Function and Algorithm Derivation Equivariance Property and Natural Gradient Special Cases and Links to Known Time-Domain Algorithms Generic Frequency-Domain BSS Algorithm General Frequency-Domain Formulation Natural Gradient in the Frequency Domain Special Cases and Links to Known Frequency-Domain Algorithms Weighting Function Off-line Implementation On-line Implementation В lock-on-line Implementation Part IV 11 Audio Coding Gerald Schüler Experiments and Results Conclusions Audio Coding and Realistic Soun Introduction Psycho-Acoustics Filter Banks 3.1 Polyphase Formulation 3.2 Modulated Filter Banks 3.3 Block Switching Current and Basic Coder Structures Stereo Coding Low Delay Audio Coding Conclusions Sound Field Synthesis 323 Sascha Spors Heinz Teutsch Achim Kuntz Rudolf Rabenstein 1. Introduction Rendering of Sound Fields with Wave Field Synthesis Physical Foundation of Wave Field Synthesis Wave Field Synthesis Based Sound Reproduction Model-based and Data-Based Rendering Data-Based Rendering Model-Based Rendering Hybrid Approach Wave Field Analysis Loudspeaker and Listening Room Compensation Listening Room Compensation Loudspeaker Compensation Description of a Sound Field Transmission System 339

7 X Audio Signal Processing 6.1 Acquisition of Source Signals Sound Stage Reproduction Using Wave Field Synthesis Summary Virtual Spatial Sound 345 Carlos Avendano 1. Introduction 1.1 Scope Spatial Hearing Interaural Coordinate System Interaural Differences Spectral Cues Distance Cues Dynamic Cues Acoustics of Spatial Sound TheHRTF Room Acoustics Virtual Spatial Sound Systems HRTF Measurement HRTF Modelling Virtual Spatial Sound Rendering Conclusions 366 Index 371

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI- CATION SYSTEMS

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI- CATION SYSTEMS AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI- CATION SYSTEMS AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI- CATION SYSTEMS Edited by YITENG (ARDEN) HUANG Bell Laboratories,

More information

Design and Implementation of Small Microphone Arrays

Design and Implementation of Small Microphone Arrays Design and Implementation of Small Microphone Arrays for Acoustic and Speech Signal Processing Jingdong Chen and Jacob Benesty Northwestern Polytechnical University 127 Youyi West Road, Xi an, China jingdongchen@ieee.org

More information

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information

Optimum Array Processing

Optimum Array Processing Optimum Array Processing Part IV of Detection, Estimation, and Modulation Theory Harry L. Van Trees WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Preface xix 1 Introduction 1 1.1 Array Processing

More information

REAL-TIME DIGITAL SIGNAL PROCESSING

REAL-TIME DIGITAL SIGNAL PROCESSING REAL-TIME DIGITAL SIGNAL PROCESSING FUNDAMENTALS, IMPLEMENTATIONS AND APPLICATIONS Third Edition Sen M. Kuo Northern Illinois University, USA Bob H. Lee Ittiam Systems, Inc., USA Wenshun Tian Sonus Networks,

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Surrounded by High-Definition Sound

Surrounded by High-Definition Sound Surrounded by High-Definition Sound Dr. ChingShun Lin CSIE, NCU May 6th, 009 Introduction What is noise? Uncertain filters Introduction (Cont.) How loud is loud? (Audible: 0Hz - 0kHz) Introduction (Cont.)

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Third Edition Rafael C. Gonzalez University of Tennessee Richard E. Woods MedData Interactive PEARSON Prentice Hall Pearson Education International Contents Preface xv Acknowledgments

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Numerical Robustness. The implementation of adaptive filtering algorithms on a digital computer, which inevitably operates using finite word-lengths,

Numerical Robustness. The implementation of adaptive filtering algorithms on a digital computer, which inevitably operates using finite word-lengths, 1. Introduction Adaptive filtering techniques are used in a wide range of applications, including echo cancellation, adaptive equalization, adaptive noise cancellation, and adaptive beamforming. These

More information

Adaptive Filters Algorithms (Part 2)

Adaptive Filters Algorithms (Part 2) Adaptive Filters Algorithms (Part 2) Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing and System

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Multichannel Affine and Fast Affine Projection Algorithms for Active Noise Control and Acoustic Equalization Systems

Multichannel Affine and Fast Affine Projection Algorithms for Active Noise Control and Acoustic Equalization Systems 54 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 1, JANUARY 2003 Multichannel Affine and Fast Affine Projection Algorithms for Active Noise Control and Acoustic Equalization Systems Martin

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Optimized Variable Step Size Normalized LMS Adaptive Algorithm for Echo Cancellation

Optimized Variable Step Size Normalized LMS Adaptive Algorithm for Echo Cancellation International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 3-869 Optimized Variable Step Size Normalized LMS Adaptive Algorithm for Echo Cancellation Deman Kosale, H.R.

More information

System Identification Related Problems at

System Identification Related Problems at media Technologies @ Ericsson research (New organization Taking Form) System Identification Related Problems at MT@ER Erlendur Karlsson, PhD 1 Outline Ericsson Publications and Blogs System Identification

More information

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan. MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.E 4 Assistant Professor, Dept. of ECE, Dr.NGP Institute of Technology,

More information

SSL for Circular Arrays of Mics

SSL for Circular Arrays of Mics SSL for Circular Arrays of Mics Yong Rui, Dinei Florêncio, Warren Lam, and Jinyan Su Microsoft Research ABSTRACT Circular arrays are of particular interest for a number of scenarios, particularly because

More information

Audio Coding Standards

Audio Coding Standards Audio Standards Kari Pihkala 13.2.2002 Tik-111.590 Multimedia Outline Architectural Overview MPEG-1 MPEG-2 MPEG-4 Philips PASC (DCC cassette) Sony ATRAC (MiniDisc) Dolby AC-3 Conclusions 2 Architectural

More information

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING SECOND EDITION IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING ith Algorithms for ENVI/IDL Morton J. Canty с*' Q\ CRC Press Taylor &. Francis Group Boca Raton London New York CRC

More information

Appendix A Auxiliary MATLAB Functions

Appendix A Auxiliary MATLAB Functions Appendix A Auxiliary MATLAB Functions Listing A.1 Function for computing the joint diagonalization. 1 function [X,D]=jeig(A,B,srtstr); L=chol(B,'lower'); 3 G=inv(L); 4 C=G*A*G'; 5 [Q,D]=schur(C); 6 X=G'*Q;

More information

Epipolar Geometry in Stereo, Motion and Object Recognition

Epipolar Geometry in Stereo, Motion and Object Recognition Epipolar Geometry in Stereo, Motion and Object Recognition A Unified Approach by GangXu Department of Computer Science, Ritsumeikan University, Kusatsu, Japan and Zhengyou Zhang INRIA Sophia-Antipolis,

More information

Adaptive Filtering using Steepest Descent and LMS Algorithm

Adaptive Filtering using Steepest Descent and LMS Algorithm IJSTE - International Journal of Science Technology & Engineering Volume 2 Issue 4 October 2015 ISSN (online): 2349-784X Adaptive Filtering using Steepest Descent and LMS Algorithm Akash Sawant Mukesh

More information

Digital Signal Processing with Field Programmable Gate Arrays

Digital Signal Processing with Field Programmable Gate Arrays Uwe Meyer-Baese Digital Signal Processing with Field Programmable Gate Arrays Third Edition With 359 Figures and 98 Tables Book with CD-ROM ei Springer Contents Preface Preface to Second Edition Preface

More information

Adaptive System Identification and Signal Processing Algorithms

Adaptive System Identification and Signal Processing Algorithms Adaptive System Identification and Signal Processing Algorithms edited by N. Kalouptsidis University of Athens S. Theodoridis University of Patras Prentice Hall New York London Toronto Sydney Tokyo Singapore

More information

Chapter 5.5 Audio Programming

Chapter 5.5 Audio Programming Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming

More information

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1 Outline Research

More information

3.5 Filtering with the 2D Fourier Transform Basic Low Pass and High Pass Filtering using 2D DFT Other Low Pass Filters

3.5 Filtering with the 2D Fourier Transform Basic Low Pass and High Pass Filtering using 2D DFT Other Low Pass Filters Contents Part I Decomposition and Recovery. Images 1 Filter Banks... 3 1.1 Introduction... 3 1.2 Filter Banks and Multirate Systems... 4 1.2.1 Discrete Fourier Transforms... 5 1.2.2 Modulated Filter Banks...

More information

Telecommunications Engineering Course Descriptions

Telecommunications Engineering Course Descriptions Telecommunications Engineering Course Descriptions Electrical Engineering Courses EE 5305 Radio Frequency Engineering (3 semester hours) Introduction to generation, transmission, and radiation of electromagnetic

More information

System Identification

System Identification System Identification D R. T A R E K A. T U T U N J I A D V A N C E D M O D E L I N G A N D S I M U L A T I O N M E C H A T R O N I C S E N G I N E E R I N G D E P A R T M E N T P H I L A D E L P H I A

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,

More information

Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays

Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays Downloaded from orbit.dtu.dk on: Oct 6, 28 Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays Favrot, Sylvain Emmanuel; Marschall, Marton Published in: Proceedings

More information

Spherical Microphone Arrays

Spherical Microphone Arrays Spherical Microphone Arrays Acoustic Wave Equation Helmholtz Equation Assuming the solutions of wave equation are time harmonic waves of frequency ω satisfies the homogeneous Helmholtz equation: Boundary

More information

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Networks System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2015-04-28 Page 1 Outline Research

More information

Application of Linux Audio in Hearing Aid Research

Application of Linux Audio in Hearing Aid Research Application of Linux Audio in Hearing Aid Research Giso Grimm 1 Tobias Herzke 2 Volker Hohmann 2 1 Universität Oldenburg, Oldenburg, Germany 2 HörTech ggmbh, Oldenburg, Germany Linux Audio Conference,

More information

ANALYSIS OF GEOPHYSICAL POTENTIAL FIELDS A Digital Signal Processing Approach

ANALYSIS OF GEOPHYSICAL POTENTIAL FIELDS A Digital Signal Processing Approach ADVANCES IN EXPLORATION GEOPHYSICS 5 ANALYSIS OF GEOPHYSICAL POTENTIAL FIELDS A Digital Signal Processing Approach PRABHAKAR S. NAIDU Indian Institute of Science, Bangalore 560012, India AND M.P. MATHEW

More information

Passive Differential Matched-field Depth Estimation of Moving Acoustic Sources

Passive Differential Matched-field Depth Estimation of Moving Acoustic Sources Lincoln Laboratory ASAP-2001 Workshop Passive Differential Matched-field Depth Estimation of Moving Acoustic Sources Shawn Kraut and Jeffrey Krolik Duke University Department of Electrical and Computer

More information

A Wavelet Tour of Signal Processing The Sparse Way

A Wavelet Tour of Signal Processing The Sparse Way A Wavelet Tour of Signal Processing The Sparse Way Stephane Mallat with contributions from Gabriel Peyre AMSTERDAM BOSTON HEIDELBERG LONDON NEWYORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY»TOKYO

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

Inverse Structure for Active Noise Control and Combined Active Noise Control/Sound Reproduction Systems

Inverse Structure for Active Noise Control and Combined Active Noise Control/Sound Reproduction Systems IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 9, NO. 2, FEBRUARY 2001 141 Inverse Structure for Active Noise Control and Combined Active Noise Control/Sound Reproduction Systems Martin Bouchard,

More information

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO International journal of computer science & information Technology (IJCSIT) Vol., No.5, October A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO Pranab Kumar Dhar *, Mohammad

More information

Fundamentals of Digital Image Processing

Fundamentals of Digital Image Processing \L\.6 Gw.i Fundamentals of Digital Image Processing A Practical Approach with Examples in Matlab Chris Solomon School of Physical Sciences, University of Kent, Canterbury, UK Toby Breckon School of Engineering,

More information

APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION.

APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION. APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION Martin Pollow Institute of Technical Acoustics RWTH Aachen University Neustraße

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

THE PERFORMANCE of automatic speech recognition

THE PERFORMANCE of automatic speech recognition IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 6, NOVEMBER 2006 2109 Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments Michael L. Seltzer,

More information

Real-Time Semi-Blind Speech Extraction with Speaker Direction Tracking on Kinect

Real-Time Semi-Blind Speech Extraction with Speaker Direction Tracking on Kinect Real-Time Semi-Blind Speech Extraction with Speaker Direction Tracking on Kinect Yuji Onuma, Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano Nara Institute of Science and Technology, Graduate School

More information

On the Minimum l p. 193 On the Strong Uniqueness of Highly Sparse Representations from Redundant Dictionaries

On the Minimum l p. 193 On the Strong Uniqueness of Highly Sparse Representations from Redundant Dictionaries Theory and Fundamentals A FastICA Algorithm for Non-negative Independent Component Analysis p. 1 Blind Source Separation by Adaptive Estimation of Score Function Difference p. 9 Exploiting Spatiotemporal

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

Image Analysis, Classification and Change Detection in Remote Sensing

Image Analysis, Classification and Change Detection in Remote Sensing Image Analysis, Classification and Change Detection in Remote Sensing WITH ALGORITHMS FOR ENVI/IDL Morton J. Canty Taylor &. Francis Taylor & Francis Group Boca Raton London New York CRC is an imprint

More information

Hardware Implementation for the Echo Canceller System based Subband Technique using TMS320C6713 DSP Kit

Hardware Implementation for the Echo Canceller System based Subband Technique using TMS320C6713 DSP Kit Hardware Implementation for the Echo Canceller System based Subband Technique using TMS3C6713 DSP Kit Mahmod. A. Al Zubaidy Ninevah University Mosul, Iraq Sura Z. Thanoon (MSE student) School of Electronics

More information

Image denoising in the wavelet domain using Improved Neigh-shrink

Image denoising in the wavelet domain using Improved Neigh-shrink Image denoising in the wavelet domain using Improved Neigh-shrink Rahim Kamran 1, Mehdi Nasri, Hossein Nezamabadi-pour 3, Saeid Saryazdi 4 1 Rahimkamran008@gmail.com nasri_me@yahoo.com 3 nezam@uk.ac.ir

More information

AFMG. EASE Seminar September 17 th to 21 st 2018, Berlin, Germany. Agenda. Software-Engineering Research Development

AFMG. EASE Seminar September 17 th to 21 st 2018, Berlin, Germany. Agenda. Software-Engineering Research Development EASE Seminar September 17 th to 21 st 2018, Berlin, Instructors: Emad Yacoub Hanna Language: English Hours: 09:00-17:00 (please be there at 08:45) EASE Seminars are split into two levels with Level 1 (entry

More information

Room Acoustics. CMSC 828D / Spring 2006 Lecture 20

Room Acoustics. CMSC 828D / Spring 2006 Lecture 20 Room Acoustics CMSC 828D / Spring 2006 Lecture 20 Lecture Plan Room acoustics basics Structure of room impulse response Characterization of room acoustics Modeling of reverberant response Basics All our

More information

BINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL

BINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL BINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL Takanori Nishino and Kazuya Takeda Center for Information Media Studies, Nagoya University Furo-cho, Chikusa-ku, Nagoya,

More information

Dr Andrew Abel University of Stirling, Scotland

Dr Andrew Abel University of Stirling, Scotland Dr Andrew Abel University of Stirling, Scotland University of Stirling - Scotland Cognitive Signal Image and Control Processing Research (COSIPRA) Cognitive Computation neurobiology, cognitive psychology

More information

INTRODUCTION. Model: Deconvolve a 2-D field of random numbers with a simple dip filter, leading to a plane-wave model.

INTRODUCTION. Model: Deconvolve a 2-D field of random numbers with a simple dip filter, leading to a plane-wave model. Stanford Exploration Project, Report 105, September 5, 2000, pages 109 123 Short Note Test case for PEF estimation with sparse data II Morgan Brown, Jon Claerbout, and Sergey Fomel 1 INTRODUCTION The two-stage

More information

CLASSIFICATION AND CHANGE DETECTION

CLASSIFICATION AND CHANGE DETECTION IMAGE ANALYSIS, CLASSIFICATION AND CHANGE DETECTION IN REMOTE SENSING With Algorithms for ENVI/IDL and Python THIRD EDITION Morton J. Canty CRC Press Taylor & Francis Group Boca Raton London NewYork CRC

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

A GEOMETRICAL APPROACH TO ROOM COMPENSATION FOR SOUND FIELD RENDERING APPLICATIONS

A GEOMETRICAL APPROACH TO ROOM COMPENSATION FOR SOUND FIELD RENDERING APPLICATIONS A GEOMETRICAL APPROACH TO ROOM COMPENSATION FOR SOUND FIELD RENDERING APPLICATIONS A. Canclini, D. Marković, L. Bianchi, F. Antonacci, A. Sarti, S. Tubaro Dipartimento di Elettronica, Informazione e Bioingegneria

More information

Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM)

Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM) Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM) Manan Joshi Navarun Gupta, Ph. D. Lawrence Hmurcik, Ph. D. University of Bridgeport, Bridgeport, CT Objective Measure

More information

Multichannel Recursive-Least-Squares Algorithms and Fast-Transversal-Filter Algorithms for Active Noise Control and Sound Reproduction Systems

Multichannel Recursive-Least-Squares Algorithms and Fast-Transversal-Filter Algorithms for Active Noise Control and Sound Reproduction Systems 606 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL 8, NO 5, SEPTEMBER 2000 Multichannel Recursive-Least-Squares Algorithms and Fast-Transversal-Filter Algorithms for Active Noise Control and Sound

More information

EASE Seminar Entry Level & Advanced Level

EASE Seminar Entry Level & Advanced Level EASE Seminar Entry Level & Advanced Level This is a general overview of our regular EASE Trainings. Please be aware that this document contains information on both levels we offer. Make sure which one

More information

MEDICAL IMAGE ANALYSIS

MEDICAL IMAGE ANALYSIS SECOND EDITION MEDICAL IMAGE ANALYSIS ATAM P. DHAWAN g, A B IEEE Engineering in Medicine and Biology Society, Sponsor IEEE Press Series in Biomedical Engineering Metin Akay, Series Editor +IEEE IEEE PRESS

More information

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes MSEE Curriculum All MSEE students are required to take the following two core courses: 3531-571 Linear systems 3531-507 Probability and Random Processes The course requirements for students majoring in

More information

Appendix 4. Audio coding algorithms

Appendix 4. Audio coding algorithms Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically

More information

Outline 7/2/201011/6/

Outline 7/2/201011/6/ Outline Pattern recognition in computer vision Background on the development of SIFT SIFT algorithm and some of its variations Computational considerations (SURF) Potential improvement Summary 01 2 Pattern

More information

Contents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48

Contents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48 Contents Part I Prelude 1 Introduction... 3 1.1 Audio Coding... 4 1.2 Basic Idea... 6 1.3 Perceptual Irrelevance... 8 1.4 Statistical Redundancy... 9 1.5 Data Modeling... 9 1.6 Resolution Challenge...

More information

NOVEL TECHNIQUES AND ARCHITECTURES FOR ADAPTIVE BEAMFORMING

NOVEL TECHNIQUES AND ARCHITECTURES FOR ADAPTIVE BEAMFORMING NOVEL TECHNIQUES AND ARCHITECTURES FOR ADAPTIVE BEAMFORMING By THUA VAN HO, B.A.Sc, M.A.Sc A Thesis Submitted to the School of Graduate Studies in Partial Fulfillment of the Requirements for the Degree

More information

Standard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology

Standard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology Standard Codecs Image compression to advanced video coding 3rd Edition Mohammed Ghanbari The Institution of Engineering and Technology Contents Preface to first edition Preface to second edition Preface

More information

Contents. I Basics 1. Copyright by SIAM. Unauthorized reproduction of this article is prohibited.

Contents. I Basics 1. Copyright by SIAM. Unauthorized reproduction of this article is prohibited. page v Preface xiii I Basics 1 1 Optimization Models 3 1.1 Introduction... 3 1.2 Optimization: An Informal Introduction... 4 1.3 Linear Equations... 7 1.4 Linear Optimization... 10 Exercises... 12 1.5

More information

Horizontal plane HRTF reproduction using continuous Fourier-Bessel functions

Horizontal plane HRTF reproduction using continuous Fourier-Bessel functions Horizontal plane HRTF reproduction using continuous Fourier-Bessel functions Wen Zhang,2, Thushara D. Abhayapala,2, Rodney A. Kennedy Department of Information Engineering, Research School of Information

More information

Distributed Signal Processing for Binaural Hearing Aids

Distributed Signal Processing for Binaural Hearing Aids Distributed Signal Processing for Binaural Hearing Aids Olivier Roy LCAV - I&C - EPFL Joint work with Martin Vetterli July 24, 2008 Outline 1 Motivations 2 Information-theoretic Analysis 3 Example: Distributed

More information

Digital Sound Ming C. Lin & Zhimin Ren

Digital Sound Ming C. Lin & Zhimin Ren Digital Sound Ming C. Lin & Zhimin Ren Department of Computer Science University of North Carolina http://gamma.cs.unc.edu/sound How can it be done? Foley artists manually make and record the sound from

More information

COMPUTER AND ROBOT VISION

COMPUTER AND ROBOT VISION VOLUME COMPUTER AND ROBOT VISION Robert M. Haralick University of Washington Linda G. Shapiro University of Washington T V ADDISON-WESLEY PUBLISHING COMPANY Reading, Massachusetts Menlo Park, California

More information

Robust Adaptive CRLS-GSC Algorithm for DOA Mismatch in Microphone Array

Robust Adaptive CRLS-GSC Algorithm for DOA Mismatch in Microphone Array Robust Adaptive CRLS-GSC Algorithm for DOA Mismatch in Microphone Array P. Mowlaee Begzade Mahale Department of Electrical Engineering Amirkabir University of Technology Tehran, Iran 15875-4413 P Mowlaee@ieee.org,

More information

Reverberation design based on acoustic parameters for reflective audio-spot system with parametric and dynamic loudspeaker

Reverberation design based on acoustic parameters for reflective audio-spot system with parametric and dynamic loudspeaker PROCEEDINGS of the 22 nd International Congress on Acoustics Signal Processing Acoustics: Paper ICA 2016-310 Reverberation design based on acoustic parameters for reflective audio-spot system with parametric

More information

DSP-CIS. Part-IV : Filter Banks & Subband Systems. Chapter-10 : Filter Bank Preliminaries. Marc Moonen

DSP-CIS. Part-IV : Filter Banks & Subband Systems. Chapter-10 : Filter Bank Preliminaries. Marc Moonen DSP-CIS Part-IV Filter Banks & Subband Systems Chapter-0 Filter Bank Preliminaries Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven marc.moonen@esat.kuleuven.be www.esat.kuleuven.be/stadius/ Part-III Filter

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi

Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi 1. Introduction The choice of a particular transform in a given application depends on the amount of

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future

More information

Contents. I The Basic Framework for Stationary Problems 1

Contents. I The Basic Framework for Stationary Problems 1 page v Preface xiii I The Basic Framework for Stationary Problems 1 1 Some model PDEs 3 1.1 Laplace s equation; elliptic BVPs... 3 1.1.1 Physical experiments modeled by Laplace s equation... 5 1.2 Other

More information

Image Processing, Analysis and Machine Vision

Image Processing, Analysis and Machine Vision Image Processing, Analysis and Machine Vision Milan Sonka PhD University of Iowa Iowa City, USA Vaclav Hlavac PhD Czech Technical University Prague, Czech Republic and Roger Boyle DPhil, MBCS, CEng University

More information

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR 2011-2012 / ODD SEMESTER QUESTION BANK SUB.CODE / NAME YEAR / SEM : IT1301 INFORMATION CODING TECHNIQUES : III / V UNIT -

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

Modelling, Auralization and Acoustic Virtual Reality ERIK MOLIN

Modelling, Auralization and Acoustic Virtual Reality ERIK MOLIN Modelling, Auralization and Acoustic Virtual Reality ERIK MOLIN Overview Auralization Overview & motivation Audio sources Room models Receiver modelling Auralization what and why? For a given space, sound

More information

SYDE 575: Introduction to Image Processing

SYDE 575: Introduction to Image Processing SYDE 575: Introduction to Image Processing Image Enhancement and Restoration in Spatial Domain Chapter 3 Spatial Filtering Recall 2D discrete convolution g[m, n] = f [ m, n] h[ m, n] = f [i, j ] h[ m i,

More information

Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection

Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection 1610 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 16, NO. 6, OCTOBER 2014 Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection Qingju Liu, Andrew J. Aubrey, Member, IEEE,

More information

Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues

Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues 3rd International Symposium on Ambisonics & Spherical Acoustics@Lexington, Kentucky, USA, 2nd June 2011 Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues J. Treviño

More information

Collaborative Sparsity and Compressive MRI

Collaborative Sparsity and Compressive MRI Modeling and Computation Seminar February 14, 2013 Table of Contents 1 T2 Estimation 2 Undersampling in MRI 3 Compressed Sensing 4 Model-Based Approach 5 From L1 to L0 6 Spatially Adaptive Sparsity MRI

More information

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++ Dietrich Paulus Joachim Hornegger Pattern Recognition of Images and Speech in C++ To Dorothea, Belinda, and Dominik In the text we use the following names which are protected, trademarks owned by a company

More information

Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei

Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei College of Physical and Information Science, Hunan Normal University, Changsha, China Hunan Art Professional

More information

Hybrid Speech Synthesis

Hybrid Speech Synthesis Hybrid Speech Synthesis Simon King Centre for Speech Technology Research University of Edinburgh 2 What are you going to learn? Another recap of unit selection let s properly understand the Acoustic Space

More information

Introduction to HRTFs

Introduction to HRTFs Introduction to HRTFs http://www.umiacs.umd.edu/users/ramani ramani@umiacs.umd.edu How do we perceive sound location? Initial idea: Measure attributes of received sound at the two ears Compare sound received

More information

Spectral modeling of musical sounds

Spectral modeling of musical sounds Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer

More information

LARGE SCALE LINEAR AND INTEGER OPTIMIZATION: A UNIFIED APPROACH

LARGE SCALE LINEAR AND INTEGER OPTIMIZATION: A UNIFIED APPROACH LARGE SCALE LINEAR AND INTEGER OPTIMIZATION: A UNIFIED APPROACH Richard Kipp Martin Graduate School of Business University of Chicago % Kluwer Academic Publishers Boston/Dordrecht/London CONTENTS Preface

More information

Audio Coding and MP3

Audio Coding and MP3 Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)

More information