AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS
|
|
- Conrad Wilkins
- 5 years ago
- Views:
Transcription
1 AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS Edited by YITENG (ARDEN) HUANG Bell Laboratories, Lucent Technologies JACOB BENESTY Universite du Quebec, INRS-EMT Kluwer Academic Publishers Boston/Dordrecht/London
2 Contents Preface xi Contributing Authors 1 Introduction 1 Yiteng (Arden) Huang Jacob Benesty 1. Multimedia Communications 1 2. Challenges and Opportunities 3 3. Organization of the Book 4 xiii Part I Speech Acquisition and Enhancement 2 Differer itial Microphone Arrays Gary W. Elko Introduction Differential Microphone Arrays Array Directional Gain Optimal Arrays for Isotropic Fields 4.1 Maximum Directional Gain 4.2 Maximum Directivity Index for Differential Microphones 4.3 Maximum Front-to-Back Ratio 4.4 Minimum Peak Directional Response 4.5 Beamwidth Design Examples 5.1 First-Order Designs 5.2 Second-Order Designs 5.3 Third-Order Designs 5.4 Higher-Order designs Sensitivity to Microphone Mismatch and Noise Conclusions
3 vi Audio Signal Processing 3 Spherical Microphone Arrays for 3D Sound Recording 67 Jens Meyer Gary W. Elko 1. Introduction Fundamental Concept The Eigenbeamformer Discrete Orthonormality The Eigenbeams The Modal Coefficients Modal-Beamformer Combining Unit Steering Unit Robustness Measure Beampattern Design Arbitrary Beampattern Design Optimum Beampattern Design Measurements Summary Appendix A 89 4 Subband Noise Reduction Methods for Speech Enhancement 91 Eric J. Diethorn 1. Introduction Wiener Filtering Speech Enhancement by Short-Time Spectral Modification Short-Time Fourier Analysis and Synthesis Short-Time Wiener Filter Power Subtraction Magnitude Subtraction Parametric Wiener Filtering Review and Discussion Averaging Techniques for Envelope Estimation Moving Average Single-Pole Recursion Two-Sided Single-Pole Recursion Nonlinear Data Processing Example Implementation Subband Filter Bank Architecture A-Posteriori-SNR Voice Activity Detector Example Conclusion 111 Part II Acoustic Echo Cancellation 5 Adaptive Algorithms for MIMO Acoustic Echo Cancellation 119 Jacob Benesty Tomas Gänsler Yiteng (Arden) Huang Markus Rupp 1. Introduction Normal Equations and Identification of a MIMO System Normal Equations 121
4 Contents Vll 2.2 The Nonuniqueness Problem The Impulse Response Tail Effect Some Different Solutions for Decorrelation The Classical and Factorized Multichannel RLS The Multichannel Fast RLS The Multichannel LMS Algorithm Classical Derivation Improved Version The Multichannel APA The Straightforward Multichannel APA The Improved Two-Channel APA The Improved Multichannel APA The Multichannel Exponentiated Gradient Algorithm The Multichannel Frequency-domain Adaptive Algorithm Conclusions Double-Talk Detectors for Acoustic Echo Cancelers 149 Tomas Gänsler Jacob Benesty 1. Introduction Basics of AEC and DTD AEC Notations The Generic DTD A Suggestion to Performance Evaluation of DTDs Double-Talk Detection Algorithms The Geigel Algorithm The Cross-Correlation Method The Normalized Cross-Correlation Method The Coherence Method The Normalized Cross-correlation Matrix The Two-Path Model DTD Combinations with Robust Statistics Comparison of DTDs by Means of the ROC Discussion The WinEC: A Real-Time Hands-Free Stereo Communication System 171 Tomas Gänsler Volker Fischer Eric J. Diethorn Jacob Benesty 1. Introduction 1.1 Signal model System Description The Audio Module The Network Module The Echo Canceler Module Algorithms of the Echo Canceler Module Adaptive Filter Algorithm Residual Echo and Noise Suppression Masking Threshold for Residual Echo in Noise Analysis of Echo Suppression Requirements Noise and Residual Echo Suppression Simulations Real-Time Tests with Different Modes of Operation 189
5 viii Audio Signal Processing 6.1 Point-to-Point Communication Multi-Point Communication Transatlantic Teleconference in Stereo Discussion 191 Part IH Sound Source Tracking and Separation 8 Time Delay Estimation 197 Jingdong Chen Yiteng (Arden) Huang Jacob Benesty 1. Introduction Signal Models Ideal Propagation Model Multipath Model Reverberant Model Generalized Cross-Correlation Method The Multichannel Cross-Correlation Algorithm Spatial Prediction Technique Time Delay Estimation Using Spatial Prediction Other Information from the Spatial Correlation Matrix Adaptive Eigenvalue Decomposition Algorithm Adaptive Multichannel Time Delay Estimation Principle Time-Domain Multichannel LMS Approach Frequency-Domain Adaptive Algorithms Experiments Experimental Setup Performance Measure Experimental Results Conclusions Source Localization 229 Yiteng (Arden) Huang Jacob Benesty Gary W. Elko 1. Introduction Source Localization Problem Measurement Model and Cramer-Rao Lower Bound for Source Localization Maximum Likelihood Estimator Least Squares Estimators The Least Squares Error Criteria Spherical Intersection (SX) Estimator Spherical Interpolation (SI) Estimator Linear-Correction Least Squares Estimator Example System Implementation Source Localization Examples Conclusions Blind Source Separation for Convolutive Mixtures: A Unified Treatment 255 Herbert Büchner Robert Aichner Walter Kellermann
6 Contents ix 1. Introduction Generic Block Time-Domain BSS Algorithm Matrix Notation for Convolutive Mixtures Cost Function and Algorithm Derivation Equivariance Property and Natural Gradient Special Cases and Links to Known Time-Domain Algorithms Generic Frequency-Domain BSS Algorithm General Frequency-Domain Formulation Natural Gradient in the Frequency Domain Special Cases and Links to Known Frequency-Domain Algorithms Weighting Function Off-line Implementation On-line Implementation В lock-on-line Implementation Part IV 11 Audio Coding Gerald Schüler Experiments and Results Conclusions Audio Coding and Realistic Soun Introduction Psycho-Acoustics Filter Banks 3.1 Polyphase Formulation 3.2 Modulated Filter Banks 3.3 Block Switching Current and Basic Coder Structures Stereo Coding Low Delay Audio Coding Conclusions Sound Field Synthesis 323 Sascha Spors Heinz Teutsch Achim Kuntz Rudolf Rabenstein 1. Introduction Rendering of Sound Fields with Wave Field Synthesis Physical Foundation of Wave Field Synthesis Wave Field Synthesis Based Sound Reproduction Model-based and Data-Based Rendering Data-Based Rendering Model-Based Rendering Hybrid Approach Wave Field Analysis Loudspeaker and Listening Room Compensation Listening Room Compensation Loudspeaker Compensation Description of a Sound Field Transmission System 339
7 X Audio Signal Processing 6.1 Acquisition of Source Signals Sound Stage Reproduction Using Wave Field Synthesis Summary Virtual Spatial Sound 345 Carlos Avendano 1. Introduction 1.1 Scope Spatial Hearing Interaural Coordinate System Interaural Differences Spectral Cues Distance Cues Dynamic Cues Acoustics of Spatial Sound TheHRTF Room Acoustics Virtual Spatial Sound Systems HRTF Measurement HRTF Modelling Virtual Spatial Sound Rendering Conclusions 366 Index 371
AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI- CATION SYSTEMS
AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI- CATION SYSTEMS AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI- CATION SYSTEMS Edited by YITENG (ARDEN) HUANG Bell Laboratories,
More informationDesign and Implementation of Small Microphone Arrays
Design and Implementation of Small Microphone Arrays for Acoustic and Speech Signal Processing Jingdong Chen and Jacob Benesty Northwestern Polytechnical University 127 Youyi West Road, Xi an, China jingdongchen@ieee.org
More information1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends
Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques
More informationOptimum Array Processing
Optimum Array Processing Part IV of Detection, Estimation, and Modulation Theory Harry L. Van Trees WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Preface xix 1 Introduction 1 1.1 Array Processing
More informationREAL-TIME DIGITAL SIGNAL PROCESSING
REAL-TIME DIGITAL SIGNAL PROCESSING FUNDAMENTALS, IMPLEMENTATIONS AND APPLICATIONS Third Edition Sen M. Kuo Northern Illinois University, USA Bob H. Lee Ittiam Systems, Inc., USA Wenshun Tian Sonus Networks,
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationSurrounded by High-Definition Sound
Surrounded by High-Definition Sound Dr. ChingShun Lin CSIE, NCU May 6th, 009 Introduction What is noise? Uncertain filters Introduction (Cont.) How loud is loud? (Audible: 0Hz - 0kHz) Introduction (Cont.)
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/
More informationDigital Image Processing
Digital Image Processing Third Edition Rafael C. Gonzalez University of Tennessee Richard E. Woods MedData Interactive PEARSON Prentice Hall Pearson Education International Contents Preface xv Acknowledgments
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationNumerical Robustness. The implementation of adaptive filtering algorithms on a digital computer, which inevitably operates using finite word-lengths,
1. Introduction Adaptive filtering techniques are used in a wide range of applications, including echo cancellation, adaptive equalization, adaptive noise cancellation, and adaptive beamforming. These
More informationAdaptive Filters Algorithms (Part 2)
Adaptive Filters Algorithms (Part 2) Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing and System
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationMultichannel Affine and Fast Affine Projection Algorithms for Active Noise Control and Acoustic Equalization Systems
54 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 1, JANUARY 2003 Multichannel Affine and Fast Affine Projection Algorithms for Active Noise Control and Acoustic Equalization Systems Martin
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationOptimized Variable Step Size Normalized LMS Adaptive Algorithm for Echo Cancellation
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 3-869 Optimized Variable Step Size Normalized LMS Adaptive Algorithm for Echo Cancellation Deman Kosale, H.R.
More informationSystem Identification Related Problems at
media Technologies @ Ericsson research (New organization Taking Form) System Identification Related Problems at MT@ER Erlendur Karlsson, PhD 1 Outline Ericsson Publications and Blogs System Identification
More informationMODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.
MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.E 4 Assistant Professor, Dept. of ECE, Dr.NGP Institute of Technology,
More informationSSL for Circular Arrays of Mics
SSL for Circular Arrays of Mics Yong Rui, Dinei Florêncio, Warren Lam, and Jinyan Su Microsoft Research ABSTRACT Circular arrays are of particular interest for a number of scenarios, particularly because
More informationAudio Coding Standards
Audio Standards Kari Pihkala 13.2.2002 Tik-111.590 Multimedia Outline Architectural Overview MPEG-1 MPEG-2 MPEG-4 Philips PASC (DCC cassette) Sony ATRAC (MiniDisc) Dolby AC-3 Conclusions 2 Architectural
More informationIMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING
SECOND EDITION IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING ith Algorithms for ENVI/IDL Morton J. Canty с*' Q\ CRC Press Taylor &. Francis Group Boca Raton London New York CRC
More informationAppendix A Auxiliary MATLAB Functions
Appendix A Auxiliary MATLAB Functions Listing A.1 Function for computing the joint diagonalization. 1 function [X,D]=jeig(A,B,srtstr); L=chol(B,'lower'); 3 G=inv(L); 4 C=G*A*G'; 5 [Q,D]=schur(C); 6 X=G'*Q;
More informationEpipolar Geometry in Stereo, Motion and Object Recognition
Epipolar Geometry in Stereo, Motion and Object Recognition A Unified Approach by GangXu Department of Computer Science, Ritsumeikan University, Kusatsu, Japan and Zhengyou Zhang INRIA Sophia-Antipolis,
More informationAdaptive Filtering using Steepest Descent and LMS Algorithm
IJSTE - International Journal of Science Technology & Engineering Volume 2 Issue 4 October 2015 ISSN (online): 2349-784X Adaptive Filtering using Steepest Descent and LMS Algorithm Akash Sawant Mukesh
More informationDigital Signal Processing with Field Programmable Gate Arrays
Uwe Meyer-Baese Digital Signal Processing with Field Programmable Gate Arrays Third Edition With 359 Figures and 98 Tables Book with CD-ROM ei Springer Contents Preface Preface to Second Edition Preface
More informationAdaptive System Identification and Signal Processing Algorithms
Adaptive System Identification and Signal Processing Algorithms edited by N. Kalouptsidis University of Athens S. Theodoridis University of Patras Prentice Hall New York London Toronto Sydney Tokyo Singapore
More informationChapter 5.5 Audio Programming
Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming
More informationEfficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio
Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans
More informationOptical Storage Technology. MPEG Data Compression
Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the
More informationSystem Identification Related Problems at SMN
Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1 Outline Research
More information3.5 Filtering with the 2D Fourier Transform Basic Low Pass and High Pass Filtering using 2D DFT Other Low Pass Filters
Contents Part I Decomposition and Recovery. Images 1 Filter Banks... 3 1.1 Introduction... 3 1.2 Filter Banks and Multirate Systems... 4 1.2.1 Discrete Fourier Transforms... 5 1.2.2 Modulated Filter Banks...
More informationTelecommunications Engineering Course Descriptions
Telecommunications Engineering Course Descriptions Electrical Engineering Courses EE 5305 Radio Frequency Engineering (3 semester hours) Introduction to generation, transmission, and radiation of electromagnetic
More informationSystem Identification
System Identification D R. T A R E K A. T U T U N J I A D V A N C E D M O D E L I N G A N D S I M U L A T I O N M E C H A T R O N I C S E N G I N E E R I N G D E P A R T M E N T P H I L A D E L P H I A
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,
More informationMetrics for performance assessment of mixed-order Ambisonics spherical microphone arrays
Downloaded from orbit.dtu.dk on: Oct 6, 28 Metrics for performance assessment of mixed-order Ambisonics spherical microphone arrays Favrot, Sylvain Emmanuel; Marschall, Marton Published in: Proceedings
More informationSpherical Microphone Arrays
Spherical Microphone Arrays Acoustic Wave Equation Helmholtz Equation Assuming the solutions of wave equation are time harmonic waves of frequency ω satisfies the homogeneous Helmholtz equation: Boundary
More informationSystem Identification Related Problems at SMN
Ericsson research SeRvices, MulTimedia and Networks System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2015-04-28 Page 1 Outline Research
More informationApplication of Linux Audio in Hearing Aid Research
Application of Linux Audio in Hearing Aid Research Giso Grimm 1 Tobias Herzke 2 Volker Hohmann 2 1 Universität Oldenburg, Oldenburg, Germany 2 HörTech ggmbh, Oldenburg, Germany Linux Audio Conference,
More informationANALYSIS OF GEOPHYSICAL POTENTIAL FIELDS A Digital Signal Processing Approach
ADVANCES IN EXPLORATION GEOPHYSICS 5 ANALYSIS OF GEOPHYSICAL POTENTIAL FIELDS A Digital Signal Processing Approach PRABHAKAR S. NAIDU Indian Institute of Science, Bangalore 560012, India AND M.P. MATHEW
More informationPassive Differential Matched-field Depth Estimation of Moving Acoustic Sources
Lincoln Laboratory ASAP-2001 Workshop Passive Differential Matched-field Depth Estimation of Moving Acoustic Sources Shawn Kraut and Jeffrey Krolik Duke University Department of Electrical and Computer
More informationA Wavelet Tour of Signal Processing The Sparse Way
A Wavelet Tour of Signal Processing The Sparse Way Stephane Mallat with contributions from Gabriel Peyre AMSTERDAM BOSTON HEIDELBERG LONDON NEWYORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY»TOKYO
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics
More informationInverse Structure for Active Noise Control and Combined Active Noise Control/Sound Reproduction Systems
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 9, NO. 2, FEBRUARY 2001 141 Inverse Structure for Active Noise Control and Combined Active Noise Control/Sound Reproduction Systems Martin Bouchard,
More informationA NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO
International journal of computer science & information Technology (IJCSIT) Vol., No.5, October A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO Pranab Kumar Dhar *, Mohammad
More informationFundamentals of Digital Image Processing
\L\.6 Gw.i Fundamentals of Digital Image Processing A Practical Approach with Examples in Matlab Chris Solomon School of Physical Sciences, University of Kent, Canterbury, UK Toby Breckon School of Engineering,
More informationAPPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION.
APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION Martin Pollow Institute of Technical Acoustics RWTH Aachen University Neustraße
More informationMpeg 1 layer 3 (mp3) general overview
Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,
More informationTHE PERFORMANCE of automatic speech recognition
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 6, NOVEMBER 2006 2109 Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments Michael L. Seltzer,
More informationReal-Time Semi-Blind Speech Extraction with Speaker Direction Tracking on Kinect
Real-Time Semi-Blind Speech Extraction with Speaker Direction Tracking on Kinect Yuji Onuma, Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano Nara Institute of Science and Technology, Graduate School
More informationOn the Minimum l p. 193 On the Strong Uniqueness of Highly Sparse Representations from Redundant Dictionaries
Theory and Fundamentals A FastICA Algorithm for Non-negative Independent Component Analysis p. 1 Blind Source Separation by Adaptive Estimation of Score Function Difference p. 9 Exploiting Spatiotemporal
More informationParametric Coding of High-Quality Audio
Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits
More informationImage Analysis, Classification and Change Detection in Remote Sensing
Image Analysis, Classification and Change Detection in Remote Sensing WITH ALGORITHMS FOR ENVI/IDL Morton J. Canty Taylor &. Francis Taylor & Francis Group Boca Raton London New York CRC is an imprint
More informationHardware Implementation for the Echo Canceller System based Subband Technique using TMS320C6713 DSP Kit
Hardware Implementation for the Echo Canceller System based Subband Technique using TMS3C6713 DSP Kit Mahmod. A. Al Zubaidy Ninevah University Mosul, Iraq Sura Z. Thanoon (MSE student) School of Electronics
More informationImage denoising in the wavelet domain using Improved Neigh-shrink
Image denoising in the wavelet domain using Improved Neigh-shrink Rahim Kamran 1, Mehdi Nasri, Hossein Nezamabadi-pour 3, Saeid Saryazdi 4 1 Rahimkamran008@gmail.com nasri_me@yahoo.com 3 nezam@uk.ac.ir
More informationAFMG. EASE Seminar September 17 th to 21 st 2018, Berlin, Germany. Agenda. Software-Engineering Research Development
EASE Seminar September 17 th to 21 st 2018, Berlin, Instructors: Emad Yacoub Hanna Language: English Hours: 09:00-17:00 (please be there at 08:45) EASE Seminars are split into two levels with Level 1 (entry
More informationRoom Acoustics. CMSC 828D / Spring 2006 Lecture 20
Room Acoustics CMSC 828D / Spring 2006 Lecture 20 Lecture Plan Room acoustics basics Structure of room impulse response Characterization of room acoustics Modeling of reverberant response Basics All our
More informationBINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL
BINAURAL SOUND LOCALIZATION FOR UNTRAINED DIRECTIONS BASED ON A GAUSSIAN MIXTURE MODEL Takanori Nishino and Kazuya Takeda Center for Information Media Studies, Nagoya University Furo-cho, Chikusa-ku, Nagoya,
More informationDr Andrew Abel University of Stirling, Scotland
Dr Andrew Abel University of Stirling, Scotland University of Stirling - Scotland Cognitive Signal Image and Control Processing Research (COSIPRA) Cognitive Computation neurobiology, cognitive psychology
More informationINTRODUCTION. Model: Deconvolve a 2-D field of random numbers with a simple dip filter, leading to a plane-wave model.
Stanford Exploration Project, Report 105, September 5, 2000, pages 109 123 Short Note Test case for PEF estimation with sparse data II Morgan Brown, Jon Claerbout, and Sergey Fomel 1 INTRODUCTION The two-stage
More informationCLASSIFICATION AND CHANGE DETECTION
IMAGE ANALYSIS, CLASSIFICATION AND CHANGE DETECTION IN REMOTE SENSING With Algorithms for ENVI/IDL and Python THIRD EDITION Morton J. Canty CRC Press Taylor & Francis Group Boca Raton London NewYork CRC
More informationParametric Coding of Spatial Audio
Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial
More informationA GEOMETRICAL APPROACH TO ROOM COMPENSATION FOR SOUND FIELD RENDERING APPLICATIONS
A GEOMETRICAL APPROACH TO ROOM COMPENSATION FOR SOUND FIELD RENDERING APPLICATIONS A. Canclini, D. Marković, L. Bianchi, F. Antonacci, A. Sarti, S. Tubaro Dipartimento di Elettronica, Informazione e Bioingegneria
More informationModeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM)
Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM) Manan Joshi Navarun Gupta, Ph. D. Lawrence Hmurcik, Ph. D. University of Bridgeport, Bridgeport, CT Objective Measure
More informationMultichannel Recursive-Least-Squares Algorithms and Fast-Transversal-Filter Algorithms for Active Noise Control and Sound Reproduction Systems
606 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL 8, NO 5, SEPTEMBER 2000 Multichannel Recursive-Least-Squares Algorithms and Fast-Transversal-Filter Algorithms for Active Noise Control and Sound
More informationEASE Seminar Entry Level & Advanced Level
EASE Seminar Entry Level & Advanced Level This is a general overview of our regular EASE Trainings. Please be aware that this document contains information on both levels we offer. Make sure which one
More informationMEDICAL IMAGE ANALYSIS
SECOND EDITION MEDICAL IMAGE ANALYSIS ATAM P. DHAWAN g, A B IEEE Engineering in Medicine and Biology Society, Sponsor IEEE Press Series in Biomedical Engineering Metin Akay, Series Editor +IEEE IEEE PRESS
More informationAll MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes
MSEE Curriculum All MSEE students are required to take the following two core courses: 3531-571 Linear systems 3531-507 Probability and Random Processes The course requirements for students majoring in
More informationAppendix 4. Audio coding algorithms
Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically
More informationOutline 7/2/201011/6/
Outline Pattern recognition in computer vision Background on the development of SIFT SIFT algorithm and some of its variations Computational considerations (SURF) Potential improvement Summary 01 2 Pattern
More informationContents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48
Contents Part I Prelude 1 Introduction... 3 1.1 Audio Coding... 4 1.2 Basic Idea... 6 1.3 Perceptual Irrelevance... 8 1.4 Statistical Redundancy... 9 1.5 Data Modeling... 9 1.6 Resolution Challenge...
More informationNOVEL TECHNIQUES AND ARCHITECTURES FOR ADAPTIVE BEAMFORMING
NOVEL TECHNIQUES AND ARCHITECTURES FOR ADAPTIVE BEAMFORMING By THUA VAN HO, B.A.Sc, M.A.Sc A Thesis Submitted to the School of Graduate Studies in Partial Fulfillment of the Requirements for the Degree
More informationStandard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology
Standard Codecs Image compression to advanced video coding 3rd Edition Mohammed Ghanbari The Institution of Engineering and Technology Contents Preface to first edition Preface to second edition Preface
More informationContents. I Basics 1. Copyright by SIAM. Unauthorized reproduction of this article is prohibited.
page v Preface xiii I Basics 1 1 Optimization Models 3 1.1 Introduction... 3 1.2 Optimization: An Informal Introduction... 4 1.3 Linear Equations... 7 1.4 Linear Optimization... 10 Exercises... 12 1.5
More informationHorizontal plane HRTF reproduction using continuous Fourier-Bessel functions
Horizontal plane HRTF reproduction using continuous Fourier-Bessel functions Wen Zhang,2, Thushara D. Abhayapala,2, Rodney A. Kennedy Department of Information Engineering, Research School of Information
More informationDistributed Signal Processing for Binaural Hearing Aids
Distributed Signal Processing for Binaural Hearing Aids Olivier Roy LCAV - I&C - EPFL Joint work with Martin Vetterli July 24, 2008 Outline 1 Motivations 2 Information-theoretic Analysis 3 Example: Distributed
More informationDigital Sound Ming C. Lin & Zhimin Ren
Digital Sound Ming C. Lin & Zhimin Ren Department of Computer Science University of North Carolina http://gamma.cs.unc.edu/sound How can it be done? Foley artists manually make and record the sound from
More informationCOMPUTER AND ROBOT VISION
VOLUME COMPUTER AND ROBOT VISION Robert M. Haralick University of Washington Linda G. Shapiro University of Washington T V ADDISON-WESLEY PUBLISHING COMPANY Reading, Massachusetts Menlo Park, California
More informationRobust Adaptive CRLS-GSC Algorithm for DOA Mismatch in Microphone Array
Robust Adaptive CRLS-GSC Algorithm for DOA Mismatch in Microphone Array P. Mowlaee Begzade Mahale Department of Electrical Engineering Amirkabir University of Technology Tehran, Iran 15875-4413 P Mowlaee@ieee.org,
More informationReverberation design based on acoustic parameters for reflective audio-spot system with parametric and dynamic loudspeaker
PROCEEDINGS of the 22 nd International Congress on Acoustics Signal Processing Acoustics: Paper ICA 2016-310 Reverberation design based on acoustic parameters for reflective audio-spot system with parametric
More informationDSP-CIS. Part-IV : Filter Banks & Subband Systems. Chapter-10 : Filter Bank Preliminaries. Marc Moonen
DSP-CIS Part-IV Filter Banks & Subband Systems Chapter-0 Filter Bank Preliminaries Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven marc.moonen@esat.kuleuven.be www.esat.kuleuven.be/stadius/ Part-III Filter
More informationCompressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.
Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity
More informationImage Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi
Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi 1. Introduction The choice of a particular transform in a given application depends on the amount of
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationSqueeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft
Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future
More informationContents. I The Basic Framework for Stationary Problems 1
page v Preface xiii I The Basic Framework for Stationary Problems 1 1 Some model PDEs 3 1.1 Laplace s equation; elliptic BVPs... 3 1.1.1 Physical experiments modeled by Laplace s equation... 5 1.2 Other
More informationImage Processing, Analysis and Machine Vision
Image Processing, Analysis and Machine Vision Milan Sonka PhD University of Iowa Iowa City, USA Vaclav Hlavac PhD Czech Technical University Prague, Czech Republic and Roger Boyle DPhil, MBCS, CEng University
More informationKINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK
KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR 2011-2012 / ODD SEMESTER QUESTION BANK SUB.CODE / NAME YEAR / SEM : IT1301 INFORMATION CODING TECHNIQUES : III / V UNIT -
More information5: Music Compression. Music Coding. Mark Handley
5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the
More informationModelling, Auralization and Acoustic Virtual Reality ERIK MOLIN
Modelling, Auralization and Acoustic Virtual Reality ERIK MOLIN Overview Auralization Overview & motivation Audio sources Room models Receiver modelling Auralization what and why? For a given space, sound
More informationSYDE 575: Introduction to Image Processing
SYDE 575: Introduction to Image Processing Image Enhancement and Restoration in Spatial Domain Chapter 3 Spatial Filtering Recall 2D discrete convolution g[m, n] = f [ m, n] h[ m, n] = f [i, j ] h[ m i,
More informationInterference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection
1610 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 16, NO. 6, OCTOBER 2014 Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection Qingju Liu, Andrew J. Aubrey, Member, IEEE,
More informationEvaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues
3rd International Symposium on Ambisonics & Spherical Acoustics@Lexington, Kentucky, USA, 2nd June 2011 Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues J. Treviño
More informationCollaborative Sparsity and Compressive MRI
Modeling and Computation Seminar February 14, 2013 Table of Contents 1 T2 Estimation 2 Undersampling in MRI 3 Compressed Sensing 4 Model-Based Approach 5 From L1 to L0 6 Spatially Adaptive Sparsity MRI
More informationDietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++
Dietrich Paulus Joachim Hornegger Pattern Recognition of Images and Speech in C++ To Dorothea, Belinda, and Dominik In the text we use the following names which are protected, trademarks owned by a company
More informationImage Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei
Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei College of Physical and Information Science, Hunan Normal University, Changsha, China Hunan Art Professional
More informationHybrid Speech Synthesis
Hybrid Speech Synthesis Simon King Centre for Speech Technology Research University of Edinburgh 2 What are you going to learn? Another recap of unit selection let s properly understand the Acoustic Space
More informationIntroduction to HRTFs
Introduction to HRTFs http://www.umiacs.umd.edu/users/ramani ramani@umiacs.umd.edu How do we perceive sound location? Initial idea: Measure attributes of received sound at the two ears Compare sound received
More informationSpectral modeling of musical sounds
Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer
More informationLARGE SCALE LINEAR AND INTEGER OPTIMIZATION: A UNIFIED APPROACH
LARGE SCALE LINEAR AND INTEGER OPTIMIZATION: A UNIFIED APPROACH Richard Kipp Martin Graduate School of Business University of Chicago % Kluwer Academic Publishers Boston/Dordrecht/London CONTENTS Preface
More informationAudio Coding and MP3
Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)
More information