Distributed Signal Processing for Binaural Hearing Aids

Similar documents
Parametric Coding of Spatial Audio

The MPEG-4 General Audio Coder

Speech and audio coding

Audio-coding standards

ELL 788 Computational Perception & Cognition July November 2015

Audio-coding standards

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

HEARING SOLUTIONS CATALOG SUPPLEMENT

5: Music Compression. Music Coding. Mark Handley

Quality versus Intelligibility: Evaluating the Coding Trade-offs for American Sign Language Video

Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei

System Identification Related Problems at SMN

System Identification Related Problems at

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Using Noise Substitution for Backwards-Compatible Audio Codec Improvement

Parametric Coding of High-Quality Audio

Embedded Audio & Robotic Ear

Mpeg 1 layer 3 (mp3) general overview

Product Information SAPHIRA 3

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

Application of Linux Audio in Hearing Aid Research

Product Information JUNA 9 7

Sensor Tasking and Control

MPEG-4 General Audio Coding

Appendix A Auxiliary MATLAB Functions

The Sensitivity Matrix

DRA AUDIO CODING STANDARD

System Identification Related Problems at SMN

Novel Lossy Compression Algorithms with Stacked Autoencoders

Product Information NANO

EE482: Digital Signal Processing Applications

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010

Appendix 4. Audio coding algorithms

BETTER HEARING. making MORE PERSONAL THAN EVER

Distributed Video Coding

Measurements and Bits: Compressed Sensing meets Information Theory. Dror Baron ECE Department Rice University dsp.rice.edu/cs

Xiaoqing Zhu, Sangeun Han and Bernd Girod Information Systems Laboratory Department of Electrical Engineering Stanford University

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Image/video compression: howto? Aline ROUMY INRIA Rennes

Proceedings of Meetings on Acoustics

Digital Image Processing

Spatial Correlation-based Collaborative Medium Access Control in Wireless Sensor Networks

Specification Guide. Cheer

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

Product Information PICO RITE

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS

Lecture 16 Perceptual Audio Coding

Nahimic Troubleshooting Instructions and Q&A The document applies to all MSI Notebook and Vortex product which supports Nahimic.

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1)

VIDEO COMPRESSION STANDARDS

Lecture Information Multimedia Video Coding & Architectures

Real-Time Extraction of Video Objects: Algorithms and applications

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

JND-based spatial parameter quantization of multichannel audio signals

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP

ELEC Dr Reji Mathew Electrical Engineering UNSW

*Faculty of Electrical Engineering, University of Belgrade, Serbia and Montenegro ** Technical University of Lodz, Institute of Electronics, Poland

MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING

Specification Guide. Celebrate

Before starting the troubleshooting, make sure you have installed the latest version of audio driver and Nahimic on your notebook.

Adaptive Quantization for Video Compression in Frequency Domain

Rich Recording Technology Technical overall description

An Ultra Low-Power WOLA Filterbank Implementation in Deep Submicron Technology

Product Information Pico RITE

Computing over Multiple-Access Channels with Connections to Wireless Network Coding

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

SupraPlus Wireless Professional Headset System. Superior comfort, sound quality, and mobility for all-day office use

2.4 Audio Compression

Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems

Compression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding

The BroadVoice Speech Coding Algorithm. Juin-Hwey (Raymond) Chen, Ph.D. Senior Technical Director Broadcom Corporation March 22, 2010

S P R I N G HEARING SOLUTIONS

Zhitao Lu and William A. Pearlman. Rensselaer Polytechnic Institute. Abstract

ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland

A Flexible IIR Filtering Implementation for Audio Processing Juergen Schmidt, Technicolor R&I, Hannover

Video-Aware Wireless Networks (VAWN) Final Meeting January 23, 2014

Presents 2006 IMTC Forum ITU-T T Workshop

Chapter 5.5 Audio Programming

Coordinated Multi-Point in Mobile Communications

Image and Video Coding I: Fundamentals

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

NEVARA 1. Product Information

CHAPTER 3. Preprocessing and Feature Extraction. Techniques

Energy Management Issue in Ad Hoc Networks

Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution

Overview: motion-compensated coding

Image Quality Assessment Techniques: An Overview

Data Compression. Audio compression

Right...from the start

Energy Management Issue in Ad Hoc Networks

ISO/IEC Information technology High efficiency coding and media delivery in heterogeneous environments. Part 3: 3D audio

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Transcription:

Distributed Signal Processing for Binaural Hearing Aids Olivier Roy LCAV - I&C - EPFL Joint work with Martin Vetterli July 24, 2008

Outline 1 Motivations 2 Information-theoretic Analysis 3 Example: Distributed Coding of Binaural Cues 4 Conclusions

Motivations (1/5) Hearing aids through the ages Source: http://www.hearingaidmuseum.com

Motivations (2/5) Deafness in disguise Source: http://beckerexhibits.wustl.edu/did

Motivations (3/5) State-of-the-art technology Types: BTE, ITE, ITC, CIC Analog vs. digital 2-3 (omni)directional microphones, 1 loudspeaker

Motivations (4/5) Ultimate goal: improve speech intelligibility Spectral shaping Beamforming Assistive listening devices (a) (b) Figure: Assistive listening devices. (a) Remote microphone. (b) Binaural hearing aids.

Motivations (5/5) Wireless collaboration Analog vs. digital Transmission method (e.g. Bluetooth) Limited communication bitrate: coding issues Gain Rate Trade-off

Information-theoretic Analysis (1/9) Recording setup N S α N α S d N d S X 1,1 X 1,2 X 2,1 X 2,2 Recorded signals (m = 1, 2) X 1,m [n] = X s 1,m [n] + Xn 1,m [n], X 2,m [n] = X s 2,m [n] + Xn 2,m [n].

Information-theoretic Analysis (2/9) Wireless collaboration X 1,1 X 1,2 X 2,1 X 2,2 Enc 1 Enc 2 Dec 1 R 1 R 2 Dec 2 Ŝ 1 Ŝ 2 Distortion criterion d(s,ŝ) (e.g. MSE, perceptual, etc.)

Information-theoretic Analysis (3/9) Source coding in a nutshell R S Enc Dec Ŝ Given: a source (signal) S and a distortion criterion d(s,ŝ) Question: for a given rate R, what is the minimum achievable distortion? Answer: the rate distortion function Assumption: unbounded coding delay and complexity

Information-theoretic Analysis (4/9) Example: the Gaussian case We observe X 1,X 2,... where X k N(0,σ 2 ) i.i.d. Rate distortion function given by D(R) = σ 2 2 2R (MSE/sample) 1 0.9 D (MSE/sample) 0.8 0.7 0.6 0.5 0.4 0.3 0.2 σ 2 = 1 0.1 0 0 1 2 3 4 5 R (bits/sample) simple 1-bit quantization 0.36σ 2, optimal = 0.25σ 2

Information-theoretic Analysis (5/9) Variations on a theme Remote source coding R S X Enc Dec Ŝ

Information-theoretic Analysis (5/9) Variations on a theme Source coding with side information at the decoder R S Enc Dec Ŝ Y

Information-theoretic Analysis (5/9) Variations on a theme Remote source coding with side information at the decoder R S X Enc Dec Ŝ Y

Information-theoretic Analysis (5/9) Variations on a theme Remote source coding with side information at the decoder SIA: R S X Enc Dec Ŝ R SIU: S X Enc Dec Dec Y Ŝ Y

Information-theoretic Analysis (6/9) What about collaborating hearing aids? Monaural perspective X 1,1 X 1,2 X 2,1 X 2,2 Enc 2 Dec 1 R 1 Ŝ 1

Information-theoretic Analysis (6/9) What about collaborating hearing aids? Monaural perspective X 1,1 X 1,2 X 2,1 X 2,2 Enc 1 R 2 Dec 2 Ŝ 2

Information-theoretic Analysis (6/9) What about collaborating hearing aids? Binaural perspective X 1,1 X 1,2 X 2,1 X 2,2 Enc 1 Enc 2 Dec 1 R 1 R 2 Dec 2 Ŝ 1 Ŝ 2

Information-theoretic Analysis (7/9) Results: Mean-square optimal gain rate trade offs 3 2.5 2 Gain (db) 1.5 1 0.5 0 0 1 2 3 4 5 6 7 8 Rate (bits/sample) x 10 3 Figure: Examples of gain rate trade offs (SIA vs. SIU)

Information-theoretic Analysis (8/9) Mean-square optimal rate allocation 35 30 25 Rate allocation (%) 20 15 10 5 0 1 2 3 4 5 6 7 8 Rate (bits/sample) Figure: Examples of rate allocations (SIA vs. SIU)

Information-theoretic Analysis (9/9) Usefulness of information-theoretic analysis Provides upper bounds to gains achieved by practical systems Suggests optimal coding architectures Multichannel Wiener filtering Scalar distributed source coding Correlation induced by recording setup can be used A priori vs. learned

Example: Distributed Coding of Binaural Cues (1/2) Binaural Cues Scene analysis Classification Source localization Voice activity detection Time-frequency representation, one value per critical band B l 1 0.8 0.6 0.4 0.2 0 0 2 4 6 8 10 12 14 16 18 f (khz)

Example: Distributed Coding of Binaural Cues (2/2) Inter channel level difference (ICLD) where p m [l] = 10log 10 p[l] = p 1 [l] p 2 [l], 1 B l X m [k] 2 for m = 1,2. k B l 15 30 20 10 f (khz) 10 5 0 10 20 30 40 50 60 0 0 2 4 6 8 t (s) 70

Example: Distributed Coding of Binaural Cues (2/2) Centralized coding p[l] [ p min [l], p max [l] ] = scalar quantizer with range p max [l] p min [l]

Example: Distributed Coding of Binaural Cues (2/2) Distributed coding Scalar quantization of p 1 [l] and p 2 [l] i 1 [l] i 2 [l] { i min [l],..., i max [l] } { } pmin [l] pmax [l] =,..., s s Modulo coding approach = index reuse 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 p min p max 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 p min p max 1 2 3 4 5 6 1 2 3 4 5 6 1 2 3 4 5 6 1 2 3 4 p min s p max p (l = 1) p (l = 5) p (l = 10)

Example: Distributed Coding of Binaural Cues (2/2) Centralized vs. distributed coding Same coding efficiency Distributed scheme takes head shadowing into account (i.e., a priori correlation) Assumption must be verified!! Application: distributed spatial audio coding original & reconstruction (KEMAR) original & reconstruction (BRIR, T 60 600 ms)

Conclusions Binaural noise reduction as a distributed source coding problem Information-theoretic analysis Distributed coding of binaural cues Take home message: correlation/structure that is known a priori is most relevant for distributed source coding

Conclusions Binaural noise reduction as a distributed source coding problem Information-theoretic analysis Distributed coding of binaural cues Take home message: correlation/structure that is known a priori is most relevant for distributed source coding Thanks for your attention!!