Conference IAFPA 2007 Plymouth, UK

Size: px
Start display at page:

Download "Conference IAFPA 2007 Plymouth, UK"

Transcription

1 Conference IAFPA 2007 Plymouth, UK Increasing Speech Intelligibility by DeNoising: What can be achieved? Jan Rademacher, Jörg Bitzer, Jörg Houpert University of Oldenburg, Medical Physics, Oldenburg, Germany University of Applied Science Oldenburg / Ostfriesland / Wilhelmshaven, Institute for Hearing Technology and Audiology, Oldenburg Germany Cube-Tec International, Bremen, Germany Slide No / 3

2 History of Cube-Tec International 1990 Founding of the engineering offices Houpert Digital Audio (HDA) in the Technology Center University Park Bremen, Germany Founding of Spectral Design (together with Steinberg Media Technologies) 2001 Founding of our international Distribution Company, in South Germany Cube Technologies GmbH Founding of the Cube-Tec Development GmbH 2005 March - Founding of the new Cube-Tec International GmbH Merger of all four companies, and transfer of assets to Cube-Tec International headquarters in Bremen. Cube-Tec Office at the Technology Park Bremen, Germany Slide No / 4

3 few words on Cube-Tec International Since 1990 we are providing solutions based on digital audio signal processing and psychoacoustics. Current developments are focused on technologies for analyzing and processing huge amounts of digital audio & video files. We are using computer rendering farms for finding relevant cues, matching algorithms, looking for similarities in media files Some of the topics mass-processing of sound and video files automatic metadata extraction metadata driven processing of media files workflow automation Slide No / 5

4 Slide No / 6 Cube-Workflow

5 Slide No / 7 What is DOBBIN?

6 FOENICS Slide No / 8

7 FOENICS real time restoration tool set for Speech Enhancement Forensic Enhancement of Intelligibility by Computer System Slide No / 9

8 back to the joint research project and the question Increasing Speech Intelligibility by DeNoising: What can be achieved? Slide No / 10

9 Motivation / Introduction It is known that state-of-the-art denoising cannot increase speech intelligibility, if the noise source is a speech-like babble noise (Bitzer et al. 2005). The Questions for the following basic research are? Is this statement also true for noises with other spectral shapes? Is it possible to predict the enhancement, if the spectral shape of the disturbance is known. Slide No / 11

10 A short review on the basics of denoising algorithms A distorted speech signal x(t) is usually modeled as summation of a speech signal s(t) and a distortion n(t) x ( t) = s( t) + n( t) This holds also for the power spectral density (PSD) X(m,l) X ( m, l) = S( m, l) + N( m, l) denoising as spectral substraction S( m, l) = X ( m, l) N( m, l) Be aware: X,S,N are power spectral densities not spectra and in that model the noise has to be uncorrelated with the speech signal Slide No / 12

11 Principle of denoising Estimate the speech signal by multipling the distored signal with a filter function X ( m, l) H ( m, l) = Sˆ( m, l) Optimal approach in the mean square sense : Wiener Filter [Bol79] Problem: successful estimation of H(m,l) depends on: Signal to Noise Ratio (SNR) Spectral shape of noise and speech Question: Is it possible to assess the achievable intelligibility enhancement without performing the denoising and listening tests? Slide No / 13

12 Principle of denoising find a weighting factor H(m,l) for each frequency m at each time instance l fulfilling X ( m, l) H ( m, l) = Sˆ( m, l) Optimal approach in the mean square sense : Wiener Filter [Bol79] Problem: successful estimation of H(m,l) depends on: Signal to Noise Ratio (SNR) Spectral shape of noise and speech Question: Is it possible to assess the achievable intelligibility enhancement without performing the denoising and listening tests? Slide No / 14

13 Prediction of enhancement Proposed solution: A new measure describing the intelligibility enhancement to be expected after denoising by a single value called 'denoising success'. Range: no enhancement: 0 total enhancement: 1 This measure will take into account the: signal to noise ratio (SNR) spectral shape of the speech signal spectral shape of the distortion psychoacoustical properties of the human hearing masking threshold of tonal and non-tonal components [Zwi82, ISO93] band importance function [ANS97] Slide No / 15

14 Calculation of achievable intelligibility enhancement - STEP-1 N(m,l) S(m,l) Estimate HWiener (m,l) N(m,l) HWiener (m,l) Ndenoise(m,l) S(m,l) N(m,l) Input to the model is: N(m,l) - the power spectral density of the noise (Noise PSD) S(m,l) - the power spectral density of the speech (Speech PSD) the long term average spectrum of a single speaker standard LTAS (long term average spectrum) for speech Slide No / 16

15 Calculation of achievable intelligibility enhancement - STEP-1 N(m,l) S(m,l) Estimate HWiener (m,l) HWiener (m,l) N(m,l) Ndenoise(m,l) S(m,l) N(m,l) The noise spectra and the speech spectra together can form the Wiener filter, that is just the power spectra of the speech - divided by the power spectral of the speech and noise. Apply this Wiener filter to the noise signal only denoised version of the original noise (denoised Noise PSD), the original noise (Noise PSD) and the original speech signal (Speech PSD) Slide No / 17

16 Calculation of achievable intelligibility enhancement - STEP-2 Masking thresholds using MPEG1 Signal model [Zwi82, ISO93] MT noise (m,l) for the distortion N(m,l) Ndenoise(m,l) Masking Threshold MTdenoise(m,l) S(m,l) N(m,l) Masking Threshold S(m,l) MTnoise(m,l) Slide No / 18

17 Calculation of achievable intelligibility enhancement - STEP-2 MT noise (m,l) for the distortion N(m,l) MT denoise (m,l) for the remaining distortion N denoise (m,l) after denoising Ndenoise(m,l) Masking Threshold MTdenoise(m,l) S(m,l) N(m,l) Masking Threshold S(m,l) MTnoise(m,l) Slide No / 19

18 Calculation of achievable intelligibility enhancement - STEP-3 MTdenoise (m,l) S(m,l) not masked? BIF S(m,l) MTnoise (m,l) S(m,l) masked? & denoising success Find all frequency indices m M, where a masked speech amplitude S(m,l) is unmasked after denoising. Slide No / 20

19 Calculation of achievable intelligibility enhancement - STEP-3 MTdenoise (m,l) S(m,l) not masked? BIF S(m,l) & denoising success MTnoise (m,l) S(m,l) masked? Find all frequency indices m M, where a masked speech amplitude S(m,l) is unmasked after denoising. The relevance of each frequency band m M with respect to intelligibility is incorporated by the band importance function (BIF). Slide No / 21

20 Calculation of achievable intelligibility enhancement - STEP Band Importance Factor (ANSI) BIF denoising success 0 200Hz 1kHz 10kHz Freq in Hz Find all frequency indices m M, where a masked speech amplitude S(m,l) is unmasked after denoising. The relevance of each frequency band m M with respect to intelligibility is incorporated by the band importance function (BIF). Slide No / 22

21 Calculation of achievable intelligibility enhancement - STEP-3 MTdenoise (m,l) S(m,l) not masked? BIF S(m,l) & denoising success MTnoise (m,l) S(m,l) masked? Find all frequency indices m M, where a masked speech amplitude S(m,l) is unmasked after denoising. The relevance of each frequency band m M with respect to intelligibility is incorporated by the band importance function (BIF). The denoising success is the sum of all BIF values belonging to M Slide No / 23

22 Complete calculation scheme N( m,l) S( m,l) estimate H m,l Wiener( ) HWiener( m,l) Ma sking N( m,l) Ndenoise( m,l) Thre shold MT m,l denoise( ) S( m,l) S( m,l) not masked? & BIF denoising success N( m,l) Ma sking Thre shold MTnoise( m,l) S( m,l) masked? How does this calculation scheme perform? Slide No / 24

23 Four different stationary noise sources Test / Noise Description OLSA speech like noise (perfect masker) White white Gaussian noise Peak peak noise with flat spectrum between 400Hz and 2400Hz Notch white noise with noise gap between 400Hz and 2400Hz 0 OLSA 0 White 0 Peak 0 Notch Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Slide No / 25

24 Four different stationary noise sources Test / Noise Description OLSA speech like noise (perfect masker) White white Gaussian noise Peak peak noise with flat spectrum between 400Hz and 2400Hz Notch white noise with noise gap between 400Hz and 2400Hz 0 OLSA 0 White 0 Peak 0 Notch Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Slide No / 26

25 Four different stationary noise sources Test / Noise Description OLSA speech like noise (perfect masker) White white Gaussian noise Peak peak noise with flat spectrum between 400Hz and 2400Hz Notch white noise with noise gap between 400Hz and 2400Hz 0 OLSA 0 White 0 Peak 0 Notch Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Slide No / 27

26 Four different stationary noise sources Test / Noise Description OLSA speech like noise (perfect masker) White white Gaussian noise Peak peak noise with flat spectrum between 400Hz and 2400Hz Notch white noise with noise gap between 400Hz and 2400Hz 0 OLSA 0 White 0 Peak 0 Notch Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Hz 1kHz 10kHz Freq in Hz Slide No / 28

27 Test Results for OLSA & White Noise Masker denoising success [01] olsa noise SNR [db] denoising success [01] white noise SNR [db] OLSA Noise Masker: Predicted enhancement of speech intelligibility is zero for all SNRs White Noise Masker: Predicted enhancement of speech intelligibility is zero for all SNRs Slide No / 29

28 Test Results for Peak & Notch Masker denoising success [01] peak noise SNR [db] denoising success [01] notch noise SNR [db] Peak Noise Masker: Intelligibility will be improved significantly at moderate SNRs Notch Noise Masker: Small improvement at low SNRs Slide No / 30

29 Listening Tests / Test-Design OLSA Test Method Oldenburger sentence test, developed by Wagener and Kollmeier at University of Oldenburg (OLSA) is a further development of the Hagerman sentence test 30 sentences, the level of the noise masker is fixed, the level for each sentence is adapted until 50% word scoring (in the mean) is adjusted Find 50% correct words in a nonsense sentence (name, verb, number, adjective, object) (Ulrich gewann fünf rote Messer = Ulrich won five red knives) The mean of the SNR of the last 20 sentences form the Speech Reception Threshold (SRT) Slide No / 31

30 Oldenburger sentence test - Example page name, verb, number, adjective, object Slide No / 32

31 Test-Design II Each test person listen to the sentences mixed with one of the four noise sources. SRT for each mix with the four noise sources Each test person listen to denoised version the sentences mixed with one of the four noise sources. SRT for each of the four denoised versions The denoiser is a standard Real-World-denoiser based on published technologies: WOLA based, FFTLen = 2048, Noise Estimator is based on (Cohen 2002), Gain-Rule is Wiener with a-priori SNR (decision directed estimation based on Ephraim and Malah 1984) Slide No / 33

32 Results of the Listening Test -10 NoAlgo SRT in db Olsa White Peak Notch The listening test was performed by five trained expert listener. See the Speech Reception Threshold for the four noise masker, with the mean values and the standard deviation. First the results for the unprocessed sound files. Slide No / 34

33 Results of the Listening Test -10 NoAlgo SRT in db Olsa White Peak Notch SRT reduces significantly, if the noise has a different color than the speech signal and is therefore, an incomplete masker to the signal speech is very quiet compared to the noise and can be close to the technical limits (bit depth). Listening condition are very stressful because of the high gain to amplify the speech signal above the hearing threshold. Slide No / 35

34 Results of the Listening Test - Denoised -10 NoAlgo DeNoised SRT in db Olsa White Peak Notch No intelligibility enhancement possible for a perfect masking noise Denoising is not useful, as long as you can't use additional information for example spatial cues by multi-microphone processing. Slide No / 36

35 Results of the Listening Test - White Noise Masker -10 NoAlgo DeNoised SRT in db Olsa White Peak Notch Small enhancement for white noise (but maybe not statistically significant) DeNoising will give a very small benefit, if at all. Slide No / 37

36 Results of the Listening Test - Peak Noise Masker -10 NoAlgo DeNoised SRT in db Olsa White Peak Notch A significant enhancement for colored peak noise DeNoising will improve speech intelligibility, to the cost of a slightly changed and maybe disturbing background noise. Speech source will sound differently. Slide No / 38

37 Results of the Listening Test - Notch Noise Masker -10 NoAlgo DeNoised SRT in db Olsa White Peak Notch Enhancement for notch noise DeNoising may improve speech intelligibility More important: the annoying noise is reduced and therefore you can amplify the signal to a higher level, which will increase SRT. (But in this experiment level change was not permitted ) Slide No / 39

38 Comparison of objective measure and listening Test The new measure 'denoising success' can coarsly predict: The lowering of the Speech Reception Threshold (Peak and Notch vs. OLSA noise) The benefit in speech intellegibility for colored noise It is only a coarse predictor (no enhancement for white noise predicted). Slide No / 40

39 Conclusions Denoising is useful if: The disturbance is colored compared to the long-term spectrum of the desired speech signal. The disturbance is so annoying that an automatic reduction is necessary in order to be able to increase the level of the speech above hearing threshold. Automatic prediction of an increase of the speech intelligibility with a denoising process is possible, but so far, only in a limited range. More work to do! Slide No / 41

40 Slide No / 42 Literature [ANS97] American National Standard, Methods for calculation of the speech intelligibility index, ANSI S [Bit05] Bitzer, Jörg and Simmer, Uwe and Holube, Inga and Schaer, Timm, "Some Experiments on Short-Time Spectral Attenuation (STSA) Algorithms and Speech Intelligibility", International Workshop on Acoustic Echo and Noise Control (IWAENC) Sept 2005, p [Bol79] Boll, Suppression of acoustic noise in speech using spectral subtraction, IEEE Trans. Acoust., Speech, Signal Process., ASSP-27, pages , 1979 [Coh02] Cohen,Berdugo: "Noise estimation by minima controlled recursive averaging for robust speech enhancement", IEEE Signal Processing Letters, vol. 9, pp. 12, Jan [EM84] Ephraim, Malah, Speech Enhancement Using a Minimum Mean- Square Error Short-Time Spectral Amplitude Estimator, IEEE Transaction on Acoustics, Speech, and Signal Processing, Dec 1984 (32), No.6, p [Hag82] Hagerman, Sentences for Testing Speech Intelligibility in Noise, Scandinavian Audiology, No. 11, 1982, p [ISO93] ISO/IEC 11172, Information technology - Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s - Part 3: audio, BSI, London, First edition, October Implementation of ISO/IEC :1993. ISBN [Wag03] Wagener, Factors Influencing Sentence Intelligibility in Noise, PhD Thesis, Oldenburg, 2003 [Wie49] Wiener, Extrapolation, interpolation and smoothing of stationary time series: With engineering applications, In Principles of Electrical Engineering Series, Cambridge, Massachusetts and New York, [Zwi82] Zwicker, Psychoakustik, Springer Verlag, Berlin, 1982

41 Abstract -1- of -2- Increasing Speech Intelligibility by DeNoising: What can be achieved? Jan Rademacher, University of Oldenburg, Medical Physics, Oldenburg, Germany Prof. Dr. Jörg Bitzer, University of Applied Science Oldenburg / Ostfriesland / Wilhelmshaven, Institute for Hearing Technology and Audiology, Oldenburg Germany Jörg Houpert, Cube-Tec International, Bremen, Germany Abstract: A lot of forensic material is of poor recording quality. A variety of disturbances is added to the desired speech signal. A typical way to increase the quality of the signal is to apply algorithms that can be called DeNoisers, which filter out the background noises. However, increasing the quality does not imply that the speech intelligibility is increased, which is the essential goal of the process. In (Bitzer et al. 2005) the authors show that no significant enhancement is possible if the background noise is a perfect masker for the desired speech signal. In their study the noise signal was computed by a mixture of 500 sentences randomly mixed by the same speaker. Therefore, the long-term noise spectrum is exactly the same as the spectrum of the desired speech signal. Finally, the speech reception threshold is computed, which is the Signal-to- Noise-Ratio (SNR) at which 50% of all words can be understood. The methodology is described in (Kollmeier and Wesselkamp 1997 and Wagener 2003). The test is called Oldenburger sentence test and is based on (Hagerman 1982). Slide No / 43

42 Abstract -2- of -2- In real world recordings, however, the masker has a different spectrum. Prediction of enhancement The differences of the long term spectrum of the desired signal and the disturbance can be used as a measure of how well a denoiser could work for this combination of signals. Of course this is a coarse prediction since stationarity has to be assumed. Our new measure is based on the so called optimal- or Wiener- filter (Boll 1979, Wiener 1949) which is well-known for denoising tasks. Based on this filter, we suggest to calculate the masking thresholds (MT) (Zwicker 1982) in a way closely related to the MPEG standard (ISO/IEC ) both for the original noise spectrum and for the noise spectrum denoised by the Wiener-filter. These MTs are then used to find the spectral components of the speech signal which are audible before and after the denoising process. Taking into account the band importance function (American National Standard 1997) reflecting the importance of each frequency band for intelligibility, the increase of audible frequency bands after denoising will be a good approximation of the forthcoming improvement of the intelligibility achieved by denoising. In order to show this behaviour subjective listening tests will be used for comparison. Conclusions: Increasing speech intelligibility is possible, if the disturbing signal is not masking all important frequencies of the desired signal. Since denoising algorithms are able to filter signal-bands individually a significant improvement in quality and intelligibility can be achieved. Furthermore, we introduced a new measure that can be used as a coarse predictor of the possible improvement. Slide No / 44

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06 Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

Wavelet filter bank based wide-band audio coder

Wavelet filter bank based wide-band audio coder Wavelet filter bank based wide-band audio coder J. Nováček Czech Technical University, Faculty of Electrical Engineering, Technicka 2, 16627 Prague, Czech Republic novacj1@fel.cvut.cz 3317 New system for

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Distributed Signal Processing for Binaural Hearing Aids

Distributed Signal Processing for Binaural Hearing Aids Distributed Signal Processing for Binaural Hearing Aids Olivier Roy LCAV - I&C - EPFL Joint work with Martin Vetterli July 24, 2008 Outline 1 Motivations 2 Information-theoretic Analysis 3 Example: Distributed

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

Lecture 16 Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information

Appendix 4. Audio coding algorithms

Appendix 4. Audio coding algorithms Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts ijdsp Interactive Illustrations of Speech/Audio Processing Concepts NSF Phase 3 Workshop, UCy Presentation of an Independent Study By Girish Kalyanasundaram, MS by Thesis in EE Advisor: Dr. Andreas Spanias,

More information

Rich Recording Technology Technical overall description

Rich Recording Technology Technical overall description Rich Recording Technology Technical overall description Ari Koski Nokia with Windows Phones Product Engineering/Technology Multimedia/Audio/Audio technology management 1 Nokia s Rich Recording technology

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

Principles of MPEG audio compression

Principles of MPEG audio compression Principles of MPEG audio compression Principy komprese hudebního signálu metodou MPEG Petr Kubíček Abstract The article describes briefly audio data compression. Focus of the article is a MPEG standard,

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective

More information

Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei

Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei Image Denoising Based on Hybrid Fourier and Neighborhood Wavelet Coefficients Jun Cheng, Songli Lei College of Physical and Information Science, Hunan Normal University, Changsha, China Hunan Art Professional

More information

Application of Linux Audio in Hearing Aid Research

Application of Linux Audio in Hearing Aid Research Application of Linux Audio in Hearing Aid Research Giso Grimm 1 Tobias Herzke 2 Volker Hohmann 2 1 Universität Oldenburg, Oldenburg, Germany 2 HörTech ggmbh, Oldenburg, Germany Linux Audio Conference,

More information

DRA AUDIO CODING STANDARD

DRA AUDIO CODING STANDARD Applied Mechanics and Materials Online: 2013-06-27 ISSN: 1662-7482, Vol. 330, pp 981-984 doi:10.4028/www.scientific.net/amm.330.981 2013 Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua

More information

Using Noise Substitution for Backwards-Compatible Audio Codec Improvement

Using Noise Substitution for Backwards-Compatible Audio Codec Improvement Using Noise Substitution for Backwards-Compatible Audio Codec Improvement Colin Raffel AES 129th Convention San Francisco, CA February 16, 2011 Outline Introduction and Motivation Coding Error Analysis

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model

Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model 1 M. Chinna Rao M.Tech,(Ph.D) Research scholar, JNTUK,kakinada chinnarao.mortha@gmail.com 2 Dr. A.V.S.N. Murthy Professor of Mathematics,

More information

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur Module 9 AUDIO CODING Lesson 32 Psychoacoustic Models Instructional Objectives At the end of this lesson, the students should be able to 1. State the basic objectives of both the psychoacoustic models.

More information

Multimedia Communications. Audio coding

Multimedia Communications. Audio coding Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated

More information

Structural Similarity Optimized Wiener Filter: A Way to Fight Image Noise

Structural Similarity Optimized Wiener Filter: A Way to Fight Image Noise Structural Similarity Optimized Wiener Filter: A Way to Fight Image Noise Mahmud Hasan and Mahmoud R. El-Sakka (B) Department of Computer Science, University of Western Ontario, London, ON, Canada {mhasan62,melsakka}@uwo.ca

More information

Ch. 5: Audio Compression Multimedia Systems

Ch. 5: Audio Compression Multimedia Systems Ch. 5: Audio Compression Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Chapter 5: Audio Compression 1 Introduction Need to code digital

More information

Bit or Noise Allocation

Bit or Noise Allocation ISO 11172-3:1993 ANNEXES C & D 3-ANNEX C (informative) THE ENCODING PROCESS 3-C.1 Encoder 3-C.1.1 Overview For each of the Layers, an example of one suitable encoder with the corresponding flow-diagram

More information

1 Introduction. 2 Speech Compression

1 Introduction. 2 Speech Compression Abstract In this paper, the effect of MPEG audio compression on HMM-based speech synthesis is studied. Speech signals are encoded with various compression rates and analyzed using the GlottHMM vocoder.

More information

Objective Models for Perceptual Vacuum Cleaner Noise

Objective Models for Perceptual Vacuum Cleaner Noise 2th WSEAS International Conference on SYSTEMS, Heraklion, Greece, July 22-24, 0 Objective Models for Perceptual Vacuum Cleaner Noise C. Lee *, Y. Cho **, S. Lee *, J. Park *, D. Hwang **, C. Song ** and

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

I D I A P R E S E A R C H R E P O R T. October submitted for publication

I D I A P R E S E A R C H R E P O R T. October submitted for publication R E S E A R C H R E P O R T I D I A P Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath

More information

Dr Andrew Abel University of Stirling, Scotland

Dr Andrew Abel University of Stirling, Scotland Dr Andrew Abel University of Stirling, Scotland University of Stirling - Scotland Cognitive Signal Image and Control Processing Research (COSIPRA) Cognitive Computation neurobiology, cognitive psychology

More information

Morset Sound Development. User s Guide

Morset Sound Development. User s Guide Morset Sound Development WinMLS 2000 for Microsoft Windows 95/98/ME/NT/2000/XP User s Guide Design: L. H. Morset, A. Goldberg, P. Svensson, +++. Programming: L. H. Morset, J. R. Mathiassen, J. T. Kommandantvold,+++.

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

Image denoising in the wavelet domain using Improved Neigh-shrink

Image denoising in the wavelet domain using Improved Neigh-shrink Image denoising in the wavelet domain using Improved Neigh-shrink Rahim Kamran 1, Mehdi Nasri, Hossein Nezamabadi-pour 3, Saeid Saryazdi 4 1 Rahimkamran008@gmail.com nasri_me@yahoo.com 3 nezam@uk.ac.ir

More information

TECHNICAL PAPER. Personal audio mix for broadcast programs. Fraunhofer Institute for Integrated Circuits IIS

TECHNICAL PAPER. Personal audio mix for broadcast programs. Fraunhofer Institute for Integrated Circuits IIS TECHNICAL PAPER Personal audio mix for broadcast programs Finding the right balance between dialogue and ambient sound within a broadcast program is a major challenge for sound engineers and an increasing

More information

Effect of MPEG Audio Compression on HMM-based Speech Synthesis

Effect of MPEG Audio Compression on HMM-based Speech Synthesis Effect of MPEG Audio Compression on HMM-based Speech Synthesis Bajibabu Bollepalli 1, Tuomo Raitio 2, Paavo Alku 2 1 Department of Speech, Music and Hearing, KTH, Stockholm, Sweden 2 Department of Signal

More information

A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION

A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION Armin Taghipour 1, Maneesh Chandra Jaikumar 2, and Bernd Edler 1 1 International Audio Laboratories Erlangen, Am Wolfsmantel

More information

Clustering & Dimensionality Reduction. 273A Intro Machine Learning

Clustering & Dimensionality Reduction. 273A Intro Machine Learning Clustering & Dimensionality Reduction 273A Intro Machine Learning What is Unsupervised Learning? In supervised learning we were given attributes & targets (e.g. class labels). In unsupervised learning

More information

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems Rec. ITU-R BS.1693 1 RECOMMENDATION ITU-R BS.1693 Procedure for the performance test of automated query-by-humming systems (Question ITU-R 8/6) (2004) The ITU Radiocommunication Assembly, considering a)

More information

Adaptive Quantization for Video Compression in Frequency Domain

Adaptive Quantization for Video Compression in Frequency Domain Adaptive Quantization for Video Compression in Frequency Domain *Aree A. Mohammed and **Alan A. Abdulla * Computer Science Department ** Mathematic Department University of Sulaimani P.O.Box: 334 Sulaimani

More information

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and

More information

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N15071 February 2015, Geneva,

More information

Efficient Signal Adaptive Perceptual Audio Coding

Efficient Signal Adaptive Perceptual Audio Coding Efficient Signal Adaptive Perceptual Audio Coding MUHAMMAD TAYYAB ALI, MUHAMMAD SALEEM MIAN Department of Electrical Engineering, University of Engineering and Technology, G.T. Road Lahore, PAKISTAN. ]

More information

PJP-EC200 Setup Procedure Yamaha Corporation

PJP-EC200 Setup Procedure Yamaha Corporation Yamaha Corporation Contents Components... 3... 4 Installations and Connections... 4 Setup... 4 Adjustment... 7... 9 Installations and Connections... 9 Setup... 9 Adjustment... 12 2 Components This document

More information

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia? Multimedia What is multimedia? Media types +Text + Graphics + Audio +Image +Video Interchange formats What is multimedia? Multimedia = many media User interaction = interactivity Script = time 1 2 Most

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

Adaptive Doppler centroid estimation algorithm of airborne SAR

Adaptive Doppler centroid estimation algorithm of airborne SAR Adaptive Doppler centroid estimation algorithm of airborne SAR Jian Yang 1,2a), Chang Liu 1, and Yanfei Wang 1 1 Institute of Electronics, Chinese Academy of Sciences 19 North Sihuan Road, Haidian, Beijing

More information

The Sensitivity Matrix

The Sensitivity Matrix The Sensitivity Matrix Integrating Perception into the Flexcode Project Jan Plasberg Flexcode Seminar Lannion June 6, 2007 SIP - Sound and Image Processing Lab, EE, KTH Stockholm 1 SIP - Sound and Image

More information

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1 Multimedia What is multimedia? Media types + Text +Graphics +Audio +Image +Video Interchange formats Petri Vuorimaa 1 What is multimedia? Multimedia = many media User interaction = interactivity Script

More information

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS Edited by YITENG (ARDEN) HUANG Bell Laboratories, Lucent Technologies JACOB BENESTY Universite du Quebec, INRS-EMT Kluwer

More information

Downloaded 09/01/14 to Redistribution subject to SEG license or copyright; see Terms of Use at

Downloaded 09/01/14 to Redistribution subject to SEG license or copyright; see Terms of Use at Random Noise Suppression Using Normalized Convolution Filter Fangyu Li*, Bo Zhang, Kurt J. Marfurt, The University of Oklahoma; Isaac Hall, Star Geophysics Inc.; Summary Random noise in seismic data hampers

More information

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany Outline Introduction What is "Parametric Audio Coding"?

More information

Sonic Studio Mastering EQ Table of Contents

Sonic Studio Mastering EQ Table of Contents Sonic Studio Mastering EQ Table of Contents 1.0 Sonic Studio Mastering EQ... 3 1.1 Sonic Studio Mastering EQ Audio Unit Plug-in...4 1.1.1 Overview... 4 1.1.2 Operation... 4 1.1.2.1 Mastering EQ Visualizer...5

More information

[N569] Wavelet speech enhancement based on voiced/unvoiced decision

[N569] Wavelet speech enhancement based on voiced/unvoiced decision The 32nd International Congress and Exposition on Noise Control Engineering Jeju International Convention Center, Seogwipo, Korea, August 25-28, 2003 [N569] Wavelet speech enhancement based on voiced/unvoiced

More information

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding Perceptual Pre-weighting and Post-inverse weighting for Speech Coding Niranjan Shetty and Jerry D. Gibson Department of Electrical and Computer Engineering University of California, Santa Barbara, CA,

More information

The MPEG-4 General Audio Coder

The MPEG-4 General Audio Coder The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution

More information

Robust speech recognition using features based on zero crossings with peak amplitudes

Robust speech recognition using features based on zero crossings with peak amplitudes Robust speech recognition using features based on zero crossings with peak amplitudes Author Gajic, Bojana, Paliwal, Kuldip Published 200 Conference Title Proceedings of the 200 IEEE International Conference

More information

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer

More information

WAVELET USE FOR IMAGE RESTORATION

WAVELET USE FOR IMAGE RESTORATION WAVELET USE FOR IMAGE RESTORATION Jiří PTÁČEK and Aleš PROCHÁZKA 1 Institute of Chemical Technology, Prague Department of Computing and Control Engineering Technicka 5, 166 28 Prague 6, Czech Republic

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

IVM 4. Audio Presets Details SST 4. AKG Acoustics 2008

IVM 4. Audio Presets Details SST 4. AKG Acoustics 2008 IVM 4 Audio Presets Details SST 4 AKG Acoustics 2008 2 COMPRESSOR All COMPRESSOR Presets have one GOAL: Create a precise and well-balanced monitoring level by maintaining all sound nuances and a natural

More information

ELEC Dr Reji Mathew Electrical Engineering UNSW

ELEC Dr Reji Mathew Electrical Engineering UNSW ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion

More information

Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution

Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution Sangita Roy, Dola B. Gupta, Sheli Sinha Chaudhuri and P. K. Banerjee Abstract In the last

More information

VDSL in terms of reach and capacity

VDSL in terms of reach and capacity ANSI TE.4/97-38 () Standards project: TE.4: VDSL Title : Zipper performance when mixing ADSL and VDSL in terms of reach and capacity Source : Contact: Telia Research AB Daniel Bengtsson, Petra Deutgen,

More information

An improved adaptive gain equalizer for noise reduction with low speech distortion

An improved adaptive gain equalizer for noise reduction with low speech distortion Borgh et al. EURASIP Journal on Audio, Speech, and Music Processing 2, 2:7 http://asmp.eurasipjournals.com/content/2//7 RESEARCH Open Access An improved adaptive gain equalizer for noise reduction with

More information

Audio Watermarking Based on PCM Technique

Audio Watermarking Based on PCM Technique Audio Watermarking Based on PCM Technique Ranjeeta Yadav Department of ECE SGIT, Ghaziabad, INDIA Sachin Yadav Department of CSE SGIT, Ghaziabad, INDIA Jyotsna Singh Department of ECE NSIT, New Delhi,

More information

Overview: motion-compensated coding

Overview: motion-compensated coding Overview: motion-compensated coding Motion-compensated prediction Motion-compensated hybrid coding Motion estimation by block-matching Motion estimation with sub-pixel accuracy Power spectral density of

More information

EECS 556 Image Processing W 09. Image enhancement. Smoothing and noise removal Sharpening filters

EECS 556 Image Processing W 09. Image enhancement. Smoothing and noise removal Sharpening filters EECS 556 Image Processing W 09 Image enhancement Smoothing and noise removal Sharpening filters What is image processing? Image processing is the application of 2D signal processing methods to images Image

More information

COMMITTEE T1 TELECOMMUNICATIONS Working Group T1E1.4 (DSL Access) Baltimore; August 25-26, 1999

COMMITTEE T1 TELECOMMUNICATIONS Working Group T1E1.4 (DSL Access) Baltimore; August 25-26, 1999 COMMITTEE T1 TELECOMMUNICATIONS Working Group T1E1. (DSL Access) Baltimore; August 5-6, 1999 T1E1./99-371 CONTRIBUTION TITLE: SOURCE: PROJECT: ISDN, HDSL, and HDSL Adaptation and SNR with Short-Term Stationary

More information

Replacing ECMs with Premium MEMS Microphones

Replacing ECMs with Premium MEMS Microphones Replacing ECMs with Premium MEMS Microphones About this document Scope and purpose This document provides reasons to replace electret condenser microphones (ECM) with premium MEMS microphones. Intended

More information

Audio Noise Removal Literature review

Audio Noise Removal Literature review Audio Noise Removal Literature review 1 Srashti Badkul 2 Brajlata Chourasiya Department of Electronics & Communication, GGITS, Jabalpur ABSTRACT: In this paper different audio denoising techniques are

More information

MKE 600. Perfect Sound.

MKE 600. Perfect Sound. MKE 600 Perfect Sound. Perfect Pictures. Your Reliable Companion for Every Shoot: The New MKE 600 Wherever the magic moment occurs: The success of your work depends on achieving the perfect interplay of

More information

Mic Setup. Motiv Android. Features. The ShurePlus MOTIV App for Android

Mic Setup. Motiv Android. Features. The ShurePlus MOTIV App for Android Motiv Android The ShurePlus MOTIV App for Android General Description The ShurePlus MOTIV recording app controls microphone settings and provides instant recording. Set your record settings in the app.

More information

(t x) domain, pattern-based multiple separation

(t x) domain, pattern-based multiple separation Stanford Exploration Project, Report 103, April 27, 2000, pages 63?? (t x) domain, pattern-based multiple separation Robert G. Clapp and Morgan Brown 1 ABSTRACT Pattern-based signal/noise separation is

More information

Quick Setup The MVi is compatible with most devices that have a USB or Lightning connector. Touch Panel Interface Controls. MVi. Overview.

Quick Setup The MVi is compatible with most devices that have a USB or Lightning connector. Touch Panel Interface Controls. MVi. Overview. MVi Digital Audio Interface General Description The Shure MVi is a compact digital audio interface used to connect a microphone, guitar, or other instrument to a computer or mobile device. Audio and power

More information

A Short Narrative on the Scope of Work Involved in Data Conditioning and Seismic Reservoir Characterization

A Short Narrative on the Scope of Work Involved in Data Conditioning and Seismic Reservoir Characterization A Short Narrative on the Scope of Work Involved in Data Conditioning and Seismic Reservoir Characterization March 18, 1999 M. Turhan (Tury) Taner, Ph.D. Chief Geophysicist Rock Solid Images 2600 South

More information

Research on the New Image De-Noising Methodology Based on Neural Network and HMM-Hidden Markov Models

Research on the New Image De-Noising Methodology Based on Neural Network and HMM-Hidden Markov Models Research on the New Image De-Noising Methodology Based on Neural Network and HMM-Hidden Markov Models Wenzhun Huang 1, a and Xinxin Xie 1, b 1 School of Information Engineering, Xijing University, Xi an

More information

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec / / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank

More information

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>> THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar

More information

CS 074 The Digital World. Digital Audio

CS 074 The Digital World. Digital Audio CS 074 The Digital World Digital Audio 1 Digital Audio Waves Hearing Analog Recording of Waves Pulse Code Modulation and Digital Recording CDs, Wave Files, MP3s Editing Wave Files with BinEd 2 Waves A

More information

Licensees Name: New Gullen Range Wind Farm Pty Ltd. Address of licenced premise: Gullen Range Wind Farm 250 Storriers Lane Banniester NSW 2580

Licensees Name: New Gullen Range Wind Farm Pty Ltd. Address of licenced premise: Gullen Range Wind Farm 250 Storriers Lane Banniester NSW 2580 New Gullen Range Wind Farm Pty Ltd ACN 167 4 211 Correspondence Address: Level 21, 1 York Street Sydney, NSW 2000 info@gullenrangewindfarm.com Toll Free: 1800 9711 Description: The following report was

More information

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573 Japan. E-mail: kitawaki@cs.tsukuba.ac.jp

More information

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010 ROW.mp3 Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010 Motivation The realities of mp3 widespread use low quality vs. bit rate when compared to modern codecs Vision for row-mp3 backwards

More information

Compliant Baffle for Large Telescope Daylight Imaging. Stacie Williams Air Force Research Laboratory ABSTRACT

Compliant Baffle for Large Telescope Daylight Imaging. Stacie Williams Air Force Research Laboratory ABSTRACT Compliant Baffle for Large Telescope Daylight Imaging Steven Griffin, Andrew Whiting, Shawn Haar The Boeing Company Stacie Williams Air Force Research Laboratory ABSTRACT With the recent interest in daylight

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

BootEQ mkii M A N U A L

BootEQ mkii M A N U A L BootEQ mkii M A N U A L Content Chapter 1: Introduction 5 1.1. License... 5 1.2. Installation... 6 1.3. Overarching topics... 6 1.4. Credits... 7 Chapter 2: Reference 9 2.1. Overview... 9 2.2. Quick reference...

More information

ET4254 Communications and Networking 1

ET4254 Communications and Networking 1 Topic 2 Aims:- Communications System Model and Concepts Protocols and Architecture Analog and Digital Signal Concepts Frequency Spectrum and Bandwidth 1 A Communications Model 2 Communications Tasks Transmission

More information

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution Detecting Salient Contours Using Orientation Energy Distribution The Problem: How Does the Visual System Detect Salient Contours? CPSC 636 Slide12, Spring 212 Yoonsuck Choe Co-work with S. Sarma and H.-C.

More information

BETTER HEARING. making MORE PERSONAL THAN EVER

BETTER HEARING. making MORE PERSONAL THAN EVER hearing solutions product catalog supplement may 12 making BETTER HEARING MORE PERSOL THAN EVER Patients come to you seeking a hearing solution that fits them perfectly. With our newest products, you now

More information

Speech and audio coding

Speech and audio coding Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples

More information