Perceptual audio coding schemes based on adaptive signal processing tools

Size: px
Start display at page:

Download "Perceptual audio coding schemes based on adaptive signal processing tools"

Transcription

1 Perceptual audio coding schemes based on adaptive signal processing tools Fernando A. Marengo Rodriguez, Sergio A. Castells, and Gonzalo D. Sad Citation: Proc. Mtgs. Acoust. 28, (2016); View online: View Table of Contents: Published by the Acoustical Society of America Articles you may be interested in Analysis of lightweight acoustic reflectors Proceedings of Meetings on Acoustics 28, (2017); / The acoustics of the concert hall Auditorio Juan Victoria from San Juan, Argentina Proceedings of Meetings on Acoustics 28, (2017); / Signal-dependent spatial audio reproduction based on playback-setup-defined beamformers Proceedings of Meetings on Acoustics 28, (2017); / Cross-frequency coupling and phase synchronization in nonlinear acoustics Proceedings of Meetings on Acoustics 28, (2016); / How long is a vocal tract? Comparison of acoustic impedance spectrometry with magnetic resonance imaging Proceedings of Meetings on Acoustics 28, (2017); /

2 Volume nd International Congress on Acoustics Acoustics for the 21 st Century Buenos Aires, Argentina September 2016 Signal Processing in Acoustics: Paper ICA Perceptual audio coding schemes based on adaptive signal processing tools Fernando A. Marengo Rodriguez, Sergio A. Castells, and Gonzalo D. Sad National University of Rosario, Rosario, Santa Fe, Argentina; In this paper, new perceptual audio coding schemes based on adaptive processing tools are proposed. They rely on both the empirical mode decomposition (EMD) and the ensemble empirical mode decomposition (EEMD) methods. In comparison with other perceptual coding schemes, the one presented here is simpler since physically meaningful components of the input signal are detected, then their local extrema are extracted and Golomb-Rice encoding of the extracted samples is performed. The proposed scheme is assessed in terms of compression ratio and perceptual quality for various tracks from the European Broadcasting Union Sound Quality Assessment Material (EBU-SQAM) compact disc. The obtained results are compared with those corresponding to other perceptual audio coding methodologies. Published by the Acoustical Society of America 2017 Acoustical Society of America [DOI: / ] Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 1

3 1. INTRODUCTION In order to optimize the use of data transmission channels as well as the storage capacity in hard disk drives, different lossless and perceptual (lossy) audio coding schemes were developed. The formers 1-3 allow to reduce the input audio file size without introducing distortion. Typical compact disc (CD) audio files in PCM format can be packed into one half or one sixth of their original sizes, depending on certain characteristics of the input data, such as the dynamic range and spectral content 4. Perceptual audio encoders 5-8 allow higher compression gains to be obtained at a cost of higher complexity of the encoder. In both types of encoders, it is crucial that the decoder be of low complexity, so as to allow low-cost portable devices to play the previously encoded audio file in real time. In this paper, new encoding schemes based on adaptive analysis techniques are proposed, and their performances are quantitatively analyzed and compared with previous techniques using musical tracks from the European Broadcasting Union Sound Quality Assessment Material (EBU-SQAM) CD 9. This document is organized as follows. The adaptive tools used for our system are described in Section 2, and the proposed encoder and decoder are outlined in Section 3. The criterion used for selecting the audio files to be tested is detailed in Section 4, and numerical results are summarized in Section 5. Conclusions are drawn in Section ADAPTIVE TOOLS FOR THE ENCODER Unlike other encoding schemes, our method relies on adaptive signal decomposition tools. They are: 1) the empirical mode decomposition (EMD) method and 2) the ensemble empirical mode decomposition (EEMD) 13 algorithm. These tools allow to decompose any one dimensional (1D) sequence into a reduced set of zero mean amplitude and frequency modulated (AM-FM) signals, each usually related with a physical phenomenon underlying the system under study 12. In the following, these tools will be briefly described. A. THE EMPIRICAL MODE DECOMPOSITION METHOD In the EMD method, details are extracted from the input data progressively, from the finest temporal resolution up to the coarsest one, by means of a sifting process. Intuitively, the input data ( ) (where is the time index) is seen as an addition of zero mean oscillatory detail functions ( ), each of which is added to slower temporal variations. Each detail is extracted as follows. 1. Local maxima (minima) are computed from the input signal ( ) and then interpolated, resulting the upper (lower) envelope ( ) ( ( )). 2. The local mean ( ) ( ) ( ) is computed, and the first order detail function ( ) ( ) ( ) is determined. 3. Step 1 is performed, using the first residue ( ) ( ) ( ) as the input sequence. The output sequences in this process are the second order detail function ( ), and the second residue ( ) ( ) ( ). 4. This sifting process continues iteratively from steps 1 through 3, using the residues ( ) ( ) ( ) as input sequences, and giving ( ) ( ) ( ) as outputs. This process is performed until the residue ( ) ( ) ( ) has no more local extrema and hence no more details to extract. At this point, the input signal is decomposed as Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 2

4 ( ) ( ) ( ) (1) ( ) is known as intrinsic mode function or IMF, and ( ) is the final residue, also denoted as ( ) for simplicity. It has to be stressed that each detail function at the end of step 2 may have nonzero mean, in which case it has to be iterated from step 1 to step 2 for subtracting such mean. This process is performed until the corresponding local mean is sufficiently small 10, 11. Since each IMF depends solely on the input signal, the EMD algorithm is characterized for being fully data-driven, adaptive and always gives a small amount of IMFs which may be described as AM-FM signals, i.e., ( ) ( ) ( ), where ( ) is the instantaneous ( ) amplitude and ( ) is the instantaneous frequency of the -th IMF ( ) 12. The EMD method has been used extensively for robust data analysis 12, and also for data compression in 2D 14 and 1D 15 including audio signals The present paper introduces further improvements on the method introduced in 16. B. THE ENSEMBLE EMPIRICAL MODE DECOMPOSITION METHOD This technique, also known by its acronym, EEMD 13, consists of multiple applications of the EMD algorithm to the input signal contaminated by different realizations of finite power white Gaussian noise (WGN). Such noise adds uniform spectral content to the input sequence, allowing the EMD to work as a dyadic filter bank 19, 20 and to obtain a set of IMFs more concentrated in some specific spectral bands. (More precisely, EMD works as a dyadic filter bank only for WGN, since each IMF is mainly concentrated in one octave band. For other class of input signals, the spectral band of each IMF could be very difficult to predict, unless the input is contaminated with finite power WGN.) Finally, the homologous IMFs ( ) ( ) ( ) are ensemble averaged over all the L realizations, resulting a set of average IMFs ( ) ( ) given by ( ) ( ) (2) One drawback of the EEMD method is that it does not fulfill completeness since Eq. (1) is not satisfied. However, this problem is minimized by reducing the power of the WGN added in this method 21, which is viable if the input spectrum is more concentrated at low frequencies 13. This is true for many classes of audio signals. An additional benefit for this case is the need of less number of total realizations, which increases the speed of the EEMD algorithm. Also, an important advantage of EEMD over EMD is that each resulting IMF does not contain information regarding two or more different physical phenomena, also known in the literature as mode mixing. This result is a consequence of adding uniform spectral density via WGN. This advantage is useful for the proposed encoder herein, since mode mixing is related with redundancies in two or more IMFs. Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 3

5 3. PROPOSED ENCODER/DECODER The proposed encoder and decoder are explained in the following subsections. A. ENCODER The encoder processes the input audio file (PCM data stored in a digital WAV file) on a frame by frame basis, according to the block diagram illustrated in Fig. 1. Figure 1. Block diagram for the proposed encoder. The encoder detects the sampling rate of the input file, the number of channels and the number of bits per sample. Then, the input data stream is segmented in frames of fixed length (4096 samples for the present case). For each frame, data are processed via either EMD or EEMD, resulting a set of IMFs. According to some numerical tests, some IMFs may not be relevant, since they carry little information of the input signal 22. For such reason, a two stage filtering process is applied to the IMFs, allowing to detect the relevant ones (see Fig. 2). In such filter, the correlation coefficient between the input sequence and the k-th IMF is determined, and those IMFs with correlation lower than a given threshold ( ) (with ) are ignored 22. Figure 2. Diagram for detecting the relevant IMFs. The second stage of the filter considers a masking model which allows to detect and remove perceptually irrelevant information 23, with little distortion for the human auditory system. In order to detect and remove masked components for each critical band, the spreading function utilized in ISO/IEC MPEG Psychoacoustic Model 2 (see 24, pp. 187) is applied. Then, the intensities of the most relevant IMFs (each associated with the corresponding masking curve in a given critical band) are added up according to 24, pp. 192, setting the parameter.33. The IMFs for an experimental sequence are illustrated in Fig. 3(a), and the relevant ones after the two stage filtering process are depicted in Fig. 3(b). The proposed filter removes a great amount of information (7 IMFs), allowing more efficient data compression. The relevant IMFs resulting from the previous step are represented via the corresponding local extrema, which is equivalent to its critical sampling rate locally (see Fig. 4). The interpolation of such extrema allows to reconstruct each IMF with low error The abscissas and ordinates of the previously mentioned local extrema are encoded separately. The former are differentiated, resulting a set of smaller numbers. Since each IMF is oscillating around zero, the ordinates are sign alternating and represented by their absolute values ( ). Just one additional bit is added (at the beginning of the frame), stating which is the sign of the first ordinate in the corresponding IMF. The set of absolute ordinates is subtracted from their median (i.e., ( ) ( ) ), so as to obtain a data set more symmetrical Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 4

6 around zero. Both sets corresponding to the abscissas and ordinates are Golomb-Rice encoded separately 25 and finally multiplexed with each other. (a) (b) Figure 3. (a) Set of IMFs for an audio input signal. (b) Resulting IMFs after the filtering process. Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 5

7 Figure 4. IMF and its local extrema for a specific frame. B. DECODER The decoder is represented in Fig. 5 and works as follows. First, a demultiplexer splits the encoded data stream into the data sets corresponding to the abscissas and ordinates. Then, each of these data sets are Golomb-Rice decoded, resulting the differentiated abscissas and processed ordinates ( ) ( ). The abscissas are recovered via cumulative sum, i.e.,, and the ordinates are determined after addition of the corresponding median and sign shifting, according to the sign of the first ordinate. The resulting local extrema ( ) are then interpolated via piecewise 3rd order cubic Hermite interpolating polynomial (PCHIP), giving the corresponding IMF. This process is performed for all the relevant IMFs, which are then added altogether, giving the decoded signal. Figure 5. Block diagram for the decoder. It has to be stressed that the encoder and above all the decoder are quite simple. The simplicity of the decoder is crucial in order to allow low-cost portable devices to play the encoded audio file in real time. 4. PERFORMANCE ANALYSIS The EMD/EEMD based audio coding scheme was tested with PCM coded WAV audio files in the EBU-SQAM CD 9. Our aim is to test a variety of audio signals according to the following parameters recommended in 26 : Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 6

8 - Transients (pre-echo sensitive, smearing of noise in temporal domain), - Tonal structure (noise sensitive, roughness), - Natural speech (distortion sensitive, smearing of attacks), - Complex sound (stresses the device under test), - High bandwidth (stresses the device under test, loss of high frequencies, high frequency noise). Therefore, the selected files in the EBU-SQAM CD for our tests were the following: - Castanets (file 27.wav), - Clarinet (file 16.wav), - Female speech (file 49.wav), - Soprano (file 44.wav), - Glockenspiel (file 35.wav). Each encoded audio file was evaluated in terms of the compression ratio (CR), which is the ratio between the input and the output file sizes. The perceived audio quality of the encoded data was measured via the objective difference grade (ODG), an objectively measured parameter according to the algorithm specified in ITUR BS The ODG ranges from 0 to -4, depending on the impairment produced by data compression (imperceptible for ODG = 0, perceptible, but not annoying for ODG = -1, slightly annoying for ODG = -2, annoying for ODG = -3, and very annoying for ODG = -4). The results were compared with those obtained using the following perceptual audio coding schemes: 1) OGG Vorbis 8 and 2) the audio coding standard ISO/IEC MPEG Layer 3 or MP RESULTS The numerical values for both the CR and the ODG associated with the selected audio files are shown in Table 1. For the EEMD algorithm, 100 realizations were performed, and different values for the WGN power were used. For better compression, the value of was determined according to the audio file under analysis (see Table 1). Table 1. Compression ratio (CR) and ODG for the audio files processed by different encoding algorithms. Input file Parameter EMD EEMD MP3 VBR OGG VBR MP3 64k Castanets (Note 1) Clarinet (Note 1) Female speech (Note 2) Soprano (Note 2) Glockenspiel (Note 1) Note 1: = 0.01 in EEMD. Note 2: = 0.05 in EEMD Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 7

9 For clarinet, soprano and glockenspiel audio files, the highest compression is achieved by the EMD based encoder, while for castanets and female speech audio files the highest compression is achieved by the MP3 64k encoder (see values in bold in Table 1). Besides, the EEMD based approach gives almost as much compression as the EMD algorithm, but with higher fidelity (the ODG is less negative). Such fidelity improvement illustrates the advantage of the EEMD method for providing IMFs with better spectral concentration, i.e., less mode mixing. Finally, it is observed that in most cases the fidelity (ODG) of the encoded audio files with the proposed method is not very different from the one corresponding to previously developed techniques. For instance, the difference between the ODG produced by the EMD based encoder and by the MP3 64k encoder is -0.1 for castanets, -0.9 for clarinet, for female speech, for soprano and for glockenspiel. This issue is currently under study for further improvements. 6. CONCLUDING REMARKS The EMD/EEMD audio encoding scheme was presented and tested with well-known audio files and compared with other existing encoding algorithms. This encoder is simple and provides higher compression for some cases. Further improvements regarding fidelity and speed are under development. Its advantages are its simplicity and flexibility, since the decoder is the same regardless of the algorithm (EMD or EEMD) used in the encoder. This statement is crucial for low-cost portable devices that perform audio decoding and playing. ACKNOWLEDGMENTS The authors are very grateful to Professor Federico Miyara for having inspired this work. REFERENCES 1 xiph.org Foundation, FLAC - Free lossless audio codec (2014). 2 M. T. Ashland, Monkey s Audio (2013). 3 M. Hans and R. W. Schafer, Lossless compression of digital audio, IEEE Signal Processing Magazine, Vol 18(4), pp (2001). 4 F. A. Marengo Rodriguez, E. A. Roveri, J. M. Rodríguez Guerrero and M. Treffiló, Análisis comparativo de codificadores de audio sin pérdidas y una herramienta gráfica para su selección y predicción de su desempeño, Mecánica Computacional, Vol 30 (41), Acoustics and Mechanical Vibrations (B), pp (2011). 5 O. Bonello, Tecnología de radiodifusión para la década del 90, Revista telegráfica electrónica, pp 293 (1990). 6 O. Bonello, AUDICOM - Un invento argentino, Coordenadas, Vol 85, pp 4-8 (2010). 7 ISO/IEC, ISO/IEC : Information technology Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s Part 3: Audio (1993). 8 xiph.org Foundation, Vorbis audio compression (2016). 9 European Broadcasting Union, Sound Quality Assessment Material, Recordings for subjective tests Users Handbook for the EBU-SQAM Compact Disc (2008). 10 N. E. Huang, Z. Shen, S. R. Long, M. C. Wu, H. H. Shih, Q. Zheng, N. Yen, C. C. Tung and H. H. Liu, The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proc. of the Royal Soc. of London (A), Vol 454 (1971), pp (1998). 11 G. Rilling, P. Flandrin, and P. Gonçalves, On empirical mode decomposition and its algorithms, Proc. of IEEE- EURASIP Workshop on Nonlinear Signal and Image Processing NSIP-03, Grado, Italy (2003). 12 N. E. Huang and S. S. P. Shen, The Hilbert-Huang Transform and Its Applications (Interdisciplinary Mathematical Sciences), World Scientific Publishing Company (2005). Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 8

10 13 Z. Wu and N. E. Huang, Ensemble Empirical Mode Decomposition: a Noise-Assisted Data Analysis Method, Advances in Adaptive Data Analysis, Vol 1 (1), pp 1-41 (2009). 14 A.Linderhed, 2-D empirical mode decompositions - in the spirit of image compression, Proceeding of SPIE, Wavelet and Independent Component Analysis Applications IXI, Orlando, USA, Vol 4738, pp 1-8 (2002). 15 C. C. Ho, Empirical Mode Decomposition Based Novel Data Compression Algorithm for Wireless Data Transmission in Machine Health Monitoring, Master s Thesis, City University of Hong Kong (2009). 16 F. A. Marengo Rodriguez and F. Miyara, Representación de Señales de Audio con Descomposición Empírica de Modos y Submuestreo Adaptativo, Primeras Jornadas Regionales de Acústica, Rosario, Argentina, number A056R. In CD-ROM (2009). 17 K. Khaldi, A. O. Boudraa, M. Turki, I. Samaali and T. Chonavel, Audio encoding based on the empirical mode decomposition, EUSIPCO 09, Glasgow, United Kingdom (2009). 18 K. Khaldi, A. O. Boudraa, B. Torresani and T. Chonavel, HHT - based audio coding, Signal, image and video processing, Vol 7 (2), pp 1-9 (2013). 19 Z. Wu and N. E. Huang, A study of the characteristics of white noise using the empirical mode decomposition method, Proc. of the Royal Society of London(A), Vol 460 (2046), pp (2004). 20 P. Flandrin, G. Rilling and P. Gonçalves, Empirical mode decomposition as a filter bank, IEEE, Signal Processing Letters, Vol 11 (2), pp (2004). 21 M. E.Torres, M. A. Colominas, G. Schlotthauer and P. A. Flandrin, A complete ensemble empirical mode decomposition with adaptive noise, ICASSP, Prague, Czech Republic, pp (2011). 22 Z. K. Peng, P. W. Tse and F. L. Chu, A comparison study of improved Hilbert-Huang transform and wavelet transform: Application to fault diagnosis for rolling bearing, Mechanical Systems and Signal Processing, Vol 19 (5), pp (2005). 23 E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models, Springer-Verlag, Berlin (Germany), 3rd edition (2007). 24 M. Bosi and R. Goldberg, Introduction to Digital Audio Coding and Standards, ser. Kluwer international series in engineering and computer science, Power electronics and power systems. Springer US (2003). 25 D.Salomon, Data Compression: The Complete Reference, Springer-Verlag, New York (USA), 3rd edition, (2004). 26 ITU, Method for objective measurements of perceived audio quality, Recommendation ITUR BS (2001). Proceedings of Meetings on Acoustics, Vol. 28, (2017) Page 9

Perceptual audio coding schemes based on adaptive signal processing tools

Perceptual audio coding schemes based on adaptive signal processing tools Biomedical Acoustics: Paper ICA2016-728 Perceptual audio coding schemes based on adaptive signal processing tools Fernando A. Marengo Rodriguez (a), Sergio A. Castells (b), Gonzalo D. Sad (c) (a) National

More information

Empirical Mode Decomposition Based Denoising by Customized Thresholding

Empirical Mode Decomposition Based Denoising by Customized Thresholding Vol:11, No:5, 17 Empirical Mode Decomposition Based Denoising by Customized Thresholding Wahiba Mohguen, Raïs El hadi Bekka International Science Index, Electronics and Communication Engineering Vol:11,

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

USING LINEAR PREDICTION TO MITIGATE END EFFECTS IN EMPIRICAL MODE DECOMPOSITION. Steven Sandoval, Matthew Bredin, and Phillip L.

USING LINEAR PREDICTION TO MITIGATE END EFFECTS IN EMPIRICAL MODE DECOMPOSITION. Steven Sandoval, Matthew Bredin, and Phillip L. USING LINEAR PREDICTION TO MITIGATE END EFFECTS IN EMPIRICAL MODE DECOMPOSITION Steven Sandoval, Matthew Bredin, and Phillip L. De Leon New Mexico State University Klipsch School of Electrical and Computer

More information

Audio Watermarking using Colour Image Based on EMD and DCT

Audio Watermarking using Colour Image Based on EMD and DCT Audio Watermarking using Colour Image Based on EMD and Suhail Yoosuf 1, Ann Mary Alex 2 P. G. Scholar, Department of Electronics and Communication, Mar Baselios College of Engineering and Technology, Trivandrum,

More information

Application Note PEAQ Audio Objective Testing in ClearView

Application Note PEAQ Audio Objective Testing in ClearView 1566 La Pradera Dr Campbell, CA 95008 www.videoclarity.com 408-379-6952 Application Note PEAQ Audio Objective Testing in ClearView Video Clarity, Inc. Version 1.0 A Video Clarity Application Note page

More information

An Improved Images Watermarking Scheme Using FABEMD Decomposition and DCT

An Improved Images Watermarking Scheme Using FABEMD Decomposition and DCT An Improved Images Watermarking Scheme Using FABEMD Decomposition and DCT Noura Aherrahrou and Hamid Tairi University Sidi Mohamed Ben Abdellah, Faculty of Sciences, Dhar El mahraz, LIIAN, Department of

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

Interpolation artifacts and bidimensional ensemble empirical mode decomposition

Interpolation artifacts and bidimensional ensemble empirical mode decomposition Interpolation artifacts and bidimensional ensemble empirical mode decomposition Jiajun Han* University of Alberta, Edmonton, Alberta, Canada, hjiajun@ualberta.ca Mirko van der Baan University of Alberta,

More information

Audio encoding based on the empirical mode decomposition

Audio encoding based on the empirical mode decomposition Audio encoding based on the empirical mode decomposition Kais Khaldi, Abdel-Ouahab Boudraa, Monia Turki, Thierry Chonavel, Imen Samaali To cite this version: Kais Khaldi, Abdel-Ouahab Boudraa, Monia Turki,

More information

Wavelet filter bank based wide-band audio coder

Wavelet filter bank based wide-band audio coder Wavelet filter bank based wide-band audio coder J. Nováček Czech Technical University, Faculty of Electrical Engineering, Technicka 2, 16627 Prague, Czech Republic novacj1@fel.cvut.cz 3317 New system for

More information

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective

More information

Equivalent Effect Function and Fast Intrinsic Mode Decomposition

Equivalent Effect Function and Fast Intrinsic Mode Decomposition Equivalent Effect Function and Fast Intrinsic Mode Decomposition Louis Yu Lu E-mail: louisyulu@gmail.com Abstract: The Equivalent Effect Function (EEF) is defined as having the identical integral values

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Separation of Surface Roughness Profile from Raw Contour based on Empirical Mode Decomposition Shoubin LIU 1, a*, Hui ZHANG 2, b

Separation of Surface Roughness Profile from Raw Contour based on Empirical Mode Decomposition Shoubin LIU 1, a*, Hui ZHANG 2, b International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) Separation of Surface Roughness Profile from Raw Contour based on Empirical Mode Decomposition Shoubin

More information

pyeemd Documentation Release Perttu Luukko

pyeemd Documentation Release Perttu Luukko pyeemd Documentation Release 1.3.1 Perttu Luukko August 10, 2016 Contents 1 Contents: 3 1.1 Installing pyeemd............................................ 3 1.2 Tutorial..................................................

More information

Adaptive Boundary Effect Processing For Empirical Mode Decomposition Using Template Matching

Adaptive Boundary Effect Processing For Empirical Mode Decomposition Using Template Matching Appl. Math. Inf. Sci. 7, No. 1L, 61-66 (2013) 61 Applied Mathematics & Information Sciences An International Journal Adaptive Boundary Effect Processing For Empirical Mode Decomposition Using Template

More information

Empirical Mode Decomposition: Improvement and Application

Empirical Mode Decomposition: Improvement and Application Empirical Mode Decomposition: Improvement and Application Peel, M.C. 1, G.G.S. Pegram 2 and T.A. McMahon 1 1 Department of Civil and Environmental Engineering, The University of Melbourne, Victoria 2 Civil

More information

Audio Watermarking using Empirical Mode Decomposition

Audio Watermarking using Empirical Mode Decomposition Audio Watermarking using Empirical Mode Decomposition Charulata P. Talele 1, Dr A. M. Patil 2 1ME Student, Electronics and Telecommunication Department J. T. Mahajan College of Engineering, Faizpur, Maharashtra,

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek

More information

Efficient Signal Adaptive Perceptual Audio Coding

Efficient Signal Adaptive Perceptual Audio Coding Efficient Signal Adaptive Perceptual Audio Coding MUHAMMAD TAYYAB ALI, MUHAMMAD SALEEM MIAN Department of Electrical Engineering, University of Engineering and Technology, G.T. Road Lahore, PAKISTAN. ]

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Lecture 16 Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec / / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank

More information

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

HHT-based audio coding

HHT-based audio coding HHT-based audio coding Kais Khaldi, Abdel-Ouahab Boudraa, Bruno Torrésani, Thierry Chonavel To cite this version: Kais Khaldi, Abdel-Ouahab Boudraa, Bruno Torrésani, Thierry Chonavel. HHT-based audio coding.

More information

Multimedia Communications. Audio coding

Multimedia Communications. Audio coding Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated

More information

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define

More information

A Novel Audio Watermarking Algorithm Based On Reduced Singular Value Decomposition

A Novel Audio Watermarking Algorithm Based On Reduced Singular Value Decomposition A Novel Audio Watermarking Algorithm Based On Reduced Singular Value Decomposition Jian Wang 1, Ron Healy 2, Joe Timoney 3 Computer Science Department NUI Maynooth, Co. Kildare, Ireland jwang@cs.nuim.ie

More information

Fault Diagnosis of Wind Turbine Based on ELMD and FCM

Fault Diagnosis of Wind Turbine Based on ELMD and FCM Send Orders for Reprints to reprints@benthamscience.ae 76 The Open Mechanical Engineering Journal, 24, 8, 76-72 Fault Diagnosis of Wind Turbine Based on ELMD and FCM Open Access Xianjin Luo * and Xiumei

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

DAB. Digital Audio Broadcasting

DAB. Digital Audio Broadcasting DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva

More information

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future

More information

Rolling element bearings fault diagnosis based on CEEMD and SVM

Rolling element bearings fault diagnosis based on CEEMD and SVM Rolling element bearings fault diagnosis based on CEEMD and SVM Tao-tao Zhou 1, Xian-ming Zhu 2, Yan Liu 3, Wei-cai Peng 4 National Key Laboratory on Ship Vibration and Noise, China Ship Development and

More information

Perceptually motivated Sub-band Decomposition for FDLP Audio Coding

Perceptually motivated Sub-band Decomposition for FDLP Audio Coding Perceptually motivated Sub-band Decomposition for FD Audio Coding Petr Motlicek 1, Sriram Ganapathy 13, Hynek Hermansky 13, Harinath Garudadri 4, and Marios Athineos 5 1 IDIAP Research Institute, Martigny,

More information

Appendix 4. Audio coding algorithms

Appendix 4. Audio coding algorithms Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)

More information

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio: Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =

More information

CHAPTER 6 Audio compression in practice

CHAPTER 6 Audio compression in practice CHAPTER 6 Audio compression in practice In earlier chapters we have seen that digital sound is simply an array of numbers, where each number is a measure of the air pressure at a particular time. This

More information

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding. Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION

More information

Context based optimal shape coding

Context based optimal shape coding IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing September 13-15, 1999, Copenhagen, Denmark Electronic Proceedings 1999 IEEE Context based optimal shape coding Gerry Melnikov,

More information

WAVELET USE FOR IMAGE RESTORATION

WAVELET USE FOR IMAGE RESTORATION WAVELET USE FOR IMAGE RESTORATION Jiří PTÁČEK and Aleš PROCHÁZKA 1 Institute of Chemical Technology, Prague Department of Computing and Control Engineering Technicka 5, 166 28 Prague 6, Czech Republic

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION

A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION Armin Taghipour 1, Maneesh Chandra Jaikumar 2, and Bernd Edler 1 1 International Audio Laboratories Erlangen, Am Wolfsmantel

More information

Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM

Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Lu Chen and Yuan Hang PERFORMANCE DEGRADATION ASSESSMENT AND FAULT DIAGNOSIS OF BEARING BASED ON EMD AND PCA-SOM.

More information

Image and Video Coding I: Fundamentals

Image and Video Coding I: Fundamentals Image and Video Coding I: Fundamentals Thomas Wiegand Technische Universität Berlin T. Wiegand (TU Berlin) Image and Video Coding Organization Vorlesung: Donnerstag 10:15-11:45 Raum EN-368 Material: http://www.ic.tu-berlin.de/menue/studium_und_lehre/

More information

Blind Measurement of Blocking Artifact in Images

Blind Measurement of Blocking Artifact in Images The University of Texas at Austin Department of Electrical and Computer Engineering EE 38K: Multidimensional Digital Signal Processing Course Project Final Report Blind Measurement of Blocking Artifact

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Fundamentals of Image Compression DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR) IN SIGNAL PROCESSING IMPERIAL COLLEGE LONDON Compression New techniques have led to the development

More information

MPEG-4 aacplus - Audio coding for today s digital media world

MPEG-4 aacplus - Audio coding for today s digital media world MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers

More information

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

An investigation of non-uniform bandwidths auditory filterbank in audio coding

An investigation of non-uniform bandwidths auditory filterbank in audio coding PAGE 360 An investigation of non-uniform bandwidths auditory filterbank in audio coding Andrew Lin, Stevan Berber, Waleed Abdulla Department of Electrical and Computer Engineering University of Auckland,

More information

Data Hiding in Video

Data Hiding in Video Data Hiding in Video J. J. Chae and B. S. Manjunath Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 9316-956 Email: chaejj, manj@iplab.ece.ucsb.edu Abstract

More information

FPGA Implementation of HHT for Feature Extraction of Signals

FPGA Implementation of HHT for Feature Extraction of Signals IJSTE - International Journal of Science Technology & Engineering Volume 3 Issue 01 July 2016 ISSN (online): 2349-784X FPGA Implementation of HHT for Feature Extraction of Signals Mahesh Baban Shinde Manish

More information

Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model

Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model 1 M. Chinna Rao M.Tech,(Ph.D) Research scholar, JNTUK,kakinada chinnarao.mortha@gmail.com 2 Dr. A.V.S.N. Murthy Professor of Mathematics,

More information

A Modified Spline Interpolation Method for Function Reconstruction from Its Zero-Crossings

A Modified Spline Interpolation Method for Function Reconstruction from Its Zero-Crossings Scientific Papers, University of Latvia, 2010. Vol. 756 Computer Science and Information Technologies 207 220 P. A Modified Spline Interpolation Method for Function Reconstruction from Its Zero-Crossings

More information

An Enhanced Video Stabilization Based On Emd Filtering And Spectral Analysis

An Enhanced Video Stabilization Based On Emd Filtering And Spectral Analysis IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 5, Ver. III (Sep.- Oct. 2017), PP 23-30 www.iosrjournals.org An Enhanced Video Stabilization Based

More information

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING Dieison Silveira, Guilherme Povala,

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

SUMMARY INTRODUCTION METHOD. Review of VMD theory

SUMMARY INTRODUCTION METHOD. Review of VMD theory Bin Lyu*, The University of Olahoma; Fangyu Li, The University of Georgia; Jie Qi, Tao Zhao, and Kurt J. Marfurt, The University of Olahoma SUMMARY The coherence attribute is a powerful tool to delineate

More information

CISC 7610 Lecture 3 Multimedia data and data formats

CISC 7610 Lecture 3 Multimedia data and data formats CISC 7610 Lecture 3 Multimedia data and data formats Topics: Perceptual limits of multimedia data JPEG encoding of images MPEG encoding of audio MPEG and H.264 encoding of video Multimedia data: Perceptual

More information

TEMPORAL ENVELOPE CORRECTION FOR ATTACK RESTORATION IN LOW BIT-RATE AUDIO CODING

TEMPORAL ENVELOPE CORRECTION FOR ATTACK RESTORATION IN LOW BIT-RATE AUDIO CODING 7th European Signal Processing Conference (EUSIPCO 29) Glasgow, Scotland, August 24-28, 29 TEMPORAL ENVELOPE CORRECTION FOR ATTACK RESTORATION IN LOW BIT-RATE AUDIO CODING Imen Samaali 2, Monia Turki-Hadj

More information

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS Television services in Europe currently broadcast video at a frame rate of 25 Hz. Each frame consists of two interlaced fields, giving a field rate of 50

More information

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform Torsten Palfner, Alexander Mali and Erika Müller Institute of Telecommunications and Information Technology, University of

More information

I D I A P R E S E A R C H R E P O R T. October submitted for publication

I D I A P R E S E A R C H R E P O R T. October submitted for publication R E S E A R C H R E P O R T I D I A P Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath

More information

Cluster EMD and its Statistical Application

Cluster EMD and its Statistical Application Cluster EMD and its Statistical Application Donghoh Kim and Heeseok Oh Sejong University and Seoul National University November 10, 2007 1/27 Contents 1. Multi-scale Concept 2. Decomposition 3. Cluster

More information

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06 Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic

More information

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO International journal of computer science & information Technology (IJCSIT) Vol., No.5, October A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO Pranab Kumar Dhar *, Mohammad

More information

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Overview Examples and motivations Scalable coding for network transmission Techniques for multiple description coding 2 27/05/2013

More information

An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria

An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria N. RUIZ REYES 1, M. ROSA ZURERA 2, F. LOPEZ FERRERAS 2, D. MARTINEZ MUÑOZ 1 1 Departamento

More information

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information

Networking Applications

Networking Applications Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

Key words: B- Spline filters, filter banks, sub band coding, Pre processing, Image Averaging IJSER

Key words: B- Spline filters, filter banks, sub band coding, Pre processing, Image Averaging IJSER International Journal of Scientific & Engineering Research, Volume 7, Issue 9, September-2016 470 Analyzing Low Bit Rate Image Compression Using Filters and Pre Filtering PNV ABHISHEK 1, U VINOD KUMAR

More information

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law

More information

DRA AUDIO CODING STANDARD

DRA AUDIO CODING STANDARD Applied Mechanics and Materials Online: 2013-06-27 ISSN: 1662-7482, Vol. 330, pp 981-984 doi:10.4028/www.scientific.net/amm.330.981 2013 Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua

More information

University of Mustansiriyah, Baghdad, Iraq

University of Mustansiriyah, Baghdad, Iraq Volume 5, Issue 9, September 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Audio Compression

More information

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573 Japan. E-mail: kitawaki@cs.tsukuba.ac.jp

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS Outline Overview of Dirac Overview of AVS-china Overview

More information

AN AUDIO WATERMARKING SCHEME ROBUST TO MPEG AUDIO COMPRESSION

AN AUDIO WATERMARKING SCHEME ROBUST TO MPEG AUDIO COMPRESSION AN AUDIO WATERMARKING SCHEME ROBUST TO MPEG AUDIO COMPRESSION Won-Gyum Kim, *Jong Chan Lee and Won Don Lee Dept. of Computer Science, ChungNam Nat l Univ., Daeduk Science Town, Taejon, Korea *Dept. of

More information

IJSRD - International Journal for Scientific Research & Development Vol. 5, Issue 05, 2017 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 5, Issue 05, 2017 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol. 5, Issue 05, 2017 ISSN (online): 2321-0613 Audio Watermarking using Empirical Mode of Decomposition Mr. A. A. Dhanorkar 1 Dr. S.

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

Adaptive Quantization for Video Compression in Frequency Domain

Adaptive Quantization for Video Compression in Frequency Domain Adaptive Quantization for Video Compression in Frequency Domain *Aree A. Mohammed and **Alan A. Abdulla * Computer Science Department ** Mathematic Department University of Sulaimani P.O.Box: 334 Sulaimani

More information

Ensemble Empirical Mode Decomposition Parameters Optimization for Spectral Distance Measurement in Hyperspectral Remote Sensing Data

Ensemble Empirical Mode Decomposition Parameters Optimization for Spectral Distance Measurement in Hyperspectral Remote Sensing Data Remote Sens. 24, 6, 269-283; doi:.339/rs63269 Article OPEN ACCESS remote sensing ISSN 272-4292 www.mdpi.com/journal/remotesensing Ensemble Empirical Mode Decomposition Parameters Optimization for Spectral

More information

Audio and video compression

Audio and video compression Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and

More information

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes: Page 1 of 8 1. SCOPE This Operational Practice sets out guidelines for minimising the various artefacts that may distort audio signals when low bit-rate coding schemes are employed to convey contribution

More information

Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit

Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit Zihan Chen 1, Chen Lu 2, Hang Yuan 3 School of Reliability and Systems Engineering, Beihang University,

More information

Lossless Predictive Compression of Medical Images*

Lossless Predictive Compression of Medical Images* SERBIAN JOURNAL OF ELECTRICAL ENGINEERING Vol. 8, No. 1, February 2011, 27-36 UDK: 004.92.032.2:616-7 Lossless Predictive Compression of Medical Images* Aleksej Avramović 1, Slavica Savić 1 Abstract: Among

More information

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames Ki-Kit Lai, Yui-Lam Chan, and Wan-Chi Siu Centre for Signal Processing Department of Electronic and Information Engineering

More information

Audio Coding Standards

Audio Coding Standards Audio Standards Kari Pihkala 13.2.2002 Tik-111.590 Multimedia Outline Architectural Overview MPEG-1 MPEG-2 MPEG-4 Philips PASC (DCC cassette) Sony ATRAC (MiniDisc) Dolby AC-3 Conclusions 2 Architectural

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS Xie Li and Wenjun Zhang Institute of Image Communication and Information Processing, Shanghai Jiaotong

More information