Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model
|
|
- Olivia Shelton
- 5 years ago
- Views:
Transcription
1 Audio Compression Using Decibel chirp Wavelet in Psycho- Acoustic Model 1 M. Chinna Rao M.Tech,(Ph.D) Research scholar, JNTUK,kakinada chinnarao.mortha@gmail.com 2 Dr. A.V.S.N. Murthy Professor of Mathematics, Srinivasa Institute Of Engg & Tech avsnmurthy2005@gmail.com 3 Dr.Ch.Satyanarayana Assoc. Professor of CSE, JNTUK,kakinada Abstract: The advanced technologies, transfer of data over the internet and multimedia era have become one of the basic needs in communication world. So that the design of multimedia workstations along with high quality of audio transmission and storage is a big challenge. In this paper we present a new design of a psycho-acoustic model by using Decibelchirp for audio coding following the model used in the standard MPEG-1 audio layer 3. The goal is to achieve transparent coding of audio and speech signals at the lowest possible data rates. This architecture is based on appropriate wavelet packet decomposition instead of a short term Fourier transformation. It s is important characteristic is to propose an analysis of the frequency bands that come closer to the critical bands of the ear. This study shows the best performance of the existing. Keywords-Audio Compression,MPEG-1,Psycho-acoustic model,wavelet Packet Decomposition, FFT, Frequency bands, Critical bands. 1. INTRODUCTION Audio compression has become one of the basic technologies of the multimedia age. The change in the telecommunication infrastructure, in recent years, from circuit switched to packet switched systems has also reflected on the way that speech and audio signals are carried in present systems. In many applications, such as the design of multimedia workstations and high quality audio transmission and storage, the goal is to achieve transparent coding of audio and speech signals at the lowest possible data rates. In other words, bandwidth cost money, therefore, the transmission and storage of information becomes costly. The MPEG/Audio is a standard for both transmitting and recording compressed ratio. Compression refers to a variety of ways to reduce the size of a data file. Audio compression is form of data compression. To obtain this different methods for compression have been designed; from the simplest one that consists to make a banal under sampling, to the most advanced that takes account of the sensitivity of the human ear. The MPEG algorithm achieves compression by exploiting the perceptual limitation of the human ear. Audio compression algorithms are used to obtain compact digital representations of highfidelity audio signals for the purpose of IJCSIET-ISSUE2-VOLUME2-SERIES1 Page 1
2 efficient transmission. The main objective in audio coding is to represent the signal with a minimum number of bits while achieving transparent signal reproduction. The majority of MPEG1coders apply a psychoacoustic model for coding applications using filter bank to approximate the frequency selectivity of the human auditory system [1]. The ear is an organ of large sensibility, presenting a high resolution and a great dynamic of the signal [10]. Most MP3 coders use the Fourier Transform whose fundamental roles are limited only to the spectral properties of the signal. In this paper we present a new design for modelling auditory masking is based on wavelet packet decomposition. Wavelet [2] unlike FFT [2], whose basic functions are sinusoids, is based on small waves, called wavelets, of varying frequency and limited duration. Its most important characteristics are conceived to analyze temporal and spectral properties of non-stationary signals [4] such as audio. A new architecture of a psycho-acoustic model based on wavelets decomposition can be applied to the audio coders in sub-bands approximating the critical bands. Models of human auditory pathway are commonly used as a part of audio coding systems or algorithms for objective evaluation of sound quality. In order to properly simulate the nonlinear, signal-dependent behavior of the cochlea, physiologically accurate models were developed, such as the Munich model [10] which was concepted by Zwicker and as for the Cambridge [11] one by Moore. Our study is made up of fourth parts. The first part will focus on the proposed psychoacoustic model. The second part compares Munich and Cambridge with new one. The third part deals with a comparison between the compressions ratios of the proposed coders (FFT and wavelet). Finally, a sound quality evaluation will be carried out to conclude this work. II. RELATED WORK Many existing psycho-acoustic models [10][11], employ an FFT-based transform to derive a spectral decomposition of the audio signal into uniform sub-bands with equal bandwidths. Currently the filter banks use the frequency domain, which gives only localization in frequency. This can be detrimental to some instruments, which are very localized in time. Wavelets, however, give both localization in time and frequency. Therefore, using filter banks that utilize wavelets might allow for better audio compression. The cochlear filter bank is based on a novel structure that supports the time-frequency resolution necessary to simulate psychophysical data closely related to cochlear spectral decomposition properties. It will be shown that this filter bank is able to closely mimic the spectral and temporal properties of frequency decomposition of the human peripheral auditory system. III.DECIBELCHIRP WITH PSYCHO-ACOUSTIC MODEL Psycho-acoustic studies show that each frequency takes place in the cochlea, along the basilar membrane. Therefore, the cochlea can be viewed from a signalprocessing perspective as a bank of highly overlapping band-pass filters [9]. The efficient algorithm of the new design is based on Wavelet packet decomposition is applied on small frames. In order to compare our work to some proposed models using various subdivisions of filter bank (28, 29 sub-bands), an approximation of the critical bands is adopted using 26 sub bands as shown in Figure 2. We choose this subdivision because it represents a good approximation of the real critical band (25 sub-bands) [8]. Then, we determine tonal and non tonal components. This step begins with the determination of the local maxima, followed by extracting the tonal components IJCSIET-ISSUE2-VOLUME2-SERIES1 Page 2
3 (sinusoidal) and non tonal components (noise) in every bandwidth of critical bands. The selective suppression of tonal and non tonal components of masking is a procedure used to reduce the number of maskers taken into account for the calculation of the global masking threshold [1]. The remaining tonal and non tonal components are those which are above the hearing absolute threshold. Individual masking threshold takes into account the masking threshold for each remaining component. Lastly, global masking threshold is calculated by the whole of tonal and non tonal components which are deduced from the spectrum of the wavelet packet decomposition. In our psycho-acoustic model, wavelet packet decomposition is developed for khz sampling frequency. The new model analyzes the input signal. The main important step is that wavelet packet decomposition is designed to closely mimic the critical bands in a psycho-acoustic model. The wavelet filters are usually chosen with many taps and a narrow transition band. Hence, aliasing between bands is minimized. The calculation of the spectrum of the input signal was carried out using decomposition of wavelets packets whose connections are selected in such a way that sub-bands correspond to the best possible one to the critical bands. The wavelet packet decomposition induces an organization of information according to a frequential segmentation as shown in Figure 2. Note: Fig 1 and Fig 2 have been taken from fundamental base papers along with mathematical formulae. Fig 2. The proposed wavelet decompostion Fig 1. The psycho acoustic model using wavelet decompostion In fact, on each level of the tree, the information corresponding to the entire signal is divided into frequency bands. IV. EXISTING MORLET FILTER BANKS Morlet wavelet is the mother wavelet to decompose the audible spectrum for the IJCSIET-ISSUE2-VOLUME2-SERIES1 Page 3
4 cochlear filters bank. In the time domain, the mother wavelet is a high-frequency vibration whose amplitude decreases when the time varies from zero to infinity. The Morlet wavelet transform has proved to be beneficial for many types of wave signal processing and has shown a good performance in tasks like audio coding [2]. The Morlet wavelet shape is not simply analytical, but also very resolute in time and frequency, regular, locally periodic and with non compact support. It is formed by complex values of the shape of a complex sinus modulated by a Gaussian envelope [5]. It is characterized by narrow frequency response, which offers a higher spectral resolution. This wavelet is defined as: is the Gaussian envelope, fo is the center frequency of the mother wavelet and B is the equivalent rectangular bandwidth of the cochlea [6]. The peripheral auditory system is equivalent to a band pass filter bank [11] whose respective bands would have the critical bandwidth. To approximate these critical bands, we adopted in our work two models called Munich and Cambridge [8]. In fact, there is a similarity between the widths of bands of these models and those of the ear. Those models are defined by two functions [8]: Considering a constant surface of elementary paving in time frequency domain, we can prove that [15]: Which F is the central frequency of the band (in KHz). Consequently, the Morlet Munich and Cambridge wavelet are given by: A mother wavelet must have finite energy [14] : Our cochlea is characterized by a number of critical bands whose frequency distribution varies logarithmically. When comparing those filter banks, we remark that Munich and Cambridge filter bank is more adequate to the cochlear filter bank than the new model. In fact, its bandwidths grow faster than those of Cambridge. V. RATIO OF SOUND COMPRESSION EVALUATION WITH DECIBELCHIRP IJCSIET-ISSUE2-VOLUME2-SERIES1 Page 4
5 In order to evaluate Morlet Munich and Cambridge wavelet compression, we used for various bitrates between 64 kbits/s and 160 Kbits/s some types of sound such as Soul, Slow, Rock, Arabic music and voice. The evaluation is based on the compression ratio defined as the quotient between the size of the original file and MP3 file. Table1 contains the type of the sound files used for the test, the average of compression ratio (Comp), their capacity (Cp), their duration (t) and the compression ratio calculated for each bit rate. The weakest compression ratio is given for the flow of 160 Kbits/s and the highest is given for the flow of 64 Kbits/s. However, the weaker the bit rate, the higher the compression ratio is and the less intelligible its quality becomes. Based on the results obtained in table1, the Morlet Munich filters bank takes account of the masking phenomenon and takes account of the critical bands. The second part of our evaluation will focus on comparing the compression ratio obtained from the coder based on wavelet analysis and the coder based on an FFT analysis versus using a bit rate of 128 kbits/s which gives the best compromise between compression ratio and sound quality [10]. It reveals that sound compression using Morlet Munich wavelet is the best one. In fact, their average compression ratios present an improvement of 13.59% and 3.2% in comparison to the FFT and Decibelchirp coders. models. Those figures show that high frequencies are canceled for compression. We remark that the Munich model is quiet better than the Decibelchirp. In fact, for Munich model, high frequencies are canceled from Hz, as for the Decibelchirp they are canceled from 18,500 Hz. Fig. 4. Spectrum of original signal and compressed signal for Munich model Sound FET Cambridge Munich Decibelchirp Voice Soul Arabic Comp Rock Table 1.Comparison beteen FFT and Wavelet Compression This new psychoacoustic model has been implemented in reference coder based on the standard MPEG-1 audio layer 3 [1]. Figures 5 and 6 shows the spectrum of the original and compressed signal using the two Fig.5.Spectrum of original signal and compressed signal for Decibelchirp model IJCSIET-ISSUE2-VOLUME2-SERIES1 Page 5
6 VI. EVALUATION OF SOUND QUALITY Approval and intelligibility are the two most significant criteria to consider subjective quality. For this purpose, we invited several subjects to hear some mp3 files resulting from the coder based on FFT analysis, then on Munich and Decibelchirp Morlet wavelet transforms. The protocol of evaluation consists in listening to the compressed sound file. Then, the possibility is offered to the listeners to listen to it as long as they wish. The listeners are 24: 12 men and 12 women just heard on a five-point opinion scale, ranging from bad to excellent. The ratings are assigned integer scores ranging from 1 for bad to 5 for excellent. This comparison, based on results obtained from the statistic card, shows that Morlet Munich coder gives satisfactory results especially for Slow, Soul and Arabic sound. We noticed that for those sound files our coder received a significant number of notes ranging between 3 and 4. Consequently; it gives the best statistics for quality in comparison to the FFT and Cambridge coders. packet decomposition offers a good performance. In fact, it gives the best compression ratio and sound quality in comparison to the FFT and Cambridge coders. A high selectivity was noticed and can lead to some interesting perspectives on audio coding using Decibelchirp of psychoacoustic model. REFERENCES [1] ISO /IEC (F), Information technology - Coding of moving picture and associated audio for digital storage media at up to about 1.5Mbits/s Part3: Audio, [2] G. Hernandez, M. Mendoza, B. Reusch, and L. Salinas, Shiftability and filter bank design using Morlet wavelet, Computer Science Society, Nov. 2004, pp [3] D. Sinha and A. Tewfik, Low Bitrate Transparent Audio Compression using adapted wavelets, Proc. IEEE Trans.Sig, Dec. 1993, pp [4] B. Carnero and A. Drygajlo, Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithm, Proc. IEEE Transaction on speech and audio processing, vol 47, NO 6, June [5] I. Daubechies, Ten lectures on wavelet, SIAM, Philadelphia, PA, [6] J. O. SmithIII and J. S. Abel, Bark and ERB Bilinear transforms, Proc. IEEE Transactions on Speech and Audio Processing, [7] G. Davidson, L. Fielder, and M. Antill, High-quality audio transform coding at 128 kbits/s, Proc. IEEE ICASSP, vol.2, 1990, pp Figure. 6. Sound quality histogram VII.CONCLUSION This paper studies audio compression using new psychoacoustic model with Decibelchirp for wavelet transforms. Our goal is to identify the best model that can be applied to the audio coders in sub-bands approximating the critical bands and finally to improve the psycho-acoustic model. This study shows that the Munich Morlet wavelet [8] F. Baumgarte, A computationally efficient cochlear filter bank for perceptual audio coding storage, Proc. IEEE ICASSP, vol.5, 2001, pp [9] W. W. Chang and C. T. Wang, Auditory patterns, Proc. IEEE Transactions on speech and audio processing, vol. 4, NO 2, March [10] E. Zwicker and E. Terhardt, Analytical expressions for critical band rate and critical bandwidth as a function of frequency, J. Acoust. Soc. Am, vol. 68, NO 5, Nov [11] C. Wang and Y. C. Tong, An improved critical-band transform processor for speech applications, Circuits and Systems, May 2004, pp IJCSIET-ISSUE2-VOLUME2-SERIES1 Page 6
Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationMpeg 1 layer 3 (mp3) general overview
Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationFundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More informationImproved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 7, OCTOBER 2002 495 Improved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank Frank Baumgarte Abstract Perceptual
More informationChapter 14 MPEG Audio Compression
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1
More informationMultimedia Communications. Audio coding
Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated
More informationAppendix 4. Audio coding algorithms
Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically
More informationSPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL
SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More information2.4 Audio Compression
2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and
More informationWavelet filter bank based wide-band audio coder
Wavelet filter bank based wide-band audio coder J. Nováček Czech Technical University, Faculty of Electrical Engineering, Technicka 2, 16627 Prague, Czech Republic novacj1@fel.cvut.cz 3317 New system for
More informationLecture 16 Perceptual Audio Coding
EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/
More informationCompression transparent low-level description of audio signals
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 25 Compression transparent low-level description of audio signals Jason
More informationAn investigation of non-uniform bandwidths auditory filterbank in audio coding
PAGE 360 An investigation of non-uniform bandwidths auditory filterbank in audio coding Andrew Lin, Stevan Berber, Waleed Abdulla Department of Electrical and Computer Engineering University of Auckland,
More information5: Music Compression. Music Coding. Mark Handley
5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the
More informationELL 788 Computational Perception & Cognition July November 2015
ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)
More informationDAB. Digital Audio Broadcasting
DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva
More informationPerceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding
Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.
More informationParametric Coding of High-Quality Audio
Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective
More informationModeling of an MPEG Audio Layer-3 Encoder in Ptolemy
Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Patrick Brown EE382C Embedded Software Systems May 10, 2000 $EVWUDFW MPEG Audio Layer-3 is a standard for the compression of high-quality digital audio.
More informationOptical Storage Technology. MPEG Data Compression
Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the
More informationEfficient Signal Adaptive Perceptual Audio Coding
Efficient Signal Adaptive Perceptual Audio Coding MUHAMMAD TAYYAB ALI, MUHAMMAD SALEEM MIAN Department of Electrical Engineering, University of Engineering and Technology, G.T. Road Lahore, PAKISTAN. ]
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationBit or Noise Allocation
ISO 11172-3:1993 ANNEXES C & D 3-ANNEX C (informative) THE ENCODING PROCESS 3-C.1 Encoder 3-C.1.1 Overview For each of the Layers, an example of one suitable encoder with the corresponding flow-diagram
More informationImage Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi
Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi 1. Introduction The choice of a particular transform in a given application depends on the amount of
More informationAudio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011
Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law
More informationAn adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria
An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria N. RUIZ REYES 1, M. ROSA ZURERA 2, F. LOPEZ FERRERAS 2, D. MARTINEZ MUÑOZ 1 1 Departamento
More informationA NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO
International journal of computer science & information Technology (IJCSIT) Vol., No.5, October A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO Pranab Kumar Dhar *, Mohammad
More informationWolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig
Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 6 Audio Retrieval 6 Audio Retrieval 6.1 Basics of
More information<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>
THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar
More informationModule 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur
Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define
More informationDSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman
DSP The Technology Presented to the IEEE Central Texas Consultants Network by Sergio Liberman Abstract The multimedia products that we enjoy today share a common technology backbone: Digital Signal Processing
More informationijdsp Interactive Illustrations of Speech/Audio Processing Concepts
ijdsp Interactive Illustrations of Speech/Audio Processing Concepts NSF Phase 3 Workshop, UCy Presentation of an Independent Study By Girish Kalyanasundaram, MS by Thesis in EE Advisor: Dr. Andreas Spanias,
More informationFigure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.
Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION
More informationPerceptual Audio Coders What to listen for: Artifacts of Parametric Coding
Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric
More informationAudio Coding and MP3
Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)
More informationA PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION
A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION Armin Taghipour 1, Maneesh Chandra Jaikumar 2, and Bernd Edler 1 1 International Audio Laboratories Erlangen, Am Wolfsmantel
More informationZhitao Lu and William A. Pearlman. Rensselaer Polytechnic Institute. Abstract
An Ecient, Low-Complexity Audio Coder Delivering Multiple Levels of Quality for Interactive Applications Zhitao Lu and William A. Pearlman Electrical,Computer and Systems Engineering Department Rensselaer
More informationModule 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur
Module 9 AUDIO CODING Lesson 32 Psychoacoustic Models Instructional Objectives At the end of this lesson, the students should be able to 1. State the basic objectives of both the psychoacoustic models.
More information1 Introduction. 2 Speech Compression
Abstract In this paper, the effect of MPEG audio compression on HMM-based speech synthesis is studied. Speech signals are encoded with various compression rates and analyzed using the GlottHMM vocoder.
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,
More information3 Sound / Audio. CS 5513 Multimedia Systems Spring 2009 LECTURE. Imran Ihsan Principal Design Consultant
LECTURE 3 Sound / Audio CS 5513 Multimedia Systems Spring 2009 Imran Ihsan Principal Design Consultant OPUSVII www.opuseven.com Faculty of Engineering & Applied Sciences 1. The Nature of Sound Sound is
More information19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007
19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek
More informationAudio Compression. Audio Compression. Absolute Threshold. CD quality audio:
Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =
More informationWavelet Transform (WT) & JPEG-2000
Chapter 8 Wavelet Transform (WT) & JPEG-2000 8.1 A Review of WT 8.1.1 Wave vs. Wavelet [castleman] 1 0-1 -2-3 -4-5 -6-7 -8 0 100 200 300 400 500 600 Figure 8.1 Sinusoidal waves (top two) and wavelets (bottom
More informationPrinciples of MPEG audio compression
Principles of MPEG audio compression Principy komprese hudebního signálu metodou MPEG Petr Kubíček Abstract The article describes briefly audio data compression. Focus of the article is a MPEG standard,
More informationS.K.R Engineering College, Chennai, India. 1 2
Implementation of AAC Encoder for Audio Broadcasting A.Parkavi 1, T.Kalpalatha Reddy 2. 1 PG Scholar, 2 Dean 1,2 Department of Electronics and Communication Engineering S.K.R Engineering College, Chennai,
More informationITNP80: Multimedia! Sound-II!
Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent than for video data rate for CD-quality audio is much less than
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics
More informationMPEG-4 aacplus - Audio coding for today s digital media world
MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers
More informationEffect of MPEG Audio Compression on HMM-based Speech Synthesis
Effect of MPEG Audio Compression on HMM-based Speech Synthesis Bajibabu Bollepalli 1, Tuomo Raitio 2, Paavo Alku 2 1 Department of Speech, Music and Hearing, KTH, Stockholm, Sweden 2 Department of Signal
More informationTHE EFFECT OF EARLY REFLECTIONS ON PERCEIVED TIMBRE ANALYZED WITH AN AUDITORY MODEL. Tapio Lokki, Ville Pulkki, and Lauri Savioja
Proceedings of the 22 International Conference on Auditory Display, Kyoto, Japan, July 2-, 22 THE EFFECT OF EARLY REFLECTIONS ON PERCEIVED TIMBRE ANALYZED WITH AN AUDITORY MODEL Tapio Lokki, Ville Pulkki,
More information1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends
Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques
More informationPerceptually motivated Sub-band Decomposition for FDLP Audio Coding
Perceptually motivated Sub-band Decomposition for FD Audio Coding Petr Motlicek 1, Sriram Ganapathy 13, Hynek Hermansky 13, Harinath Garudadri 4, and Marios Athineos 5 1 IDIAP Research Institute, Martigny,
More informationSubjective Audiovisual Quality in Mobile Environment
Vienna University of Technology Faculty of Electrical Engineering and Information Technology Institute of Communications and Radio-Frequency Engineering Master of Science Thesis Subjective Audiovisual
More informationOpen Systems Interconnection Model
OPEN SYSTEMS INTERCONNECTION AND TCP/IP PROTOCOL SUITE TCP/IP P Open Systems Interconnection Model An ISO standard that covers all aspects of network communications A layered framework consisting of seven
More informationMPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding
MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany Outline Introduction What is "Parametric Audio Coding"?
More informationA Detailed look of Audio Steganography Techniques using LSB and Genetic Algorithm Approach
www.ijcsi.org 402 A Detailed look of Audio Steganography Techniques using LSB and Genetic Algorithm Approach Gunjan Nehru 1, Puja Dhar 2 1 Department of Information Technology, IEC-Group of Institutions
More informationPort of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform
Port of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform by Romain Pagniez romain@felinewave.com A Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science
More informationA Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
International Journal of Engineering Research and General Science Volume 3, Issue 4, July-August, 15 ISSN 91-2730 A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
More informationLecture 5: Error Resilience & Scalability
Lecture 5: Error Resilience & Scalability Dr Reji Mathew A/Prof. Jian Zhang NICTA & CSE UNSW COMP9519 Multimedia Systems S 010 jzhang@cse.unsw.edu.au Outline Error Resilience Scalability Including slides
More informationPerceptual Coding of Digital Audio
Perceptual Coding of Digital Audio TED PAINTER, STUDENT MEMBER, IEEE AND ANDREAS SPANIAS, SENIOR MEMBER, IEEE During the last decade, CD-quality digital audio has essentially replaced analog audio. Emerging
More informationNetworking Applications
Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services
More informationTransporting audio-video. over the Internet
Transporting audio-video over the Internet Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP
More informationI D I A P R E S E A R C H R E P O R T. October submitted for publication
R E S E A R C H R E P O R T I D I A P Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath
More informationData Compression. Audio compression
1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic
More informationLOW COMPLEXITY SUBBAND ANALYSIS USING QUADRATURE MIRROR FILTERS
LOW COMPLEXITY SUBBAND ANALYSIS USING QUADRATURE MIRROR FILTERS Aditya Chopra, William Reid National Instruments Austin, TX, USA Brian L. Evans The University of Texas at Austin Electrical and Computer
More information08 Sound. Multimedia Systems. Nature of Sound, Store Audio, Sound Editing, MIDI
Multimedia Systems 08 Sound Nature of Sound, Store Audio, Sound Editing, MIDI Imran Ihsan Assistant Professor, Department of Computer Science Air University, Islamabad, Pakistan www.imranihsan.com Lectures
More informationROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010
ROW.mp3 Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010 Motivation The realities of mp3 widespread use low quality vs. bit rate when compared to modern codecs Vision for row-mp3 backwards
More informationIntelligibility and Representation of Timbre in Primary Auditory Cortex
Intelligibility and Representation of Timbre in Primary Auditory Cortex Powen Ru Taishih Chi Didier Depireux David Klein Nina Kowalski & Acoustic Research Electrical and Computer Engineering, College Park
More informationCompressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.
Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity
More informationEfficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio
Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans
More informationAudio and video compression
Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and
More informationPerspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony
Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573 Japan. E-mail: kitawaki@cs.tsukuba.ac.jp
More informationIntroduction to LAN/WAN. Application Layer 4
Introduction to LAN/WAN Application Layer 4 Multimedia Multimedia: Audio + video Human ear: 20Hz 20kHz, Dogs hear higher freqs DAC converts audio waves to digital E.g PCM uses 8-bit samples 8000 times
More informationMPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings
MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and
More information/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec
/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank
More informationMODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.
MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.E 4 Assistant Professor, Dept. of ECE, Dr.NGP Institute of Technology,
More informationAUDIO COMPRESSION USING WAVELET TRANSFORM
AUDIO COMPRESSION USING WAVELET TRANSFORM Swapnil T. Dumbre Department of electronics, Amrutvahini College of Engineering,Sangamner,India Sheetal S. Gundal Department of electronics, Amrutvahini College
More informationThe MPEG-4 General Audio Coder
The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution
More informationAN AUDIO WATERMARKING SCHEME ROBUST TO MPEG AUDIO COMPRESSION
AN AUDIO WATERMARKING SCHEME ROBUST TO MPEG AUDIO COMPRESSION Won-Gyum Kim, *Jong Chan Lee and Won Don Lee Dept. of Computer Science, ChungNam Nat l Univ., Daeduk Science Town, Taejon, Korea *Dept. of
More informationBluray (
Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)
More informationScalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC
Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer
More informationInternational Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015
International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 Denoising of Speech using Wavelets Snehal S. Laghate 1, Prof. Sanjivani S. Bhabad
More informationThe following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:
Page 1 of 8 1. SCOPE This Operational Practice sets out guidelines for minimising the various artefacts that may distort audio signals when low bit-rate coding schemes are employed to convey contribution
More informationMPEG-1 Bitstreams Processing for Audio Content Analysis
ISSC, Cork. June 5- MPEG- Bitstreams Processing for Audio Content Analysis Roman Jarina, Orla Duffner, Seán Marlow, Noel O Connor, and Noel Murphy Visual Media Processing Group Dublin City University Glasnevin,
More informationMP3. Panayiotis Petropoulos
MP3 By Panayiotis Petropoulos Overview Definition History MPEG standards MPEG 1 / 2 Layer III Why audio compression through Mp3 is necessary? Overview MPEG Applications Mp3 Devices Mp3PRO Conclusion Definition
More informationFilterbanks and transforms
Filterbanks and transforms Sources: Zölzer, Digital audio signal processing, Wiley & Sons. Saramäki, Multirate signal processing, TUT course. Filterbanks! Introduction! Critical sampling, half-band filter!
More informationSpeech-Music Discrimination from MPEG-1 Bitstream
Speech-Music Discrimination from MPEG-1 Bitstream ROMAN JARINA, NOEL MURPHY, NOEL O CONNOR, SEÁN MARLOW Centre for Digital Video Processing / RINCE Dublin City University, Dublin 9 IRELAND jarinar@eeng.dcu.ie
More informationDigital Media. Daniel Fuller ITEC 2110
Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09
More informationSpeech and audio coding
Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples
More informationUniversity of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics
University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics ESE250 Spring 2013 Lab 7: Psychoacoustic Compression Friday, February 22, 2013 For Lab Session: Thursday,
More informationAudio Watermarking using Empirical Mode Decomposition
Audio Watermarking using Empirical Mode Decomposition Charulata P. Talele 1, Dr A. M. Patil 2 1ME Student, Electronics and Telecommunication Department J. T. Mahajan College of Engineering, Faizpur, Maharashtra,
More information