LOSSLESS AUDIO COMPRESSION USING INTEGER MODIFIED DISCRETE COSINE TRANSFORM. Yoshikazu Yokotani and Soontorn Oraintara
|
|
- Dayna Phelps
- 5 years ago
- Views:
Transcription
1 3 ISPACS Awaji Island 3 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS 3) Awaji Island, Japan, December 7, 3 C14 LSSLESS AUDI CMPRESSIN USING INTEGER MDIFIED DISCRETE CSINE TRANSFRM Yoshikazu Yokotani and Soontorn raintara Department of Electrical Engineering, University of Texas at Arlington, 416 Yates St., Arlington, TX, , USA Phone: , Fax: yoshi@msp.uta.edu, oraintar@uta.edu ASTRACT Recently, an MPEG2 AAC [1] based lossless audio codec with the Integer MDCT (IntMDCT) was proposed [2]. The IntMDCT was constructed by lifting scheme [3] to hold the perfect reconstruction(pr). In this paper, we will evaluate the IntMDCT implemented by fixedpoint arithmetic with quantized lifting coefficients in the MPEG2 AAC based lossless audio coding. The results indicate that there exists a tradeoff between computational complexity of the IntMDCT and coding efficiency when it is applied in the MPEG2 AAC based lossless audio coding scheme and one can reduce the computational complexity of the IntMDCT while a certain level of coding efficiency is maintained in the scheme. 1. INTRDUCTIN Lossless audio coding has received attentions from audio researchers and engineers in recent years. Following the demands of network operating companies and service providers, the MPEG committee started working to define lossless audio coding technology last year [4]. Recently, a lossless audio codec by using the integer modified discrete cosine Transform (IntMDCT), implemented by the lifting scheme, was proposed [5, 2]. The IntMDCT is an approximated version of the MDCT employed in the current audio coding standards, and is a reversible process. In [5, 5], the IntMDCT is employed in the MPEGAudio layeriii (MP3) coder and decoder by factorizing the 3 and 9point DCT s used in the 12 and 36 point MDCT s to a set of Given rotation angles and approximating them by lifting. In [2], similar idea but different in size is applied. The 48point MDCT is approximated. Since it is very difficult to factorize a large orthogonal matrix into Given rotation angles, the 512point discrete Fourier transform (DFT) is used in the fast structure. In this paper, the IntFFT proposed in [] is used in the implementation of our coder since it is adjustable for different levels of complexity which may be useful in different situations. Since it inherits the properties of the MDCT and is a reversible process, it is a natural choice for a lossless audio coder. The Int MDCT can be used to implement a lossless audio codec in two ways. First, it can be used to replace the conventional MDCT directly. Since it maps integers to integers with reversibility preserved, one can simply apply an entropy coder at the output of the transform. The resulting codec is called a nonscalable codec. Second, the conventional (lossy) audio codec using MDCT is taken to be the base layer. An enhancement layer is created by applying the IntMDCT to the audio signal. The difference between the outputs of the two versions of the transform serve as the refining detail which complements the base layer bit stream. Since the base layer is compatible with the MPEG AAC standard and the enhancement layer can be an addon component, it is call a scalable codec [6]. The key idea of approximating an orthogonal transform with irrational coefficients with reversibility preserved is to factorize the matrix into a number of Givens rotation angles, and then further factorize them into lower and upper triangular matrices. Each of these triangular matrices can be implemented using a two port network with only one branch called lifting whose corresponding coefficient can be truncated [3]. As long as the forward and inverse lifting coefficients are approximated properly and simultaneously, the input of the forward transform can be perfectly reconstructed (PR) at the output of its inverse. Such technique has been applied into many existing transforms including dyadic wavelet transform [7], the discrete cosine transform (DCT) [8, 9], the discrete Fourier transform [] and the MDCT [5]. In this paper, the IntMDCT is implemented by fixedpoint arithmetic with the quantized lifting coefficients and evaluated in lossless audio coding with experimental results when it is used in the nonscalable and the scalable codecs. This paper is organized as follows: a fast structure of the IntMDCT is described in section 2. Section 3 shows the computational complexity of the IntMDCT, and the structures of the nonscalable and scalable codecs are described in section 4 with the coding results. In section 5, the future work is described. Finally, the conclusions are stated. 2. A FAST AND MULTIPLIERLESS STRUCTURE F THE INTMDCT A fast structure of the IntMDCT can be constructed from a fast structure of the MDCT whose Givens rotation angles are factorized via the lifting scheme with a set of butterfly coefficient pairs and, which are cosine and sine of a rotational angle, respectively. This is formulated in (1). (1) where is a Givens rotation matrix. Now, we introduce two parameters, lifting coefficients and, and these coefficients are calculated from butterfly coefficients and by (2) (3)
2 ?? P V,,? Q * L Q x() indow operation x(n/21) sinθsw(n/21) x(n/2) cosθsw(n/21) x(n1) x(n) cosθsw() cosθsw() sinθsw() PreFFT twiddle operation cosθpr() sinθpr() sinθpr() j cosθpr() N/2pt FFT Re Im PostFFT twiddle operation sinθpo() sinθpo() cosθpo() cosθpo() 1 X() X(1) cosθsw(n/21) x(3n/21) x(3n/2) sinθsw(n/21) sinθpr(n/21) cosθpr(n/21) sinθpr(n/21) cosθpr(n/21) j θ fft1 θ fft3 cosθpo(n/21) Re sinθpo(n/21) sinθpo(n/21) 1 Im cosθpo(n/21) X(N2) X(N1) sinθsw() x(2n1) Figure 1: The fast MDCT Implementation via the FFT J J! #%$&' ()+*,./ $K6 &' ()+L./ $K6 &' ()+L./ 21354&7681:9;4<6>=?A@ ED C $:GFHFIF =? (3)+* and (4) 1354& M 1 9 4N >6 6 M 1 9 4>6<P ED (3)+Q (3)+L (5) 21354&76 M 1R9;4N6N= S@ 1:9;4>6<=?A@ :M 1R9;4>6 P T@ = ED UGV (6) (3)+Q (3)+L Factorization of the = point MDCT can be separately done in the following four blocks: The = point sine window operation, point FFT, and = the = point prefft twiddle operation, the ( point postfft twiddle operation. Figure 1 depicts Q a fast structure of the MDCT which is composed of the four blocks with rotational angles defined below: The = point sine window operation: YX [Z;&\^] = 9E= Z_@ (7) The = point prefft twiddle operation: I`Ha [Z;&b ] (8) = The =Ae point split radix FFT: %fgfih 5$&j ] $ =Ae (9) fgfih 5$&j Rk ] $ =Ae () The postfft twiddle operation: `Hl [Zc&\ Z ] = (11) ZK where FiFIF ( $: and FHFHF (. For the inverse MDCT, it can be simply implemented by applying the rotational angles with the opposite signs. Here, the equation (7) is obtained by the method reported in [2], and (8) and (11) are calculated from the fast DCTIV structure described in [11]. (9) and () are the twiddle factors of the =Ae point splitradix and (3)+Q (3)+Q V 13nm& mo FFT coefficients of an input signal where FHFHF ( defined by (5) and (6). Now, a fast IntMDCT can be constructed by calculating the lifting coefficients and for all the rotations as well as the following x matrix with a rotational angle p : Given a rotational angle, a pair of the lifting coefficients and is computed by the equations (2) and (3). Here, it should be noted that these equations have to be changed with a different range of rotational angles to keep the absolute values of the lifting coefficients and less than or equal to as reported in []. The lifting coefficients obtained from the factorization are generally irrational and must be approximated by finiteprecision numbers. Moreover, if the coefficients are represented in dyadic number, the IntMDCT can be implemented by additions and shifts so no multipliers are necessary. Now, let = be the number of bits to quantize these lifting coefficients. The value of = is directly related to the accuracy of the transform. Since this is a highly
3 ? complex structure, there is no simple mathematical expression that relates the accuracy of the transform to the order of quantization. Figure 2 shows the absolute values of MDCT spectrum and Int MDCT spectrum with = and gq when the 1kHz sine wave in the SQAM [12] is the input. The size of the transforms is. hen = and gq, the IntMDCT spectrum looks close to the MDCT spectrum compared to the case of =. 3. CMPUTATINAL CMPLEXITY F THE INTMDCT The parameter = relates to computational complexity as well as the accuracy of the IntMDCT since the lower the value of = is the less the number of additions and shifts are required. However, it is difficult to quantitatively evaluate how much the computational complexity can be saved by decreasing the value. Moreover, it cannot be used to see the advantage of using the lifting scheme rather than the butterfly structure in terms of computational complexity. Now, let us call the FxpMDCT as the case where the MDCT is constructed by fixedpoint arithmetic and butterfly structure is used to implement a Givens rotation as it is in the conventional MDCT. In this section, the computational complexity is discussed with the numbers of real multiplications and real additions needed to perform the FxpMDCT, and IntMDCT. Figure 3?shows the number of real additions and shifts to calculate the point FxpMDCT and IntMDCT. The same value of = is used to represent butterfly coefficients in the FxpMDCT and lifting coefficients in the Int MDCT. From the figure,it can be observed that the IntMDCT has percent less computational complexity than the Fxp at most P MDCT. Moreover, reducing the value of = from 15 to 12 save about 28 percent of the computational complexity. 4. IMPLEMENTATIN F THE INTMDCT IN LSSLESS AUDI CDING As proposed in [2, 6], two different types of lossless audio codecs can be constructed. ne is a nonscalable codec where the Int MDCT is directly followed by an entropy coder. Another is a scalable codec where the base layer stream is compatible with the MPEG AAC standard and the enhancement layer is the residual to complement a losslessly compressed audio signal [6] Nonscalable codec Figure 5 depicts a structure of the nonscalable codec implemented with the IntMDCT. In this codec, the psychoacoustic model defined in the MPEG2 AAC is used to switch a window shape for the IntMDCT. After the IntMDCT, the coefficients are compressed by a contextbased arithmetic coder. A context model is a probability model of the current coding symbol constructed from the neighbor symbols. The model can be used in an entropy coding scheme so that the compression ratio can be improved by the conditional entropy. For lossless audio coding application, this encoding scheme has been applied to a linear prediction error in time domain [13]. Table 1 shows comparison of the bit rates of the nonscalable codec with the value of = and iq with the bit rates of the lossless AAC [2] and a linear predictionbased codec, FLAC (Free Lossless Audio Coder) [14]. The test audio files are chosen from the SQAM CD [12]. As mentioned in [2], in each SQAM audio file, an audio signal is preceded and followed by zero frames. To make a fair comparison, these zero frames are omitted. It can be observed that the coding efficiencies when a value of = The number of shifts The number of additions 1.8 x IntMDCT FxpMDCT x IntMDCT FxpMDCT (a) (b) Figure 3: Comparison of computational complexities of the Fxp MDCT and the IntMDCT (a)the number of additions (b)the number of shifts and iq are almost the same. In the case of =, the coding efficiency is degraded by at most q bits/sample. This is because decreasing a value of = introduces a more approximation error observed in the Figure 2 into the IntMDCT spectrum and the randomness pushes up the bit rate even though it contributes to reduction of computational complexity of the IntMDCT. These results implies the following: The optimum value of = could be around to maintain the same coding efficiency compared to the case of = gq. Thus, for the nonscalable codec, a high value in = is desirable to keep a certain level of coding efficiency Scalable codec A lossless scalable codec can be realized by combining the MPEG2 AAC perceptual audio coder with a residual of the quantized Int MDCT coefficients. Figure 6 and 7 illustrate the MPEG2 AAC based lossless scalable codec [6]. Here, it can be seen that the base layer stream is compatible with the bit stream defined by the MPEG2 AAC standard. Figure 4 shows bit rates of both base layer
4 (a) (b) (c) Figure 2: Absolute values of (a)mdct spectrum, (b)intmdct spectrum with = (d)intmdct spectrum with = iq (d), (c)intmdct spectrum with =, and Table 1: Comparison of the bit rates (bits/sample) of the nonscalable codec Test the nonscalable codec Lossless FLAC audios A value of = AAC [2] [14] Piano Soprano rchestra Pop and enhancement layer of the scalable codec with the value of = and gq for the four test audios used in Table 1. Ideally, bit rates of the nonscalable codec and the scalable codec are the same since both tests use the same set of audio files. However, it is observed that bit rates of the scalable code are higher than those obtained of the nonscalable codec. Moreover, when the bit rate of the base layer is equal to r k kbps, the total bit rate increases at most q bits/sample. This may be explained as follow: Increasing a bit rate of the base layer cannot make the approximation error of the IntMDCT in the enhancement layer smaller and the bit rate of the enhancement layer doesn t decrease much. As a result, the total bit rate will increase. In addition, similar to the result in Table 1, when a value of =, the coding efficiency is similar to the one obtained with = gq in the scalable codec. 5. CNCLUSINS In this paper, we implemented the IntMDCT by fixedpoint arithmetic with quantized lifting coefficients and evaluated the computational complexity as well as coding efficiencies when the Int MDCT is applied in the MPEG2 AAC based lossless audio coding scheme. These results indicate that there exists a tradeoff between
5 the computational complexity and coding efficiency for lossless in this simulation. audio coding and it is balanced when = 6. REFERENCES [1] IS/IEC JTC1/SC29/G11 (MPEG). International standard IS/IEC Generic coding of moving pictures and associated audio : Advanced audio coding [2] R. Geiger, T. Sporer, J. Koller, and K. randenburg. Audio coding based on integer transform. 111st AES convention Preprint 5471, 1. [3] I. Daubechies and. Sweldens. Factoring wavelet transforms into lifting steps. Technical report, ell Laboratories, Lucent Technologies, [4] IS/IEC JTC1/SC29/G11 (MPEG). MPEG meeting dcoument N [5] T. Krishnan and S. raintara. A fast and lossless forward and inverse structure for the MDCT in MPEG audio coding. Proc. of the International Symposium on Circuits and Systems, May 2. [6] R. Geiger, J.Herre, J. Koller, and K. randenburg. IntMDCT A link between perceptual and lossless audio coding. Proc. IEEE International Conf. on Acoustics, Speech, and Signal Processing, 2: , 2. [7] A. R. Calderbank et al. Lossless image compression using integer to integer wavelet transforms. Proc. IEEE International Conf. on Image Processing, 1: , [8] T. D. Tran. The indct: Fast multiplierless approximation of the DCT. IEEE Singal Processing Letters, 7(6): , June 1. [9] S. C. Chan and P. M. Yiu. Multiplierless discrete sinusoidal and lapped transforms using sumofpowersoftwo (sopot) coefficients. IEEE International Symposium on Circuits and Systems, 2:13 16, May 1. [] S. raintara, Y. J. Chen, and T. Nguyen. Integer fast Fourier transform. IEEE Trans. on Signal Processing, 5:67 618, March 2. [11] H. S. Malvar. Signal Processing with Lapped Transforms. Archtech House, [12] SQAM(Sound Quality Assessment Material). CD European roadcasting Union, [13] T. Qiu. Lossless audio coding based on high order context modeling. IEEE Fourth orkshop on Multimedia Signal Processing, pages , 1. [14] Josh Coalson. FLAC : Free Lossless Audio Codec.
6 The total bit rate of the scalable codec(bits/sample) The total bit rate of the scalable codec(bits/sample) The total bit rate of the scalable codec(bits/sample) (a) The total bit rate of the scalable codec(bits/sample) (b) (c) (d) Figure 4: Comparison of the bit rates (bits/sample) of the scalable codec (a)sqam6(piano), (b)sqam61(soprano), (c)sqam65(rchestra), and (d)sqam7(pop) Legend Signal in float Signal in integer Control Psychoacoustic model PCM audio Int MDCT Contextbased Coding Contextbased Decoding Int IMDCT Encoder Decoder Figure 5: A Structure of the nonscalable Codec
7 Psychoacoustic Model block switching MPEG2 AAC Encoder 16bit PCM Audio MDCT Q Huffman Coding ase layer Q 1 Integer MDCT Contextbased Coding Enhancement layer Lossless Encoding Scheme Figure 6: A structure of the lossless scalable encoder MPEG2 AAC Decoder ase layer Huffman Decoding Q 1 IMDCT Reconstructed Audio with lossy compression process Enhancement layer Contextbased Decoding Lossless Decoding Scheme Integer IMDCT Reconstructed Audio with lossless compression process Figure 7: A structure of the lossless scalable decoder
Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC
Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationAn efficient multiplierless approximation of the fast Fourier transform using sum-of-powers-of-two (SOPOT) coefficients
Title An efficient multiplierless approximation of the fast Fourier transm using sum-of-powers-of-two (SOPOT) coefficients Author(s) Chan, SC; Yiu, PM Citation Ieee Signal Processing Letters, 2002, v.
More informationModule 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur
Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define
More informationCompressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.
Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity
More informationPerceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding
Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.
More informationOptical Storage Technology. MPEG Data Compression
Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the
More informationModeling of an MPEG Audio Layer-3 Encoder in Ptolemy
Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Patrick Brown EE382C Embedded Software Systems May 10, 2000 $EVWUDFW MPEG Audio Layer-3 is a standard for the compression of high-quality digital audio.
More informationA Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
International Journal of Engineering Research and General Science Volume 3, Issue 4, July-August, 15 ISSN 91-2730 A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/
More informationImage Compression for Mobile Devices using Prediction and Direct Coding Approach
Image Compression for Mobile Devices using Prediction and Direct Coding Approach Joshua Rajah Devadason M.E. scholar, CIT Coimbatore, India Mr. T. Ramraj Assistant Professor, CIT Coimbatore, India Abstract
More informationCS 335 Graphics and Multimedia. Image Compression
CS 335 Graphics and Multimedia Image Compression CCITT Image Storage and Compression Group 3: Huffman-type encoding for binary (bilevel) data: FAX Group 4: Entropy encoding without error checks of group
More informationELL 788 Computational Perception & Cognition July November 2015
ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)
More informationPERFORMANCE ANALYSIS OF INTEGER DCT OF DIFFERENT BLOCK SIZES USED IN H.264, AVS CHINA AND WMV9.
EE 5359: MULTIMEDIA PROCESSING PROJECT PERFORMANCE ANALYSIS OF INTEGER DCT OF DIFFERENT BLOCK SIZES USED IN H.264, AVS CHINA AND WMV9. Guided by Dr. K.R. Rao Presented by: Suvinda Mudigere Srikantaiah
More informationVideo Compression Method for On-Board Systems of Construction Robots
Video Compression Method for On-Board Systems of Construction Robots Andrei Petukhov, Michael Rachkov Moscow State Industrial University Department of Automatics, Informatics and Control Systems ul. Avtozavodskaya,
More informationA Parallel Reconfigurable Architecture for DCT of Lengths N=32/16/8
Page20 A Parallel Reconfigurable Architecture for DCT of Lengths N=32/16/8 ABSTRACT: Parthiban K G* & Sabin.A.B ** * Professor, M.P. Nachimuthu M. Jaganathan Engineering College, Erode, India ** PG Scholar,
More informationReversible Wavelets for Embedded Image Compression. Sri Rama Prasanna Pavani Electrical and Computer Engineering, CU Boulder
Reversible Wavelets for Embedded Image Compression Sri Rama Prasanna Pavani Electrical and Computer Engineering, CU Boulder pavani@colorado.edu APPM 7400 - Wavelets and Imaging Prof. Gregory Beylkin -
More informationPerformance analysis of Integer DCT of different block sizes.
Performance analysis of Integer DCT of different block sizes. Aim: To investigate performance analysis of integer DCT of different block sizes. Abstract: Discrete cosine transform (DCT) has been serving
More information5: Music Compression. Music Coding. Mark Handley
5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the
More informationCISC 7610 Lecture 3 Multimedia data and data formats
CISC 7610 Lecture 3 Multimedia data and data formats Topics: Perceptual limits of multimedia data JPEG encoding of images MPEG encoding of audio MPEG and H.264 encoding of video Multimedia data: Perceptual
More informationTHE TRANSFORM AND DATA COMPRESSION HANDBOOK
THE TRANSFORM AND DATA COMPRESSION HANDBOOK Edited by K.R. RAO University of Texas at Arlington AND RC. YIP McMaster University CRC Press Boca Raton London New York Washington, D.C. Contents 1 Karhunen-Loeve
More informationContents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48
Contents Part I Prelude 1 Introduction... 3 1.1 Audio Coding... 4 1.2 Basic Idea... 6 1.3 Perceptual Irrelevance... 8 1.4 Statistical Redundancy... 9 1.5 Data Modeling... 9 1.6 Resolution Challenge...
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationIndex. 1. Motivation 2. Background 3. JPEG Compression The Discrete Cosine Transformation Quantization Coding 4. MPEG 5.
Index 1. Motivation 2. Background 3. JPEG Compression The Discrete Cosine Transformation Quantization Coding 4. MPEG 5. Literature Lossy Compression Motivation To meet a given target bit-rate for storage
More informationReview and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.
Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About
More informationChapter 14 MPEG Audio Compression
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1
More informationJPEG: An Image Compression System. Nimrod Peleg update: Nov. 2003
JPEG: An Image Compression System Nimrod Peleg update: Nov. 2003 Basic Structure Source Image Data Reconstructed Image Data Encoder Compressed Data Decoder Encoder Structure Source Image Data Compressed
More informationLecture 16 Perceptual Audio Coding
EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero
More informationAppendix 4. Audio coding algorithms
Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically
More informationMultimedia Communications. Transform Coding
Multimedia Communications Transform Coding Transform coding Transform coding: source output is transformed into components that are coded according to their characteristics If a sequence of inputs is transformed
More informationInternational Journal of Wavelets, Multiresolution and Information Processing c World Scientific Publishing Company
International Journal of Wavelets, Multiresolution and Information Processing c World Scientific Publishing Company IMAGE MIRRORING AND ROTATION IN THE WAVELET DOMAIN THEJU JACOB Electrical Engineering
More informationAudio Compression. Audio Compression. Absolute Threshold. CD quality audio:
Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =
More informationInteractive Progressive Encoding System For Transmission of Complex Images
Interactive Progressive Encoding System For Transmission of Complex Images Borko Furht 1, Yingli Wang 1, and Joe Celli 2 1 NSF Multimedia Laboratory Florida Atlantic University, Boca Raton, Florida 33431
More informationCODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala
CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala Tampere University of Technology Korkeakoulunkatu 1, 720 Tampere, Finland ABSTRACT In
More informationSPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL
SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,
More informationsignal-to-noise ratio (PSNR), 2
u m " The Integration in Optics, Mechanics, and Electronics of Digital Versatile Disc Systems (1/3) ---(IV) Digital Video and Audio Signal Processing ƒf NSC87-2218-E-009-036 86 8 1 --- 87 7 31 p m o This
More informationFILE CONVERSION AFTERMATH: ANALYSIS OF AUDIO FILE STRUCTURE FORMAT
FILE CONVERSION AFTERMATH: ANALYSIS OF AUDIO FILE STRUCTURE FORMAT Abstract JENNIFER L. SANTOS 1 JASMIN D. NIGUIDULA Technological innovation has brought a massive leap in data processing. As information
More informationCompression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction
Compression of RADARSAT Data with Block Adaptive Wavelets Ian Cumming and Jing Wang Department of Electrical and Computer Engineering The University of British Columbia 2356 Main Mall, Vancouver, BC, Canada
More informationCSCD 443/533 Advanced Networks Fall 2017
CSCD 443/533 Advanced Networks Fall 2017 Lecture 18 Compression of Video and Audio 1 Topics Compression technology Motivation Human attributes make it possible Audio Compression Video Compression Performance
More informationFundamentals of Video Compression. Video Compression
Fundamentals of Video Compression Introduction to Digital Video Basic Compression Techniques Still Image Compression Techniques - JPEG Video Compression Introduction to Digital Video Video is a stream
More informationCompression Part 2 Lossy Image Compression (JPEG) Norm Zeck
Compression Part 2 Lossy Image Compression (JPEG) General Compression Design Elements 2 Application Application Model Encoder Model Decoder Compression Decompression Models observe that the sensors (image
More informationMPEG-4 ALS International Standard for Lossless Audio Coding
MPEG-4 ALS International Standard for Lossless Audio Coding Takehiro Moriya, Noboru Harada, Yutaka Kamamoto, and Hiroshi Sekigawa Abstract This article explains the technologies and applications of lossless
More informationCHAPTER 6 Audio compression in practice
CHAPTER 6 Audio compression in practice In earlier chapters we have seen that digital sound is simply an array of numbers, where each number is a measure of the air pressure at a particular time. This
More informationMpeg 1 layer 3 (mp3) general overview
Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,
More informationFigure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.
Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION
More informationIMAGE COMPRESSION. October 7, ICSY Lab, University of Kaiserslautern, Germany
Lossless Compression Multimedia File Formats Lossy Compression IMAGE COMPRESSION 69 Basic Encoding Steps 70 JPEG (Overview) Image preparation and coding (baseline system) 71 JPEG (Enoding) 1) select color
More informationPartial Video Encryption Using Random Permutation Based on Modification on Dct Based Transformation
International Refereed Journal of Engineering and Science (IRJES) ISSN (Online) 2319-183X, (Print) 2319-1821 Volume 2, Issue 6 (June 2013), PP. 54-58 Partial Video Encryption Using Random Permutation Based
More informationVideo Compression An Introduction
Video Compression An Introduction The increasing demand to incorporate video data into telecommunications services, the corporate environment, the entertainment industry, and even at home has made digital
More informationMPEG-4 Audio Lossless Coding
Audio Engineering Society Convention Paper Presented at the 116th Convention 2004 May 8 11 Berlin, Germany This convention paper has been reproduced from the author s advance manuscript, without editing,
More informationLecture 12 Video Coding Cascade Transforms H264, Wavelets
Lecture 12 Video Coding Cascade Transforms H264, Wavelets H.264 features different block sizes, including a so-called macro block, which can be seen in following picture: (Aus: Al Bovik, Ed., "The Essential
More informationDIGITAL IMAGE PROCESSING WRITTEN REPORT ADAPTIVE IMAGE COMPRESSION TECHNIQUES FOR WIRELESS MULTIMEDIA APPLICATIONS
DIGITAL IMAGE PROCESSING WRITTEN REPORT ADAPTIVE IMAGE COMPRESSION TECHNIQUES FOR WIRELESS MULTIMEDIA APPLICATIONS SUBMITTED BY: NAVEEN MATHEW FRANCIS #105249595 INTRODUCTION The advent of new technologies
More informationWireless Communication
Wireless Communication Systems @CS.NCTU Lecture 6: Image Instructor: Kate Ching-Ju Lin ( 林靖茹 ) Chap. 9 of Fundamentals of Multimedia Some reference from http://media.ee.ntu.edu.tw/courses/dvt/15f/ 1 Outline
More informationInternational Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)
A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and
More informationMPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings
MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and
More informationCHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM
74 CHAPTER 4 REVERSIBLE IMAGE WATERMARKING USING BIT PLANE CODING AND LIFTING WAVELET TRANSFORM Many data embedding methods use procedures that in which the original image is distorted by quite a small
More informationWhat is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?
Multimedia What is multimedia? Media types +Text + Graphics + Audio +Image +Video Interchange formats What is multimedia? Multimedia = many media User interaction = interactivity Script = time 1 2 Most
More informationA review of lossless audio compression standards and algorithms
A review of lossless audio compression standards and algorithms Fathiah Abdul Muin, Teddy Surya Gunawan, Mira Kartiwi, and Elsheikh M. A. Elsheikh Citation: AIP Conference Proceedings 1883, 020006 (2017);
More informationUsing Shift Number Coding with Wavelet Transform for Image Compression
ISSN 1746-7659, England, UK Journal of Information and Computing Science Vol. 4, No. 3, 2009, pp. 311-320 Using Shift Number Coding with Wavelet Transform for Image Compression Mohammed Mustafa Siddeq
More informationWavelet Based Image Compression Using ROI SPIHT Coding
International Journal of Information & Computation Technology. ISSN 0974-2255 Volume 1, Number 2 (2011), pp. 69-76 International Research Publications House http://www.irphouse.com Wavelet Based Image
More informationParallel-computing approach for FFT implementation on digital signal processor (DSP)
Parallel-computing approach for FFT implementation on digital signal processor (DSP) Yi-Pin Hsu and Shin-Yu Lin Abstract An efficient parallel form in digital signal processor can improve the algorithm
More information15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION
15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:
More informationPacket Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms
26 IEEE 24th Convention of Electrical and Electronics Engineers in Israel Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms Hadas Ofir and David Malah Department of Electrical
More informationIMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression
IMAGE COMPRESSION Image Compression Why? Reducing transportation times Reducing file size A two way event - compression and decompression 1 Compression categories Compression = Image coding Still-image
More informationAudio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011
Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law
More informationDCT Coefficients Compression Using Embedded Zerotree Algorithm
DCT Coefficients Compression Using Embedded Zerotree Algorithm Dr. Tawfiq A. Abbas and Asa'ad. Hashim Abstract: The goal of compression algorithms is to gain best compression ratio with acceptable visual
More informationTopic 5 Image Compression
Topic 5 Image Compression Introduction Data Compression: The process of reducing the amount of data required to represent a given quantity of information. Purpose of Image Compression: the reduction of
More informationAudio Coding and MP3
Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)
More informationImage Compression Algorithm and JPEG Standard
International Journal of Scientific and Research Publications, Volume 7, Issue 12, December 2017 150 Image Compression Algorithm and JPEG Standard Suman Kunwar sumn2u@gmail.com Summary. The interest in
More informationEFFICIENT METHODS FOR ENCODING REGIONS OF INTEREST IN THE UPCOMING JPEG2000 STILL IMAGE CODING STANDARD
EFFICIENT METHODS FOR ENCODING REGIONS OF INTEREST IN THE UPCOMING JPEG2000 STILL IMAGE CODING STANDARD Charilaos Christopoulos, Joel Askelöf and Mathias Larsson Ericsson Research Corporate Unit Ericsson
More informationCHAPTER 5 AUDIO WATERMARKING SCHEME INHERENTLY ROBUST TO MP3 COMPRESSION
CHAPTER 5 AUDIO WATERMARKING SCHEME INHERENTLY ROBUST TO MP3 COMPRESSION In chapter 4, SVD based watermarking schemes are proposed which met the requirement of imperceptibility, having high payload and
More informationREAL-TIME DIGITAL SIGNAL PROCESSING
REAL-TIME DIGITAL SIGNAL PROCESSING FUNDAMENTALS, IMPLEMENTATIONS AND APPLICATIONS Third Edition Sen M. Kuo Northern Illinois University, USA Bob H. Lee Ittiam Systems, Inc., USA Wenshun Tian Sonus Networks,
More informationAN ANALYTICAL STUDY OF LOSSY COMPRESSION TECHINIQUES ON CONTINUOUS TONE GRAPHICAL IMAGES
AN ANALYTICAL STUDY OF LOSSY COMPRESSION TECHINIQUES ON CONTINUOUS TONE GRAPHICAL IMAGES Dr.S.Narayanan Computer Centre, Alagappa University, Karaikudi-South (India) ABSTRACT The programs using complex
More informationMultimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1
Multimedia What is multimedia? Media types + Text +Graphics +Audio +Image +Video Interchange formats Petri Vuorimaa 1 What is multimedia? Multimedia = many media User interaction = interactivity Script
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,
More informationOverview. Videos are everywhere. But can take up large amounts of resources. Exploit redundancy to reduce file size
Overview Videos are everywhere But can take up large amounts of resources Disk space Memory Network bandwidth Exploit redundancy to reduce file size Spatial Temporal General lossless compression Huffman
More informationImage Compression using Discrete Wavelet Transform Preston Dye ME 535 6/2/18
Image Compression using Discrete Wavelet Transform Preston Dye ME 535 6/2/18 Introduction Social media is an essential part of an American lifestyle. Latest polls show that roughly 80 percent of the US
More informationJPEG 2000 compression
14.9 JPEG and MPEG image compression 31 14.9.2 JPEG 2000 compression DCT compression basis for JPEG wavelet compression basis for JPEG 2000 JPEG 2000 new international standard for still image compression
More informationSIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P
SIGNAL COMPRESSION 9. Lossy image compression: SPIHT and S+P 9.1 SPIHT embedded coder 9.2 The reversible multiresolution transform S+P 9.3 Error resilience in embedded coding 178 9.1 Embedded Tree-Based
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics
More informationEmbedded lossless audio coding using linear prediction and cascade coding
University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2005 Embedded lossless audio coding using linear prediction and
More informationf. ws V r.» ««w V... V, 'V. v...
M. SV V 'Vy' i*-- V.J ". -. '. j 1. vv f. ws. v wn V r.» ««w V... V, 'V. v... --
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationFinal Review. Image Processing CSE 166 Lecture 18
Final Review Image Processing CSE 166 Lecture 18 Topics covered Basis vectors Matrix based transforms Wavelet transform Image compression Image watermarking Morphological image processing Segmentation
More informationHaar Wavelet Image Compression
Math 57 Haar Wavelet Image Compression. Preliminaries Haar wavelet compression is an efficient way to perform both lossless and lossy image compression. It relies on averaging and differencing the values
More information3. Lifting Scheme of Wavelet Transform
3. Lifting Scheme of Wavelet Transform 3. Introduction The Wim Sweldens 76 developed the lifting scheme for the construction of biorthogonal wavelets. The main feature of the lifting scheme is that all
More informationCh. 5: Audio Compression Multimedia Systems
Ch. 5: Audio Compression Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Chapter 5: Audio Compression 1 Introduction Need to code digital
More informationAn adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria
An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria N. RUIZ REYES 1, M. ROSA ZURERA 2, F. LOPEZ FERRERAS 2, D. MARTINEZ MUÑOZ 1 1 Departamento
More informationImplementation of FPGA Based MP3 player using Invers Modified Discrete Cosine Transform
Implementation of FPGA Based MP3 player using Invers Modified Discrete Cosine Transform Mr. Sanket Shinde Universal college of engineering, Kaman Email-Id:sanketsanket01@gmail.com Mr. Vinay Vyas Universal
More informationLecture 10 Video Coding Cascade Transforms H264, Wavelets
Lecture 10 Video Coding Cascade Transforms H264, Wavelets H.264 features different block sizes, including a so-called macro block, which can be seen in following picture: (Aus: Al Bovik, Ed., "The Essential
More informationVHDL Implementation of Multiplierless, High Performance DWT Filter Bank
VHDL Implementation of Multiplierless, High Performance DWT Filter Bank Mr. M.M. Aswale 1, Prof. Ms. R.B Patil 2,Member ISTE Abstract The JPEG 2000 image coding standard employs the biorthogonal 9/7 wavelet
More informationJPEG: An Image Compression System
JPEG: An Image Compression System ISO/IEC DIS 10918-1 ITU-T Recommendation T.81 http://www.jpeg.org/ Nimrod Peleg update: April 2007 Basic Structure Source Image Data Reconstructed Image Data Encoder Compressed
More informationDRA AUDIO CODING STANDARD
Applied Mechanics and Materials Online: 2013-06-27 ISSN: 1662-7482, Vol. 330, pp 981-984 doi:10.4028/www.scientific.net/amm.330.981 2013 Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua
More informationAbstract. Literature Survey. Introduction. A.Radix-2/8 FFT algorithm for length qx2 m DFTs
Implementation of Split Radix algorithm for length 6 m DFT using VLSI J.Nancy, PG Scholar,PSNA College of Engineering and Technology; S.Bharath,Assistant Professor,PSNA College of Engineering and Technology;J.Wilson,Assistant
More informationWavelet Transform (WT) & JPEG-2000
Chapter 8 Wavelet Transform (WT) & JPEG-2000 8.1 A Review of WT 8.1.1 Wave vs. Wavelet [castleman] 1 0-1 -2-3 -4-5 -6-7 -8 0 100 200 300 400 500 600 Figure 8.1 Sinusoidal waves (top two) and wavelets (bottom
More informationJPEG Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features
JPEG-2000 Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features Improved compression efficiency (vs. JPEG) Highly scalable embedded data streams Progressive lossy
More informationMRT based Fixed Block size Transform Coding
3 MRT based Fixed Block size Transform Coding Contents 3.1 Transform Coding..64 3.1.1 Transform Selection...65 3.1.2 Sub-image size selection... 66 3.1.3 Bit Allocation.....67 3.2 Transform coding using
More informationParametric Coding of High-Quality Audio
Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits
More information