DRA AUDIO CODING STANDARD
|
|
- Ralph Bradford
- 5 years ago
- Views:
Transcription
1 Applied Mechanics and Materials Online: ISSN: , Vol. 330, pp doi: / Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua Ma 1, a Yuanzhe Ma 2, b Yu-Li You 3, c 1 School of Informatics Guangdong University of Foreign Studies.2 Baiyundadao 2 Department of Biomedical, South China University of Technology.381.Wushanlu 3 Provincial Key Lab for Digital Audio Technology, Digital Rise Technology Co. Ltd. 6th Floor, Bldg. 2, Science and Tech Park South China University of Technology a myz122@yahoo.com.cn, b reika2009@yeah.net, c yuliyou@usa.com Keywords: Audio coding, standard, listening test, adaptive transform coder. Abstract. China s DRA audio coding standard is shown to be a barebone transform coder, utilizing transient-localized MDCT for improved pre-echo suppression and statistic allocation of codebooks for high entropy coding efficiency. A signal path of up to 24 bits is provided throughout the codec so that highest audio quality can be delivered if bit rate suffices. Results of five ITU-R BS.1116 compliant subjective listening tests are presented. Introduction There have been extensive standardization activities in audio coding in the past twenty years. MPEG-1 is the first international standard for perceptual high quality audio coding [1]. It is essentially a subband coder that deploys a 32-band QMF (quadrature mirror filter bank). Its Layer 3 adds switched MDCT (modified discrete cosine transform) to the subband signals output from the QMF for increased frequency resolution. MPEG-1 was extended by MPEG-2 BC (backward compatible) to provide for lower sample rates and multichannel surround sounds [1]. MPEG-2 AAC abandons backward compatibility with MPEG-1 in order to achieve significant improvement in coding efficiency [1]. AAC uses an MDCT that switches between 1024 and 128 spectral lines. AAC is carried over into MPEG-4 AAC with the addition of more coding tools, such as Perceptual Noise Shaping and Long Term Prediction, and coder configurations [1]. Dolby AC-3 is probably the most commercially successful audio coding standard [1]. It uses MDCT that switches between 128 and 256 spectral lines. WMA (Windows Media Audio), offered by Microsoft, uses MDCT that switches between 64, 128, 256, 512, 1024, and 2048 spectral lines [2]. Vorbis, an open source codec offered by the Xiph.Org Foundation, uses MDCT that switches between 256 and 1024 spectral lines [3]. Modern audio codecs have apparently converged to MDCT as the tool for time-frequency analysis. DRA algorithm [4], adopted as China s national standard for its electronics industry, uses transient-localized MDCT that provides improved pre-echo suppression with small bit and computation overheads. It uses statistic allocation of Huffman codebooks to enhance the coding efficiency of Huffman codes. Its quantization unit and Huffman codebooks are designed in such a way that signal path of up to 24 bits is provided throughout the codec so that highest audio quality can be delivered if bit rate allows. Although simple, DRA standard delivers state-of art coding efficiency as is shown by the five ITU-R BS.1116 compliant subjective listening tests. All rights reserved. No part of contents of this paper may be reproduced or transmitted in any form or by any means without the written permission of Trans Tech Publications, (ID: , Pennsylvania State University, University Park, USA-05/03/16,23:28:38)
2 982 Materials Engineering and Automatic Control II The Algorithm As shown in Figure 1, DRA audio encoder is a simple, essentially bare-bone, adaptive transform coder. Its major components are described as follows. Transient-Localized MDCT. An MDCT that switches between 128 and 1024 spectral lines is used to provide time-frequency analysis. In order to improve its capability of pre-echo suppression, a special brief window is introduced, whose effective size is reduced from 256 to 160 samples. In particular, the brief window is nonzero only within the central 160 samples, with zeros for the first 48 and the last 48 samples of the window [5]. In order to switch to/from this brief window from/to the long (2048 samples) and short (256 samples) windows, a few more transitional windows are introduced due to the perfect reconstruction conditions [1]. All the windows are based on the sine window [1] This brief window is applied only to the block of samples containing a transient, while the short and/or the appropriate transition windows are applied to the quasi-stationary samples in the remainder of the transient frame. Some example window sequences are shown in Figure 2. (a) is an example for the conventional approach. (b) shows that a transient occurs in the first block of the frame, so a brief window is deployed for this block. (c) shows that a transient occurs in the third block. (d) shows that two transients occur in the third and sixth blocks, so two brief windows are placed, respectively. (e) shows that a transient occurs in the last block. Since the effective size of this brief window is reduced from 256 to 160, better pre-echo suppression is achieved for two reasons. The first is that a finer temporal resolution is deployed to transient samples and high bit rates associated with transients are constrained to fewer samples. The second reason is that the spread of quantization noise is reduced. The spread of quantization noise for the short window is 256 samples, but this is reduced to only 160 samples for the brief window. For a typical sample rate of 48 khz, they amount to 5.33 and 3.3 ms, respectively. Given that significant premasking tends to last about 1-2 ms [1], the spread of quantization noise that may be audible is reduced from to ms, a significant reduction. Linear Scalar Quantization. Linear scalar quantization is used to quantize MDCT spectra lines. A group of spectral lines, referred to as a quantization unit, boxed in the frequency domain by the critical bands and in the time domain by the MDCT blocks that are statistically similar, share a quantization step size, which is logarithmically quantized with a step size of 0.2 db. When the quantization step size is one, the maximum allowed quantization index is ±(2 23 1) and the Huffman codebooks are designed to accommodate this. Consequently, a signal path of up to 24 bits is provided throughout the codec so that audio quality far exceeds the perceptual capability of the human ear can be delivered if bit rate suffices.
3 Applied Mechanics and Materials Vol Statistic Allocation of Codebooks. With the conventional approach to codebook allocation, all the spectral lines within a quantization unit share a Huffman codebook. The codebook assigned to such a unit is the smallest one that can accommodate the largest quantization index within the unit. Consequently, a fixed quantization step size means all the quantization indexes within the unit are fixed, and so is the Huffman codebook. There is no other option. Since the quantization indexes within a quantization unit do not necessarily share the same statistic properties, the traditional approach does not provide a good match, if any, between the statistic properties of the Huffman codebooks and those of the quantization indexes. This motivates a statistic adaptive approach to codebook assignment, whose steps are outlined as follows: 1. The quantization indexes are grouped into granules of four, the smallest codebook that can accommodate the largest quantization index in the granule is assigned to the granule. 2. Segment the indexes of these codebook into large segments based on their local statistic properties. 3. Select the largest codebook within each segment as the codebook for that segment. The advantage of this approach is illustrated in Figure3. Since the largest quantization index falls into quantization unit d, so a large codebook is assigned using previous methods, which is obviously not a good match because most of the indexes in the unit are much smaller. Using the DRA approach, however, the largest quantization index is segmented into segment C, so share a codebook with other large quantization indexes. Also, all quantization indexes in segment D are small, so a small codebook is selected. This obviously results in fewer bits for coding the quantization indexes. Other Components. At low bit rates, DRA algorithm may deploy joint channel coding. While the implementation for sum/difference coding is regular, the joint intensity coding is a little different. Instead of joining stereo pairs, it joins all channels into the left channel, thereby providing significant bit rate reduction when surround sounds are involved. While perceptual model and global bit allocation are necessary components of DRA encoder, they are not part of DRA decoder and there is little, if any, restriction on their implementation, so they are not stipulated in the DRA standard and are thereby not discussed here. Subjective Listening Tests During its standardization process, DRA went through five rounds of ITU-R BS.1116 [1] compliant subjective listening test. The results of them are shown in Table 1. Table 1Scores for ITU-R BS compliant subjective listening tests. Lab Date Stereo 128[kbps] [kbps] [kbps] NTICRT 08/ SLDST 10/ SLDST 01/ SLDST 07/ SLDST 08/
4 984 Materials Engineering and Automatic Control II All these tests specified the bit rate in such a way that it is the upper limit absolutely not to be exceeded in any frame. For example, if the sample rate is 48 khz, the bit rate of 128kbps translates into 2730 bits per frame because a DRA frame consists of 1024 samples. No frame can use more than 2730 bits and no bit reservoir is allowed. The first test was conducted by National Testing and Inspection Center for Radio and TV Products of China (NTICRT) in August Ten stereo sound tracks selected mostly from SQAM CD [6] and five 5.1 surround sound tracks were used in the test. The test subjects were all expert listeners consisting of conductors, musicians, recording engineers, and audio engineers. The other four tests were all performed by the State Lab for DTV System Testing (SLDST) under the State Administration for Radio, Film, and TV of China. Other than a few Chinese sound tracks, most of the test materials were selected from the SQAM CD [6] and a pool of surround sound tracks used by EBU and MPEG, including Pitch pipe, Harpsichord, and Elloit1. The last test, though still conducted by SLDST, was actually ordered and supervised by China Central TV (CCTV) as part of its DTV technology evaluation program. CCTV was only interested in surround sounds, so DRA was tested at 384kbps and 320 kbps. This test was conducted in comparison with two major international codecs, DRA came out as the clear winner. Conclusion DRA audio coding standard was shown to be essentially a bare-bone transform coder that uses transient-localized MDCT for improved pre-echo suppression and statistic allocation of codebooks for better entropy coding efficiency. Its quantizer and Huffman codebooks are designed in such a way that a signal path of up to 24 bits is provided throughout the codec so that highest audio quality can be delivered if bit rate suffices. Its coding efficiency has been evaluated by five ITU-R BS compliant subjective listening tests. Acknowledgements Supported by 2012 Guangdong Science and Technology Plan for Commercialization of Advanced and New Technologies under contract 2012B References [1]T. Painter and A. Spanias, Perceptual coding of digital audio, Proceedings of the IEEE, vol. 88, no. 4, pp , April [2] Wikipedia, Windows Media Audio, org/wiki/windows Media Audio, October [3] Vorbis I specification, Xiph.org Foundation, [4] Yu-Li You, Weixiong Zhang, Mao Xu, and Subin Zhang, Electronics Industry Standard: Multichannel Digital Audio Coding Technology, SJ/T , Ministry of Information Industry, People s Republic of China, [5] Yu-Li You and Wenhua Ma, Temporal transient localization for enhanced pre-echo suppression, submitted to IEEE International Conference on Acoustics, Speech, and Signal Processing, [6] EBU, Sound Quality Assessment Material Recordings for Subjective Tests, Tech. 3253, April 1988.
5 Materials Engineering and Automatic Control II / DRA Audio Coding Standard /
5: Music Compression. Music Coding. Mark Handley
5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the
More informationAudio Compression. Audio Compression. Absolute Threshold. CD quality audio:
Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =
More informationAudio Coding Standards
Audio Standards Kari Pihkala 13.2.2002 Tik-111.590 Multimedia Outline Architectural Overview MPEG-1 MPEG-2 MPEG-4 Philips PASC (DCC cassette) Sony ATRAC (MiniDisc) Dolby AC-3 Conclusions 2 Architectural
More informationOptical Storage Technology. MPEG Data Compression
Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the
More informationMultimedia Communications. Audio coding
Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated
More informationMpeg 1 layer 3 (mp3) general overview
Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,
More informationContents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48
Contents Part I Prelude 1 Introduction... 3 1.1 Audio Coding... 4 1.2 Basic Idea... 6 1.3 Perceptual Irrelevance... 8 1.4 Statistical Redundancy... 9 1.5 Data Modeling... 9 1.6 Resolution Challenge...
More informationMPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings
MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationLecture 16 Perceptual Audio Coding
EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationMPEG-4 General Audio Coding
MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and
More informationELL 788 Computational Perception & Cognition July November 2015
ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)
More informationAudio Coding and MP3
Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)
More informationFigure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.
Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION
More informationPerceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding
Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.
More informationThe MPEG-4 General Audio Coder
The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution
More informationAppendix 4. Audio coding algorithms
Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically
More information<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>
THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationFundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic
More informationChapter 4: Audio Coding
Chapter 4: Audio Coding Lossy and lossless audio compression Traditional lossless data compression methods usually don't work well on audio signals if applied directly. Many audio coders are lossy coders,
More informationScalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC
Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationWavelet filter bank based wide-band audio coder
Wavelet filter bank based wide-band audio coder J. Nováček Czech Technical University, Faculty of Electrical Engineering, Technicka 2, 16627 Prague, Czech Republic novacj1@fel.cvut.cz 3317 New system for
More informationMPEG-4 aacplus - Audio coding for today s digital media world
MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/
More informationModule 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur
Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define
More informationChapter 14 MPEG Audio Compression
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1
More information/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec
/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank
More informationCompressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.
Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity
More informationApplication Note PEAQ Audio Objective Testing in ClearView
1566 La Pradera Dr Campbell, CA 95008 www.videoclarity.com 408-379-6952 Application Note PEAQ Audio Objective Testing in ClearView Video Clarity, Inc. Version 1.0 A Video Clarity Application Note page
More informationS.K.R Engineering College, Chennai, India. 1 2
Implementation of AAC Encoder for Audio Broadcasting A.Parkavi 1, T.Kalpalatha Reddy 2. 1 PG Scholar, 2 Dean 1,2 Department of Electronics and Communication Engineering S.K.R Engineering College, Chennai,
More information2.4 Audio Compression
2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and
More informationThe RTP Encapsulation based on Frame Type Method for AVS Video
Applied Mechanics and Materials Online: 2012-12-27 ISSN: 1662-7482, Vols. 263-266, pp 1803-1808 doi:10.4028/www.scientific.net/amm.263-266.1803 2013 Trans Tech Publications, Switzerland The RTP Encapsulation
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationINTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29 WG11 N15073 February 2015, Geneva,
More informationEfficient Signal Adaptive Perceptual Audio Coding
Efficient Signal Adaptive Perceptual Audio Coding MUHAMMAD TAYYAB ALI, MUHAMMAD SALEEM MIAN Department of Electrical Engineering, University of Engineering and Technology, G.T. Road Lahore, PAKISTAN. ]
More informationROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010
ROW.mp3 Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010 Motivation The realities of mp3 widespread use low quality vs. bit rate when compared to modern codecs Vision for row-mp3 backwards
More informationCT516 Advanced Digital Communications Lecture 7: Speech Encoder
CT516 Advanced Digital Communications Lecture 7: Speech Encoder Yash M. Vasavada Associate Professor, DA-IICT, Gandhinagar 2nd February 2017 Yash M. Vasavada (DA-IICT) CT516: Adv. Digital Comm. 2nd February
More informationParametric Coding of High-Quality Audio
Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits
More informationHAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER
HAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER Rob Colcord, Elliot Kermit-Canfield and Blane Wilson Center for Computer Research in Music and Acoustics,
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,
More informationISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding
INTERNATIONAL STANDARD This is a preview - click here to buy the full publication ISO/IEC 23003-3 First edition 2012-04-01 Information technology MPEG audio technologies Part 3: Unified speech and audio
More informationI D I A P R E S E A R C H R E P O R T. October submitted for publication
R E S E A R C H R E P O R T I D I A P Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath
More informationPerceptually motivated Sub-band Decomposition for FDLP Audio Coding
Perceptually motivated Sub-band Decomposition for FD Audio Coding Petr Motlicek 1, Sriram Ganapathy 13, Hynek Hermansky 13, Harinath Garudadri 4, and Marios Athineos 5 1 IDIAP Research Institute, Martigny,
More informationAudio coding for digital broadcasting
Recommendation ITU-R BS.1196-4 (02/2015) Audio coding for digital broadcasting BS Series Broadcasting service (sound) ii Rec. ITU-R BS.1196-4 Foreword The role of the Radiocommunication Sector is to ensure
More informationData Compression. Audio compression
1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic
More informationSpeech and audio coding
Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics
More informationFor Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer
For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing
More informationSPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL
SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,
More informationMUSIC A Darker Phonetic Audio Coder
MUSIC 422 - A Darker Phonetic Audio Coder Prateek Murgai and Orchisama Das Abstract In this project we develop an audio coder that tries to improve the quality of the audio at 128kbps per channel by employing
More informationPQMF Filter Bank, MPEG-1 / MPEG-2 BC Audio. Fraunhofer IDMT
PQMF Filter Bank, MPEG-1 / MPEG-2 BC Audio The Basic Paradigm of T/F Domain Audio Coding Digital Audio Input Filter Bank Bit or Noise Allocation Quantized Samples Bitstream Formatting Encoded Bitstream
More informationENTROPY CODING OF QUANTIZED SPECTRAL COMPONENTS IN FDLP AUDIO CODEC
RESEARCH REPORT IDIAP ENTROPY CODING OF QUANTIZED SPECTRAL COMPONENTS IN FDLP AUDIO CODEC Petr Motlicek Sriram Ganapathy Hynek Hermansky Idiap-RR-71-2008 NOVEMBER 2008 Centre du Parc, Rue Marconi 19, P.O.
More informationPrinciples of MPEG audio compression
Principles of MPEG audio compression Principy komprese hudebního signálu metodou MPEG Petr Kubíček Abstract The article describes briefly audio data compression. Focus of the article is a MPEG standard,
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More informationEfficient Implementation of Transform Based Audio Coders using SIMD Paradigm and Multifunction Computations
Efficient Implementation of Transform Based Audio Coders using SIMD Paradigm and Multifunction Computations Luckose Poondikulam S (luckose@sasken.com), Suyog Moogi (suyog@sasken.com), Rahul Kumar, K P
More informationSimple Watermark for Stereo Audio Signals with Modulated High-Frequency Band Delay
ACOUSTICAL LETTER Simple Watermark for Stereo Audio Signals with Modulated High-Frequency Band Delay Kazuhiro Kondo and Kiyoshi Nakagawa Graduate School of Science and Engineering, Yamagata University,
More informationModeling of an MPEG Audio Layer-3 Encoder in Ptolemy
Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Patrick Brown EE382C Embedded Software Systems May 10, 2000 $EVWUDFW MPEG Audio Layer-3 is a standard for the compression of high-quality digital audio.
More information6MPEG-4 audio coding tools
6MPEG-4 audio coding 6.1. Introduction to MPEG-4 audio MPEG-4 audio [58] is currently one of the most prevalent audio coding standards. It combines many different types of audio coding into one integrated
More informationINTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N15071 February 2015, Geneva,
More informationSAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.
SAOC and USAC Spatial Audio Object Coding / Unified Speech and Audio Coding Lecture Audio Coding WS 2013/14 Dr.-Ing. Andreas Franck Fraunhofer Institute for Digital Media Technology IDMT, Germany SAOC
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More informationCISC 7610 Lecture 3 Multimedia data and data formats
CISC 7610 Lecture 3 Multimedia data and data formats Topics: Perceptual limits of multimedia data JPEG encoding of images MPEG encoding of audio MPEG and H.264 encoding of video Multimedia data: Perceptual
More informationMPEG-l.MPEG-2, MPEG-4
The MPEG Handbook MPEG-l.MPEG-2, MPEG-4 Second edition John Watkinson PT ^PVTPR AMSTERDAM BOSTON HEIDELBERG LONDON. NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO Focal Press is an
More informationSqueeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft
Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future
More informationCompression; Error detection & correction
Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some
More information2014 Summer School on MPEG/VCEG Video. Video Coding Concept
2014 Summer School on MPEG/VCEG Video 1 Video Coding Concept Outline 2 Introduction Capture and representation of digital video Fundamentals of video coding Summary Outline 3 Introduction Capture and representation
More informationThe following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:
Page 1 of 8 1. SCOPE This Operational Practice sets out guidelines for minimising the various artefacts that may distort audio signals when low bit-rate coding schemes are employed to convey contribution
More informationDSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman
DSP The Technology Presented to the IEEE Central Texas Consultants Network by Sergio Liberman Abstract The multimedia products that we enjoy today share a common technology backbone: Digital Signal Processing
More informationAVS-P3: Algorithm and Implementation
858 JOURNAL OF SOFTWARE, VOL. 5, NO. 8, AUGUST 2 AVS-P3: Algorithm and Implementation Zhang Tao, Dong Yuxi and Zhang Wen School of Electronic and Information Engineering, Tianjin University, Tianjin, China
More informationShape Optimization Design of Gravity Buttress of Arch Dam Based on Asynchronous Particle Swarm Optimization Method. Lei Xu
Applied Mechanics and Materials Submitted: 2014-08-26 ISSN: 1662-7482, Vol. 662, pp 160-163 Accepted: 2014-08-31 doi:10.4028/www.scientific.net/amm.662.160 Online: 2014-10-01 2014 Trans Tech Publications,
More informationsignal-to-noise ratio (PSNR), 2
u m " The Integration in Optics, Mechanics, and Electronics of Digital Versatile Disc Systems (1/3) ---(IV) Digital Video and Audio Signal Processing ƒf NSC87-2218-E-009-036 86 8 1 --- 87 7 31 p m o This
More informationDesign and Implementation of MP3 Player Based on FPGA Dezheng Sun
Applied Mechanics and Materials Online: 2013-10-31 ISSN: 1662-7482, Vol. 443, pp 746-749 doi:10.4028/www.scientific.net/amm.443.746 2014 Trans Tech Publications, Switzerland Design and Implementation of
More informationKINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK
KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR 2011-2012 / ODD SEMESTER QUESTION BANK SUB.CODE / NAME YEAR / SEM : IT1301 INFORMATION CODING TECHNIQUES : III / V UNIT -
More informationAudio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011
Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law
More informationEmbedded lossless audio coding using linear prediction and cascade coding
University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2005 Embedded lossless audio coding using linear prediction and
More informationAudio and video compression
Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and
More informationSubjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications
Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications
More informationSource Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201
Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Why do we need to compress speech signals Basic components in a source coding
More informationEBU Tech Doc Three parts of Tech Doc 3344
EBU Tech Doc 3344 Andrew Mason BBC Research & Development Chairman of EBU Strategic Programme Group Future Audio and Radio Three parts of Tech Doc 3344 1 -35-30 -25-23 -20-18 -15-12 -10-5 -1 0-35 -30-25
More informationMotion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.
ON VIDEO SNR SCALABILITY Lisimachos P. Kondi, Faisal Ishtiaq and Aggelos K. Katsaggelos Northwestern University Dept. of Electrical and Computer Engineering 2145 Sheridan Road Evanston, IL 60208 E-Mail:
More informationThe Analysis and Research of IPTV Set-top Box System. Fangyan Bai 1, Qi Sun 2
Applied Mechanics and Materials Online: 2012-12-13 ISSN: 1662-7482, Vols. 256-259, pp 2898-2901 doi:10.4028/www.scientific.net/amm.256-259.2898 2013 Trans Tech Publications, Switzerland The Analysis and
More informationPerceptual Pre-weighting and Post-inverse weighting for Speech Coding
Perceptual Pre-weighting and Post-inverse weighting for Speech Coding Niranjan Shetty and Jerry D. Gibson Department of Electrical and Computer Engineering University of California, Santa Barbara, CA,
More informationMP3. Panayiotis Petropoulos
MP3 By Panayiotis Petropoulos Overview Definition History MPEG standards MPEG 1 / 2 Layer III Why audio compression through Mp3 is necessary? Overview MPEG Applications Mp3 Devices Mp3PRO Conclusion Definition
More informationCh. 5: Audio Compression Multimedia Systems
Ch. 5: Audio Compression Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Chapter 5: Audio Compression 1 Introduction Need to code digital
More informationCSCD 443/533 Advanced Networks Fall 2017
CSCD 443/533 Advanced Networks Fall 2017 Lecture 18 Compression of Video and Audio 1 Topics Compression technology Motivation Human attributes make it possible Audio Compression Video Compression Performance
More informationSpeech-Coding Techniques. Chapter 3
Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the lower the quality RTP payload types
More informationA PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION
A PSYCHOACOUSTIC MODEL WITH PARTIAL SPECTRAL FLATNESS MEASURE FOR TONALITY ESTIMATION Armin Taghipour 1, Maneesh Chandra Jaikumar 2, and Bernd Edler 1 1 International Audio Laboratories Erlangen, Am Wolfsmantel
More informationFINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM
FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM Heiko Hansen, Stefan Strahl Carl von Ossietzky University Oldenburg Department of Physics D-6111 Oldenburg, Germany
More informationISO/IEC INTERNATIONAL STANDARD
INTERNATIONAL STANDARD ISO/IEC 13818-7 Second edition 2003-08-01 Information technology Generic coding of moving pictures and associated audio information Part 7: Advanced Audio Coding (AAC) Technologies
More informationPerformance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS
Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS Outline Overview of Dirac Overview of AVS-china Overview
More informationAudio Engineering Society. Convention Paper. Presented at the 126th Convention 2009 May 7 10 Munich, Germany
Audio Engineering Society Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany 7712 The papers at this Convention have been selected on the basis of a submitted abstract and
More informationDigiPoints Volume 1. Student Workbook. Module 8 Digital Compression
Digital Compression Page 8.1 DigiPoints Volume 1 Module 8 Digital Compression Summary This module describes the techniques by which digital signals are compressed in order to make it possible to carry
More informationANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION
ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION K Nagamani (1) and AG Ananth (2) (1) Assistant Professor, R V College of Engineering, Bangalore-560059. knmsm_03@yahoo.com (2) Professor, R V
More informationTechnical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS
Technical PapER Extended HE-AAC Bridging the gap between speech and audio coding One codec taking the place of two; one unified system bridging a troublesome gap. The fifth generation MPEG audio codec
More informationStructural analysis of low latency audio coding schemes
Structural analysis of low latency audio coding schemes Manfred Lutzky, Markus Schnell, Markus Schmidt and Ralf Geiger Fraunhofer Institute for Integrated Circuits IIS, Am Wolfsmantel 33, 91058 Erlangen,
More information19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007
19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek
More information