Principles of MPEG audio compression
|
|
- Hillary Martin
- 5 years ago
- Views:
Transcription
1 Principles of MPEG audio compression Principy komprese hudebního signálu metodou MPEG Petr Kubíček Abstract The article describes briefly audio data compression. Focus of the article is a MPEG standard, which define world wide compatibility of compressed data. There are descriptions of principles audio data compression (masking effects), compatibilities and compress ratios. Finally there are a lot of examples where are MPEG standards used. There is focus for real time applications which are working in the world at present time. INTRODUCTION TO AUDIO MPEG The article is based on standard ISO/IEC (ISO/IEC 11172:1993 Information technology-coding of moving pictures and associated audio for digital storage media at up to about 1,5Mbit/s). Actually it is standard for MPEG-1 compression. It consist from five parts: Systems, Video, Audio, Conformance testing, Software simulation. Next paragraphs are used only Part 3: Audio. In fact, there are several standards for MPEG compression. Later, in 1996, was set up other standard for MPEG2 standard. It is ISO/IEC 13818:1996 Information technology-generic coding of moving pictures and associated audio information. Figure 1 1: ISO standards of MPEG 1 Ing. Petr Kubíček, Department of Telecommunications, Brno University of Technology, Purkyňova 118, Brno, tel.: 05/ fax.: 05/ kubicekp@utko.fee.vutbr.cz
2 Figure 1 shows similarity of MPEG-1 and MPEG-2. More over MPEG-2 is built for low bit rates and for more audio channels compare to MPEG-1. Low bit rate is a demand by industry of electronics (for example radio connection). On the other hand high bit rate is caused with modern trends in house acoustic (home video, surround sound). The MPEG-1 coding algorithms have mainly been designed for high quality audio compression (CD-like quality) of mono- or stereophonic signals. When the MPEG-1 standard was established, it became obvious that two extensions of the standard would open new applications for digital audio coding: Better quality for low bit rates. Many applications don't need CD-like quality, but lowest possible amount of data. Although it's possible to generate bitstreams down to 32 kbit/sec with MPEG-1, they don't offer the best possible quality, because they were designed for high quality audio. Coding of up to 5 audio channels (and an optional low frequency enhancement channel) to support multichannel surround sound. LAYERS The MPEG group chose to recommend three compression methods and named them Audio Level I, II and III. Level I (sometimes named Layer I) is the simplest, a subband-coder with a psychoacoustic model. Layer II adds more advanced bit allocation techniques and greater accuracy. Layer III adds a hybrid filter bank and non-uniform quantization. Layer I, II and III gives increasing quality/compression ratios with increasing complexity and demand on processing power. The reason for recommending three methods was based on testers, whose felt that none of coders was 100% transparent to all material and partly that the best coder (layer III) was so heavy in computing that it would seriously impact the acceptance of the standard. LAYER COMPATIBILITY The specifications say, that a valid Layer III decoder shall be able to decode any Layer I, II or III MPEG audio stream. A Layer II decoder shall be able to decode Layer I and Layer II MPEG audio stream. In order to ensure the backward compatibility, an "old" MPEG-1 decoder must be able to decode the basic stereo signal of the MPEG-2 multi-channel signal, that are the L0 and R0 signals as defined in MPEG-1. This may be achieved by sending a suitable
3 mix of the multiple channels as L0 and R0, or by sending L and R as L0 and R0. The rest of the information is transmitted in the ancillary data channel, while other ancillary data may still be transmitted. AUDIO COMPRESSION Exists two alternatives how to simplify audio signal data stream. Either is sampled less often (less than times per second) or is sampled with less resolution (less than 16 bits per sample). If is demanded quality, sample frequency cannot be changed. Humans can hear sounds with frequencies from 20 Hz to 20 khz approximately (depends on person`s sence). According to the Nyquist theorem (equation 1) sample must be taken at least two times the highest frequency as is asked for producing. Allowing to imperfect filters, a 44.1 khz sampling rate is a fair minimum. So is either set out to prove the Nyquist theorem is wrong or go to work on reducing resolution. Fs > 2 * Fmax (1) Now, the real reason for using 16 bits is to get a good signal-to-noise (s/n) ratio. The noise is quantization noise from the digitizing process. For each added bit is signalto-noise ratio better about 6 db. To the ear, 6 db corresponds to a doubling of the sound level. CD audio quality achieves about 90 db s/n. This matches the dynamic range of the ear fairly well. That is no noise for listeners coming from the system. If samples has only eight bits as sampling resolution, the noise floor in recording is increased. Now it is noticeable for humans. It is possible to hear it in silent moments in the music between words or sentences. But in loud passages is not noticed it. This is the masking effect and it is the key of MPEG audio coding. In develop a coding method, the basis had to be the human ear. Unfortunately this is not a perfect device for acoustic reception but is the best that is known. Advantage was taken of one of the humane ear's shortcomings: its non-linear and adaptive threshold of hearing. MASKING EFFECT The threshold of hearing is the level below which a sound is not heard. It varies with frequency and, of course, between individuals. Whether a person hears a sound or not depends on the frequency of the sound and whether its amplitude is above or below that persons hearing threshold at that frequency.
4 The threshold of hearing is also adaptive, being constantly changed by the sounds heard. For example, an ordinary conversation in a room is perfectly audible under normal conditions. However, the same conversation in the vicinity of a loud noise, such as an aircraft passing low overhead, is impossible to hear due to the distortions introduced to the hearing thresholds of the individuals concerned. When the aircraft has gone the hearing thresholds return to normal. Sounds that are inaudible due to dynamic adaptation of the hearing threshold are said to be "masked". This effect is universal but is of particular relevance in music. An orchestra instrument playing fortissimo will, to a greater or lesser extent, make the sound of some other instruments inaudible to the human ear. When the music is recorded, however, all the frequencies go on the medium because the response of the recording device is flat, i.e. it is not dynamically adaptive. When the recording is played the masked instruments will not be audible to the listener, so might as well not be there. A linear recording, as used on CD, is inefficient in this respect. To make the most use of a recording medium the parts of the medium that contain inaudible data can better be used for audible data. In this way the amount of the recording medium needed to contain the music can be considerably reduced without any loss of audio quality. Because it is the main aspect of the MPEG encoding process to understand, next paragraph explains masking effect in an other angle of sight, on concrete signal. When there is a strong tone with a frequency of 1000 Hz nearby a tone with 1100 Hz, but 18 db lower, second tone is not heard anymore. It is completely masked by the first 1000 Hz tone. As a matter of fact, any relatively weak sound near a strong sound is masked. So the further you get from a sound the less masking effect it has. The masking effect means possibility to raise the noise floor around a strong sound, because the noise is masked by stronger sound. Raising the noise is the same as using less bits. And using less bits is the same as compression.
5 Figure 2: Masking effect The change of the hearing threshold so that a weak tone cannot be heard in presence of a strong tone is what we call masking. In the figure 2 the dotted line represents the unmodified hearing threshold. The area below the masking thresholds is where we find the inaudible information. MPEG AUDIO ENCODER A perceptual subband audio encoder constantly analyses the incoming audio signal and determines the masking curve, the threshold under which additional noise will not be audible by the human auditory system. MPEG audio coder in first step divide the frequency audio spectrum (from 20 Hz to 20 khz) into 32 subbands. Each subband holds a little slice of the audio spectrum. Later the coder calculates the masking effect of the sound in each subband. It means, if tone of 1000 Hz is in eigth subband, it will have the effect through entire subband too. The coder find appropriate masking threshold. So that acceptable s/n ratio is difference between level of strong tone and threshold looked by coder. This difference point to appropriate number of bit resolution. In addition, there are masking effects on near bands. The effect decrease with the distance from basic band (now No. 8). In real-life situation you have sounds in most bands and the masking effects are additive. In addition, the coder considers the sensitivity of the ear for various frequencies. The ear is a lot sensitive in the high and low frequencies. Peak sensitivity is around 2-4 khz, the same region that human voice occupies.
6 The information used in a quantizer in each subband is transmitted along with the coded subband samples. The decoder can then decode the bit stream without knowing how the encoder determined this information. This allows for encoders of different quality and complexity, and for future improvements of encoders. In MPEG Layer I and II each subband is 625 Hz wide (for Fs=44.1kHz). It would be better, if the sub-bands are narrower in the low frequency range and wider in the high frequency range. To do that it is necessary to use more complex filters (Layer III). To keep the filters simple was chosen to add FFT in parallel with filtering and use the spectral components from the FFT as additional information to the coder. This way gives higher resolution in the low frequencies where the ear is more sensitive. Masking effect also occurs before and after strong sound (pre- and post- masking effect). The brain needs some processing time. Premasking is only about 2 to 5 ms, postmasking can bi up till 100 ms. Other bit reduction techniques involve considering tonal and non tonal components of the sound. For the stereo signal is a lot of redundancy between channels. The last step before formatting is Huffman coding (Layer III only). For Layer II the coder works on 23 ms of sound (1152 samples) at a time and for Layer I it is 8.7 ms (because 384 samples only). It should be a problem. COMPRESS RATIO (BIT RATES) The calculations are done on a basis of a psychoacoustic model, which is the heart of MPEG audio coding. Of course, the encoder does not know what the actual sound pressure is at the listener's premises, but this is for practical purposes not necessary either. Experiments show that the coding, at a given bit rate, gives the same impression of quality for a wide range of sound pressures. One may add that the quantizing noise at low sound pressures and modest compression ratios is much too low to be noticed even by trained ears. Layer I is the simplest layer which is suitable for consumer use. Bit rates range from 32 kbit/s (mono) up to 448 kbit/s (stereo). Depending on the complexity of the encoder, a high (near CD) audio quality requires a bit rate of about kb/s per stereo program. The complexity of the decoder is low, the encoder complexity is about 1.5 to 3 times as high.
7 LAYER BIT RATE FROM [kbps] TO [kbps] LAYER I LAYER II LAYER III Table 1: Bit rates for every layer Layer II offers more compression than Layer I. Bit rates range from kb/s for mono, and from kb/s for stereo. Depending on the complexity of the encoder, a high (near CD) audio quality requires a bit rate of about kb/s per stereo program. The complexity of the decoder is about 25 % higher than for Layer I decoder, the encoder complexity is about 2 to 4 times higher. APPLICATIONS There are two audio coding schemes were the origin for three Layers: MUSICAM and ASPEC. Whereas the ASPEC coding scheme offered the best audio quality, especially at low bit rates or below 64 kbit/sec/channel, MUSICAM had some advantages with respect to implementation complexity and decoding delay. In order to have a generic coding system applicable for various demands, it was decided to establish a standard with three Layers: Layer II is nearly identical with the MUSICAM coding scheme, and Layer I is a simplified version of Layer II resp. MUSICAM. Nevertheless Layer III is mainly based on ASPEC and offers the best audio quality amongst the MPEG audio compression schemes. Layer II has numerous applications in both consumer and professional audio, such as audio broadcasting, television, telecommunications and multi-media. Layer III extends MPEG-1 applications into narrow band ISDN telecommu-nications and certain specialist areas of professional audio. PCM MPEG coded data A / D ENCODER CD Phone microphone Computer data storage Computer Figure 3: Applications for MPEG-1: encoding Satellite dish
8 Audio above the adapted hearing threshold is coded and anything below it is ignored. Using such a coding scheme reduces the amount of a medium required to make an audio recording by a factor of six. MPEG coded data PCM Data storage Phone DECODER D / A Satellite dish CD Computer speaker Figure4: Applications for MPEG-1: decoding Products with MPEG Layer I coding : Digital Compact Disk (DCC) (PASC), Solid State Audio (theme parks, traffic), Disk storage and editing, CD-i Full Motion Video. Products with MPEG Layer II coding : CD-i Full Motion Video, Video CD, Solid State Audio (theme parks, traffic), Disk storage and editing, Digital Audio Broadcast, Digital Versatile Disk (DVD), Cable and satellite radio (e.g. DMX), Cable and satellite TV (e.g. Digital Video Broadcast, Digital Satellite System), Contribution, distribution and emission links (e.g. ISDN, microwave radio), Movie sound tracks. Products with MPEG Layer III coding : ISDN. Literature [ 1 ] ISO/IEC Office, International Standard ISO/IEC , Information technology - Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit / s - Part 3: Audio. Switzerland, Geneve, [ 2 ] MPEG - FAQ: Multimedia compression - version 4.1., [ 3 ] Cornet, O.; Dejeux, I.; Tchong, C.: Etude du Codage MPEG et Conception d un Codeur bas debit. ESIEE, Paris, 1996.
Mpeg 1 layer 3 (mp3) general overview
Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More information5: Music Compression. Music Coding. Mark Handley
5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationChapter 14 MPEG Audio Compression
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1
More informationMultimedia Communications. Audio coding
Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated
More informationAudio Compression. Audio Compression. Absolute Threshold. CD quality audio:
Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =
More informationMPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings
MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and
More informationPerceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding
Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.
More information2.4 Audio Compression
2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/
More informationAppendix 4. Audio coding algorithms
Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically
More informationOptical Storage Technology. MPEG Data Compression
Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the
More informationAudio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011
Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law
More informationFigure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.
Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION
More informationAudio and video compression
Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and
More informationLecture 16 Perceptual Audio Coding
EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics
More informationFundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic
More informationParametric Coding of High-Quality Audio
Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits
More informationCh. 5: Audio Compression Multimedia Systems
Ch. 5: Audio Compression Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Chapter 5: Audio Compression 1 Introduction Need to code digital
More informationWhat is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?
Multimedia What is multimedia? Media types +Text + Graphics + Audio +Image +Video Interchange formats What is multimedia? Multimedia = many media User interaction = interactivity Script = time 1 2 Most
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,
More informationCISC 7610 Lecture 3 Multimedia data and data formats
CISC 7610 Lecture 3 Multimedia data and data formats Topics: Perceptual limits of multimedia data JPEG encoding of images MPEG encoding of audio MPEG and H.264 encoding of video Multimedia data: Perceptual
More informationMultimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1
Multimedia What is multimedia? Media types + Text +Graphics +Audio +Image +Video Interchange formats Petri Vuorimaa 1 What is multimedia? Multimedia = many media User interaction = interactivity Script
More informationCHAPTER 10: SOUND AND VIDEO EDITING
CHAPTER 10: SOUND AND VIDEO EDITING What should you know 1. Edit a sound clip to meet the requirements of its intended application and audience a. trim a sound clip to remove unwanted material b. join
More informationELL 788 Computational Perception & Cognition July November 2015
ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)
More informationAudio Coding and MP3
Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)
More informationNetworking Applications
Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services
More informationData Compression. Audio compression
1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic
More informationModule 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur
Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define
More informationDVB Audio. Leon van de Kerkhof (Philips Consumer Electronics)
eon van de Kerkhof Philips onsumer Electronics Email: eon.vandekerkhof@ehv.ce.philips.com Introduction The introduction of the ompact Disc, already more than fifteen years ago, has brought high quality
More informationThe following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:
Page 1 of 8 1. SCOPE This Operational Practice sets out guidelines for minimising the various artefacts that may distort audio signals when low bit-rate coding schemes are employed to convey contribution
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More informationDSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman
DSP The Technology Presented to the IEEE Central Texas Consultants Network by Sergio Liberman Abstract The multimedia products that we enjoy today share a common technology backbone: Digital Signal Processing
More information1. Before adjusting sound quality
1. Before adjusting sound quality Functions available when the optional 5.1 ch decoder/av matrix unit is connected The following table shows the finer audio adjustments that can be performed when the optional
More informationMPEG-4 aacplus - Audio coding for today s digital media world
MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers
More informationCSCD 443/533 Advanced Networks Fall 2017
CSCD 443/533 Advanced Networks Fall 2017 Lecture 18 Compression of Video and Audio 1 Topics Compression technology Motivation Human attributes make it possible Audio Compression Video Compression Performance
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis
More informationDigital Media. Daniel Fuller ITEC 2110
Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09
More informationDAB. Digital Audio Broadcasting
DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva
More informationWavelet filter bank based wide-band audio coder
Wavelet filter bank based wide-band audio coder J. Nováček Czech Technical University, Faculty of Electrical Engineering, Technicka 2, 16627 Prague, Czech Republic novacj1@fel.cvut.cz 3317 New system for
More informationLossy compression. CSCI 470: Web Science Keith Vertanen
Lossy compression CSCI 470: Web Science Keith Vertanen Digital audio Overview Sampling rate Quan5za5on MPEG audio layer 3 (MP3) JPEG s5ll images Color space conversion, downsampling Discrete Cosine Transform
More informationMPEG-4 General Audio Coding
MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and
More informationVideoCD Audio + Stills A solution compatible with DVD players
VideoCD Audio + Stills A solution compatible with DVD players 1. INTRODUCTION This manual is a translation into English from the original Spanish document available in www.videoedicion.org and www.vcdsp.com,
More informationSpeech and audio coding
Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples
More informationAn adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria
An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria N. RUIZ REYES 1, M. ROSA ZURERA 2, F. LOPEZ FERRERAS 2, D. MARTINEZ MUÑOZ 1 1 Departamento
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationHAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER
HAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER Rob Colcord, Elliot Kermit-Canfield and Blane Wilson Center for Computer Research in Music and Acoustics,
More informationCompressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.
Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity
More informationMP3. Panayiotis Petropoulos
MP3 By Panayiotis Petropoulos Overview Definition History MPEG standards MPEG 1 / 2 Layer III Why audio compression through Mp3 is necessary? Overview MPEG Applications Mp3 Devices Mp3PRO Conclusion Definition
More informationISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding
INTERNATIONAL STANDARD This is a preview - click here to buy the full publication ISO/IEC 23003-3 First edition 2012-04-01 Information technology MPEG audio technologies Part 3: Unified speech and audio
More informationTransporting audio-video. over the Internet
Transporting audio-video over the Internet Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP
More informationSPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL
SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,
More informationDRA AUDIO CODING STANDARD
Applied Mechanics and Materials Online: 2013-06-27 ISSN: 1662-7482, Vol. 330, pp 981-984 doi:10.4028/www.scientific.net/amm.330.981 2013 Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua
More information/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec
/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank
More informationMPEG-1 Bitstreams Processing for Audio Content Analysis
ISSC, Cork. June 5- MPEG- Bitstreams Processing for Audio Content Analysis Roman Jarina, Orla Duffner, Seán Marlow, Noel O Connor, and Noel Murphy Visual Media Processing Group Dublin City University Glasnevin,
More informationBit or Noise Allocation
ISO 11172-3:1993 ANNEXES C & D 3-ANNEX C (informative) THE ENCODING PROCESS 3-C.1 Encoder 3-C.1.1 Overview For each of the Layers, an example of one suitable encoder with the corresponding flow-diagram
More informationPort of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform
Port of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform by Romain Pagniez romain@felinewave.com A Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science
More informationImplementation of a MPEG 1 Layer I Audio Decoder with Variable Bit Lengths
Implementation of a MPEG 1 Layer I Audio Decoder with Variable Bit Lengths A thesis submitted in fulfilment of the requirements of the degree of Master of Engineering 23 September 2008 Damian O Callaghan
More informationCS 074 The Digital World. Digital Audio
CS 074 The Digital World Digital Audio 1 Digital Audio Waves Hearing Analog Recording of Waves Pulse Code Modulation and Digital Recording CDs, Wave Files, MP3s Editing Wave Files with BinEd 2 Waves A
More informationWolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig
Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 6 Audio Retrieval 6 Audio Retrieval 6.1 Basics of
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective
More informationThe MPEG-4 General Audio Coder
The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution
More information_APP B_549_10/31/06. Appendix B. Producing for Multimedia and the Web
1-59863-307-4_APP B_549_10/31/06 Appendix B Producing for Multimedia and the Web In addition to enabling regular music production, SONAR includes a number of features to help you create music for multimedia
More information3 Sound / Audio. CS 5513 Multimedia Systems Spring 2009 LECTURE. Imran Ihsan Principal Design Consultant
LECTURE 3 Sound / Audio CS 5513 Multimedia Systems Spring 2009 Imran Ihsan Principal Design Consultant OPUSVII www.opuseven.com Faculty of Engineering & Applied Sciences 1. The Nature of Sound Sound is
More informationRich Recording Technology Technical overall description
Rich Recording Technology Technical overall description Ari Koski Nokia with Windows Phones Product Engineering/Technology Multimedia/Audio/Audio technology management 1 Nokia s Rich Recording technology
More informationFor Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer
For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing
More informationSpeech-Music Discrimination from MPEG-1 Bitstream
Speech-Music Discrimination from MPEG-1 Bitstream ROMAN JARINA, NOEL MURPHY, NOEL O CONNOR, SEÁN MARLOW Centre for Digital Video Processing / RINCE Dublin City University, Dublin 9 IRELAND jarinar@eeng.dcu.ie
More informationITEC310 Computer Networks II
ITEC310 Computer Networks II Chapter 29 Multimedia Department of Information Technology Eastern Mediterranean University 2/75 Objectives After completing this chapter you should be able to do the following:
More informationMultimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology
Course Presentation Multimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology Sound Sound is a sequence of waves of pressure which propagates through compressible media such
More informationITNP80: Multimedia! Sound-II!
Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent than for video data rate for CD-quality audio is much less than
More informationModeling of an MPEG Audio Layer-3 Encoder in Ptolemy
Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Patrick Brown EE382C Embedded Software Systems May 10, 2000 $EVWUDFW MPEG Audio Layer-3 is a standard for the compression of high-quality digital audio.
More informationStarlink 9003T1 T1/E1 Digital Transmission
Starlink 9003T1 T1/E1 Digital Transmission A Digital Audio via T1/E1 C ombining Moseley s unparalleled reputation for high quality RF aural Studio-Transmitter Links (STLs) with the performance and speed
More informationAET 1380 Digital Audio Formats
AET 1380 Digital Audio Formats Consumer Digital Audio Formats CDs --44.1 khz, 16 bit Television 48 khz, 16bit DVD 96 khz, 24bit How many more measurements does a DVD take? Bit Rate? Sample rate? Is it
More informationDesign and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM
Design and Implementation of an MPEG-1 Layer III Audio Decoder KRISTER LAGERSTRÖM Master s Thesis Computer Science and Engineering Program CHALMERS UNIVERSITY OF TECHNOLOGY Department of Computer Engineering
More informationPQMF Filter Bank, MPEG-1 / MPEG-2 BC Audio. Fraunhofer IDMT
PQMF Filter Bank, MPEG-1 / MPEG-2 BC Audio The Basic Paradigm of T/F Domain Audio Coding Digital Audio Input Filter Bank Bit or Noise Allocation Quantized Samples Bitstream Formatting Encoded Bitstream
More information14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP
TRADEOFF BETWEEN COMPLEXITY AND MEMORY SIZE IN THE 3GPP ENHANCED PLUS DECODER: SPEED-CONSCIOUS AND MEMORY- CONSCIOUS DECODERS ON A 16-BIT FIXED-POINT DSP Osamu Shimada, Toshiyuki Nomura, Akihiko Sugiyama
More informationCOS 116 The Computational Universe Laboratory 4: Digital Sound and Music
COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency
More informationBluray (
Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)
More informationLecture Information Multimedia Video Coding & Architectures
Multimedia Video Coding & Architectures (5LSE0), Module 01 Introduction to coding aspects 1 Lecture Information Lecturer Prof.dr.ir. Peter H.N. de With Faculty Electrical Engineering, University Technology
More informationBest-in-class audio recording
Best-in-class audio recording Philips Voice Tracer range 2013 New Philips Voice Tracer range Best-in-class audio recording Only the perfect combination of audio quality & ease of use delivers the best
More informationCompression transparent low-level description of audio signals
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 25 Compression transparent low-level description of audio signals Jason
More informationISOIIEC I I NTERNATI O NA L S TA NDA R D. Information technology
This is a preview click here to buy the full publication NTERNAT O NA L S TA NDA R D SOEC 723 First edition 993080 nformation technology Coding of moving pictures and associated audio for digital storage
More informationUNDERSTANDING MUSIC & VIDEO FORMATS
ComputerFixed.co.uk Page: 1 Email: info@computerfixed.co.uk UNDERSTANDING MUSIC & VIDEO FORMATS Are you confused with all the different music and video formats available? Do you know the difference between
More informationAudio Coding Standards
Audio Standards Kari Pihkala 13.2.2002 Tik-111.590 Multimedia Outline Architectural Overview MPEG-1 MPEG-2 MPEG-4 Philips PASC (DCC cassette) Sony ATRAC (MiniDisc) Dolby AC-3 Conclusions 2 Architectural
More informationFundamental of Digital Media Design. Introduction to Audio
Fundamental of Digital Media Design Introduction to Audio by Noraniza Samat Faculty of Computer Systems & Software Engineering noraniza@ump.edu.my OER Fundamental of Digital Media Design by Noraniza Samat
More informationDigital Recording and Playback
Digital Recording and Playback Digital recording is discrete a sound is stored as a set of discrete values that correspond to the amplitude of the analog wave at particular times Source: http://www.cycling74.com/docs/max5/tutorials/msp-tut/mspdigitalaudio.html
More informationLossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013
Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013 Digital audio Overview Sampling rate Quan5za5on MPEG audio layer 3 (MP3) JPEG s5ll images Color space conversion, downsampling Discrete
More informationParametric Coding of Spatial Audio
Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial
More informationINTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N15071 February 2015, Geneva,
More informationCHAPTER 6 Audio compression in practice
CHAPTER 6 Audio compression in practice In earlier chapters we have seen that digital sound is simply an array of numbers, where each number is a measure of the air pressure at a particular time. This
More informationSqueeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft
Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future
More informationEfficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio
Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans
More informationISO/IEC INTERNATIONAL STANDARD
INTERNATIONAL STANDARD ISO/IEC 13818-7 Second edition 2003-08-01 Information technology Generic coding of moving pictures and associated audio information Part 7: Advanced Audio Coding (AAC) Technologies
More informationExport Audio Mixdown
26 Introduction The function in Cubase Essential allows you to mix down audio from the program to a file on your hard disk. You always mix down an output bus. For example, if you have set up a stereo mix
More informationDigital Image Processing
Digital Image Processing Fundamentals of Image Compression DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR) IN SIGNAL PROCESSING IMPERIAL COLLEGE LONDON Compression New techniques have led to the development
More information