Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Size: px
Start display at page:

Download "Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd"

Transcription

1 Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 1

2 Overview Audio Signal Processing Applications Audio Signal Processing Basics Sampling What is an audio signal? Signal Processing Domains Case Study 1 Headphone Virtualisation Frequency Response FIR filtering Computational Complexity Case Study 2 Perceptual Audio Coding Psychoacoustics Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 2

3 Audio Signal Processing Applications Cinema Delivering channel based audio Distribute movies to multiple screens in a multiplex Cinemas use speaker arrays rather than single speakers so processing required to fill the arrays from single channel feeds Rendering object based audio Dolby Atmos Cinema soundtrack is express as individual objects and locations - in every cinema the movie is renderer for that specific cinema s speaker locations Speaker equalisation & protection Process the audio sent to each speaker to compensate for the frequency response of the speaker cones. Ensure that audio sent to the speakers doesn t over driver the speaker cones, which would damage them. Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 3

4 Audio Signal Processing Applications Broadcast / Home Theatre Compression of Audio for DVD / Blu-ray Disc Perceptual audio coding (case study later) Multi-channel audio coding Multiple languages Multiple playback formats (stereo / 5.1 / etc) Broadcast end-to-end Capture, coding, transmission, playback AV Receivers (AVRs), Set Top Boxes (STBs) Games consoles Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 4

5 Audio Signal Processing Applications Personal Audio Devices Mobile phones (feature phones & smart phones) Tablets Music players PCs Same issues as Home Theatre, but usually more limited acoustic hardware (i.e. cheap speakers) Headphone playback is a big use case (case study later) Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 5

6 Audio Signal Processing Applications Voice Processing Many of the same basic challenges but because speech has some specifically different characteristics from general audio, different solutions exist Speech coders use different approaches than audio codecs What makes a good codec is measured differently The transmission bandwidths used for the data is much more limited Conferencing & Telephony Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 6

7 Audio Signal Processing Basics Sampling Digital signals have samples which are discrete in time and magnitude Process of converting a continuous signal to the digital domain is Sampling Two key questions when sampling are: How often to sample & how precisely? Analogue to Digital Converter (ADC) Digital Signal Processing Digital to Analogue Converter (DAC) Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 7

8 Audio Signal Processing Basics Sampling Frequency Fs (how often?) Number of samples per second Nyquist rate: Greater than twice the highest frequency Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 8

9 Audio Signal Processing Basics Resolution (how precisely?) Each sample is represented by a number, how many bits should we use? Converting a continuous value to a discrete value requires quantisation. Quantisation Error Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 9

10 Audio Signal Processing Basics Resolution (how precisely?) By using more bits, we reduce the error 101 skipping all the math Each additional bit improves SNR (signal to noise ratio) by 6.01 db Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 10

11 Audio Signal Processing Basics Audio Signal Sampling Frequency Human perception 20 Hz 20,000 Hz Nyquist says Fs >= 40 khz CD Audio: 44.1 khz Blu-ray (and before that DAT): 48 khz Bit depth Range of loudness relative to human hearing Threshold of hearing 0 db Jet Engines db Busy Road (standing at the curb) 100 db Sustained exposure will cause damage 85dB 16 bits per sample gives ~ 96 db of dynamic range 24 bits per sample = 144 db Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 11

12 Audio Signal Processing Basics Audio Signal Raw data rate 48 khz, 16 bits per sample = 768 kbps / ch 3.86 GB for a 2hr movie (5.1 channels) Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 12

13 Audio Signal Processing Basics Processing domains Sampled audio i.e., Pulse Code Modulated (PCM) data is in the time domain Not everything we want to do with audio is formulated as a time domain operation E.g.: Flattening the frequency response of a speaker The Fourier Transform expresses a signal in terms of it s frequency components (sinusoids). Using it we can formulate processing in the frequency domain Whether processing is implemented in the time or the frequency domain can depend on where it is most efficient. Signal processing also has other useful transform domains which may offer advantages for specific types of processing (e.g. image coding often uses the discrete cosine transform DCT) Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 13

14 Case Study 1 HEADPHONE VIRTUALISATION Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 14

15 Headphone Virtualisation How do you get surround sound out of a pair of headphones? Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 15

16 Headphone Virtualisation Two things we need to achieve: 1. Make it sound like the audio is coming from different directions 2. Make it sound like the listener is in a room. Both can be achieved by filtering the signal using the impulse response of the room, or the head-related transfer functions. Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 16

17 Headphone Virtualisation Room impulse response By measuring how a short impulsive sound is altered by a room, the room s reflections and echoes can be characterised to create an impulse response. The impulse response can in turn be used to filter any signal, to make it sound like it was in the room. The process of filtering a signal using an impulse response is convolution: y[n] = h k x n k k= Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 17

18 Headphone Virtualisation Room impulse response How many points would be required to capture a room? (i.e. how long is the impulse response?) Limiting the impulse response to 50ms gives us 1440 points (@48kHz) Considering the computational cost 1440 * 48k > 69 MFLOPS Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 18

19 Headphone Virtualisation Computational load On a DSP chip with a single cycle MAC -> 69 MIPS On an ARM, MAC s ~ 3.5 cycles each -> ~240 MIPS 5.1 channels -> 10 filters = 2,400 MIPS Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 19

20 Headphone Virtualisation The solution? Convolution in Time domain <-> Multiplication in Frequency Domain Fourier Transform the impulse response & the signal Block based, e.g., blocks of 2048 O[N.log2(N)] -> k*22528 ~ 78,848 Operate in the Frequency domain, Complex multiplies -> 4 * > 8,192 Transform the result back to the time domain. Same as forward transform Blocks per second? 23 blocks/sec ~4 MFLOPS / filter What about the HRTFs? Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 20

21 Headphone Virtualisation Head-related Transfer Function Measured on a dummy Applied as filters Same computational arguments lead us to the need to apply these in the frequency domain. NB: we don t need to go back to the time domain between the two sets of filters Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 21

22 Case study 2 PERCEPTUAL AUDIO CODING Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 22

23 Perceptual Audio Coding How do you reduce the storage and transmission bandwidth requirements of Audio signals? Bitrates: Uncompressed : 768 kbps / ch DVD (AC3) : 448 kbps (5.1 channels) Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 23

24 Perceptual Audio Coding Audio Coding is Lossy Lossless compression: must perfectly reconstruct their source. (zip files) Lossy compression: can throw away data if it isn t needed. The reconstruction need only be good enough. Deciding which bits to throw away and what is good enough is the hard part. Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 24

25 Perceptual Audio Coding Time/Frequency analysis Quantisation Entropy coding Psychoacoustic analysis Bit allocation Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 25

26 Perceptual Audio Coding Psychoacoustics Study of sound Perception Perception implies the human experience which include physiological and psychological factors. Is at the heart of the question of which parts of an audio signal are important, or unimportant. Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 26

27 Perceptual Audio Coding Psychoacoustics Most perceptual quantities are non-linear and subjective Loudness Non-linearly related to sound pressure Scales include: sone, phon Pitch Non-linearly related to frequency Scales include: Bark, Mel, ERB Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 27

28 Perceptual Audio Coding Frequency Masking Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 28

29 Perceptual Audio Coding Temporal Masking Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 29

30 Perceptual Audio Coding Time/Frequency analysis Break the incoming signal into time blocks and transform into the frequency domain Coding is always block based The frequency representation is analysed in bins of equal perceptual bandwidth (bark) Psychoacoustic analysis Use the frequency representation of the current block to calculate the masking curve Use the frequency masking curves from previous frames to account for temporal masking Time/Frequency analysis Psychoacoustic analysis Quantisation Bit allocation Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 30

31 Perceptual Audio Coding Masking Curve Areas of the spectrum where the masking curve is above the signal energy, represent things we can t hear If we can t hear them, we shouldn t spend bits encoding them Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 31

32 Perceptual Audio Coding Bit allocation Using the masking curve, we can calculate the allowed signal to noise ratio in each of the frequency bands Knowing that allocating a bit to a quantiser improves SNR by 6 db, iterative allocate the bits available in the bit pool to band, until we either; run out of bits, or exceed the SNR requirements in all bands (any left over bits can be used to code the next frame) The bit distribution must be sent to the decoder Quantiser Quantise the frequency domain representation to send to the decoder. Time/Frequency analysis Psychoacoustic analysis Quantisation Bit allocation Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 32

33 Perceptual Audio Coding Decoding is simple Recreate the frequency representation of each frame Transform back to the time domain Additional processing can be used to enhance the reconstructed signal Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 33

34 Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories, Inc. 34

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

CHAPTER 10: SOUND AND VIDEO EDITING

CHAPTER 10: SOUND AND VIDEO EDITING CHAPTER 10: SOUND AND VIDEO EDITING What should you know 1. Edit a sound clip to meet the requirements of its intended application and audience a. trim a sound clip to remove unwanted material b. join

More information

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman DSP The Technology Presented to the IEEE Central Texas Consultants Network by Sergio Liberman Abstract The multimedia products that we enjoy today share a common technology backbone: Digital Signal Processing

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

Enhanced Audio Features for High- Definition Broadcasts and Discs. Roland Vlaicu Dolby Laboratories, Inc.

Enhanced Audio Features for High- Definition Broadcasts and Discs. Roland Vlaicu Dolby Laboratories, Inc. Enhanced Audio Features for High- Definition Broadcasts and Discs Roland Vlaicu Dolby Laboratories, Inc. Entertainment is Changing High definition video Flat panel televisions Plasma LCD DLP New broadcasting

More information

Data Compression. Audio compression

Data Compression. Audio compression 1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic

More information

Audio Coding and MP3

Audio Coding and MP3 Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)

More information

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding. Introduction to Digital Audio Compression B. Cavagnolo and J. Bier Berkeley Design Technology, Inc. 2107 Dwight Way, Second Floor Berkeley, CA 94704 (510) 665-1600 info@bdti.com http://www.bdti.com INTRODUCTION

More information

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future

More information

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes: Page 1 of 8 1. SCOPE This Operational Practice sets out guidelines for minimising the various artefacts that may distort audio signals when low bit-rate coding schemes are employed to convey contribution

More information

Lecture 16 Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio: Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =

More information

Audio and video compression

Audio and video compression Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and

More information

CISC 7610 Lecture 3 Multimedia data and data formats

CISC 7610 Lecture 3 Multimedia data and data formats CISC 7610 Lecture 3 Multimedia data and data formats Topics: Perceptual limits of multimedia data JPEG encoding of images MPEG encoding of audio MPEG and H.264 encoding of video Multimedia data: Perceptual

More information

Multimedia Communications. Audio coding

Multimedia Communications. Audio coding Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated

More information

CS 074 The Digital World. Digital Audio

CS 074 The Digital World. Digital Audio CS 074 The Digital World Digital Audio 1 Digital Audio Waves Hearing Analog Recording of Waves Pulse Code Modulation and Digital Recording CDs, Wave Files, MP3s Editing Wave Files with BinEd 2 Waves A

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

AET 1380 Digital Audio Formats

AET 1380 Digital Audio Formats AET 1380 Digital Audio Formats Consumer Digital Audio Formats CDs --44.1 khz, 16 bit Television 48 khz, 16bit DVD 96 khz, 24bit How many more measurements does a DVD take? Bit Rate? Sample rate? Is it

More information

Implementation of a MPEG 1 Layer I Audio Decoder with Variable Bit Lengths

Implementation of a MPEG 1 Layer I Audio Decoder with Variable Bit Lengths Implementation of a MPEG 1 Layer I Audio Decoder with Variable Bit Lengths A thesis submitted in fulfilment of the requirements of the degree of Master of Engineering 23 September 2008 Damian O Callaghan

More information

Appendix 4. Audio coding algorithms

Appendix 4. Audio coding algorithms Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

Lecture #3: Digital Music and Sound

Lecture #3: Digital Music and Sound Lecture #3: Digital Music and Sound CS106E Spring 2018, Young In this lecture we take a look at how computers represent music and sound. One very important concept we ll come across when studying digital

More information

The MPEG-4 General Audio Coder

The MPEG-4 General Audio Coder The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution

More information

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans

More information

ITNP80: Multimedia! Sound-II!

ITNP80: Multimedia! Sound-II! Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent than for video data rate for CD-quality audio is much less than

More information

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia? Multimedia What is multimedia? Media types +Text + Graphics + Audio +Image +Video Interchange formats What is multimedia? Multimedia = many media User interaction = interactivity Script = time 1 2 Most

More information

Ch. 5: Audio Compression Multimedia Systems

Ch. 5: Audio Compression Multimedia Systems Ch. 5: Audio Compression Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Chapter 5: Audio Compression 1 Introduction Need to code digital

More information

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1 Multimedia What is multimedia? Media types + Text +Graphics +Audio +Image +Video Interchange formats Petri Vuorimaa 1 What is multimedia? Multimedia = many media User interaction = interactivity Script

More information

CHAPTER 6 Audio compression in practice

CHAPTER 6 Audio compression in practice CHAPTER 6 Audio compression in practice In earlier chapters we have seen that digital sound is simply an array of numbers, where each number is a measure of the air pressure at a particular time. This

More information

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts ijdsp Interactive Illustrations of Speech/Audio Processing Concepts NSF Phase 3 Workshop, UCy Presentation of an Independent Study By Girish Kalyanasundaram, MS by Thesis in EE Advisor: Dr. Andreas Spanias,

More information

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS Television services in Europe currently broadcast video at a frame rate of 25 Hz. Each frame consists of two interlaced fields, giving a field rate of 50

More information

UNDERSTANDING MUSIC & VIDEO FORMATS

UNDERSTANDING MUSIC & VIDEO FORMATS ComputerFixed.co.uk Page: 1 Email: info@computerfixed.co.uk UNDERSTANDING MUSIC & VIDEO FORMATS Are you confused with all the different music and video formats available? Do you know the difference between

More information

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec / / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank

More information

Digital Media. Daniel Fuller ITEC 2110

Digital Media. Daniel Fuller ITEC 2110 Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

Wavelet filter bank based wide-band audio coder

Wavelet filter bank based wide-band audio coder Wavelet filter bank based wide-band audio coder J. Nováček Czech Technical University, Faculty of Electrical Engineering, Technicka 2, 16627 Prague, Czech Republic novacj1@fel.cvut.cz 3317 New system for

More information

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR 2011-2012 / ODD SEMESTER QUESTION BANK SUB.CODE / NAME YEAR / SEM : IT1301 INFORMATION CODING TECHNIQUES : III / V UNIT -

More information

Aud-X 5.1 Help.

Aud-X 5.1 Help. Aud-X 5.1 Help www.aud-x.com Overview Aud-X has been engineered as 5.1 channel surround sound CODEC, and is dedicated for the use with Xvid/Divx based movies. It is of course also possible to compress

More information

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing

More information

1. Before adjusting sound quality

1. Before adjusting sound quality 1. Before adjusting sound quality Functions available when the optional 5.1 ch decoder/av matrix unit is connected The following table shows the finer audio adjustments that can be performed when the optional

More information

A Digital Audio Primer

A Digital Audio Primer Conversion of Sound Wave to Analog Signal A Digital Audio Primer Many people don t care about the technology behind their stereo system. As long as it sounds good and they can press a button and listen

More information

MPEG-l.MPEG-2, MPEG-4

MPEG-l.MPEG-2, MPEG-4 The MPEG Handbook MPEG-l.MPEG-2, MPEG-4 Second edition John Watkinson PT ^PVTPR AMSTERDAM BOSTON HEIDELBERG LONDON. NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO Focal Press is an

More information

Chapter 5.5 Audio Programming

Chapter 5.5 Audio Programming Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming

More information

Principles of MPEG audio compression

Principles of MPEG audio compression Principles of MPEG audio compression Principy komprese hudebního signálu metodou MPEG Petr Kubíček Abstract The article describes briefly audio data compression. Focus of the article is a MPEG standard,

More information

Fundamental of Digital Media Design. Introduction to Audio

Fundamental of Digital Media Design. Introduction to Audio Fundamental of Digital Media Design Introduction to Audio by Noraniza Samat Faculty of Computer Systems & Software Engineering noraniza@ump.edu.my OER Fundamental of Digital Media Design by Noraniza Samat

More information

Embedding Audio into your RX Application

Embedding Audio into your RX Application Embedding Audio into your RX Application Renesas Electronics America Inc. Renesas Technology & Solution Portfolio 2 Microcontroller and Microprocessor Line-up 2010 2013 32-bit 8/16-bit 1200 DMIPS, Superscalar

More information

Bluray (

Bluray ( Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

DAB. Digital Audio Broadcasting

DAB. Digital Audio Broadcasting DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva

More information

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

Speech-Coding Techniques. Chapter 3

Speech-Coding Techniques. Chapter 3 Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the lower the quality RTP payload types

More information

Networking Applications

Networking Applications Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services

More information

MPEG-4 General Audio Coding

MPEG-4 General Audio Coding MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and

More information

Introduction to LAN/WAN. Application Layer 4

Introduction to LAN/WAN. Application Layer 4 Introduction to LAN/WAN Application Layer 4 Multimedia Multimedia: Audio + video Human ear: 20Hz 20kHz, Dogs hear higher freqs DAC converts audio waves to digital E.g PCM uses 8-bit samples 8000 times

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective

More information

Port of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform

Port of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform Port of a Fixed Point MPEG-2 AAC Encoder on a ARM Platform by Romain Pagniez romain@felinewave.com A Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science

More information

University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics

University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics ESE250 Spring 2013 Lab 7: Psychoacoustic Compression Friday, February 22, 2013 For Lab Session: Thursday,

More information

3 Sound / Audio. CS 5513 Multimedia Systems Spring 2009 LECTURE. Imran Ihsan Principal Design Consultant

3 Sound / Audio. CS 5513 Multimedia Systems Spring 2009 LECTURE. Imran Ihsan Principal Design Consultant LECTURE 3 Sound / Audio CS 5513 Multimedia Systems Spring 2009 Imran Ihsan Principal Design Consultant OPUSVII www.opuseven.com Faculty of Engineering & Applied Sciences 1. The Nature of Sound Sound is

More information

Features and Benefits Integrated Twin HD Digital Tuner

Features and Benefits Integrated Twin HD Digital Tuner Blu-ray Disc Recorder with 250GB HDD & Twin HD Tuner [DMR-BW750] Blu-ray Disc Recorder with 250GB HDD & Twin HD Tuner RRP: $1,429 [GST inc.] Integrated Twin HD Digital Tuner, Record Two Programs Simultaneously

More information

CSCD 443/533 Advanced Networks Fall 2017

CSCD 443/533 Advanced Networks Fall 2017 CSCD 443/533 Advanced Networks Fall 2017 Lecture 18 Compression of Video and Audio 1 Topics Compression technology Motivation Human attributes make it possible Audio Compression Video Compression Performance

More information

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK Professor Laurence S. Dooley School of Computing and Communications Milton Keynes, UK How many bits required? 2.4Mbytes 84Kbytes 9.8Kbytes 50Kbytes Data Information Data and information are NOT the same!

More information

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06 Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic

More information

Multimedia Technology

Multimedia Technology Multimedia Application An (usually) interactive piece of software which communicates to the user using several media e.g Text, graphics (illustrations, photos), audio (music, sounds), animation and video.

More information

_äìé`çêé. Audio Compression Codec Specifications and Requirements. Application Note. Issue 2

_äìé`çêé. Audio Compression Codec Specifications and Requirements. Application Note. Issue 2 _äìé`çêé Audio Compression Codec Specifications and Requirements Application Note Issue 2 CSR Cambridge Science Park Milton Road Cambridge CB4 0WH United Kingdom Registered in England 3665875 Tel: +44

More information

Computing in the Modern World

Computing in the Modern World Computing in the Modern World BCS-CMW-7: Data Representation Wayne Summers Marion County October 25, 2011 There are 10 kinds of people in the world: those who understand binary and those who don t. Pre-exercises

More information

Speech and audio coding

Speech and audio coding Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples

More information

HAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER

HAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER HAVE YOUR CAKE AND HEAR IT TOO: A HUFFMAN CODED, BLOCK SWITCHING, STEREO PERCEPTUAL AUDIO CODER Rob Colcord, Elliot Kermit-Canfield and Blane Wilson Center for Computer Research in Music and Acoustics,

More information

Contents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48

Contents. 3 Vector Quantization The VQ Advantage Formulation Optimality Conditions... 48 Contents Part I Prelude 1 Introduction... 3 1.1 Audio Coding... 4 1.2 Basic Idea... 6 1.3 Perceptual Irrelevance... 8 1.4 Statistical Redundancy... 9 1.5 Data Modeling... 9 1.6 Resolution Challenge...

More information

Multimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology

Multimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology Course Presentation Multimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology Sound Sound is a sequence of waves of pressure which propagates through compressible media such

More information

UHD Audio Center. Operation Manual

UHD Audio Center. Operation Manual HDACC DCT-37 II-4K UHD Audio Center Operation Manual DISCLAIMERS The information in this manual has been carefully checked and is believed to be accurate. Essence For Hi Res Audio assumes no responsibility

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression Data Representation Interpreting bits to give them meaning Part 4: Media -, Video, Compression Notes for CSC 100 - The Beauty and Joy of Computing The University of North Carolina at Greensboro Reminders

More information

LIVE MUSIC PERFORMANCES OVER HIGH- SPEED IP NETWORKS

LIVE MUSIC PERFORMANCES OVER HIGH- SPEED IP NETWORKS LIVE MUSIC PERFORMANCES OVER HIGH- SPEED IP NETWORKS Stefan Karapetkov Polycom, Inc. e-mail: Stefan.Karapetkov@polycom.com ABSTRACT High-speed IP networks are creating opportunities for new kinds of real-time

More information

Export Audio Mixdown

Export Audio Mixdown 26 Introduction The function in Cubase Essential allows you to mix down audio from the program to a file on your hard disk. You always mix down an output bus. For example, if you have set up a stereo mix

More information

Tuning into a Radio Station

Tuning into a Radio Station Details on AM/FM reception Tuning into a Radio Station Tuning Automatically 1. Press TUNER on the remote controller several times to select either "AM" or "FM". 2. Press TUN MODE so that the "AUTO" indicator

More information

AUDIO MEDIA CHAPTER Background

AUDIO MEDIA CHAPTER Background CHAPTER 3 AUDIO MEDIA 3.1 Background It is important to understand how the various audio software is distributed in order to plan for its use. Today, there are so many audio media formats that sorting

More information

145W per channel, this powerhouse envelopes you in a new level of sound performance. (145W x 8ohms 1kHz 0.9% THD with 1 ch.

145W per channel, this powerhouse envelopes you in a new level of sound performance. (145W x 8ohms 1kHz 0.9% THD with 1 ch. STR-DN1030 7.2 Receiver with Airplay and Bluetooth Bullets 7.2 Channel 1015W A/V Receiver (145W x 7 @ 8 ohms 1kHz 0.09% THD 1 ch. driven) Built-in WiFi for easy access to Music Services like Pandora and

More information

2.1 Transcoding audio files

2.1 Transcoding audio files 2.1 Transcoding audio files 2.1.1 Introduction to Transcoding One of the basic tasks you can perform on an audio track is to convert it into another format. This process known as Transcoding, is the direct

More information

ITEC310 Computer Networks II

ITEC310 Computer Networks II ITEC310 Computer Networks II Chapter 29 Multimedia Department of Information Technology Eastern Mediterranean University 2/75 Objectives After completing this chapter you should be able to do the following:

More information

H6201 SMART LED TV 46" 50" SPEC SHEET PRODUCT HIGHLIGHTS. Wi-Fi Built In. key features

H6201 SMART LED TV 46 50 SPEC SHEET PRODUCT HIGHLIGHTS. Wi-Fi Built In. key features PRODUCT HIGHLIGHTS Smart TV Wi-Fi Built In Clear Motion Rate 240 sizes 46" 50" With the Samsung Smart LED TV, Smart is only the beginning. Navigate effortlessly through the Samsung Smart Hub to find your

More information

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 6 Audio Retrieval 6 Audio Retrieval 6.1 Basics of

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

_APP B_549_10/31/06. Appendix B. Producing for Multimedia and the Web

_APP B_549_10/31/06. Appendix B. Producing for Multimedia and the Web 1-59863-307-4_APP B_549_10/31/06 Appendix B Producing for Multimedia and the Web In addition to enabling regular music production, SONAR includes a number of features to help you create music for multimedia

More information

H6203 SMART LED TV 40" 46" 50" 55" 60" 65" SPEC SHEET PRODUCT HIGHLIGHTS. Wi-Fi Built In. key features

H6203 SMART LED TV 40 46 50 55 60 65 SPEC SHEET PRODUCT HIGHLIGHTS. Wi-Fi Built In. key features PRODUCT HIGHLIGHTS Smart TV Wi-Fi Built In Clear Motion Rate 240 sizes 40" 46" 50" 55" 60" 65" With the Samsung Smart LED TV, Smart is only the beginning. Navigate effortlessly through the Samsung Smart

More information

The Gullibility of Human Senses

The Gullibility of Human Senses The Gullibility of Human Senses Three simple tricks for producing LBSC 690: Week 9 Multimedia Jimmy Lin College of Information Studies University of Maryland Monday, April 2, 2007 Images Video Audio But

More information

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information