University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics

Size: px
Start display at page:

Download "University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics"

Transcription

1 University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics ESE250 Spring 2013 Lab 7: Psychoacoustic Compression Friday, February 22, 2013 For Lab Session: Thursday, February 28, 2013in Detkin Lab. Due: Wednesday, March 13, 2013by 12:00pm. Collaboration: Work in lab in teams of 2. Perform individual writeups. See course collaboration policy in the Administrative Handout. Objective: Appreciate and experience sampling rates; understand aliasing. Prelab Requirements: Download and read this entire lab assignment. Complete the prelab questions at the bottom of the document. Bring your headphones. Optional but highly encouraged, encode a track from one of your music CDs into a.wav file sampled at 44.1KHz, 16bits. 1 Deliverable: Names of all lab group members Answers to all lab questions (including prelab question) Size vs. Quality plots Handin: All labs will be turned in electronically through the Penn Blackboard website. Go to the assignment submission link and follow the instructions. Please submit all your files as one zip file. Also, remember that your writeup should be a PDF file. Exit Ticket: Play for your TA a bad quality and low quality encoded audio file. 1 Make sure you encode it from a CD, re-encoding an existing.mp3 into a.wav will not be useful for the lab. 1

2 Motivation Labs 2 through 6 introduced you to many digital audio concepts from sampling to compression. In this lab we will use what you ve learned to experiment with a simplified lossy audio encoder modeled after an mp3 encoder. The encoder operates on one frame at a time consisting of 1152 samples. Each frame is processed through 4 stages: Sampling which consists of reading the audio file (week 2 and 6). DFT where the signal is transformed into (1152 2) = 576 frequencies in the frequency domain (week 4). Quantization where frequencies are binned and using a psychoacoustic model, some frequencies are removed. This is the lossy step of the process (week 5). Lossless Compression where, finally, the remaining signal is compressed (week 3). This encoder differs slightly from a standard mp3 encoder in the quantization step. In this step, a standard mp3 encoder is faced with a difficult challenge: it has a strict limit on the number of bits it can use to describe the frame, but it must maintain as high an audio quality as it can. It iterates until it finds a balance between adhering to the psychoacoustic model and meeting the bit requirement by deciding how many frequencies to keep and how to quantize the amplitudes. Our encoder does not have a strictly enforced bit limit, its only requirement is to produce high quality audio using as few bits as it can. Also, it does not quantize the amplitudes but rather only removes frequencies based on a policy you define. Another significant difference between our encoder and the standard mp3 encoder is that a standard encoder considers what policy to use for each individual frame and each individual critical band within each frame. Our encoder operates with a global policy over all frames and all bands, this makes it feasible for you to appreciate how different policies work rather than having to deal with the tedious problem of manually coming up with a policy for each frame and each band. Lab Procedure Guide This lab is not particularly complex, but it may take some time to complete. You should keep track of time as you work through the lab and keep in mind that since many of the experiments are subjective, you don t have to exhaustively search the space for the answer. Quantization Filters During quantization, the 576 frequencies are binned into 25 critical bands as explained in lecture 6. One or more quantization filters are then applied. Based on these, some frequencies are thrown 2

3 out. As we saw in week 6, we can do this because there are frequencies that we can t hear, either because they are too soft or because they are masked by other tones. There are four filters we can independently apply: a. Limit Number of Critical Bands allows us to choose how many of the 25 critical bands to keep. If less than 25 bands are specified, it will remove the higher frequency bands starting from the 25th down (i.e. if we specify 22 bands, we will throw out the 23rd, 24th and 25th bands) b. Limit Tones Overall disregards the binning of all 576 frequencies. Instead, it keeps the n All frequencies that have the highest amplitudes. c. Limit Tones by Band operates on each band individually, and keeps the n band frequencies that have the highest amplitude in that band. d. Threshold by Band operates on each band individually, and keeps all the frequencies that are within d decibels of the loudest tone in the band. This allows the total number of frequencies kept to differ between bands based on how many loud frequencies the band has. It also approximates how knowledge of masking may be used to remove masked frequencies. File encoding For the first part of the lab, you will encode a.wav file using six different quantization policies (i.e. combinations of quantization filters) and plot the encoded file size vs. audio quality. In the second part, for each quantization policy used, you will pick an encoding at the same quality level and explore what allows the different encodings to be at the same audio quality but vary in file size. Refer to the section LabView Encoder and Decoder to learn how to use the encoder and decoder we will use for these experiments. For each policy, you will need to find four data points relating quality to size a good quality point, two intermediate points and a bad quality point. Figure 1 shows what plotting these points for one policy may look like. Good quality point: This point is defined as the smallest file size that still sounds like something you would not mind listening to. It also means that any point that yields a higher file size should have the same audio quality as this point. Bad quality point: This point is the largest file size that produces an audio quality that is unclear and maybe even unpleasant to listen to. Any point that produces a smaller file size using the same policy should also have bad audio quality. 2 intermediate quality point These are two points between the good and bad points. To measure the quality we will use the absolute grade industry standard scale in figure 2. It is quantized from 1 to 5 where 1 is the quality of the bad point and 5 is the quality of the good point. 3

4 Figure 1: File size vs. Perceived Quality plot In order to complete the lab in the alloted time, you should only encode 15 to 20 seconds of audio. You should pick one.wav file and decide which part to encode. If you don t have a.wav file, you can download one from the course documents the ESE250 schedule webpage. Also, you can multi-task and run the encoder and decoder at the same time, so while you are waiting for the encoder to encode a new file, you can listen to the results of the file encoded previously. Pick a song and a duration that you wish to encode. Make sure to keep the parameters you just chose constant as you work through the rest of this lab. Data to Record For each of the four points in each policy (including the baseline), record: Frequencies overall kept (for policy 1, also record the number of bands kept). Rememeber that this is a parameter of your filters. For example, if you are using the Limit Tones Overall filter, then you record the number of frequencies kept over all bands, whereas if you are using the Limit Tones By Band filter, you record the number of frequencies kept in a band. Number of non-zero frequencies reported over the entire snippet encoded. Size in bytes (not kilobytes) of the encoded files. Quality on the defined scale from 1 (bad) to 5 (good) of the encoded files. Note: The baseline will only have one point since we are not applying any filters. For policy 4, record the decibel threshold for each point. For policies 3 and 6 also record the number of frequencies per band kept. 4

5 Figure 2: Quality Scale. [Painter & Spanias. Proc.IEEE, 88(4):451512, 2000] Baseline Before applying any policies, you should quantify how well the encoder does when we completely skip the quantization step. Set all three filters in the encoder to Inactive, set the input file, start time and duration with the values you decided on at the beginning of the lab. Set an output file and run the encoder. Listen to the encoded snippet using the decoder. possible audio quality. The way it sounds will define the best Policy 1 First we decide how many critical bands we will keep. Set the first filter to Limit Number of Critical Bands. Leave filters 2 and 3 set to Inactive. Keep the input file, start time and duration the same as before. Try a value of 24, 23, 22 and 21 for the number of bands to keep for filter 1 to find the four quality vs. size data points defined above 2. Use the decoder to listen to the files and gauge their audio quality. Make sure you update the output file name as you explore the space so that you don t overwrite other files. 2 For this policy, you may find that even at 21 bands, the quality is not bad enough to be considered a bad point, if so, you may only report the other three points. 5

6 Policy 2 Now we start to remove frequencies from each of the remaining bands. Set the first filter to Limit Number of Critical Bands and set the number of bands to the number found for the good point in Policy 1. Set filter 2 to Limit Tones Overall and leave filter 3 set to Inactive. Keep the input file, start time and duration the same as before. Choose values between 1 and 200 for the number of frequencies over all to keep for filter 2 as you search for the four quality vs. size data points defined above 3. Use the decoder to listen to the files and gauge their audio quality. Make sure you update the output file name as you explore the space so that you don t overwrite other files. Policy 3 Here we will try a different frequency removal filter. Set the first filter to Limit Number of Critical Bands and set the number of bands to the number found for the good point in Policy 1. Set filter 2 to Limit Tones by Band and leave filter 3 set to Inactive. Keep the input file, start time and duration the same as before. Choose values between 1 and 20 for the number of frequencies over all to keep for filter 2 as you search for the four quality vs. size data points defined above. Use the decoder to listen to the files and gauge their audio quality. Make sure you update the output file name as you explore the space so that you don t overwrite other files. Policy 4 Now, instead of directly limiting the number of frequencies, we threshold the amplitude of the frequencies and remove any frequency with an amplitude below the threshold. Set the first filter to Limit Number of Critical Bands and set the number of bands to the number found for the good point in Policy 1. Set filter 2 to Threshold by Band and leave filter 3 set to Inactive. Keep the input file, start time and duration the same as before. Choose values between 1 and 10 for the number of decibels below the highest amplitude in the band at which we will still keep the frequency for filter 2 as you search for the four quality vs. size data points defined above. Use the decoder to listen to the files and gauge their audio quality. Make sure you update the output file name as you explore the space so that you don t overwrite other files. 3 for such a large range you should not do a sequential search, instead try a few points such as 200, 100, 50 and 10 and then refine your search around those points if you feel you need to 6

7 Policy 5 Finally, we will combine three frequency removal filters. Set the first filter to Limit Number of Critical Bands and set the number of bands to the number found for the good point in Policy 1. Set filter 2 to Threshold by Band and set the number of decibels to threshold to the number found for the good point in Policy 4. Set filter 3 to Limit Tones Overall. Keep the input file, start time and duration the same as before. Choose values between 1 and 200 for the number of frequencies over all to keep for filter 3 as you search for the four quality vs. size data points defined above. Use the decoder to listen to the files and gauge their audio quality. Make sure you update the output file name as you explore the space so that you don t overwrite other files. Policy 6 For the last policy, change the last filter. Set the first filter to Limit Number of Critical Bands and set the number of bands to the number found for the good point in Policy 1. Set filter 2 to Threshold by Band and set the number of decibels to threshold to the number found for the good point in Policy 4. Set filter 3 to Limit Tones By Band, keep the input file, start time and duration the same as before. Choose values between 1 and 20 for the number of frequencies over all to keep for filter 3 as you search for the four quality vs. size data points defined above. Use the decoder to listen to the files and gauge their audio quality. Make sure you update the output file name as you explore the space so that you don t overwrite other files. Frame exploration You will now analyze what makes one policy better than another by looking at how each policy decides which frequencies to keep. Focus only on the 5 good points, one for each of the policies above ignoring policy 1. Load explorer.vi, it is similar to the encoder from the previous section but instead of encoding, it allows you to explore each frame and each band while you dynamically change the filters. You can explore the encoding by changing the Frame and Band sliders. Each channel reports the number of frequencies with non-zero amplitudes in that Frame-Band pair before and after applying the filters, it also reports the percent of frequencies kept after the filter. 7

8 FOR QUESTIONS: For each policy (2 to 6): Set the input file, length and duration to the same values used in the previous section. Run the VI. Set the filters and filter parameters to exactly the same values you set for the good point of that policy. Pick one band and examine the percent of frequencies kept as you sweep the frames. Repeat the above step for a few more bands (bands 10 to 18 tend to give better results). LabView Encoder, Decoder and Explorer Encoder encoder.vi is the main encoder application. It takes a.wav file and produces a.mp250, a simplified version of a standard mp3. The VI consists of four main parts: File Settings, Filters, Status and Frequency Graphs. File Settings Here you specify the path to the input.wav file and output.mp250 file. Also you choose the part of the input file to encode by setting the duration and start time values. Filters This part defines which quantization filters to apply. You can choose the filter to use, and if it takes a parameter, define the parameter for that filter. Filter 1 is applied first than filter 2 and finally filter 3. Status When the encoder is running, this section will show information about the snippet being encoded including total length, number of frames, and at the end, the number of frequencies that have amplitudes greater than zero. As it encodes, it also shows the progress for the left and right channels. 8

9 Frequency Graphs The final section shows what the encoder is doing to each band of each frame as it encodes. It has four frequency graphs, two on the left for the left channel and two on the right for the right channel. The top graphs show the amplitudes of the frequencies in the selected critical band. The bottom graphs show what frequencies remain after applying the selected filters. As the encoder runs, you can look at different bands by dragging the Critical Band Displayed slider, you will see that lower frequency bands have less frequencies. Decoder Since we are generating.mp250 files, we need a custom decoder that can play these files. The decoder is the simplest of the three VIs, you can run it by opening decoder.vi. You specify the.mp250 file to decode and run it. It reports the status of what it is doing and once it has finished decoding, it plays the file. Explorer The explorer is very similar to the encoder but it has some differences. Unlike the encoder, explorer.vi will not produce a.mp250 file but rather it will let you explore each frame and each band of the frame to see what is happening to the frequencies. Once you ve specified an input file, start time and duration, you can run it and dynamically change the filters, sweep the frames back and forth and go to a chosen critical band. The Frequency Graphs section has more features than the corresponding section in the encoder. At the top it lets you not only choose the critical band to look at but also choose which frame. Also, every graph shows how many frequencies with non-zero amplitudes the band has. Finally, the After Filters graphs calculate what percent of frequencies were kept compared to the Before Filters graphs. Lab Writeup Guidelines Theory: Pre-lab A standard MP3 has a bitrate of 128kbits/s for stereo sound (i.e there are two channels, each with a bitrate of 64kbits/s). The MP3 encoder takes the 44K samples in one sec of an audio sample and breaks them into roughly 38 frames of 1152 samples each. Each frame is then converted into 576 double-precision floating point frequency co-efficients. At this point, the psychoacoustic models 9

10 can then be applied to eliminate imperceptible frequency components to reduce the amount of information needed to be encoded. 1. Roughly how many bits are available to encode an MP3 frame (assume 1kbit is 1000bits)? 2. If each frequency co-efficient in a frame remains as a double-precision floating point value: How many bits would be required to represent all the co-efficients in a frame (a doubleprecision floating point requires 64bits to represent it)? How does this value compare to the available number of bits for a frame calculated above? 3. If we want represent all 576 values in a frame with equal precision: How many quantization levels could we use? How many bits would be needed per frequency co-efficient? 4. If each frequency co-efficient in an encoded frame is quantized to one of 8192 values, how many bits would be needed to represent one of these co-efficients? 5. Assuming we can represent a zero frequency with no bits: How many non-zero frequency frequency co-efficients can we afford to have with the available bits in a frame if each co-efficient is quantized to one of 8192 values as in the previous question? What does this mean for our encoder? Analysis File Encoding Question 1: Report your recorded data. This is best presented in a table. You can list data that does not fit well with the table. Question 2: Plot on one graph all 6 size vs quality curves (For each policy, the curve defined by the one good, two intermediate and one bad points). Question 3: Which of the six policies produced the smallest file size for the good point? Frame Exploration Question 4: For each policy, does the percent of frequencies kept stay constant as you sweep the frame? Is this the case for the other bands examined? Use what you know about what the policy is doing to justify what you observe. 10

11 Conclusion Question 5: Given what you discovered in Frame Exploration section and what you know about psychoacoustic models, what properties of the policy answered in question 3 allow it to be the best? Further Thought This section is not part of your required assignment. Along with each week, we will offer directions and questions for further thought. Due to the nature of this course, we can only begin to glimpse the depth and richness of each of the topic areas. These questions offer some headings to contemplate further depth. These will often be open-ended questions. he encoder used for this week s lab is deliberately limited to contain the complexity for both us and you. How would you improve the encoder (e.g. improve the search algorithm, exploit temporal masking)? The encoders we have described operate online. That is they encode each frame as they see it. If you allowed the encoder to consider all frames before and during encoding, how might you be able to improve the encoding? Does the MP3 encoding model limit you? Are there additional ways you might improve encoding (achieve a superior quality at some or all target bitrates) by departing from the MP3 standard as you understand it? How would your changes impact requirements for the decoder? 11

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

Lecture 16 Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,

More information

CSC 101: Lab #7 Digital Audio Due Date: 5:00pm, day after lab session

CSC 101: Lab #7 Digital Audio Due Date: 5:00pm, day after lab session CSC 101: Lab #7 Digital Audio Due Date: 5:00pm, day after lab session Purpose: The purpose of this lab is to provide you with hands-on experience in digital audio manipulation techniques using the Audacity

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06 Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic

More information

Lecture #3: Digital Music and Sound

Lecture #3: Digital Music and Sound Lecture #3: Digital Music and Sound CS106E Spring 2018, Young In this lecture we take a look at how computers represent music and sound. One very important concept we ll come across when studying digital

More information

CHAPTER 10: SOUND AND VIDEO EDITING

CHAPTER 10: SOUND AND VIDEO EDITING CHAPTER 10: SOUND AND VIDEO EDITING What should you know 1. Edit a sound clip to meet the requirements of its intended application and audience a. trim a sound clip to remove unwanted material b. join

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)

More information

ITNP80: Multimedia! Sound-II!

ITNP80: Multimedia! Sound-II! Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent than for video data rate for CD-quality audio is much less than

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

ESE532: System-on-a-Chip Architecture. Today. Message. Project. Expect. Why MPEG Encode? MPEG Encoding Project Motion Estimation DCT Entropy Encoding

ESE532: System-on-a-Chip Architecture. Today. Message. Project. Expect. Why MPEG Encode? MPEG Encoding Project Motion Estimation DCT Entropy Encoding ESE532: System-on-a-Chip Architecture Day 16: March 20, 2017 MPEG Encoding MPEG Encoding Project Motion Estimation DCT Entropy Encoding Today Penn ESE532 Spring 2017 -- DeHon 1 Penn ESE532 Spring 2017

More information

Digital Media. Daniel Fuller ITEC 2110

Digital Media. Daniel Fuller ITEC 2110 Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

CHAPTER 6 Audio compression in practice

CHAPTER 6 Audio compression in practice CHAPTER 6 Audio compression in practice In earlier chapters we have seen that digital sound is simply an array of numbers, where each number is a measure of the air pressure at a particular time. This

More information

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio: Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =

More information

CISC 7610 Lecture 3 Multimedia data and data formats

CISC 7610 Lecture 3 Multimedia data and data formats CISC 7610 Lecture 3 Multimedia data and data formats Topics: Perceptual limits of multimedia data JPEG encoding of images MPEG encoding of audio MPEG and H.264 encoding of video Multimedia data: Perceptual

More information

Export Audio Mixdown

Export Audio Mixdown 26 Introduction The function in Cubase Essential allows you to mix down audio from the program to a file on your hard disk. You always mix down an output bus. For example, if you have set up a stereo mix

More information

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression Data Representation Interpreting bits to give them meaning Part 4: Media -, Video, Compression Notes for CSC 100 - The Beauty and Joy of Computing The University of North Carolina at Greensboro Reminders

More information

Multimedia Communications. Audio coding

Multimedia Communications. Audio coding Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated

More information

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

CS 074 The Digital World. Digital Audio

CS 074 The Digital World. Digital Audio CS 074 The Digital World Digital Audio 1 Digital Audio Waves Hearing Analog Recording of Waves Pulse Code Modulation and Digital Recording CDs, Wave Files, MP3s Editing Wave Files with BinEd 2 Waves A

More information

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

EECE.2160: ECE Application Programming

EECE.2160: ECE Application Programming Spring 2018 Programming Assignment #10: Instruction Decoding and File I/O Due Wednesday, 5/9/18, 11:59:59 PM (Extra credit ( 4 pts on final average), no late submissions or resubmissions) 1. Introduction

More information

Digital Recording and Playback

Digital Recording and Playback Digital Recording and Playback Digital recording is discrete a sound is stored as a set of discrete values that correspond to the amplitude of the analog wave at particular times Source: http://www.cycling74.com/docs/max5/tutorials/msp-tut/mspdigitalaudio.html

More information

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency

More information

FAU. How do I. Post course content? Folders

FAU. How do I. Post course content? Folders How do I Post course content? Content is made up of folders, files, links, and assessments (this will be covered on page ## or see separate documentation). It is basically anything you want to share with

More information

Project 4: ATM Design and Implementation

Project 4: ATM Design and Implementation University of Maryland CMSC414 Computer and Network Security (Spring 2015) Instructor: Dave Levin (project originally created by Jonathan Katz) Updated April 30, 9:00am: Valid user names are now at most

More information

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency

More information

AudioLiquid Converter User Guide

AudioLiquid Converter User Guide AudioLiquid Converter User Guide Acon Digital Media GmbH AudioLiquid Converter User Guide All rights reserved. No parts of this work may be reproduced in any form or by any means - graphic, electronic,

More information

OLIVE 4 & 4HD HI-FI MUSIC SERVER P R O D U C T O V E R V I E W

OLIVE 4 & 4HD HI-FI MUSIC SERVER P R O D U C T O V E R V I E W OLIVE 4 & 4HD HI-FI MUSIC SERVER P R O D U C T O V E R V I E W O L I V E 4 & O L I V E 4 H D P R O D U C T O V E R V I E W 2 4 Digital Music Without Compromise Olive makes the only high fi delity digital

More information

Due: March 8, 11:59pm. Project 1

Due: March 8, 11:59pm. Project 1 COS 433: Cryptography (Spring 2018) Project 1 Princeton University Due: March 8, 11:59pm Project 1 Introduction You are interning at the super secretive SLA (Three Letter Agency). The SLA has intercepted

More information

Rich Recording Technology Technical overall description

Rich Recording Technology Technical overall description Rich Recording Technology Technical overall description Ari Koski Nokia with Windows Phones Product Engineering/Technology Multimedia/Audio/Audio technology management 1 Nokia s Rich Recording technology

More information

Advanced Relational Database Management MISM Course S A3 Spring 2019 Carnegie Mellon University

Advanced Relational Database Management MISM Course S A3 Spring 2019 Carnegie Mellon University Advanced Relational Database Management MISM Course S19-95736 A3 Spring 2019 Carnegie Mellon University Instructor: Randy Trzeciak Office: HBH 1104C Office hours: By Appointment Phone: 412-268-7040 E-mail:

More information

Project 4: ATM Design and Implementation

Project 4: ATM Design and Implementation University of Maryland CMSC414 Computer and Network Security (Spring 2017) Instructor: Udaya Shankar (project originally created by Jonathan Katz) Project 4: ATM Design and Implementation Due dates May

More information

2013 Association Marketing Benchmark Report

2013 Association  Marketing Benchmark Report 2013 Association Email Marketing Benchmark Report Part I: Key Metrics 1 TABLE of CONTENTS About Informz.... 3 Introduction.... 4 Key Findings.... 5 Overall Association Metrics... 6 Results by Country of

More information

Standard File Formats

Standard File Formats Standard File Formats Introduction:... 2 Text: TXT and RTF... 4 Grapics: BMP, GIF, JPG and PNG... 5 Audio: WAV and MP3... 8 Video: AVI and MPG... 11 Page 1 Introduction You can store many different types

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>> THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar

More information

Introduction to Audacity

Introduction to Audacity IMC Innovate Make Create http://library.albany.edu/imc/ 518 442-3607 Introduction to Audacity NOTE: This document illustrates Audacity 2.x on the Windows operating system. Audacity is a versatile program

More information

Appendix 4. Audio coding algorithms

Appendix 4. Audio coding algorithms Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically

More information

_APP B_549_10/31/06. Appendix B. Producing for Multimedia and the Web

_APP B_549_10/31/06. Appendix B. Producing for Multimedia and the Web 1-59863-307-4_APP B_549_10/31/06 Appendix B Producing for Multimedia and the Web In addition to enabling regular music production, SONAR includes a number of features to help you create music for multimedia

More information

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and

More information

APPENDIX G: Biennial Exhibition Device Survey Questions

APPENDIX G: Biennial Exhibition Device Survey Questions APPENDIX G: Biennial Exhibition Device Survey Questions STANDARD DEVICE QUESTIONS Did you use Whitney s Audio Guide to access the Museum s audio guide tour during today s visit? If so, please take a few

More information

02 - Numerical Representations

02 - Numerical Representations September 3, 2014 Todays lecture Finite length effects, continued from Lecture 1 Floating point (continued from Lecture 1) Rounding Overflow handling Example: Floating Point Audio Processing Example: MPEG-1

More information

Binary representation and data

Binary representation and data Binary representation and data Loriano Storchi loriano@storchi.org http:://www.storchi.org/ Binary representation of numbers In a positional numbering system given the base this directly defines the number

More information

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay Lecture - 26 Source Coding (Part 1) Hello everyone, we will start a new module today

More information

DAB. Digital Audio Broadcasting

DAB. Digital Audio Broadcasting DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva

More information

Networking Applications

Networking Applications Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services

More information

Data Structures and OO Development II

Data Structures and OO Development II CS 2606 1 Long House Ancestral Puebloan, Mesa Verde Instructor and Course Information 2 William D McQuain Email: Office: Office Hours: wmcquain@cs.vt.edu 631 McBryde Hall see course website CS 2606 Design

More information

Digital Media. Daniel Fuller ITEC 2110

Digital Media. Daniel Fuller ITEC 2110 Digital Media Daniel Fuller ITEC 2110 Daily Question: Which statement is True? 5 + 5 = 10 1 + 1 = 10 F + 1 = 10 Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-26 First, some mac

More information

Fundamentals of Operations Research. Prof. G. Srinivasan. Department of Management Studies. Indian Institute of Technology, Madras. Lecture No.

Fundamentals of Operations Research. Prof. G. Srinivasan. Department of Management Studies. Indian Institute of Technology, Madras. Lecture No. Fundamentals of Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras Lecture No. # 13 Transportation Problem, Methods for Initial Basic Feasible

More information

Lossy compression. CSCI 470: Web Science Keith Vertanen

Lossy compression. CSCI 470: Web Science Keith Vertanen Lossy compression CSCI 470: Web Science Keith Vertanen Digital audio Overview Sampling rate Quan5za5on MPEG audio layer 3 (MP3) JPEG s5ll images Color space conversion, downsampling Discrete Cosine Transform

More information

ESE532 Spring University of Pennsylvania Department of Electrical and System Engineering System-on-a-Chip Architecture

ESE532 Spring University of Pennsylvania Department of Electrical and System Engineering System-on-a-Chip Architecture University of Pennsylvania Department of Electrical and System Engineering System-on-a-Chip Architecture ESE532, Spring 2017 HW2: Profiling Wednesday, January 18 Due: Friday, January 27, 5:00pm In this

More information

Novel Lossy Compression Algorithms with Stacked Autoencoders

Novel Lossy Compression Algorithms with Stacked Autoencoders Novel Lossy Compression Algorithms with Stacked Autoencoders Anand Atreya and Daniel O Shea {aatreya, djoshea}@stanford.edu 11 December 2009 1. Introduction 1.1. Lossy compression Lossy compression is

More information

Summary of Digital Information (so far):

Summary of Digital Information (so far): CS/MA 109 Fall 2016 Wayne Snyder Department Boston University Today : Audio concluded: MP3 Data Compression introduced Next Time Data Compression Concluded Error Detection and Error Correcction Summary

More information

Exploring IX1D The Terrain Conductivity/Resistivity Modeling Software

Exploring IX1D The Terrain Conductivity/Resistivity Modeling Software Exploring IX1D The Terrain Conductivity/Resistivity Modeling Software You can bring a shortcut to the modeling program IX1D onto your desktop by right-clicking the program in your start > all programs

More information

Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013

Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013 Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013 Digital audio Overview Sampling rate Quan5za5on MPEG audio layer 3 (MP3) JPEG s5ll images Color space conversion, downsampling Discrete

More information

CS/COE 1501

CS/COE 1501 CS/COE 1501 www.cs.pitt.edu/~lipschultz/cs1501/ Compression What is compression? Represent the same data using less storage space Can get more use out a disk of a given size Can get more use out of memory

More information

1 of 24 5/6/2011 2:14 PM

1 of 24 5/6/2011 2:14 PM 1 of 24 5/6/2011 2:14 PM This tutorial explains how to add links, files, zipped files, pages, and MOODLE Media. ADDING LINKS 1. Let s start with adding a link. Here is a link to a practice Prezi. Highlight

More information

Lab 2: Introduction to LabVIEW 8.5

Lab 2: Introduction to LabVIEW 8.5 Lab 2: Introduction to LabVIEW 8.5 INTRODUCTION: This lab is designed as an introduction to using LabVIEW. In this lab you will run through some tutorials to get a basic understanding of some of the LabVIEW

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

A Detailed look of Audio Steganography Techniques using LSB and Genetic Algorithm Approach

A Detailed look of Audio Steganography Techniques using LSB and Genetic Algorithm Approach www.ijcsi.org 402 A Detailed look of Audio Steganography Techniques using LSB and Genetic Algorithm Approach Gunjan Nehru 1, Puja Dhar 2 1 Department of Information Technology, IEC-Group of Institutions

More information

Homework Assignment #3

Homework Assignment #3 CS 540-2: Introduction to Artificial Intelligence Homework Assignment #3 Assigned: Monday, February 20 Due: Saturday, March 4 Hand-In Instructions This assignment includes written problems and programming

More information

CpSc 1111 Lab 4 Part a Flow Control, Branching, and Formatting

CpSc 1111 Lab 4 Part a Flow Control, Branching, and Formatting CpSc 1111 Lab 4 Part a Flow Control, Branching, and Formatting Your factors.c and multtable.c files are due by Wednesday, 11:59 pm, to be submitted on the SoC handin page at http://handin.cs.clemson.edu.

More information

MP Music & Video Player VOL. User Guide

MP Music & Video Player VOL. User Guide MP414-4 Music & Video Player 4 VOL User Guide In the box VOL slick MENU VOL Player Headphones USB Cable MP414-4 Music & Video Player 4 User Guide User Guide CD Location of controls 1 OFF ON 2 slick MENU

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

VS1063 ENCODER DEMONSTRATION

VS1063 ENCODER DEMONSTRATION PRELIMINARY DOCUMENT VS1063 ENCODER DEMONSTRATION VLSI Solution Audio Decoder Project Code: Project Name: All information in this document is provided as-is without warranty. Features are subject to change

More information

CSC 101: Lab #8 Digital Video Lab due date: 5:00pm, day after lab session

CSC 101: Lab #8 Digital Video Lab due date: 5:00pm, day after lab session Name: Lab Date and Time: Email Username: Partner s Name: CSC 101: Lab #8 Digital Video Lab due date: 5:00pm, day after lab session Pledged Assignment: This lab document should be considered a pledged graded

More information

What Do I Really Need to Know about itunes? COPYRIGHTED MATERIAL

What Do I Really Need to Know about itunes? COPYRIGHTED MATERIAL 1 What Do I Really Need to Know about itunes? COPYRIGHTED MATERIAL 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Managing a library of music and other media content can be a challenging task, and itunes is a powerful

More information

CS 1803 Pair Homework 4 Greedy Scheduler (Part I) Due: Wednesday, September 29th, before 6 PM Out of 100 points

CS 1803 Pair Homework 4 Greedy Scheduler (Part I) Due: Wednesday, September 29th, before 6 PM Out of 100 points CS 1803 Pair Homework 4 Greedy Scheduler (Part I) Due: Wednesday, September 29th, before 6 PM Out of 100 points Files to submit: 1. HW4.py This is a PAIR PROGRAMMING Assignment: Work with your partner!

More information

ESE 150 Lab 03: Data Compression

ESE 150 Lab 03: Data Compression LAB 03 In this lab we will do the following: 1. Use the Arduino A2D you built in Lab 1 to capture the following waveforms: square wave, sine wave, and sawtooth wave in the audio frequency range 2. Import

More information

Fundamental of Digital Media Design. Introduction to Audio

Fundamental of Digital Media Design. Introduction to Audio Fundamental of Digital Media Design Introduction to Audio by Noraniza Samat Faculty of Computer Systems & Software Engineering noraniza@ump.edu.my OER Fundamental of Digital Media Design by Noraniza Samat

More information

Analogue Ripper. What TAR does not do. Record Albums Break Them Into Tracks Send the tracks to itunes

Analogue Ripper. What TAR does not do. Record Albums Break Them Into Tracks Send the tracks to itunes Analogue Ripper Record Albums Break Them Into Tracks Send the tracks to itunes I. What the Analogue Ripper (TAR) does TAR allows you to digitally record music and sound from any analogue source onto your

More information

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman

DSP. Presented to the IEEE Central Texas Consultants Network by Sergio Liberman DSP The Technology Presented to the IEEE Central Texas Consultants Network by Sergio Liberman Abstract The multimedia products that we enjoy today share a common technology backbone: Digital Signal Processing

More information

OPERATING MANUAL- MODEL : BS-101

OPERATING MANUAL- MODEL : BS-101 OPERATING MANUAL- MODEL : BS-101 Refer the attached connection diagram to install Slave units in all classrooms of your premises. Part Id is mentioned on the sticker pasted on right side of each Slave

More information

Advanced Vision Practical

Advanced Vision Practical Advanced Vision Practical Bob Fisher School of Informatics February 2018 Abstract This describes the Advanced Vision assessed practical. The main goal is to reconstruct a box from a set of 3D point clouds

More information

CS 160: Evaluation. Outline. Outline. Iterative Design. Preparing for a User Test. User Test

CS 160: Evaluation. Outline. Outline. Iterative Design. Preparing for a User Test. User Test CS 160: Evaluation Professor John Canny Spring 2006 2/15/2006 1 2/15/2006 2 Iterative Design Prototype low-fi paper, DENIM Design task analysis contextual inquiry scenarios sketching 2/15/2006 3 Evaluate

More information

Basic features. Adding audio files and tracks

Basic features. Adding audio files and tracks Audio in Pictures to Exe Introduction In the past the conventional wisdom was that you needed a separate audio editing program to produce the soundtrack for an AV sequence. However I believe that PTE (Pictures

More information

ESE 531: Digital Signal Processing

ESE 531: Digital Signal Processing ESE 531: Digital Signal Processing Lec 23: April 20, 2017 Compressive Sensing Penn ESE 531 Spring 2017 Khanna Previously! Today " DTFT, DFT, FFT practice " Compressive Sampling/Sensing Penn ESE 531 Spring

More information

OWNER S MANUAL. 1 Overview

OWNER S MANUAL. 1 Overview OWNER S MANUAL 1 Overview SurCode for Dolby Pro Logic II converts your 5.1 surround soundfiles to the Dolby Pro Logic II format by matrix-encoding the Center, Surround, and LFE tracks into the Left and

More information

CSE115 Lab 2 Fall 2016

CSE115 Lab 2 Fall 2016 DUE DATES: Monday recitations: 9:00 PM on 9/25 Wednesday recitations: 9:00 PM on 9/27 Thursday recitations: 9:00 PM on 9/28 Friday recitations: 9:00 PM on 9/29 Saturday recitations: 9:00 PM on 9/30 Ready!

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective

More information

CS106B Handout 34 Autumn 2012 November 12 th, 2012 Data Compression and Huffman Encoding

CS106B Handout 34 Autumn 2012 November 12 th, 2012 Data Compression and Huffman Encoding CS6B Handout 34 Autumn 22 November 2 th, 22 Data Compression and Huffman Encoding Handout written by Julie Zelenski. In the early 98s, personal computers had hard disks that were no larger than MB; today,

More information

HCI and Design SPRING 2016

HCI and Design SPRING 2016 HCI and Design SPRING 2016 Topics for today Heuristic Evaluation 10 usability heuristics How to do heuristic evaluation Project planning and proposals Usability Testing Formal usability testing in a lab

More information

ELE 201, Spring 2014 Laboratory No. 4 Compression, Error Correction, and Watermarking

ELE 201, Spring 2014 Laboratory No. 4 Compression, Error Correction, and Watermarking ELE 201, Spring 2014 Laboratory No. 4 Compression, Error Correction, and Watermarking 1 Introduction This lab focuses on the storage and protection of digital media. First, we ll take a look at ways to

More information

Compression of Stereo Images using a Huffman-Zip Scheme

Compression of Stereo Images using a Huffman-Zip Scheme Compression of Stereo Images using a Huffman-Zip Scheme John Hamann, Vickey Yeh Department of Electrical Engineering, Stanford University Stanford, CA 94304 jhamann@stanford.edu, vickey@stanford.edu Abstract

More information

Summary. I. Summary II. Download A. XLD Application B. Logchecker Plugin

Summary. I. Summary II. Download A. XLD Application B. Logchecker Plugin I. Summary II. Download A. XLD Application B. Logchecker Plugin III. Configuration IV. Ripping V. Log Analysis A. Main things to look for B. LOG CONSOLIDATION WARNING C. AccurateRip D. Errors (CDParanoia

More information

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION 15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:

More information

A Digital Audio Primer

A Digital Audio Primer Conversion of Sound Wave to Analog Signal A Digital Audio Primer Many people don t care about the technology behind their stereo system. As long as it sounds good and they can press a button and listen

More information

PREMIUM ATTENDANT GUIDE

PREMIUM ATTENDANT GUIDE wowforbusiness.com PREMIUM ATTENDANT GUIDE WOW! Business PAGM.U.1408.O Premium Attendant Table of Contents Product Overview...3 CommPortal Login Screen...4 Main Page...5 Schedule...6 Special Days...8 Menus

More information