Because of the good performance of vocoder and the potential

Size: px
Start display at page:

Download "Because of the good performance of vocoder and the potential"

Transcription

1 FINAL REVIEW ABOUT APPLIED FFT APPROACH IN PHASE VOCODER TO ACHIEVE TIME/PITCH SCALING Digital Audio Systems, DESC9115, 2018 Graduate Program in Audio and Acoustics Sydney School of Architecture, Design and Planning, The University of Sydney ABSTRACT Because of the good performance of vocoder and the potential value of time and pitch shifting, this paper attempts to create a phase vocoder APP through MATLAB, which is used for teaching or research in the initial stage. The main expectation of this APP can achieve sound, Window type, Spectrogram and Waveform real-time contrast and the diagram (or sound) fast switching. Later stages can be developed as a stand-alone vocoder, or as a Third-party plug-in. Hence, there are three problems in here. Firstly, what is the principle of the phase vocoder? Then, how to implement it in MATLAB? Finally, how to integrate these function into the GUI interface. The results show that GUI interface as expected, waveform, spectrogram, sound and window functions can appear simultaneously, and switching speed as expected. The drawback is that the overall design of the interface is not very intelligent, so it needs to be improved in the future. INTRODUCTION Due to the humorous voice, Talking Tom Cat became to one of the most popular APP around the world in a few years ago. Even now, the official channel of Talking Tom Cat still has more than 7 million followers on YouTube, and each new upload video has more than 2 million hits within two weeks[1]. Not only that, different rates of voice playback features are increasingly sought after by users, and many people were got attention by uploading these interesting sounds on Instagram and Facebook. The most incredible thing is a music APP Tik Tok has reached 120 million daily active users in April 2018 [2]. These cases embody the research value of audio production technology, and the key point of these technologies is time and pitch scaling. A sound can be understood as the superposition of various frequency sine waves, and the pitch is the main frequency of a sound. So, pitch shifting is changing the main frequency. The time and pitch scaling include two parts: time scaling (pitch constant) and pitch scaling (time constant). The time scaling is defined as the pitch unchanged and the speech rate is faster or slower. That also means the fundamental frequency value is almost constant. Pitch scaling means speech rate is constant, but fundamental frequency is changed. Normally, time and pitch scaling can be processed either in the time domain or frequency domain. Unfortunately, some approaches cannot separate those two effects since scaling the length of a signal can affect the pitch. But Phase vocoder is a way of scaling the length of a signal without affect the pitch[3]. Because of the good performance of vocoder and the potential value of time and pitch shifting, this paper attempts to create a phase vocoder APP through MATLAB, which is used for teaching or research in the initial stage. The main aim of this APP can achieve to demonstrate time and pitch scaling, Window type, Spectrogram and Waveform real-time contrast and the diagram (or sound) fast switching. In the future, it can be developed as a stand-alone vocoder, or as a Third-party plug-in. Therefore, there are mainly have three problems to solve, the first is to understand the principle of the phase vocoder, and then to implement it in MATLAB, and finally integrate these function into the GUI interface. 1. THEORETICAL CONSIDERATION The short time Fourier transform is a basic theory when implement phase vocoder. The algorithm and principle has been introduced in the initial review and lab report specifically. So, this final review just cites that key information Short Time Fourier Transform The Short Time Fourier Transform is proposed to carry out the partial analysis. Moreover, the prefect reconstruction can be achieved if overlapping window is unity [4]. There are many types of window function, which including the Hamming, the Hanning and Gauss window [4][5][6]. Figure 2 is an example to demonstrated how Hanning windows reconstruct a short period of DFT [4]. When take a Gaussian window it is generally called Gabor Transform [6]. However, time and frequency are two independent variables for the output of the STFT equation (shows in equation 1, t is time, and ω is radial frequency) [4]. So as it has time-frequency representation in each sampling point. Figure 1. Sum of small windows. Figure source from DAFX[2] Normally, we can use Hanning, Hamming and Gauss window optionally. In order to compare the effect of the window function on the signal, there are two kinds of window function is adopted in this phase vocoder, one is Gauss window (shows in equation 2) and the other one is Hanning window (shows in equation 3, window length L is N+1).

2 The Window Fourier Transform with Gaussian function is known as Gaboret transform which has the smallest time-frequency window. For this reason, Gabor transform is the optimal STFT[4][7][8]. We can obtain Gaboret transform equation if Substitute equation 2 into equation IMPLEMENTATION Figure 2. Phase vocoder implementation diagram Phase vocoder can extend the time domain and frequency domain by changing the phase of sound. The extension of time domain and frequency domain corresponds to the time scaling (change playback rate) and pitch scaling (change foundational frequency) [8]. The basic principle of FFT/IFFT approach is that the timefrequency representation can be regarded as a series of overlapping FFT (may include a window function). Since the FFT is reversible, adding the IFFT of vertical line can reconstruct the sound, that means reconstruct sound from time-frequency domain [4]. The signal diagram has demonstrated in figure 2. Base on this principle, there are three steps when using direct FFT/IFFT approach to implement vocoder. First step, use the FFT to compute the relationship between instantaneous frequency and amplitude of the signal. Second step, resampling the FFT blocks. Third step, perform an IFFT by take the inverse Fourier transform on each chunk and adding the resulting waveform chunks [3] Window function Comparison In step one, we can identify window function as well as compare the effect of the window function on the signal. so, we can use the Matlab built-in function gausswin() and hanning() for the signal. Figure 3. comparison between Hanning window and Gauss window Figure 3 shows that the main lobe of the Hanning window is slightly wider than the Gauss window, which is equivalent to widening the bandwidth of the analysis and decreasing the frequency resolution. From the angle of frequency resolution, Gauss window is superior to Hanning window. The Side lobe of Gauss window is higher than that of Hanning Window, so from the point of view of leakage, Gauss window is better than Hanning window[6] FFT/IFFT approach to Implement Time scaling The following figure 4 and 5 is time scaling result based on the direct FFT/IFFT approach, the function routine includes one vectors of the gentlemen sound (I am speaking from over here), window size (2048), analysis steps (512) and synthesis steps(384). It has shown the implementation result of time scaling from waveform diagram and spectrogram when we use FFT approach. Figure 4. FFT approaches implement time scaling (waveform diagram) Figure 5. FFT approaches implement time scaling (spectrogram)

3 2.3. FFT/IFFT to Implement Pitch Scaling The following figure 6 and 7 is pitch scaling result based on the direct FFT/IFFT approach, but this function is different with time-scaling function. its routine includes one vectors of the gentlemen sound (I am speaking from over here), window size (2048), pitch ratio (0.9) and synthesis steps (512). It has shown the implementation result of pitch scaling from waveform diagram and spectrogram when we use FFT approach. Pitch Ratio, normal pitch ratio and high pitch ratio. Not only that, The sound of time/pitch-scaling is audible simultaneously. Figure 6. FFT approaches implement pitch scaling (waveform diagram) Figure 8. GUI development 3. EVALUATION 3.1. Test APP Run Figure Figure 7. FFT approaches implement pitch scaling (spectrogram) 2.4. GUI development Figure 8 has demonstrated that phase vocoder APP design idea, and this GUI interface has been divided into 4 portions. Portion 1 is window function display area, which is controlled in portion 3. Once the control button in the portion 3 is clicked, the selected window function image will be displayed in portion 1. Portion 2 is waveform and spectrogram display area, which is controlled in portion 4. Once the control button in the portion 4 is clicked, the selected time/pitch scaling waveform and spectrogram will be displayed in portion 2 separately. Portion 3 is window function control area which designed with 3 buttons, including two window functions and a set of reciprocal comparisons. Portion 4 is the control area, mainly used to control the diagram in portion 2. The Control area has two groups, one use for Hanning windows function and the other one for Gauss windows function. Then, each group also includes the time-scaling and pitch-scaling two kinds of buttons, time-scaling is controlled by the playback rate, three buttons are 1.5,2,3 times the playback rate respectively; pitch-scaling three buttons are divided into low Figure 9. error when start to run figure Run the function file Pitch_shift.m, As shown in figure 9, when the app is started, the system complains about some programs, but the error does not affect the test results, so the author prepares to do a detailed error correction later. When we click OK, we will enter the app interface (as shown in figure 10).

4 3.4. Test APP Time Scaling Test Figure 13. Time scaling (rate is 1.5) with Hanning window function (left) and Gauss window function(right) Figure 10. interface of this APP 3.2. Test APP Window Function Test Figure 12 evaluated the operation of the time scaling with Hanning and Gauss window function, and the results show that it is in good condition. Same as pitch test, all sounds were as expected(all demo will be attached), and the scaled sounds sped up a lot. During the audition process, there is no way to distinguish between the two window function and the sound. Although the waveform and spectrogram of the two appear to be somewhat different Test APP Switching Time Figure 11. interface of this APP The three diagrams in Figure 11 evaluated the operation of the different window function, and the results show that it is in good condition Test APP Pitch Scaling Test Figure 14. calculation time for each button. Figure 12. Low pitch ratio of Pitch scaling with Hanning window function (left) and Gauss window function(right) Figure 12 evaluated the operation of the pitch scaling with Hanning and Gauss window function, and the results show that it is in good condition. All the scaled sounds were as expected(all demo will be attached), and the low pitch ratio sounded even more muffled. In the audition process, there is no direct difference between the two window function and the sound. The only difference between the two is the sound at the end, and the Gauss window function is slightly louder than the Hanning window function To test the speed of each operation, the ' tic ' and ' toc ' (built-in function) has been added in every function to calculate the time of operation. As shown in Figure 14, the operation time has been attached on top of each button. Strangely, the operation time of Rate (Gauss) is almost 50 times than to Rate (Hanning). One possibility is that algorithm of rate (Gauss) was used Gauss approach, its basic principle is similar to FFT algorithm, but it has an extra calculation item Gaborets[4]. Therefore, the increase in the time is likely to be calculated when the Gaborets generated. 4. CONCULSION The results show that GUI interface as expected, waveform, spectrogram, sound and window functions can appear

5 simultaneously. The switching speed also is satisfying (except for the group of Rate (Gauss)), which will bring good experience to users. The disadvantage is that the overall design of the interface is not very intelligent, so it needs to be improved in the future. 5. REFERENCES [1]"Talking Tom", YouTube, [Online]. Available: [Accessed: 07- Jun- 2018]. [2]"Tik Tok (app)", En.wikipedia.org, [Online]. Available: [Accessed: 08- Jun- 2018]. [3]"Audio time stretching and pitch scaling", En.wikipedia.org, [Online]. Available: scaling. [Accessed: 08- Jun- 2018]. [4]U. Zölzer, DAFX, 2nd ed. Chichester: Wiley, 2011, pp. Chapter 7 page [5]"Gabor transform", En.wikipedia.org, [Online]. Available: [6]"Window function", En.wikipedia.org, [Online]. Available: [7] J. Proakis and D. Manolakis, Digital signal processing, 4th ed. Harlow, Essex: Pearson, 2014, pp. chapter 4, page [8]"The Fast Fourier Transform Algorithm", YouTube, [Online]. Available:

Spectral modeling of musical sounds

Spectral modeling of musical sounds Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer

More information

REAL-TIME DIGITAL SIGNAL PROCESSING

REAL-TIME DIGITAL SIGNAL PROCESSING REAL-TIME DIGITAL SIGNAL PROCESSING FUNDAMENTALS, IMPLEMENTATIONS AND APPLICATIONS Third Edition Sen M. Kuo Northern Illinois University, USA Bob H. Lee Ittiam Systems, Inc., USA Wenshun Tian Sonus Networks,

More information

2.161 Signal Processing: Continuous and Discrete Fall 2008

2.161 Signal Processing: Continuous and Discrete Fall 2008 MIT OpenCourseWare http://ocw.mit.edu 2.161 Signal Processing: Continuous and Discrete Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. MASSACHUSETTS

More information

Rhythmic constant pitch time stretching for digital audio

Rhythmic constant pitch time stretching for digital audio Rhythmic constant pitch time stretching for digital audio Brendan TREVORROW ; University of Southern Queensland, Australia ABSTRACT Constant pitch time stretching is not uncommon in audio editing software,

More information

Two-Dimensional Fourier Processing of Rasterised Audio. Chris Pike

Two-Dimensional Fourier Processing of Rasterised Audio. Chris Pike Two-Dimensional Fourier Processing of Rasterised Audio Chris Pike cwp500@york.ac.uk June 13, 2008 Abstract A comprehensive software tool has been developed in Matlab to enable two-dimensional Fourier analysis

More information

Separation of speech mixture using time-frequency masking implemented on a DSP

Separation of speech mixture using time-frequency masking implemented on a DSP Separation of speech mixture using time-frequency masking implemented on a DSP Javier Gaztelumendi and Yoganathan Sivakumar March 13, 2017 1 Introduction This report describes the implementation of a blind

More information

MATLAB Apps for Teaching Digital Speech Processing

MATLAB Apps for Teaching Digital Speech Processing MATLAB Apps for Teaching Digital Speech Processing Lawrence Rabiner, Rutgers University Ronald Schafer, Stanford University GUI LITE 2.5 editor written by Maria d Souza and Dan Litvin MATLAB coding support

More information

An Improvement in Temporal Resolution of Seismic Data Using Logarithmic Time-frequency Transform Method

An Improvement in Temporal Resolution of Seismic Data Using Logarithmic Time-frequency Transform Method Iranian Journal of Oil & Gas Science and Technology, Vol. 4 (2015), No. 2, pp. 27-39 http://ijogst.put.ac.ir An Improvement in Temporal Resolution of Seismic Data Using Logarithmic Time-frequency Transform

More information

Reconstruction of Images Distorted by Water Waves

Reconstruction of Images Distorted by Water Waves Reconstruction of Images Distorted by Water Waves Arturo Donate and Eraldo Ribeiro Computer Vision Group Outline of the talk Introduction Analysis Background Method Experiments Conclusions Future Work

More information

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency

More information

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency

More information

Rapid Prototyping System for Teaching Real-Time Digital Signal Processing

Rapid Prototyping System for Teaching Real-Time Digital Signal Processing IEEE TRANSACTIONS ON EDUCATION, VOL. 43, NO. 1, FEBRUARY 2000 19 Rapid Prototyping System for Teaching Real-Time Digital Signal Processing Woon-Seng Gan, Member, IEEE, Yong-Kim Chong, Wilson Gong, and

More information

Chapter 6 Data Acquisition and Spectral Analysis System high-speed digitizer card for acquiring time-domain data. The digitizer is used in

Chapter 6 Data Acquisition and Spectral Analysis System high-speed digitizer card for acquiring time-domain data. The digitizer is used in Chapter 6 Data Acquisition and Spectral Analysis System 6.1 Introduction This chapter will discuss the hardware and software involved in developing the data acquisition and spectral analysis system. The

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi

Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi 1. Introduction The choice of a particular transform in a given application depends on the amount of

More information

Digital Recording and Playback

Digital Recording and Playback Digital Recording and Playback Digital recording is discrete a sound is stored as a set of discrete values that correspond to the amplitude of the analog wave at particular times Source: http://www.cycling74.com/docs/max5/tutorials/msp-tut/mspdigitalaudio.html

More information

A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D.

A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D. Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS Verfaille V., Arfib D. CNRS - LMA 3, chemin Joseph Aiguier

More information

Slow Motion Video. MediaTek Technology White Paper

Slow Motion Video. MediaTek Technology White Paper MediaTek Technology White Paper May 2015 Introducing Slow Motion Video MediaTek s Slow Motion Video technology allows the device user to easily capture, replay, and share a fast moving object any time.

More information

Chapter X Sampler Instrument

Chapter X Sampler Instrument Chapter X Sampler Instrument A sampler is a synthesizer that generates sound by playing recorded sounds rather than calculated waveforms. The wave file player instrument described in an earlier chapter

More information

Bluray (

Bluray ( Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)

More information

USING AUDACITY: ROBUST, FREE, AND FULL- FEATURED SOFTWARE

USING AUDACITY: ROBUST, FREE, AND FULL- FEATURED SOFTWARE Slide 1 USING AUDACITY: ROBUST, FREE, AND FULL- FEATURED SOFTWARE This presentation contains audio; ensure your speakers are turned up. Click the forward arrow below to proceed to the next slide. Audacity

More information

Digital Media. Daniel Fuller ITEC 2110

Digital Media. Daniel Fuller ITEC 2110 Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09

More information

Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology

Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology Course Presentation Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology Homework Original Sound Speech Quantization Companding parameter (µ) Compander Quantization bit

More information

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm International Journal of Engineering Research and General Science Volume 3, Issue 4, July-August, 15 ISSN 91-2730 A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

More information

Adobe Sound Booth Tutorial

Adobe Sound Booth Tutorial Adobe Sound Booth Tutorial Recording your Voice in the Studio 1. Open Adobe Sound Booth 2. Click File>New>Empty Audio File 3. Hit the Record Button (red circle button at the bottom of the screen) 4. In

More information

Topics in Linguistic Theory: Laboratory Phonology Spring 2007

Topics in Linguistic Theory: Laboratory Phonology Spring 2007 MIT OpenCourseWare http://ocw.mit.edu 24.910 Topics in Linguistic Theory: Laboratory Phonology Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Phase Vocoder Implementation with FLWT and TD-PSOLA

Phase Vocoder Implementation with FLWT and TD-PSOLA Phase Vocoder Implementation with FLWT and TD-PSOLA Terry Kong Stanford University Email: tckong@stanford.edu Abstract The following is a report written for EE 64 s final project. The goal of the project

More information

S4B Ringtone Creator Soft4Boost Help S4B Ringtone Creator www.sorentioapps.com Sorentio Systems, Ltd. All rights reserved Contact Us If you have any comments, suggestions or questions regarding S4B Ringtone

More information

Criterion E: Product Development

Criterion E: Product Development Criterion E: Product Development Techniques used to address client s requirements: Pre-Video Setup.pg. 1 Video Layout...pg. 2 Editing Video Clips pg. 3-5 Uploading my Videos to Youtube..pg. 6-7 My Youtube

More information

Mahdi Amiri. February Sharif University of Technology

Mahdi Amiri. February Sharif University of Technology Course Presentation Multimedia Systems Speech II Mahdi Amiri February 2014 Sharif University of Technology Speech Compression Road Map Based on Time Domain analysis Differential Pulse-Code Modulation (DPCM)

More information

Digital Image Processing. Image Enhancement in the Frequency Domain

Digital Image Processing. Image Enhancement in the Frequency Domain Digital Image Processing Image Enhancement in the Frequency Domain Topics Frequency Domain Enhancements Fourier Transform Convolution High Pass Filtering in Frequency Domain Low Pass Filtering in Frequency

More information

Laboratory Exercise #5

Laboratory Exercise #5 ECEN4002/5002 Spring 2003 Digital Signal Processing Laboratory Laboratory Exercise #5 Signal Synthesis Introduction Up to this point we have been developing and implementing signal processing algorithms:

More information

Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 2015 Sharif University of Technology

Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 2015 Sharif University of Technology Course Presentation Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 25 Sharif University of Technology Speech Compression Road Map Based on Time Domain analysis Differential Pulse-Code

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define

More information

PHYS 5061 Lab 1: Introduction to LabVIEW

PHYS 5061 Lab 1: Introduction to LabVIEW PHYS 5061 Lab 1: Introduction to LabVIEW In this lab, you will work through chapter 1 and 2 of Essick s book to become familiar with using LabVIEW to build simple programs, called VI s in LabVIEW-speak,

More information

Tree-based Cluster Weighted Modeling: Towards A Massively Parallel Real- Time Digital Stradivarius

Tree-based Cluster Weighted Modeling: Towards A Massively Parallel Real- Time Digital Stradivarius Tree-based Cluster Weighted Modeling: Towards A Massively Parallel Real- Time Digital Stradivarius Edward S. Boyden III e@media.mit.edu Physics and Media Group MIT Media Lab 0 Ames St. Cambridge, MA 039

More information

FFT and Spectrum Analyzer

FFT and Spectrum Analyzer Objective: FFT and Spectrum Analyzer Understand the frequency domain and some of it's uses Understand the Discrete Fourier Transform Preparation: (i) PC and data acquisition system (ii)fft software Background:

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

A Non-Iterative Approach to Frequency Estimation of a Complex Exponential in Noise by Interpolation of Fourier Coefficients

A Non-Iterative Approach to Frequency Estimation of a Complex Exponential in Noise by Interpolation of Fourier Coefficients A on-iterative Approach to Frequency Estimation of a Complex Exponential in oise by Interpolation of Fourier Coefficients Shahab Faiz Minhas* School of Electrical and Electronics Engineering University

More information

Introduction to Podcasting

Introduction to Podcasting 2011 Introduction to Podcasting Part 2: Recording Your Podcast Adam Read University College Plymouth St Mark & St John 10/Jan/2011 About this guide This is part 2 in the elearning essentials: Introduction

More information

Camtasia Studio 7 User Guide

Camtasia Studio 7 User Guide Camtasia Studio 7 User Guide TechSmith & Camtasia Studio: TechSmith Corporation released popular desktop recording tools like Snagit, Jing, and Camtasia. TechSmith also launched Screencast.com, a screencast

More information

Fundamentals of Digital Image Processing

Fundamentals of Digital Image Processing \L\.6 Gw.i Fundamentals of Digital Image Processing A Practical Approach with Examples in Matlab Chris Solomon School of Physical Sciences, University of Kent, Canterbury, UK Toby Breckon School of Engineering,

More information

Audacity tutorial. 1. Look for the Audacity icon on your computer desktop. 2. Open the program. You get the basic screen.

Audacity tutorial. 1. Look for the Audacity icon on your computer desktop. 2. Open the program. You get the basic screen. Audacity tutorial What does Audacity do? It helps you record and edit audio files. You can record a speech through a microphone into your computer, into the Audacity program, then fix up the bits that

More information

5.2. The Sine Function and the Cosine Function. Investigate A

5.2. The Sine Function and the Cosine Function. Investigate A 5.2 The Sine Function and the Cosine Function What do an oceanographer, a stock analyst, an audio engineer, and a musician playing electronic instruments have in common? They all deal with periodic patterns.

More information

Numerical Propagation Manual

Numerical Propagation Manual Numerical Propagation Manual Numerical Propagation Manual 1 Numerical Propagation Manual Version 1.2 Juan Pablo Piedrahita Quintero, Raúl Andrés Castañeda Quintero and Jorge Iván García Sucerquia Monday,

More information

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information

(Refer Slide Time: 01:25)

(Refer Slide Time: 01:25) Computer Architecture Prof. Anshul Kumar Department of Computer Science and Engineering Indian Institute of Technology, Delhi Lecture - 32 Memory Hierarchy: Virtual Memory (contd.) We have discussed virtual

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

MATH STUDENT BOOK. 12th Grade Unit 4

MATH STUDENT BOOK. 12th Grade Unit 4 MATH STUDENT BOOK th Grade Unit Unit GRAPHING AND INVERSE FUNCTIONS MATH 0 GRAPHING AND INVERSE FUNCTIONS INTRODUCTION. GRAPHING 5 GRAPHING AND AMPLITUDE 5 PERIOD AND FREQUENCY VERTICAL AND HORIZONTAL

More information

Working with Windows Movie Maker

Working with Windows Movie Maker 518 442-3608 Working with Windows Movie Maker Windows Movie Maker allows you to make movies and slide shows that can be saved to your computer, put on a CD, uploaded to a Web service (such as YouTube)

More information

21M.361 Composing with Computers I (Electronic Music Composition) Spring 2008

21M.361 Composing with Computers I (Electronic Music Composition) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 21M.361 Composing with Computers I (Electronic Music Composition) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Model parametrization strategies for Newton-based acoustic full waveform

Model parametrization strategies for Newton-based acoustic full waveform Model parametrization strategies for Newton-based acoustic full waveform inversion Amsalu Y. Anagaw, University of Alberta, Edmonton, Canada, aanagaw@ualberta.ca Summary This paper studies the effects

More information

WAVELET USE FOR IMAGE RESTORATION

WAVELET USE FOR IMAGE RESTORATION WAVELET USE FOR IMAGE RESTORATION Jiří PTÁČEK and Aleš PROCHÁZKA 1 Institute of Chemical Technology, Prague Department of Computing and Control Engineering Technicka 5, 166 28 Prague 6, Czech Republic

More information

A building structural-performance monitoring system using RFID tag with sensors

A building structural-performance monitoring system using RFID tag with sensors icccbe 21 Nottingham University Press Proceedings of the International Conference on Computing in Civil and Building Engineering W Tizani (Editor) A building structural-performance monitoring system using

More information

Software for Analysis, Processing, and Creation of Signals and Related Applications

Software for Analysis, Processing, and Creation of Signals and Related Applications TM Overview Software for Analysis, Processing, and Creation of Signals and Related Applications Technology is always evolving - isn't it time your data analysis evolved as well? Technology is always evolving

More information

(A simplified version of this document is available for applicants who had applied in previous years.)

(A simplified version of this document is available for applicants who had applied in previous years.) NYO Canada 2019 Auditions ( en français ) (A simplified version of this document is available for applicants who had applied in previous years.) Digital Audition Process Since 2013 NYO Canada has implemented

More information

DAY 28 - ARITHMETIC SEQUENCES

DAY 28 - ARITHMETIC SEQUENCES DAY 28 - ARITHMETIC SEQUENCES ARITHMETIC SEQUENCE An ARITHMETIC SEQUENCE is where the rule of the pattern is always ADDED. The rule is called the COMMON DIFFERENCE ARITHMETIC SEQUENCE You can use the following

More information

Experimental reconstruction of a highly reflecting fiber Bragg grating by using spectral regularization and inverse scattering

Experimental reconstruction of a highly reflecting fiber Bragg grating by using spectral regularization and inverse scattering 3284 J. Opt. Soc. Am. A/ Vol. 24, No. 10/ October 2007 Rosenthal et al. Experimental reconstruction of a highly reflecting fiber Bragg grating by using spectral regularization and inverse scattering Amir

More information

Fundamental of Digital Media Design. Introduction to Audio

Fundamental of Digital Media Design. Introduction to Audio Fundamental of Digital Media Design Introduction to Audio by Noraniza Samat Faculty of Computer Systems & Software Engineering noraniza@ump.edu.my OER Fundamental of Digital Media Design by Noraniza Samat

More information

Michael Moody School of Pharmacy University of London 29/39 Brunswick Square London WC1N 1AX, U.K.

Michael Moody School of Pharmacy University of London 29/39 Brunswick Square London WC1N 1AX, U.K. This material is provided for educational use only. The information in these slides including all data, images and related materials are the property of : Michael Moody School of Pharmacy University of

More information

USER GUIDE FOR PREDICTION ERROR METHOD OF ADAPTIVE FEEDBACK CANCELLER ON ios PLATFORM FOR HEARING AID APPLICATIONS

USER GUIDE FOR PREDICTION ERROR METHOD OF ADAPTIVE FEEDBACK CANCELLER ON ios PLATFORM FOR HEARING AID APPLICATIONS Page 1 of 13 USER GUIDE FOR PREDICTION ERROR METHOD OF ADAPTIVE FEEDBACK CANCELLER ON ios PLATFORM FOR HEARING AID APPLICATIONS Parth Mishra, Anshuman Ganguly, Nikhil Shankar STATISTICAL SIGNAL PROCESSING

More information

TIME-FREQUENCY SPECTRA OF MUSIC

TIME-FREQUENCY SPECTRA OF MUSIC TIME-FREQUENCY SPECTRA OF MUSIC James S. Walker Department of Mathematics University of Wisconsin Eau Claire walkerjs@uwec.edu Amanda J. Potts Department of Mathematics University of Nebraska Lincoln Abstract

More information

Lesson 12: Sine 5 = 15 3

Lesson 12: Sine 5 = 15 3 Lesson 12: Sine How did ou do on that last worksheet? Is finding the opposite side and adjacent side of an angle super-duper eas for ou now? Good, now I can show ou wh I wanted ou to learn that first.

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Project Plan of Simultaneous Co-Test of high performance DAC ADC Pairs

Project Plan of Simultaneous Co-Test of high performance DAC ADC Pairs Project Plan of Simultaneous Co-Test of high performance DAC ADC Pairs Iowa State University Senior Design Project 2012-2013 May13-28 Adviser Dr. Degang Chen Members Luke Goetzke Tao Chen Ben Magstadt

More information

Implementation of FFT Processor using Urdhva Tiryakbhyam Sutra of Vedic Mathematics

Implementation of FFT Processor using Urdhva Tiryakbhyam Sutra of Vedic Mathematics Implementation of FFT Processor using Urdhva Tiryakbhyam Sutra of Vedic Mathematics Yojana Jadhav 1, A.P. Hatkar 2 PG Student [VLSI & Embedded system], Dept. of ECE, S.V.I.T Engineering College, Chincholi,

More information

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer

For Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing

More information

Blind one-microphone speech separation: A spectral learning approach

Blind one-microphone speech separation: A spectral learning approach Blind one-microphone speech separation: A spectral learning approach Francis R. Bach Computer Science University of California Berkeley, CA 94720 fbach@cs.berkeley.edu Michael I. Jordan Computer Science

More information

Assistive Technology Training Software Guide

Assistive Technology Training Software Guide Assistive Technology Training Software Guide 1 Contents 1. Getting Started 1.1 Starting a new project 4 1.2 The Notetaker window layout 5 2. Importing to the Images Pane 2.1 Importing slides (from PowerPoint)

More information

Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction

Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction Jonathan Le Roux, obutaka Ono and Shigeki Sagayama Graduate School of Information Science and Technology,

More information

Design and implementation of a Beat Detector algorithm

Design and implementation of a Beat Detector algorithm Design and implementation of a Beat Detector algorithm (ETIN80 report) Jaime Gancedo, ja1626ga-s@student.lu.se Sakif Hossain, sa5536ho-s@student.lu.se Wenpeng Song, we3543so-s@student.lu.se March 11, 2018

More information

DSP-CIS. Part-IV : Filter Banks & Subband Systems. Chapter-10 : Filter Bank Preliminaries. Marc Moonen

DSP-CIS. Part-IV : Filter Banks & Subband Systems. Chapter-10 : Filter Bank Preliminaries. Marc Moonen DSP-CIS Part-IV Filter Banks & Subband Systems Chapter-0 Filter Bank Preliminaries Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven marc.moonen@esat.kuleuven.be www.esat.kuleuven.be/stadius/ Part-III Filter

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek

More information

An Audio Fingerprinting System for Live Version Identification using Image Processing Techniques

An Audio Fingerprinting System for Live Version Identification using Image Processing Techniques An Audio Fingerprinting System for Live Version Identification using Image Processing Techniques (Dr.?) Zafar Rafii Northwestern University EECS department Acknowledgments Work done during internship at

More information

6 Using Noise Reduction Filters

6 Using Noise Reduction Filters 6 Using Noise Reduction Filters You can add new life to old or low quality audio recordings using Adobe Audition s enhancement and restoration effects. Distracting hiss or background noise can easily be

More information

AUDACITY DIGITAL AUDIO SOFTWARE

AUDACITY DIGITAL AUDIO SOFTWARE AUDACITY DIGITAL AUDIO SOFTWARE TABLE OF CONTENTS (Select a link below to jump to that section of the document) Importing Music Into Audacity... 2 Where To Get Sound Files... 2 Change The Volume of Sound

More information

The MPC Renaissance & MPC Studio Bible - Demo Tutorial (For MPC Software 2.x)

The MPC Renaissance & MPC Studio Bible - Demo Tutorial (For MPC Software 2.x) The MPC Renaissance & MPC Studio Bible - Demo Tutorial (For MPC Software 2.x) This free tutorial is taken from MPC-Tutor s tutorial book The MPC Renaissance & MPC Studio Bible (MPC Software 2.1 Edition).

More information

Audacity: How- To. Import audio (a song or SFX) Before we start. Import song into Audacity

Audacity: How- To. Import audio (a song or SFX) Before we start. Import song into Audacity Audacity: How- To music technology training Import audio (a song or SFX) Before we start You can t import a song into Audacity directly from a CD. You need to rip the required track from the CD using a

More information

From Fourier Transform to Wavelets

From Fourier Transform to Wavelets From Fourier Transform to Wavelets Otto Seppälä April . TRANSFORMS.. BASIS FUNCTIONS... SOME POSSIBLE BASIS FUNCTION CONDITIONS... Orthogonality... Redundancy...3. Compact support.. FOURIER TRANSFORMS

More information

Digital Fundamentals

Digital Fundamentals Digital Fundamentals Tenth Edition Floyd Chapter 1 Modified by Yuttapong Jiraraksopakun Floyd, Digital Fundamentals, 10 th 2008 Pearson Education ENE, KMUTT ed 2009 Analog Quantities Most natural quantities

More information

Audacity Tutorial C. Stanley

Audacity Tutorial C. Stanley Audacity Tutorial C. Stanley Getting to Know Audacity: Silence Keys Microphone Select Editing Tools Recording Tools Cut, Copy, Paste Undo, Redo Zoom How to Record: Select external microphone. Press the

More information

Sales Manual Part II

Sales Manual Part II Sales Manual Part II In this sales manual, you ll be able to show how to make a song and create a WAV file of the song. Table of Contents Page 1. Main Features of the Sequencer 2 2. How to Demo the Sequencer

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Checklist. ORB Education Quality Teaching Resources. ORB Education Visit for the full, editable versions.

Checklist. ORB Education Quality Teaching Resources. ORB Education Visit   for the full, editable versions. Checklist 1. Data in our Lives 2. Representing Data 3. Working with Data 4. Introducing Spreadsheets 5. Basic Excel Skills 6. Google Sheets 7. Sorting and Filtering 8. Formulas 9. Plots and Trendlines

More information

Surface Wave Suppression with Joint S Transform and TT Transform

Surface Wave Suppression with Joint S Transform and TT Transform Available online at www.sciencedirect.com Procedia Earth and Planetary Science 3 ( 011 ) 46 5 011 Xian International Conference on Fine Geological Exploration and Groundwater & Gas Hazards Control in Coal

More information

Repeating Segment Detection in Songs using Audio Fingerprint Matching

Repeating Segment Detection in Songs using Audio Fingerprint Matching Repeating Segment Detection in Songs using Audio Fingerprint Matching Regunathan Radhakrishnan and Wenyu Jiang Dolby Laboratories Inc, San Francisco, USA E-mail: regu.r@dolby.com Institute for Infocomm

More information

Digital Signal Processing Lecture Notes 22 November 2010

Digital Signal Processing Lecture Notes 22 November 2010 Digital Signal Processing Lecture otes 22 ovember 2 Topics: Discrete Cosine Transform FFT Linear and Circular Convolution Rate Conversion Includes review of Fourier transforms, properties of Fourier transforms,

More information

How to change clip speed and direction

How to change clip speed and direction Adobe Premiere Pro CS6 Project 6 guide How to change clip speed and direction Editors frequently want to change clip speed or direction. With Adobe Premiere Pro CS6, you can edit a video clip so a sprinter

More information

Filterbanks and transforms

Filterbanks and transforms Filterbanks and transforms Sources: Zölzer, Digital audio signal processing, Wiley & Sons. Saramäki, Multirate signal processing, TUT course. Filterbanks! Introduction! Critical sampling, half-band filter!

More information

Hand Controlled Audio Synthesizer

Hand Controlled Audio Synthesizer Alex Sanchez & Behram Mistree 6.111 Final Project Proposal 11.3.2006 Coordinate TA: Javy Hand Controlled Audio Synthesizer 1. Proposed Functionality We propose to build a device that incorporates a video

More information

A New Technique of Extraction of Edge Detection Using Digital Image Processing

A New Technique of Extraction of Edge Detection Using Digital Image Processing International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) A New Technique of Extraction of Edge Detection Using Digital Image Processing Balaji S.C.K 1 1, Asst Professor S.V.I.T Abstract:

More information

Plane Wave Imaging Using Phased Array Arno Volker 1

Plane Wave Imaging Using Phased Array Arno Volker 1 11th European Conference on Non-Destructive Testing (ECNDT 2014), October 6-10, 2014, Prague, Czech Republic More Info at Open Access Database www.ndt.net/?id=16409 Plane Wave Imaging Using Phased Array

More information

CALCULATING SNELL S LAW

CALCULATING SNELL S LAW CALCULATING SNELL S LAW What is Snell s Law? Snell s Law defines angular relationships of sound waves crossing an interface. The calculations for Snell s Law will come into play when we are conducting

More information

Analysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope

Analysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope Analysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope G. Mohana Durga 1, D.V.R. Mohan 2 1 M.Tech Student, 2 Professor, Department of ECE, SRKR Engineering College, Bhimavaram, Andhra

More information

Implementing a Speech Recognition System on a GPU using CUDA. Presented by Omid Talakoub Astrid Yi

Implementing a Speech Recognition System on a GPU using CUDA. Presented by Omid Talakoub Astrid Yi Implementing a Speech Recognition System on a GPU using CUDA Presented by Omid Talakoub Astrid Yi Outline Background Motivation Speech recognition algorithm Implementation steps GPU implementation strategies

More information

How to edit audio tracks

How to edit audio tracks How to edit audio tracks Although at times you will use Adobe Audition to record new audio, you will often use it to edit, clean up, or add effects to an existing audio file such as an audio file recorded

More information

FAST AND ACCURATE TRANSIENT ULTRASOUND PROPAGATION AND B-MODE IMAGING SIMULATION METHODS. Yi Zhu

FAST AND ACCURATE TRANSIENT ULTRASOUND PROPAGATION AND B-MODE IMAGING SIMULATION METHODS. Yi Zhu FAST AND ACCURATE TRANSIENT ULTRASOUND PROPAGATION AND B-MODE IMAGING SIMULATION METHODS By Yi Zhu A THESIS Submitted to Michigan State University in partial fulfillment of the requirements for the degree

More information

Assignment 3: Edge Detection

Assignment 3: Edge Detection Assignment 3: Edge Detection - EE Affiliate I. INTRODUCTION This assignment looks at different techniques of detecting edges in an image. Edge detection is a fundamental tool in computer vision to analyse

More information

How to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device

How to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device How to Change the Default Playback & Recording Audio Device Sound is a very important part of our computing experience. We listen to music, do voice chat, watch movies, play games, record sound, etc. In

More information