Because of the good performance of vocoder and the potential
|
|
- Marianna Melton
- 5 years ago
- Views:
Transcription
1 FINAL REVIEW ABOUT APPLIED FFT APPROACH IN PHASE VOCODER TO ACHIEVE TIME/PITCH SCALING Digital Audio Systems, DESC9115, 2018 Graduate Program in Audio and Acoustics Sydney School of Architecture, Design and Planning, The University of Sydney ABSTRACT Because of the good performance of vocoder and the potential value of time and pitch shifting, this paper attempts to create a phase vocoder APP through MATLAB, which is used for teaching or research in the initial stage. The main expectation of this APP can achieve sound, Window type, Spectrogram and Waveform real-time contrast and the diagram (or sound) fast switching. Later stages can be developed as a stand-alone vocoder, or as a Third-party plug-in. Hence, there are three problems in here. Firstly, what is the principle of the phase vocoder? Then, how to implement it in MATLAB? Finally, how to integrate these function into the GUI interface. The results show that GUI interface as expected, waveform, spectrogram, sound and window functions can appear simultaneously, and switching speed as expected. The drawback is that the overall design of the interface is not very intelligent, so it needs to be improved in the future. INTRODUCTION Due to the humorous voice, Talking Tom Cat became to one of the most popular APP around the world in a few years ago. Even now, the official channel of Talking Tom Cat still has more than 7 million followers on YouTube, and each new upload video has more than 2 million hits within two weeks[1]. Not only that, different rates of voice playback features are increasingly sought after by users, and many people were got attention by uploading these interesting sounds on Instagram and Facebook. The most incredible thing is a music APP Tik Tok has reached 120 million daily active users in April 2018 [2]. These cases embody the research value of audio production technology, and the key point of these technologies is time and pitch scaling. A sound can be understood as the superposition of various frequency sine waves, and the pitch is the main frequency of a sound. So, pitch shifting is changing the main frequency. The time and pitch scaling include two parts: time scaling (pitch constant) and pitch scaling (time constant). The time scaling is defined as the pitch unchanged and the speech rate is faster or slower. That also means the fundamental frequency value is almost constant. Pitch scaling means speech rate is constant, but fundamental frequency is changed. Normally, time and pitch scaling can be processed either in the time domain or frequency domain. Unfortunately, some approaches cannot separate those two effects since scaling the length of a signal can affect the pitch. But Phase vocoder is a way of scaling the length of a signal without affect the pitch[3]. Because of the good performance of vocoder and the potential value of time and pitch shifting, this paper attempts to create a phase vocoder APP through MATLAB, which is used for teaching or research in the initial stage. The main aim of this APP can achieve to demonstrate time and pitch scaling, Window type, Spectrogram and Waveform real-time contrast and the diagram (or sound) fast switching. In the future, it can be developed as a stand-alone vocoder, or as a Third-party plug-in. Therefore, there are mainly have three problems to solve, the first is to understand the principle of the phase vocoder, and then to implement it in MATLAB, and finally integrate these function into the GUI interface. 1. THEORETICAL CONSIDERATION The short time Fourier transform is a basic theory when implement phase vocoder. The algorithm and principle has been introduced in the initial review and lab report specifically. So, this final review just cites that key information Short Time Fourier Transform The Short Time Fourier Transform is proposed to carry out the partial analysis. Moreover, the prefect reconstruction can be achieved if overlapping window is unity [4]. There are many types of window function, which including the Hamming, the Hanning and Gauss window [4][5][6]. Figure 2 is an example to demonstrated how Hanning windows reconstruct a short period of DFT [4]. When take a Gaussian window it is generally called Gabor Transform [6]. However, time and frequency are two independent variables for the output of the STFT equation (shows in equation 1, t is time, and ω is radial frequency) [4]. So as it has time-frequency representation in each sampling point. Figure 1. Sum of small windows. Figure source from DAFX[2] Normally, we can use Hanning, Hamming and Gauss window optionally. In order to compare the effect of the window function on the signal, there are two kinds of window function is adopted in this phase vocoder, one is Gauss window (shows in equation 2) and the other one is Hanning window (shows in equation 3, window length L is N+1).
2 The Window Fourier Transform with Gaussian function is known as Gaboret transform which has the smallest time-frequency window. For this reason, Gabor transform is the optimal STFT[4][7][8]. We can obtain Gaboret transform equation if Substitute equation 2 into equation IMPLEMENTATION Figure 2. Phase vocoder implementation diagram Phase vocoder can extend the time domain and frequency domain by changing the phase of sound. The extension of time domain and frequency domain corresponds to the time scaling (change playback rate) and pitch scaling (change foundational frequency) [8]. The basic principle of FFT/IFFT approach is that the timefrequency representation can be regarded as a series of overlapping FFT (may include a window function). Since the FFT is reversible, adding the IFFT of vertical line can reconstruct the sound, that means reconstruct sound from time-frequency domain [4]. The signal diagram has demonstrated in figure 2. Base on this principle, there are three steps when using direct FFT/IFFT approach to implement vocoder. First step, use the FFT to compute the relationship between instantaneous frequency and amplitude of the signal. Second step, resampling the FFT blocks. Third step, perform an IFFT by take the inverse Fourier transform on each chunk and adding the resulting waveform chunks [3] Window function Comparison In step one, we can identify window function as well as compare the effect of the window function on the signal. so, we can use the Matlab built-in function gausswin() and hanning() for the signal. Figure 3. comparison between Hanning window and Gauss window Figure 3 shows that the main lobe of the Hanning window is slightly wider than the Gauss window, which is equivalent to widening the bandwidth of the analysis and decreasing the frequency resolution. From the angle of frequency resolution, Gauss window is superior to Hanning window. The Side lobe of Gauss window is higher than that of Hanning Window, so from the point of view of leakage, Gauss window is better than Hanning window[6] FFT/IFFT approach to Implement Time scaling The following figure 4 and 5 is time scaling result based on the direct FFT/IFFT approach, the function routine includes one vectors of the gentlemen sound (I am speaking from over here), window size (2048), analysis steps (512) and synthesis steps(384). It has shown the implementation result of time scaling from waveform diagram and spectrogram when we use FFT approach. Figure 4. FFT approaches implement time scaling (waveform diagram) Figure 5. FFT approaches implement time scaling (spectrogram)
3 2.3. FFT/IFFT to Implement Pitch Scaling The following figure 6 and 7 is pitch scaling result based on the direct FFT/IFFT approach, but this function is different with time-scaling function. its routine includes one vectors of the gentlemen sound (I am speaking from over here), window size (2048), pitch ratio (0.9) and synthesis steps (512). It has shown the implementation result of pitch scaling from waveform diagram and spectrogram when we use FFT approach. Pitch Ratio, normal pitch ratio and high pitch ratio. Not only that, The sound of time/pitch-scaling is audible simultaneously. Figure 6. FFT approaches implement pitch scaling (waveform diagram) Figure 8. GUI development 3. EVALUATION 3.1. Test APP Run Figure Figure 7. FFT approaches implement pitch scaling (spectrogram) 2.4. GUI development Figure 8 has demonstrated that phase vocoder APP design idea, and this GUI interface has been divided into 4 portions. Portion 1 is window function display area, which is controlled in portion 3. Once the control button in the portion 3 is clicked, the selected window function image will be displayed in portion 1. Portion 2 is waveform and spectrogram display area, which is controlled in portion 4. Once the control button in the portion 4 is clicked, the selected time/pitch scaling waveform and spectrogram will be displayed in portion 2 separately. Portion 3 is window function control area which designed with 3 buttons, including two window functions and a set of reciprocal comparisons. Portion 4 is the control area, mainly used to control the diagram in portion 2. The Control area has two groups, one use for Hanning windows function and the other one for Gauss windows function. Then, each group also includes the time-scaling and pitch-scaling two kinds of buttons, time-scaling is controlled by the playback rate, three buttons are 1.5,2,3 times the playback rate respectively; pitch-scaling three buttons are divided into low Figure 9. error when start to run figure Run the function file Pitch_shift.m, As shown in figure 9, when the app is started, the system complains about some programs, but the error does not affect the test results, so the author prepares to do a detailed error correction later. When we click OK, we will enter the app interface (as shown in figure 10).
4 3.4. Test APP Time Scaling Test Figure 13. Time scaling (rate is 1.5) with Hanning window function (left) and Gauss window function(right) Figure 10. interface of this APP 3.2. Test APP Window Function Test Figure 12 evaluated the operation of the time scaling with Hanning and Gauss window function, and the results show that it is in good condition. Same as pitch test, all sounds were as expected(all demo will be attached), and the scaled sounds sped up a lot. During the audition process, there is no way to distinguish between the two window function and the sound. Although the waveform and spectrogram of the two appear to be somewhat different Test APP Switching Time Figure 11. interface of this APP The three diagrams in Figure 11 evaluated the operation of the different window function, and the results show that it is in good condition Test APP Pitch Scaling Test Figure 14. calculation time for each button. Figure 12. Low pitch ratio of Pitch scaling with Hanning window function (left) and Gauss window function(right) Figure 12 evaluated the operation of the pitch scaling with Hanning and Gauss window function, and the results show that it is in good condition. All the scaled sounds were as expected(all demo will be attached), and the low pitch ratio sounded even more muffled. In the audition process, there is no direct difference between the two window function and the sound. The only difference between the two is the sound at the end, and the Gauss window function is slightly louder than the Hanning window function To test the speed of each operation, the ' tic ' and ' toc ' (built-in function) has been added in every function to calculate the time of operation. As shown in Figure 14, the operation time has been attached on top of each button. Strangely, the operation time of Rate (Gauss) is almost 50 times than to Rate (Hanning). One possibility is that algorithm of rate (Gauss) was used Gauss approach, its basic principle is similar to FFT algorithm, but it has an extra calculation item Gaborets[4]. Therefore, the increase in the time is likely to be calculated when the Gaborets generated. 4. CONCULSION The results show that GUI interface as expected, waveform, spectrogram, sound and window functions can appear
5 simultaneously. The switching speed also is satisfying (except for the group of Rate (Gauss)), which will bring good experience to users. The disadvantage is that the overall design of the interface is not very intelligent, so it needs to be improved in the future. 5. REFERENCES [1]"Talking Tom", YouTube, [Online]. Available: [Accessed: 07- Jun- 2018]. [2]"Tik Tok (app)", En.wikipedia.org, [Online]. Available: [Accessed: 08- Jun- 2018]. [3]"Audio time stretching and pitch scaling", En.wikipedia.org, [Online]. Available: scaling. [Accessed: 08- Jun- 2018]. [4]U. Zölzer, DAFX, 2nd ed. Chichester: Wiley, 2011, pp. Chapter 7 page [5]"Gabor transform", En.wikipedia.org, [Online]. Available: [6]"Window function", En.wikipedia.org, [Online]. Available: [7] J. Proakis and D. Manolakis, Digital signal processing, 4th ed. Harlow, Essex: Pearson, 2014, pp. chapter 4, page [8]"The Fast Fourier Transform Algorithm", YouTube, [Online]. Available:
Spectral modeling of musical sounds
Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer
More informationREAL-TIME DIGITAL SIGNAL PROCESSING
REAL-TIME DIGITAL SIGNAL PROCESSING FUNDAMENTALS, IMPLEMENTATIONS AND APPLICATIONS Third Edition Sen M. Kuo Northern Illinois University, USA Bob H. Lee Ittiam Systems, Inc., USA Wenshun Tian Sonus Networks,
More information2.161 Signal Processing: Continuous and Discrete Fall 2008
MIT OpenCourseWare http://ocw.mit.edu 2.161 Signal Processing: Continuous and Discrete Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. MASSACHUSETTS
More informationRhythmic constant pitch time stretching for digital audio
Rhythmic constant pitch time stretching for digital audio Brendan TREVORROW ; University of Southern Queensland, Australia ABSTRACT Constant pitch time stretching is not uncommon in audio editing software,
More informationTwo-Dimensional Fourier Processing of Rasterised Audio. Chris Pike
Two-Dimensional Fourier Processing of Rasterised Audio Chris Pike cwp500@york.ac.uk June 13, 2008 Abstract A comprehensive software tool has been developed in Matlab to enable two-dimensional Fourier analysis
More informationSeparation of speech mixture using time-frequency masking implemented on a DSP
Separation of speech mixture using time-frequency masking implemented on a DSP Javier Gaztelumendi and Yoganathan Sivakumar March 13, 2017 1 Introduction This report describes the implementation of a blind
More informationMATLAB Apps for Teaching Digital Speech Processing
MATLAB Apps for Teaching Digital Speech Processing Lawrence Rabiner, Rutgers University Ronald Schafer, Stanford University GUI LITE 2.5 editor written by Maria d Souza and Dan Litvin MATLAB coding support
More informationAn Improvement in Temporal Resolution of Seismic Data Using Logarithmic Time-frequency Transform Method
Iranian Journal of Oil & Gas Science and Technology, Vol. 4 (2015), No. 2, pp. 27-39 http://ijogst.put.ac.ir An Improvement in Temporal Resolution of Seismic Data Using Logarithmic Time-frequency Transform
More informationReconstruction of Images Distorted by Water Waves
Reconstruction of Images Distorted by Water Waves Arturo Donate and Eraldo Ribeiro Computer Vision Group Outline of the talk Introduction Analysis Background Method Experiments Conclusions Future Work
More informationCOS 116 The Computational Universe Laboratory 4: Digital Sound and Music
COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency
More informationCOS 116 The Computational Universe Laboratory 4: Digital Sound and Music
COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency
More informationRapid Prototyping System for Teaching Real-Time Digital Signal Processing
IEEE TRANSACTIONS ON EDUCATION, VOL. 43, NO. 1, FEBRUARY 2000 19 Rapid Prototyping System for Teaching Real-Time Digital Signal Processing Woon-Seng Gan, Member, IEEE, Yong-Kim Chong, Wilson Gong, and
More informationChapter 6 Data Acquisition and Spectral Analysis System high-speed digitizer card for acquiring time-domain data. The digitizer is used in
Chapter 6 Data Acquisition and Spectral Analysis System 6.1 Introduction This chapter will discuss the hardware and software involved in developing the data acquisition and spectral analysis system. The
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationImage Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi
Image Transformation Techniques Dr. Rajeev Srivastava Dept. of Computer Engineering, ITBHU, Varanasi 1. Introduction The choice of a particular transform in a given application depends on the amount of
More informationDigital Recording and Playback
Digital Recording and Playback Digital recording is discrete a sound is stored as a set of discrete values that correspond to the amplitude of the analog wave at particular times Source: http://www.cycling74.com/docs/max5/tutorials/msp-tut/mspdigitalaudio.html
More informationA-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D.
Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS Verfaille V., Arfib D. CNRS - LMA 3, chemin Joseph Aiguier
More informationSlow Motion Video. MediaTek Technology White Paper
MediaTek Technology White Paper May 2015 Introducing Slow Motion Video MediaTek s Slow Motion Video technology allows the device user to easily capture, replay, and share a fast moving object any time.
More informationChapter X Sampler Instrument
Chapter X Sampler Instrument A sampler is a synthesizer that generates sound by playing recorded sounds rather than calculated waveforms. The wave file player instrument described in an earlier chapter
More informationBluray (
Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile)
More informationUSING AUDACITY: ROBUST, FREE, AND FULL- FEATURED SOFTWARE
Slide 1 USING AUDACITY: ROBUST, FREE, AND FULL- FEATURED SOFTWARE This presentation contains audio; ensure your speakers are turned up. Click the forward arrow below to proceed to the next slide. Audacity
More informationDigital Media. Daniel Fuller ITEC 2110
Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09
More informationMultimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology
Course Presentation Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology Homework Original Sound Speech Quantization Companding parameter (µ) Compander Quantization bit
More informationA Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
International Journal of Engineering Research and General Science Volume 3, Issue 4, July-August, 15 ISSN 91-2730 A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm
More informationAdobe Sound Booth Tutorial
Adobe Sound Booth Tutorial Recording your Voice in the Studio 1. Open Adobe Sound Booth 2. Click File>New>Empty Audio File 3. Hit the Record Button (red circle button at the bottom of the screen) 4. In
More informationTopics in Linguistic Theory: Laboratory Phonology Spring 2007
MIT OpenCourseWare http://ocw.mit.edu 24.910 Topics in Linguistic Theory: Laboratory Phonology Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationPhase Vocoder Implementation with FLWT and TD-PSOLA
Phase Vocoder Implementation with FLWT and TD-PSOLA Terry Kong Stanford University Email: tckong@stanford.edu Abstract The following is a report written for EE 64 s final project. The goal of the project
More informationS4B Ringtone Creator Soft4Boost Help S4B Ringtone Creator www.sorentioapps.com Sorentio Systems, Ltd. All rights reserved Contact Us If you have any comments, suggestions or questions regarding S4B Ringtone
More informationCriterion E: Product Development
Criterion E: Product Development Techniques used to address client s requirements: Pre-Video Setup.pg. 1 Video Layout...pg. 2 Editing Video Clips pg. 3-5 Uploading my Videos to Youtube..pg. 6-7 My Youtube
More informationMahdi Amiri. February Sharif University of Technology
Course Presentation Multimedia Systems Speech II Mahdi Amiri February 2014 Sharif University of Technology Speech Compression Road Map Based on Time Domain analysis Differential Pulse-Code Modulation (DPCM)
More informationDigital Image Processing. Image Enhancement in the Frequency Domain
Digital Image Processing Image Enhancement in the Frequency Domain Topics Frequency Domain Enhancements Fourier Transform Convolution High Pass Filtering in Frequency Domain Low Pass Filtering in Frequency
More informationLaboratory Exercise #5
ECEN4002/5002 Spring 2003 Digital Signal Processing Laboratory Laboratory Exercise #5 Signal Synthesis Introduction Up to this point we have been developing and implementing signal processing algorithms:
More informationMultimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 2015 Sharif University of Technology
Course Presentation Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 25 Sharif University of Technology Speech Compression Road Map Based on Time Domain analysis Differential Pulse-Code
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationModule 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur
Module 9 AUDIO CODING Lesson 29 Transform and Filter banks Instructional Objectives At the end of this lesson, the students should be able to: 1. Define the three layers of MPEG-1 audio coding. 2. Define
More informationPHYS 5061 Lab 1: Introduction to LabVIEW
PHYS 5061 Lab 1: Introduction to LabVIEW In this lab, you will work through chapter 1 and 2 of Essick s book to become familiar with using LabVIEW to build simple programs, called VI s in LabVIEW-speak,
More informationTree-based Cluster Weighted Modeling: Towards A Massively Parallel Real- Time Digital Stradivarius
Tree-based Cluster Weighted Modeling: Towards A Massively Parallel Real- Time Digital Stradivarius Edward S. Boyden III e@media.mit.edu Physics and Media Group MIT Media Lab 0 Ames St. Cambridge, MA 039
More informationFFT and Spectrum Analyzer
Objective: FFT and Spectrum Analyzer Understand the frequency domain and some of it's uses Understand the Discrete Fourier Transform Preparation: (i) PC and data acquisition system (ii)fft software Background:
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationA Non-Iterative Approach to Frequency Estimation of a Complex Exponential in Noise by Interpolation of Fourier Coefficients
A on-iterative Approach to Frequency Estimation of a Complex Exponential in oise by Interpolation of Fourier Coefficients Shahab Faiz Minhas* School of Electrical and Electronics Engineering University
More informationIntroduction to Podcasting
2011 Introduction to Podcasting Part 2: Recording Your Podcast Adam Read University College Plymouth St Mark & St John 10/Jan/2011 About this guide This is part 2 in the elearning essentials: Introduction
More informationCamtasia Studio 7 User Guide
Camtasia Studio 7 User Guide TechSmith & Camtasia Studio: TechSmith Corporation released popular desktop recording tools like Snagit, Jing, and Camtasia. TechSmith also launched Screencast.com, a screencast
More informationFundamentals of Digital Image Processing
\L\.6 Gw.i Fundamentals of Digital Image Processing A Practical Approach with Examples in Matlab Chris Solomon School of Physical Sciences, University of Kent, Canterbury, UK Toby Breckon School of Engineering,
More informationAudacity tutorial. 1. Look for the Audacity icon on your computer desktop. 2. Open the program. You get the basic screen.
Audacity tutorial What does Audacity do? It helps you record and edit audio files. You can record a speech through a microphone into your computer, into the Audacity program, then fix up the bits that
More information5.2. The Sine Function and the Cosine Function. Investigate A
5.2 The Sine Function and the Cosine Function What do an oceanographer, a stock analyst, an audio engineer, and a musician playing electronic instruments have in common? They all deal with periodic patterns.
More informationNumerical Propagation Manual
Numerical Propagation Manual Numerical Propagation Manual 1 Numerical Propagation Manual Version 1.2 Juan Pablo Piedrahita Quintero, Raúl Andrés Castañeda Quintero and Jorge Iván García Sucerquia Monday,
More information1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends
Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques
More information(Refer Slide Time: 01:25)
Computer Architecture Prof. Anshul Kumar Department of Computer Science and Engineering Indian Institute of Technology, Delhi Lecture - 32 Memory Hierarchy: Virtual Memory (contd.) We have discussed virtual
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/
More informationMATH STUDENT BOOK. 12th Grade Unit 4
MATH STUDENT BOOK th Grade Unit Unit GRAPHING AND INVERSE FUNCTIONS MATH 0 GRAPHING AND INVERSE FUNCTIONS INTRODUCTION. GRAPHING 5 GRAPHING AND AMPLITUDE 5 PERIOD AND FREQUENCY VERTICAL AND HORIZONTAL
More informationWorking with Windows Movie Maker
518 442-3608 Working with Windows Movie Maker Windows Movie Maker allows you to make movies and slide shows that can be saved to your computer, put on a CD, uploaded to a Web service (such as YouTube)
More information21M.361 Composing with Computers I (Electronic Music Composition) Spring 2008
MIT OpenCourseWare http://ocw.mit.edu 21M.361 Composing with Computers I (Electronic Music Composition) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationModel parametrization strategies for Newton-based acoustic full waveform
Model parametrization strategies for Newton-based acoustic full waveform inversion Amsalu Y. Anagaw, University of Alberta, Edmonton, Canada, aanagaw@ualberta.ca Summary This paper studies the effects
More informationWAVELET USE FOR IMAGE RESTORATION
WAVELET USE FOR IMAGE RESTORATION Jiří PTÁČEK and Aleš PROCHÁZKA 1 Institute of Chemical Technology, Prague Department of Computing and Control Engineering Technicka 5, 166 28 Prague 6, Czech Republic
More informationA building structural-performance monitoring system using RFID tag with sensors
icccbe 21 Nottingham University Press Proceedings of the International Conference on Computing in Civil and Building Engineering W Tizani (Editor) A building structural-performance monitoring system using
More informationSoftware for Analysis, Processing, and Creation of Signals and Related Applications
TM Overview Software for Analysis, Processing, and Creation of Signals and Related Applications Technology is always evolving - isn't it time your data analysis evolved as well? Technology is always evolving
More information(A simplified version of this document is available for applicants who had applied in previous years.)
NYO Canada 2019 Auditions ( en français ) (A simplified version of this document is available for applicants who had applied in previous years.) Digital Audition Process Since 2013 NYO Canada has implemented
More informationDAY 28 - ARITHMETIC SEQUENCES
DAY 28 - ARITHMETIC SEQUENCES ARITHMETIC SEQUENCE An ARITHMETIC SEQUENCE is where the rule of the pattern is always ADDED. The rule is called the COMMON DIFFERENCE ARITHMETIC SEQUENCE You can use the following
More informationExperimental reconstruction of a highly reflecting fiber Bragg grating by using spectral regularization and inverse scattering
3284 J. Opt. Soc. Am. A/ Vol. 24, No. 10/ October 2007 Rosenthal et al. Experimental reconstruction of a highly reflecting fiber Bragg grating by using spectral regularization and inverse scattering Amir
More informationFundamental of Digital Media Design. Introduction to Audio
Fundamental of Digital Media Design Introduction to Audio by Noraniza Samat Faculty of Computer Systems & Software Engineering noraniza@ump.edu.my OER Fundamental of Digital Media Design by Noraniza Samat
More informationMichael Moody School of Pharmacy University of London 29/39 Brunswick Square London WC1N 1AX, U.K.
This material is provided for educational use only. The information in these slides including all data, images and related materials are the property of : Michael Moody School of Pharmacy University of
More informationUSER GUIDE FOR PREDICTION ERROR METHOD OF ADAPTIVE FEEDBACK CANCELLER ON ios PLATFORM FOR HEARING AID APPLICATIONS
Page 1 of 13 USER GUIDE FOR PREDICTION ERROR METHOD OF ADAPTIVE FEEDBACK CANCELLER ON ios PLATFORM FOR HEARING AID APPLICATIONS Parth Mishra, Anshuman Ganguly, Nikhil Shankar STATISTICAL SIGNAL PROCESSING
More informationTIME-FREQUENCY SPECTRA OF MUSIC
TIME-FREQUENCY SPECTRA OF MUSIC James S. Walker Department of Mathematics University of Wisconsin Eau Claire walkerjs@uwec.edu Amanda J. Potts Department of Mathematics University of Nebraska Lincoln Abstract
More informationLesson 12: Sine 5 = 15 3
Lesson 12: Sine How did ou do on that last worksheet? Is finding the opposite side and adjacent side of an angle super-duper eas for ou now? Good, now I can show ou wh I wanted ou to learn that first.
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationProject Plan of Simultaneous Co-Test of high performance DAC ADC Pairs
Project Plan of Simultaneous Co-Test of high performance DAC ADC Pairs Iowa State University Senior Design Project 2012-2013 May13-28 Adviser Dr. Degang Chen Members Luke Goetzke Tao Chen Ben Magstadt
More informationImplementation of FFT Processor using Urdhva Tiryakbhyam Sutra of Vedic Mathematics
Implementation of FFT Processor using Urdhva Tiryakbhyam Sutra of Vedic Mathematics Yojana Jadhav 1, A.P. Hatkar 2 PG Student [VLSI & Embedded system], Dept. of ECE, S.V.I.T Engineering College, Chincholi,
More informationFor Mac and iphone. James McCartney Core Audio Engineer. Eric Allamanche Core Audio Engineer
For Mac and iphone James McCartney Core Audio Engineer Eric Allamanche Core Audio Engineer 2 3 James McCartney Core Audio Engineer 4 Topics About audio representation formats Converting audio Processing
More informationBlind one-microphone speech separation: A spectral learning approach
Blind one-microphone speech separation: A spectral learning approach Francis R. Bach Computer Science University of California Berkeley, CA 94720 fbach@cs.berkeley.edu Michael I. Jordan Computer Science
More informationAssistive Technology Training Software Guide
Assistive Technology Training Software Guide 1 Contents 1. Getting Started 1.1 Starting a new project 4 1.2 The Notetaker window layout 5 2. Importing to the Images Pane 2.1 Importing slides (from PowerPoint)
More informationExplicit consistency constraints for STFT spectrograms and their application to phase reconstruction
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction Jonathan Le Roux, obutaka Ono and Shigeki Sagayama Graduate School of Information Science and Technology,
More informationDesign and implementation of a Beat Detector algorithm
Design and implementation of a Beat Detector algorithm (ETIN80 report) Jaime Gancedo, ja1626ga-s@student.lu.se Sakif Hossain, sa5536ho-s@student.lu.se Wenpeng Song, we3543so-s@student.lu.se March 11, 2018
More informationDSP-CIS. Part-IV : Filter Banks & Subband Systems. Chapter-10 : Filter Bank Preliminaries. Marc Moonen
DSP-CIS Part-IV Filter Banks & Subband Systems Chapter-0 Filter Bank Preliminaries Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven marc.moonen@esat.kuleuven.be www.esat.kuleuven.be/stadius/ Part-III Filter
More information19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007
19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek
More informationAn Audio Fingerprinting System for Live Version Identification using Image Processing Techniques
An Audio Fingerprinting System for Live Version Identification using Image Processing Techniques (Dr.?) Zafar Rafii Northwestern University EECS department Acknowledgments Work done during internship at
More information6 Using Noise Reduction Filters
6 Using Noise Reduction Filters You can add new life to old or low quality audio recordings using Adobe Audition s enhancement and restoration effects. Distracting hiss or background noise can easily be
More informationAUDACITY DIGITAL AUDIO SOFTWARE
AUDACITY DIGITAL AUDIO SOFTWARE TABLE OF CONTENTS (Select a link below to jump to that section of the document) Importing Music Into Audacity... 2 Where To Get Sound Files... 2 Change The Volume of Sound
More informationThe MPC Renaissance & MPC Studio Bible - Demo Tutorial (For MPC Software 2.x)
The MPC Renaissance & MPC Studio Bible - Demo Tutorial (For MPC Software 2.x) This free tutorial is taken from MPC-Tutor s tutorial book The MPC Renaissance & MPC Studio Bible (MPC Software 2.1 Edition).
More informationAudacity: How- To. Import audio (a song or SFX) Before we start. Import song into Audacity
Audacity: How- To music technology training Import audio (a song or SFX) Before we start You can t import a song into Audacity directly from a CD. You need to rip the required track from the CD using a
More informationFrom Fourier Transform to Wavelets
From Fourier Transform to Wavelets Otto Seppälä April . TRANSFORMS.. BASIS FUNCTIONS... SOME POSSIBLE BASIS FUNCTION CONDITIONS... Orthogonality... Redundancy...3. Compact support.. FOURIER TRANSFORMS
More informationDigital Fundamentals
Digital Fundamentals Tenth Edition Floyd Chapter 1 Modified by Yuttapong Jiraraksopakun Floyd, Digital Fundamentals, 10 th 2008 Pearson Education ENE, KMUTT ed 2009 Analog Quantities Most natural quantities
More informationAudacity Tutorial C. Stanley
Audacity Tutorial C. Stanley Getting to Know Audacity: Silence Keys Microphone Select Editing Tools Recording Tools Cut, Copy, Paste Undo, Redo Zoom How to Record: Select external microphone. Press the
More informationSales Manual Part II
Sales Manual Part II In this sales manual, you ll be able to show how to make a song and create a WAV file of the song. Table of Contents Page 1. Main Features of the Sequencer 2 2. How to Demo the Sequencer
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationChecklist. ORB Education Quality Teaching Resources. ORB Education Visit for the full, editable versions.
Checklist 1. Data in our Lives 2. Representing Data 3. Working with Data 4. Introducing Spreadsheets 5. Basic Excel Skills 6. Google Sheets 7. Sorting and Filtering 8. Formulas 9. Plots and Trendlines
More informationSurface Wave Suppression with Joint S Transform and TT Transform
Available online at www.sciencedirect.com Procedia Earth and Planetary Science 3 ( 011 ) 46 5 011 Xian International Conference on Fine Geological Exploration and Groundwater & Gas Hazards Control in Coal
More informationRepeating Segment Detection in Songs using Audio Fingerprint Matching
Repeating Segment Detection in Songs using Audio Fingerprint Matching Regunathan Radhakrishnan and Wenyu Jiang Dolby Laboratories Inc, San Francisco, USA E-mail: regu.r@dolby.com Institute for Infocomm
More informationDigital Signal Processing Lecture Notes 22 November 2010
Digital Signal Processing Lecture otes 22 ovember 2 Topics: Discrete Cosine Transform FFT Linear and Circular Convolution Rate Conversion Includes review of Fourier transforms, properties of Fourier transforms,
More informationHow to change clip speed and direction
Adobe Premiere Pro CS6 Project 6 guide How to change clip speed and direction Editors frequently want to change clip speed or direction. With Adobe Premiere Pro CS6, you can edit a video clip so a sprinter
More informationFilterbanks and transforms
Filterbanks and transforms Sources: Zölzer, Digital audio signal processing, Wiley & Sons. Saramäki, Multirate signal processing, TUT course. Filterbanks! Introduction! Critical sampling, half-band filter!
More informationHand Controlled Audio Synthesizer
Alex Sanchez & Behram Mistree 6.111 Final Project Proposal 11.3.2006 Coordinate TA: Javy Hand Controlled Audio Synthesizer 1. Proposed Functionality We propose to build a device that incorporates a video
More informationA New Technique of Extraction of Edge Detection Using Digital Image Processing
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) A New Technique of Extraction of Edge Detection Using Digital Image Processing Balaji S.C.K 1 1, Asst Professor S.V.I.T Abstract:
More informationPlane Wave Imaging Using Phased Array Arno Volker 1
11th European Conference on Non-Destructive Testing (ECNDT 2014), October 6-10, 2014, Prague, Czech Republic More Info at Open Access Database www.ndt.net/?id=16409 Plane Wave Imaging Using Phased Array
More informationCALCULATING SNELL S LAW
CALCULATING SNELL S LAW What is Snell s Law? Snell s Law defines angular relationships of sound waves crossing an interface. The calculations for Snell s Law will come into play when we are conducting
More informationAnalysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope
Analysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope G. Mohana Durga 1, D.V.R. Mohan 2 1 M.Tech Student, 2 Professor, Department of ECE, SRKR Engineering College, Bhimavaram, Andhra
More informationImplementing a Speech Recognition System on a GPU using CUDA. Presented by Omid Talakoub Astrid Yi
Implementing a Speech Recognition System on a GPU using CUDA Presented by Omid Talakoub Astrid Yi Outline Background Motivation Speech recognition algorithm Implementation steps GPU implementation strategies
More informationHow to edit audio tracks
How to edit audio tracks Although at times you will use Adobe Audition to record new audio, you will often use it to edit, clean up, or add effects to an existing audio file such as an audio file recorded
More informationFAST AND ACCURATE TRANSIENT ULTRASOUND PROPAGATION AND B-MODE IMAGING SIMULATION METHODS. Yi Zhu
FAST AND ACCURATE TRANSIENT ULTRASOUND PROPAGATION AND B-MODE IMAGING SIMULATION METHODS By Yi Zhu A THESIS Submitted to Michigan State University in partial fulfillment of the requirements for the degree
More informationAssignment 3: Edge Detection
Assignment 3: Edge Detection - EE Affiliate I. INTRODUCTION This assignment looks at different techniques of detecting edges in an image. Edge detection is a fundamental tool in computer vision to analyse
More informationHow to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device
How to Change the Default Playback & Recording Audio Device Sound is a very important part of our computing experience. We listen to music, do voice chat, watch movies, play games, record sound, etc. In
More information