Wavetable Matching of Pitched Inharmonic Instrument Tones

Size: px
Start display at page:

Download "Wavetable Matching of Pitched Inharmonic Instrument Tones"

Transcription

1 Wavetable Matching of Pitched Inharmonic Instrument Tones Clifford So and Andrew Horner Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Kowloon, Hong Kong and Abstract Wavetable matching is the process of finding the parameters needed to resynthesize a musical instrument tone using wavetable synthesis. The most important parameters are the wavetable basis spectra. Previous wavetable matching has often assumed the original tone was harmonic or nearly harmonic. This assumption does not hold for instruments such as the plucked strings. A semi-automated process has recently been proposed to group the partials based on their normalized frequency deviations, and then match each group. However, the user must adjust the group size manually to find the best match. The method is also inefficient at matching harmonic instruments. This paper introduces a new method for hierarchically grouping partials with similar normalized frequency deviations. Ordinary wavetable matching then finds the basis spectra for individual groups. The new method is fully automatic and requires no prior knowledge about the inharmonicity of the tone. Results for 8 instrument tones show the new method improves the perceived match on harmonic tones and pitched inharmonic tones compared to ordinary wavetable matching. 0. Introduction The term wavetable synthesis is currently often used synonymously with sampling synthesis in the music industry. We will use the classical computer music meaning of wavetable synthesis throughout this paper, where an oscillator table is loaded with

2 several partials and indexed depending on the fundamental frequency. Several weighted wavetables can be summed to produce spectral changes in multiple wavetable synthesis. When the partials are disjoint (i.e. each partial only appears in only one of the wavetables), this is called group additive synthesis []-[3]. Recent work has shown how to match wavetable parameters for resynthesizing instrument tones [4]-[7]. One of the most successful of these methods, wavetable matching, picks spectral snapshots from the original tone to determine the wavetables (additive synthesis of the spectral snapshots loads the wavetables). This approach is intuitive, and guarantees an exact match at the timepoints of the selected spectral snapshots and excellent matches at neighboring points as well. Figure shows the multiple wavetable block diagram, where the output of each wavetable is scaled by an amplitude envelope and summed into the output signal. Wavetable matching occurs in the frequency domain and determines the spectra of the wavetables, called the basis spectra. Previous work on wavetable matching has used genetic algorithm (GA) optimization to determine the best time points to select the basis spectra. GAs are combinatorial optimization techniques modeled on the idea of evolving solutions through processes analogous to natural selection and breeding [8][9]. These evolutionary processes guide the parameter search using populations of potential solutions. To judge the fitness of a particular combination of basis spectra, the fitness function uses a least-squares solution to find the amplitude envelopes of each basis spectra, and an average relative amplitude error (RAE) to measure the spectral difference between the original and matched tones: RAE = N frames N frames! j = N hars! k = ( b N hars! k = # b" 2 k, j k, j ) b 2 k, j () where the N frames is the number of analysis frames, N hars is the number of partials, b k!, j is the kth partial amplitude of the synthesized tone at the jth analysis frame, and b, is the k j original tone s amplitude on the same partial and analysis frame. A relative amplitude 2

3 error of zero indicates an exact match, while an error of 0. indicates an average relative difference of 0%. Equation is not a perfect perceptual error measure, but it is reasonably accurate and computationally inexpensive. The GA uses the relative amplitude error as the fitness value of the candidate basis spectra, mixing and matching basis spectra combinations from the best individuals in the population, returning the best set of basis spectra as the solution. The cost of computing the relative amplitude error for each set of basis spectra can be held down by limiting the match points to a small number of representative test spectra (usually about 20). However, wavetable synthesis makes a basic assumption: the partials are harmonics, i.e., they have frequencies which are at integer multiples of the fundamental frequency ( f k = kf). This causes problems when modeling inharmonic instruments, such as the plucked strings. Wavetable matching generates a dynamic spectrum as close to the original as possible. However, it does not take into account the tone s inharmonicity (the amount of frequency deviation between the partials and integer multiples of the fundamental frequency). The degree of inharmonicity is dependent on the instrument [0][]. Figure 2 shows the normalized frequency deviations (NFD) for four acoustic instruments based on McAulay- Quatieri analysis [2]. We define the NFD for partial k as the following: NFD k = N frames! j= b ' f k, j % j ( & kf k, N frames! j= b k, j $ " # (2) where F is the fixed fundamental frequency (specified by the user prior to spectrum analysis), and f k, j is the actual time-varying frequency of partial k at analysis frame j. A value of 0.0 represents a +% frequency deviation. The equation weights the frequency deviation by the relative amplitude of each frame (i.e., high amplitude frames contribute more to the overall deviation than low amplitude frames). Figure 2 shows that the 3

4 plucked yangqin and zheng tones have larger and more variable frequency deviations than the trumpet and saxophone tones. Lee and Horner [3] introduced a group additive synthesis method for matching piano tones with stretched partials. Piano tones have simpler a inharmonicity than plucked string tones, because the amount of average frequency deviation increases in a fairly continuous way from partial to partial on the piano. This is unlike the zheng tone, where a partial may be very flat and its neighbor very sharp (see Figure 2). Such deviations require a special model to represent them accurately. So and Horner [4] recently introduced an inharmonic wavetable matching technique that classifies partials with similar normalized frequency deviations. The method then uses ordinary wavetable matching for each class. However, the method is only semi-automatic and requires user intervention to determine the class size. The user inputs the value by trial-and-error seeking the best match. The method is also inefficient for matching harmonic tones, using much more wavetables than ordinary wavetable matching without any significant improvement. This paper introduces a new wavetable matching technique that adapts wavetable matching to both harmonic and pitched inharmonic tones without prior knowledge about the inharmonicity of the tone. The new method uses a hierarchical grouping technique. Section describes the new method, Section 2 gives results for 8 instrument tones, and Section 3 concludes the paper.. Wavetable Matching of Pitched Inharmonic Tones A new version of wavetable matching is presented in this section. The idea is to hierarchically group the original tone s partials based on their normalized frequency deviation (NFD), and then determine the basis spectra and amplitude envelopes of individual groups using ordinary GA wavetable matching. The user specifies the number of wavetables (N tabs ), and the algorithm minimizes the relative amplitude error (RAE) and 4

5 normalized frequency deviation error (NFDE). The procedure is fully automatic and requires no intervention from the user about the inharmonicity of the tone, compared to the previous version of the algorithm [4]. Figure 3 outlines the method. First, a short-time Fourier analysis such as the phase vocoder [5][6] transforms the tone from the time domain to the frequency domain. If the tones has severe pitch changes we apply McAulay-Quatieri [2] analysis. Beauchamp [7] gives more details about these procedures. Next, a group decomposition tree of the partials is formed according to the normalized frequency deviation (NFD k ). Then, wavetables are distributed among the groups, forming a wavetable allocation tree. Ordinary GA-wavetable matching is applied to each group on each level of the tree. The result is a set of basis spectra and amplitude envelopes, along with their frequency error (NFDE) and amplitude error (RAE) for each group. The final parameters are selected from the level with the lowest overall frequency and amplitude error. Section. discusses the hierarchical grouping method for the partials. Section.2 shows how to distribute the wavetables among the various groups. Section.3 shows how to find the level with the lowest overall error, and Section.4 describes how to resynthesize the signal.. Hierarchical Grouping of the Partials A good grouping algorithm is critical to the overall procedure, because the individual frequency deviations of the partials will be replaced with a single group frequency deviation. Therefore, partials with similar deviations should be grouped together. The goodness of a grouping can be measured by its normalized frequency deviation error (NFDE), which is defined as: NFDE = N! groups i= NFDE 2 i = N groups!!( NFDk # GNFDi ) i= k" group i 2 (3) 5

6 where N groups is the total number of groups, NFDE i is the normalized frequency deviation error of group i, and GNFD i is the average NFD k for group i, which is defined as GNFD i = NFD k" group = i H i! k" group NFD i k (4) where H i is the number of partials in group i. The lower the NFDE, the better the grouping. We propose a hierarchical grouping algorithm that minimizes the NFDE for a given number of groups N groups. The steps are as follows: i) Let array T be a set of ordered pairs where the first number represents the partial number, and the second its frequency deviation NFD k, where k N hars. Sort T by the frequency deviation values in ascending order (i.e., the flatest deviation will be first in T, and the sharpest last). ii) Construct a set of edges E that connect all the adjacent ordered pairs of T, where e x = (t x, t x+ ) for x < N hars. Set the number of current existing groups c =. iii) Connected ordered pairs are in the same group. Pick the group m with the largest NFDE m. iv) Split group m into left and right groups by breaking one edge. Test all the edges in group m to find which edge gives the largest NFDE m reduction after its removal. That is, find the edge in group m that gives the largest difference: NFDE! ( NFDE + NFDE ). m left right v) c = c +. Stop the algorithm if N groups = c, otherwise go to step iii and continue. An example run of the algorithm is shown in Figures 4, 5, and 6 for a hypothetical instrument with ten partials. Figure 4 shows amplitude versus normalized frequency deviation for each partial. Figure 5 shows the sorted partials along the x-axis and the connected edges between them. The amplitude information is not shown in the graph, as it is unnecessary in the grouping algorithm (though it will be used in the wavetable allocation algorithm). Each step in Figure 5 shows the successive removal of edges: e 4 is removed first, e 7 is removed second, and so on. Figure 6 shows the corresponding 6

7 hierarchical grouping tree with the relative amplitude () of each group. When N groups =, no edge is removed and the partials form a single group {, 8, 2, 5, 9, 0, 3, 6, 7, 4}, and thus = 00%. When N groups = 2, removing e 4 results in the largest frequency error reduction and the groups {, 8, 2, 5} ( = 53.7%) and {9, 0, 3, 6, 7, 4} ( = 46.3%). When N groups = 3, removing e 7 results in the largest error reduction and breaks the second group into {9, 0, 3} ( = 2%) and {6, 7, 4} ( = 25.3%), and so on. At the bottom of the tree, with N groups = N hars = 0, no edge remains and every partial is in a group by itself..2 Allocating the Wavetables After forming the group decomposition tree, ordinary GA wavetable matching on each group in the tree determines the basis spectra and amplitude envelopes. But how to best distribute the wavetables among groups, given the total number of wavetables (N tabs )? We set the number of wavetables in group i (#wt i ) to be proportional to the relative amplitude ( i ) of group i, that is # wt! N (5) i i tabs where the relative amplitude ( i ) comes from the group decomposition tree. The number of group wavetables #wt i is rounded to the nearest integer. Equation 5 distributes the wavetables according to the relative amplitude of each group. Proportional allocation of the wavetables insures that groups with higher relative amplitudes have higher matching accuracy, helping to reduce the overall relative amplitude error (RAE) in Equation. Figure 7 shows the wavetable allocation tree for the example from Figures 4-6 with N tabs = 5. Since the maximum number of wavetables is five in this case, the wavetable allocation tree contains only five levels (with five groups in the leaf level). The single group at the root includes all five wavetables, whereas each leaf will contain only one wavetable. The groups in intermediate levels will be allocated a proportional number of wavetables. We also restrict #wt i, such that 7

8 #wt i i (6) The number of wavetables allocated in group i must be smaller than or equal to the number of partials in group i. It is unnecessary to have more wavetables than partials in a group since additive synthesis yields an exact match (a "perfect group"). Such an example occurs in the leftmost group of the forth level (N groups = 4) of the wavetable allocation tree in Figure 7. According to Equation 5, the leftmost group ( = 37.7%) has the highest relative amplitude and would contain two wavetables, while the second strongest group ( = 25.3%) would contain one wavetable. However, there is only one partial in the 37.7% group, and one wavetable is enough to perfectly resynthesize this group. The surplus wavetable is passed to the second strongest group (which contains three partials). If the second strongest group already has enough wavetables, the surplus wavetable is passed to the next strongest candidate, and so on. Each group in the wavetable allocation tree (i.e., each box in Figure 7) is then passed to a genetic algorithm for optimization of the wavetables and amplitude envelopes as shown in Figure 8. This yields the amplitude error (RAE) and frequency error (NFDE) of the group. Identical groups from different levels only require one optimization. The total amplitude error (RAE) and frequency error (NFDE) can then be calculated for each level of the tree using all the partials..3 Selecting the Best Grouping After obtaining the amplitude error (RAE) and frequency error (NFDE) for each level, we need to determine which level gives the best grouping. To do this, we plot RAE versus NFDE for all levels of the tree. Figure 9 shows a plot of the example in Figure 7. The curve starts at the lower-right corner for the root and extends to the upper-left corner of the leaf level. In this example, there are five points corresponding to the five levels of the tree. The number of points in the tree is always equal to N tabs. We need to find the point that gives the lowest amplitude and frequency error. However, the point with the lowest amplitude error may not be the point with the lowest frequency 8

9 error. The frequency error always decreases with more groups, but the amplitude error may not (in the example in Figure 9 the amplitude error decreases up to three groups, and increases with more groups). One method of reconciling these two criteria is to select the point closest to the origin. Measuring the normalized Euclidian distance to the origin can do this. We term this distance the matching quality (MAQ) of a point, and defined it as: 2 2 & NFDE # & RAE # MAQ = $! + $! (7) % maxnfde " % maxrae " where the maxnfde and maxrae are the maximum NFDE and RAE of the line. A normalized distance is preferable to an absolute distance because the NFDE and RAE are on different scales. In the example in Figure 9, the values of NFDE are ten times smaller than RAE. That means the RAE will dominate the MAQ if the absolute values are used. For the example in Figure 9, the third level point (N groups = 3) is selected as the best grouping since it has the lowest MAQ (it is a little better than the fourth level point)..4 Resynthesizing the Signal To resynthesize the signal, we synthesize each group x i (t) using the ordinary multiple wavetable synthesis and sum all the group signals to produce the final synthetic signal x(t). To synthesize the group signal x i (t), every wavetable in group i uses a common timevarying fundamental frequency g i (t) defined as: ' & b # $ k, t NFDk ( t)! k( Groupi g = ( + ) = $ +! i ( t) F vi ( t) F (8) $! ' bk, t % k( Groupi " where F is the fixed fundamental frequency (specified by the user prior to spectrum analysis), and v i (t) is the time-varying NFD of group i. v i (t) is equal to the weighted-nfd of the group's partials. The weighting ensures that the resulting NFD will not be adversely affected by low amplitude partials, which are often noisy (the same situation as 9

10 in Equation 2). We do not use the GNFD i defined in Equation 4 to calculate g i (t), because GNFD i is only a single value rather than a time-varying value. We synthesize the group signal x i (t) using multiple wavetable synthesis. # wt i! x ( t) = w ( t) y ( t (9) i j= i, j i, j ) where y i,j (t) is the time-varying j-th wavetable for group i, w i,j (t) is the amplitude envelope of the j-th wavetable for group i, and #wt i is the number of wavetables in group i. Finally, we sum the signals of each group x i (t) to produce the resynthesized signal x(t). N groups! x( t) = x i ( t) (0) i= 2. Results This section compares the results of the new method to the original wavetable matching method. Eighteen instrument tones have been tested and are categorized as either harmonic tones (overall inharmonicity %) or pitched inharmonic tones (overall inharmonicity > %). We define the inharmonicity of a tone as its normalized frequency deviation error (NFDE) when N tabs =. Seven of the eighteen instruments are harmonic. They are mainly the woodwinds, brass and bowed strings (saxophone, shakuhachi, trumpet, violin, erhu, guitar, and piano). The remaining eleven instruments are inharmonic. They are mostly plucked string and pitched percussion (pipa, liuqin, zheng, qin, yangqin, glockenspiel, bell, qing, pitched gong, ling, and cello). Non-pitched instruments, such as the cymbal, are not included because they are unsuitable for wavetable matching, and Section 2. shows the harmonic results. Section 2.2 shows the inharmonic results. Section 2.3 shows the listening test results. 0

11 2. Harmonic Results We judge the quality of the matched tone by its amplitude error (RAE) and frequency error (NFDE). Figures 0 and show the amplitude error results. The tones are listed from left to right in order of decreasing inharmonicity. For each tone in the original method, the amplitude error decreases exponentially with an increasing number of wavetables (see Figure 0). In the new method, however, the amplitude error sometimes drops more rapidly than in the original wavetable method (see Figure ). The most obvious example is the guitar, which drops to nearly zero when N tabs = 5. Another example is the shakuhachi. When N tabs is large, there are more perfect groups. Recall that if #wt i = i (the number of wavetables equals the number of partials in group i), additive synthesis generates a perfect match. With more perfect groups, the overall amplitude error drops faster. The dropping rate actually depends on the relative amplitudes of the perfect groups. Stronger perfect groups reduce the error faster (as in the guitar and shakuhachi). Figures 2 and 3 show the frequency error results. The tones are listed in the same order. For every tone in the original method, the NFDE must stay the same because there are no groups (see Figure 2). On the other hand, the new method improves the NFDE with increasing N tabs (see Figure 3). Groups allow better NFD resolution. All the frequency errors shown in Figures 2 and 3 are below %, indicating that all the tones are near-harmonic and the original method is a good enough model. Nevertheless, the new method further improves the frequency error, although the improvement may not be perceptible. To summarize the harmonic tones results, the new method slightly improves on the original wavetable method for the same amount of resources. 2.2 Inharmonic Results Figures 4 and 5 show the amplitude errors (RAE) for each tone. The tones are listed from left to right by decreasing inharmonicity. The amplitude error of the original

12 wavetable method drops exponentially in Figure 4. It reaches zero for the qing, bell and ling when N tabs = 2 = N hars (it is the same as additive synthesis). On the other hand, the amplitude error of the new method drops significantly faster in Figure 5. The bell error drops the fastest, and approaches zero when N tabs = 6. The pipa error drops the slowest, and is similar to the original method. The bell has more perfect groups and they are stronger than in the other instruments. The pipa contains perfect groups, but they are all weak. Figures 6 and 7 show the resulting frequency errors (NFDE) for each tone. The tones are listed in the same order from left to right. Figure 6 shows that the frequency errors of all the tones stay the same with the original wavetable method. This is because the original wavetable method is unable to divide the partials into groups to give better NFD resolution. On the other hand, the new method lowers the overall frequency error (see Figure 7). 2.3 Listening Test Results To further evaluate our new method versus the original method, we carry out a listening test. The qin, yangqin and violin tones are matched with 7 wavetables in our test with to 7 number of groups according to the hierarchical grouping method in Section.. 3 subjects are given the 7 synthesized tones and the real tone (filtered to 3 partials) and asked to vote for the best 2 tones out of the 7 tones that sound closest to the real tone. They can also vote for the same tone twice if it sounds distinguishably the best. They are given a high quality headphone for the test and they can listen to the tones as many times as they want for the judgement. Figure 8, 9 and 20 show the vote results for the qin, yangqin and violin respectively. Note that the group indicates the original method and the circled group indicates the error-balanced group indicated from our MAQ function. The number of votes generally conform to our MAQ function. The subjects highly prefer the tones near the MAQ group. While nearly no votes are given to the number of group. That shows a clear quality 2

13 improvement over the original method. Another interesting result is that the votes are so concentrated in the qin (Figure 8) but spread across in the violin (Figure 20). Feedbacks from the subjects are that the qin tones are very easy to hear the difference but all the violin tones sound very close. That conforms to the fact that the violin tone is a nearlyharmonic tone with inharmonicity 0.29%, further improvement to it is more difficult to be heard, contrasting to the qin having initial frequency error 5.8% be reduced to 0.7%. 3. Conclusion The original wavetable matching method cannot effectively simulate pitched inharmonic tones such as the plucked strings. To remedy this problem, a new technique has been introduced to adapt the original method to pitched inharmonic tones. The new method hierarchically separates the partials into groups based on their normalized frequency deviations. Wavetables are then allocated to each group and ordinary wavetable matching is performed on each group. The final parameters are those with the lowest combined amplitude and frequency errors. The new method requires no user intervention, unlike the previous version of the algorithm [4], and is fully automatic. Eighteen instrument tones from the harmonic and pitched inharmonic families were tested. The results show that the new method is as good as the original method for harmonic tones, and better for inharmonic tones. The harmonic matches require fewer wavetables than inharmonic matches which require more wavetables to resolve the frequency error. The new method has its limits, and cannot handle non-pitched instruments such as the cymbal or environmental sounds. Their partials are noisy and unstable, and thus unsuitable for wavetable matching. The new method is well-suited to existing music synthesis hardware running multiple wavetable synthesis in realtime. The new method expands the range of the instruments 3

14 possible in wavetable synthesis from wind and bowed string instruments, to also include plucked string and pitched percussion instruments (which are usually modelled by PCM sampling). The optimal parameters selected by the new method are small in size, and efficiently stored. 4

15 Bibliography [] P. Kleczkowski, Group Additive Synthesis, Computer Music Journal, vol. 3, no., pp (989). [2] N.-M. Cheung and A. B. Horner, Group Synthesis with Genetic Algorithms, Journal of Audio Engineering Society, vol. 44, pp (996 Mar.). [3] S. Oates and B. Eaglestone, Analytical Methods for Group Synthesis, Computer Music Journal, vol. 2, no. 2, pp (997). [4] M.-H. Serra, D. Rubine and R. Dannenberg, Analysis and Synthesis of Tones by Spectral Interpolation, Journal of Audio Engineering Society, vol. 38, pp. -28 (990 Ma.). [5] A. B. Horner, J. Beauchamp, and L. Haken, Methods for Multiple Wavetable Synthesis of Musical Instrument Tones, Journal of Audio Engineering Society, vol. 4, pp (993 May). [6] A. B. Horner and J. Beauchamp, Synthesis of Trumpet Tones Using a Wavetable and Dynamic Filter, Journal of Audio Engineering Society, vol. 43, pp (995 Oct.). [7] A. B. Horner, Computation and Memory Tradeoffs with Multiple Wavetable Interpolation, Journal of Audio Engineering Society, vol. 44, pp (996 June). [8] J. H. Holland, Adaptation in Natural and Artificial Systems. Ann Arbor: The University of Michigan Press. [9] D. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning, Reading, MA: Addison-Wesley. [0] S. Dubnov and N. Tishby, Influence of Frequency Modulating Jitter on Higher Order Moments of Sound Residual with Applications to Synthesis and Classification, Proceedings of the 996 ICMC, pp (996). [] J. C. Brown, Frequency Rations of Spectral Components of Musical Sounds, Journal of Acoustical Society of America, vol. 99, pp (996). [2] R. McAulay and T. Quatieri, Speech Analysis/Synthesis based on a Sinusoidal Representation, IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, pp (986). 5

16 [3] K. Lee and A. B. Horner, Modeling Piano Tones with Group Synthesis, Journal of the Audio Engineering Society, vol. 47, pp. 0- (999 Mar.). [4] C. So and A. B. Horner, Wavetable Matching of Inharmonic String Tones, Journal of the Audio Engineering Society, vol. 50, pp (2002 Jan./Feb.). [5] J. B. Allen, Short Term Spectral Analysis, Synthesis, and Modification by Discrete Fourier transform, IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, pp (986). [6] M. Dolson, The Phase Vocoder: A Tutorial, Computer Music Journal, vol. 0, no. 4, pp (986). [7] J. Beauchamp, Unix Workstation Software for Analysis, Graphics, Modification, and Synthesis of Musical Sounds, presented at the 94th Convention of the Audio Engineering Society, Journal of the Audio Engineering Society (Abstracts), vol. 4, preprint no. 3479, pp. 387 (993 May). 6

17 List of Figures Figure. Figure 2. Figure 3. Figure 4. Figure 5. Figure 6. Figure 7. Figure 8. Figure 9. Multiple wavetable synthesis block diagram. Normalized frequency deviations of partials of the saxophone (F#4), trumpet (F4), zheng (C#4), and yangqin (C4). Wavetable matching algorithm for pitched inharmonic tones. Normalized frequency deviations for a hypothetical instrument with ten partials. The graph shows amplitude versus normalized frequency deviation (NFD) for each partial (the labels indicate the partial numbers). Increasing the number of groups by the successive removal of edges (for the same example shown in Figure 4). The partials are sorted by their NFD k and labelled on the x-axis. Connected edges are part of the same group. The process stops when all edges are removed, in this case, where N groups = N hars = 0. The first five levels of the hierarchical grouping tree (for the example shown in Figures 4 and 5). Each group is represented by a box with the number of partials () and the relative amplitude () of the group. One group is split at each descending level. The wavetable allocation tree for the example from Figures 4-6 with total number of wavetables (N tabs ) = 5. Each group is represented by a box with the number of partials (), relative amplitude () and the number of allocated wavetables (#wt). Genetic algorithm optimization of the wavetable basis spectra and amplitude envelopes for a group with three partials. The number of wavetables (#wt = 2) is set in the wavetable allocated tree. RAE versus NFDE for the example in Figure 7. maxnfde and maxrae are.0 and.05 respectively. The third level (N groups = 3) gives the best overall matching quality (MAQ). Figure 0. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. Figure. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. Figure 2. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. 7

18 Figure 3. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. Figure 4. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. Figure 5. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. Figure 6. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on eleven inharmonic tones. Figure 7. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on the eleven inharmonic tones. Figure 8. Number of votes by the subjects for the qin tone with 7-wavetable match from to 7 groups. The 6-groups is selected as the best error-balanced grouping from our MAQ function. Figure 9. Number of votes by the subjects for the yangqin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. Figure 20. Number of votes by the subjects for the violin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. 8

19 f ( ) t.... w ( ) w ( ) t! 2 t w N (t)....!! + x (t) w i (t) amplitude envelope of ith table at time t f ( t ) fundamental frequency at time t x (t) sample output at time t N number of wavetables Figure. Multiple wavetable synthesis block diagram. 9

20 2% Normalized Frequency Deviation (NFD) % 0% -% -2% -3% -4% -5% -6% -7% % Partials Saxophone Trumpet Zheng Yanqin Figure 2. Normalized frequency deviations of partials of the saxophone (F#4), trumpet (F4), zheng (C#4), and yangqin (C4). 20

21 Original Tone Short-time Spectrum Analysis Time-varying frequency spectra Hierarchical grouping of partials based on their NFD k Group decomposition tree Wavetable allocation based on N tabs Wavetable allocation tree For each level in the tree Ordinary GA-based wavetable matching Frequency Error (NFDE) and litude Error (RAE) No Processed all the levels? Yes The parameters with the lowest NFDE and RAE Figure 3. Wavetable matching algorithm for pitched inharmonic tones. 2

22 litude % -8% -6% -4% -2% 0% 2% 4% 6% 8% 0 Normalized Frequency Deviation (NFD) Figure 4. Normalized frequency deviations for a hypothetical instrument with ten partials. The graph shows amplitude versus normalized frequency deviation (NFD) for each partial (the labels indicate the partial numbers). 22

23 NFD k NFD k NFD k e e 3 e 4 e 5 e 6 e 7 e 8 e 9 8 e % -8% -6% -4% -2% 0% 2% 4% 6% 8% e e 2 e 3 e 5 e 6 e 7 e 8 e % -8% -6% -4% -2% 0% 2% 4% 6% 8% e % -8% -6% -4% -2% 0% 2% 4% 6% 8% 3 e e 3 e 5 e 6 e 8 e N groups = N groups = 2 N groups = 3 NFD k e N groups = 9-0% -8% -6% -4% -2% 0% 2% 4% 6% 8% NFD k N groups = N hars = 0-0% -8% -6% -4% -2% 0% 2% 4% 6% 8% Figure 5. Increasing the number of groups by the successive removal of edges (for the same example shown in Figure 4). The partials are sorted by their NFD k and labelled on the x-axis. Connected edges are part of the same group. The process stops when all edges are removed, in this case, where N groups = N hars = 0. 23

24 0 00% N groups = p 53.7% % N groups = 2 p 4 3 p53.7% p 3 3 p2.0% p N groups = % 37.7% p6.0% p2.0% p25.3% N groups = p p37.7% p6.0% p0.4% p20.6% p25.3% Figure 6. The first five levels of the hierarchical grouping tree (for the example shown in Figures 4 and 5). Each group is represented by a box with the number of partials () and the relative amplitude () of the group. One group is split at each descending level. 3 3 N groups = 4 24

25 0 00% N groups = #wt % % N groups = 2 #wt 3 #wt % 3 2.0% % N groups = 3 #wt 3 #wt #wt N groups = % 6.0% 2.0% 25.3% #wt #wt #wt #wt 2 N groups = % 6.0% 0.4% 20.6% 25.3% #wt #wt #wt #wt #wt Figure 7. The wavetable allocation tree for the example from Figures 4-6 with total number of wavetables (N tabs ) = 5. Each group is represented by a box with the number of partials (), relative amplitude () and the number of allocated wavetables (#wt). 25

26 Given: #wt 3 2.0% 2 Wavetable and its amplitude envelope GA wavetable optimization Wavetable 2 and its amplitude envelope Group spectrum with three partials Figure 8. Genetic algorithm optimization of the wavetable basis spectra and amplitude envelopes for a group with three partials. The number of wavetables (#wt = 2) is set in the wavetable allocated tree. 26

27 8% 7% 5th level maxnfde 6% litude Error (RAE) 5% 4% 3% 2% 4th level 3rd level 2nd level Root level maxrae best MAQ % 0% 0.00% 0.05% 0.0% 0.5% 0.20% 0.25% 0.30% 0.35% 0.40% 0.45% 0.50% Frequency Error (NFDE) Figure 9. RAE versus NFDE for the example in Figure 7. maxnfde and maxrae are.0 and.05 respectively. The third level (N groups = 3) gives the best overall matching quality (MAQ). 27

28 40% 35% Piano litude Error (RAE) 30% 25% 20% 5% 0% Erhu Guitar Shakuhachi Violin Trumpet Saxophone 5% 0% 0.854% 0.566% 0.528% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 0. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. 28

29 40% 35% Piano litude Error (RAE) 30% 25% 20% 5% 0% Erhu Guitar Shakuhachi Violin Trumpet Saxophone 5% 0% 0.854% 0.566% 0.545% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. 29

30 .% Frequency Error (NFDE).0% 0.9% 0.8% 0.7% 0.6% 0.5% 0.4% 0.3% 0.2% Erhu Piano Guitar Shakuhachi Violin Trumpet Saxophone 0.% 0.0% 0.854% 0.566% 0.528% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 2. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. 30

31 Frequency Error (NFDE).%.0% 0.9% 0.8% 0.7% 0.6% 0.5% 0.4% 0.3% 0.2% 0.% 0.0% Erhu Piano Guitar Shakuhachi Violin Trumpet Saxophone 0.854% 0.566% 0.545% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 3. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. 3

32 litude Error (RAE) 55% 50% 45% 40% 35% 30% 25% 20% 5% 0% 5% 0% Pipa Qin Zheng Cello Glockenspiel Yangqin Qing Bell Liuqin Gong Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 4. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. 32

33 litude Error (RAE) 55% 50% 45% 40% 35% 30% 25% 20% 5% 0% 5% 0% Pipa Qin Zheng Cello Glockenspiel Yangqin Qing Bell Liuqin Gong Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 5. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. 33

34 Frequency Error (NFDE) 6% 5% 4% 3% 2% % 0% 9% 8% 7% 6% 5% 4% 3% 2% % 0% Qin Zheng Yangqin Pipa Qing Bell Cello Gong Glockenspiel Liuqin Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 6. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on eleven inharmonic tones. 34

35 Frequency Error (NFDE) 6% 5% 4% 3% 2% % 0% 9% 8% 7% 6% 5% 4% 3% 2% % 0% Qin Zheng Yangqin Pipa Qing Bell Cello Gong Glockenspiel Liuqin Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 7. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on the eleven inharmonic tones. 35

36 4 2 Number of Votes Number of Groups Figure 8. Number of votes by the subjects for the qin tone with 7-wavetable match from to 7 groups. The 6-groups is selected as the best error-balanced grouping from our MAQ function. 36

37 4 2 Number of Votes Number of Groups Figure 9. Number of votes by the subjects for the yangqin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. 37

38 4 2 Number of Votes Number of Groups Figure 20. Number of votes by the subjects for the violin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. 38

Optimized Multiple Wavetable Interpolation

Optimized Multiple Wavetable Interpolation Optimized Multiple Wavetable Interpolation JONATHAN MOHR Augustana Faculty University of Alberta Camrose, Alberta CANADA http://www.augustana.ca/ mohrj / XIAOBO LI Department of Computing Science University

More information

Spectral modeling of musical sounds

Spectral modeling of musical sounds Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer

More information

The Effects of MP3 Compression on Emotional Characteristics

The Effects of MP3 Compression on Emotional Characteristics The Effects of MP3 Compression on Emotional Characteristics Ronald Mo Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong ronmo@cse.ust.hk Chung Lee

More information

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information

VL Visual Editor. Manual

VL Visual Editor. Manual VL Visual Editor Manual Important Notices The VL Visual Editor (the Visual Editor) provides an easy, intuitive way to create, edit, and save basic voices for Yamaha VL Plug-in Board. Do not operate the

More information

Hand Controlled Audio Synthesizer

Hand Controlled Audio Synthesizer Alex Sanchez & Behram Mistree 6.111 Final Project Proposal 11.3.2006 Coordinate TA: Javy Hand Controlled Audio Synthesizer 1. Proposed Functionality We propose to build a device that incorporates a video

More information

Audio Watermarking Based on PCM Technique

Audio Watermarking Based on PCM Technique Audio Watermarking Based on PCM Technique Ranjeeta Yadav Department of ECE SGIT, Ghaziabad, INDIA Sachin Yadav Department of CSE SGIT, Ghaziabad, INDIA Jyotsna Singh Department of ECE NSIT, New Delhi,

More information

Modeling the Spectral Envelope of Musical Instruments

Modeling the Spectral Envelope of Musical Instruments Modeling the Spectral Envelope of Musical Instruments Juan José Burred burred@nue.tu-berlin.de IRCAM Équipe Analyse/Synthèse Axel Röbel / Xavier Rodet Technical University of Berlin Communication Systems

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D.

A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D. Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS Verfaille V., Arfib D. CNRS - LMA 3, chemin Joseph Aiguier

More information

The VL v2 Visual Editor/Beta 1.0 (VL Visual Editor : VVE) is a Beta test version of the program which has been released for use free of charge.

The VL v2 Visual Editor/Beta 1.0 (VL Visual Editor : VVE) is a Beta test version of the program which has been released for use free of charge. (1) Important Notice The VL v2 Visual Editor/Beta 1.0 (VL Visual Editor : VVE) is a Beta test version of the program which has been released for use free of charge. Yamaha cannot provide technical support

More information

Physical Modeling Synthesis of Sound. Adapted from Perry R. Cook Princeton Computer Science (also Music)

Physical Modeling Synthesis of Sound. Adapted from Perry R. Cook Princeton Computer Science (also Music) Physical Modeling Synthesis of Sound Adapted from Perry R. Cook Princeton Computer Science (also Music) prc@cs.princeton.edu www.cs.princeton.edu/~prc 1 One View of Sound Sound is a waveform, it, store

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Convention Paper Presented at the 120th Convention 2006 May Paris, France

Convention Paper Presented at the 120th Convention 2006 May Paris, France Audio Engineering Society Convention Paper Presented at the 12th Convention 26 May 2 23 Paris, France This convention paper has been reproduced from the author s advance manuscript, without editing, corrections,

More information

Math for Liberal Arts MAT 110: Chapter 11 Notes

Math for Liberal Arts MAT 110: Chapter 11 Notes Math for Liberal Arts MAT 110: Chapter 11 Notes Math and Art David J. Gisch Mathematics and Music Sound and Music Any vibrating object produces sound. The vibrations produce a wave. Most musical sounds

More information

Using Genetic Algorithms to Improve Pattern Classification Performance

Using Genetic Algorithms to Improve Pattern Classification Performance Using Genetic Algorithms to Improve Pattern Classification Performance Eric I. Chang and Richard P. Lippmann Lincoln Laboratory, MIT Lexington, MA 021739108 Abstract Genetic algorithms were used to select

More information

PRINCIPAL COMPONENT ANALYSIS OF RASTERISED AUDIO FOR CROSS- SYNTHESIS

PRINCIPAL COMPONENT ANALYSIS OF RASTERISED AUDIO FOR CROSS- SYNTHESIS PRINCIPAL COMPONENT ANALYSIS OF RASTERISED AUDIO FOR CROSS- SYNTHESIS Jeremy J. Wells Audio Lab, Department of Electronics, University of York, YO10 5DD, UK. jjw100@ohm.york.ac.uk Gregory Aldam Department

More information

A MORPHOLOGY-BASED FILTER STRUCTURE FOR EDGE-ENHANCING SMOOTHING

A MORPHOLOGY-BASED FILTER STRUCTURE FOR EDGE-ENHANCING SMOOTHING Proceedings of the 1994 IEEE International Conference on Image Processing (ICIP-94), pp. 530-534. (Austin, Texas, 13-16 November 1994.) A MORPHOLOGY-BASED FILTER STRUCTURE FOR EDGE-ENHANCING SMOOTHING

More information

D 5.2 Time stretching modules with synchronized multimedia prototype

D 5.2 Time stretching modules with synchronized multimedia prototype The time-stretching factor is sent to the audio processing engine in order to change the analysis hop size, and the audio output frame timestamp is calculated accordingly. However, this timestamp is not

More information

Convention Paper Presented at the 120th Convention 2006 May Paris, France

Convention Paper Presented at the 120th Convention 2006 May Paris, France Audio Engineering Society Convention Paper Presented at the 12th Convention 26 May 2 23 Paris, France This convention paper has been reproduced from the author s advance manuscript, without editing, corrections,

More information

Error Analysis, Statistics and Graphing

Error Analysis, Statistics and Graphing Error Analysis, Statistics and Graphing This semester, most of labs we require us to calculate a numerical answer based on the data we obtain. A hard question to answer in most cases is how good is your

More information

Intensity Transformations and Spatial Filtering

Intensity Transformations and Spatial Filtering 77 Chapter 3 Intensity Transformations and Spatial Filtering Spatial domain refers to the image plane itself, and image processing methods in this category are based on direct manipulation of pixels in

More information

Image Compression With Haar Discrete Wavelet Transform

Image Compression With Haar Discrete Wavelet Transform Image Compression With Haar Discrete Wavelet Transform Cory Cox ME 535: Computational Techniques in Mech. Eng. Figure 1 : An example of the 2D discrete wavelet transform that is used in JPEG2000. Source:

More information

Digital Media. Daniel Fuller ITEC 2110

Digital Media. Daniel Fuller ITEC 2110 Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09

More information

1. Introduction. 2. Motivation and Problem Definition. Volume 8 Issue 2, February Susmita Mohapatra

1. Introduction. 2. Motivation and Problem Definition. Volume 8 Issue 2, February Susmita Mohapatra Pattern Recall Analysis of the Hopfield Neural Network with a Genetic Algorithm Susmita Mohapatra Department of Computer Science, Utkal University, India Abstract: This paper is focused on the implementation

More information

Reverberation prediction using the Image Source Method

Reverberation prediction using the Image Source Method Reverberation prediction using Image Source Method DESC9115 Lab Report 2 Alex Tu 307174123 Introduction The image source model technique (ISM) is a commonly utilised tool in acoustics and signal processing

More information

Chapter 5: Music Synthesis Technologies

Chapter 5: Music Synthesis Technologies Chapter 5: Technologies For the presentation of sound, music synthesis is as important to multimedia system as for computer graphics to the presentation of image. In this chapter, the basic principles

More information

Genetic Model Optimization for Hausdorff Distance-Based Face Localization

Genetic Model Optimization for Hausdorff Distance-Based Face Localization c In Proc. International ECCV 2002 Workshop on Biometric Authentication, Springer, Lecture Notes in Computer Science, LNCS-2359, pp. 103 111, Copenhagen, Denmark, June 2002. Genetic Model Optimization

More information

Supplemental Notes March 10, 1999 SN79 V1.0. The Basics of MIDI

Supplemental Notes March 10, 1999 SN79 V1.0. The Basics of MIDI && Supplemental Notes March 10, 1999 SN79 V1.0 The Basics of MIDI For electronic music aficionados, MIDI (Musical Instrument Digital Interface) has opened doors to new worlds of creativity. Before MIDI,

More information

AUTOMATING THE DESIGN OF SOUND SYNTHESIS TECHNIQUES USING EVOLUTIONARY METHODS. Ricardo A. Garcia *

AUTOMATING THE DESIGN OF SOUND SYNTHESIS TECHNIQUES USING EVOLUTIONARY METHODS. Ricardo A. Garcia * AUTOMATING THE DESIGN OF SOUND SYNTHESIS TECHNIQUES USING EVOLUTIONARY METHODS Ricardo A. Garcia * MIT Media Lab Machine Listening Group 20 Ames St., E5-49, Cambridge, MA 0239 rago@media.mit.edu ABSTRACT

More information

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany Outline Introduction What is "Parametric Audio Coding"?

More information

5.2. The Sine Function and the Cosine Function. Investigate A

5.2. The Sine Function and the Cosine Function. Investigate A 5.2 The Sine Function and the Cosine Function What do an oceanographer, a stock analyst, an audio engineer, and a musician playing electronic instruments have in common? They all deal with periodic patterns.

More information

Introduction to ANSYS DesignXplorer

Introduction to ANSYS DesignXplorer Lecture 4 14. 5 Release Introduction to ANSYS DesignXplorer 1 2013 ANSYS, Inc. September 27, 2013 s are functions of different nature where the output parameters are described in terms of the input parameters

More information

Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces

Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces Eric Christiansen Michael Gorbach May 13, 2008 Abstract One of the drawbacks of standard reinforcement learning techniques is that

More information

Falcon Singles - Glockenspiel for Falcon

Falcon Singles - Glockenspiel for Falcon Falcon Singles - Glockenspiel for Falcon 2016 Simon Stockhausen Installation As there is no default location for 3rd party sound libraries for Falcon, you can just install the folder Glockenspiel which

More information

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric

More information

ITNP80: Multimedia! Sound-II!

ITNP80: Multimedia! Sound-II! Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent than for video data rate for CD-quality audio is much less than

More information

A GENETIC ALGORITHM APPROACH TO OPTIMAL TOPOLOGICAL DESIGN OF ALL TERMINAL NETWORKS

A GENETIC ALGORITHM APPROACH TO OPTIMAL TOPOLOGICAL DESIGN OF ALL TERMINAL NETWORKS A GENETIC ALGORITHM APPROACH TO OPTIMAL TOPOLOGICAL DESIGN OF ALL TERMINAL NETWORKS BERNA DENGIZ AND FULYA ALTIPARMAK Department of Industrial Engineering Gazi University, Ankara, TURKEY 06570 ALICE E.

More information

Network Routing Protocol using Genetic Algorithms

Network Routing Protocol using Genetic Algorithms International Journal of Electrical & Computer Sciences IJECS-IJENS Vol:0 No:02 40 Network Routing Protocol using Genetic Algorithms Gihan Nagib and Wahied G. Ali Abstract This paper aims to develop a

More information

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, VOL., NO., MONTH YEAR 1

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, VOL., NO., MONTH YEAR 1 IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, VOL., NO., MONTH YEAR 1 An Efficient Approach to Non-dominated Sorting for Evolutionary Multi-objective Optimization Xingyi Zhang, Ye Tian, Ran Cheng, and

More information

Chapter X Sampler Instrument

Chapter X Sampler Instrument Chapter X Sampler Instrument A sampler is a synthesizer that generates sound by playing recorded sounds rather than calculated waveforms. The wave file player instrument described in an earlier chapter

More information

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting

More information

An Initial Seed Selection Algorithm for K-means Clustering of Georeferenced Data to Improve

An Initial Seed Selection Algorithm for K-means Clustering of Georeferenced Data to Improve An Initial Seed Selection Algorithm for K-means Clustering of Georeferenced Data to Improve Replicability of Cluster Assignments for Mapping Application Fouad Khan Central European University-Environmental

More information

Estimating the Quality of Databases

Estimating the Quality of Databases Estimating the Quality of Databases Ami Motro Igor Rakov George Mason University May 1998 1 Outline: 1. Introduction 2. Simple quality estimation 3. Refined quality estimation 4. Computing the quality

More information

Room Acoustics. CMSC 828D / Spring 2006 Lecture 20

Room Acoustics. CMSC 828D / Spring 2006 Lecture 20 Room Acoustics CMSC 828D / Spring 2006 Lecture 20 Lecture Plan Room acoustics basics Structure of room impulse response Characterization of room acoustics Modeling of reverberant response Basics All our

More information

Binary Representations of Integers and the Performance of Selectorecombinative Genetic Algorithms

Binary Representations of Integers and the Performance of Selectorecombinative Genetic Algorithms Binary Representations of Integers and the Performance of Selectorecombinative Genetic Algorithms Franz Rothlauf Department of Information Systems University of Bayreuth, Germany franz.rothlauf@uni-bayreuth.de

More information

Quick Value Statistics in Presentation

Quick Value Statistics in Presentation Quick Value Statistics in Presentation Presentation can show you a simple time series and distribution for any measured value, based on all loaded measurements. In the Data Contents tree, navigate to the

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective

More information

A New Selection Operator - CSM in Genetic Algorithms for Solving the TSP

A New Selection Operator - CSM in Genetic Algorithms for Solving the TSP A New Selection Operator - CSM in Genetic Algorithms for Solving the TSP Wael Raef Alkhayri Fahed Al duwairi High School Aljabereyah, Kuwait Suhail Sami Owais Applied Science Private University Amman,

More information

Evaluating Classifiers

Evaluating Classifiers Evaluating Classifiers Charles Elkan elkan@cs.ucsd.edu January 18, 2011 In a real-world application of supervised learning, we have a training set of examples with labels, and a test set of examples with

More information

Visualizing diffraction of a loudspeaker enclosure

Visualizing diffraction of a loudspeaker enclosure Visualizing diffraction of a loudspeaker enclosure V. Pulkki T. Lokki Laboratory of Acoustics and Audio Signal Processing Telecommunications Software and Multimedia Laboratory Helsinki University of Technology,

More information

Genetic algorithm based on number of children and height task for multiprocessor task Scheduling

Genetic algorithm based on number of children and height task for multiprocessor task Scheduling Genetic algorithm based on number of children and height task for multiprocessor task Scheduling Marjan Abdeyazdan 1,Vahid Arjmand 2,Amir masoud Rahmani 3, Hamid Raeis ghanavati 4 1 Department of Computer

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

MetroPro Surface Texture Parameters

MetroPro Surface Texture Parameters MetroPro Surface Texture Parameters Contents ROUGHNESS PARAMETERS...1 R a, R q, R y, R t, R p, R v, R tm, R z, H, R ku, R 3z, SR z, SR z X, SR z Y, ISO Flatness WAVINESS PARAMETERS...4 W a, W q, W y HYBRID

More information

Keywords: Binary Sort, Sorting, Efficient Algorithm, Sorting Algorithm, Sort Data.

Keywords: Binary Sort, Sorting, Efficient Algorithm, Sorting Algorithm, Sort Data. Volume 4, Issue 6, June 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com An Efficient and

More information

Aerophones in Flatland

Aerophones in Flatland Aerophones in Flatland Interactive Wave Simulation of Wind Instruments Andrew Allen Nikunj Raghuvanshi Simulation Domain Mouthpiece Spectrogram Wind Instruments Non-linear Feedback Linear mouth reed bore

More information

Non-stationary interpolation in the f-x domain

Non-stationary interpolation in the f-x domain Stanford Exploration Project, Report 129, May 6, 2007, pages 75 85 Non-stationary interpolation in the f-x domain William Curry ABSTRACT Interpolation of seismic data has previously been performed using

More information

The Encoding Complexity of Network Coding

The Encoding Complexity of Network Coding The Encoding Complexity of Network Coding Michael Langberg Alexander Sprintson Jehoshua Bruck California Institute of Technology Email: mikel,spalex,bruck @caltech.edu Abstract In the multicast network

More information

Multimedia Database Systems. Retrieval by Content

Multimedia Database Systems. Retrieval by Content Multimedia Database Systems Retrieval by Content MIR Motivation Large volumes of data world-wide are not only based on text: Satellite images (oil spill), deep space images (NASA) Medical images (X-rays,

More information

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California,

More information

The Journal of MacroTrends in Technology and Innovation

The Journal of MacroTrends in Technology and Innovation MACROJOURNALS The Journal of MacroTrends in Technology and Innovation Automatic Knot Adjustment Using Dolphin Echolocation Algorithm for B-Spline Curve Approximation Hasan Ali AKYÜREK*, Erkan ÜLKER**,

More information

LAB 2: DATA FILTERING AND NOISE REDUCTION

LAB 2: DATA FILTERING AND NOISE REDUCTION NAME: LAB TIME: LAB 2: DATA FILTERING AND NOISE REDUCTION In this exercise, you will use Microsoft Excel to generate several synthetic data sets based on a simplified model of daily high temperatures in

More information

K-Nearest-Neighbours with a Novel Similarity Measure for Intrusion Detection

K-Nearest-Neighbours with a Novel Similarity Measure for Intrusion Detection K-Nearest-Neighbours with a Novel Similarity Measure for Intrusion Detection Zhenghui Ma School of Computer Science The University of Birmingham Edgbaston, B15 2TT Birmingham, UK Ata Kaban School of Computer

More information

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,

More information

Name Course Days/Start Time

Name Course Days/Start Time Name Course Days/Start Time Mini-Project : The Library of Functions In your previous math class, you learned to graph equations containing two variables by finding and plotting points. In this class, we

More information

Instruments. 20 Ames Street. Abstract. We present a framework for the analysis and synthesis of acoustical instruments

Instruments. 20 Ames Street. Abstract. We present a framework for the analysis and synthesis of acoustical instruments Data-driven Modeling and Synthesis of Acoustical Instruments Bernd Schoner, Chuck Cooper, Chris Douglas, Neil Gershenfeld Physics and Media Group MIT Media Laboratory Ames Street Cambridge, MA 39 schoner@media.mit.edu

More information

4.1 QUANTIZATION NOISE

4.1 QUANTIZATION NOISE DIGITAL SIGNAL PROCESSING UNIT IV FINITE WORD LENGTH EFFECTS Contents : 4.1 Quantization Noise 4.2 Fixed Point and Floating Point Number Representation 4.3 Truncation and Rounding 4.4 Quantization Noise

More information

ON THE PERFORMANCE OF SEGMENT AVERAGING OF DISCRETE COSINE TRANSFORM COEFFICIENTS ON MUSICAL INSTRUMENTS TONE RECOGNITION

ON THE PERFORMANCE OF SEGMENT AVERAGING OF DISCRETE COSINE TRANSFORM COEFFICIENTS ON MUSICAL INSTRUMENTS TONE RECOGNITION O THE PERFORMACE OF SEGMET AVERAGIG OF DISCRETE COSIE TRASFORM COEFFICIETS O MUSICAL ISTRUMETS TOE RECOGITIO Linggo Sumarno Electrical Engineering Study Program, Sanata Dharma University, Yogyakarta, Indonesia

More information

Rhythmic constant pitch time stretching for digital audio

Rhythmic constant pitch time stretching for digital audio Rhythmic constant pitch time stretching for digital audio Brendan TREVORROW ; University of Southern Queensland, Australia ABSTRACT Constant pitch time stretching is not uncommon in audio editing software,

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM

CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM 4.1 Introduction Nowadays money investment in stock market gains major attention because of its dynamic nature. So the

More information

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>> THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar

More information

Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data

Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data Ryan Atallah, John Ryan, David Aeschlimann December 14, 2013 Abstract In this project, we study the problem of classifying

More information

Effect of Wavetable on The Spectral Evolution of Karplus-Strong Synthesized Guitar Notes

Effect of Wavetable on The Spectral Evolution of Karplus-Strong Synthesized Guitar Notes IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 6, Ver. III (Nov. - Dec. 216), PP 1-11 e-issn: 2319 42, p-issn No. : 2319 4197 www.iosrjournals.org Effect of Wavetable on The Spectral

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek

More information

UNIT 1: NUMBER LINES, INTERVALS, AND SETS

UNIT 1: NUMBER LINES, INTERVALS, AND SETS ALGEBRA II CURRICULUM OUTLINE 2011-2012 OVERVIEW: 1. Numbers, Lines, Intervals and Sets 2. Algebraic Manipulation: Rational Expressions and Exponents 3. Radicals and Radical Equations 4. Function Basics

More information

Random Search Report An objective look at random search performance for 4 problem sets

Random Search Report An objective look at random search performance for 4 problem sets Random Search Report An objective look at random search performance for 4 problem sets Dudon Wai Georgia Institute of Technology CS 7641: Machine Learning Atlanta, GA dwai3@gatech.edu Abstract: This report

More information

Scythe Synthesizer for iphone

Scythe Synthesizer for iphone Scythe Synthesizer for iphone http://www.bitmaskstudio.com/ https://twitter.com/bitmaskstudio Scythe is a polyphonic wavetable synthesizer which can load custom sounds and has a Wavetable Creator tool.

More information

SmsPerformer: A Real-Time Synthesis Interface for SMS

SmsPerformer: A Real-Time Synthesis Interface for SMS SmsPerformer: A Real-Time Synthesis Interface for SMS Alex Loscos, Eduard Resina Audiovisual Institute, Pompeu Fabra University Rambla 31, 08002 Barcelona, Spain {aloscos, eduard}@iua.upf.es http://www.iua.upf.es

More information

Evolutionary Computation Algorithms for Cryptanalysis: A Study

Evolutionary Computation Algorithms for Cryptanalysis: A Study Evolutionary Computation Algorithms for Cryptanalysis: A Study Poonam Garg Information Technology and Management Dept. Institute of Management Technology Ghaziabad, India pgarg@imt.edu Abstract The cryptanalysis

More information

CMPSCI 646, Information Retrieval (Fall 2003)

CMPSCI 646, Information Retrieval (Fall 2003) CMPSCI 646, Information Retrieval (Fall 2003) Midterm exam solutions Problem CO (compression) 1. The problem of text classification can be described as follows. Given a set of classes, C = {C i }, where

More information

GENETIC ALGORITHM VERSUS PARTICLE SWARM OPTIMIZATION IN N-QUEEN PROBLEM

GENETIC ALGORITHM VERSUS PARTICLE SWARM OPTIMIZATION IN N-QUEEN PROBLEM Journal of Al-Nahrain University Vol.10(2), December, 2007, pp.172-177 Science GENETIC ALGORITHM VERSUS PARTICLE SWARM OPTIMIZATION IN N-QUEEN PROBLEM * Azhar W. Hammad, ** Dr. Ban N. Thannoon Al-Nahrain

More information

AN EVOLUTIONARY APPROACH TO DISTANCE VECTOR ROUTING

AN EVOLUTIONARY APPROACH TO DISTANCE VECTOR ROUTING International Journal of Latest Research in Science and Technology Volume 3, Issue 3: Page No. 201-205, May-June 2014 http://www.mnkjournals.com/ijlrst.htm ISSN (Online):2278-5299 AN EVOLUTIONARY APPROACH

More information

Descriptive Statistics and Graphing

Descriptive Statistics and Graphing Anatomy and Physiology Page 1 of 9 Measures of Central Tendency Descriptive Statistics and Graphing Measures of central tendency are used to find typical numbers in a data set. There are different ways

More information

Text Categorization. Foundations of Statistic Natural Language Processing The MIT Press1999

Text Categorization. Foundations of Statistic Natural Language Processing The MIT Press1999 Text Categorization Foundations of Statistic Natural Language Processing The MIT Press1999 Outline Introduction Decision Trees Maximum Entropy Modeling (optional) Perceptrons K Nearest Neighbor Classification

More information

Speech-Coding Techniques. Chapter 3

Speech-Coding Techniques. Chapter 3 Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the lower the quality RTP payload types

More information

CS 498PS Audio Computing Lab. Audio Restoration. Paris Smaragdis. paris.cs.illinois.edu U NIVERSITY OF URBANA-CHAMPAIGN

CS 498PS Audio Computing Lab. Audio Restoration. Paris Smaragdis. paris.cs.illinois.edu U NIVERSITY OF URBANA-CHAMPAIGN CS 498PS Audio Computing Lab Audio Restoration Paris Smaragdis paris@illinois.edu paris.cs.illinois.edu Today s lecture Audio restoration Gap filling Click removal Clip recovery 2 Missing data approach

More information

ENHANCED RADAR IMAGING VIA SPARSITY REGULARIZED 2D LINEAR PREDICTION

ENHANCED RADAR IMAGING VIA SPARSITY REGULARIZED 2D LINEAR PREDICTION ENHANCED RADAR IMAGING VIA SPARSITY REGULARIZED 2D LINEAR PREDICTION I.Erer 1, K. Sarikaya 1,2, H.Bozkurt 1 1 Department of Electronics and Telecommunications Engineering Electrics and Electronics Faculty,

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Head, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India

Head, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India Mapping Signal Processing Algorithms to Architecture Sumam David S Head, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India sumam@ieee.org Objectives At the

More information

Chapter 3 Image Registration. Chapter 3 Image Registration

Chapter 3 Image Registration. Chapter 3 Image Registration Chapter 3 Image Registration Distributed Algorithms for Introduction (1) Definition: Image Registration Input: 2 images of the same scene but taken from different perspectives Goal: Identify transformation

More information

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 6 Audio Retrieval 6 Audio Retrieval 6.1 Basics of

More information

Optimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of C

Optimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of C Optimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of California, San Diego CA 92093{0114, USA Abstract. We

More information

Refined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA

Refined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA Copyright 3 SPIE Refined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA ABSTRACT Image texture is the term given to the information-bearing fluctuations

More information

Multi-label classification using rule-based classifier systems

Multi-label classification using rule-based classifier systems Multi-label classification using rule-based classifier systems Shabnam Nazmi (PhD candidate) Department of electrical and computer engineering North Carolina A&T state university Advisor: Dr. A. Homaifar

More information

THE EFFECT OF EARLY REFLECTIONS ON PERCEIVED TIMBRE ANALYZED WITH AN AUDITORY MODEL. Tapio Lokki, Ville Pulkki, and Lauri Savioja

THE EFFECT OF EARLY REFLECTIONS ON PERCEIVED TIMBRE ANALYZED WITH AN AUDITORY MODEL. Tapio Lokki, Ville Pulkki, and Lauri Savioja Proceedings of the 22 International Conference on Auditory Display, Kyoto, Japan, July 2-, 22 THE EFFECT OF EARLY REFLECTIONS ON PERCEIVED TIMBRE ANALYZED WITH AN AUDITORY MODEL Tapio Lokki, Ville Pulkki,

More information

2386 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 52, NO. 6, JUNE 2006

2386 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 52, NO. 6, JUNE 2006 2386 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 52, NO. 6, JUNE 2006 The Encoding Complexity of Network Coding Michael Langberg, Member, IEEE, Alexander Sprintson, Member, IEEE, and Jehoshua Bruck,

More information

FACTORSYNTH user manual

FACTORSYNTH user manual FACTORSYNTH user manual www.jjburred.com - software@jjburred.com J.J. Burred, 2018-2019 factorsynth version: 1.4, 28/2/2019 Introduction Factorsynth is a Max For Live device that uses machine learning

More information