Wavetable Matching of Pitched Inharmonic Instrument Tones
|
|
- Owen Wood
- 6 years ago
- Views:
Transcription
1 Wavetable Matching of Pitched Inharmonic Instrument Tones Clifford So and Andrew Horner Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Kowloon, Hong Kong and Abstract Wavetable matching is the process of finding the parameters needed to resynthesize a musical instrument tone using wavetable synthesis. The most important parameters are the wavetable basis spectra. Previous wavetable matching has often assumed the original tone was harmonic or nearly harmonic. This assumption does not hold for instruments such as the plucked strings. A semi-automated process has recently been proposed to group the partials based on their normalized frequency deviations, and then match each group. However, the user must adjust the group size manually to find the best match. The method is also inefficient at matching harmonic instruments. This paper introduces a new method for hierarchically grouping partials with similar normalized frequency deviations. Ordinary wavetable matching then finds the basis spectra for individual groups. The new method is fully automatic and requires no prior knowledge about the inharmonicity of the tone. Results for 8 instrument tones show the new method improves the perceived match on harmonic tones and pitched inharmonic tones compared to ordinary wavetable matching. 0. Introduction The term wavetable synthesis is currently often used synonymously with sampling synthesis in the music industry. We will use the classical computer music meaning of wavetable synthesis throughout this paper, where an oscillator table is loaded with
2 several partials and indexed depending on the fundamental frequency. Several weighted wavetables can be summed to produce spectral changes in multiple wavetable synthesis. When the partials are disjoint (i.e. each partial only appears in only one of the wavetables), this is called group additive synthesis []-[3]. Recent work has shown how to match wavetable parameters for resynthesizing instrument tones [4]-[7]. One of the most successful of these methods, wavetable matching, picks spectral snapshots from the original tone to determine the wavetables (additive synthesis of the spectral snapshots loads the wavetables). This approach is intuitive, and guarantees an exact match at the timepoints of the selected spectral snapshots and excellent matches at neighboring points as well. Figure shows the multiple wavetable block diagram, where the output of each wavetable is scaled by an amplitude envelope and summed into the output signal. Wavetable matching occurs in the frequency domain and determines the spectra of the wavetables, called the basis spectra. Previous work on wavetable matching has used genetic algorithm (GA) optimization to determine the best time points to select the basis spectra. GAs are combinatorial optimization techniques modeled on the idea of evolving solutions through processes analogous to natural selection and breeding [8][9]. These evolutionary processes guide the parameter search using populations of potential solutions. To judge the fitness of a particular combination of basis spectra, the fitness function uses a least-squares solution to find the amplitude envelopes of each basis spectra, and an average relative amplitude error (RAE) to measure the spectral difference between the original and matched tones: RAE = N frames N frames! j = N hars! k = ( b N hars! k = # b" 2 k, j k, j ) b 2 k, j () where the N frames is the number of analysis frames, N hars is the number of partials, b k!, j is the kth partial amplitude of the synthesized tone at the jth analysis frame, and b, is the k j original tone s amplitude on the same partial and analysis frame. A relative amplitude 2
3 error of zero indicates an exact match, while an error of 0. indicates an average relative difference of 0%. Equation is not a perfect perceptual error measure, but it is reasonably accurate and computationally inexpensive. The GA uses the relative amplitude error as the fitness value of the candidate basis spectra, mixing and matching basis spectra combinations from the best individuals in the population, returning the best set of basis spectra as the solution. The cost of computing the relative amplitude error for each set of basis spectra can be held down by limiting the match points to a small number of representative test spectra (usually about 20). However, wavetable synthesis makes a basic assumption: the partials are harmonics, i.e., they have frequencies which are at integer multiples of the fundamental frequency ( f k = kf). This causes problems when modeling inharmonic instruments, such as the plucked strings. Wavetable matching generates a dynamic spectrum as close to the original as possible. However, it does not take into account the tone s inharmonicity (the amount of frequency deviation between the partials and integer multiples of the fundamental frequency). The degree of inharmonicity is dependent on the instrument [0][]. Figure 2 shows the normalized frequency deviations (NFD) for four acoustic instruments based on McAulay- Quatieri analysis [2]. We define the NFD for partial k as the following: NFD k = N frames! j= b ' f k, j % j ( & kf k, N frames! j= b k, j $ " # (2) where F is the fixed fundamental frequency (specified by the user prior to spectrum analysis), and f k, j is the actual time-varying frequency of partial k at analysis frame j. A value of 0.0 represents a +% frequency deviation. The equation weights the frequency deviation by the relative amplitude of each frame (i.e., high amplitude frames contribute more to the overall deviation than low amplitude frames). Figure 2 shows that the 3
4 plucked yangqin and zheng tones have larger and more variable frequency deviations than the trumpet and saxophone tones. Lee and Horner [3] introduced a group additive synthesis method for matching piano tones with stretched partials. Piano tones have simpler a inharmonicity than plucked string tones, because the amount of average frequency deviation increases in a fairly continuous way from partial to partial on the piano. This is unlike the zheng tone, where a partial may be very flat and its neighbor very sharp (see Figure 2). Such deviations require a special model to represent them accurately. So and Horner [4] recently introduced an inharmonic wavetable matching technique that classifies partials with similar normalized frequency deviations. The method then uses ordinary wavetable matching for each class. However, the method is only semi-automatic and requires user intervention to determine the class size. The user inputs the value by trial-and-error seeking the best match. The method is also inefficient for matching harmonic tones, using much more wavetables than ordinary wavetable matching without any significant improvement. This paper introduces a new wavetable matching technique that adapts wavetable matching to both harmonic and pitched inharmonic tones without prior knowledge about the inharmonicity of the tone. The new method uses a hierarchical grouping technique. Section describes the new method, Section 2 gives results for 8 instrument tones, and Section 3 concludes the paper.. Wavetable Matching of Pitched Inharmonic Tones A new version of wavetable matching is presented in this section. The idea is to hierarchically group the original tone s partials based on their normalized frequency deviation (NFD), and then determine the basis spectra and amplitude envelopes of individual groups using ordinary GA wavetable matching. The user specifies the number of wavetables (N tabs ), and the algorithm minimizes the relative amplitude error (RAE) and 4
5 normalized frequency deviation error (NFDE). The procedure is fully automatic and requires no intervention from the user about the inharmonicity of the tone, compared to the previous version of the algorithm [4]. Figure 3 outlines the method. First, a short-time Fourier analysis such as the phase vocoder [5][6] transforms the tone from the time domain to the frequency domain. If the tones has severe pitch changes we apply McAulay-Quatieri [2] analysis. Beauchamp [7] gives more details about these procedures. Next, a group decomposition tree of the partials is formed according to the normalized frequency deviation (NFD k ). Then, wavetables are distributed among the groups, forming a wavetable allocation tree. Ordinary GA-wavetable matching is applied to each group on each level of the tree. The result is a set of basis spectra and amplitude envelopes, along with their frequency error (NFDE) and amplitude error (RAE) for each group. The final parameters are selected from the level with the lowest overall frequency and amplitude error. Section. discusses the hierarchical grouping method for the partials. Section.2 shows how to distribute the wavetables among the various groups. Section.3 shows how to find the level with the lowest overall error, and Section.4 describes how to resynthesize the signal.. Hierarchical Grouping of the Partials A good grouping algorithm is critical to the overall procedure, because the individual frequency deviations of the partials will be replaced with a single group frequency deviation. Therefore, partials with similar deviations should be grouped together. The goodness of a grouping can be measured by its normalized frequency deviation error (NFDE), which is defined as: NFDE = N! groups i= NFDE 2 i = N groups!!( NFDk # GNFDi ) i= k" group i 2 (3) 5
6 where N groups is the total number of groups, NFDE i is the normalized frequency deviation error of group i, and GNFD i is the average NFD k for group i, which is defined as GNFD i = NFD k" group = i H i! k" group NFD i k (4) where H i is the number of partials in group i. The lower the NFDE, the better the grouping. We propose a hierarchical grouping algorithm that minimizes the NFDE for a given number of groups N groups. The steps are as follows: i) Let array T be a set of ordered pairs where the first number represents the partial number, and the second its frequency deviation NFD k, where k N hars. Sort T by the frequency deviation values in ascending order (i.e., the flatest deviation will be first in T, and the sharpest last). ii) Construct a set of edges E that connect all the adjacent ordered pairs of T, where e x = (t x, t x+ ) for x < N hars. Set the number of current existing groups c =. iii) Connected ordered pairs are in the same group. Pick the group m with the largest NFDE m. iv) Split group m into left and right groups by breaking one edge. Test all the edges in group m to find which edge gives the largest NFDE m reduction after its removal. That is, find the edge in group m that gives the largest difference: NFDE! ( NFDE + NFDE ). m left right v) c = c +. Stop the algorithm if N groups = c, otherwise go to step iii and continue. An example run of the algorithm is shown in Figures 4, 5, and 6 for a hypothetical instrument with ten partials. Figure 4 shows amplitude versus normalized frequency deviation for each partial. Figure 5 shows the sorted partials along the x-axis and the connected edges between them. The amplitude information is not shown in the graph, as it is unnecessary in the grouping algorithm (though it will be used in the wavetable allocation algorithm). Each step in Figure 5 shows the successive removal of edges: e 4 is removed first, e 7 is removed second, and so on. Figure 6 shows the corresponding 6
7 hierarchical grouping tree with the relative amplitude () of each group. When N groups =, no edge is removed and the partials form a single group {, 8, 2, 5, 9, 0, 3, 6, 7, 4}, and thus = 00%. When N groups = 2, removing e 4 results in the largest frequency error reduction and the groups {, 8, 2, 5} ( = 53.7%) and {9, 0, 3, 6, 7, 4} ( = 46.3%). When N groups = 3, removing e 7 results in the largest error reduction and breaks the second group into {9, 0, 3} ( = 2%) and {6, 7, 4} ( = 25.3%), and so on. At the bottom of the tree, with N groups = N hars = 0, no edge remains and every partial is in a group by itself..2 Allocating the Wavetables After forming the group decomposition tree, ordinary GA wavetable matching on each group in the tree determines the basis spectra and amplitude envelopes. But how to best distribute the wavetables among groups, given the total number of wavetables (N tabs )? We set the number of wavetables in group i (#wt i ) to be proportional to the relative amplitude ( i ) of group i, that is # wt! N (5) i i tabs where the relative amplitude ( i ) comes from the group decomposition tree. The number of group wavetables #wt i is rounded to the nearest integer. Equation 5 distributes the wavetables according to the relative amplitude of each group. Proportional allocation of the wavetables insures that groups with higher relative amplitudes have higher matching accuracy, helping to reduce the overall relative amplitude error (RAE) in Equation. Figure 7 shows the wavetable allocation tree for the example from Figures 4-6 with N tabs = 5. Since the maximum number of wavetables is five in this case, the wavetable allocation tree contains only five levels (with five groups in the leaf level). The single group at the root includes all five wavetables, whereas each leaf will contain only one wavetable. The groups in intermediate levels will be allocated a proportional number of wavetables. We also restrict #wt i, such that 7
8 #wt i i (6) The number of wavetables allocated in group i must be smaller than or equal to the number of partials in group i. It is unnecessary to have more wavetables than partials in a group since additive synthesis yields an exact match (a "perfect group"). Such an example occurs in the leftmost group of the forth level (N groups = 4) of the wavetable allocation tree in Figure 7. According to Equation 5, the leftmost group ( = 37.7%) has the highest relative amplitude and would contain two wavetables, while the second strongest group ( = 25.3%) would contain one wavetable. However, there is only one partial in the 37.7% group, and one wavetable is enough to perfectly resynthesize this group. The surplus wavetable is passed to the second strongest group (which contains three partials). If the second strongest group already has enough wavetables, the surplus wavetable is passed to the next strongest candidate, and so on. Each group in the wavetable allocation tree (i.e., each box in Figure 7) is then passed to a genetic algorithm for optimization of the wavetables and amplitude envelopes as shown in Figure 8. This yields the amplitude error (RAE) and frequency error (NFDE) of the group. Identical groups from different levels only require one optimization. The total amplitude error (RAE) and frequency error (NFDE) can then be calculated for each level of the tree using all the partials..3 Selecting the Best Grouping After obtaining the amplitude error (RAE) and frequency error (NFDE) for each level, we need to determine which level gives the best grouping. To do this, we plot RAE versus NFDE for all levels of the tree. Figure 9 shows a plot of the example in Figure 7. The curve starts at the lower-right corner for the root and extends to the upper-left corner of the leaf level. In this example, there are five points corresponding to the five levels of the tree. The number of points in the tree is always equal to N tabs. We need to find the point that gives the lowest amplitude and frequency error. However, the point with the lowest amplitude error may not be the point with the lowest frequency 8
9 error. The frequency error always decreases with more groups, but the amplitude error may not (in the example in Figure 9 the amplitude error decreases up to three groups, and increases with more groups). One method of reconciling these two criteria is to select the point closest to the origin. Measuring the normalized Euclidian distance to the origin can do this. We term this distance the matching quality (MAQ) of a point, and defined it as: 2 2 & NFDE # & RAE # MAQ = $! + $! (7) % maxnfde " % maxrae " where the maxnfde and maxrae are the maximum NFDE and RAE of the line. A normalized distance is preferable to an absolute distance because the NFDE and RAE are on different scales. In the example in Figure 9, the values of NFDE are ten times smaller than RAE. That means the RAE will dominate the MAQ if the absolute values are used. For the example in Figure 9, the third level point (N groups = 3) is selected as the best grouping since it has the lowest MAQ (it is a little better than the fourth level point)..4 Resynthesizing the Signal To resynthesize the signal, we synthesize each group x i (t) using the ordinary multiple wavetable synthesis and sum all the group signals to produce the final synthetic signal x(t). To synthesize the group signal x i (t), every wavetable in group i uses a common timevarying fundamental frequency g i (t) defined as: ' & b # $ k, t NFDk ( t)! k( Groupi g = ( + ) = $ +! i ( t) F vi ( t) F (8) $! ' bk, t % k( Groupi " where F is the fixed fundamental frequency (specified by the user prior to spectrum analysis), and v i (t) is the time-varying NFD of group i. v i (t) is equal to the weighted-nfd of the group's partials. The weighting ensures that the resulting NFD will not be adversely affected by low amplitude partials, which are often noisy (the same situation as 9
10 in Equation 2). We do not use the GNFD i defined in Equation 4 to calculate g i (t), because GNFD i is only a single value rather than a time-varying value. We synthesize the group signal x i (t) using multiple wavetable synthesis. # wt i! x ( t) = w ( t) y ( t (9) i j= i, j i, j ) where y i,j (t) is the time-varying j-th wavetable for group i, w i,j (t) is the amplitude envelope of the j-th wavetable for group i, and #wt i is the number of wavetables in group i. Finally, we sum the signals of each group x i (t) to produce the resynthesized signal x(t). N groups! x( t) = x i ( t) (0) i= 2. Results This section compares the results of the new method to the original wavetable matching method. Eighteen instrument tones have been tested and are categorized as either harmonic tones (overall inharmonicity %) or pitched inharmonic tones (overall inharmonicity > %). We define the inharmonicity of a tone as its normalized frequency deviation error (NFDE) when N tabs =. Seven of the eighteen instruments are harmonic. They are mainly the woodwinds, brass and bowed strings (saxophone, shakuhachi, trumpet, violin, erhu, guitar, and piano). The remaining eleven instruments are inharmonic. They are mostly plucked string and pitched percussion (pipa, liuqin, zheng, qin, yangqin, glockenspiel, bell, qing, pitched gong, ling, and cello). Non-pitched instruments, such as the cymbal, are not included because they are unsuitable for wavetable matching, and Section 2. shows the harmonic results. Section 2.2 shows the inharmonic results. Section 2.3 shows the listening test results. 0
11 2. Harmonic Results We judge the quality of the matched tone by its amplitude error (RAE) and frequency error (NFDE). Figures 0 and show the amplitude error results. The tones are listed from left to right in order of decreasing inharmonicity. For each tone in the original method, the amplitude error decreases exponentially with an increasing number of wavetables (see Figure 0). In the new method, however, the amplitude error sometimes drops more rapidly than in the original wavetable method (see Figure ). The most obvious example is the guitar, which drops to nearly zero when N tabs = 5. Another example is the shakuhachi. When N tabs is large, there are more perfect groups. Recall that if #wt i = i (the number of wavetables equals the number of partials in group i), additive synthesis generates a perfect match. With more perfect groups, the overall amplitude error drops faster. The dropping rate actually depends on the relative amplitudes of the perfect groups. Stronger perfect groups reduce the error faster (as in the guitar and shakuhachi). Figures 2 and 3 show the frequency error results. The tones are listed in the same order. For every tone in the original method, the NFDE must stay the same because there are no groups (see Figure 2). On the other hand, the new method improves the NFDE with increasing N tabs (see Figure 3). Groups allow better NFD resolution. All the frequency errors shown in Figures 2 and 3 are below %, indicating that all the tones are near-harmonic and the original method is a good enough model. Nevertheless, the new method further improves the frequency error, although the improvement may not be perceptible. To summarize the harmonic tones results, the new method slightly improves on the original wavetable method for the same amount of resources. 2.2 Inharmonic Results Figures 4 and 5 show the amplitude errors (RAE) for each tone. The tones are listed from left to right by decreasing inharmonicity. The amplitude error of the original
12 wavetable method drops exponentially in Figure 4. It reaches zero for the qing, bell and ling when N tabs = 2 = N hars (it is the same as additive synthesis). On the other hand, the amplitude error of the new method drops significantly faster in Figure 5. The bell error drops the fastest, and approaches zero when N tabs = 6. The pipa error drops the slowest, and is similar to the original method. The bell has more perfect groups and they are stronger than in the other instruments. The pipa contains perfect groups, but they are all weak. Figures 6 and 7 show the resulting frequency errors (NFDE) for each tone. The tones are listed in the same order from left to right. Figure 6 shows that the frequency errors of all the tones stay the same with the original wavetable method. This is because the original wavetable method is unable to divide the partials into groups to give better NFD resolution. On the other hand, the new method lowers the overall frequency error (see Figure 7). 2.3 Listening Test Results To further evaluate our new method versus the original method, we carry out a listening test. The qin, yangqin and violin tones are matched with 7 wavetables in our test with to 7 number of groups according to the hierarchical grouping method in Section.. 3 subjects are given the 7 synthesized tones and the real tone (filtered to 3 partials) and asked to vote for the best 2 tones out of the 7 tones that sound closest to the real tone. They can also vote for the same tone twice if it sounds distinguishably the best. They are given a high quality headphone for the test and they can listen to the tones as many times as they want for the judgement. Figure 8, 9 and 20 show the vote results for the qin, yangqin and violin respectively. Note that the group indicates the original method and the circled group indicates the error-balanced group indicated from our MAQ function. The number of votes generally conform to our MAQ function. The subjects highly prefer the tones near the MAQ group. While nearly no votes are given to the number of group. That shows a clear quality 2
13 improvement over the original method. Another interesting result is that the votes are so concentrated in the qin (Figure 8) but spread across in the violin (Figure 20). Feedbacks from the subjects are that the qin tones are very easy to hear the difference but all the violin tones sound very close. That conforms to the fact that the violin tone is a nearlyharmonic tone with inharmonicity 0.29%, further improvement to it is more difficult to be heard, contrasting to the qin having initial frequency error 5.8% be reduced to 0.7%. 3. Conclusion The original wavetable matching method cannot effectively simulate pitched inharmonic tones such as the plucked strings. To remedy this problem, a new technique has been introduced to adapt the original method to pitched inharmonic tones. The new method hierarchically separates the partials into groups based on their normalized frequency deviations. Wavetables are then allocated to each group and ordinary wavetable matching is performed on each group. The final parameters are those with the lowest combined amplitude and frequency errors. The new method requires no user intervention, unlike the previous version of the algorithm [4], and is fully automatic. Eighteen instrument tones from the harmonic and pitched inharmonic families were tested. The results show that the new method is as good as the original method for harmonic tones, and better for inharmonic tones. The harmonic matches require fewer wavetables than inharmonic matches which require more wavetables to resolve the frequency error. The new method has its limits, and cannot handle non-pitched instruments such as the cymbal or environmental sounds. Their partials are noisy and unstable, and thus unsuitable for wavetable matching. The new method is well-suited to existing music synthesis hardware running multiple wavetable synthesis in realtime. The new method expands the range of the instruments 3
14 possible in wavetable synthesis from wind and bowed string instruments, to also include plucked string and pitched percussion instruments (which are usually modelled by PCM sampling). The optimal parameters selected by the new method are small in size, and efficiently stored. 4
15 Bibliography [] P. Kleczkowski, Group Additive Synthesis, Computer Music Journal, vol. 3, no., pp (989). [2] N.-M. Cheung and A. B. Horner, Group Synthesis with Genetic Algorithms, Journal of Audio Engineering Society, vol. 44, pp (996 Mar.). [3] S. Oates and B. Eaglestone, Analytical Methods for Group Synthesis, Computer Music Journal, vol. 2, no. 2, pp (997). [4] M.-H. Serra, D. Rubine and R. Dannenberg, Analysis and Synthesis of Tones by Spectral Interpolation, Journal of Audio Engineering Society, vol. 38, pp. -28 (990 Ma.). [5] A. B. Horner, J. Beauchamp, and L. Haken, Methods for Multiple Wavetable Synthesis of Musical Instrument Tones, Journal of Audio Engineering Society, vol. 4, pp (993 May). [6] A. B. Horner and J. Beauchamp, Synthesis of Trumpet Tones Using a Wavetable and Dynamic Filter, Journal of Audio Engineering Society, vol. 43, pp (995 Oct.). [7] A. B. Horner, Computation and Memory Tradeoffs with Multiple Wavetable Interpolation, Journal of Audio Engineering Society, vol. 44, pp (996 June). [8] J. H. Holland, Adaptation in Natural and Artificial Systems. Ann Arbor: The University of Michigan Press. [9] D. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning, Reading, MA: Addison-Wesley. [0] S. Dubnov and N. Tishby, Influence of Frequency Modulating Jitter on Higher Order Moments of Sound Residual with Applications to Synthesis and Classification, Proceedings of the 996 ICMC, pp (996). [] J. C. Brown, Frequency Rations of Spectral Components of Musical Sounds, Journal of Acoustical Society of America, vol. 99, pp (996). [2] R. McAulay and T. Quatieri, Speech Analysis/Synthesis based on a Sinusoidal Representation, IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, pp (986). 5
16 [3] K. Lee and A. B. Horner, Modeling Piano Tones with Group Synthesis, Journal of the Audio Engineering Society, vol. 47, pp. 0- (999 Mar.). [4] C. So and A. B. Horner, Wavetable Matching of Inharmonic String Tones, Journal of the Audio Engineering Society, vol. 50, pp (2002 Jan./Feb.). [5] J. B. Allen, Short Term Spectral Analysis, Synthesis, and Modification by Discrete Fourier transform, IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, pp (986). [6] M. Dolson, The Phase Vocoder: A Tutorial, Computer Music Journal, vol. 0, no. 4, pp (986). [7] J. Beauchamp, Unix Workstation Software for Analysis, Graphics, Modification, and Synthesis of Musical Sounds, presented at the 94th Convention of the Audio Engineering Society, Journal of the Audio Engineering Society (Abstracts), vol. 4, preprint no. 3479, pp. 387 (993 May). 6
17 List of Figures Figure. Figure 2. Figure 3. Figure 4. Figure 5. Figure 6. Figure 7. Figure 8. Figure 9. Multiple wavetable synthesis block diagram. Normalized frequency deviations of partials of the saxophone (F#4), trumpet (F4), zheng (C#4), and yangqin (C4). Wavetable matching algorithm for pitched inharmonic tones. Normalized frequency deviations for a hypothetical instrument with ten partials. The graph shows amplitude versus normalized frequency deviation (NFD) for each partial (the labels indicate the partial numbers). Increasing the number of groups by the successive removal of edges (for the same example shown in Figure 4). The partials are sorted by their NFD k and labelled on the x-axis. Connected edges are part of the same group. The process stops when all edges are removed, in this case, where N groups = N hars = 0. The first five levels of the hierarchical grouping tree (for the example shown in Figures 4 and 5). Each group is represented by a box with the number of partials () and the relative amplitude () of the group. One group is split at each descending level. The wavetable allocation tree for the example from Figures 4-6 with total number of wavetables (N tabs ) = 5. Each group is represented by a box with the number of partials (), relative amplitude () and the number of allocated wavetables (#wt). Genetic algorithm optimization of the wavetable basis spectra and amplitude envelopes for a group with three partials. The number of wavetables (#wt = 2) is set in the wavetable allocated tree. RAE versus NFDE for the example in Figure 7. maxnfde and maxrae are.0 and.05 respectively. The third level (N groups = 3) gives the best overall matching quality (MAQ). Figure 0. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. Figure. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. Figure 2. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. 7
18 Figure 3. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. Figure 4. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. Figure 5. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. Figure 6. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on eleven inharmonic tones. Figure 7. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on the eleven inharmonic tones. Figure 8. Number of votes by the subjects for the qin tone with 7-wavetable match from to 7 groups. The 6-groups is selected as the best error-balanced grouping from our MAQ function. Figure 9. Number of votes by the subjects for the yangqin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. Figure 20. Number of votes by the subjects for the violin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. 8
19 f ( ) t.... w ( ) w ( ) t! 2 t w N (t)....!! + x (t) w i (t) amplitude envelope of ith table at time t f ( t ) fundamental frequency at time t x (t) sample output at time t N number of wavetables Figure. Multiple wavetable synthesis block diagram. 9
20 2% Normalized Frequency Deviation (NFD) % 0% -% -2% -3% -4% -5% -6% -7% % Partials Saxophone Trumpet Zheng Yanqin Figure 2. Normalized frequency deviations of partials of the saxophone (F#4), trumpet (F4), zheng (C#4), and yangqin (C4). 20
21 Original Tone Short-time Spectrum Analysis Time-varying frequency spectra Hierarchical grouping of partials based on their NFD k Group decomposition tree Wavetable allocation based on N tabs Wavetable allocation tree For each level in the tree Ordinary GA-based wavetable matching Frequency Error (NFDE) and litude Error (RAE) No Processed all the levels? Yes The parameters with the lowest NFDE and RAE Figure 3. Wavetable matching algorithm for pitched inharmonic tones. 2
22 litude % -8% -6% -4% -2% 0% 2% 4% 6% 8% 0 Normalized Frequency Deviation (NFD) Figure 4. Normalized frequency deviations for a hypothetical instrument with ten partials. The graph shows amplitude versus normalized frequency deviation (NFD) for each partial (the labels indicate the partial numbers). 22
23 NFD k NFD k NFD k e e 3 e 4 e 5 e 6 e 7 e 8 e 9 8 e % -8% -6% -4% -2% 0% 2% 4% 6% 8% e e 2 e 3 e 5 e 6 e 7 e 8 e % -8% -6% -4% -2% 0% 2% 4% 6% 8% e % -8% -6% -4% -2% 0% 2% 4% 6% 8% 3 e e 3 e 5 e 6 e 8 e N groups = N groups = 2 N groups = 3 NFD k e N groups = 9-0% -8% -6% -4% -2% 0% 2% 4% 6% 8% NFD k N groups = N hars = 0-0% -8% -6% -4% -2% 0% 2% 4% 6% 8% Figure 5. Increasing the number of groups by the successive removal of edges (for the same example shown in Figure 4). The partials are sorted by their NFD k and labelled on the x-axis. Connected edges are part of the same group. The process stops when all edges are removed, in this case, where N groups = N hars = 0. 23
24 0 00% N groups = p 53.7% % N groups = 2 p 4 3 p53.7% p 3 3 p2.0% p N groups = % 37.7% p6.0% p2.0% p25.3% N groups = p p37.7% p6.0% p0.4% p20.6% p25.3% Figure 6. The first five levels of the hierarchical grouping tree (for the example shown in Figures 4 and 5). Each group is represented by a box with the number of partials () and the relative amplitude () of the group. One group is split at each descending level. 3 3 N groups = 4 24
25 0 00% N groups = #wt % % N groups = 2 #wt 3 #wt % 3 2.0% % N groups = 3 #wt 3 #wt #wt N groups = % 6.0% 2.0% 25.3% #wt #wt #wt #wt 2 N groups = % 6.0% 0.4% 20.6% 25.3% #wt #wt #wt #wt #wt Figure 7. The wavetable allocation tree for the example from Figures 4-6 with total number of wavetables (N tabs ) = 5. Each group is represented by a box with the number of partials (), relative amplitude () and the number of allocated wavetables (#wt). 25
26 Given: #wt 3 2.0% 2 Wavetable and its amplitude envelope GA wavetable optimization Wavetable 2 and its amplitude envelope Group spectrum with three partials Figure 8. Genetic algorithm optimization of the wavetable basis spectra and amplitude envelopes for a group with three partials. The number of wavetables (#wt = 2) is set in the wavetable allocated tree. 26
27 8% 7% 5th level maxnfde 6% litude Error (RAE) 5% 4% 3% 2% 4th level 3rd level 2nd level Root level maxrae best MAQ % 0% 0.00% 0.05% 0.0% 0.5% 0.20% 0.25% 0.30% 0.35% 0.40% 0.45% 0.50% Frequency Error (NFDE) Figure 9. RAE versus NFDE for the example in Figure 7. maxnfde and maxrae are.0 and.05 respectively. The third level (N groups = 3) gives the best overall matching quality (MAQ). 27
28 40% 35% Piano litude Error (RAE) 30% 25% 20% 5% 0% Erhu Guitar Shakuhachi Violin Trumpet Saxophone 5% 0% 0.854% 0.566% 0.528% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 0. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. 28
29 40% 35% Piano litude Error (RAE) 30% 25% 20% 5% 0% Erhu Guitar Shakuhachi Violin Trumpet Saxophone 5% 0% 0.854% 0.566% 0.545% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on seven near-harmonic tones. 29
30 .% Frequency Error (NFDE).0% 0.9% 0.8% 0.7% 0.6% 0.5% 0.4% 0.3% 0.2% Erhu Piano Guitar Shakuhachi Violin Trumpet Saxophone 0.% 0.0% 0.854% 0.566% 0.528% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 2. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. 30
31 Frequency Error (NFDE).%.0% 0.9% 0.8% 0.7% 0.6% 0.5% 0.4% 0.3% 0.2% 0.% 0.0% Erhu Piano Guitar Shakuhachi Violin Trumpet Saxophone 0.854% 0.566% 0.545% 0.404% 0.29% 0.66% 0.30% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 3. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on seven near-harmonic tones. 3
32 litude Error (RAE) 55% 50% 45% 40% 35% 30% 25% 20% 5% 0% 5% 0% Pipa Qin Zheng Cello Glockenspiel Yangqin Qing Bell Liuqin Gong Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 4. Original wavetable matching method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. 32
33 litude Error (RAE) 55% 50% 45% 40% 35% 30% 25% 20% 5% 0% 5% 0% Pipa Qin Zheng Cello Glockenspiel Yangqin Qing Bell Liuqin Gong Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 5. New method: The plot of instrument inharmonicity versus amplitude error (RAE) on eleven inharmonic tones. 33
34 Frequency Error (NFDE) 6% 5% 4% 3% 2% % 0% 9% 8% 7% 6% 5% 4% 3% 2% % 0% Qin Zheng Yangqin Pipa Qing Bell Cello Gong Glockenspiel Liuqin Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 6. Original wavetable matching method: The plot of instrument inharmonicity versus frequency error (NFDE) on eleven inharmonic tones. 34
35 Frequency Error (NFDE) 6% 5% 4% 3% 2% % 0% 9% 8% 7% 6% 5% 4% 3% 2% % 0% Qin Zheng Yangqin Pipa Qing Bell Cello Gong Glockenspiel Liuqin Ling 5.2% 3.8% 3.7% 2.7% 2.6% 2.3%.8%.8%.5%.5%.4% Instrument Inharmonicity Wavetable 2 Wavetables 3 Wavetables 4 Wavetables 5 Wavetables 6 Wavetables 7 Wavetables 8 Wavetables 9 Wavetables 0 Wavetables Wavetables 2 Wavetables Figure 7. New method: The plot of instrument inharmonicity versus frequency error (NFDE) on the eleven inharmonic tones. 35
36 4 2 Number of Votes Number of Groups Figure 8. Number of votes by the subjects for the qin tone with 7-wavetable match from to 7 groups. The 6-groups is selected as the best error-balanced grouping from our MAQ function. 36
37 4 2 Number of Votes Number of Groups Figure 9. Number of votes by the subjects for the yangqin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. 37
38 4 2 Number of Votes Number of Groups Figure 20. Number of votes by the subjects for the violin tone with 7-wavetable match from to 7 groups. The 5-groups is selected as the best error-balanced grouping from our MAQ function. 38
Optimized Multiple Wavetable Interpolation
Optimized Multiple Wavetable Interpolation JONATHAN MOHR Augustana Faculty University of Alberta Camrose, Alberta CANADA http://www.augustana.ca/ mohrj / XIAOBO LI Department of Computing Science University
More informationSpectral modeling of musical sounds
Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer
More informationThe Effects of MP3 Compression on Emotional Characteristics
The Effects of MP3 Compression on Emotional Characteristics Ronald Mo Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong ronmo@cse.ust.hk Chung Lee
More information1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends
Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques
More informationVL Visual Editor. Manual
VL Visual Editor Manual Important Notices The VL Visual Editor (the Visual Editor) provides an easy, intuitive way to create, edit, and save basic voices for Yamaha VL Plug-in Board. Do not operate the
More informationHand Controlled Audio Synthesizer
Alex Sanchez & Behram Mistree 6.111 Final Project Proposal 11.3.2006 Coordinate TA: Javy Hand Controlled Audio Synthesizer 1. Proposed Functionality We propose to build a device that incorporates a video
More informationAudio Watermarking Based on PCM Technique
Audio Watermarking Based on PCM Technique Ranjeeta Yadav Department of ECE SGIT, Ghaziabad, INDIA Sachin Yadav Department of CSE SGIT, Ghaziabad, INDIA Jyotsna Singh Department of ECE NSIT, New Delhi,
More informationModeling the Spectral Envelope of Musical Instruments
Modeling the Spectral Envelope of Musical Instruments Juan José Burred burred@nue.tu-berlin.de IRCAM Équipe Analyse/Synthèse Axel Röbel / Xavier Rodet Technical University of Berlin Communication Systems
More information2.4 Audio Compression
2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationA-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D.
Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS Verfaille V., Arfib D. CNRS - LMA 3, chemin Joseph Aiguier
More informationThe VL v2 Visual Editor/Beta 1.0 (VL Visual Editor : VVE) is a Beta test version of the program which has been released for use free of charge.
(1) Important Notice The VL v2 Visual Editor/Beta 1.0 (VL Visual Editor : VVE) is a Beta test version of the program which has been released for use free of charge. Yamaha cannot provide technical support
More informationPhysical Modeling Synthesis of Sound. Adapted from Perry R. Cook Princeton Computer Science (also Music)
Physical Modeling Synthesis of Sound Adapted from Perry R. Cook Princeton Computer Science (also Music) prc@cs.princeton.edu www.cs.princeton.edu/~prc 1 One View of Sound Sound is a waveform, it, store
More informationBoth LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationConvention Paper Presented at the 120th Convention 2006 May Paris, France
Audio Engineering Society Convention Paper Presented at the 12th Convention 26 May 2 23 Paris, France This convention paper has been reproduced from the author s advance manuscript, without editing, corrections,
More informationMath for Liberal Arts MAT 110: Chapter 11 Notes
Math for Liberal Arts MAT 110: Chapter 11 Notes Math and Art David J. Gisch Mathematics and Music Sound and Music Any vibrating object produces sound. The vibrations produce a wave. Most musical sounds
More informationUsing Genetic Algorithms to Improve Pattern Classification Performance
Using Genetic Algorithms to Improve Pattern Classification Performance Eric I. Chang and Richard P. Lippmann Lincoln Laboratory, MIT Lexington, MA 021739108 Abstract Genetic algorithms were used to select
More informationPRINCIPAL COMPONENT ANALYSIS OF RASTERISED AUDIO FOR CROSS- SYNTHESIS
PRINCIPAL COMPONENT ANALYSIS OF RASTERISED AUDIO FOR CROSS- SYNTHESIS Jeremy J. Wells Audio Lab, Department of Electronics, University of York, YO10 5DD, UK. jjw100@ohm.york.ac.uk Gregory Aldam Department
More informationA MORPHOLOGY-BASED FILTER STRUCTURE FOR EDGE-ENHANCING SMOOTHING
Proceedings of the 1994 IEEE International Conference on Image Processing (ICIP-94), pp. 530-534. (Austin, Texas, 13-16 November 1994.) A MORPHOLOGY-BASED FILTER STRUCTURE FOR EDGE-ENHANCING SMOOTHING
More informationD 5.2 Time stretching modules with synchronized multimedia prototype
The time-stretching factor is sent to the audio processing engine in order to change the analysis hop size, and the audio output frame timestamp is calculated accordingly. However, this timestamp is not
More informationConvention Paper Presented at the 120th Convention 2006 May Paris, France
Audio Engineering Society Convention Paper Presented at the 12th Convention 26 May 2 23 Paris, France This convention paper has been reproduced from the author s advance manuscript, without editing, corrections,
More informationError Analysis, Statistics and Graphing
Error Analysis, Statistics and Graphing This semester, most of labs we require us to calculate a numerical answer based on the data we obtain. A hard question to answer in most cases is how good is your
More informationIntensity Transformations and Spatial Filtering
77 Chapter 3 Intensity Transformations and Spatial Filtering Spatial domain refers to the image plane itself, and image processing methods in this category are based on direct manipulation of pixels in
More informationImage Compression With Haar Discrete Wavelet Transform
Image Compression With Haar Discrete Wavelet Transform Cory Cox ME 535: Computational Techniques in Mech. Eng. Figure 1 : An example of the 2D discrete wavelet transform that is used in JPEG2000. Source:
More informationDigital Media. Daniel Fuller ITEC 2110
Digital Media Daniel Fuller ITEC 2110 Daily Question: Digital Audio What values contribute to the file size of a digital audio file? Email answer to DFullerDailyQuestion@gmail.com Subject Line: ITEC2110-09
More information1. Introduction. 2. Motivation and Problem Definition. Volume 8 Issue 2, February Susmita Mohapatra
Pattern Recall Analysis of the Hopfield Neural Network with a Genetic Algorithm Susmita Mohapatra Department of Computer Science, Utkal University, India Abstract: This paper is focused on the implementation
More informationReverberation prediction using the Image Source Method
Reverberation prediction using Image Source Method DESC9115 Lab Report 2 Alex Tu 307174123 Introduction The image source model technique (ISM) is a commonly utilised tool in acoustics and signal processing
More informationChapter 5: Music Synthesis Technologies
Chapter 5: Technologies For the presentation of sound, music synthesis is as important to multimedia system as for computer graphics to the presentation of image. In this chapter, the basic principles
More informationGenetic Model Optimization for Hausdorff Distance-Based Face Localization
c In Proc. International ECCV 2002 Workshop on Biometric Authentication, Springer, Lecture Notes in Computer Science, LNCS-2359, pp. 103 111, Copenhagen, Denmark, June 2002. Genetic Model Optimization
More informationSupplemental Notes March 10, 1999 SN79 V1.0. The Basics of MIDI
&& Supplemental Notes March 10, 1999 SN79 V1.0 The Basics of MIDI For electronic music aficionados, MIDI (Musical Instrument Digital Interface) has opened doors to new worlds of creativity. Before MIDI,
More informationAUTOMATING THE DESIGN OF SOUND SYNTHESIS TECHNIQUES USING EVOLUTIONARY METHODS. Ricardo A. Garcia *
AUTOMATING THE DESIGN OF SOUND SYNTHESIS TECHNIQUES USING EVOLUTIONARY METHODS Ricardo A. Garcia * MIT Media Lab Machine Listening Group 20 Ames St., E5-49, Cambridge, MA 0239 rago@media.mit.edu ABSTRACT
More informationMPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding
MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany Outline Introduction What is "Parametric Audio Coding"?
More information5.2. The Sine Function and the Cosine Function. Investigate A
5.2 The Sine Function and the Cosine Function What do an oceanographer, a stock analyst, an audio engineer, and a musician playing electronic instruments have in common? They all deal with periodic patterns.
More informationIntroduction to ANSYS DesignXplorer
Lecture 4 14. 5 Release Introduction to ANSYS DesignXplorer 1 2013 ANSYS, Inc. September 27, 2013 s are functions of different nature where the output parameters are described in terms of the input parameters
More informationAdaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces
Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces Eric Christiansen Michael Gorbach May 13, 2008 Abstract One of the drawbacks of standard reinforcement learning techniques is that
More informationFalcon Singles - Glockenspiel for Falcon
Falcon Singles - Glockenspiel for Falcon 2016 Simon Stockhausen Installation As there is no default location for 3rd party sound libraries for Falcon, you can just install the folder Glockenspiel which
More informationPerceptual Audio Coders What to listen for: Artifacts of Parametric Coding
Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric
More informationITNP80: Multimedia! Sound-II!
Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent than for video data rate for CD-quality audio is much less than
More informationA GENETIC ALGORITHM APPROACH TO OPTIMAL TOPOLOGICAL DESIGN OF ALL TERMINAL NETWORKS
A GENETIC ALGORITHM APPROACH TO OPTIMAL TOPOLOGICAL DESIGN OF ALL TERMINAL NETWORKS BERNA DENGIZ AND FULYA ALTIPARMAK Department of Industrial Engineering Gazi University, Ankara, TURKEY 06570 ALICE E.
More informationNetwork Routing Protocol using Genetic Algorithms
International Journal of Electrical & Computer Sciences IJECS-IJENS Vol:0 No:02 40 Network Routing Protocol using Genetic Algorithms Gihan Nagib and Wahied G. Ali Abstract This paper aims to develop a
More informationIEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, VOL., NO., MONTH YEAR 1
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, VOL., NO., MONTH YEAR 1 An Efficient Approach to Non-dominated Sorting for Evolutionary Multi-objective Optimization Xingyi Zhang, Ye Tian, Ran Cheng, and
More informationChapter X Sampler Instrument
Chapter X Sampler Instrument A sampler is a synthesizer that generates sound by playing recorded sounds rather than calculated waveforms. The wave file player instrument described in an earlier chapter
More informationDUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING
DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting
More informationAn Initial Seed Selection Algorithm for K-means Clustering of Georeferenced Data to Improve
An Initial Seed Selection Algorithm for K-means Clustering of Georeferenced Data to Improve Replicability of Cluster Assignments for Mapping Application Fouad Khan Central European University-Environmental
More informationEstimating the Quality of Databases
Estimating the Quality of Databases Ami Motro Igor Rakov George Mason University May 1998 1 Outline: 1. Introduction 2. Simple quality estimation 3. Refined quality estimation 4. Computing the quality
More informationRoom Acoustics. CMSC 828D / Spring 2006 Lecture 20
Room Acoustics CMSC 828D / Spring 2006 Lecture 20 Lecture Plan Room acoustics basics Structure of room impulse response Characterization of room acoustics Modeling of reverberant response Basics All our
More informationBinary Representations of Integers and the Performance of Selectorecombinative Genetic Algorithms
Binary Representations of Integers and the Performance of Selectorecombinative Genetic Algorithms Franz Rothlauf Department of Information Systems University of Bayreuth, Germany franz.rothlauf@uni-bayreuth.de
More informationQuick Value Statistics in Presentation
Quick Value Statistics in Presentation Presentation can show you a simple time series and distribution for any measured value, based on all loaded measurements. In the Data Contents tree, navigate to the
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective
More informationA New Selection Operator - CSM in Genetic Algorithms for Solving the TSP
A New Selection Operator - CSM in Genetic Algorithms for Solving the TSP Wael Raef Alkhayri Fahed Al duwairi High School Aljabereyah, Kuwait Suhail Sami Owais Applied Science Private University Amman,
More informationEvaluating Classifiers
Evaluating Classifiers Charles Elkan elkan@cs.ucsd.edu January 18, 2011 In a real-world application of supervised learning, we have a training set of examples with labels, and a test set of examples with
More informationVisualizing diffraction of a loudspeaker enclosure
Visualizing diffraction of a loudspeaker enclosure V. Pulkki T. Lokki Laboratory of Acoustics and Audio Signal Processing Telecommunications Software and Multimedia Laboratory Helsinki University of Technology,
More informationGenetic algorithm based on number of children and height task for multiprocessor task Scheduling
Genetic algorithm based on number of children and height task for multiprocessor task Scheduling Marjan Abdeyazdan 1,Vahid Arjmand 2,Amir masoud Rahmani 3, Hamid Raeis ghanavati 4 1 Department of Computer
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationMetroPro Surface Texture Parameters
MetroPro Surface Texture Parameters Contents ROUGHNESS PARAMETERS...1 R a, R q, R y, R t, R p, R v, R tm, R z, H, R ku, R 3z, SR z, SR z X, SR z Y, ISO Flatness WAVINESS PARAMETERS...4 W a, W q, W y HYBRID
More informationKeywords: Binary Sort, Sorting, Efficient Algorithm, Sorting Algorithm, Sort Data.
Volume 4, Issue 6, June 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com An Efficient and
More informationAerophones in Flatland
Aerophones in Flatland Interactive Wave Simulation of Wind Instruments Andrew Allen Nikunj Raghuvanshi Simulation Domain Mouthpiece Spectrogram Wind Instruments Non-linear Feedback Linear mouth reed bore
More informationNon-stationary interpolation in the f-x domain
Stanford Exploration Project, Report 129, May 6, 2007, pages 75 85 Non-stationary interpolation in the f-x domain William Curry ABSTRACT Interpolation of seismic data has previously been performed using
More informationThe Encoding Complexity of Network Coding
The Encoding Complexity of Network Coding Michael Langberg Alexander Sprintson Jehoshua Bruck California Institute of Technology Email: mikel,spalex,bruck @caltech.edu Abstract In the multicast network
More informationMultimedia Database Systems. Retrieval by Content
Multimedia Database Systems Retrieval by Content MIR Motivation Large volumes of data world-wide are not only based on text: Satellite images (oil spill), deep space images (NASA) Medical images (X-rays,
More informationQUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose
QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California,
More informationThe Journal of MacroTrends in Technology and Innovation
MACROJOURNALS The Journal of MacroTrends in Technology and Innovation Automatic Knot Adjustment Using Dolphin Echolocation Algorithm for B-Spline Curve Approximation Hasan Ali AKYÜREK*, Erkan ÜLKER**,
More informationLAB 2: DATA FILTERING AND NOISE REDUCTION
NAME: LAB TIME: LAB 2: DATA FILTERING AND NOISE REDUCTION In this exercise, you will use Microsoft Excel to generate several synthetic data sets based on a simplified model of daily high temperatures in
More informationK-Nearest-Neighbours with a Novel Similarity Measure for Intrusion Detection
K-Nearest-Neighbours with a Novel Similarity Measure for Intrusion Detection Zhenghui Ma School of Computer Science The University of Birmingham Edgbaston, B15 2TT Birmingham, UK Ata Kaban School of Computer
More informationSPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL
SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,
More informationName Course Days/Start Time
Name Course Days/Start Time Mini-Project : The Library of Functions In your previous math class, you learned to graph equations containing two variables by finding and plotting points. In this class, we
More informationInstruments. 20 Ames Street. Abstract. We present a framework for the analysis and synthesis of acoustical instruments
Data-driven Modeling and Synthesis of Acoustical Instruments Bernd Schoner, Chuck Cooper, Chris Douglas, Neil Gershenfeld Physics and Media Group MIT Media Laboratory Ames Street Cambridge, MA 39 schoner@media.mit.edu
More information4.1 QUANTIZATION NOISE
DIGITAL SIGNAL PROCESSING UNIT IV FINITE WORD LENGTH EFFECTS Contents : 4.1 Quantization Noise 4.2 Fixed Point and Floating Point Number Representation 4.3 Truncation and Rounding 4.4 Quantization Noise
More informationON THE PERFORMANCE OF SEGMENT AVERAGING OF DISCRETE COSINE TRANSFORM COEFFICIENTS ON MUSICAL INSTRUMENTS TONE RECOGNITION
O THE PERFORMACE OF SEGMET AVERAGIG OF DISCRETE COSIE TRASFORM COEFFICIETS O MUSICAL ISTRUMETS TOE RECOGITIO Linggo Sumarno Electrical Engineering Study Program, Sanata Dharma University, Yogyakarta, Indonesia
More informationRhythmic constant pitch time stretching for digital audio
Rhythmic constant pitch time stretching for digital audio Brendan TREVORROW ; University of Southern Queensland, Australia ABSTRACT Constant pitch time stretching is not uncommon in audio editing software,
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationCHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM
CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM 4.1 Introduction Nowadays money investment in stock market gains major attention because of its dynamic nature. So the
More information<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>
THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar
More informationIncorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data
Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data Ryan Atallah, John Ryan, David Aeschlimann December 14, 2013 Abstract In this project, we study the problem of classifying
More informationEffect of Wavetable on The Spectral Evolution of Karplus-Strong Synthesized Guitar Notes
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 6, Ver. III (Nov. - Dec. 216), PP 1-11 e-issn: 2319 42, p-issn No. : 2319 4197 www.iosrjournals.org Effect of Wavetable on The Spectral
More information19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007
19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek
More informationUNIT 1: NUMBER LINES, INTERVALS, AND SETS
ALGEBRA II CURRICULUM OUTLINE 2011-2012 OVERVIEW: 1. Numbers, Lines, Intervals and Sets 2. Algebraic Manipulation: Rational Expressions and Exponents 3. Radicals and Radical Equations 4. Function Basics
More informationRandom Search Report An objective look at random search performance for 4 problem sets
Random Search Report An objective look at random search performance for 4 problem sets Dudon Wai Georgia Institute of Technology CS 7641: Machine Learning Atlanta, GA dwai3@gatech.edu Abstract: This report
More informationScythe Synthesizer for iphone
Scythe Synthesizer for iphone http://www.bitmaskstudio.com/ https://twitter.com/bitmaskstudio Scythe is a polyphonic wavetable synthesizer which can load custom sounds and has a Wavetable Creator tool.
More informationSmsPerformer: A Real-Time Synthesis Interface for SMS
SmsPerformer: A Real-Time Synthesis Interface for SMS Alex Loscos, Eduard Resina Audiovisual Institute, Pompeu Fabra University Rambla 31, 08002 Barcelona, Spain {aloscos, eduard}@iua.upf.es http://www.iua.upf.es
More informationEvolutionary Computation Algorithms for Cryptanalysis: A Study
Evolutionary Computation Algorithms for Cryptanalysis: A Study Poonam Garg Information Technology and Management Dept. Institute of Management Technology Ghaziabad, India pgarg@imt.edu Abstract The cryptanalysis
More informationCMPSCI 646, Information Retrieval (Fall 2003)
CMPSCI 646, Information Retrieval (Fall 2003) Midterm exam solutions Problem CO (compression) 1. The problem of text classification can be described as follows. Given a set of classes, C = {C i }, where
More informationGENETIC ALGORITHM VERSUS PARTICLE SWARM OPTIMIZATION IN N-QUEEN PROBLEM
Journal of Al-Nahrain University Vol.10(2), December, 2007, pp.172-177 Science GENETIC ALGORITHM VERSUS PARTICLE SWARM OPTIMIZATION IN N-QUEEN PROBLEM * Azhar W. Hammad, ** Dr. Ban N. Thannoon Al-Nahrain
More informationAN EVOLUTIONARY APPROACH TO DISTANCE VECTOR ROUTING
International Journal of Latest Research in Science and Technology Volume 3, Issue 3: Page No. 201-205, May-June 2014 http://www.mnkjournals.com/ijlrst.htm ISSN (Online):2278-5299 AN EVOLUTIONARY APPROACH
More informationDescriptive Statistics and Graphing
Anatomy and Physiology Page 1 of 9 Measures of Central Tendency Descriptive Statistics and Graphing Measures of central tendency are used to find typical numbers in a data set. There are different ways
More informationText Categorization. Foundations of Statistic Natural Language Processing The MIT Press1999
Text Categorization Foundations of Statistic Natural Language Processing The MIT Press1999 Outline Introduction Decision Trees Maximum Entropy Modeling (optional) Perceptrons K Nearest Neighbor Classification
More informationSpeech-Coding Techniques. Chapter 3
Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the lower the quality RTP payload types
More informationCS 498PS Audio Computing Lab. Audio Restoration. Paris Smaragdis. paris.cs.illinois.edu U NIVERSITY OF URBANA-CHAMPAIGN
CS 498PS Audio Computing Lab Audio Restoration Paris Smaragdis paris@illinois.edu paris.cs.illinois.edu Today s lecture Audio restoration Gap filling Click removal Clip recovery 2 Missing data approach
More informationENHANCED RADAR IMAGING VIA SPARSITY REGULARIZED 2D LINEAR PREDICTION
ENHANCED RADAR IMAGING VIA SPARSITY REGULARIZED 2D LINEAR PREDICTION I.Erer 1, K. Sarikaya 1,2, H.Bozkurt 1 1 Department of Electronics and Telecommunications Engineering Electrics and Electronics Faculty,
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationHead, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India
Mapping Signal Processing Algorithms to Architecture Sumam David S Head, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India sumam@ieee.org Objectives At the
More informationChapter 3 Image Registration. Chapter 3 Image Registration
Chapter 3 Image Registration Distributed Algorithms for Introduction (1) Definition: Image Registration Input: 2 images of the same scene but taken from different perspectives Goal: Identify transformation
More informationWolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig
Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 6 Audio Retrieval 6 Audio Retrieval 6.1 Basics of
More informationOptimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of C
Optimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of California, San Diego CA 92093{0114, USA Abstract. We
More informationRefined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA
Copyright 3 SPIE Refined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA ABSTRACT Image texture is the term given to the information-bearing fluctuations
More informationMulti-label classification using rule-based classifier systems
Multi-label classification using rule-based classifier systems Shabnam Nazmi (PhD candidate) Department of electrical and computer engineering North Carolina A&T state university Advisor: Dr. A. Homaifar
More informationTHE EFFECT OF EARLY REFLECTIONS ON PERCEIVED TIMBRE ANALYZED WITH AN AUDITORY MODEL. Tapio Lokki, Ville Pulkki, and Lauri Savioja
Proceedings of the 22 International Conference on Auditory Display, Kyoto, Japan, July 2-, 22 THE EFFECT OF EARLY REFLECTIONS ON PERCEIVED TIMBRE ANALYZED WITH AN AUDITORY MODEL Tapio Lokki, Ville Pulkki,
More information2386 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 52, NO. 6, JUNE 2006
2386 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 52, NO. 6, JUNE 2006 The Encoding Complexity of Network Coding Michael Langberg, Member, IEEE, Alexander Sprintson, Member, IEEE, and Jehoshua Bruck,
More informationFACTORSYNTH user manual
FACTORSYNTH user manual www.jjburred.com - software@jjburred.com J.J. Burred, 2018-2019 factorsynth version: 1.4, 28/2/2019 Introduction Factorsynth is a Max For Live device that uses machine learning
More information