Research Article Fast Query-by-Singing/Humming System That Combines Linear Scaling and Quantized Dynamic Time Warping Algorithm

Size: px
Start display at page:

Download "Research Article Fast Query-by-Singing/Humming System That Combines Linear Scaling and Quantized Dynamic Time Warping Algorithm"

Transcription

1 Distributed Sensor Networks Volume 215, Article ID 17691, 1 pages Research Article Fast Query-by-Singing/Humming System That Combines Linear Scaling and Quantized Dynamic Time Warping Algorithm Gi Pyo Nam 1 and Kang Ryoung Park 2 1 Department of Electronics Engineering, Dongguk University, 26 Pil-dong 3-ga, Jung-gu, Seoul 1-715, Republic of Korea 2 Division of Electronics and Electrical Engineering, Dongguk University, 26 Pil-dong 3-ga, Jung-gu, Seoul 1-715, Republic of Korea Correspondence should be addressed to Kang Ryoung Park; parkgr@dgu.edu Received 23 January 215; Accepted 3 May 215 Academic Editor: José Molina Copyright 215 G. P. Nam and K. R. Park. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. We newly propose a query-by-singing/humming (QbSH) system considering both the preclassification and multiple classifierbased by combining linear scaling (LS) and quantized dynamic time warping (QDTW) algorithm in order to enhance both the matching accuracy and processing speed. This is appropriate for the QbSH of high speed in the huge distributed server environment. This research is novel in the following three ways. First, the processing speed of the QDTW is generally much slower thanthels.so,weperformtheqdtwmatchingonlyincasethatthematchingdistancebylsalgorithmissmaller than predetermined threshold, by which the entire processing time is reduced while the matching accuracy is maintained. Second, we use the different measurement of matching distance in LS algorithm by considering the characteristics of reference database. Third, we combine the calculated distances of LS and QDTW algorithms based on score level fusion in order to enhance the matching accuracy. The experimental results with the 29 MIR-QbSH corpus and the AFA MIDI 1 databases showed that the proposed reduced the total searching time of reference data while obtaining the higher accuracy compared to the QDTW. 1. Introduction With the widespread music content and music databases on the Internet, portable media, and smart phone, fast and accurate content-based searching systems are required. Query-by-singing/humming (QbSH) is a representatively convenient and intelligent in the field of contentbased music retrieval systems. It matches the reference music file corresponding humming queries of a user. It can be used for retrieving a music file without singer s name and song title basedonthemelodyofthemusichummed/sungbyauser. In previous researches, the various kinds of QbSH systems have been researched [1 22]. Ghias et al. proposed the of representing the pitch contour features extracted from the humming or whistle data as an up-down-repeat (UDR) string and using them for matching [2]. McNab et al. proposed the MELDEX system based on the pitch contour, interval, and duration with string matcher [3, 4]. In previous research [5], they proposed the Tuneserver representing the pitch contour as the UDR string like the of [2]. Kornstadt et al. developed the Themefinder system which has the capability of searching the theme of music in the Humdrum database of classic music of the 16th century and folk songs on the web [6, 7]. In previous research [8], they showed the retrieval using the changes of melody and the UDR string. RyynänenandKlapuriproposedtheof extracting the pitch vectors by using a fixed-size time window and matching them by using locality sensitive hashing (LSH) [9]. In another study [1], they adopted earth mover s distance (EMD) which could calculate the minimum cost between the features of humming and reference data with the changes of the weight to measure melodic similarity. In the previous research [11], they proposed the of content-based music retrieval which firstly filters out 8% unlikely candidates by using hierarchical filtering and compares the input query with the remaining candidates. Salamon and Rohrmeier proposed the two-stage retrieval for QbSH system [12]. As the first stage, the number

2 2 Distributed Sensor Networks Table 1: Summarized comparisons of the proposed to previous ones. Only by single classifier-based Only by multiple classifier-based Only by preclassification-based Considering both preclassification and multiple classifier-based (proposed ) Advantage Disadvantage Matching with single classifier to calculate the distance between input query data and reference data [2 1, 14, 15, 18, 19] High processing speed Limitation to enhance the matching accuracy only by single classifier Combining the matching scores (by two or more classifiers) [13, 16, 17] Advantage Higher matching accuracy than that by single classifier-based Disadvantage Lower processing speed The system firstly reduces the number of candidates in large amount of database by preclassification, and it calculates the matching distance with remaining candidate data [11, 12, 2] Advantage Higher matching speed by reducing the number of candidates based on preclassification Disadvantage The final matching only by the single classifier has the limitation of lower accuracy Considering both preclassification and multiple classifier-based by combining LS and QDTW algorithm Advantage Higher matching speed with higher accuracy Disadvantage Lower matching speed than that only by LS algorithm of candidates is reduced by the indexing using n- grams. And detail matching with the remaining candidates is performed with the remaining candidates based on local alignment with modified cost functions. Wang et al. proposed the QbSH system by combining the EMD and dynamic time warping (DTW) classifiers based on the weighted SUM rule [13]. The previous QbSH systems can be roughly categorized into top-down and bottom-up matching systems. As the topdown one, Wu et al. proposed recursive alignment algorithm which firstly compares two-feature data in global view and does them locally [14]. Other s of [11, 12] belongto this category. On the contrary, bottom-up locally calculates the distance between query and reference data in each position and searches the optimal path for obtaining a final matching score [2 4, 6 8, 15, 16]. For the QbSH system, DTW algorithm has been widely used for matcher. It has been widely used in speech recognition and can easily solve the time alignment problem. Since there generally has been much misalignment of time between the input humming/singing and the reference music file, the DTW algorithm is suitable for QbSH systems, but it has the limitation of high cost in computation. Jang and Gao converted the input query data into pitch vectors [15]. Using this, they measured the similarity between singing/humming and reference songs based on the calculated distance by DTW with high accuracy; however, this is also computationally demanding [23, 24]. Krishnamoorthy et al. also used DTW as distance measurement for the QbSH system on embedded platforms [19]. However, it still has the problem of high computation of the DTW and lower matching accuracy by using single classifier. Li et al. proposed multistage matching-based system to enhance the performance of QbSH system [2]. It includes three stages. First and second stages aim to reduce the number of candidates in large amount of database by usingearthmover sdistance(emd)basedontuneandprofile features, respectively. Finally, DTW calculates the matching distance with remaining candidate data. However, the final matchingonlybythesingleclassifierofthedtwhasthe limitation of lower accuracy. The linear scaling (LS) has the advantage of fast processing time, but its accuracy is relatively lower than the DTW [16]. All of these previous researches are ones only by single classifier-based or by multiple classifier-based or by preclassification-based. They do not adopt the scheme of considering both the preclassification and multiple classifier-based s. To overcome the problems of the previous researches, we newly propose a QbSH system considering both the preclassification and multiple classifier-based by combining LS and quantized DTW (QDTW) algorithm in order to enhance both the matching accuracy and the processing speed. The processing speed of the QDTW is generally much slower than the LS, although QDTW is the modified version of DTW to enhance the matching accuracy and reduce the processing time. So, we perform the QDTW matching only in case that the matching distance by LS algorithm is smaller than predeterminedthreshold,bywhichtheentireprocessingtime is reduced by higher than 3% compared to that of QDTW while the matching accuracy is maintained. We use the different measurement of matching distance in LS algorithm by considering the characteristics of reference database. In addition, we combine the calculated distances of LS and QDTW algorithms based on score level fusion in order to enhance the matching accuracy. Table 1 shows the summarized comparisons of the proposed and previous s. The rest of this paper is structured as follows. Section 2 explains the proposed QbSH system. Section 3 discusses the experimental results, and Section 4 states the conclusions of this study.

3 Distributed Sensor Networks 3 Input query humming data Pitch extraction Normalization Preclassification by LS algorithm Calculating the matching distance by LS algorithm Matching distance < threshold Yes Calculating the matching distance by QDTW algorithm No Moving the matching window to the next matching position of MIDI data Is the last part of MIDI data?? Yes Combining of the matching distance based on score level fusion No Searching the genuine MIDI file in the database Figure 1: Flowchart of the proposed. 2. Proposed 2.1. Overview of the Proposed. Figure 1 shows a flowchart of the proposed. First, pitch data are extractedfromtheuser sinputhummingfilebyusingmusical note estimation. Second, we perform the following normalization. We remove pitch values of in the extracted pitch data, since these can be regarded as the meaningless data which are obtained from the silence period of melody. In general, the melody of the input humming/singing is relatively inaccurate compared to the reference musical instrument digital interface (MIDI) data because it is hummed or sung by an amateur. So, the pitch data of the input is quite different from the MIDI file which requires the further normalization of the pitch data in both input and MIDI files as follows. After eliminating the values, the input humming data are normalized through mean shifting, average filtering, and min-max scaling [16, 17, 21]. Median andaveragefilteringgetridofthepeakedandvibratednoises, and min-max scaling adjusts the amplitude variations. With the normalized pitch data, preclassification is performed based on the calculated distance by LS algorithm in order to decide whether the QDTW algorithm should be executed. In detail, it calculates the matching distance between the input query data and the reference MIDI data in the matching window. If the matching distance is greater than a specific threshold, the QDTW algorithm does not runbecausethehummingandmididataaredifferent. Then, the matching window of the MIDI data is moved to the next matching position, and the preclassification procedure is repeated. If the matching distance is less than the threshold, the QDTW is executed in order to obtain more accurate matching score. These procedures are iterated until the matching window reaches the last part of MIDI data. If arriving at the last part of MIDI data, the final matching distance between the input humming/singing and the MIDI fileisdeterminedbycombiningthematchingdistanceof QDTWandthatofLSalgorithmbasedonscorelevelfusion. The correct MIDI file is selected based on the final matching distance Pitch Extraction and Normalization. In order to extract the pitch data, we used a voice-activity detection (VAD) algorithm [16, 17, 21]. First, the VAD algorithm estimates the voiced frames, and then pitch data as integer value is extracted by the spectrotemporal autocorrelation (STA)

4 4 Distributed Sensor Networks Input query data Time The length of reference MIDI data (matched with input query data as corresponding part).8 Time The length of reference MIDI data 1. Time The length of reference MIDI data 1.2 Time Figure 2: Example of the operation of the LS algorithm. which is based on temporal and spectral autocorrelations with the sampling of every 32 ms. However, a lot of noises are generally contained in the extracted pitch data. In addition, muted regions exist and the pitch data of input are quite different from the MIDI file since users cannot hum/sing perfectly like MIDI music. So, the extracted pitch data should be normalized to obtain an accurate matching result. In this research, we perform the procedures of removal of values, mean shifting, median filtering, average filtering, and min-max scaling for normalization Preclassification by LS Algorithm LS. The LS algorithm has been widely used in QbSH systems, since its processing complexity is very low [16]. It calculates the matching distance between input query data and reference MIDI data by changing linearly the length of input or reference data on time axis. In this research, we change the length of the reference MIDI data. Figure 2 shows the example of the LS algorithm Measuring of Matching Distance. In general, the characteristics of MIDI data are different according to the kind of reference databases. Although the 29 MIR- QbSH corpus mostly consists of children s song and folk song, theafamidi1databaseincludesmorevariouskinds of songs. So, the melodies of the 29 MIR-QbSH corpus database are usually simpler than those of the AFA MIDI 1 database. In addition, more noises are included in the AFA MIDI 1 database. So, we use the different measurement of matching distance in LS algorithm by considering the characteristics of reference database. In general, the Euclidean distance is used to measure the dissimilarity between input query data and reference MIDI data in LS algorithm as shown in ED = n i= (q i r i ) 2, (1) where q i and r i mean ith query and reference MIDI data, respectively, and n means the length of data. For lower

5 Distributed Sensor Networks 5 Calculated Dist i Absolute difference Square Abs Log Atan Figure 3: The relationship between absolute difference ( q i r i of (4))andcalculatedDist i of (4) (square, abs, log, and atan mean the 1st 4th functions of (4),resp.). processing time, the following equation can be used instead of (1): SquareED = n i= (q i r i ) 2. (2) In (2), we define the (q i r i ) 2 as Dist i like (3),andweselect one of the four functions of (4) as the Dist i according to the kind of reference database: SqaureED = n i= Dist i, (3) (q i r i ) 2 { Dist i = q i r i log ( q i r i + 1) { { arctan ( q i r i.5)+.5. Figure3 shows the relationship between absolute difference ( q i r i of (4)) andcalculateddist i of (4). The square, log, and atan have the characteristics of nonlinearity between the input and output values whereas the abs has the characteristics of linearity Matching by QDTW. As shown in Figure 1,ifthematching distance by LS algorithm is less than predetermined threshold, QDTW is executed to calculate a more accurate matching distance. In general, a difference in length exists between the MIDI and humming phrase. This problem of time alignment can be overcome by the DTW algorithm which can calculate the dissimilarity between the two patterns with insertion and deletion [16, 17, 21]. At each matching (4) position of DTW, the dissimilarity between the humming and MIDI features is calculated by Euclidean distance. In this research, we adopted QDTW which has the only difference (from the DTW) that it uses the quantized pitch value instead of the original one. Since the original pitch value has variations caused by noise, they are represented as quantized integer values in QDTW. Before matching by QDTW, we detect the zero to nonzero position (the position where the pitch value is changed from zero to nonzero) of the MIDI data and match the starting position of the humming data with each zero to nonzero position of the MIDI data. If the time interval between twozero to nonzero positions is less than the threshold, only the first zero to nonzero position is used for matching, through which we can reduce the processing time and enhance the matching accuracy Score Level Fusion of Matching Distances. Score level fusionhasbeenusedwidelytoenhancethematching accuracy, and there are a lot of s into score level fusion. In this paper, we combined the two matching distances by the LS and QDTW s based on simple fusion ssuchasmin,max,product,andsumrulesand compared the performances of each fusion. The MIN andmaxrulesselectthesmallerandgreateroneamongtwo matching distances as final matching score, respectively. The PRODUCTandSUMrulecalculatethefinalmatchingscore by multiplying and summing the two matching distances, respectively. Experimental results showed that the MIN rule showed the best performance among all s. 3. Experimental Results For experiments, we used two databases. The first database was the 29 MIR-QbSH corpus which consists of 48 reference MIDI files and 4431 singing and humming queries as wav files [22]. A total of 118 persons sing or hum 8 s per each query in various environments such as telephones and microphones. Since the 29 MIR-QbSH corpus database provides pitch vector (PV) files which included manually extracted pitch data, we used the PV files for the experiments to exclude the pitch extraction error. The second database was the audio feature analysis (AFA) MIDI 1 database, which includes 1, singing and humming files recorded by microphone, and 1 MIDI files which are made up of 84 Korean songs, 6 children s songs, and 1 pop songs. The average time length of the input singing/humming files is 12 s. We performed our experiments on a desktop computer with a 3.4 GHz CPU and 8 GB RAM. To measure the matching accuracy, the mean reciprocal rank () is used as the criterion of performance, and it has been frequently used for measuring the accuracy of QbSH system [12, 16, 17, 21]: = 1 K k i=1 1 rank i, (5) where K isthenumberofinputsinging/hummingfilesand rank i is the ranking of the correct MIDI file (corresponding

6 6 Distributed Sensor Networks Table 2: The matching accuracy of LS algorithm with PV files of 29 MIR-QbSH corpus database according to various distance measurement s of (3) and (4). Distance measurement Accuracy Top 1 (%) Top 1 (%) Top 2 (%) Square function Abs function Log function Arctan function Table 3: The matching accuracy of LS algorithm with AFA MIDI 1 database according to various distance measurement s of (3) and (4). Distance measurement Accuracy Top 1 (%) Top 1 (%) Top 2 (%) Square function Abs function Log function Arctan function to the input file), as calculated by the proposed. If all of the correct MIDI files (corresponding to the input files) are accurately measured as the 1st in rank, the calculated becomes 1, and the maximum is 1 [12, 16, 17, 21]. Top 1, Top 1, and Top 2 indicate that the rank of the MIDI file is included within rank 1, rank 1, and rank 2, respectively. As the 1st experiment, we measured the matching accuracy of the LS algorithm according to various distance measurement s of (3) and (4), as shown in Tables 2 and 3. Theresultshowedthatthecasewhichuseslog or arctan function shows better accuracy than other cases when using the AFA MIDI 1 database which has a lot of noises. However, the abs function shows the best matching accuracy when using the 29 MIR-QbSH corpus database which has fewer noises. From that, we can confirm that the linear function for distance measurement can show better performance with the database of less noise while the nonlinear function can have better accuracy with the database of larger noises. As the 2nd experiment, we measured the matching accuracy and processing time when using the LS algorithm as preclassification before performing the QDTW algorithm.basedontables2 and 3, the square function based distance measurement for LS algorithm was excluded because it had the lower matching accuracy. As shown in Tables 4 and 5, the processing time was much reduced by the proposed compared to the QDTW although the by the proposed is the same to that of the QDTW. As the 3rd experiment, we compared the processing timeandoftheoriginalqdtwandtheproposed according to the threshold for preclassification by the LS. If the matching distance by the LS isgreaterthanthethreshold,theqdtw-basedmatchingis not performed and matching window is moved to the next position for matching. If not, the QDTW-based matching is performed. If the threshold increases, the number of cases (that the matching distance by the LS is less than the threshold) increases. Consequently, the number of cases of performing the QDTW-based matching is also increased, which enhances the but increases the processing time. AsshowninFigures4 and 5, we can confirm that processing time by the proposed is much reduced compared to that of QDTW while maintaining the. By comparing Figures 4(a), 4(b),and4(c), we can confirm that the proposed using the preclassification based on abs function of (4) shows the better performance. In addition, we can also confirmthattheproposedusingthepreclassification based onarctan functionof (4) shows the better performance by comparing Figures 5(a), 5(b), and 5(c). The predetermined threshold for LS was experimentally determined considering the minimum processing time with the maintained (matching accuracy) of our. That is, as shown in Figures 4(a) 4(c) and 5(a) 5(c), the predetermined thresholds are 3.2, 1.3, 1.5, 3.9, 1.5, and 1.5, respectively. The positions of the thresholds mean that the minimum processing time is taken while the of our does not degrade. As shown in Figures 4 and 5, the thresholds are different from the dataset and the measurement s of matching distance (equation (4)) in LS algorithm. The above results of Tables 4 and 5 and Figures 4 and 5 are the cases that two matching distances by the LS and QDTW are not combined. As the last experiment, we compared the performances when combining the matching distances by LS and QDTW algorithm. Since the matching distance by the LS algorithm was already calculated for preclassification and the processing time of score fusion such as MIN, MAX, PRODUCT, and SUM rule is almost ms, the final processing time by combining two matching distances is not increased. Tables 6 and 7 show the results of fusion of two matching distances. Based on the above results of Tables 4 and 5,theabs function-based LS algorithm was used for 29 MIR-QbSH corpus database, and the arctan function-based LS algorithm was used for AFA MIDI 1 database. Tables 8 and 9 show the performance comparisons of the proposed and others with 29 MIR-QbSH corpus database and AFA MIDI 1 database, respectively. As shown in Table 8, the Top 1 and Top 2 rate of the proposed are a little lower than those of QDTW and QDTW with preclassification by LS (not combining two matching distances) in case of using the 29 MIR-QbSH corpus database. However, except for this case, the accuracies of the proposed are higher than those of other s in allthecasesasshownintables8 and 9.InmostoftheQbSH systems, the accuracy is evaluated based on the of (5) and Top 1 rate. So, we can confirm that the matching accuracy of the proposed was enhanced compared to others although the processing time of our algorithm was reduced byhigherthan3%comparedtothatofqdtw.although LS has the lowest processing time among them, it could not be used as single classifier because of poor matching accuracy.

7 Distributed Sensor Networks 7 Table 4: The performance of the s which combine LS and QDTW algorithm with PV files of 29 MIR-QbSH corpus database. Processing time & accuracy Processing time (ms) Top 1 (%) Top 1 (%) Top 2 (%) QDTW without preclassification by LS Preclassification by LS LS (abs function) + QDTW LS (log function) + QDTW LS (arctan function) + QDTW Table 5: The performance of the s which combine LS and QDTW algorithm with AFA MIDI 1 database. Processing time & accuracy Processing Time (s) Top 1 (%) Top 1 (%) Top 2 (%) QDTW without preclassification by LS Preclassification by LS LS (abs function) + QDTW LS (log function) + QDTW LS (arctan function) + QDTW Processing time (ms) Preclassification threshold by LS (a) Processing time (ms) Threshold of proposed Processing time (ms) Threshold Preclassification threshold by LS Threshold Preclassification threshold by LS (b) of proposed (c) of proposed Figure 4: The relationship between the matching accuracy and processing time with the 29 MIR-QbSH corpus database in case of using the following: (a) abs function of (4); (b) log function of (4); (c) arctan function of (4).

8 8 Distributed Sensor Networks Processing time (s) Threshold Preclassification threshold by LS (a) Processing time (s) of proposed Processing time (s) Threshold Preclassification threshold by LS Threshold Preclassification threshold by LS (b) of proposed (c) of proposed Figure 5: The relationship between the matching accuracy and processing time with the AFA MIDI 1 database in case of using the following: (a) abs function of (4); (b) log function of (4); (c) arctan function of (4). Table 6: The results of fusion of two matching distances with 29 MIR-QbSH corpus database. Fusion Accuracy Top 1 (%) Top 1 (%) Top 2 (%) MIN (proposed ) MAX PRODUCT SUM Table 7: The results of fusion of two matching distances with AFA MIDI 1 database. Fusion Accuracy Top 1 (%) Top 1 (%) Top 2 (%) MIN (proposed ) MAX PRODUCT SUM Conclusions In QbSH systems, DTW is typically adopted as a matcher. However, this is computationally expensive, and a reduction in processing time is required for real-time QbSH systems. To overcome this problem, in this paper we proposed a fast QbSH system that combines LS algorithm and QDTW algorithm. The experimental results showed that theproposedenhancedthematchingaccuracyand reduced the processing time compared to the result when the QDTW algorithm was used as single classifier.

9 Distributed Sensor Networks 9 Table 8: Performance comparison of the proposed with other single classifiers with the 29 MIR-QbSH corpus database. Processing time per query (ms) Processing time & accuracy Top 1 (%) Top 1 (%) Top 2 (%) LS (abs function) QDTW Preclassification by LS (abs function) + QDTW (not combining two matching distances) Preclassification by LS (abs function) + QDTW (combining two matching distances) (proposed ) Table 9: Performance comparison of the proposed with other single classifiers with the AFA MIDI 1 database. Processing time per query (ms) Processing time & accuracy Top 1 (%) Top 1 (%) Top 2 (%) LS (arctan function) QDTW 9, Preclassification by LS (arctan function) + QDTW (not combining two matching distances) 6, Preclassification by LS (abs function) + QDTW (combining two matching distances) (proposed ) 6, Asafuturework,wewillcomparetheperformanceofour proposed with other s for a larger database on various platforms including mobile devices. Conflict of Interests The authors declare that they have no conflict of interests. Acknowledgments This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF- 212R1A1A238666) and in part by the Public Welfare and Safety Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (NRF ). References [1] R. Typke, F. Wiering, and R. C. Veltkamp, A survey of music information retrieval systems, in Proceedings of the International Conference on Music Information Retrieval, pp , September 25. [2] A.Ghias,J.Logan,D.Chamberlin,andB.C.Smith, Queryby humming: musical information retrieval in an audio database, in Proceedings of ACM International Conference on Multimedia (MULTIMEDIA 95), pp , November [3] R.J.McNab,L.A.Smith,I.H.Witten,C.L.Henderson,andS.J. Cunningham, Towards the digital music library: tune retrieval from acoustic input, in Proceedings of the 1st ACM International Conference on Digital Libraries, pp , March [4] R.J.McNab,L.A.Smith,D.Bainbridge,andI.H.Witten, The New Zealand digital library melody index, D-Lib Magazine, vol.3,no.5,pp.4 15,1997. [5] L. Prechelt and R. Typke, An interface for melody input, ACM Transactions on Computer-Human Interaction,vol.8,no.2,pp , 21. [6] A. Kornstadt, Themefinder: a web-based melodic search tool, Computing in Musicology,vol.11,pp ,1998. [7] Themefinder, [8] S. Blackburn and D. DeRoure, A tool for content based navigation of music, in Proceedings of ACM International Conference on Multimedia, pp , [9] M. Ryynänen and A. Klapuri, Query by humming of midi and audio using locality sensitive hashing, in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing,pp ,April28. [1] R. Typke, P. Giannopoulos, R. C. Veltkamp, F. Wiering, and R. V. Oostrum, Using transportation distances for measuring melodic similarity, in Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR 3),pp , Baltimore, Md, USA, October 23. [11] J.-S. R. Jang and H.-R. Lee, Hierarchical filtering for content-based music retrieval via acoustic input, in Proceedings of the ACM International Conference on Multimedia, pp.41 41, October 21. [12] J. Salamon and M. Rohrmeier, A quantitative evaluation of a two stage retrieval approach for a melodic query by example system, in Proceedings of the 1th International Society for Music Information Retrieval Conference, pp , Kobe, Japan, October 29.

10 1 Distributed Sensor Networks [13] L.Wang,S.Huang,S.Hu,J.Liang,andB.Xu, Aneffectiveand efficient for query by humming system based on multisimilarity measurement fusion, in Proceedings of the International Conference on Audio, Language and Image Processing (ICALIP 8), pp , Shanghai, China, July 28. [14] X. Wu, M. Li, J. Liu, J. Yang, and Y. Yan, A top-down approach to melody match in pitch contour for query by humming, in Proceedings of the International Symposium of Chinese Spoken Language Processing, pp , 26. [15] J.-S. R. Jang and M.-Y. Gao, A query-by-singing system based on dynamic programming, in Proceedings of the International Workshop on Intelligent Systems Resolutions, pp , 2. [16] G. P. Nam, T. T. T. Luong, H. H. Nam, K. R. Park, and S. J. Park, Intelligent query by humming system based on score level fusion of multiple classifiers, EURASIP Journal on Advances in Signal Processing, vol. 211, article 21, 11 pages, 211. [17] G. P. Nam, K. R. Park, S.-J. Park, S.-P. Lee, and M.-Y. Kim, A new query-by-humming system based on the score level fusion of two classifiers, Communication Systems,vol.25,no.6,pp ,212. [18]J.Song,S.Y.Bae,andK.Yoon, Mid-levelmusicmelody representation of polyphonic audio for query-by-humming system, in Proceedings of the International Symposium on Music Information Retrieval, pp , Paris, France, October 22. [19] P.Krishnamoorthy,R.Bhatt,A.Srinivas,andS.Kumar, Query by humming system for embedded platforms, in Proceedings of the Annual IEEE India Conference, pp. 1 5, December 21. [2] J.Li,J.Han,Z.Shi,andJ.Li, AnEfficientApproachtoHumming Transcription for Query-by-Humming System, in Proceedings of the 3rd International Congress on Image and Signal Processing (CISP 1), pp , IEEE, Yantai, China, October 21. [21] K. Kim, K. R. Park, S.-J. Park, S.-P. Lee, and M. Y. Kim, Robust query-by-singing/humming system against background noise environments, IEEE Transactions on Consumer Electronics,vol. 57, no. 2, pp , 211. [22] C.-C. Wang, J.-S. R. Jang, and W. Wang, An improved query by singing/humming system using melody and lyrics information, in Proceedings of the 11th International Society for Music Information Retrieval Conference (ISMIR 1), pp.45 5,August 21. [23] E. J. Keogh and M. J. Pazzani, Scaling up dynamic time warping to massive datasets, in Principles of Data Mining and Knowledge Discovery: Third European Conference, PKDD 99, Prague, Czech Republic, September 15 18, Proceedings,vol.174ofLecture Notes in Computer Science, pp. 1 11, Springer, Berlin, Germany, [24] A. M. Youssef, T. K. Abdel-Galil, E. F. El-Saadany, and M. M. A. Salama, Disturbance classification utilizing dynamic time warping classifier, IEEE Transactions on Power Delivery, vol. 19, no. 1, pp , 24.

11 Rotating Machinery Engineering Journal of The Scientific World Journal Distributed Sensor Networks Journal of Sensors Journal of Control Science and Engineering Advances in Civil Engineering Submit your manuscripts at Journal of Journal of Electrical and Computer Engineering Robotics VLSI Design Advances in OptoElectronics Navigation and Observation Chemical Engineering Active and Passive Electronic Components Antennas and Propagation Aerospace Engineering Modelling & Simulation in Engineering Shock and Vibration Advances in Acoustics and Vibration

Query by Singing/Humming System Based on Deep Learning

Query by Singing/Humming System Based on Deep Learning Query by Singing/Humming System Based on Deep Learning Jia-qi Sun * and Seok-Pil Lee** *Department of Computer Science, Graduate School, Sangmyung University, Seoul, Korea. ** Department of Electronic

More information

Intelligent query by humming system based on score level fusion of multiple classifiers

Intelligent query by humming system based on score level fusion of multiple classifiers RESEARCH Open Access Intelligent query by humming system based on score level fusion of multiple classifiers Gi Pyo Nam 1, Thi Thu Trang Luong 1, Hyun Ha Nam 1, Kang Ryoung Park 1* and Sung-Joo Park 2

More information

Two-layer Distance Scheme in Matching Engine for Query by Humming System

Two-layer Distance Scheme in Matching Engine for Query by Humming System Two-layer Distance Scheme in Matching Engine for Query by Humming System Feng Zhang, Yan Song, Lirong Dai, Renhua Wang University of Science and Technology of China, iflytek Speech Lab, Hefei zhangf@ustc.edu,

More information

Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava

Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava 1 OPTIMAL PARAMETER ESTIMATION AND PERFORMANCE MODELLING IN MELODIC CONTOUR-BASED QBH SYSTEMS Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava Department of Electrical Engineering, IIT Bombay, Powai,

More information

Research Article MFT-MAC: A Duty-Cycle MAC Protocol Using Multiframe Transmission for Wireless Sensor Networks

Research Article MFT-MAC: A Duty-Cycle MAC Protocol Using Multiframe Transmission for Wireless Sensor Networks Distributed Sensor Networks Volume 2013, Article ID 858765, 6 pages http://dx.doi.org/10.1155/2013/858765 Research Article MFT-MAC: A Duty-Cycle MAC Protocol Using Multiframe Transmission for Wireless

More information

Research Article Implementation of Personal Health Device Communication Protocol Applying ISO/IEEE

Research Article Implementation of Personal Health Device Communication Protocol Applying ISO/IEEE Distributed Sensor Networks, Article ID 291295, 4 pages http://dx.doi.org/10.1155/2014/291295 Research Article Implementation of Personal Health Device Communication Protocol Applying ISO/IEEE 11073-20601

More information

Research Article Regressive Structures for Computation of DST-II and Its Inverse

Research Article Regressive Structures for Computation of DST-II and Its Inverse International Scholarly Research etwork ISR Electronics Volume 01 Article ID 537469 4 pages doi:10.540/01/537469 Research Article Regressive Structures for Computation of DST-II and Its Inverse Priyanka

More information

THE IMPORTANCE OF F0 TRACKING IN QUERY-BY-SINGING-HUMMING

THE IMPORTANCE OF F0 TRACKING IN QUERY-BY-SINGING-HUMMING 15th International Society for Music Information Retrieval Conference (ISMIR 214) THE IMPORTANCE OF F TRACKING IN QUERY-BY-SINGING-HUMMING Emilio Molina, Lorenzo J. Tardón, Isabel Barbancho, Ana M. Barbancho

More information

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Takuichi Nishimura Real World Computing Partnership / National Institute of Advanced

More information

Query By Singing/Humming System Using Segment-based Melody Matching for Music Retrieval

Query By Singing/Humming System Using Segment-based Melody Matching for Music Retrieval Query By Singing/Humming System Using Segment-based Melody Matching for Music Retrieval WEN-HSING LAI, CHI-YONG LEE Department of Computer and Communication Engineering National Kaohsiung First University

More information

Research Article Average Bandwidth Allocation Model of WFQ

Research Article Average Bandwidth Allocation Model of WFQ Modelling and Simulation in Engineering Volume 2012, Article ID 301012, 7 pages doi:10.1155/2012/301012 Research Article Average Bandwidth Allocation Model of WFQ TomášBaloghandMartinMedvecký Institute

More information

Research Article Novel Neighbor Selection Method to Improve Data Sparsity Problem in Collaborative Filtering

Research Article Novel Neighbor Selection Method to Improve Data Sparsity Problem in Collaborative Filtering Distributed Sensor Networks Volume 2013, Article ID 847965, 6 pages http://dx.doi.org/10.1155/2013/847965 Research Article Novel Neighbor Selection Method to Improve Data Sparsity Problem in Collaborative

More information

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines The Scientific World Journal Volume 2013, Article ID 596724, 6 pages http://dx.doi.org/10.1155/2013/596724 Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines Weizhe

More information

A New Logging-based IP Traceback Approach using Data Mining Techniques

A New Logging-based IP Traceback Approach using Data Mining Techniques using Data Mining Techniques Internet & Multimedia Engineering, Konkuk University, Seoul, Republic of Korea hsriverv@gmail.com, kimsr@konuk.ac.kr Abstract IP Traceback is a way to search for sources of

More information

Research Article A Novel Metaheuristic for Travelling Salesman Problem

Research Article A Novel Metaheuristic for Travelling Salesman Problem Industrial Volume 2013, Article ID 347825, 5 pages http://dx.doi.org/10.1155/2013/347825 Research Article A Novel Metaheuristic for Travelling Salesman Problem Vahid Zharfi and Abolfazl Mirzazadeh Industrial

More information

Toward Interlinking Asian Resources Effectively: Chinese to Korean Frequency-Based Machine Translation System

Toward Interlinking Asian Resources Effectively: Chinese to Korean Frequency-Based Machine Translation System Toward Interlinking Asian Resources Effectively: Chinese to Korean Frequency-Based Machine Translation System Eun Ji Kim and Mun Yong Yi (&) Department of Knowledge Service Engineering, KAIST, Daejeon,

More information

A Robust Music Retrieval Method for Queryby-Humming

A Robust Music Retrieval Method for Queryby-Humming A Robust Music Retrieval Method for Queryby-Humming Yongwei Zhu Institute for Inforcomm Research Singapore ywzhu@i2r.a-star.edu.sg Abstract The increasing availability of digital music has created a need

More information

Research Article A Data Gathering Method Based on a Mobile Sink for Minimizing the Data Loss in Wireless Sensor Networks

Research Article A Data Gathering Method Based on a Mobile Sink for Minimizing the Data Loss in Wireless Sensor Networks Distributed Sensor Networks, Article ID 90636, 7 pages http://dx.doi.org/10.1155/014/90636 Research Article A Gathering Method Based on a Mobile Sink for Minimizing the Loss in Wireless Sensor Networks

More information

Applicability Estimation of Mobile Mapping. System for Road Management

Applicability Estimation of Mobile Mapping. System for Road Management Contemporary Engineering Sciences, Vol. 7, 2014, no. 24, 1407-1414 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.49173 Applicability Estimation of Mobile Mapping System for Road Management

More information

Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding

Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding e Scientific World Journal, Article ID 746260, 8 pages http://dx.doi.org/10.1155/2014/746260 Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding Ming-Yi

More information

A Top-down Approach to Melody Match in Pitch Contour for Query by Humming

A Top-down Approach to Melody Match in Pitch Contour for Query by Humming A Top-down Approach to Melody Match in Pitch Contour for Query by Humming Xiao Wu, Ming Li, Jian Liu, Jun Yang, and Yonghong Yan Institute of Acoustics, Chinese Academy of Science. {xwu, mli, jliu, yyan}@hccl.ioa.ac.cn

More information

Robot localization method based on visual features and their geometric relationship

Robot localization method based on visual features and their geometric relationship , pp.46-50 http://dx.doi.org/10.14257/astl.2015.85.11 Robot localization method based on visual features and their geometric relationship Sangyun Lee 1, Changkyung Eem 2, and Hyunki Hong 3 1 Department

More information

Implementation of Semantic Information Retrieval. System in Mobile Environment

Implementation of Semantic Information Retrieval. System in Mobile Environment Contemporary Engineering Sciences, Vol. 9, 2016, no. 13, 603-608 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2016.6447 Implementation of Semantic Information Retrieval System in Mobile

More information

(JBE Vol. 23, No. 6, November 2018) Detection of Frame Deletion Using Convolutional Neural Network. Abstract

(JBE Vol. 23, No. 6, November 2018) Detection of Frame Deletion Using Convolutional Neural Network. Abstract (JBE Vol. 23, No. 6, November 2018) (Regular Paper) 23 6, 2018 11 (JBE Vol. 23, No. 6, November 2018) https://doi.org/10.5909/jbe.2018.23.6.886 ISSN 2287-9137 (Online) ISSN 1226-7953 (Print) CNN a), a),

More information

Improved differential fault analysis on lightweight block cipher LBlock for wireless sensor networks

Improved differential fault analysis on lightweight block cipher LBlock for wireless sensor networks Jeong et al. EURASIP Journal on Wireless Communications and Networking 2013, 2013:151 RESEARCH Improved differential fault analysis on lightweight block cipher LBlock for wireless sensor networks Kitae

More information

A QUANTITATIVE EVALUATION OF A TWO STAGE RETRIEVAL APPROACH FOR A MELODIC QUERY BY EXAMPLE SYSTEM

A QUANTITATIVE EVALUATION OF A TWO STAGE RETRIEVAL APPROACH FOR A MELODIC QUERY BY EXAMPLE SYSTEM 10th International Society for Music Information Retrieval Conference (ISMIR 2009) A QUANTITATIVE EVALUATION OF A TWO STAGE RETRIEVAL APPROACH FOR A MELODIC QUERY BY EXAMPLE SYSTEM Justin Salamon Music

More information

A Study on Multi-resolution Screen based Conference Broadcasting Technology

A Study on Multi-resolution Screen based Conference Broadcasting Technology 2 : (Young-ae Kim et al.: A Study on Multi-resolution Screen based Conference Broadcasting Technology) (Special Paper) 23 2, 2018 3 (JBE Vol. 23, No. 2, March 2018) https://doi.org/10.5909/jbe.2018.23.2.253

More information

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems Rec. ITU-R BS.1693 1 RECOMMENDATION ITU-R BS.1693 Procedure for the performance test of automated query-by-humming systems (Question ITU-R 8/6) (2004) The ITU Radiocommunication Assembly, considering a)

More information

DTV-BASED MELODY CUTTING FOR DTW-BASED MELODY SEARCH AND INDEXING IN QBH SYSTEMS

DTV-BASED MELODY CUTTING FOR DTW-BASED MELODY SEARCH AND INDEXING IN QBH SYSTEMS DTV-BASED MELODY CUTTING FOR DTW-BASED MELODY SEARCH AND INDEXING IN QBH SYSTEMS Bartłomiej Stasiak Institute of Information Technology, Lodz University of Technology, bartlomiej.stasiak@p.lodz.pl ABSTRACT

More information

An Efficient Provable Data Possession Scheme based on Counting Bloom Filter for Dynamic Data in the Cloud Storage

An Efficient Provable Data Possession Scheme based on Counting Bloom Filter for Dynamic Data in the Cloud Storage , pp. 9-16 http://dx.doi.org/10.14257/ijmue.2016.11.4.02 An Efficient Provable Data Possession Scheme based on Counting Bloom Filter for Dynamic Data in the Cloud Storage Eunmi Jung 1 and Junho Jeong 2

More information

Fault Diagnosis of Wind Turbine Based on ELMD and FCM

Fault Diagnosis of Wind Turbine Based on ELMD and FCM Send Orders for Reprints to reprints@benthamscience.ae 76 The Open Mechanical Engineering Journal, 24, 8, 76-72 Fault Diagnosis of Wind Turbine Based on ELMD and FCM Open Access Xianjin Luo * and Xiumei

More information

Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments

Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments Send Orders for Reprints to reprints@benthamscience.ae 368 The Open Automation and Control Systems Journal, 2014, 6, 368-373 Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing

More information

Research Article Cooperative Signaling with Soft Information Combining

Research Article Cooperative Signaling with Soft Information Combining Electrical and Computer Engineering Volume 2010, Article ID 530190, 5 pages doi:10.1155/2010/530190 Research Article Cooperative Signaling with Soft Information Combining Rui Lin, Philippa A. Martin, and

More information

Piecewise Linear Approximation Based on Taylor Series of LDPC Codes Decoding Algorithm and Implemented in FPGA

Piecewise Linear Approximation Based on Taylor Series of LDPC Codes Decoding Algorithm and Implemented in FPGA Journal of Information Hiding and Multimedia Signal Processing c 2018 ISSN 2073-4212 Ubiquitous International Volume 9, Number 3, May 2018 Piecewise Linear Approximation Based on Taylor Series of LDPC

More information

INDEXING AND RETRIEVAL OF MUSIC DOCUMENTS THROUGH PATTERN ANALYSIS AND DATA FUSION TECHNIQUES

INDEXING AND RETRIEVAL OF MUSIC DOCUMENTS THROUGH PATTERN ANALYSIS AND DATA FUSION TECHNIQUES INDEXING AND RETRIEVAL OF MUSIC DOCUMENTS THROUGH PATTERN ANALYSIS AND DATA FUSION TECHNIQUES Giovanna Neve University of Padova Department of Information Engineering Nicola Orio University of Padova Department

More information

Tendency Mining in Dynamic Association Rules Based on SVM Classifier

Tendency Mining in Dynamic Association Rules Based on SVM Classifier Send Orders for Reprints to reprints@benthamscienceae The Open Mechanical Engineering Journal, 2014, 8, 303-307 303 Open Access Tendency Mining in Dynamic Association Rules Based on SVM Classifier Zhonglin

More information

An Adaptive Threshold LBP Algorithm for Face Recognition

An Adaptive Threshold LBP Algorithm for Face Recognition An Adaptive Threshold LBP Algorithm for Face Recognition Xiaoping Jiang 1, Chuyu Guo 1,*, Hua Zhang 1, and Chenghua Li 1 1 College of Electronics and Information Engineering, Hubei Key Laboratory of Intelligent

More information

Spoken Document Retrieval (SDR) for Broadcast News in Indian Languages

Spoken Document Retrieval (SDR) for Broadcast News in Indian Languages Spoken Document Retrieval (SDR) for Broadcast News in Indian Languages Chirag Shah Dept. of CSE IIT Madras Chennai - 600036 Tamilnadu, India. chirag@speech.iitm.ernet.in A. Nayeemulla Khan Dept. of CSE

More information

Multi-Stage Rocchio Classification for Large-scale Multilabeled

Multi-Stage Rocchio Classification for Large-scale Multilabeled Multi-Stage Rocchio Classification for Large-scale Multilabeled Text data Dong-Hyun Lee Nangman Computing, 117D Garden five Tools, Munjeong-dong Songpa-gu, Seoul, Korea dhlee347@gmail.com Abstract. Large-scale

More information

CHAPTER 7 MUSIC INFORMATION RETRIEVAL

CHAPTER 7 MUSIC INFORMATION RETRIEVAL 163 CHAPTER 7 MUSIC INFORMATION RETRIEVAL Using the music and non-music components extracted, as described in chapters 5 and 6, we can design an effective Music Information Retrieval system. In this era

More information

Multimedia Databases

Multimedia Databases Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Previous Lecture Audio Retrieval - Low Level Audio

More information

manufacturing process.

manufacturing process. Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 203-207 203 Open Access Identifying Method for Key Quality Characteristics in Series-Parallel

More information

MIREX SYMBOLIC MELODIC SIMILARITY AND QUERY BY SINGING/HUMMING

MIREX SYMBOLIC MELODIC SIMILARITY AND QUERY BY SINGING/HUMMING MIREX SYMBOLIC MELODIC SIMILARITY AND QUERY BY SINGING/HUMMING Rainer Typke rainer.typke@musipedia.org Frans Wiering frans.wiering@cs.uu.nl Remco C. Veltkamp remco.veltkamp@cs.uu.nl Abstract This submission

More information

A Real Time GIS Approximation Approach for Multiphase Spatial Query Processing Using Hierarchical-Partitioned-Indexing Technique

A Real Time GIS Approximation Approach for Multiphase Spatial Query Processing Using Hierarchical-Partitioned-Indexing Technique International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 6 ISSN : 2456-3307 A Real Time GIS Approximation Approach for Multiphase

More information

3D Grid Size Optimization of Automatic Space Analysis for Plant Facility Using Point Cloud Data

3D Grid Size Optimization of Automatic Space Analysis for Plant Facility Using Point Cloud Data 33 rd International Symposium on Automation and Robotics in Construction (ISARC 2016) 3D Grid Size Optimization of Automatic Space Analysis for Plant Facility Using Point Cloud Data Gyu seong Choi a, S.W.

More information

Research Article Modeling and Simulation Based on the Hybrid System of Leasing Equipment Optimal Allocation

Research Article Modeling and Simulation Based on the Hybrid System of Leasing Equipment Optimal Allocation Discrete Dynamics in Nature and Society Volume 215, Article ID 459381, 5 pages http://dxdoiorg/11155/215/459381 Research Article Modeling and Simulation Based on the Hybrid System of Leasing Equipment

More information

Fast and Robust Earth Mover s Distances

Fast and Robust Earth Mover s Distances Fast and Robust Earth Mover s Distances Ofir Pele and Michael Werman School of Computer Science and Engineering The Hebrew University of Jerusalem {ofirpele,werman}@cs.huji.ac.il Abstract We present a

More information

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

Open Access Research on the Prediction Model of Material Cost Based on Data Mining Send Orders for Reprints to reprints@benthamscience.ae 1062 The Open Mechanical Engineering Journal, 2015, 9, 1062-1066 Open Access Research on the Prediction Model of Material Cost Based on Data Mining

More information

Research Article Multichannel Broadcast Based on Home Channel for Cognitive Radio Sensor Networks

Research Article Multichannel Broadcast Based on Home Channel for Cognitive Radio Sensor Networks e Scientific World Journal, Article ID 72521, 6 pages http://dx.doi.org/1.1155/214/72521 Research Article Multichannel Broadcast Based on Home Channel for Cognitive Radio Sensor Networks Fanzi Zeng, 1

More information

A Spatial Point Pattern Analysis to Recognize Fail Bit Patterns in Semiconductor Manufacturing

A Spatial Point Pattern Analysis to Recognize Fail Bit Patterns in Semiconductor Manufacturing A Spatial Point Pattern Analysis to Recognize Fail Bit Patterns in Semiconductor Manufacturing Youngji Yoo, Seung Hwan Park, Daewoong An, Sung-Shick Shick Kim, Jun-Geol Baek Abstract The yield management

More information

Monocular Vision-based Displacement Measurement System Robust to Angle and Distance Using Homography

Monocular Vision-based Displacement Measurement System Robust to Angle and Distance Using Homography 6 th International Conference on Advances in Experimental Structural Engineering 11 th International Workshop on Advanced Smart Materials and Smart Structures Technology August 1-2, 2015, University of

More information

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting

More information

Multimedia Databases. 8 Audio Retrieval. 8.1 Music Retrieval. 8.1 Statistical Features. 8.1 Music Retrieval. 8.1 Music Retrieval 12/11/2009

Multimedia Databases. 8 Audio Retrieval. 8.1 Music Retrieval. 8.1 Statistical Features. 8.1 Music Retrieval. 8.1 Music Retrieval 12/11/2009 8 Audio Retrieval Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 8 Audio Retrieval 8.1 Query by Humming

More information

Event Detection using Archived Smart House Sensor Data obtained using Symbolic Aggregate Approximation

Event Detection using Archived Smart House Sensor Data obtained using Symbolic Aggregate Approximation Event Detection using Archived Smart House Sensor Data obtained using Symbolic Aggregate Approximation Ayaka ONISHI 1, and Chiemi WATANABE 2 1,2 Graduate School of Humanities and Sciences, Ochanomizu University,

More information

Web Information Retrieval using WordNet

Web Information Retrieval using WordNet Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT

More information

Preliminary Research on Distributed Cluster Monitoring of G/S Model

Preliminary Research on Distributed Cluster Monitoring of G/S Model Available online at www.sciencedirect.com Physics Procedia 25 (2012 ) 860 867 2012 International Conference on Solid State Devices and Materials Science Preliminary Research on Distributed Cluster Monitoring

More information

Electrical Engineering and Computer Science Department

Electrical Engineering and Computer Science Department Electrical Engineering and Computer Science Department Technical Report NWU-EECS-07-03 July 12th, 2006 Online Training of a Music Search Engine David Little, David Raffensperger, and Bryan Pardo. Abstract

More information

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN 2016 International Conference on Artificial Intelligence: Techniques and Applications (AITA 2016) ISBN: 978-1-60595-389-2 Face Recognition Using Vector Quantization Histogram and Support Vector Machine

More information

Open Access Research on the Data Pre-Processing in the Network Abnormal Intrusion Detection

Open Access Research on the Data Pre-Processing in the Network Abnormal Intrusion Detection Send Orders for Reprints to reprints@benthamscience.ae 1228 The Open Automation and Control Systems Journal, 2014, 6, 1228-1232 Open Access Research on the Data Pre-Processing in the Network Abnormal Intrusion

More information

Research Article Secure Data Aggregation in Wireless Multimedia Sensor Networks Based on Similarity Matching

Research Article Secure Data Aggregation in Wireless Multimedia Sensor Networks Based on Similarity Matching Distributed Sensor Networks Volume 214, Article ID 494853, 6 pages http://dx.doi.org/1.1155/214/494853 Research Article Secure Data Aggregation in Wireless Multimedia Sensor Networks Based on Similarity

More information

Abstract We present a system which automatically generates a 3D face model from a single frontal image of a face. Our system consists of two component

Abstract We present a system which automatically generates a 3D face model from a single frontal image of a face. Our system consists of two component A Fully Automatic System To Model Faces From a Single Image Zicheng Liu Microsoft Research August 2003 Technical Report MSR-TR-2003-55 Microsoft Research Microsoft Corporation One Microsoft Way Redmond,

More information

Open Access Algorithm of Context Inconsistency Elimination Based on Feedback Windowing and Evidence Theory for Smart Home

Open Access Algorithm of Context Inconsistency Elimination Based on Feedback Windowing and Evidence Theory for Smart Home Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 637-648 637 Open Access Algorithm of Context Inconsistency Elimination Based on Feedback

More information

Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents

Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents Send Orders for Reprints to reprints@benthamscience.ae 676 The Open Automation and Control Systems Journal, 2014, 6, 676-683 Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving

More information

Effectiveness of HMM-Based Retrieval on Large Databases

Effectiveness of HMM-Based Retrieval on Large Databases Effectiveness of HMM-Based Retrieval on Large Databases Jonah Shifrin EECS Dept, University of Michigan ATL, Beal Avenue Ann Arbor, MI 489-2 jshifrin@umich.edu William Birmingham EECS Dept, University

More information

Texture Sensitive Image Inpainting after Object Morphing

Texture Sensitive Image Inpainting after Object Morphing Texture Sensitive Image Inpainting after Object Morphing Yin Chieh Liu and Yi-Leh Wu Department of Computer Science and Information Engineering National Taiwan University of Science and Technology, Taiwan

More information

PUBLICATIONS. Journal Papers

PUBLICATIONS. Journal Papers PUBLICATIONS Journal Papers [J1] X. Wu and L.-L. Xie, Asymptotic equipartition property of output when rate is above capacity, submitted to IEEE Transactions on Information Theory, August 2009. [J2] A.

More information

Graph Matching Iris Image Blocks with Local Binary Pattern

Graph Matching Iris Image Blocks with Local Binary Pattern Graph Matching Iris Image Blocs with Local Binary Pattern Zhenan Sun, Tieniu Tan, and Xianchao Qiu Center for Biometrics and Security Research, National Laboratory of Pattern Recognition, Institute of

More information

QUERY BY HUMMING SYSTEM USING PERSONAL HYBRID RANKING

QUERY BY HUMMING SYSTEM USING PERSONAL HYBRID RANKING QUERY BY HUMMING SYSTEM USING PERSONAL HYBRID RANKING ARPITA SHRIVASTAVA, DHVANI PANDYA, PRIYANKA TELI, YASH SAHASRABUDDHE ABSTRACT With the increasing use of smart devices, many applications have been

More information

An Approach for Real Time Moving Object Extraction based on Edge Region Determination

An Approach for Real Time Moving Object Extraction based on Edge Region Determination An Approach for Real Time Moving Object Extraction based on Edge Region Determination Sabrina Hoque Tuli Department of Computer Science and Engineering, Chittagong University of Engineering and Technology,

More information

The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing

The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing Sung Ho Jang, Tae Young Kim, Jae Kwon Kim and Jong Sik Lee School of Information Engineering Inha University #253, YongHyun-Dong,

More information

Optimized Watermarking Using Swarm-Based Bacterial Foraging

Optimized Watermarking Using Swarm-Based Bacterial Foraging Journal of Information Hiding and Multimedia Signal Processing c 2009 ISSN 2073-4212 Ubiquitous International Volume 1, Number 1, January 2010 Optimized Watermarking Using Swarm-Based Bacterial Foraging

More information

Automatic Pipeline Generation by the Sequential Segmentation and Skelton Construction of Point Cloud

Automatic Pipeline Generation by the Sequential Segmentation and Skelton Construction of Point Cloud , pp.43-47 http://dx.doi.org/10.14257/astl.2014.67.11 Automatic Pipeline Generation by the Sequential Segmentation and Skelton Construction of Point Cloud Ashok Kumar Patil, Seong Sill Park, Pavitra Holi,

More information

The MUSART Testbed for Query-By-Humming Evaluation

The MUSART Testbed for Query-By-Humming Evaluation The MUSART Testbed for Query-By-Humming Evaluation Roger B. Dannenberg, William P. Birmingham, George Tzanetakis, Colin Meek, Ning Hu, Bryan Pardo School of Computer Science Department of Electrical Engineering

More information

Distributed CoAP Handover Using Distributed Mobility Agents in Internet-of-Things Networks

Distributed CoAP Handover Using Distributed Mobility Agents in Internet-of-Things Networks J. lnf. Commun. Converg. Eng. 15(1): 37-42, Mar. 2017 Regular paper Distributed CoAP Handover Using Distributed Mobility Agents in Internet-of-Things Networks Sang-Il Choi 1 and Seok-Joo Koh 2*, Member,

More information

Research Article Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation

Research Article Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation Discrete Dynamics in Nature and Society Volume 2008, Article ID 384346, 8 pages doi:10.1155/2008/384346 Research Article Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation

More information

Design of Hierarchical Crossconnect WDM Networks Employing a Two-Stage Multiplexing Scheme of Waveband and Wavelength

Design of Hierarchical Crossconnect WDM Networks Employing a Two-Stage Multiplexing Scheme of Waveband and Wavelength 166 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 20, NO. 1, JANUARY 2002 Design of Hierarchical Crossconnect WDM Networks Employing a Two-Stage Multiplexing Scheme of Waveband and Wavelength

More information

A Hybrid Approach to Parallel Connected Component Labeling Using CUDA

A Hybrid Approach to Parallel Connected Component Labeling Using CUDA International Journal of Signal Processing Systems Vol. 1, No. 2 December 2013 A Hybrid Approach to Parallel Connected Component Labeling Using CUDA Youngsung Soh, Hadi Ashraf, Yongsuk Hae, and Intaek

More information

Adaptive Aggregation Scheduling Using. Aggregation-degree Control in Sensor Network

Adaptive Aggregation Scheduling Using. Aggregation-degree Control in Sensor Network Contemporary Engineering Sciences, Vol. 7, 2014, no. 14, 725-730 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.4681 Adaptive Aggregation Scheduling Using Aggregation-degree Control in

More information

Comprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority

Comprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority IOP Conference Series: Earth and Environmental Science PAPER OPEN ACCESS Comprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority To cite this article:

More information

BUILDING CORPORA OF TRANSCRIBED SPEECH FROM OPEN ACCESS SOURCES

BUILDING CORPORA OF TRANSCRIBED SPEECH FROM OPEN ACCESS SOURCES BUILDING CORPORA OF TRANSCRIBED SPEECH FROM OPEN ACCESS SOURCES O.O. Iakushkin a, G.A. Fedoseev, A.S. Shaleva, O.S. Sedova Saint Petersburg State University, 7/9 Universitetskaya nab., St. Petersburg,

More information

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES STUDYING OF CLASSIFYING CHINESE SMS MESSAGES BASED ON BAYESIAN CLASSIFICATION 1 LI FENG, 2 LI JIGANG 1,2 Computer Science Department, DongHua University, Shanghai, China E-mail: 1 Lifeng@dhu.edu.cn, 2

More information

CROWDSOURCING A REAL-WORLD ON-LINE QUERY BY HUMMING SYSTEM

CROWDSOURCING A REAL-WORLD ON-LINE QUERY BY HUMMING SYSTEM CROWDSOURCING A REAL-WORLD ON-LINE QUERY BY HUMMING SYSTEM Arefin Huq Mark Cartwright Bryan Pardo Northwestern University EECS Department 233 Sheridan Road Evanston, IL 60208, USA mcartwright@u.northwestern.edu

More information

Research Article Cloud Platform Based on Mobile Internet Service Opportunistic Drive and Application Aware Data Mining

Research Article Cloud Platform Based on Mobile Internet Service Opportunistic Drive and Application Aware Data Mining Electrical and Computer Engineering Volume 25, Article ID 357378, 7 pages http://dx.doi.org/.55/25/357378 Research Article Cloud Platform Based on Mobile Internet Service Opportunistic Drive and Application

More information

Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis

Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis Sensors & Transducers 2014 by IFSA Publishing, S. L. http://www.sensorsportal.com Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis 1 Xulin LONG, 1,* Qiang CHEN, 2 Xiaoya

More information

FSRM Feedback Algorithm based on Learning Theory

FSRM Feedback Algorithm based on Learning Theory Send Orders for Reprints to reprints@benthamscience.ae The Open Cybernetics & Systemics Journal, 2015, 9, 699-703 699 FSRM Feedback Algorithm based on Learning Theory Open Access Zhang Shui-Li *, Dong

More information

Fingerprint Mosaicking by Rolling with Sliding

Fingerprint Mosaicking by Rolling with Sliding Fingerprint Mosaicking by Rolling with Sliding Kyoungtaek Choi, Hunjae Park, Hee-seung Choi and Jaihie Kim Department of Electrical and Electronic Engineering,Yonsei University Biometrics Engineering Research

More information

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

Video Inter-frame Forgery Identification Based on Optical Flow Consistency Sensors & Transducers 24 by IFSA Publishing, S. L. http://www.sensorsportal.com Video Inter-frame Forgery Identification Based on Optical Flow Consistency Qi Wang, Zhaohong Li, Zhenzhen Zhang, Qinglong

More information

Image Classification Using Wavelet Coefficients in Low-pass Bands

Image Classification Using Wavelet Coefficients in Low-pass Bands Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan

More information

Towards an Integrated Approach to Music Retrieval

Towards an Integrated Approach to Music Retrieval Towards an Integrated Approach to Music Retrieval Emanuele Di Buccio 1, Ivano Masiero 1, Yosi Mass 2, Massimo Melucci 1, Riccardo Miotto 1, Nicola Orio 1, and Benjamin Sznajder 2 1 Department of Information

More information

Development of an Optimization Software System for Nonlinear Dynamics using the Equivalent Static Loads Method

Development of an Optimization Software System for Nonlinear Dynamics using the Equivalent Static Loads Method 10 th World Congress on Structural and Multidisciplinary Optimization May 19-24, 2013, Orlando, Florida, USA Development of an Optimization Software System for Nonlinear Dynamics using the Equivalent Static

More information

Tararira. version 0.1 USER'S MANUAL

Tararira. version 0.1 USER'S MANUAL version 0.1 USER'S MANUAL 1. INTRODUCTION Tararira is a software that allows music search in a local database using as a query a hummed, sung or whistled melody fragment performed by the user. To reach

More information

Implementation of Omni-directional Image Viewer Program for Effective Monitoring

Implementation of Omni-directional Image Viewer Program for Effective Monitoring 2 : (So-Yeon Jeon et al.: Implementation of Omni-directional Image Viewer Program for Effective Monitoring) (Regular Paper) 23 6, 2018 11 (JBE Vol. 23, No. 6, November 2018) https://doi.org/10.5909/jbe.2018.23.6.939

More information

Research Article Cross Beam STAP for Nonstationary Clutter Suppression in Airborne Radar

Research Article Cross Beam STAP for Nonstationary Clutter Suppression in Airborne Radar Antennas and Propagation Volume 213, Article ID 27631, 5 pages http://dx.doi.org/1.1155/213/27631 Research Article Cross Beam STAP for Nonstationary Clutter Suppression in Airborne Radar Yongliang Wang,

More information

Chapter 5.5 Audio Programming

Chapter 5.5 Audio Programming Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming

More information

International Journal of Modern Engineering and Research Technology

International Journal of Modern Engineering and Research Technology Volume 4, Issue 3, July 2017 ISSN: 2348-8565 (Online) International Journal of Modern Engineering and Research Technology Website: http://www.ijmert.org Email: editor.ijmert@gmail.com A Novel Approach

More information

Fast trajectory matching using small binary images

Fast trajectory matching using small binary images Title Fast trajectory matching using small binary images Author(s) Zhuo, W; Schnieders, D; Wong, KKY Citation The 3rd International Conference on Multimedia Technology (ICMT 2013), Guangzhou, China, 29

More information

Dynamic Clustering of Data with Modified K-Means Algorithm

Dynamic Clustering of Data with Modified K-Means Algorithm 2012 International Conference on Information and Computer Networks (ICICN 2012) IPCSIT vol. 27 (2012) (2012) IACSIT Press, Singapore Dynamic Clustering of Data with Modified K-Means Algorithm Ahamed Shafeeq

More information

Dukpa Kim FIELDS OF INTEREST. Econometrics, Time Series Econometrics ACADEMIC POSITIONS

Dukpa Kim FIELDS OF INTEREST. Econometrics, Time Series Econometrics ACADEMIC POSITIONS Dukpa Kim Contact Information Department of Economics Phone: 82-2-3290-5131 Korea University Fax: 82-2-3290-2661 145 Anam-ro, Seongbuk-gu Email: dukpakim@korea.ac.kr Seoul, 02841 Korea FIELDS OF INTEREST

More information

An Improved Frequent Pattern-growth Algorithm Based on Decomposition of the Transaction Database

An Improved Frequent Pattern-growth Algorithm Based on Decomposition of the Transaction Database Algorithm Based on Decomposition of the Transaction Database 1 School of Management Science and Engineering, Shandong Normal University,Jinan, 250014,China E-mail:459132653@qq.com Fei Wei 2 School of Management

More information

Scalable Hierarchical Summarization of News Using Fidelity in MPEG-7 Description Scheme

Scalable Hierarchical Summarization of News Using Fidelity in MPEG-7 Description Scheme Scalable Hierarchical Summarization of News Using Fidelity in MPEG-7 Description Scheme Jung-Rim Kim, Seong Soo Chun, Seok-jin Oh, and Sanghoon Sull School of Electrical Engineering, Korea University,

More information