Combining The Global and Partial Information for Distance-Based Time Series Classification and Clustering

Size: px
Start display at page:

Download "Combining The Global and Partial Information for Distance-Based Time Series Classification and Clustering"

Transcription

1 Paper: Combnng The Global and Partal Informaton for Dstance-Based Tme Seres Hu Zhang, Tu Bao Ho, Mao-Song Ln, and We Huang School of Knowledge Scence, Japan Advanced Insttute of Scence and Technology, Nom, Ishkawa , Japan School of Computer Scence, Southwest Unversty of Scence and Technology, Manyang, Schuan , Chna E-mal: [Receved ; accepted ] Many tme seres representaton schemes for classfcaton and clusterng have been proposed. Most of the proposed representaton focuses on the promnent seres by consderng the global nformaton of the tme seres. The partal nformaton of tme seres that ndcates the local change of tme seres s often gnored. Recently, researches shown that the partal nformaton s also mportant for tme seres classfcaton. However, the combnaton of these two types of nformaton has not been well studed n the lteratures. Moreover, most of the proposed tme seres representaton requres predefned parameters. The classfcaton and clusterng results are consderably nfluenced by the parameter settngs, and, users often have dffculty n determnng the parameters. We attack above two problems by explotng the mult-scale property of wavelet decomposton. The man contrbutons of ths work are: (1) extractng features combnng the global nformaton and partal nformaton of tme seres (2) automatcally choosng approprate features, namely, features n an approprate wavelet decomposton scale accordng to the concentraton of wavelet coeffcents wthn ths scale. Experments performed on several benchmark tme seres datasets justfy the usefulness of the proposed approach. Keywords: Tme Seres, Haar Wavelets, Classfcaton, Clusterng, Feature Extracton 1. Introducton Tme seres data accounts for a huge amount of the data stored n fnancal, gene expresson, medcal and scence databases. Many algorthms have been proposed for mnng tme seres data [19, 28]. Tme seres classfcaton and tme seres clusterng are two mportant aspects of tme seres mnng. Tme seres classfcaton has been successfully used n varous applcatons such as medcal data analyss [12, 33], sgn language recognton [15], speech recognton [23], etc. Tme seres clusterng s a popularly used preprocessng technque n stock market analyss [9], gene expresson data analyss [13], and so on. The hgh dmensonalty, hgh correlaton between data, and hgh nose embedded n tme seres make tme seres classfcaton and clusterng challengng tasks. For effcency, most of the proposed methods classfy or cluster tme seres on the hgh level representaton of tme seres that takes the global nformaton of tme seres rather than classfyng or clusterng them drectly. The representaton ncludes Fourer Transforms [1, 27], Pecewse Lnear Representaton (PLR) [21], Pecewse Aggregate Approxmaton [17, 34], Regresson Tree Representaton [10], Haar Wavelets [3], and Symbolc Representaton [24]. Recently, Jn et al. proposed a tme seres representaton scheme based on the partal nformaton of the tme seres and showed that the partal nformaton s also mportant for tme seres classfcaton [14]. However, lttle work has done to combne these two types of nformaton n tme seres mnng lteratures. Most of the proposed representaton schemes n the lteratures requre predefned parameters and the correspondng classfcaton or clusterng algorthms are consderably nfluenced by these predefned parameters [20]. For example, when settng the number of straght lnes as the nput parameter for the PLR algorthm, the range of selecton s lmted from one to the length of raw tme seres data. If the nput parameter s one, the representaton s just a lnear regresson of the whole data set, and the classfcaton accuracy n ths case wll be lower than usng raw data for most data and algorthms. If we choose the length of raw tme seres data as the nput parameter, the represented data s actually the raw data. The selecton s not trval or easy for the users, and as a consequence, they usually have dffculty n determnng the parameters. Note that a doman-transform technque such as Fourer transform doesn t need nput parameters tself, but the later feature extracton process needs nput parameters n most cases. In ths paper we ntroduce a tme seres representaton combnng both the global nformaton and the partal nformaton of the tme seres data by Haar wavelet decomposton. Tme seres classfcaton and clusterng algorthms are performed wth the selected features of the representaton. We propose a novel non-parametrc feature extracton algorthm that extracts the approxmaton Journal of Advanced Computatonal Intellgence 1

2 Zhang, H. et al. (global nformaton) and change ampltudes (partal nformaton) of tme seres. The Eucldean dstances between the features of shfted tme seres and orgnal tme seres are smaller than those between the raw data. Hence, dstance-based classfcaton and clusterng algorthms are sutable for the features. The approprate features,.e., features wthn the approprate wavelet decomposton scale, are chosen wth respect to the concentraton of features. The approprate features are robust aganst nose embedded n the tme seres. Experments performed on several benchmark datasets demonstrate the effectveness of the proposed approach. The rest of ths paper s organzed as follows. Secton 2 brefly dscusses the related work. Secton 3 ntroduces our feature extracton algorthm and the correspondng dstance-based classfcaton and clusterng algorthms. Secton 4 contans an comparatve expermental evaluaton wth the proposed approach. Fnally, Secton 5 concludes the paper wth summarzng the man contrbutons of the work. 2. Related work A large number of wavelet-based feature extracton technques have been proposed. Chan and Fu ntroduced an algorthm for nearest neghbor queryng wth Haar wavelet coeffcents, and only frst few wavelet coeffcents were preserved for dmensonalty reducton [3, 4]. Popvanov and Mller used frst few Daubeches wavelet coeffcents nstead of Haar wavelet coeffcents for tme seres queryng [26]. Shahab et al. proposed an algorthm called TSA-tree whch queres ether the approxmaton part or detal part of Haar wavelet coeffcents n a specfc scale gven by the user [30]. Struzk and Sebes defned a new tme seres smlarty measurement on the correlaton of Haar wavelet coeffcents [31]. In the sgnal processng communty, Pter and Kamarth chose clustered wavelet coeffcents as features [25]. Tancel et al. only used approxmaton wavelet coeffcents as nput to an ART-2 type neural network [32]. Kalayc and Ozdamar suggested a method usng eght central detal coeffcents of scale m rangng from 1 to 5 as the features [16]. To date, all the proposed methods use global nformaton or partal nformaton, but no proposed algorthm combnes these two types of nformaton. Furthermore, no proposed work gves soluton for automatcally choosng approprate features for a gven dataset. We propose a method of usng entropy to choose approprate scale whch s smlar n sprt to the wavelet packet algorthm ntroduced by Cofman et al. [5], n whch the entropy s used to select the best bass for a wavelet packet. 3. Tme Seres Representaton and Feature Extracton Our basc dea s to extract features from tme seres and perform classfcaton and clusterng algorthms wth the extracted features. The mult-scale property of wavelets allows us to extract features wth the global nformaton and partal nformaton smultaneously. After obtanng the features, dstance-based classfcaton and clusterng algorthms can be appled n terms of the smlarty between the extracted features. Therefore, we decompose our task nto three sub-procedures: (1) representng the tme seres va wavelet coeffcents wthn varous scales whch contan both the global and partal nformaton of the tme seres data; (2) retrevng the features by selectng the approprate scale of the representaton; and (3) desgnng a smlarty measurng strategy, n whch most proposed smlarty models could be appled. The basc dea of Haar wavelet decomposton s ntroduced n Secton 3.1. We gve the tme seres representaton and correspondng feature extracton algorthms n Sectons 3.2 and 3.3. Secton 3.4 presents the method of nose reducton wth the features. We suggest the smlarty measure strategy and ts correspondng classfcaton and clusterng algorthms n sectons Haar Wavelet Decomposton Wavelet transform s a doman transform technque for herarchcally decomposng sequences. It allows a sequence to be descrbed n terms of an approxmaton of the orgnal sequence, plus a set of detals that range from coarse to fne. One property of wavelets s that the broad trend of the nput sequence s preserved n the approxmaton part, whereas localzed changes are kept n the detal parts. No nformaton s ganed or lost durng the decomposton process. The orgnal sgnal can be fully reconstructed from the approxmaton part and the detal parts. The Haar wavelet s the smplest and most popular wavelet proposed by Haar. The beneft of the Haar wavelet s that ts decomposton process has low computatonal complexty. Gven a tme seres wth length n, where n s an ntegral power of 2, the complexty of Haar decomposton s O(n). The Haar wavelet decomposton process needs a par of sequences assocated wth t. The sequences are called wavelet analyss flters, denoted as {h k,g k }. The Haar wavelet analyss flters h k and g k are gven by h k = [1/ 2,1/ 2] and g k = [1/ 2, 1/ 2] Haar wavelet decomposton s mplemented by a twostep process: down-samplng and a convoluton wth the down-sampled seres and wavelet analyss flters. Concrete mathematcal foundatons can be found n [2]. The length of the nput tme seres s restrcted to an nteger power of 2 n the process of wavelet decomposton. The seres wll be extended to an nteger power of 2 by paddng zeros to the end of tme seres f the length of the nput tme seres doesn t satsfy ths requrement. A tme seres X = {x 1,x 2,...,x n } can be decomposed nto an approxmaton part A 1 = {(x 1 + x 2 )/ 2,(x 3 + x 4 )/ 2,...,(x n 1 + x n )/ 2} and a detal part D 1 = {(x 1 x 2 )/ 2,(x 3 x 4 )/ 2,...,(x n 1 x n )/ 2}. The 2 Journal of Advanced Computatonal Intellgence

3 Fg. 1. An example of a tme seres and ts wavelet coeffcents to scale 2 A 1 are approxmaton coeffcents wthn scale 1 and D 1 are detal coeffcents wthn scale 1. The approxmaton coeffcents and detal coeffcents wthn a partcular scale j, A j and D j, both havng length n/2 j, can be decomposed from A j 1, the approxmaton coeffcents wthn scale j 1 recursvely. The th element of A j s calculated as a j = 1 (a 2 1 j 1 + a2 j 1 ), [1,2,...,n/2 j ].. (1) 2 The th element of D j s calculated as: d j = 1 2 (a 2 1 j 1 a2 j 1 ), [1,2,...,n/2 j ]... (2) The number of decomposng scales for X s J = log 2 (n). A J only has one element denotng the global average of X Tme Seres Representaton and Feature Extracton The concatenaton of decomposed wavelet coeffcents of a tme seres X = {x 1,x 2,...,x n } to a partcular scale k [1,2,...,J] shown n (3) s a representaton of X. An example of decomposng a tme seres to scale 2 s llustrated n Fg. 1. The X can be fully reconstructed from W k (X) wthout losng any nformaton. W k (X) = [ A k (X), D k (X),..., D 2 (X), D 1 (X)]. (3) Chan et al. proved that the Eucldean dstance s preserved through a Haar wavelet transform [3]. Assume we have two tme seres X = {x 1,x 2,...,x n } and Y = {y 1,y 2,...,y n }, the Eucldean dstance between X and Y s n Dsc( X, Y ) = (x y ) (4) =1 j The Eucldean dstance between W k (X) and W k (Y ), Dsc( W k (X), W k (Y )) s equal to Dsc( X, Y ). In ths case, f we just use W k as the features for a dstance based classfcaton or clusterng algorthm, the result should be the same wth that gotten from the orgnal tme seres. The th element of A j corresponds to the segment n the seres X startng from poston ( 1) 2 j + 1 to poston 2 j. The a j s proportonal to the average of ths segment and thus can be vewed as the approxmaton of the segment. Thus the approxmaton coeffcents wthn dfferent scales provde an understandng of the major trends n the data at a partcular level of granularty. From Eq. 2, we know that the detal coeffcents D j = {d 1 j j,d2 j,...,dn/2 } contan local changes of tme seres. Thus D j = { d 1 j, d2 j j } denote the ampltude of local changes. We defne the concatenaton of decomposed wavelet approxmaton coeffcents A k (X) and the absolute values of decomposed wavelet detal coeffcents D j (X), j = 1,2,...,k to a partcular scale k (k [1,2,...,J]) of a tme seres X as features. F k (X) = [ A k (X), D k (X),..., D 2 (X), D 1 (X) ] (5),... dn/2 j Ths defnton helps to overcome the well-known problem posed by the fact that wavelet coeffcents are senstve to shfts of seres. The Eucldean dstance Dsc( F k (X), F k (Y )) between the features of two tme seres X and Y s (a k ( X ) a k k ( Y )) 2 + ( d j ( X ) d j ( Y ) ) 2 (6) j=1 Because x y x y, we obtan Dsc( F k (X), F k (Y )) Dsc( W k (X), W k (Y )), and Dsc( W k (X), W k (Y )) = Dsc( X, Y ). If X and Y denote the orgnal tme seres and shfted tme seres respectvely, ths nequaton s stll tenable Approprate Scale Selecton Normally, for a tme seres X, the frst k, k < n coeffcents of W J (X) are taken as features after decomposng X to the scale J [3, 26]. The frst few wavelet coeffcents are correspond to the approxmaton wavelet coeffcents ncludng the global nformaton of X. However, settng the parameter k s not easy for the users and the later classfcaton or clusterng process are affected by poorly chosen parameter settngs. A non-parametrc feature extracton algorthm wthout any nput parameter s more convenent to the users. For our defnton of features, the nonparametrc feature extracton algorthm needs to fnd out whch features assocated wth a specfc scale are better than others for classfcaton and clusterng automatcally. If the energy of wavelet coeffcents wthn a scale s concentrated n a few coeffcents then just those mportant coeffcents can represent the whole, wth low error. Ths scale may gve valuable nformaton for classfcaton and clusterng. We need a functon to descrbe the en- Journal of Advanced Computatonal Intellgence 3

4 Zhang, H. et al. ergy concentraton of the wavelet coeffcents. The functon should be large when coeffcents are largely the same value, and small when all but a few coeffcents are neglgble. Shannon entropy, whch s a measure of mpurty wthn a set of nstances satsfes our requrement. It s defned below. H = p log 2 p (7) The approprate decomposng scale s defned as the scale wth the lowest entropy. The approprate features of a tme seres are defned as the wavelet coeffcents wthn an approprate decomposng scale. Approprate scale = argmn( p k log 2 p k ). (8) k here p k = F k (X) / n =1 F k (X) s the proporton between the absolute value of a coeffcent n a feature and the sum of the absolute values of a whole feature. p k s proportonal to the energy rato of each coeffcent to coeffcents wthn a feature. p k has propertes p k = 1 and p k Nose Reducton on The Approprate Features The dea of wavelet nose reducton s based on the assumpton that the ampltude of the spectra of the sgnal s as dfferent as possble from that of nose. If a sgnal has ts energy concentrated n a small number of wavelet coeffcents, these coeffcents wll be relatvely large compared to the nose, whch has ts energy spread over a large number of coeffcents. Ths allows thresholdng of the ampltude of the coeffcents to separate sgnals or remove nose. The thresholdng s based on a value τ that s used to compare all the detal coeffcents. Approprate scale, as defned n secton 3.3, should be robust to the nose because the energy of true sgnal gets concentrated n a few coeffcents and the nose remans spread out n that scale. Hence the energy of the few coeffcents s much larger than that of noses and these large coeffcents can domnate the classfcaton or clusterng process. To verfy the clam, we wll compare the classfcaton and clusterng results wth and wthout nose-reducton n the experments. Donoho and Johnstone [7] gave the threshold as τ = σ n 2log(N); here σn s the standard varaton of nose, and N s the length of the tme seres. Because we don t know the σ n of the tme seres n advance, we estmate t by the robust medan estmaton of nose method descrbed n [7]. The robust medan estmaton s the medan absolute devaton of the detal wavelet coeffcents at scale one, dvded by The wdely used hard thresholdng algorthm s a process of settng the value of detal coeffcents whose absolute values are lower than the threshold to zero [6]. The hard thresholdng algorthm for features defned n (8) s descrbed n Eq. 9. { d T hre( d j ) = j, d j > τ 0, d j τ (9) 3.5. The Smlarty Strategy and Its Correspondng Algorthms For two seres wth the same length, ther correspondng approprate scales may not be equal. We can t compare the smlarty of two sets of approprate features drectly because the meanng of each data entry s dfferent. For example, consder a tme seres X = {x 1,x 2,x 3,x 4 } wth approprate scale 1 and a tme seres Y = {y 1,y 2,y 3,y 4 } wth approprate scale 2. The features of seres X are F 1 (X) = {a 0 1 ( X ),a 1 1 ( X ), d1 1( X ), d1 1( X ) } and the features of seres Y are F 2 (Y ) = {a 1 2 ( X ), d2 1( X ), d2 2( X ), d2 3( X ) }. Comparng detal coeffcents wth approxmaton coeffcents wll nduce errors and be meanngless. To avod ths problem, we merge the dstance of features wthn dfferent approprate scales. The dstance of two features s replaced by the average of dstance computed on two features wth dfferent approprate scales. Suppose a tme seres X wth approprate scale m and another tme seres Y wth approprate scale n, the dstance between the approprate features of X and Y, Dsc( F m (X), F n (Y )), s defned as (Dsc( F m (X), F m (Y )) + Dsc( F n (X), F n (Y )))/2. We can smply use dstancebased classfcaton and clusterng algorthms for the proposed smlarty strategy NN Classfcaton Algorthm Usng The Proposed Smlarty Strategy The classfcaton algorthm s mplemented by means of a 1-nearest neghbor algorthm (1-NN), whch we call WCANN (Wavelet Classfcaton Algorthm based on 1- Nearest Neghbor). Table 1 shows an outlne of the classfcaton algorthm. The nput s S = { S 1, S 2,...} conssts of a set of labeled tme seres data (the traned tme seres datasets) and X (a new emerged testng tme seres). The output s x c, the label of X. X wll be labeled as a member of a class f and only f, the dstance between X and one nstance of the class s smaller than between other nstances. The classfcaton algorthm for nose reduced coeffcents, WCANR (Wavelet Classfcaton Algorthm wth Nose Reducton), s smlar to the WCANN algorthm. The only dfference s that the nose wthn approprate features s reduced before classfcaton Herarchcal Clusterng Usng The Proposed Smlarty Strategy Clusterng s one of the commonly used data mnng tasks for dscoverng groups and dentfyng nterestng dstrbutons and patterns n the underlyng data. Clusterng s useful n ts own rght as a descrpton tool, and also as a subtask n more complex algorthms. Herarchcal clusterng groups data objects nto a tree of clusters whch s among the best known of clusterng methods [8]. The herarchcal clusterng algorthms can be dvded nto agglomeratve and dvsve clusterng, n terms of whether the herarchcal decomposton s 4 Journal of Advanced Computatonal Intellgence

5 Table 1. The WCANN algorthm Input: S, X For each tranng example S, calculatng ts approprate scale m and correspondng approprate features F m (S ); Gven a testng nstance X, calculatng ts approprate scale n and approprate features F n (X); best-so-far = nf; for = 1 to length(s) do Calculate Dsc( F m (S ), F m (X)); Calculate Dsc( F n (S ), F n (X)); Dsc( F m (S ), F n (x)) = (Dsc( F m (S ), F m (X)) + Dsc( F n (S ), F n (X)))/2 f Dsc( F m (S ), F n (x)) < best-so-far then ponter-to-best-seres k = ; best-so-far = Dsc( F m (S ), F n (X)); end f end for return x c = the label of k; formed n a bottom-up or top-down fashon. As the computaton of decomposton s normally smple for the agglomeratve procedures, we only use agglomeratve clusterng algorthm wth the proposed smlarty strategy. Gven a set of N tme seres to be clustered, we generate a N N dstance matrx conssted of the par-dstance between each of two tme seres usng the defned smlarty strategy. The basc process of agglomeratve herarchcal clusterng s descrbed as below: 1 Assgn each tme seres to ts own cluster. For the gven N tme seres, we now have N clusters, each contanng just one tme seres. Let the dstances between the clusters equal the dstances between the tme seres they contan. 2 Fnd the closest (most smlar) par of clusters and merge them nto a sngle cluster, so that now we have one less cluster. 3 Compute dstances between the new cluster and each of the old clusters. 4 Repeat steps 2 and 3 untl all tems are clustered nto a sngle cluster of sze N. 4. Expermental Evaluaton To show the effectveness of our approach, we performed experments on fve benchmark tme seres datasets. We compared the classfcaton accuracy of our feature extracton algorthm wth other feature extracton algorthms. All the feature extracton algorthms were compared wth the one-nearest-neghbor algorthm (1-NN), evaluated by leave-one-out cross valdaton. We also compared the clusterng result of usng features to that of usng orgnal tme seres. The sngle lnkage s used for the agglomeratve herarchcal clusterng Data Descrpton We used fve datasets (CBF, CC, Trance, Gun and Realtycheck) from the UCR Tme Seres Data Mnng Archve [18]. As we need the label nformaton, we only took the classfed datasets for experments. There are sx classfed datasets n the archve. The Auslan data s a multvarate dataset wth whch we can t apply our approach drectly. All the other fve datasets are useded n our experments. As Realtycheck data only has one nstance wthn each cluster that s too smple for classfcaton, we only used t for clusterng. The man features of the used data sets are descrbed as below. Cylnder-Bell-Funnel (CBF): Contans three types of tme seres: cylnder (c), bell (b) and funnel (f). It s an artfcal data orgnal proposed by [29]. Instances are generated usng the followng functons: where c(t) = (6 + η) χ [a,b] (t) + ε(t) b(t) = (6 + η) χ [a,b] (t a)/(b a) + ε(t) f (t) = (6 + η) χ [a,b] (b t)/(b a) + ε(t) { 0, f t < a t > b χ [a,b] = 1, f a t b η and ε(t) are drawn from a standard normal dstrbuton N(0,1), a s an nteger drawn unformly from [16,32] and b a s an nteger drawn unformly from the range [32,96]. UCR Archve provdes the source code for generatng the samples. We generated 128 examples for each class wth length 128. Control Chart Tme Seres (CC): Ths data set has 100 nstances for each of the sx dfferent classes of control charts. The dataset can also be downloaded from UCI KDD Archve [11]. Trace dataset (Trace): The 4-class dataset contans 200 nstances, 50 for each class. The dmensonalty of the data s 275. Gun Pont dataset (Gun): The dataset has two classes, each contanng 100 nstances. The dmensonalty of the data s 150. Realty Check dataset (Realtycheck): Ths data set s desgned as a smple realty check for new dstance measures. The data set conssts of data from Space Shuttle telemetry, Exchange Rates and artfcal sequences. The data s a normalzed data so that the mnmum value s zero and the maxmum s one. Journal of Advanced Computatonal Intellgence 5

6 Zhang, H. et al. Table 2. The error rates (%) produced by varous feature extracton methods wth 1-NN classfcaton algorthm for all four data sets Methods CBF CC Gun Trace WCANN WCANR WC-NN DFT-NN SVD-NN NN Classfcaton results Keogh and Kasetty compared 12 dstance measurements for tme seres usng 1-NN classfcaton algorthm and shown that Eucldean s the best among the compared dstance measurements [19]. Therefore we used only Eucldean dstance for 1-NN algorthm n our experments. An unclassfed nstance s assgned to the same class as ts closest match n the tranng set. The accuracy of the classfed results s measured by error rates. We compared sx algorthms usng dfferent features extracted from the CBF, CC, Trace and Gun data sets descrbed above. WCANN and WCANR are our proposed algorthms descrbed n secton 3.5. NN s the 1-nearest neghbor algorthm that uses the raw data [19]. WC-NN s the algorthm that uses the 1-nearest neghbor algorthm wth decomposed wavelet coeffcents havng varous dmensons [4]. DFT-NN s the 1-NN algorthm usng frst few Fourer coeffcents as features [27]. SVD-NN uses Sngular Value Decomposton (SVD) to extract features wth 1-NN algorthm [22]. As WC-NN, DFT-NN and SVD-NN don t gve the soluton for choosng feature dmensonalty, we take the average error rates wth all possble feature dmensons produced by these algorthms. Table 2 gves the error rates produced by varous feature extracton algorthms wth 1-NN classfcaton algorthm. The WCANN algorthm acheves the same classfcaton accuracy as the WCANR algorthm on all the four data sets. Nose reducton upon the extracted approprate features doesn t affect the classfcaton result,.e., the approprate features are robust aganst nose n terms of classfcaton. WCANN and WCANR take the same classfcaton accuracy wth NN algorthm on CBF data and hgher accuracy on other three datasets. WCANN and WCANR algorthm outperform WC-NN, DFT-NN and SVD-NN on classfcaton accuracy for all the datasets used Clusterng Herarchcal clusterng s a good way to compare smlarty measures wth representaton, snce a dendrogram of sze N summarzes O(N 2 ) dstance calculatons. We can easly judge whch representaton measure s better by vsually observng whch representaton appears to create the most natural groupngs of the data. Fg. 2 shows Fg. 2. The dendrogram generated by agglomeratve clusterng algorthm for orgnal Realtycheck data set, sngle lnkage s used Fg. 3. The dendrogram generated by agglomeratve clusterng algorthm wth the features of Realtycheck data set, sngle lnkage s used the dendrogram produced by agglomeratve herarchcal clusterng algorthm wth the orgnal tme seres. The extracted features and nose-reduced features produce the same clusterng results and the results are shown n Fg. 3. The extracted features are not affected by the nose durng the clusterng process for the used datasets. The extracted features are robust aganst nose n terms of clusterng. The subgroup contans nstances {11, 12} and the subgroup contans nstances {13, 14} are smlar. They are successfully clustered nto one subgroup wth our defned features shown n Fg. 3 and are not n the same subgroup wth the orgnal tme seres shown n Fg Journal of Advanced Computatonal Intellgence

7 5. Conclusons We explot the mult-scale property of Haar wavelet transform to extract features combnng the global nformaton and partal nformaton together. We propose a method for automatcally choosng the approprate features by the concentraton of the feature coeffcents. We propose a dstance strategy for the approprate features wth whch dstance-based classfcaton and clusterng algorthms can be easly appled. We conduct experments on several wdely used tme seres datasets and compare the classfcaton results of our algorthms and those of other four algorthms. Our algorthms outperform other four algorthms on classfcaton accuracy. We also compare the clusterng results produced by the approprate features and the orgnal tme seres. The approprate features can create more natural groupngs of the data. The expermental results show that the approprate features always produce the same classfcaton and clusterng results wth nose-reduced approprate features. It ndcates that several mportant coeffcents wthn the approprate features domnate the classfcaton and clusterng processes. The extracted approprate features are robust aganst nose n terms of classfcaton and clusterng. Acknowledgements The authors thank Prof. Keogh for provdng the expermental data. The authors thank two anonymous revews for suggestons that markedly clarfed and mproved the paper. References: [1] R. Agrawal, C. Faloutsos, and A. Swam, Effcent Smlarty Search n Sequence Databases, In Proceedngs of the 4th Conference on Foundatons of Data Organzaton and Algorthms, pp , October [2] C. S. Burrus, R. A. Gopnath, and H. Guo, Introducton to Wavelets and Wavelet Transforms, A Prmer, Prentce Hall, Englewood Clffs, NJ, [3] F. K. Chan, A. W. Fu, and T. Y. Clement, Harr Wavelets for Effcent Smlarty Search of Tme-Seres: wth and wthout Tme Warpng, IEEE Trans. on Knowledge and Data Engneerng, 15(3): , [4] K. P. Chan and A. W. Fu, Effcent Tme Seres Matchng by Wavelets, In Proceedngs of the 15th Internaton Conference on Data Engneerng, pp , March [5] R. R. Cofman and M. V. Wckerhauser, Entropy-Based Algorthms for Best Bass Selecton, IEEE Trans. on Informaton Theory, 38(2): , [6] D. L. Donoho, De-nosng by soft-thresholdng, IEEE Trans. on Informaton Theory, 41(3): , [7] D. L. Donoho and I. M. Johnson, Ideal spatal adaptaton va wavelet shrnkage, Bometrka, 81: , [8] R. Duda, P. Hart, and D. Stork, Pattern Classfcaton, John Wley & Sons, New York, second edton, [9] M. Gavrlov, D. Anguelov, and P. Indyk, Mnng the Stock Market: Whch Measure s Best?, In Proceedngs of The 6th ACM SIGKDD Internatonal Conference on Knowledge Dscovery and Data Mnng, pp , August [10] P. Geurts, Pattern Extracton for Tme Seres Classfcaton, In Proceedngs of the Prncples of Data Mnng and Knowledge Dscovery,5th European Conference, pp , September [11] S. Hettch and S. D. Bay. The UCI KDD Archve [12] T. B. Ho, T. D. Nguyen, S. Kawasak, S. Q. Le, D. D. Nguyen, H. Yoko, and K. Takabayash, Mnng Hepatts Data wth Temporal Abstracton, In Proceedngs of the 9th ACM Internatonal Conference on Knowledge Dscovery and Data Mnng, pp , August [13] D. Jang, C. Tang, and A. Zhang, Cluster Analyss for Gene Expresson Data: A Survey, IEEE Trans. on Knowledge and Data Engneerng, 16(11): , [14] X. Jn, Y. Lu, and C. Sh, Smlarty Measure Based on Partal Informaton of Tme Seres, In Proceedngs of The 8th ACM SIGKDD Internatonal Conference on Knowledge Dscovery and Data Mnng, pp , August [15] M. W. Kadous, Learnng Comprehensble Descrptons of Multvarate Tme Seres, In Proceedngs of the 6th Internatonal Conference on Machne Learnng, pp , September [16] T. Kalayc and O. Ozdamar, Wavelet Preprocessng for Automated Neural Network Detecton of EEG Spkes, IEEE Eng. n Medcne and Bology, 14: , [17] E. Keogh, K. Chakrabart, M. Pazzan, and S. Mehrotra, Dmensonalty Reducton of Fast Smlarty Search n Large Tme Seres Databases, Journal of Knowledge and Informaton System, 3: , [18] E. Keogh and T. Folas. The UCR Tme Seres Data Mnng Archve. eamonn/tsdma/ndex.html, [19] E. Keogh and S. Kasetty, On the Need for Tme Seres Data Mnng Benchmarks: A Survey and Emprcal Demonstraton, Data Mnng and Knowledge Dscovery, 7(4): , [20] E. Keogh, S. Lonard, and C. A. Ratanamahatana, Towards Parameter-Free Data Mnng, In Proceedngs of the 10th ACM SIGKDD Internatonal Conference on Knowledge Dscovery and Data Mnng, pp , August [21] E. Keogh and M. Pazzan, An Enhanced Representaton of Tme Seres whch Allows Fast and Accurate Classfcaton, Clusterng and Relevance Feedback, In Proceedngs of the 4th Internatonal Conference of Knowledge Dscovery and Data Mnng, pp , August [22] F. Korn, H. Jagadsh, and C. Faloutsos, Effcently Supportng ad hoc Queres n Large Datasets of Tme Sequences, In Proceedngs of The ACM SIGMOD Internatonal Conference on Management of Data, pp , May [23] S. Lawrence, A. Back, A. Tso, and C. L. Gles, The Gamma MLP Usng Multple Temporal Resolutons for Improved Classfcaton, In Proceedngs of the 1997 IEEE Workshop on Neural Networks for Sgnal Processng VII, pp , Pscataway, NJ, 1997, IEEE Press. [24] J. Ln, E. Keogh, S. Lonard, and B. Chu, A Symbolc Representaton of Tme Seres, wth Implcatons for Streamng Algorthms, In Proceedngs of the 8th ACM SIGMOD Workshop on Research Issues n Data Mnng and Knowledge Dscovery, pp. 2 11, June [25] S. Ptter and S. V. Kamarth, Feature Extracton From Wavelet Coeffcents for Pattern Recognton Tasks, IEEE Trans. on Pattern Recognton and Machne Intellgence, 21(1):83 85, January [26] I. Popvanov and R. J. Mller, Smlarty Search over Tme-Seres Data Usng Wavelets, In Proceedngs of The 18th Internatonal Conference on Data Engneerng, pp , Feburary [27] D. Rafe and A. Mendelzon, Effcent Retreval of Smlar Tme Sequences Usng DFT, In Proceedngs of the 5th Internatonal Conference on Foundatons of Data Organzatons, pp , [28] J. F. Roddch and M. Splopoulou, A Survey of Temporal Knowledge Dscovery Paradgms and Methods, IEEE Trans. on Knowledge and Data Engneerng, 14(4): , [29] N. Sato, Local Feature Extracton and Its Applcaton Usng a Lbrary of Bases, PhD thess, Department of Mathematcs, Yale Unversty, [30] C. Shahab, X. Tan, and W. Zhao, TSA-Tree: A Wavelet-Based Approach to Improve the Effcency of Mult-Level Surprse and Trend Queres on Tme-Seres Data, In Proceedngs of the12th Internatonal Conference on Scentfc and Statstcal Database Management, pp , July [31] Z. R. Struzk and A. Sebes, The Harr Wavelet Transform n the Tme Seres Smlarty Paradgm, In Proceedngs of the 3rd European Conference on Prncples of Data Mnng and Knowledge Dscovery, pp , September [32] I. N. Tansel, C. Mekdec, and C. McLaughln, Detecton of Tool Falure n End Mllng wth Wavelet Transformatons and Neural Networks (WT-NN), Internatonal Journal of Machne Tolls and Manufacture, 35(8): , [33] Y. Yamada, E. Suzuk, H. Yoko, and K. Takabayash, Decson- Tree Inducton from Tme-Seres Data Based on Standard-Example Splt Test, In Proceedngs of the 20th Internatonal Conference on Machne Learnng (ICML03), pp , August [34] B. K. Y and C. Faloustos, Fast Tme Sequence Indexng for arbtrary L p norms, In Proceedngs of the 26th Internatonal Conference on Very Large Databases, pp , September Journal of Advanced Computatonal Intellgence 7

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

A Similarity Measure Method for Symbolization Time Series

A Similarity Measure Method for Symbolization Time Series Research Journal of Appled Scences, Engneerng and Technology 5(5): 1726-1730, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scentfc Organzaton, 2013 Submtted: July 27, 2012 Accepted: September 03, 2012

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

CLASSIFICATION OF ULTRASONIC SIGNALS

CLASSIFICATION OF ULTRASONIC SIGNALS The 8 th Internatonal Conference of the Slovenan Socety for Non-Destructve Testng»Applcaton of Contemporary Non-Destructve Testng n Engneerng«September -3, 5, Portorož, Slovena, pp. 7-33 CLASSIFICATION

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15 CS434a/541a: Pattern Recognton Prof. Olga Veksler Lecture 15 Today New Topc: Unsupervsed Learnng Supervsed vs. unsupervsed learnng Unsupervsed learnng Net Tme: parametrc unsupervsed learnng Today: nonparametrc

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Boundary-Based Time Series Sorting

Boundary-Based Time Series Sorting JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY OF CHINA, VOL. 6, NO. 3, SEPTEMBER 2008 323 Boundary-Based Tme Seres Sortng Jun-Ku L, Yuan-Zhen Wang, and Ha-Bo L Abstract In many applcatons, t s desrable

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

A Multiresolution Symbolic Representation of Time Series

A Multiresolution Symbolic Representation of Time Series A Multresoluton Symbolc Representaton of Tme Seres Vasleos Megalookonomou 1 Qang Wang 1 Guo L 1 Chrstos Faloutsos 2 1 Department of Computer & Informaton Scences 2 Department of Computer Scence Temple

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

CSCI 5417 Information Retrieval Systems Jim Martin!

CSCI 5417 Information Retrieval Systems Jim Martin! CSCI 5417 Informaton Retreval Systems Jm Martn! Lecture 11 9/29/2011 Today 9/29 Classfcaton Naïve Bayes classfcaton Ungram LM 1 Where we are... Bascs of ad hoc retreval Indexng Term weghtng/scorng Cosne

More information

Brushlet Features for Texture Image Retrieval

Brushlet Features for Texture Image Retrieval DICTA00: Dgtal Image Computng Technques and Applcatons, 1 January 00, Melbourne, Australa 1 Brushlet Features for Texture Image Retreval Chbao Chen and Kap Luk Chan Informaton System Research Lab, School

More information

SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE

SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE Dorna Purcaru Faculty of Automaton, Computers and Electroncs Unersty of Craoa 13 Al. I. Cuza Street, Craoa RO-1100 ROMANIA E-mal: dpurcaru@electroncs.uc.ro

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

K-means and Hierarchical Clustering

K-means and Hierarchical Clustering Note to other teachers and users of these sldes. Andrew would be delghted f you found ths source materal useful n gvng your own lectures. Feel free to use these sldes verbatm, or to modfy them to ft your

More information

Face Recognition Based on SVM and 2DPCA

Face Recognition Based on SVM and 2DPCA Vol. 4, o. 3, September, 2011 Face Recognton Based on SVM and 2DPCA Tha Hoang Le, Len Bu Faculty of Informaton Technology, HCMC Unversty of Scence Faculty of Informaton Scences and Engneerng, Unversty

More information

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method Internatonal Journal of Computatonal and Appled Mathematcs. ISSN 89-4966 Volume, Number (07), pp. 33-4 Research Inda Publcatons http://www.rpublcaton.com An Accurate Evaluaton of Integrals n Convex and

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

SCALABLE AND VISUALIZATION-ORIENTED CLUSTERING FOR EXPLORATORY SPATIAL ANALYSIS

SCALABLE AND VISUALIZATION-ORIENTED CLUSTERING FOR EXPLORATORY SPATIAL ANALYSIS SCALABLE AND VISUALIZATION-ORIENTED CLUSTERING FOR EXPLORATORY SPATIAL ANALYSIS J.H.Guan, F.B.Zhu, F.L.Ban a School of Computer, Spatal Informaton & Dgtal Engneerng Center, Wuhan Unversty, Wuhan, 430079,

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

Face Recognition Method Based on Within-class Clustering SVM

Face Recognition Method Based on Within-class Clustering SVM Face Recognton Method Based on Wthn-class Clusterng SVM Yan Wu, Xao Yao and Yng Xa Department of Computer Scence and Engneerng Tong Unversty Shangha, Chna Abstract - A face recognton method based on Wthn-class

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Correlative features for the classification of textural images

Correlative features for the classification of textural images Correlatve features for the classfcaton of textural mages M A Turkova 1 and A V Gadel 1, 1 Samara Natonal Research Unversty, Moskovskoe Shosse 34, Samara, Russa, 443086 Image Processng Systems Insttute

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

A Clustering Algorithm for Key Frame Extraction Based on Density Peak

A Clustering Algorithm for Key Frame Extraction Based on Density Peak Journal of Computer and Communcatons, 2018, 6, 118-128 http://www.scrp.org/ournal/cc ISSN Onlne: 2327-5227 ISSN Prnt: 2327-5219 A Clusterng Algorthm for Key Frame Extracton Based on Densty Peak Hong Zhao

More information

Clustering Algorithm of Similarity Segmentation based on Point Sorting

Clustering Algorithm of Similarity Segmentation based on Point Sorting Internatonal onference on Logstcs Engneerng, Management and omputer Scence (LEMS 2015) lusterng Algorthm of Smlarty Segmentaton based on Pont Sortng Hanbng L, Yan Wang*, Lan Huang, Mngda L, Yng Sun, Hanyuan

More information

Optimal Workload-based Weighted Wavelet Synopses

Optimal Workload-based Weighted Wavelet Synopses Optmal Workload-based Weghted Wavelet Synopses Yoss Matas School of Computer Scence Tel Avv Unversty Tel Avv 69978, Israel matas@tau.ac.l Danel Urel School of Computer Scence Tel Avv Unversty Tel Avv 69978,

More information

Hybrid Non-Blind Color Image Watermarking

Hybrid Non-Blind Color Image Watermarking Hybrd Non-Blnd Color Image Watermarkng Ms C.N.Sujatha 1, Dr. P. Satyanarayana 2 1 Assocate Professor, Dept. of ECE, SNIST, Yamnampet, Ghatkesar Hyderabad-501301, Telangana 2 Professor, Dept. of ECE, AITS,

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT 3. - 5. 5., Brno, Czech Republc, EU APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT Abstract Josef TOŠENOVSKÝ ) Lenka MONSPORTOVÁ ) Flp TOŠENOVSKÝ

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Querying by sketch geographical databases. Yu Han 1, a *

Querying by sketch geographical databases. Yu Han 1, a * 4th Internatonal Conference on Sensors, Measurement and Intellgent Materals (ICSMIM 2015) Queryng by sketch geographcal databases Yu Han 1, a * 1 Department of Basc Courses, Shenyang Insttute of Artllery,

More information

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers Journal of Convergence Informaton Technology Volume 5, Number 2, Aprl 2010 Investgatng the Performance of Naïve- Bayes Classfers and K- Nearest Neghbor Classfers Mohammed J. Islam *, Q. M. Jonathan Wu,

More information

Under-Sampling Approaches for Improving Prediction of the Minority Class in an Imbalanced Dataset

Under-Sampling Approaches for Improving Prediction of the Minority Class in an Imbalanced Dataset Under-Samplng Approaches for Improvng Predcton of the Mnorty Class n an Imbalanced Dataset Show-Jane Yen and Yue-Sh Lee Department of Computer Scence and Informaton Engneerng, Mng Chuan Unversty 5 The-Mng

More information

Novel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition

Novel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition Mathematcal Methods for Informaton Scence and Economcs Novel Pattern-based Fngerprnt Recognton Technque Usng D Wavelet Decomposton TUDOR BARBU Insttute of Computer Scence of the Romanan Academy T. Codrescu,,

More information

Fuzzy Modeling of the Complexity vs. Accuracy Trade-off in a Sequential Two-Stage Multi-Classifier System

Fuzzy Modeling of the Complexity vs. Accuracy Trade-off in a Sequential Two-Stage Multi-Classifier System Fuzzy Modelng of the Complexty vs. Accuracy Trade-off n a Sequental Two-Stage Mult-Classfer System MARK LAST 1 Department of Informaton Systems Engneerng Ben-Guron Unversty of the Negev Beer-Sheva 84105

More information

THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY

THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY Proceedngs of the 20 Internatonal Conference on Machne Learnng and Cybernetcs, Guln, 0-3 July, 20 THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY JUN-HAI ZHAI, NA LI, MENG-YAO

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback

An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback An enhanced representaton of tme seres whch allows fast and accurate classfcaton, clusterng and relevance feedback Eamonn J. Keogh and Mchael J. Pazzan Department of Informaton and Computer Scence Unversty

More information

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law) Machne Learnng Support Vector Machnes (contans materal adapted from talks by Constantn F. Alfers & Ioanns Tsamardnos, and Martn Law) Bryan Pardo, Machne Learnng: EECS 349 Fall 2014 Support Vector Machnes

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION 1 FENG YONG, DANG XIAO-WAN, 3 XU HONG-YAN School of Informaton, Laonng Unversty, Shenyang Laonng E-mal: 1 fyxuhy@163.com, dangxaowan@163.com, 3 xuhongyan_lndx@163.com

More information