Clustering is a discovery process in data mining.

Size: px
Start display at page:

Download "Clustering is a discovery process in data mining."

Transcription

1 Cover Feature Chameleon: Herarchcal Clusterng Usng Dynamc Modelng Many advanced algorthms have dffculty dealng wth hghly varable clusters that do not follow a preconceved model. By basng ts selectons on both nterconnectvty and closeness, the Chameleon algorthm yelds accurate results for these hghly varable clusters. George Karyps Eu-Hong (Sam) Han Vpn Kumar Unversty of Mnnesota Clusterng s a dscovery process n data mnng. 1 It groups a set of data n a way that maxmzes the smlarty wthn clusters and mnmzes the smlarty between two dfferent clusters. 1,2 These dscovered clusters can help explan the characterstcs of the underlyng data dstrbuton and serve as the foundaton for other data mnng and analyss technques. Clusterng s useful n characterzng customer groups based on purchasng patterns, categorzng Web documents, 3 groupng genes and protens that have smlar functonalty, 4 groupng spatal locatons prone to earthquakes based on sesmologcal data, and so on. Most exstng clusterng algorthms fnd clusters that ft some statc model. Although effectve n some cases, these algorthms can break down that s, cluster the data ncorrectly f the user doesn t select approprate statc-model parameters. Or sometmes the model cannot adequately capture the clusters characterstcs. Most of these algorthms break down when the data contans clusters of dverse shapes, denstes, and szes. Exstng algorthms use a statc model of the clusters and do not use nformaton about the nature of ndvdual clusters as they are merged. Furthermore, one set of schemes (the CURE algorthm and related schemes) gnores the nformaton about the aggregate nterconnectvty of tems n two clusters. The other set of schemes (the Rock algorthm, group averagng method, and related schemes) gnores nformaton about the closeness of two clusters as defned by the smlarty of the closest tems across two clusters. (For more nformaton, see the Lmtatons of Tradtonal Clusterng Algorthms sdebar.) By only consderng ether nterconnectvty or closeness, these algorthms can easly select and merge the wrong par of clusters. For nstance, an algorthm that focuses only on the closeness of two clusters wll ncorrectly merge the clusters n Fgure 1a over those n Fgure 1b. Smlarly, an algorthm that focuses only on nterconnectvty wll, n Fgure 2, ncorrectly merge the dark-blue wth the red cluster rather than the green one. Here, we assume that the aggregate nterconnectvty between the tems n the dark-blue and red clusters s greater than that of the dark-blue and green clusters. However, the border ponts of the dark-blue cluster are much closer to those of the green cluster than those of the red cluster. CHAMELEON: CLUSTERING USING DYNAMIC MODELING Chameleon s a new agglomeratve herarchcal clusterng algorthm that overcomes the lmtatons of exstng clusterng algorthms. Fgure 3 (on page 70) provdes an overvew of the overall approach used by Chameleon to fnd the clusters n a data set. The Chameleon algorthm s key feature s that t accounts for both nterconnectvty and closeness n dentfyng the most smlar par of clusters. It thus avods the lmtatons dscussed earler. Furthermore, Chameleon uses a novel approach to model the degree of nterconnectvty and closeness between each par of clusters. Ths approach consders the nternal characterstcs of the clusters themselves. Thus, t does not depend on a statc, user-suppled model and can automatcally adapt to the nternal characterstcs of the merged clusters. Chameleon operates on a sparse graph n whch nodes represent data tems, and weghted edges represent smlartes among the data tems. Ths sparsegraph representaton allows Chameleon to scale to large data sets and to successfully use data sets that 68 Computer /99/$ IEEE

2 are avalable only n smlarty space and not n metrc spaces. 5 Data sets n a metrc space have a fxed number of attrbutes for each data tem, whereas data sets n a smlarty space only provde smlartes between data tems. Chameleon fnds the clusters n the data set by usng a two-phase algorthm. Durng the frst phase, Chameleon uses a graph-parttonng algorthm to cluster the data tems nto several relatvely small subclusters. Durng the second phase, t uses an algorthm to fnd the genune clusters by repeatedly combnng these subclusters. Modelng the data Gven a matrx whose entres represent the smlarty between data tems, many methods can be used to fnd a graph representaton. 2,8 In fact, modelng data tems as a graph s very common n many herarchcal clusterng algorthms. 2,8,9 Chameleon s sparse-graph representaton of the tems s based on the commonly used k-nearest-neghbor graph approach. Each vertex of the k-nearest-neghbor graph represents a data tem. An edge exsts between two vertces v and u f u s among the k most smlar ponts of v, or v s among the k most smlar ponts of u. (a) Fgure 1. Algorthms based only on closeness wll ncorrectly merge (a) the dark-blue and green clusters because these two clusters are closer together than (b) the red and cyan clusters. Fgure 2. Algorthms based only on the nterconnectvty of two clusters wll ncorrectly merge the dark-blue and red clusters rather than dark-blue and green clusters. (b) Lmtatons of Tradtonal Clusterng Algorthms Partton-based clusterng technques such as K-Means 2 and Clarans 6 attempt to break a data set nto K clusters such that the partton optmzes a gven crteron. These algorthms assume that clusters are hyper-ellpsodal and of smlar szes. They can t fnd clusters that vary n sze, as shown n Fgure A1, or concave shapes, as shown n Fgure A2. DBScan 7 (Densty-Based Spatal Clusterng of Applcatons wth Nose), a wellknown spatal clusterng algorthm, can fnd clusters of arbtrary shapes. DBScan defnes a cluster to be a maxmum set of densty-connected ponts, whch means that every core pont n a cluster must have at least a mnmum number of ponts (MnPts) wthn a gven radus (Eps). DBScan assumes that all ponts wthn genune clusters can be reached from one another by traversng a path of denstyconnected ponts and ponts across dfferent clusters cannot. DBScan can fnd arbtrarly shaped clusters f the cluster densty can be determned beforehand and (1) (2) the cluster densty s unform. Herarchcal clusterng algorthms produce a nested sequence of clusters wth a sngle, all-nclusve cluster at the top and sngle-pont clusters at the bottom. Agglomeratve herarchcal algorthms 2 start wth each data pont as a separate cluster. Each step of the algorthm nvolves mergng two clusters that are the most smlar. After each merger, the total number of clusters decreases by one. Users can repeat these steps untl they obtan the desred number of clusters or the dstance between the two closest clusters goes above a certan threshold. The many varatons of agglomeratve herarchcal algorthms 2 prmarly dffer n how they update the smlarty between exstng and merged clusters. In some herarchcal methods, each clus- Fgure A. Data sets on whch centrod and medod approaches fal: Clusters (1) of wdely dfferent szes and (2) wth concave shapes. August

3 Data set Construct a sparse graph k-nearest neghbor graph Partton the graph Merge parttons Fnal clusters Fgure 3. Chameleon uses a two-phase algorthm, whch frst parttons the data tems nto subclusters and then repeatedly combnes these subclusters to obtan the fnal clusters. Fgure 4 llustrates the 1-, 2-, and 3-nearest-neghbor graphs of a smple data set. There are several advantages to representng the tems to be clustered usng a k-nearest-neghbor graph. Data tems that are far apart are completely dsconnected, and the weghts on the edges capture the underlyng populaton densty of the space. Items n denser and sparser regons are modeled unformly, and the sparsty of the representaton leads to computatonally effcent algorthms. Because Chameleon operates on a sparse graph, each cluster s nothng more than a subgraph of the data set s orgnal sparse-graph representaton. Modelng cluster smlarty Chameleon uses a dynamc modelng framework to determne the smlarty between pars of clusters by lookng at ther relatve nterconnectvty (RI) and relatve closeness (RC). Chameleon selects pars to merge for whch both RI and RC are hgh. That s, t selects clusters that are well nterconnected as well as close together. Relatve nterconnectvty. Clusterng algorthms typcally measure the absolute nterconnectvty between clusters C and C n terms of edge cut the sum of the weght of the edges that straddle the two clusters, whch we denote EC(C, C ). Relatve nterconnectvty between clusters s ther absolute nterconnectvty normalzed wth respect to ther nternal nterconnectvtes. To get the cluster s nternal nterconnectvty, we sum the edges crossng a mn-cut bsecton that splts the cluster nto two roughly equal parts. Recent advances n graph parttonng have made t possble to effcently fnd such quanttes. 10 Thus, the relatve nterconnectvty between a par of clusters C and C s RI( C, C ) = EC( C, C ) EC( C) + EC( C ) 2 By focusng on relatve nterconnectvty, Chameleon ter s represented by a centrod or medod a data pont that s the closest to the center of the cluster and the smlarty between two clusters s measured by the smlarty between the centrods/medods. Both of these schemes fal for data n whch ponts n a gven cluster are closer to the center of another cluster than to the center of ther own cluster. Ths stuaton occurs n many natural clusters, 4,8 for example, f there s a large varaton n cluster szes, as n Fgure A1, or when cluster shapes are concave, as n Fgure A2. The sngle-lnk herarchcal method 2 measures the smlarty between two clusters by the smlarty of the closest par of data ponts belongng to dfferent clusters. Unlke the centrod/medod-based methods, ths method can fnd clusters of arbtrary shape and dfferent szes. However, t s hghly susceptble to nose, outlers, and artfacts. Researchers have proposed CURE 9 (Clusterng Usng Representatves) to remedy the drawbacks of both of these methods whle combnng ther advantages. In CURE, a cluster s represented by selectng a constant number of well-scattered ponts and shrnkng them toward the cluster s centrod, accordng to a shrnkng factor. CURE measures the smlarty between two clusters by the smlarty of the closest par of ponts belongng to dfferent clusters. Unlke centrod/medod-based methods, CURE can fnd clusters of arbtrary shapes and szes, as t represents each cluster va multple representatve ponts. Shrnkng the representatve ponts toward the centrod allows CURE to avod some of the problems assocated wth nose and outlers. However, these technques fal to account for specal characterstcs of ndvdual clusters. They can make ncorrect mergng decsons when the underlyng data does not follow the assumed model or when nose s present. In some algorthms, the smlarty between two clusters s captured by the aggregate of the smlartes (that s, the nterconnectvty) among pars of tems belongng to dfferent clusters. The ratonale for ths approach s that subclusters belongng to the same cluster wll tend to have hgh nterconnectvty. But the aggregate nterconnectvty between two clusters depends on the sze of the clusters; n general, pars of larger clusters wll have hgher nterconnectvty. Many such schemes normalze the aggregate smlarty between a par of clusters wth respect to the expected nterconnectvty of the clusters nvolved. For example, the wdely used group-average method 2 assumes fully connected clusters, and thus scales the aggregate smlarty between two clusters by n m, where n and m are the number of members n the two clusters. Rock 8 (Robust Clusterng Usng Lnks), a recently developed algorthm that operates on a derved smlarty graph, scales the aggregate nterconnectvty wth respect to a user-specfed nterconnectvty model. However, the maor lmtaton of all such schemes s that they assume a statc, usersuppled nterconnectvty model. Such models are nflexble and can easly lead to ncorrect mergng decsons when the model under- or overestmates the nterconnectvty of the data set or when dfferent clusters exhbt dfferent nterconnectvty characterstcs. Although some schemes allow the connectvty to vary for dfferent problem domans (as does Rock 8 ), t s stll the same for all clusters rrespectve of ther denstes and shapes. 70 Computer

4 can overcome the lmtatons of exstng algorthms that use statc nterconnectvty models. Relatve nterconnectvty can account for dfferences n cluster shapes as well as dfferences n the degree of nterconnectvty for dfferent clusters. Relatve closeness. Relatve closeness nvolves concepts that are analogous to those developed for relatve nterconnectvty. The absolute closeness of clusters s the average weght (as opposed to the sum of weghts for nterconnectvty) of the edges that connect vertces n C to those n C. Snce these connectons come from the k-nearest-neghbor graph, ther average strength provdes a good measure of the affnty between the data tems along the nterface layer of the two clusters. At the same tme, ths measure s tolerant of outlers and nose. To get a cluster s nternal closeness, we take the average of the edge weghts across a mn-cut bsecton that splts the cluster nto two roughly equal parts. The relatve closeness between a par of clusters s the absolute closeness normalzed wth respect to the nternal closeness of the two clusters: RC(C, C ) = SEC(C,C ) C C C SEC C C ( ) + C C SEC ( C ) + + where SEC(C ) and SEC(C ) are the average weghts of the edges that belong n the mn-cut bsector of clusters C and C, and SEC(C, C ) s the average weght of the edges that connect vertces n C and C. Terms C and C are the number of data ponts n each cluster. Ths equaton also normalzes the absolute closeness of the two clusters by the weghted average of the nternal closeness of C and C. Ths dscourages the mergng of small sparse clusters nto large dense clusters. In general, the relatve closeness between two clusters s less than one because the edges that connect vertces n dfferent clusters have a smaller weght. By focusng on the relatve closeness, Chameleon can overcome the lmtatons of exstng algorthms that look only at the absolute closeness. By lookng at the relatve closeness, Chameleon correctly merges clusters so that the resultng cluster has a unform degree of closeness between ts tems. Process The dynamc framework for modelng cluster smlarty s applcable only when each cluster contans a suffcently large number of vertces (data tems). The reason s that to compute the relatve nterconnectvty and closeness of clusters, Chameleon needs to compute each cluster s nternal nterconnectvty and closeness, nether of whch can be accurately calculated for clusters contanng a few data ponts. For ths (a) (b) (c) (d) Fgure 4. K-nearest-neghbor graphs from orgnal data n two dmensons: (a) orgnal data, (b) 1-, (c) 2-, and (d) 3-nearest neghbor graphs. reason, Chameleon has a frst phase that clusters the data tems nto several subclusters that contan a suffcent number of tems to allow dynamc modelng. In a second phase, t dscovers the genune clusters n the data set by usng the dynamc modelng framework to herarchcally merge these subclusters. Chameleon fnds the ntal subclusters usng hmets, 10 a fast, hgh-qualty graph-parttonng algorthm. hmets parttons the k-nearest-neghbor graph of the data set nto several parttons such that t mnmzes the edge cut. Snce each edge n the k-nearestneghbor graph represents the smlarty among data ponts, a parttonng that mnmzes the edge cut effectvely mnmzes the relatonshp (affnty) among data ponts across the parttons. After fndng subclusters, Chameleon swtches to an algorthm that repeatedly combnes these small subclusters, usng the relatve nterconnectvty and relatve closeness framework. There are many ways to develop an algorthm that accounts for both of these measures. Chameleon uses two dfferent schemes. User-specfed thresholds. The frst merges only those pars of clusters exceedng user-specfed thresholds for relatve nterconnectvty (T RI ) and relatve closeness (T RC ). In ths approach, Chameleon vsts each cluster C and checks for adacent clusters C wth RI and RC that exceed these thresholds. If more than one adacent cluster satsfes these condtons, then Chameleon wll merge C wth the cluster that t s most connected to that s, the par C and C wth the hghest absolute nterconnectvty. Once Chameleon chooses a partner for every cluster, t performs these mergers and repeats the entre process. Users can control the characterstcs of the desred clusters wth T RI and T RC. Functon-defned optmzaton. The second scheme mplemented n Chameleon uses a functon to combne the relatve nterconnectvty and relatve closeness. Chameleon selects cluster pars that maxmze ths functon. Snce our goal s to merge pars for whch both the relatve nterconnectvty and relatve closeness are hgh, a natural way of defnng such a functon s to take ther product. That s, select clusters that maxmze RI(C,C ) RC(C,C ). Ths formula gves equal mportance to both parameters. However, qute often we may prefer clusters that gve a hgher preference to one of these two measures. For ths reason, Chameleon selects the par of clusters that maxmzes RI(C,C ) RC(C,C ) α August

5 Fgure 5. Four data sets used n our experments: (a) DS1 has 8,000 ponts; (b) DS2 has 6,000 ponts; (c) DS3 has 10,000 ponts; and (d) DS4 has 8,000 ponts. (a) (b) (c) (d) where α s a user-specfed parameter. If α > 1, then Chameleon gves a hgher mportance to the relatve closeness, and when α < 1, t gves a hgher mportance to the relatve nterconnectvty. RESULTS AND COMPARISONS We compared Chameleon s performance aganst that of CURE 9 and DBScan 7 on four dfferent data sets. These data sets contan from 6,000 to 10,000 ponts n two dmensons; the ponts form geometrc shapes as shown n Fgure 5. These data sets represent some dffcult clusterng nstances because they contan clusters of arbtrary shape, proxmty, orentaton, and varyng denstes. They also contan sgnfcant nose and specal artfacts. Many of these results are avalable at ~han/chameleon.html. Chameleon s applcable to any data set for whch a smlarty matrx s avalable (or can be constructed). However, we evaluated t wth two-dmensonal data sets because smlar data sets have been used to evaluate other state-of-the-art algorthms. Clusters n 2D data sets are also easy to vsualze and compare. Fgure 6 shows the clusters that Chameleon found for each of the four data sets, usng the same set of parameter values. In partcular, we used k = 10 n a functon-defned optmzaton wth α = 2.0. Ponts n dfferent clusters are represented usng a combnaton of colors and glyphs. As a result, ponts that belong to the same cluster use both the same color and glyph. For example, n the DS3 clusters, there are two cyan clusters. One contans the ponts n the regon between the two crcles nsde the ellpse, and the other, the lne between the two horzontal bars and the c-shaped cluster. Two clusters are dark blue one corresponds to the upsde-down c-shaped cluster and the other, the crcle nsde the candy cane. Ther ponts use dfferent glyphs (bells and squares for the frst par, and squares and Fgure 6. Clusters dscovered by Chameleon for the four data sets. 72 Computer

6 bells for the second), so they denote dfferent clusters. The clusterng shown n Fgure 6 corresponds to the earlest pont at whch Chameleon was able to fnd the genune clusters n the data set. That s, they correspond to the earlest teraton at whch Chameleon dentfes the genune clusters and places each n a sngle cluster. Thus for a gven data set, Fgure 6 shows more than the number of genune clusters; the addtonal clusters contan outlers. Chameleon correctly dentfes the genune clusters n all four data sets. For example, n DS1, Chameleon fnds sx clusters, fve of whch are genune clusters, and the sxth (shown n brown) corresponds to outlers that connect the two ellpsod clusters. In DS3, Chameleon fnds the nne genune clusters and two clusters that contan outlers. In DS2 and DS4, Chameleon accurately fnds only the genune cluster. CURE We also evaluated CURE, whch has been shown to effectvely fnd clusters n two-dmensonal data sets. 9 CURE was able to fnd the rght clusters for DS1 and DS2, but t faled to fnd the rght clusters on the remanng two data sets. Fgure 7 shows the results obtaned by CURE for the DS3 and DS4 data sets. For each of the data sets, Fgure 7 shows two dfferent clusterng solutons contanng dfferent numbers of clusters. The clusterng soluton n Fgure 7a corresponds to the earlest pont n the process n whch CURE merges subclusters that belong to two dfferent genune clusters. As we can see from Fgure 7b, n the case of DS3, CURE makes a mstake when gong down to 25 clusters, as t merges one of the crcles nsde the ellpse wth a porton of the ellpse. Smlarly, n the case of DS4 (Fgure 7c), CURE also makes a mstake when gong down to 25 clusters by mergng the small crcular cluster wth a porton of the upsde-down Y-shaped cluster. The second clusterng soluton (Fgure 7d) corresponds to solutons that contan as many clusters as those dscovered by Chameleon. These solutons are consderably worse than the frst set of solutons, ndcatng that CURE s mergng scheme makes multple mstakes for these data sets. DBSCAN Fgure 8 shows the clusters found by DBScan for DS1 and DS2, usng dfferent values of the Eps parameter. Followng the recommendaton of DBScan s (a) (b) Fgure 7. Clusters dentfed by CURE wth shrnkng factor 0.3 and number of representatve ponts equal to 10. For DS3, CURE frst merges subclusters that belong to two dfferent subclusters at (a) 25 clusters. Wth (b) 11 clusters specfed the same number that Chameleon found CURE obtans the result shown. CURE produced these results for DS4 wth (c) 25 and (d) 8 clusters specfed. (c) (d) August

7 Fgure 8. DBScan results for DS1 wth MnPts at 4 and Eps at (a) 0.5 and (b) 0.4. (a) (b) (a) (b) (c) Fgure 9. DBScan results for DS2 wth MnPts at 4 and Eps at (a) 5.0, (b) 3.5, and (c) 3.0. developers, 7 we fxed the MnPts at 4 and vared Eps n these experments. The clusters produced for DS1 llustrate that DBScan cannot effectvely fnd clusters of dfferent densty. In Fgure 8a, DBScan puts the two red ellpses (at the top of the mage) nto the same cluster because the outler ponts satsfy the densty requrements as dctated by the Eps and MnPts parameters. Decreasng the value of Eps can separate these clusters, as shown n Fgure 8b for Eps at 0.4. Although DBScan separates the ellpses wthout fragmentng them, t now fragments the largest but lower densty cluster nto several small subclusters. Our experments have shown that DBScan exhbts smlar characterstcs on DS4. The clusters produced for DS2 llustrate that DBScan cannot effectvely fnd clusters wth varable nternal denstes. Fgure 9a through Fgure 9c show a sequence of three clusterng solutons for decreasng values of Eps. As we decrease Eps n the hope of separatng the two clusters, the natural clusters n the data set are fragmented nto several smaller clusters. DBScan fnds the correct clusters n DS3 gven the rght combnaton of Eps and MnPts. If we fx MnPts at 4, then the algorthm works fne on ths data set as long as Eps s from 5.7 to 6.1. Even though we chose to model the data usng a k- nearest-neghbor graph, t s entrely possble to use other graph representatons such as those based on mutual shared neghbors, 2,8,11 whch are sutable for partcular applcaton domans. Furthermore, dfferent domans may requre dfferent models for capturng relatve closeness and nterconnectvty. In any of these stuatons, we beleve that the two-phase framework of Chameleon would stll be hghly effectve. Our future research ncludes the verfcaton of Chameleon on dfferent applcaton domans. We also want to study the effectveness of dfferent technques for modelng data as well as cluster smlarty. Acknowledgments Ths work was partally supported by Army Research Offce contract DA/DAAG and the Army Hgh-Performance Computng Research Center under the auspces of the Department of the Army, Army Research Laboratory cooperatve agreement number DAAH /contract number DAAH04-95-C-0008, the content of whch does not necessarly reflect the government s poston or polcy, and should nfer no offcal endorsement. Ths work was also supported by an IBM Partnershp Award. Access to computng facltes was provded by the Army Hgh-Performance Computng Research Center and the Mnnesota Supercomputer Insttute. We also thank Martn Ester, Hans-Peter Kregel, Jorg Sander, and Xaowe Xu for provdng the DBScan program. References 1. M.S. Chen, J. Han, and P.S. Yu, Data Mnng: An Overvew from a Database Perspectve, IEEE Trans. Knowledge and Data Eng., Dec. 1996, pp A.K. Jan and R.C. Dubes, Algorthms for Clusterng Data, Prentce Hall, Upper Saddle Rver, N.J., D. Boley et al., Document Categorzaton and Query Generaton on the World Wde Web Usng WebACE, to be publshed n AI Revew, E.H. Han et al., Hypergraph-Based Clusterng n Hgh- Dmensonal Data Sets: A Summary of Results, Bull. Tech. 74 Computer

8 Commttee on Data Eng., Vol. 21, No. 1, 1998, pp Also a longer verson avalable as Tech. Report TR , Dept. of Computer Scence, Unv. of Mnnesota, V. Gant et al., Clusterng Large Datasets n Arbtrary Metrc Spaces, Proc. 15th Int l Conf. Data Eng., IEEE CS Press, Los Alamtos, Calf., 1999, pp R. Ng and J. Han, Effcent and Effectve Clusterng Method for Spatal Data Mnng, Proc. 20th Int l Conf. Very Large Data Bases, Morgan Kaufmann, San Francsco, 1994, pp M. Ester et al., A Densty-Based Algorthm for Dscoverng Clusters n Large Spatal Databases wth Nose, Proc. 2nd Int l Conf. Knowledge Dscovery and Data Mnng, AAAI Press, Menlo Park, Calf., 1996, pp S. Guha, R. Rastog, and K. Shm, ROCK: A Robust Clusterng Algorthm for Categorcal Attrbutes, Proc. 15th Int l Conf. Data Eng., IEEE CS Press, Los Alamtos, Calf., 1999, pp S. Guha, R. Rastog, and K. Shm, CURE: An Effcent Clusterng Algorthm for Large Databases, Proc. ACM SIGMOD Int l Conf. Management of Data, ACM Press, New York, 1998, pp G. Karyps and V. Kumar, hmetis 1.5: A Hypergraph Parttonng Package, Tech. Report, Dept. of Computer Scence, Unv. of Mnnesota, 1998; umn.edu/~karyps/mets. 11. K.C. Gowda and G. Krshna, Agglomeratve Clusterng Usng the Concept of Mutual Nearest Neghborhood, Pattern Recognton, Feb. 1978, pp George Karyps s an assstant professor of computer scence at the Unversty of Mnnesota. Hs research nterests nclude data mnng, bo-nformatcs, parallel computng, graph parttonng, and scentfc computng. Karyps has a PhD n computer scence from the Unversty of Mnnesota. He s a member of the IEEE Computer Socety and the ACM. Eu-Hong (Sam) Han s a PhD canddate at the Unversty of Mnnesota. Hs research nterests nclude data mnng and nformaton retreval. Han has an MS n computer scence from the Unversty of Texas, Austn. He s a member of the ACM. Vpn Kumar s a professor of computer scence at the Unversty of Mnnesota. Hs research nterests nclude data mnng, parallel computng, and scentfc computng. Kumar has a PhD n computer scence from the Unversty of Maryland, College Park. He s a member of the IEEE, ACM, and SIAM. Contact the authors at the Unv. of Mnnesota, Dept. of Computer Scence and Eng., EECS Bldg., 200 Unon St. SE, Mnneapols, MN 55455; {karyps, han,kumar}@cs.umn.edu. Ggabt Ethernet What s the standard technology? We wrote the book on t. Jon our member network. Fnd Out Dd You Know? Rght now, over 200 Workng Groups are draftng IEEE standards. IEEE Computer Socety

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

K-means and Hierarchical Clustering

K-means and Hierarchical Clustering Note to other teachers and users of these sldes. Andrew would be delghted f you found ths source materal useful n gvng your own lectures. Feel free to use these sldes verbatm, or to modfy them to ft your

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Graph-based Clustering

Graph-based Clustering Graphbased Clusterng Transform the data nto a graph representaton ertces are the data ponts to be clustered Edges are eghted based on smlarty beteen data ponts Graph parttonng Þ Each connected component

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Analyzing Popular Clustering Algorithms from Different Viewpoints

Analyzing Popular Clustering Algorithms from Different Viewpoints 1000-9825/2002/13(08)1382-13 2002 Journal of Software Vol.13, No.8 Analyzng Popular Clusterng Algorthms from Dfferent Vewponts QIAN We-nng, ZHOU Ao-yng (Department of Computer Scence, Fudan Unversty, Shangha

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15 CS434a/541a: Pattern Recognton Prof. Olga Veksler Lecture 15 Today New Topc: Unsupervsed Learnng Supervsed vs. unsupervsed learnng Unsupervsed learnng Net Tme: parametrc unsupervsed learnng Today: nonparametrc

More information

Bidirectional Hierarchical Clustering for Web Mining

Bidirectional Hierarchical Clustering for Web Mining Bdrectonal Herarchcal Clusterng for Web Mnng ZHONGMEI YAO & BEN CHOI Computer Scence, College of Engneerng and Scence Lousana Tech Unversty, Ruston, LA 71272, USA zya001@latech.edu, pro@bencho.org Abstract

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

A Clustering Algorithm for Key Frame Extraction Based on Density Peak

A Clustering Algorithm for Key Frame Extraction Based on Density Peak Journal of Computer and Communcatons, 2018, 6, 118-128 http://www.scrp.org/ournal/cc ISSN Onlne: 2327-5227 ISSN Prnt: 2327-5219 A Clusterng Algorthm for Key Frame Extracton Based on Densty Peak Hong Zhao

More information

APPLICATION OF A COMPUTATIONALLY EFFICIENT GEOSTATISTICAL APPROACH TO CHARACTERIZING VARIABLY SPACED WATER-TABLE DATA

APPLICATION OF A COMPUTATIONALLY EFFICIENT GEOSTATISTICAL APPROACH TO CHARACTERIZING VARIABLY SPACED WATER-TABLE DATA RFr"W/FZD JAN 2 4 1995 OST control # 1385 John J Q U ~ M Argonne Natonal Laboratory Argonne, L 60439 Tel: 708-252-5357, Fax: 708-252-3 611 APPLCATON OF A COMPUTATONALLY EFFCENT GEOSTATSTCAL APPROACH TO

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Hierarchical agglomerative. Cluster Analysis. Christine Siedle Clustering 1

Hierarchical agglomerative. Cluster Analysis. Christine Siedle Clustering 1 Herarchcal agglomeratve Cluster Analyss Chrstne Sedle 19-3-2004 Clusterng 1 Classfcaton Basc (unconscous & conscous) human strategy to reduce complexty Always based Cluster analyss to fnd or confrm types

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

SCALABLE AND VISUALIZATION-ORIENTED CLUSTERING FOR EXPLORATORY SPATIAL ANALYSIS

SCALABLE AND VISUALIZATION-ORIENTED CLUSTERING FOR EXPLORATORY SPATIAL ANALYSIS SCALABLE AND VISUALIZATION-ORIENTED CLUSTERING FOR EXPLORATORY SPATIAL ANALYSIS J.H.Guan, F.B.Zhu, F.L.Ban a School of Computer, Spatal Informaton & Dgtal Engneerng Center, Wuhan Unversty, Wuhan, 430079,

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Background Removal in Image indexing and Retrieval

Background Removal in Image indexing and Retrieval Background Removal n Image ndexng and Retreval Y Lu and Hong Guo Department of Electrcal and Computer Engneerng The Unversty of Mchgan-Dearborn Dearborn Mchgan 4818-1491, U.S.A. Voce: 313-593-508, Fax:

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Simplification of 3D Meshes

Simplification of 3D Meshes Smplfcaton of 3D Meshes Addy Ngan /4/00 Outlne Motvaton Taxonomy of smplfcaton methods Hoppe et al, Mesh optmzaton Hoppe, Progressve meshes Smplfcaton of 3D Meshes 1 Motvaton Hgh detaled meshes becomng

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Clustering Algorithm of Similarity Segmentation based on Point Sorting

Clustering Algorithm of Similarity Segmentation based on Point Sorting Internatonal onference on Logstcs Engneerng, Management and omputer Scence (LEMS 2015) lusterng Algorthm of Smlarty Segmentaton based on Pont Sortng Hanbng L, Yan Wang*, Lan Huang, Mngda L, Yng Sun, Hanyuan

More information

Clustering. A. Bellaachia Page: 1

Clustering. A. Bellaachia Page: 1 Clusterng. Obectves.. Clusterng.... Defntons... General Applcatons.3. What s a good clusterng?. 3.4. Requrements 3 3. Data Structures 4 4. Smlarty Measures. 4 4.. Standardze data.. 5 4.. Bnary varables..

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

Accounting for the Use of Different Length Scale Factors in x, y and z Directions 1 Accountng for the Use of Dfferent Length Scale Factors n x, y and z Drectons Taha Soch (taha.soch@kcl.ac.uk) Imagng Scences & Bomedcal Engneerng, Kng s College London, The Rayne Insttute, St Thomas Hosptal,

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES

More information

Survey of Cluster Analysis and its Various Aspects

Survey of Cluster Analysis and its Various Aspects Harmnder Kaur et al, Internatonal Journal of Computer Scence and Moble Computng, Vol.4 Issue.0, October- 05, pg. 353-363 Avalable Onlne at www.csmc.com Internatonal Journal of Computer Scence and Moble

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

Web Mining: Clustering Web Documents A Preliminary Review

Web Mining: Clustering Web Documents A Preliminary Review Web Mnng: Clusterng Web Documents A Prelmnary Revew Khaled M. Hammouda Department of Systems Desgn Engneerng Unversty of Waterloo Waterloo, Ontaro, Canada 2L 3G1 hammouda@pam.uwaterloo.ca February 26,

More information

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming CS 4/560 Desgn and Analyss of Algorthms Kent State Unversty Dept. of Math & Computer Scence LECT-6 Dynamc Programmng 2 Dynamc Programmng Dynamc Programmng, lke the dvde-and-conquer method, solves problems

More information

BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering

BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering 015 IEEE 17th Internatonal Conference on Hgh Performance Computng and Communcatons (HPCC), 015 IEEE 7th Internatonal Symposum on Cyberspace Safety and Securty (CSS), and 015 IEEE 1th Internatonal Conf

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS by XUNYU PAN (Under the Drecton of Suchendra M. Bhandarkar) ABSTRACT In modern tmes, more and more

More information

Keyword-based Document Clustering

Keyword-based Document Clustering Keyword-based ocument lusterng Seung-Shk Kang School of omputer Scence Kookmn Unversty & AIrc hungnung-dong Songbuk-gu Seoul 36-72 Korea sskang@kookmn.ac.kr Abstract ocument clusterng s an aggregaton of

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

APPLIED MACHINE LEARNING

APPLIED MACHINE LEARNING Methods for Clusterng K-means, Soft K-means DBSCAN 1 Objectves Learn basc technques for data clusterng K-means and soft K-means, GMM (next lecture) DBSCAN Understand the ssues and major challenges n clusterng

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Clustering algorithms and validity measures

Clustering algorithms and validity measures Clusterng algorthms and valdty measures M. Hald, Y. Batstas, M. Vazrganns Department of Informatcs Athens Unversty of Economcs & Busness Emal: {mhal, yanns, mvazrg}@aueb.gr Abstract Clusterng ams at dscoverng

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS46: Mnng Massve Datasets Jure Leskovec, Stanford Unversty http://cs46.stanford.edu /19/013 Jure Leskovec, Stanford CS46: Mnng Massve Datasets, http://cs46.stanford.edu Perceptron: y = sgn( x Ho to fnd

More information

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay,

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay, Vanshng Hull Jnhu Hu Suya You Ulrch Neumann Unversty of Southern Calforna {jnhuhusuyay uneumann}@graphcs.usc.edu Abstract Vanshng ponts are valuable n many vson tasks such as orentaton estmaton pose recovery

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning Computer Anmaton and Vsualsaton Lecture 4. Rggng / Sknnng Taku Komura Overvew Sknnng / Rggng Background knowledge Lnear Blendng How to decde weghts? Example-based Method Anatomcal models Sknnng Assume

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros.

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros. Fttng & Matchng Lecture 4 Prof. Bregler Sldes from: S. Lazebnk, S. Setz, M. Pollefeys, A. Effros. How do we buld panorama? We need to match (algn) mages Matchng wth Features Detect feature ponts n both

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Multi-stable Perception. Necker Cube

Multi-stable Perception. Necker Cube Mult-stable Percepton Necker Cube Spnnng dancer lluson, Nobuuk Kaahara Fttng and Algnment Computer Vson Szelsk 6.1 James Has Acknowledgment: Man sldes from Derek Hoem, Lana Lazebnk, and Grauman&Lebe 2008

More information

Lecture #15 Lecture Notes

Lecture #15 Lecture Notes Lecture #15 Lecture Notes The ocean water column s very much a 3-D spatal entt and we need to represent that structure n an economcal way to deal wth t n calculatons. We wll dscuss one way to do so, emprcal

More information