Fuzzy Clustering in Parallel Universes

Size: px
Start display at page:

Download "Fuzzy Clustering in Parallel Universes"

Transcription

1 Fuzzy Clustering in Parallel Universes Bern Wisweel an Michael R. Berthol ALTANA-Chair for Bioinformatics an Information Mining Department of Computer an Information Science, University of Konstanz Konstanz, Germany Abstract We present an extension of the fuzzy c-means algorithm, which operates simultaneously on ifferent feature spaces so-calle parallel universes an also incorporates noise etection. The metho assigns membership values of patterns to ifferent universes, which are then aopte throughout the training. This leas to better clustering results since patterns not contributing to clustering in a universe are completely or partially) ignore. The metho also uses an auxiliary universe to capture patterns that o not contribute to any of the clusters in the real universes an therefore liely represent noise. The outcome of the algorithm are clusters istribute over ifferent parallel universes, each moeling a particular, potentially overlapping, subset of the ata an a set of patterns etecte as noise. One potential target application of the propose metho is biological ata analysis where ifferent escriptors for molecules are available but none of them by itself shows global satisfactory preiction results. Key wors: Fuzzy Clustering, Objective Function, Multiple Descriptor Spaces, Noise Hanling Introuction In recent years, researchers have wore extensively in the fiel of cluster analysis, which has resulte in a wie range of fuzzy) clustering algorithms [9,]. Most of the methos assume the ata to be given in a single mostly numeric) feature space. In some applications, however, it is common to have multiple representations of the ata available. Such applications inclue biological ata analysis, in which, e. g. molecular similarity can be efine in various ways. Fingerprints are the most commonly use similarity measure. A fingerprint in aress: {wisweel,berthol}@inf.uni-onstanz.e Bern Wisweel an Michael R. Berthol). Preprint submitte to Elsevier Science 3 October 25

2 a molecular sense is usually a binary vector, whereby each bit inicates the presence or absence of a molecular feature. The similarity of two compouns can be expresse base on their bit vectors using the Tanimoto coefficient for example. Other escriptors encoe numerical features erive from 3D maps, incorporating the molecular size an shape, hyrophilic an hyrophobic regions quantification, surface charge istribution, etc. [6]. Further similarities involve the comparison of chemical graphs, inter-atomic istances, an molecular fiel escriptors. However, it has been shown that often a single escriptor fails to show satisfactory preiction results [6]. Other application omains inclue web mining where a ocument can be escribe base on its content an on anchor texts of hyperlins pointing to it [4]. 3D objects as use in CAD-catalogues, virtual reality applications, meicine an many other omains can be escribe, for instance, by various so-calle feature vectors, i. e. vector of scalars whose carinalities can easily reach couple of hunres. Feature vectors can rely on ifferent statistics of the 3D object, projection methos, volumetric representations obtaine by iscretizing the object s surface, 2D images, or topological matchings. Bustos et al. [5] provie a survey of feature-base similarity measures of 3D objects. In the following we enote these multiple representations, i. e. ifferent escriptor spaces, as Parallel Universes [4], each of which having representations of all objects of the ata set. The challenge that we are facing here is to tae avantage of the information encoe in the ifferent universes to fin clusters that resie in one or more universes each moeling one particular subset of the ata. In this paper, we evelop an extene fuzzy c-means FCM) algorithm [] with noise etection that is applicable to parallel universes, by assigning membership values from objects to universes. The optimization of the objective function is similar to the original FCM but also inclues the learning of the membership values to compute the impact of objects to universes. In the next section, we iscuss in more etail the concept of parallel universes; section 3 presents relate wor. We formulate our new objective function in section 4, introuce the clustering algorithm in section 5 an illustrate its usefulness with some numeric examples in section 6. 2 Parallel Universes We consier parallel universes to be a set of feature spaces for a given set of objects. Each object is assigne a representation in each single universe. Typically, parallel universes encoe ifferent properties of the ata an thus lea to ifferent measures of similarity. For instance, similarity of molecular com- 2

3 pouns can be base on surface charge istribution or a specific fingerprint representation.) Note, ue to these iniviual measurements they can also show ifferent structural information an therefore exhibit istinctive clustering. This property iffers from the problem setting in the so-calle Multi-View Clustering [3] where a single universe, i. e. view, suffices for learning but the aim is on bining ifferent views to improve the classification accuracy an/or accelerating the learning process. As it often causes confusion, we want to emphasize the ifference of the concept of parallel universes to feature selection methos [2], feature transformation such as principle component analysis an singular value ecomposition), an subspace clustering [3,8], whose problem efinitions soun similar at first but are very ifferent from what we iscuss here. Feature selection methos attempt to iscover attributes in a ata set that are most relevant to the tas at han. Subspace clustering is an extension of feature selection that sees to ientify ifferent subspaces, i. e. subsets of input features, for the same ataset. These algorithms become particularly useful when ealing with high-imensional ata, where often, many imensions are irrelevant an can mas existing clusters in noise. The main goal of such algorithms is therefore to uncover subsets of attributes subspaces), on which subsets of the ata are self-similar, i. e. buil subspace-clusters, whereas the clustering in parallel universes is given the efinition of semantically meaningful universes along with representations of all ata in them an the goal is to exploit this information. The objective for our problem efinition is on ientifying clusters locate in ifferent universes whereby each cluster moels a subset of the ata base on some unerlying property. Since stanar clustering techniques are not able to cope with parallel universes, one coul either restrict the analysis to a single universe at a time or efine a escriptor space comprising all universes. However, using only one particular universe omits information encoe in the other representations an the construction of a joint feature space an the erivation of an appropriate istance measure are cumbersome an require great care as it can introuce artifacts or hie an loose clusters that were apparent in a single universe. 3 Relate Wor Clustering in parallel universes is a relatively new fiel of research an was first mentione in [4]. In [], the DBSCAN algorithm is extene an applie to parallel universes. DBSCAN uses the notion of ense regions by means of core objects, i. e. objects that have a minimum number of objects in their ɛ-) neighborhoo. A cluster is then efine as a set of connecte) ense regions. The authors exten this concept in two ifferent ways: They efine an object 3

4 as a neighbor of a core object if it is in the ɛ-neighborhoo of this core object either ) in any of the representations or 2) in all of them. The cluster size is finally etermine through appropriate values of ɛ an. Case ) seems rather wea, having objects in one cluster even though they might not be similar in any of the representational feature spaces. Case 2), in comparison, is very conservative since it oes not reveal local clusters, i. e. subsets of the ata that only group in a single universe. However, the results in [] are promising. Another clustering scheme calle Collaborative fuzzy clustering is base on the FCM algorithm an was introuce in [5]. The author proposes an architecture in which objects escribe in parallel universes can be processe together with the objective of fining structures that are common to all universes. Clustering is carrie out by applying the c-means algorithm to all universes iniviually an then by exchanging information from the local clustering results base on the partitioning matrices. Note, the objective function, as introuce in [5], assumes the same number of clusters in each universe an, moreover, a global orer on the clusters which is very restrictive ue to the ranom initialization of FCM. A supervise clustering technique for parallel universes was given in [4]. It focuses on a moel for a particular minor) class of interest by constructing local neighborhoo histograms, so-calle Neighborgrams for each object of interest in each universe. The algorithm assigns a quality value to each Neighborgram an greeily inclues the best Neighborgram, no matter from which universe it stems, in the global preiction moel. Objects that are covere by this Neighborgram are finally remove from consieration in a sequential covering manner. This process is repeate until the global moel has sufficient preictive power. Although the algorithm is powerful to moel a minority class, it suffers from computational complexity on larger ata sets. Blum an Mitchell [4] introuce co-training as a semi-supervise proceure whereby two ifferent hypotheses are traine on two istinct representations an then bootstrap each other. In particular they consier the problem of classifying web pages base on the ocument itself an on anchor texts of inboun hyperlins. They require a conitional inepenence of both universes an state that each representation shoul suffice for learning if enough labele ata were available. The benefit of their strategy is that inexpensive) unlabele ata augment the expensive) labele ata by using the preiction in one universe to support the ecision maing in the other. Other relate wor inclues reinforcement clustering [8] an extensions of partitioning methos such as -Means, -Meois, an EM an hierarchical, agglomerative methos, all in [3]. 4

5 4 Objective Functions In this section, we introuce all necessary notation, review the FCM [,7] algorithm an formulate two new objective functions that are suitable to be use for parallel universes. The first one is a generic function that, similar to the stanar FCM, has no explicit noise hanling an therefore forces a cluster membership preiction for each pattern while the secon objective function also incorporates noise etection an, hence, allows patterns to not participate to any cluster. The technical etails, i. e. the erivation of the objective functions, can be foun in the appenix. In the following, we consier U, u U, parallel universe, each having representational feature vectors for all objects x u) i = x u) i,,..., x u i,a,... x u i,a u ) with A u inicating the imensionality of the u-th universe. We epict the overall number of objects as T, i T. We are intereste in ientifying K u clusters in universe u. We further assume appropriate efinitions of istance functions for each universe u) w u), xu) u) i where w = w u),,..., wu),a,... wu),a u ) enotes the -th prototype in the u-th universe. We confine ourselves to the Eucliean istance in the following. In general, there are no restrictions to the istance metrics other than ifferentiability. In particular, they o not nee to be of the same type in all universes. This is important to note, since we can use the propose algorithm in the same feature space, i. e. x u ) i = x u 2) i for some u an u 2, but ifferent istance measures in these universes. 4. Objective Function with No Noise Detection The stanar FCM algorithm relies on one feature space only an minimizes the accumulate sum of istances between patterns x i an cluster centers w, weighte by the egree of membership to which a pattern belongs to a cluster. Note that we omit the subscript u here, as we consier only one universe: T K J m = vi, m w, x i. ) i= = The coefficient m, ) is a fuzzyfication parameter, an v i, the respective value from the partition matrix, i. e. the egree to which pattern x i belongs to cluster. 5

6 This function is subject to minimization uner the constraint K i : v i, =, 2) = requiring that the coverage of any pattern i nees to accumulate to. The above objective function assumes all cluster caniates to be locate in the same feature space an is therefore not irectly applicable to parallel universes. To overcome this, we introuce a matrix z i,u ) i T, u U encoing the membership of patterns to universes. A value z i,u close to enotes a strong contribution of pattern x i to the clustering in universe u, an a smaller value, a respectively lesser egree. The new objective function is given by J m,m = T U i= u= K u z i,u = v u) i, u) w u), xu) i 3) Parameter m, ) allows analogous to m) to have impact on the fuzzyfication of z i,u : The larger m, the more equal the istribution of z i,u, giving each pattern an equal impact to all universes. A value close to will strengthen the composition of z i,u an assign high values to universes where a pattern shows goo clustering behavior an small values to those where it oes not. Note, we now have U, u U, ifferent partition matrices v u) ) i, to i T, K u assign membership egrees of objects to cluster prototypes in each universe. As in the stanar FCM algorithm, the objective function has to fulfill sie constraints. The coverage of a pattern among the partitions in each universe must accumulate to : i, u : K u = v u) i, =. 4) This is similar to the constraint of the single universe FCM in 2) an is require for each universe iniviually. Aitionally, the membership of a pattern to ifferent universes z i,u has to satisfy stanar requirements for membership egrees: it must accumulate to for each object consiering all universes an must be in the unit interval, i. e. U i : z i,u =. 5) u= 6

7 The minimization is one with respect to the parameters v u) i,, z i,u, an w u). The erivation of objective function 3) can be foun in the appenix, the final upate equations are given by A.2), A.7), an A.4). 4.2 Objective Function with Noise Detection The objective function as introuce in the previous section has one major rawbac: Patterns that o not contribute to any of the clusters in any universe still have a great impact on the cluster formation as the cluster memberships for each iniviual pattern nee to sum to one. This is not avantageous since ata sets in many real worl applications, if not all, contain outliers or noisy patterns. Particularly in the presente application omain it may happen that certain structural properties of the ata are not capture by any of the given semantically meaningful!) universes an therefore this portion of the ata appears to be noise. The ientification of these patterns is important for two reasons: First, as note above, these patterns influence the cluster formation an can lea to istorte clusters. Seconly, noise patterns may lea to insights on which properties of the unerlying ata are not well moele by any of the universe efinitions an therefore give hints as to what nees to be aresse when efining new universes or similarity measures. In orer to incorporate noise etection we nee to exten our objective function such that it also allows the explicit notion of noise. We aopt an extension introuce by Davé [7], which wors on the single universe FCM. The objective function accoring to Davé is given by: T T K J m = vi, m w, x i K + noise v i,. 6) i= = i= = This equation is similar to ) except for the last term. It serves as a noise cluster; all objects have a fixe, user-efine istance noise to this noise cluster. Objects that are not close to any cluster center w can therefore be etecte as noise. The constraint 2ust be soften: K i : v i,, 7) = requiring that the coverage of any pattern i nees to accumulate to at most the remainer to represents the membership to the noise cluster). Similar to the last term in 6), we a a new term to our new objective function 3) whose role is to localize the noise an place it in a single auxiliary 7

8 universe: T J m,m = U i= u= K u z i,u = T + noise i= v u) i, u) w u) U z i,u. u=, xu) i 8) By assigning patterns to this noise universe, we eclare them to be outliers in the ata set. The parameter noise reflects the fixe istance between a virtual cluster in the noise universe an all ata points. Hence, if the minimum istance between a ata point an any cluster in one of the universes is greater than noise, the pattern is labele as noise. The optimization splits into three parts: optimization of the partition values v u) i, for each universe; etermining the membership egrees of patterns to universes z i,u an finally the aaption of the center vectors of the cluster representatives w u). The upate equations of these parameters are given as follows. For the partition values v i,, we get v u) i, = K u u) = u) w u), xu) i w u), xu) i m. 9) Note, this equation is inepenent of the values z i,u an is therefore ientical to the upate expression in the single universe FCM. The optimization with respect to z i,u yiels z i,u = U ū= Ku v u) = i, K ū = v ū) i, u) ū) w u), xu) i w ū), xū) i + noise m, ) an finally the upate equation for the aaption of the prototype vectors w u) is of the form w u) = ) i= z i,u u u) v i, x i ) T i= z i,u u. ) v i, T 8

9 Thus, the upate of the prototypes epens not only on the partitioning value v u) i,, i. e. the egree to which pattern i belongs to cluster in universe u, but also to z i,u representing the membership egrees of patterns to the current universe of interest. Patterns with larger values z i,u will contribute more to the aaption of the prototype vectors, while patterns with a smaller egree accoringly to a lesser extent. Equippe with these upate equations, we can introuce the overall clustering scheme in the next section. 5 Clustering algorithm Similar to the stanar FCM algorithm, clustering is carrie out in an iterative manner, involving three steps: ) Upate of the partition matrices v u) i, ) 2) Upate of the membership egrees z i,u ) 3) Upate of the prototypes w u) ) More precisely, the clustering proceure is given as: ) Given: Input pattern set escribe in U parallel universes: x u) i, i T, u U. 2) Select: A set of istance metrics u),, an the number of clusters for each universe K u, u U, efine parameter m an m. 3) Initialize: Partition parameters v u) i, with ranom values an the cluster prototypes by rawing samples from the ata. Assign equal weights to all membership egrees z i,u =. U 4) Train: 5) Repeat 6) Upate partitioning values v u) i, accoring to 9) 7) Upate membership egrees z i,u accoring to ) 8) Compute prototypes w u) i using ) 9) until a termination criterion has been satisfie. The algorithm starts with a given set of universe efinitions an the specification of the istance metrics to be use. Also, the number of clusters in each universe nees to be efine in avance. The membership egrees z i,u are initialize with equal weight line 3)), thus having the same impact on 9

10 all universes. The optimization phase in line 5) to 9) is in comparison to the stanar FCM algorithm extene by the optimization of the universe membership egrees, line 7). The possibilities for the termination criterion in line 9) are manifol, as is also the case in the stanar FCM. One can stop after a certain number of iterations or use the change of the value of the objective function 3) between two successive iterations as stopping criteria. There are also more sophisticate approaches, for instance the change to the partition matrices uring optimization. Just lie the FCM algorithm, this metho suffers from the fact that the user has to specify the number of prototypes to be foun. Furthermore, our approach even requires the efinition of cluster counts per universe. There are numerous approaches to suggest the number of clusters in the case of the stanar FCM, [9,7,2] to name but a few. Although we have not yet stuie their applicability to our problem efinition we o believe that some of them can be aapte naturally to be use in our context as well. 6 Experimental Results In orer to emonstrate the propose approach, we generate synthetic ata sets with ifferent numbers of parallel universes. For simplicity an in orer to visualize the results we restricte the size of a universe to 2 imensions an generate 2 Gaussian istribute clusters per universe. We use 4 patterns to buil groupings by assigning each object to one of the universes an rawing its features in that universe accoring to the istribution of the cluster ranomly picing one of the two). The features of that object in the other universes were rawn from a uniform istribution, i. e. they liely represent noise in these universes unless they fall, by chance, into one of the clusters). Figure shows an example ata set with three universes. The top figures show only the objects that were generate to cluster in the respective universe. The bottom figures show all patterns, i. e. also the patterns that cluster in the other universes. They efine the reference clustering. In this example, when looing solely at one universe, about 2/3 of the ata oes not contribute to clustering an therefore are noise in that universe. To compare the results we applie the FCM algorithm [] to the joint feature space of all universes an set the number of esire clusters to the overall number of generate clusters. The cluster membership ecision for the single-universe FCM is base on the highest value of the partition values, i. e. the cluster to a pattern i is etermine by = arg max K {v i, }.

11 Universe Universe 2 Universe Fig.. Three universes of a synthetic ata set. The top figures show only objects that were generate within the respective universe using two clusters per universe). The bottom figures show all patterns; note that most of them i. e. the ones from the other two universes), are noise in this particular universe. For clarification we use ifferent shapes for objects that originate from ifferent universes. Fig. 2. Clustering quality for 3 ifferent ata sets. The number of universes ranges from 2 to 4 universes. Note how the cluster quality of the joint feature space rops sharply whereas the parallel universe approach seems less affecte. An overall ecline of cluster quality is to be expecte since the number of clusters to be etecte increases. When the universe information is taen into account, a cluster ecision is base on the memberships to universes z i,u an memberships to clusters v u) i,. The winning universe is etermine by ū = arg max u U {z i,u } an the corresponing cluster in ū is calculate as = arg max Kū{v ū) i, }. We use the following quality measure to evaluate the clustering outcome an

12 compare it to the reference clustering []: Q K C) = C i C C i T entropy KC i )), where K is the reference clustering, i. e. the clusters as generate, C the clustering to evaluate, an entropy K C i ) the entropy of cluster C i with respect to K. This function is if C equals K an if all clusters are completely mixe such that they all contain an equal fraction of the clusters in K or all points are preicte to be noise. Thus, the higher the value, the better the clustering. Figure 2 summarizes the quality values for 3 experiments. The number of universes ranges from 2 to 4. The left bar for each experiment in figure 2 shows the quality value when using the new objective function as introuce in section 4., i. e. with incorporating the nowlege of parallel universes but no explicit noise etection. The right bar shows the quality value when applying the stanar FCM to the joint feature space. Clearly, for this ata set, our algorithm taes avantage of the information encoe in ifferent universes an ientifies the major parts of the original clusters much better. In a secon experiment, we artificially ae 6 noise patterns in orer to test the ability of noise etection. The patterns features were rawn from a ranom istribution in all universes, hence, they liely represent noise. We then applie our new algorithm in parallel universes with an without noise etection an compare the results to the extene FCM algorithm with noise etection [7] applie to the joint feature space. The crisp cluster membership was base on the egree of membership of a pattern to the auxiliary noise cluster: Whenever this value was higher than the maximum membership to any of the real clusters, the pattern was labele as noise, i. e. max K {v i, } < K = v i,. Similarly, in case of the algorithm in parallel universes, a pattern is etecte as noise when the egree of membership to the auxiliary noise universe is higher than to any real universe, max u U {z i,u } < U u= z i,u. Figure 3 summarizes the quality values for this experiment. Clearly, when allowing the algorithm to label patterns as noise, the quality value increases. However, when applying FCM to the joint feature space right most bar), most of the ata was labele as noise. It was noticeable, that the noise etection 3% of the ata was generate ranomly such that it shoul not cluster in any universe) ecrease when there were more universes, since the number of clusters an therefore the chance to hit one of them when rawing the features of a noise object increase for this artifical ata. As a result, the ifference in quality between the clustering algorithm which allows noise etection an the clustering algorithm that forces a cluster preiction eclines The overall number of patterns is therefore 2 patterns. 2

13 Fig. 3. Results on the artificial ataset with 6 patterns being noise, i. e. not contributing to any cluster. When using our new algorithms the two left bars for each experiment) the quality values are always greater than the value for the FCM with noise cluster [7] applie to the joint feature space. when there are more universes. This effect occurs no matter how carefully the noise istance parameter noise is chosen. However, if we have only few universes, the ifference is quite obvious. Figure 4 visually emonstrates the clusters from the foregoing example as they are etermine by the fuzzy c-means algorithm in parallel universes: The top figures show the outcome when using the objective function introuce in section 4., i. e. without noise etection, an the bottom figures show the clusters when allowing noise etection section 4.2). The figures show only the patterns that are part of clusters in the respective universe; other patterns, either covere by clusters in the remaining universes or etecte as noise, are filtere out. Note how the clusters in the top figures are sprea an contain patterns that obviously o not mae much sense for this clustering. This is ue to the fact that the algorithm is not allowe to iscar such patterns as noise: each pattern must be assigne to a cluster. The bottom figures, in comparison, show the clusters as well-shape, ense regions. Patterns that in the top figures istort the clusters are not inclue here. It shows nicely that the algorithm oes not force a cluster preiction an will recognize these patterns as being noise. We chose this in of ata generation to test the ability to etect clusters that are blurre by noise. Particularly in biological ata analysis it is common to have noisy ata for which ifferent escriptors are available an each by itself exhibits only little clustering power. 3

14 Universe Universe 2 Universe Fig. 4. The top figures show the clusters as they are foun when applying the algorithm with no noise etection. The bottom figures show the clusters foun by the algorithm using noise etection. While the clusters in the top figures contain patterns that o not appear natural for this clustering, the clustering with noise etection reveals those patterns an buils up clear groupings. 7 Conclusion We consiere the problem of unsupervise clustering in parallel universes, i. e. problems where multiple representations are available for each object. We evelope an extension of the fuzzy c-means algorithm with noise etection that uses membership egrees to moel the impact of objects to the clustering in a particular universe. By incorporating these membership values into the objective function, we were able to erive upate equations which minimize the objective function with respect to these values, the partition matrices, an the prototype center vectors. In orer to moel the concept of noise, i. e. patterns that apparently are not containe in any of the clusters in any universe, we introuce an auxiliary noise universe that has one single cluster to which all objects have a fixe, pre-efine istance. Patterns that are not covere by any of the clusters are assigne a high membership to this universe an can therefore be reveale as noise. The clustering algorithm itself wors in an iterative manner similar to the stanar FCM using the above upate equations to compute a localinimum. The result are clusters locate in ifferent parallel universes, each moeling only a subset of the overall ata an ignoring ata that o not contribute to clustering in a universe. We emonstrate that the algorithm performs well on a synthetic ata set an nicely exploits the information of having ifferent universes. Further stuies will concentrate on the overlap of clusters. The propose ob- 4

15 jective function rewars clusters that only occur in one universe. Objects that cluster well in more than one universe coul possibly be ientifie when having balance membership values to the universes but very unbalance partitioning values for the cluster memberships within these particular universes. Other stuies will continue to focus on the applicability of the propose metho to real worl ata an heuristics that ajust the number of clusters per universe. Acnowlegments This wor was partially supporte by DFG Research Training Group GK-42 Explorative Analysis an Visualization of Large Information Spaces. A Appenix In orer to compute a minimum of the objective function 3) with respect to 4) an 5), we exploit a Lagrange technique to merge the constraine part of the optimization problem with the unconstraine one. Note, we sip the extra notation of the noise universe in 8); it can be seen as an aitional universe, i. e. the number of universe is U +, that has one cluster to which all patterns have a fixe istance of noise. The erivation can then be applie as follows. It leas to a new objective function F i F i = U K u z i,u u) v i, u= = U + λ K u u u= = u) w u) ) v u) i, + λ, xu) i A.) U z i,u ), u= which we minimize iniviually for each pattern x i. The parameters λ an λ u, u U, enote the Lagrange multipliers to tae 4) an 5) into account. The necessary conitions leaing to local minima of F i rea as F i z i,u =, F i v u) i, =, F i λ =, F i λ u u U, K u. =, A.2) 5

16 In the following we will erive upate equations for the z an v parameters. Evaluating the first erivative of the equations in A.2) yiels the expression F i = m K u z i,u z i,u = v u) i, u) w u), xu) i λ =, an hence z i,u = λ m ) m Ku = v u) i, u) w u), xu) i m. A.3) We can rewrite the above equation λ m ) m Ku = zi,u = v u) i, u) w u), xu) i ) m. A.4) From the erivative of F i w. r. t. λ in A.2), it follows F i λ = U U z i,u =, u= u= z i,u = A.5) which returns the normalization conition as in 5). Using the formula for z i,u in A.3) an integrating it into expression A.5) we compute U λ u= λ m m ) m ) m U u= Ku = Ku = v u) i, v u) i, u) u) w u) w u), xu) i, xu) i m m = =. A.6) We mae use of A.4) an substitute λ m in A.6). Note, we use ū as parameter inex of the sum to aress the fact that it covers all universes, whereas u enotes the current universe of interest. It follows m ) Ku = z i,u = v u) i, u) w u), xu) i ) m 6

17 U ū= Kū = v ū) i, ū) w ū), xū) i m, which can be simplifie to U = z i,u ū= Ku = Kū = ) u v i, u) w u), xu) i ) ū v i, ū) ū) w, xū) i m, an returns an immeiate upate expression for the membership z i,u of pattern i to universe u: z i,u = u) Ku U v u) = i, ū= K ū v ū) = i, ū) w u), xu) i w ū), xū) i m. A.7) Analogous to the calculations above we can erive the upate equation for value v u) i, which represents the partitioning value of pattern i to cluster in universe u. From A.2) it follows F i v u) i, = z i,u m v u) i, u) w u), xu) i λ u =, an thus λ u m z i,u v u) i, = ) m = v u) i, m z i,u u) w u) λ u u) w u), xu) i m, xu) i ) m, A.8). A.9) Zeroing the erivative of F i w. r. t. λ u will result in conition 4), ensuring that the partition values sum to, i. e. F i λ u = K u = v u) i, =. A.) We use A.8) an A.) to come up with 7

18 K u = = m z i,u = λ u m z i,u ) m λ u u) u) w K u = m,, xu) i m, xu) i u) w u). A.) Equation A.9) allows us to replace the first multiplier in A.). We will use the notation to point out that the sum in A.) consiers all partitions in a universe an to enote one particular cluster coming from A.8), = v u) i, u) w u) = v u) i, K u =, xu) i u) w u) u) w u) ) m Finally, the upate rule for v u) i, K u =, xu) m i, xu) i arises as: u) w u) m, xu) i v u) i, = K u u) = u) w u), xu) i w u), xu) i m. A.2) For the sae of completeness we also erive the upate rules for the cluster prototypes w u). We confine ourselves to the Eucliean istance here, assuming the ata is normalize 2 : u) w u), A u xu) i = a= w u),a xu) i,a, A.3) with A u the number of imensions in universe u an w u),a the value of the prototype in imension a. x u) i,a is the value of the a-th attribute of pattern i in universe u, respectively. The necessary conition for a minimum of the objective function 3) is of the form u) w J =. Using the Eucliean istance as given in A.3) we obtain 2 The erivation of the upates using other than the Eucliean istance wors in a similar manner. 8

19 T i= J m,m w u),a! = ) 2 z i,u u u) v i, w,a ) xu) i,a = w u),a T i= ) T z i,u u v i, = i= T w u),a = ) z i,u u u) v i, x i,a ) i= z i,u u u) v i, x i,a ) T i= z i,u u. A.4) v i, References [] J. C. Beze. Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New Yor, 98. [2] James C. Beze an Richar J. Hathaway. VAT: a tool for visual assessment of cluster) tenency. In Proceeings of the 22 International Joint Conference on Neural Networs IJCNN 2), pages , 22. [3] Steffen Bicel an Tobias Scheffer. Multi-view clustering. In Proceeings of the Fourth IEEE International Conference on Data Mining ICDM 4), pages 9 26, 24. [4] Avrim Blum an Tom Mitchell. Combining labele an unlabele ata with co-training. In Proceeings of the eleventh annual Conference on Computational Learning Theory COLT 98), pages 92. ACM Press, 998. [5] Benjamin Bustos, Daniel A. Keim, Dietmar Saupe, Tobias Schrec, an Dejan V. Vranić. An experimental effectiveness comparison of methos for 3D similarity search. International Journal on Digital Libraries, Special issue on Multimeia Contents an Management in Digital Libraries, 25. To appear. [6] G. Cruciani, P. Crivori, P.-A. Carrupt, an B. Testa. Molecular fiels in quantitative structure-permeation relationships: the VolSurf approach. Journal of Molecular Structure, 53:7 3, 2. [7] Rajesh N. Davé. Characterization an etection of noise in clustering. Pattern Recognition Letters, 2: , 99. [8] Jerome H. Frieman an Jacqueline J. Meulmany. Clustering objects on subsets of attributes. Journal of the Royal Statistical Society, 664), 24. [9] Davi J. Han, Heii Mannila, an Pahraic Smyth. Principles of Data Mining. MIT Press, 2. [] Fran Höppner, Fran Klawoon, Ruolf Kruse, an Thomas Runler. Fuzzy Cluster Analysis. John Wiley, Chichester, Englan,

20 [] Karin Kailing, Hans-Peter Kriegel, Alexey Pryahin, an Matthias Schubert. Clustering multi-represente objects with noise. In PAKDD, pages , 24. [2] Huan Liu an Hiroshi Motoa. Feature Selection for Knowlege Discovery & Data Mining. Kluwer Acaemic Publishers, 998. [3] Lance Parsons, Ehtesham Haque, an Huan Liu. Subspace clustering for high imensional ata: a review. SIGKDD Explor. Newsl., 6):9 5, 24. [4] Davi E. Patterson an Michael R. Berthol. Clustering in parallel universes. In Proceeings of the 2 IEEE Conference in Systems, Man an Cybernetics. IEEE Press, 2. [5] Witol Perycz. Collaborative fuzzy clustering. Pattern Recognition Letters, 234): , 22. [6] Ansgar Schuffenhauer, Valerie J. Gillet, an Peter Willett. Similarity searching in files of three-imensional chemical structures: Analysis of the bioster atabase using two-imensional fingerprints an molecular fiel escriptors. Journal of Chemical Information an Computer Sciences, 42):295 37, 2. [7] N. B. Venateswarlu an P. S. V. S. K. Raju. Fast ISODATA clustering algorithms. Pattern Recognition, 253): , 992. [8] Jiong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu, Li Tao, an Wei-Ying Ma. ReCoM: Reinforcement clustering of multi-type interrelate ata objects. In In Proceeings of the 26th annual international ACM SIGIR conference on research an evelopment in information retrieval SIGIR 3), pages , 23. [9] R. R. Yager an D. P. Filev. Approximate clustering via the mountain metho. IEEE Trans. Systems Man Cybernet., 248): , August

Skyline Community Search in Multi-valued Networks

Skyline Community Search in Multi-valued Networks Syline Community Search in Multi-value Networs Rong-Hua Li Beijing Institute of Technology Beijing, China lironghuascut@gmail.com Jeffrey Xu Yu Chinese University of Hong Kong Hong Kong, China yu@se.cuh.eu.h

More information

Non-homogeneous Generalization in Privacy Preserving Data Publishing

Non-homogeneous Generalization in Privacy Preserving Data Publishing Non-homogeneous Generalization in Privacy Preserving Data Publishing W. K. Wong, Nios Mamoulis an Davi W. Cheung Department of Computer Science, The University of Hong Kong Pofulam Roa, Hong Kong {wwong2,nios,cheung}@cs.hu.h

More information

Particle Swarm Optimization Based on Smoothing Approach for Solving a Class of Bi-Level Multiobjective Programming Problem

Particle Swarm Optimization Based on Smoothing Approach for Solving a Class of Bi-Level Multiobjective Programming Problem BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 17, No 3 Sofia 017 Print ISSN: 1311-970; Online ISSN: 1314-4081 DOI: 10.1515/cait-017-0030 Particle Swarm Optimization Base

More information

Online Appendix to: Generalizing Database Forensics

Online Appendix to: Generalizing Database Forensics Online Appenix to: Generalizing Database Forensics KYRIACOS E. PAVLOU an RICHARD T. SNODGRASS, University of Arizona This appenix presents a step-by-step iscussion of the forensic analysis protocol that

More information

Classifying Facial Expression with Radial Basis Function Networks, using Gradient Descent and K-means

Classifying Facial Expression with Radial Basis Function Networks, using Gradient Descent and K-means Classifying Facial Expression with Raial Basis Function Networks, using Graient Descent an K-means Neil Allrin Department of Computer Science University of California, San Diego La Jolla, CA 9237 nallrin@cs.ucs.eu

More information

Feature Extraction and Rule Classification Algorithm of Digital Mammography based on Rough Set Theory

Feature Extraction and Rule Classification Algorithm of Digital Mammography based on Rough Set Theory Feature Extraction an Rule Classification Algorithm of Digital Mammography base on Rough Set Theory Aboul Ella Hassanien Jafar M. H. Ali. Kuwait University, Faculty of Aministrative Science, Quantitative

More information

The Reconstruction of Graphs. Dhananjay P. Mehendale Sir Parashurambhau College, Tilak Road, Pune , India. Abstract

The Reconstruction of Graphs. Dhananjay P. Mehendale Sir Parashurambhau College, Tilak Road, Pune , India. Abstract The Reconstruction of Graphs Dhananay P. Mehenale Sir Parashurambhau College, Tila Roa, Pune-4030, Inia. Abstract In this paper we iscuss reconstruction problems for graphs. We evelop some new ieas lie

More information

Rough Set Approach for Classification of Breast Cancer Mammogram Images

Rough Set Approach for Classification of Breast Cancer Mammogram Images Rough Set Approach for Classification of Breast Cancer Mammogram Images Aboul Ella Hassanien Jafar M. H. Ali. Kuwait University, Faculty of Aministrative Science, Quantitative Methos an Information Systems

More information

A PSO Optimized Layered Approach for Parametric Clustering on Weather Dataset

A PSO Optimized Layered Approach for Parametric Clustering on Weather Dataset Vol.3, Issue.1, Jan-Feb. 013 pp-504-508 ISSN: 49-6645 A PSO Optimize Layere Approach for Parametric Clustering on Weather Dataset Shikha Verma, 1 Kiran Jyoti 1 Stuent, Guru Nanak Dev Engineering College

More information

Cluster Center Initialization Method for K-means Algorithm Over Data Sets with Two Clusters

Cluster Center Initialization Method for K-means Algorithm Over Data Sets with Two Clusters Available online at www.scienceirect.com Proceia Engineering 4 (011 ) 34 38 011 International Conference on Avances in Engineering Cluster Center Initialization Metho for K-means Algorithm Over Data Sets

More information

Generalized Edge Coloring for Channel Assignment in Wireless Networks

Generalized Edge Coloring for Channel Assignment in Wireless Networks TR-IIS-05-021 Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu, Pangfeng Liu, Da-Wei Wang, Jan-Jan Wu December 2005 Technical Report No. TR-IIS-05-021 http://www.iis.sinica.eu.tw/lib/techreport/tr2005/tr05.html

More information

Image Segmentation using K-means clustering and Thresholding

Image Segmentation using K-means clustering and Thresholding Image Segmentation using Kmeans clustering an Thresholing Preeti Panwar 1, Girhar Gopal 2, Rakesh Kumar 3 1M.Tech Stuent, Department of Computer Science & Applications, Kurukshetra University, Kurukshetra,

More information

Generalized Edge Coloring for Channel Assignment in Wireless Networks

Generalized Edge Coloring for Channel Assignment in Wireless Networks Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu Institute of Information Science Acaemia Sinica Taipei, Taiwan Da-wei Wang Jan-Jan Wu Institute of Information Science

More information

Multilevel Linear Dimensionality Reduction using Hypergraphs for Data Analysis

Multilevel Linear Dimensionality Reduction using Hypergraphs for Data Analysis Multilevel Linear Dimensionality Reuction using Hypergraphs for Data Analysis Haw-ren Fang Department of Computer Science an Engineering University of Minnesota; Minneapolis, MN 55455 hrfang@csumneu ABSTRACT

More information

NEW METHOD FOR FINDING A REFERENCE POINT IN FINGERPRINT IMAGES WITH THE USE OF THE IPAN99 ALGORITHM 1. INTRODUCTION 2.

NEW METHOD FOR FINDING A REFERENCE POINT IN FINGERPRINT IMAGES WITH THE USE OF THE IPAN99 ALGORITHM 1. INTRODUCTION 2. JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 13/009, ISSN 164-6037 Krzysztof WRÓBEL, Rafał DOROZ * fingerprint, reference point, IPAN99 NEW METHOD FOR FINDING A REFERENCE POINT IN FINGERPRINT IMAGES

More information

Study of Network Optimization Method Based on ACL

Study of Network Optimization Method Based on ACL Available online at www.scienceirect.com Proceia Engineering 5 (20) 3959 3963 Avance in Control Engineering an Information Science Stuy of Network Optimization Metho Base on ACL Liu Zhian * Department

More information

APPLYING GENETIC ALGORITHM IN QUERY IMPROVEMENT PROBLEM. Abdelmgeid A. Aly

APPLYING GENETIC ALGORITHM IN QUERY IMPROVEMENT PROBLEM. Abdelmgeid A. Aly International Journal "Information Technologies an Knowlege" Vol. / 2007 309 [Project MINERVAEUROPE] Project MINERVAEUROPE: Ministerial Network for Valorising Activities in igitalisation -

More information

filtering LETTER An Improved Neighbor Selection Algorithm in Collaborative Taek-Hun KIM a), Student Member and Sung-Bong YANG b), Nonmember

filtering LETTER An Improved Neighbor Selection Algorithm in Collaborative Taek-Hun KIM a), Student Member and Sung-Bong YANG b), Nonmember 107 IEICE TRANS INF & SYST, VOLE88 D, NO5 MAY 005 LETTER An Improve Neighbor Selection Algorithm in Collaborative Filtering Taek-Hun KIM a), Stuent Member an Sung-Bong YANG b), Nonmember SUMMARY Nowaays,

More information

Coupling the User Interfaces of a Multiuser Program

Coupling the User Interfaces of a Multiuser Program Coupling the User Interfaces of a Multiuser Program PRASUN DEWAN University of North Carolina at Chapel Hill RAJIV CHOUDHARY Intel Corporation We have evelope a new moel for coupling the user-interfaces

More information

A Neural Network Model Based on Graph Matching and Annealing :Application to Hand-Written Digits Recognition

A Neural Network Model Based on Graph Matching and Annealing :Application to Hand-Written Digits Recognition ITERATIOAL JOURAL OF MATHEMATICS AD COMPUTERS I SIMULATIO A eural etwork Moel Base on Graph Matching an Annealing :Application to Han-Written Digits Recognition Kyunghee Lee Abstract We present a neural

More information

Learning Subproblem Complexities in Distributed Branch and Bound

Learning Subproblem Complexities in Distributed Branch and Bound Learning Subproblem Complexities in Distribute Branch an Boun Lars Otten Department of Computer Science University of California, Irvine lotten@ics.uci.eu Rina Dechter Department of Computer Science University

More information

Design of Policy-Aware Differentially Private Algorithms

Design of Policy-Aware Differentially Private Algorithms Design of Policy-Aware Differentially Private Algorithms Samuel Haney Due University Durham, NC, USA shaney@cs.ue.eu Ashwin Machanavajjhala Due University Durham, NC, USA ashwin@cs.ue.eu Bolin Ding Microsoft

More information

Shift-map Image Registration

Shift-map Image Registration Shift-map Image Registration Svärm, Linus; Stranmark, Petter Unpublishe: 2010-01-01 Link to publication Citation for publishe version (APA): Svärm, L., & Stranmark, P. (2010). Shift-map Image Registration.

More information

1 Surprises in high dimensions

1 Surprises in high dimensions 1 Surprises in high imensions Our intuition about space is base on two an three imensions an can often be misleaing in high imensions. It is instructive to analyze the shape an properties of some basic

More information

Random Clustering for Multiple Sampling Units to Speed Up Run-time Sample Generation

Random Clustering for Multiple Sampling Units to Speed Up Run-time Sample Generation DEIM Forum 2018 I4-4 Abstract Ranom Clustering for Multiple Sampling Units to Spee Up Run-time Sample Generation uzuru OKAJIMA an Koichi MARUAMA NEC Solution Innovators, Lt. 1-18-7 Shinkiba, Koto-ku, Tokyo,

More information

6 Gradient Descent. 6.1 Functions

6 Gradient Descent. 6.1 Functions 6 Graient Descent In this topic we will iscuss optimizing over general functions f. Typically the function is efine f : R! R; that is its omain is multi-imensional (in this case -imensional) an output

More information

Transient analysis of wave propagation in 3D soil by using the scaled boundary finite element method

Transient analysis of wave propagation in 3D soil by using the scaled boundary finite element method Southern Cross University epublications@scu 23r Australasian Conference on the Mechanics of Structures an Materials 214 Transient analysis of wave propagation in 3D soil by using the scale bounary finite

More information

THE BAYESIAN RECEIVER OPERATING CHARACTERISTIC CURVE AN EFFECTIVE APPROACH TO EVALUATE THE IDS PERFORMANCE

THE BAYESIAN RECEIVER OPERATING CHARACTERISTIC CURVE AN EFFECTIVE APPROACH TO EVALUATE THE IDS PERFORMANCE БСУ Международна конференция - 2 THE BAYESIAN RECEIVER OPERATING CHARACTERISTIC CURVE AN EFFECTIVE APPROACH TO EVALUATE THE IDS PERFORMANCE Evgeniya Nikolova, Veselina Jecheva Burgas Free University Abstract:

More information

An Algorithm for Building an Enterprise Network Topology Using Widespread Data Sources

An Algorithm for Building an Enterprise Network Topology Using Widespread Data Sources An Algorithm for Builing an Enterprise Network Topology Using Wiesprea Data Sources Anton Anreev, Iurii Bogoiavlenskii Petrozavosk State University Petrozavosk, Russia {anreev, ybgv}@cs.petrsu.ru Abstract

More information

Using Vector and Raster-Based Techniques in Categorical Map Generalization

Using Vector and Raster-Based Techniques in Categorical Map Generalization Thir ICA Workshop on Progress in Automate Map Generalization, Ottawa, 12-14 August 1999 1 Using Vector an Raster-Base Techniques in Categorical Map Generalization Beat Peter an Robert Weibel Department

More information

Fast Fractal Image Compression using PSO Based Optimization Techniques

Fast Fractal Image Compression using PSO Based Optimization Techniques Fast Fractal Compression using PSO Base Optimization Techniques A.Krishnamoorthy Visiting faculty Department Of ECE University College of Engineering panruti rishpci89@gmail.com S.Buvaneswari Visiting

More information

A FUZZY FRAMEWORK FOR SEGMENTATION, FEATURE MATCHING AND RETRIEVAL OF BRAIN MR IMAGES

A FUZZY FRAMEWORK FOR SEGMENTATION, FEATURE MATCHING AND RETRIEVAL OF BRAIN MR IMAGES A FUZZY FRAMEWORK FOR SEGMENTATION, FEATURE MATCHING AND RETRIEVAL OF BRAIN MR IMAGES Archana.S 1 an Srihar.S 2 1 Department of Information Science an Technology, College of Engineering, Guiny archana.santhira@gmail.com

More information

d 3 d 4 d d d d d d d d d d d 1 d d d d d d

d 3 d 4 d d d d d d d d d d d 1 d d d d d d Proceeings of the IASTED International Conference Software Engineering an Applications (SEA') October 6-, 1, Scottsale, Arizona, USA AN OBJECT-ORIENTED APPROACH FOR MANAGING A NETWORK OF DATABASES Shu-Ching

More information

WLAN Indoor Positioning Based on Euclidean Distances and Fuzzy Logic

WLAN Indoor Positioning Based on Euclidean Distances and Fuzzy Logic WLAN Inoor Positioning Base on Eucliean Distances an Fuzzy Logic Anreas TEUBER, Bern EISSFELLER Institute of Geoesy an Navigation, University FAF, Munich, Germany, e-mail: (anreas.teuber, bern.eissfeller)@unibw.e

More information

Solution Representation for Job Shop Scheduling Problems in Ant Colony Optimisation

Solution Representation for Job Shop Scheduling Problems in Ant Colony Optimisation Solution Representation for Job Shop Scheuling Problems in Ant Colony Optimisation James Montgomery, Carole Faya 2, an Sana Petrovic 2 Faculty of Information & Communication Technologies, Swinburne University

More information

Learning Polynomial Functions. by Feature Construction

Learning Polynomial Functions. by Feature Construction I Proceeings of the Eighth International Workshop on Machine Learning Chicago, Illinois, June 27-29 1991 Learning Polynomial Functions by Feature Construction Richar S. Sutton GTE Laboratories Incorporate

More information

A Convex Clustering-based Regularizer for Image Segmentation

A Convex Clustering-based Regularizer for Image Segmentation Vision, Moeling, an Visualization (2015) D. Bommes, T. Ritschel an T. Schultz (Es.) A Convex Clustering-base Regularizer for Image Segmentation Benjamin Hell (TU Braunschweig), Marcus Magnor (TU Braunschweig)

More information

MORA: a Movement-Based Routing Algorithm for Vehicle Ad Hoc Networks

MORA: a Movement-Based Routing Algorithm for Vehicle Ad Hoc Networks : a Movement-Base Routing Algorithm for Vehicle A Hoc Networks Fabrizio Granelli, Senior Member, Giulia Boato, Member, an Dzmitry Kliazovich, Stuent Member Abstract Recent interest in car-to-car communications

More information

A Plane Tracker for AEC-automation Applications

A Plane Tracker for AEC-automation Applications A Plane Tracker for AEC-automation Applications Chen Feng *, an Vineet R. Kamat Department of Civil an Environmental Engineering, University of Michigan, Ann Arbor, USA * Corresponing author (cforrest@umich.eu)

More information

Multi-camera tracking algorithm study based on information fusion

Multi-camera tracking algorithm study based on information fusion International Conference on Avance Electronic Science an Technolog (AEST 016) Multi-camera tracking algorithm stu base on information fusion a Guoqiang Wang, Shangfu Li an Xue Wen School of Electronic

More information

Clustering using Particle Swarm Optimization. Nuria Gómez Blas, Octavio López Tolic

Clustering using Particle Swarm Optimization. Nuria Gómez Blas, Octavio López Tolic 24 International Journal Information Theories an Applications, Vol. 23, Number 1, (c) 2016 Clustering using Particle Swarm Optimization Nuria Gómez Blas, Octavio López Tolic Abstract: Data clustering has

More information

A fast embedded selection approach for color texture classification using degraded LBP

A fast embedded selection approach for color texture classification using degraded LBP A fast embee selection approach for color texture classification using egrae A. Porebski, N. Vanenbroucke an D. Hama Laboratoire LISIC - EA 4491 - Université u Littoral Côte Opale - 50, rue Ferinan Buisson

More information

Detecting Overlapping Communities from Local Spectral Subspaces

Detecting Overlapping Communities from Local Spectral Subspaces Detecting Overlapping Communities from Local Spectral Subspaces Kun He, Yiwei Sun Huazhong University of Science an Technology Wuhan 430074, China Email: {brooklet60, yiweisun}@hust.eu.cn Davi Binel, John

More information

New Geometric Interpretation and Analytic Solution for Quadrilateral Reconstruction

New Geometric Interpretation and Analytic Solution for Quadrilateral Reconstruction New Geometric Interpretation an Analytic Solution for uarilateral Reconstruction Joo-Haeng Lee Convergence Technology Research Lab ETRI Daejeon, 305 777, KOREA Abstract A new geometric framework, calle

More information

Research Article Research on Law s Mask Texture Analysis System Reliability

Research Article Research on Law s Mask Texture Analysis System Reliability Research Journal of Applie Sciences, Engineering an Technology 7(19): 4002-4007, 2014 DOI:10.19026/rjaset.7.761 ISSN: 2040-7459; e-issn: 2040-7467 2014 Maxwell Scientific Publication Corp. Submitte: November

More information

Lecture 1 September 4, 2013

Lecture 1 September 4, 2013 CS 84r: Incentives an Information in Networks Fall 013 Prof. Yaron Singer Lecture 1 September 4, 013 Scribe: Bo Waggoner 1 Overview In this course we will try to evelop a mathematical unerstaning for the

More information

Data Mining: Concepts and Techniques. Chapter 7. Cluster Analysis. Examples of Clustering Applications. What is Cluster Analysis?

Data Mining: Concepts and Techniques. Chapter 7. Cluster Analysis. Examples of Clustering Applications. What is Cluster Analysis? Data Mining: Concepts an Techniques Chapter Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign www.cs.uiuc.eu/~hanj Jiawei Han an Micheline Kamber, All rights reserve

More information

AnyTraffic Labeled Routing

AnyTraffic Labeled Routing AnyTraffic Labele Routing Dimitri Papaimitriou 1, Pero Peroso 2, Davie Careglio 2 1 Alcatel-Lucent Bell, Antwerp, Belgium Email: imitri.papaimitriou@alcatel-lucent.com 2 Universitat Politècnica e Catalunya,

More information

Research Article Inviscid Uniform Shear Flow past a Smooth Concave Body

Research Article Inviscid Uniform Shear Flow past a Smooth Concave Body International Engineering Mathematics Volume 04, Article ID 46593, 7 pages http://x.oi.org/0.55/04/46593 Research Article Invisci Uniform Shear Flow past a Smooth Concave Boy Abullah Mura Department of

More information

Optimal Distributed P2P Streaming under Node Degree Bounds

Optimal Distributed P2P Streaming under Node Degree Bounds Optimal Distribute P2P Streaming uner Noe Degree Bouns Shaoquan Zhang, Ziyu Shao, Minghua Chen, an Libin Jiang Department of Information Engineering, The Chinese University of Hong Kong Department of EECS,

More information

New Version of Davies-Bouldin Index for Clustering Validation Based on Cylindrical Distance

New Version of Davies-Bouldin Index for Clustering Validation Based on Cylindrical Distance New Version of Davies-Boulin Inex for lustering Valiation Base on ylinrical Distance Juan arlos Roas Thomas Faculta e Informática Universia omplutense e Mari Mari, España correoroas@gmail.com Abstract

More information

Queueing Model and Optimization of Packet Dropping in Real-Time Wireless Sensor Networks

Queueing Model and Optimization of Packet Dropping in Real-Time Wireless Sensor Networks Queueing Moel an Optimization of Packet Dropping in Real-Time Wireless Sensor Networks Marc Aoun, Antonios Argyriou, Philips Research, Einhoven, 66AE, The Netherlans Department of Computer an Communication

More information

Short-term prediction of photovoltaic power based on GWPA - BP neural network model

Short-term prediction of photovoltaic power based on GWPA - BP neural network model Short-term preiction of photovoltaic power base on GWPA - BP neural networ moel Jian Di an Shanshan Meng School of orth China Electric Power University, Baoing. China Abstract In recent years, ue to China's

More information

Classification and clustering methods for documents. by probabilistic latent semantic indexing model

Classification and clustering methods for documents. by probabilistic latent semantic indexing model A Short Course at amang University aipei, aiwan, R.O.C., March 7-9, 2006 Classification an clustering methos for ocuments by probabilistic latent semantic inexing moel Shigeichi Hirasawa Department of

More information

Almost Disjunct Codes in Large Scale Multihop Wireless Network Media Access Control

Almost Disjunct Codes in Large Scale Multihop Wireless Network Media Access Control Almost Disjunct Coes in Large Scale Multihop Wireless Network Meia Access Control D. Charles Engelhart Anan Sivasubramaniam Penn. State University University Park PA 682 engelhar,anan @cse.psu.eu Abstract

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. Preface Here are my online notes for my Calculus I course that I teach here at Lamar University. Despite the fact that these are my class notes, they shoul be accessible to anyone wanting to learn Calculus

More information

Improving Spatial Reuse of IEEE Based Ad Hoc Networks

Improving Spatial Reuse of IEEE Based Ad Hoc Networks mproving Spatial Reuse of EEE 82.11 Base A Hoc Networks Fengji Ye, Su Yi an Biplab Sikar ECSE Department, Rensselaer Polytechnic nstitute Troy, NY 1218 Abstract n this paper, we evaluate an suggest methos

More information

DEVELOPMENT OF DamageCALC APPLICATION FOR AUTOMATIC CALCULATION OF THE DAMAGE INDICATOR

DEVELOPMENT OF DamageCALC APPLICATION FOR AUTOMATIC CALCULATION OF THE DAMAGE INDICATOR Mechanical Testing an Diagnosis ISSN 2247 9635, 2012 (II), Volume 4, 28-36 DEVELOPMENT OF DamageCALC APPLICATION FOR AUTOMATIC CALCULATION OF THE DAMAGE INDICATOR Valentina GOLUBOVIĆ-BUGARSKI, Branislav

More information

A Duality Based Approach for Realtime TV-L 1 Optical Flow

A Duality Based Approach for Realtime TV-L 1 Optical Flow A Duality Base Approach for Realtime TV-L 1 Optical Flow C. Zach 1, T. Pock 2, an H. Bischof 2 1 VRVis Research Center 2 Institute for Computer Graphics an Vision, TU Graz Abstract. Variational methos

More information

Improving Performance of Sparse Matrix-Vector Multiplication

Improving Performance of Sparse Matrix-Vector Multiplication Improving Performance of Sparse Matrix-Vector Multiplication Ali Pınar Michael T. Heath Department of Computer Science an Center of Simulation of Avance Rockets University of Illinois at Urbana-Champaign

More information

State Indexed Policy Search by Dynamic Programming. Abstract. 1. Introduction. 2. System parameterization. Charles DuHadway

State Indexed Policy Search by Dynamic Programming. Abstract. 1. Introduction. 2. System parameterization. Charles DuHadway State Inexe Policy Search by Dynamic Programming Charles DuHaway Yi Gu 5435537 503372 December 4, 2007 Abstract We consier the reinforcement learning problem of simultaneous trajectory-following an obstacle

More information

A Cost Model For Nearest Neighbor Search. High-Dimensional Data Space

A Cost Model For Nearest Neighbor Search. High-Dimensional Data Space A Cost Moel For Nearest Neighbor Search in High-Dimensional Data Space Stefan Berchtol University of Munich Germany berchtol@informatikuni-muenchene Daniel A Keim University of Munich Germany keim@informatikuni-muenchene

More information

AN INVESTIGATION OF FOCUSING AND ANGULAR TECHNIQUES FOR VOLUMETRIC IMAGES BY USING THE 2D CIRCULAR ULTRASONIC PHASED ARRAY

AN INVESTIGATION OF FOCUSING AND ANGULAR TECHNIQUES FOR VOLUMETRIC IMAGES BY USING THE 2D CIRCULAR ULTRASONIC PHASED ARRAY AN INVESTIGATION OF FOCUSING AND ANGULAR TECHNIQUES FOR VOLUMETRIC IMAGES BY USING THE D CIRCULAR ULTRASONIC PHASED ARRAY S. Monal Lonon South Bank University; Engineering an Design 103 Borough Roa, Lonon

More information

Throughput Characterization of Node-based Scheduling in Multihop Wireless Networks: A Novel Application of the Gallai-Edmonds Structure Theorem

Throughput Characterization of Node-based Scheduling in Multihop Wireless Networks: A Novel Application of the Gallai-Edmonds Structure Theorem Throughput Characterization of Noe-base Scheuling in Multihop Wireless Networks: A Novel Application of the Gallai-Emons Structure Theorem Bo Ji an Yu Sang Dept. of Computer an Information Sciences Temple

More information

Open Access Adaptive Image Enhancement Algorithm with Complex Background

Open Access Adaptive Image Enhancement Algorithm with Complex Background Sen Orers for Reprints to reprints@benthamscience.ae 594 The Open Cybernetics & Systemics Journal, 205, 9, 594-600 Open Access Aaptive Image Enhancement Algorithm with Complex Bacgroun Zhang Pai * epartment

More information

Unknown Radial Distortion Centers in Multiple View Geometry Problems

Unknown Radial Distortion Centers in Multiple View Geometry Problems Unknown Raial Distortion Centers in Multiple View Geometry Problems José Henrique Brito 1,2, Rolan Angst 3, Kevin Köser 3, Christopher Zach 4, Pero Branco 2, Manuel João Ferreira 2, Marc Pollefeys 3 1

More information

Characterizing Decoding Robustness under Parametric Channel Uncertainty

Characterizing Decoding Robustness under Parametric Channel Uncertainty Characterizing Decoing Robustness uner Parametric Channel Uncertainty Jay D. Wierer, Wahee U. Bajwa, Nigel Boston, an Robert D. Nowak Abstract This paper characterizes the robustness of ecoing uner parametric

More information

Comparison of Methods for Increasing the Performance of a DUA Computation

Comparison of Methods for Increasing the Performance of a DUA Computation Comparison of Methos for Increasing the Performance of a DUA Computation Michael Behrisch, Daniel Krajzewicz, Peter Wagner an Yun-Pang Wang Institute of Transportation Systems, German Aerospace Center,

More information

Image compression predicated on recurrent iterated function systems

Image compression predicated on recurrent iterated function systems 2n International Conference on Mathematics & Statistics 16-19 June, 2008, Athens, Greece Image compression preicate on recurrent iterate function systems Chol-Hui Yun *, Metzler W. a an Barski M. a * Faculty

More information

Blind Data Classification using Hyper-Dimensional Convex Polytopes

Blind Data Classification using Hyper-Dimensional Convex Polytopes Blin Data Classification using Hyper-Dimensional Convex Polytopes Brent T. McBrie an Gilbert L. Peterson Department of Electrical an Computer Engineering Air Force Institute of Technology 9 Hobson Way

More information

Handling missing values in kernel methods with application to microbiology data

Handling missing values in kernel methods with application to microbiology data an Machine Learning. Bruges (Belgium), 24-26 April 2013, i6oc.com publ., ISBN 978-2-87419-081-0. Available from http://www.i6oc.com/en/livre/?gcoi=28001100131010. Hanling missing values in kernel methos

More information

Tight Wavelet Frame Decomposition and Its Application in Image Processing

Tight Wavelet Frame Decomposition and Its Application in Image Processing ITB J. Sci. Vol. 40 A, No., 008, 151-165 151 Tight Wavelet Frame Decomposition an Its Application in Image Processing Mahmu Yunus 1, & Henra Gunawan 1 1 Analysis an Geometry Group, FMIPA ITB, Banung Department

More information

arxiv: v1 [cs.cr] 22 Apr 2015 ABSTRACT

arxiv: v1 [cs.cr] 22 Apr 2015 ABSTRACT Differentially Private -Means Clustering arxiv:10.0v1 [cs.cr] Apr 01 ABSTRACT Dong Su #, Jianneng Cao, Ninghui Li #, Elisa Bertino #, Hongxia Jin There are two broa approaches for ifferentially private

More information

EXACT SIMULATION OF A BOOLEAN MODEL

EXACT SIMULATION OF A BOOLEAN MODEL Original Research Paper oi:10.5566/ias.v32.p101-105 EXACT SIMULATION OF A BOOLEAN MODEL CHRISTIAN LANTUÉJOULB MinesParisTech 35 rue Saint-Honoré 77305 Fontainebleau France e-mail: christian.lantuejoul@mines-paristech.fr

More information

Kinematic Analysis of a Family of 3R Manipulators

Kinematic Analysis of a Family of 3R Manipulators Kinematic Analysis of a Family of R Manipulators Maher Baili, Philippe Wenger an Damien Chablat Institut e Recherche en Communications et Cybernétique e Nantes, UMR C.N.R.S. 6597 1, rue e la Noë, BP 92101,

More information

Bayesian localization microscopy reveals nanoscale podosome dynamics

Bayesian localization microscopy reveals nanoscale podosome dynamics Nature Methos Bayesian localization microscopy reveals nanoscale poosome ynamics Susan Cox, Ewar Rosten, James Monypenny, Tijana Jovanovic-Talisman, Dylan T Burnette, Jennifer Lippincott-Schwartz, Gareth

More information

A Spectral Clustering Approach to Optimally Combining Numerical Vectors with a Modular Network

A Spectral Clustering Approach to Optimally Combining Numerical Vectors with a Modular Network A Spectral Clustering Approach to Optimally Combining Numerical Vectors with a Moular Networ Motoi Shiga Bioinformatics Center Kyoto University Goasho Uji 6-, Japan shiga@uicryotouacjp Ichigau Taigawa

More information

A Revised Simplex Search Procedure for Stochastic Simulation Response Surface Optimization

A Revised Simplex Search Procedure for Stochastic Simulation Response Surface Optimization 272 INFORMS Journal on Computing 0899-1499 100 1204-0272 $05.00 Vol. 12, No. 4, Fall 2000 2000 INFORMS A Revise Simplex Search Proceure for Stochastic Simulation Response Surface Optimization DAVID G.

More information

2-connected graphs with small 2-connected dominating sets

2-connected graphs with small 2-connected dominating sets 2-connecte graphs with small 2-connecte ominating sets Yair Caro, Raphael Yuster 1 Department of Mathematics, University of Haifa at Oranim, Tivon 36006, Israel Abstract Let G be a 2-connecte graph. A

More information

Modifying ROC Curves to Incorporate Predicted Probabilities

Modifying ROC Curves to Incorporate Predicted Probabilities Moifying ROC Curves to Incorporate Preicte Probabilities Cèsar Ferri DSIC, Universitat Politècnica e València Peter Flach Department of Computer Science, University of Bristol José Hernánez-Orallo DSIC,

More information

Cellular Ants: Combining Ant-Based Clustering with Cellular Automata

Cellular Ants: Combining Ant-Based Clustering with Cellular Automata Cellular Ants: Combining Ant-Base Clustering with Cellular Automata Anrew Vane Moere & Justin James Clayen Key Centre of Design Computing & Cognition, University of Syney, Australia {anrew,justin}@arch.usy.eu.au

More information

CS 106 Winter 2016 Craig S. Kaplan. Module 01 Processing Recap. Topics

CS 106 Winter 2016 Craig S. Kaplan. Module 01 Processing Recap. Topics CS 106 Winter 2016 Craig S. Kaplan Moule 01 Processing Recap Topics The basic parts of speech in a Processing program Scope Review of syntax for classes an objects Reaings Your CS 105 notes Learning Processing,

More information

A Novel Density Based Clustering Algorithm by Incorporating Mahalanobis Distance

A Novel Density Based Clustering Algorithm by Incorporating Mahalanobis Distance Receive: November 20, 2017 121 A Novel Density Base Clustering Algorithm by Incorporating Mahalanobis Distance Margaret Sangeetha 1 * Velumani Paikkaramu 2 Rajakumar Thankappan Chellan 3 1 Department of

More information

Robust Camera Calibration for an Autonomous Underwater Vehicle

Robust Camera Calibration for an Autonomous Underwater Vehicle obust Camera Calibration for an Autonomous Unerwater Vehicle Matthew Bryant, Davi Wettergreen *, Samer Aballah, Alexaner Zelinsky obotic Systems Laboratory Department of Engineering, FEIT Department of

More information

A Classification of 3R Orthogonal Manipulators by the Topology of their Workspace

A Classification of 3R Orthogonal Manipulators by the Topology of their Workspace A Classification of R Orthogonal Manipulators by the Topology of their Workspace Maher aili, Philippe Wenger an Damien Chablat Institut e Recherche en Communications et Cybernétique e Nantes, UMR C.N.R.S.

More information

Bends, Jogs, And Wiggles for Railroad Tracks and Vehicle Guide Ways

Bends, Jogs, And Wiggles for Railroad Tracks and Vehicle Guide Ways Ben, Jogs, An Wiggles for Railroa Tracks an Vehicle Guie Ways Louis T. Klauer Jr., PhD, PE. Work Soft 833 Galer Dr. Newtown Square, PA 19073 lklauer@wsof.com Preprint, June 4, 00 Copyright 00 by Louis

More information

On the Role of Multiply Sectioned Bayesian Networks to Cooperative Multiagent Systems

On the Role of Multiply Sectioned Bayesian Networks to Cooperative Multiagent Systems On the Role of Multiply Sectione Bayesian Networks to Cooperative Multiagent Systems Y. Xiang University of Guelph, Canaa, yxiang@cis.uoguelph.ca V. Lesser University of Massachusetts at Amherst, USA,

More information

Learning convex bodies is hard

Learning convex bodies is hard Learning convex boies is har Navin Goyal Microsoft Research Inia navingo@microsoftcom Luis Raemacher Georgia Tech lraemac@ccgatecheu Abstract We show that learning a convex boy in R, given ranom samples

More information

Offloading Cellular Traffic through Opportunistic Communications: Analysis and Optimization

Offloading Cellular Traffic through Opportunistic Communications: Analysis and Optimization 1 Offloaing Cellular Traffic through Opportunistic Communications: Analysis an Optimization Vincenzo Sciancalepore, Domenico Giustiniano, Albert Banchs, Anreea Picu arxiv:1405.3548v1 [cs.ni] 14 May 24

More information

Shift-map Image Registration

Shift-map Image Registration Shift-map Image Registration Linus Svärm Petter Stranmark Centre for Mathematical Sciences, Lun University {linus,petter}@maths.lth.se Abstract Shift-map image processing is a new framework base on energy

More information

Enabling Rollback Support in IT Change Management Systems

Enabling Rollback Support in IT Change Management Systems Enabling Rollback Support in IT Change Management Systems Guilherme Sperb Machao, Fábio Fabian Daitx, Weverton Luis a Costa Coreiro, Cristiano Bonato Both, Luciano Paschoal Gaspary, Lisanro Zambeneetti

More information

Optimal Oblivious Path Selection on the Mesh

Optimal Oblivious Path Selection on the Mesh Optimal Oblivious Path Selection on the Mesh Costas Busch Malik Magon-Ismail Jing Xi Department of Computer Science Rensselaer Polytechnic Institute Troy, NY 280, USA {buschc,magon,xij2}@cs.rpi.eu Abstract

More information

NET Institute*

NET Institute* NET Institute* www.netinst.org Working Paper #08-24 October 2008 Computer Virus Propagation in a Network Organization: The Interplay between Social an Technological Networks Hsing Kenny Cheng an Hong Guo

More information

MANJUSHA K.*, ANAND KUMAR M., SOMAN K. P.

MANJUSHA K.*, ANAND KUMAR M., SOMAN K. P. Journal of Engineering Science an echnology Vol. 13, No. 1 (2018) 141-157 School of Engineering, aylor s University IMPLEMENAION OF REJECION SRAEGIES INSIDE MALAYALAM CHARACER RECOGNIION SYSEM BASED ON

More information

A Stochastic Process on the Hypercube with Applications to Peer to Peer Networks

A Stochastic Process on the Hypercube with Applications to Peer to Peer Networks A Stochastic Process on the Hypercube with Applications to Peer to Peer Networs [Extene Abstract] Micah Aler Department of Computer Science, University of Massachusetts, Amherst, MA 0003 460, USA micah@cs.umass.eu

More information

FTSM FAST TAKAGI-SUGENO FUZZY MODELING. Manfred Männle

FTSM FAST TAKAGI-SUGENO FUZZY MODELING. Manfred Männle FTS FAST TAKAGI-SUGENO FUZZY ODELING anfre ännle Institute for Computer Design an Fault Tolerance University of Karlsruhe, D-7628 Karlsruhe, Germany anfre.aennle@informatik.uni-karlsruhe.e Abstract Takagi-Sugeno

More information

Preamble. Singly linked lists. Collaboration policy and academic integrity. Getting help

Preamble. Singly linked lists. Collaboration policy and academic integrity. Getting help CS2110 Spring 2016 Assignment A. Linke Lists Due on the CMS by: See the CMS 1 Preamble Linke Lists This assignment begins our iscussions of structures. In this assignment, you will implement a structure

More information

Software Reliability Modeling and Cost Estimation Incorporating Testing-Effort and Efficiency

Software Reliability Modeling and Cost Estimation Incorporating Testing-Effort and Efficiency Software Reliability Moeling an Cost Estimation Incorporating esting-effort an Efficiency Chin-Yu Huang, Jung-Hua Lo, Sy-Yen Kuo, an Michael R. Lyu -+ Department of Electrical Engineering Computer Science

More information

Object Recognition Using Colour, Shape and Affine Invariant Ratios

Object Recognition Using Colour, Shape and Affine Invariant Ratios Object Recognition Using Colour, Shape an Affine Invariant Ratios Paul A. Walcott Centre for Information Engineering City University, Lonon EC1V 0HB, Englan P.A.Walcott@city.ac.uk Abstract This paper escribes

More information

BIJECTIONS FOR PLANAR MAPS WITH BOUNDARIES

BIJECTIONS FOR PLANAR MAPS WITH BOUNDARIES BIJECTIONS FOR PLANAR MAPS WITH BOUNDARIES OLIVIER BERNARDI AND ÉRIC FUSY Abstract. We present bijections for planar maps with bounaries. In particular, we obtain bijections for triangulations an quarangulations

More information