A Comparison of Pattern-Based Spectral Clustering Algorithms in Directed Weighted Network

Size: px
Start display at page:

Download "A Comparison of Pattern-Based Spectral Clustering Algorithms in Directed Weighted Network"

Transcription

1 A Comparison of Pattern-Based Spectral Clustering Algorithms in Directed Weighted Network Sumuya Borjigin 1. School of Economics and Management, Inner Mongolia University, No.235 West College Road, Hohhot, Inner Mongolia, P.R.C., ; 2. Academy of Mathematics and Systems Sciences, Chinese Academy of Sciences No.55 Zhongguancun East Road, Haidian District, Beijing City, P.R.C., Abstract. Pattern-based spectral clustering algorithms in directed weighted network have important application values in many domains, including computer science, electronic commerce, economics and finance. In this paper, we compare several of the best-known algorithms from the point of view of clustering quality over some existing benchmark data sets. Experimental results show that, it is necessary to propose a more common pattern-based spectral clustering algorithm in directed weighted network. Keywords: Pattern-based cluster structure, Spectral clustering, Comparison 1 Introduction Networks have become ubiquitous as data from many different disciplines can be naturally mapped to network structures, such as technological networks, biological networks and social networks. A cluster in a directed weighted network can be considered as a set of nodes that share common or similar characteristics. The clusters in directed weighted networks can be partitioned into two categories, density-based clusters and pattern-based clusters. Density-based clusters in directed weighted network are groups of nodes that follow the traditional clustering definition based on edge density characteristics, which is defined as a group of nodes with more intra-cluster edges than inter-cluster edges. Although density-based clusters in directed weighted network represents the most common and well-studied clustering definition in both directed and undirected networks, it cannot capture more sophisticated clustering structures, than the classical well-cohesive groups, where edge density may not represent the major clustering criterion. Another category of clusters are pattern-based clusters, where the nodes of a directed weighted network can be naturally clustered together according to similar connectivity patterns that may exist and are not captured completely applying only density criteria. They represent structures

2 2 Sumuya Borjigin with interesting connectivity properties in directed weighted networks. Revealing the underlying patter-based cluster structure of directed weighted networks has become a crucial and interdisciplinary topic with a plethora of relevant application domains, including computer science, electronic commerce, economics and finance. There are different types of pattern-based cluster structures in directed weighted networks, two of interesting and commonly used patterns for clustering directed weighted networks are the cases of citation-based clustering and flow-based clustering [1]. Spectral clustering method either has advantages of effectiveness and easy implementation or an optimization framework for the modularity function. For this reason, spectral clustering method has been used to detect pattern-based clustering structure of directed weighted networks. Fan Chung considered Laplacians for directed graphs and examined their eigenvalues and introduced a notion of a circulation in a directed graph and its connection with the Rayleigh quotient, then defined a Cheeger constant and established the Cheeger inequality for directed graphs. These relations can be used to deal with various problems that often arise in the study of non-reversible Markov chains including bounding the rate of convergence and deriving comparison theorems [2]. David Gleich examined the generalization of the Laplacian of a graph due to Fan Chung. He showed that Fan Chungs generalization reduces to examining one particular symmetrization of the adjacency matrix for a directed graph. From this result, the directed Cheeger bounds trivially follow. Additionally, David Gleich implemented and examined the benefits of directed hierarchical spectral clustering empirically on a data set from Wikipedia. Finally, David Gleich examined a set of competing heuristic methods on the same data set [3]. Perrault-Joncas and Meila considered the problem of embedding directed graphs in Euclidean space while retaining directional information. He modeled the observedgraph as a sample from a manifold endowed with a vector field, and designed an algorithm that separates and recovers the features of this process: the geometry of the manifold, the data density and the vector field. The algorithm was motivated by their analysis of Laplacian-type operators and their continuous limit as generators of diffusions on a manifold. They illustrated the recovery algorithm on both artificially constructed and real data [4]. Bauer considered the normalized Laplace operator for directed graphs with positive and negative edge weights. This generalization of the normalized Laplace operator for undirected graphs was used to characterize directed acyclic graphs. Moreover, they identified certain structural properties of the underlying graph with extremal eigenvalues of the normalized Laplace operator. Bauer proved comparison theorems that establish a relationship between the eigenvalues of directed graphs and certain undirected graphs. This relationship was used to derive eigenvalue estimates for directed graphs. Finally Bauer introduced the concept of neighborhood graphs for directed graphs and use it to obtain further eigenvalue estimates [5]. Given a directed graph in which some of the nodes were labeled, Zhou, Schokopf and Hofmann investigated the question of how to exploit the link structure of the graph to infer the labels of the remaining unlabeled nodes. To that extent

3 Sumuya Borjigin 3 they propose a regularization framework for functions defined over nodes of a directed graph that forces the classification function to change slowly on densely linked subgraphs. A powerful, yet computationally simple classification algorithm was derived within the proposed framework. The experimental evaluation on real-world web classification problems demonstrated encouraging results that validate the approach [6]. Pentney and Meila [7], Huang, Zhu and Schuurmans [8], Meila and Pentney [9], Zhou and Burges [10], Chen, Liu and Tang [11], Mirzal and Furukawa [12], Mantrach et. al. [13], Li and Zhang [14], Satuluri and Parthasarathy [15], Long, Wu and Zhang [16], Rosvall and Bergstrom [17] and Lai, Lu and Nardini [18] also studied the pattern-based spectral clustering in the directed weighted network. In spite of the large number of papers on spectral clustering method for pattern-based cluster structures in directed weighted network, so far no systematic comparison among the existing algorithms has been published. This is why we set out to do the comparison here. We compare performance of the existing pattern-based spectral clustering algorithms in directed weighted network on several kinds of benchmark data sets. We hope to be able to find out each of these spectral clustering is suit for detecting what kinds of patterns, fail to discover what kinds of patterns, how the informative eigenvectors are chosen and ties between the cluster number with eigenvalues of the spectral clustering matrices. The paper is organized as follows. In Section 2, pattern-based spectral clustering algorithms in directed weighted network are given. In section 3, the data sets which contain pattern-based cluster structures are introduced. Section 4 is the experiments and Section 5 is conclusions. 2 Pattern-based Spectral Clustering Algorithms in Directed Weighted Network Similar to classical spectral clustering algorithms in undirected network, the pattern-based spectral clustering algorithms in directed weighted network mentioned here can be thought of as composed of the following stages: The first stage: Construct spectral clustering matrix based on asymmetric similarity matrix; The second stage: Map the original data set according to the informative eigenvectors of the spectral clustering matrix; The third stage: Cluster the mapped data set by a simple clustering algorithm. Main difference of the pattern-based spectral clustering algorithms in directed weighted network proposed in related works is the spectral clustering matrix constructed on the first stage. Different kinds of spectral clustering matrices lead to different kinds of partition results. We compare performance of 9 kinds of pattern-based spectral clustering algorithms in directed weighted network listed in Table 1. Most of the pattern-based spectral clustering algorithms in directed weighted network we selected are the

4 4 Sumuya Borjigin ones given in the related articles. For the integrity of our comparison, we also present some pattern-based spectral clustering algorithms in directed weighted network. Table 1. Pattern-based spectral clustering algorithms in directed weighted network Algorithm Reference Spectral clustering matrix 1 FC-1 [2] eye(size(a,1)) (Z A F + F A Z)/2 2 FC-2 [this paper] π (Z A F + F A Z)/2 3 FC-3 [this paper] (Z A F + F A Z)/2 4 FC-Com [2] π (π A + A π)/2 5 Random Walk [15] (π A + A π)/2 6 Bibliometric [15] (A A + A A)/2 7 Degree-Discounted [15] (F A F A F + F A F A F)/2 8 Asymmetric A 9 FC-Com+Bibliometric [this paper] π (π A + A π)/2 + (A A + A A)/2 In the Table 1, A is the asymmetric similarity matrix, diagonal elements of π are sum of the corresponding columns of A, for nonzero elements of π, Z = π 1/2, F = π 1/2. 3 data sets In this section, 9 kinds of pattern-based spectral clustering algorithms in directed weighted network listed in Table 1 were compared in 3 citation-based benchmark data sets and 3 flow-based benchmark data sets, the data sets were shown in figure 1 to figure 6, respectively. Fig. 1. Citation-based cluster: Cite-4 [1], [2], [15]. The data set in figure 1 forms two citation-based clusters. The most interesting point in this case is that the nodes of the graph that are clustered together

5 Sumuya Borjigin 5 Fig. 2. Citation-based cluster: Cite-8 [1], [2], [15]. Fig. 3. Citation-based cluster: Cite-Long [16].

6 6 Sumuya Borjigin do not have an edge between them. In this case, the data set consists of 2 clusters, The nodes 1 and 3 form a cluster, the nodes 2 and 4 form a cluster. Their similarity emanates from the co-citation event. Respectively the two nodes of the right cluster are pointed by the same group of nodes. Let us consider the case of a citation network where nodes correspond to scientific papers and a directed edge from paper 1 to paper 2 implies that the first paper cites the latter. Although papers 1 and 3 do not share an edge, they form a natural cluster since they both cite papers 2 and 4 and it is probable that they belong on the same scientific topic [1]. A similar example of citation-based cluster structure appears in the second data set in figure 2. In this case, the data set consists of 3 clusters. The nodes 4 and 5 form a cluster, since they have out-links to the same nodes 1, 2 and 3, while at the same time having in-links from the same group of nodes 6, 7 and 8. This structure constitutes a common situation in the context of directed graphs. In this case the clustering features correspond to the common neighbors in the graph and thus the nodes are clustered together if they share common neighbors [1]. The undirected version of the data set Cite-Long was constructed in the reference [16]. Here, we add direction to the data set to construct a directed network which contains citation-based cluster structure. The data set contains 4 clusters, nodes 1, 2, 3 and 4 form a cluster, nodes 5, 6, 7 and 8 for a cluster, nodes 9, 10, 11 and 12 form a cluster, nodes 13, 14, 15 and 16 form a cluster. Fig. 4. Flow-based cluster: Flow-16 [1], [17], [18].

7 Sumuya Borjigin 7 Fig. 5. Flow-based cluster: Flow-Small [1], [17], [18]. The main characteristicof the networks in figure 4 to 6 is that the edges form patterns of flow among nodes. In other words, the local interactions in the network combined with the edge directionality, induce a flow of information among the entities and therefore the clustering structure depends on how information flows. Then, a cluster or community in the network corresponds to a group of nodes where the flow is larger as compared to the flow outside the group. Assuming a user that conducts random walk on the graph, a flow-based community is a group of nodes where a random surfer is more likely to be trapped inside instead of moving out of the group [1]. The data set in figure 4 consists of 16 nodes, which can be partitioned into 4 clusters, and each cluster forms a circle structure. Nodes 1 4 form a cluster, nodes5 8formacluster,nodes9 12formacluster,nodes13 16formacluster. The data sets in figure 5 and in figure 6 are all consist of 36 nodes, which can be partitioned into 6 clusters, and each cluster forms a circle structure. Nodes 1 6 form a cluster, nodes 7 12 form a cluster, nodes form a cluster, nodes form a cluster, nodes form a cluster, nodes form a cluster. The difference between data sets Flow-Small and Flow-Large are the weights of the edges, weights of Flow-Large is significantly larger then corresponding weights of Flow-Small. 4 Experiments 4.1 Experimental Results Pattern-based cluster structures detection results in directed weighted networks were shown in the Table 2. In the Table 2, if the algorithm successfully reveals

8 8 Sumuya Borjigin Fig. 6. Flow-based cluster: Flow-Large [1], [17], [18]. corresponding pattern-based cluster structure, we mark, else we mark. The numbers corresponding to stand for informative eigenvetors. Table 2. Pattern-based cluster structures detection results in directed weighted networks algorithms & data sets Cite-4 Cite-8 Cite-Long Flow-16 Flow-Small Flow-Large 1 FC-1 (1:4) (1:6) 2 FC-2 3 FC-3 4 FC-Com (7:8) (1:4) (1:6) 5 Random-Walk (8) (13:16) 6 Bibliometric (3:4) (6:8) (13:16) 7 Degree-Discounted 8 Asymmetric (3:4) 9 FC-Com+Bibliometric (2&4) (6:8) (1:4) From Table 2 we find that spectral clustering method based on the matrices Bibliometric, Asymmetric and FC-Com+Bibliometric can successfully detect citation-based cluster structure of the data set Cite-4, spectral clustering method based on the matrices FC-Com, Random-Walk, Bibliometric and FC- Com+Bibliometric can successfully detect citation-based cluster structure of the data set Cite-8, spectral clustering method can be used to detect the citationbased cluster structure of the data set Cite-Long only based on the spectral clustering matrix Bibliometric. Spectral clustering method based on the matri-

9 Sumuya Borjigin 9 ces FC-1, FC-Com, Random-Walk and FC-Com+Bibliometric can successfully detect flow-based cluster structure of the data set Flow-16. Spectral clustering method based on the matrices FC-1 and FC-Com can successfully detect flowbased cluster structure of the data set Flow-Small. Spectral clustering method is incapable of detecting flow-based structure of the data set Flow-Large based on all the 9 kinds of spectral clustering matrices. 4.2 Discussions For the purpose of compare performance of the existing pattern-based spectral clustering method, we did experiments in two categories of benchmark data sets, the first category contains citation-based cluster structure, the second category contains flow-based cluster structure. We get the following conclusions: (1) None of the spectral clustering matrix can be used to detect patternbased cluster structure of all the benchmark data sets. FC-1 can be used to detect flow-based cluster structure of the data sets Flow-16 and Flow-Small, Bibliometric can be used to detect citation-based cluster structure of all the 3 data sets which contain citation-based cluster structure. FC-Com, Random-Walk and FC-Com+Bibliometric succeed in discovering both kinds of pattern-based cluster structures in some of the data sets simultaneously. Performance of the algorithms in two data sets Flow-Small and Flow-Large, which have similar flowbased cluster structure different edge weights, is different. (2) Informative eigenvectors of the spectral clustering methods mentioned above were selected manually. For the classical spectral clustering methods in undirected network, informative eigenvectors consist of the smallest k eigenvectors of the normalized Laplacians or the largest k eigenvectors of normalized similarity matrix or transition probability matrix, where k is the cluster number. But for the spectral clustering methods in directed weighted network, informative eigenvectors not always consist of the k largest or smallest eigenvectors. (3) Mechanism of the pattern-based spectral clustering methods in directed weighted network is not so clear compared to spectral clustering methods in undirected network. For the spectral clustering methods in undirected network, optimization theory or lumpability theorem are used to explain why the original data sets are mapped to high dimensional space in ideal case, matrix perturbation theory is used to explain why the algorithms extended to general case. Researchers made effort to discover mechanism of the spectral clustering methods in directed weighted network, however they haven t solve mechanism of pattern-based spectral clustering in directed weighted network. 5 Conclusions Based on above discussions, we conclude that it is need to construct a more common spectral clustering matrix, which can be used to detect pattern-based cluster structure of most of the data sets, for given cluster number, informative eigenvectors can be selected automatically. Moreover, finding mechanism of

10 10 Sumuya Borjigin the pattern-based spectral clustering methods in directed weighted network is another interesting topic. Acknowledgments. This work was partly supported by National Natural Science Foundation of China (Grant No ) and Natural Science Foundation of Inner Mongolia (Grant No. 2014BS0706). The authors also gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation. References 1. Malliaros, F., Vazirgiannis, M.: Clustering and Community Detection in Directed Networks: A Survey. Physics Reports, 533, (2013) 2. Chung, F.: Laplacians and the Cheeger Inequality for Directed Graphs. Annals of Combinatorics, 9, 1 19 (2005) 3. Gleich, D.: Hierarchical Directed Spectral Graph Partitioning. Information Networks, Stanford University, Final Project (2005) 4. Perrault-Joncas, D., Meila, M.: Directed Graph Embedding: an Algorithm Based on Continuous Limits of Laplacian Type Operators. In: Advances in Neural Information Processing Systems, pp.1 9. MIT Press, Cambridge (2011) 5. Bauer, F.: Normalized Graph Laplacians for Directed Graphs, Linear Algebra and its Applications, 436, (2012) 6. Zhou, D., Schokopf, B., Hofmann, T.: Semi-Supervised Learning on Directed Graphs. In: Advances in Neural Information Processing Systems, pp.1 8. MIT Press, Cambridge (2005) 7. Pentney, W., Meila, M., Spectral Clustering of Biological Sequence Data. In: Proceedings of the 20th National Conference on Artificial intelligence, pp AAAI Press, Pittsburgh (2005) 8. Huang, J., Zhu, T., Schuurmans, D., Web Communities Identification from Random Walks. In: Joint European Conference on Machine Learning and European Conference on Principles and Practice of Knowledge Discovery in Databases, pp Springer Verlag, Heidelberg (2006) 9. Meila, M., Pentney, W., Clustering by Weighted Cuts in Directed Graphs. In: Proceedings of the 2007 SIAM International Conference on Data Mining, pp SIAM, Philadelphia (2007) 10. Zhou, D., Burges, C., Spectral Clustering and Transductive Learning with Multiple Views. In: Proceedings of the 24th International Conference on Machine learning, pp ACM Press, New York (2007) 11. Chen, M., Liu, J., Tang, X., Clustering via Ravndom Walk Hitting Time on Directed Graphs. In: Proceedings of the 20th National Conference on Artificial intelligence, pp AAAI Press, Pittsburgh (2008) 12. Mirzal, A., Furukawa, M., Eigenvectors for Clustering: Unipartite, Bipartite, and Directed Graph Cases. In: 2010 International Conference on Electronics and Information Engineering, pp IEEE Press, Piscataway (2010) 13. Mantrach, A., Zeebroeck, N., Francq, P., Shimbo, M., Bersini, H., Saerens, M.: Semi-Supervised Classification and Betweenness Computation on Large, Sparse, Directed Graphs. Pattern Recognition, 44, (2011)

11 Sumuya Borjigin Li, Y., Zhang, Z.: Digraph Laplacian and the Degree of Asymmetry. Internet Mathematics, 8, (2012) 15. Satuluri, V., Parthasarathy, S.: Symmetrizations for Clustering Directed Graphs. In: Proceedings of the 14th International Conference on Extending Database Technology, pp ACM Press, New York (2011) 16. Long, B., Wu, X., Zhang, Z.: Community Learning by Graph Approximation. In: Seventh IEEE International Conference on Data Mining, pp IEEE Press, Piscataway (2007) 17. Rosvall, M., Bergstrom, C.: Maps of Random Walks on Complex Networks Reveal Community Structure. Proceedings of the National Academy of the Sciences of the United States of America, 105, (2008) 18. Lai, D., Lu, H., Nardini, C.: Finding Communities in Directed Networks by PageRank Random Walk Induced Network Embedding, Physica A: Statistical Mechanics and its Applications, 389, (2010)

arxiv: v1 [cs.si] 5 Aug 2013

arxiv: v1 [cs.si] 5 Aug 2013 Clustering and Community Detection in Directed Networks: A Survey Fragkiskos D. Malliaros a,, Michalis Vazirgiannis a,b a Computer Science Laboratory, École Polytechnique, 91120 Palaiseau, France b Department

More information

Relative Constraints as Features

Relative Constraints as Features Relative Constraints as Features Piotr Lasek 1 and Krzysztof Lasek 2 1 Chair of Computer Science, University of Rzeszow, ul. Prof. Pigonia 1, 35-510 Rzeszow, Poland, lasek@ur.edu.pl 2 Institute of Computer

More information

Social Network Analysis

Social Network Analysis Social Network Analysis Mathematics of Networks Manar Mohaisen Department of EEC Engineering Adjacency matrix Network types Edge list Adjacency list Graph representation 2 Adjacency matrix Adjacency matrix

More information

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 http://intelligentoptimization.org/lionbook Roberto Battiti

More information

My favorite application using eigenvalues: partitioning and community detection in social networks

My favorite application using eigenvalues: partitioning and community detection in social networks My favorite application using eigenvalues: partitioning and community detection in social networks Will Hobbs February 17, 2013 Abstract Social networks are often organized into families, friendship groups,

More information

CSCI-B609: A Theorist s Toolkit, Fall 2016 Sept. 6, Firstly let s consider a real world problem: community detection.

CSCI-B609: A Theorist s Toolkit, Fall 2016 Sept. 6, Firstly let s consider a real world problem: community detection. CSCI-B609: A Theorist s Toolkit, Fall 016 Sept. 6, 016 Lecture 03: The Sparsest Cut Problem and Cheeger s Inequality Lecturer: Yuan Zhou Scribe: Xuan Dong We will continue studying the spectral graph theory

More information

Social-Network Graphs

Social-Network Graphs Social-Network Graphs Mining Social Networks Facebook, Google+, Twitter Email Networks, Collaboration Networks Identify communities Similar to clustering Communities usually overlap Identify similarities

More information

Clustering. SC4/SM4 Data Mining and Machine Learning, Hilary Term 2017 Dino Sejdinovic

Clustering. SC4/SM4 Data Mining and Machine Learning, Hilary Term 2017 Dino Sejdinovic Clustering SC4/SM4 Data Mining and Machine Learning, Hilary Term 2017 Dino Sejdinovic Clustering is one of the fundamental and ubiquitous tasks in exploratory data analysis a first intuition about the

More information

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data Wasserman and Faust, Chapter 3: Notation for Social Network Data Three different network notational schemes Graph theoretic: the most useful for centrality and prestige methods, cohesive subgroup ideas,

More information

Mining Social Network Graphs

Mining Social Network Graphs Mining Social Network Graphs Analysis of Large Graphs: Community Detection Rafael Ferreira da Silva rafsilva@isi.edu http://rafaelsilva.com Note to other teachers and users of these slides: We would be

More information

An Improved KNN Classification Algorithm based on Sampling

An Improved KNN Classification Algorithm based on Sampling International Conference on Advances in Materials, Machinery, Electrical Engineering (AMMEE 017) An Improved KNN Classification Algorithm based on Sampling Zhiwei Cheng1, a, Caisen Chen1, b, Xuehuan Qiu1,

More information

Spectral Clustering X I AO ZE N G + E L HA M TA BA S SI CS E CL A S S P R ESENTATION MA RCH 1 6,

Spectral Clustering X I AO ZE N G + E L HA M TA BA S SI CS E CL A S S P R ESENTATION MA RCH 1 6, Spectral Clustering XIAO ZENG + ELHAM TABASSI CSE 902 CLASS PRESENTATION MARCH 16, 2017 1 Presentation based on 1. Von Luxburg, Ulrike. "A tutorial on spectral clustering." Statistics and computing 17.4

More information

The Un-normalized Graph p-laplacian based Semi-supervised Learning Method and Speech Recognition Problem

The Un-normalized Graph p-laplacian based Semi-supervised Learning Method and Speech Recognition Problem Int. J. Advance Soft Compu. Appl, Vol. 9, No. 1, March 2017 ISSN 2074-8523 The Un-normalized Graph p-laplacian based Semi-supervised Learning Method and Speech Recognition Problem Loc Tran 1 and Linh Tran

More information

Introduction to Machine Learning

Introduction to Machine Learning Introduction to Machine Learning Clustering Varun Chandola Computer Science & Engineering State University of New York at Buffalo Buffalo, NY, USA chandola@buffalo.edu Chandola@UB CSE 474/574 1 / 19 Outline

More information

Network community detection with edge classifiers trained on LFR graphs

Network community detection with edge classifiers trained on LFR graphs Network community detection with edge classifiers trained on LFR graphs Twan van Laarhoven and Elena Marchiori Department of Computer Science, Radboud University Nijmegen, The Netherlands Abstract. Graphs

More information

Clustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search

Clustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search Informal goal Clustering Given set of objects and measure of similarity between them, group similar objects together What mean by similar? What is good grouping? Computation time / quality tradeoff 1 2

More information

SGN (4 cr) Chapter 11

SGN (4 cr) Chapter 11 SGN-41006 (4 cr) Chapter 11 Clustering Jussi Tohka & Jari Niemi Department of Signal Processing Tampere University of Technology February 25, 2014 J. Tohka & J. Niemi (TUT-SGN) SGN-41006 (4 cr) Chapter

More information

Mining Web Data. Lijun Zhang

Mining Web Data. Lijun Zhang Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems

More information

Community Detection. Community

Community Detection. Community Community Detection Community In social sciences: Community is formed by individuals such that those within a group interact with each other more frequently than with those outside the group a.k.a. group,

More information

Semi-supervised Data Representation via Affinity Graph Learning

Semi-supervised Data Representation via Affinity Graph Learning 1 Semi-supervised Data Representation via Affinity Graph Learning Weiya Ren 1 1 College of Information System and Management, National University of Defense Technology, Changsha, Hunan, P.R China, 410073

More information

The Randomized Shortest Path model in a nutshell

The Randomized Shortest Path model in a nutshell Panzacchi M, Van Moorter B, Strand O, Saerens M, Kivimäki I, Cassady St.Clair C., Herfindal I, Boitani L. (2015) Predicting the continuum between corridors and barriers to animal movements using Step Selection

More information

A Graph Clustering Algorithm Based on Minimum and Normalized Cut

A Graph Clustering Algorithm Based on Minimum and Normalized Cut A Graph Clustering Algorithm Based on Minimum and Normalized Cut Jiabing Wang 1, Hong Peng 1, Jingsong Hu 1, and Chuangxin Yang 1, 1 School of Computer Science and Engineering, South China University of

More information

An ICA-Based Multivariate Discretization Algorithm

An ICA-Based Multivariate Discretization Algorithm An ICA-Based Multivariate Discretization Algorithm Ye Kang 1,2, Shanshan Wang 1,2, Xiaoyan Liu 1, Hokyin Lai 1, Huaiqing Wang 1, and Baiqi Miao 2 1 Department of Information Systems, City University of

More information

A Note on Vertex Arboricity of Toroidal Graphs without 7-Cycles 1

A Note on Vertex Arboricity of Toroidal Graphs without 7-Cycles 1 International Mathematical Forum, Vol. 11, 016, no. 14, 679-686 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.1988/imf.016.667 A Note on Vertex Arboricity of Toroidal Graphs without 7-Cycles 1 Haihui

More information

Time Series Clustering Ensemble Algorithm Based on Locality Preserving Projection

Time Series Clustering Ensemble Algorithm Based on Locality Preserving Projection Based on Locality Preserving Projection 2 Information & Technology College, Hebei University of Economics & Business, 05006 Shijiazhuang, China E-mail: 92475577@qq.com Xiaoqing Weng Information & Technology

More information

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM EFFICIENT ATTRIBUTE REDUCTION ALGORITHM Zhongzhi Shi, Shaohui Liu, Zheng Zheng Institute Of Computing Technology,Chinese Academy of Sciences, Beijing, China Abstract: Key words: Efficiency of algorithms

More information

Locality Preserving Projections (LPP) Abstract

Locality Preserving Projections (LPP) Abstract Locality Preserving Projections (LPP) Xiaofei He Partha Niyogi Computer Science Department Computer Science Department The University of Chicago The University of Chicago Chicago, IL 60615 Chicago, IL

More information

Extracting Information from Complex Networks

Extracting Information from Complex Networks Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform

More information

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge Centralities (4) By: Ralucca Gera, NPS Excellence Through Knowledge Some slide from last week that we didn t talk about in class: 2 PageRank algorithm Eigenvector centrality: i s Rank score is the sum

More information

Dynamically Motivated Models for Multiplex Networks 1

Dynamically Motivated Models for Multiplex Networks 1 Introduction Dynamically Motivated Models for Multiplex Networks 1 Daryl DeFord Dartmouth College Department of Mathematics Santa Fe institute Inference on Networks: Algorithms, Phase Transitions, New

More information

Visual Representations for Machine Learning

Visual Representations for Machine Learning Visual Representations for Machine Learning Spectral Clustering and Channel Representations Lecture 1 Spectral Clustering: introduction and confusion Michael Felsberg Klas Nordberg The Spectral Clustering

More information

SPECTRAL SPARSIFICATION IN SPECTRAL CLUSTERING

SPECTRAL SPARSIFICATION IN SPECTRAL CLUSTERING SPECTRAL SPARSIFICATION IN SPECTRAL CLUSTERING Alireza Chakeri, Hamidreza Farhidzadeh, Lawrence O. Hall Department of Computer Science and Engineering College of Engineering University of South Florida

More information

Loopy Belief Propagation

Loopy Belief Propagation Loopy Belief Propagation Research Exam Kristin Branson September 29, 2003 Loopy Belief Propagation p.1/73 Problem Formalization Reasoning about any real-world problem requires assumptions about the structure

More information

An Introduction to Graph Theory

An Introduction to Graph Theory An Introduction to Graph Theory CIS008-2 Logic and Foundations of Mathematics David Goodwin david.goodwin@perisic.com 12:00, Friday 17 th February 2012 Outline 1 Graphs 2 Paths and cycles 3 Graphs and

More information

Clustering Object-Oriented Software Systems using Spectral Graph Partitioning

Clustering Object-Oriented Software Systems using Spectral Graph Partitioning Clustering Object-Oriented Software Systems using Spectral Graph Partitioning Spiros Xanthos University of Illinois at Urbana-Champaign 0 North Goodwin Urbana, IL 680 xanthos@cs.uiuc.edu Abstract In this

More information

The Gene Modular Detection of Random Boolean Networks by Dynamic Characteristics Analysis

The Gene Modular Detection of Random Boolean Networks by Dynamic Characteristics Analysis Journal of Materials, Processing and Design (2017) Vol. 1, Number 1 Clausius Scientific Press, Canada The Gene Modular Detection of Random Boolean Networks by Dynamic Characteristics Analysis Xueyi Bai1,a,

More information

TRANSDUCTIVE LINK SPAM DETECTION

TRANSDUCTIVE LINK SPAM DETECTION TRANSDUCTIVE LINK SPAM DETECTION Denny Zhou Microsoft Research http://research.microsoft.com/~denzho Joint work with Chris Burges and Tao Tao Presenter: Krysta Svore Link spam detection problem Classification

More information

A Study of Graph Spectra for Comparing Graphs

A Study of Graph Spectra for Comparing Graphs A Study of Graph Spectra for Comparing Graphs Ping Zhu and Richard C. Wilson Computer Science Department University of York, UK Abstract The spectrum of a graph has been widely used in graph theory to

More information

Overview Citation. ML Introduction. Overview Schedule. ML Intro Dataset. Introduction to Semi-Supervised Learning Review 10/4/2010

Overview Citation. ML Introduction. Overview Schedule. ML Intro Dataset. Introduction to Semi-Supervised Learning Review 10/4/2010 INFORMATICS SEMINAR SEPT. 27 & OCT. 4, 2010 Introduction to Semi-Supervised Learning Review 2 Overview Citation X. Zhu and A.B. Goldberg, Introduction to Semi- Supervised Learning, Morgan & Claypool Publishers,

More information

Graph Theory Problem Ideas

Graph Theory Problem Ideas Graph Theory Problem Ideas April 15, 017 Note: Please let me know if you have a problem that you would like me to add to the list! 1 Classification Given a degree sequence d 1,...,d n, let N d1,...,d n

More information

Graph drawing in spectral layout

Graph drawing in spectral layout Graph drawing in spectral layout Maureen Gallagher Colleen Tygh John Urschel Ludmil Zikatanov Beginning: July 8, 203; Today is: October 2, 203 Introduction Our research focuses on the use of spectral graph

More information

A Construction of Cospectral Graphs

A Construction of Cospectral Graphs Project Number: MA-PRC-3271 A Construction of Cospectral Graphs A Major Qualifying Project Report submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements

More information

Data fusion and multi-cue data matching using diffusion maps

Data fusion and multi-cue data matching using diffusion maps Data fusion and multi-cue data matching using diffusion maps Stéphane Lafon Collaborators: Raphy Coifman, Andreas Glaser, Yosi Keller, Steven Zucker (Yale University) Part of this work was supported by

More information

PageRank and related algorithms

PageRank and related algorithms PageRank and related algorithms PageRank and HITS Jacob Kogan Department of Mathematics and Statistics University of Maryland, Baltimore County Baltimore, Maryland 21250 kogan@umbc.edu May 15, 2006 Basic

More information

APPROXIMATE SPECTRAL LEARNING USING NYSTROM METHOD. Aleksandar Trokicić

APPROXIMATE SPECTRAL LEARNING USING NYSTROM METHOD. Aleksandar Trokicić FACTA UNIVERSITATIS (NIŠ) Ser. Math. Inform. Vol. 31, No 2 (2016), 569 578 APPROXIMATE SPECTRAL LEARNING USING NYSTROM METHOD Aleksandar Trokicić Abstract. Constrained clustering algorithms as an input

More information

Jure Leskovec, Cornell/Stanford University. Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research

Jure Leskovec, Cornell/Stanford University. Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research Jure Leskovec, Cornell/Stanford University Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research Network: an interaction graph: Nodes represent entities Edges represent interaction

More information

A Tutorial on the Probabilistic Framework for Centrality Analysis, Community Detection and Clustering

A Tutorial on the Probabilistic Framework for Centrality Analysis, Community Detection and Clustering A Tutorial on the Probabilistic Framework for Centrality Analysis, Community Detection and Clustering 1 Cheng-Shang Chang All rights reserved Abstract Analysis of networked data has received a lot of attention

More information

Research on Community Structure in Bus Transport Networks

Research on Community Structure in Bus Transport Networks Commun. Theor. Phys. (Beijing, China) 52 (2009) pp. 1025 1030 c Chinese Physical Society and IOP Publishing Ltd Vol. 52, No. 6, December 15, 2009 Research on Community Structure in Bus Transport Networks

More information

A ew Algorithm for Community Identification in Linked Data

A ew Algorithm for Community Identification in Linked Data A ew Algorithm for Community Identification in Linked Data Nacim Fateh Chikhi, Bernard Rothenburger, Nathalie Aussenac-Gilles Institut de Recherche en Informatique de Toulouse 118, route de Narbonne 31062

More information

Collaborative filtering based on a random walk model on a graph

Collaborative filtering based on a random walk model on a graph Collaborative filtering based on a random walk model on a graph Marco Saerens, Francois Fouss, Alain Pirotte, Luh Yen, Pierre Dupont (UCL) Jean-Michel Renders (Xerox Research Europe) Some recent methods:

More information

Introduction aux Systèmes Collaboratifs Multi-Agents

Introduction aux Systèmes Collaboratifs Multi-Agents M1 EEAII - Découverte de la Recherche (ViRob) Introduction aux Systèmes Collaboratifs Multi-Agents UPJV, Département EEA Fabio MORBIDI Laboratoire MIS Équipe Perception et Robotique E-mail: fabio.morbidi@u-picardie.fr

More information

Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing

Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing Gautam Bhat, Rajeev Kumar Singh Department of Computer Science and Engineering Shiv Nadar University Gautam Buddh Nagar,

More information

Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1

Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1 CME 305: Discrete Mathematics and Algorithms Instructor: Professor Aaron Sidford (sidford@stanfordedu) February 6, 2018 Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1 In the

More information

Locality Preserving Projections (LPP) Abstract

Locality Preserving Projections (LPP) Abstract Locality Preserving Projections (LPP) Xiaofei He Partha Niyogi Computer Science Department Computer Science Department The University of Chicago The University of Chicago Chicago, IL 60615 Chicago, IL

More information

Semi supervised clustering for Text Clustering

Semi supervised clustering for Text Clustering Semi supervised clustering for Text Clustering N.Saranya 1 Assistant Professor, Department of Computer Science and Engineering, Sri Eshwar College of Engineering, Coimbatore 1 ABSTRACT: Based on clustering

More information

Spectral Methods for Network Community Detection and Graph Partitioning

Spectral Methods for Network Community Detection and Graph Partitioning Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman Department of Physics, University of Michigan Presenters: Yunqi Guo Xueyin Yu Yuanqi Li 1 Outline: Community Detection

More information

SGT Toolbox A Playground for Spectral Graph Theory

SGT Toolbox A Playground for Spectral Graph Theory Distributed Computing SGT Toolbox A Playground for Spectral Graph Theory Master s Thesis Michael König mikoenig@student.ethz.ch Distributed Computing Group Computer Engineering and Networks Laboratory

More information

Spectral Clustering on Handwritten Digits Database

Spectral Clustering on Handwritten Digits Database October 6, 2015 Spectral Clustering on Handwritten Digits Database Danielle dmiddle1@math.umd.edu Advisor: Kasso Okoudjou kasso@umd.edu Department of Mathematics University of Maryland- College Park Advance

More information

C-NBC: Neighborhood-Based Clustering with Constraints

C-NBC: Neighborhood-Based Clustering with Constraints C-NBC: Neighborhood-Based Clustering with Constraints Piotr Lasek Chair of Computer Science, University of Rzeszów ul. Prof. St. Pigonia 1, 35-310 Rzeszów, Poland lasek@ur.edu.pl Abstract. Clustering is

More information

On the Balanced Case of the Brualdi-Shen Conjecture on 4-Cycle Decompositions of Eulerian Bipartite Tournaments

On the Balanced Case of the Brualdi-Shen Conjecture on 4-Cycle Decompositions of Eulerian Bipartite Tournaments Electronic Journal of Graph Theory and Applications 3 (2) (2015), 191 196 On the Balanced Case of the Brualdi-Shen Conjecture on 4-Cycle Decompositions of Eulerian Bipartite Tournaments Rafael Del Valle

More information

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Cluster Analysis Mu-Chun Su Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Introduction Cluster analysis is the formal study of algorithms and methods

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu HITS (Hypertext Induced Topic Selection) Is a measure of importance of pages or documents, similar to PageRank

More information

Explore Co-clustering on Job Applications. Qingyun Wan SUNet ID:qywan

Explore Co-clustering on Job Applications. Qingyun Wan SUNet ID:qywan Explore Co-clustering on Job Applications Qingyun Wan SUNet ID:qywan 1 Introduction In the job marketplace, the supply side represents the job postings posted by job posters and the demand side presents

More information

Clustering Algorithms for general similarity measures

Clustering Algorithms for general similarity measures Types of general clustering methods Clustering Algorithms for general similarity measures general similarity measure: specified by object X object similarity matrix 1 constructive algorithms agglomerative

More information

On Reduct Construction Algorithms

On Reduct Construction Algorithms 1 On Reduct Construction Algorithms Yiyu Yao 1, Yan Zhao 1 and Jue Wang 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao, yanzhao}@cs.uregina.ca 2 Laboratory

More information

Ranking on Data Manifolds

Ranking on Data Manifolds Ranking on Data Manifolds Dengyong Zhou, Jason Weston, Arthur Gretton, Olivier Bousquet, and Bernhard Schölkopf Max Planck Institute for Biological Cybernetics, 72076 Tuebingen, Germany {firstname.secondname

More information

SOMSN: An Effective Self Organizing Map for Clustering of Social Networks

SOMSN: An Effective Self Organizing Map for Clustering of Social Networks SOMSN: An Effective Self Organizing Map for Clustering of Social Networks Fatemeh Ghaemmaghami Research Scholar, CSE and IT Dept. Shiraz University, Shiraz, Iran Reza Manouchehri Sarhadi Research Scholar,

More information

Graph Theory Review. January 30, Network Science Analytics Graph Theory Review 1

Graph Theory Review. January 30, Network Science Analytics Graph Theory Review 1 Graph Theory Review Gonzalo Mateos Dept. of ECE and Goergen Institute for Data Science University of Rochester gmateosb@ece.rochester.edu http://www.ece.rochester.edu/~gmateosb/ January 30, 2018 Network

More information

Spectral Clustering and Community Detection in Labeled Graphs

Spectral Clustering and Community Detection in Labeled Graphs Spectral Clustering and Community Detection in Labeled Graphs Brandon Fain, Stavros Sintos, Nisarg Raval Machine Learning (CompSci 571D / STA 561D) December 7, 2015 {btfain, nisarg, ssintos} at cs.duke.edu

More information

Community Detection: Comparison of State of the Art Algorithms

Community Detection: Comparison of State of the Art Algorithms Community Detection: Comparison of State of the Art Algorithms Josiane Mothe IRIT, UMR5505 CNRS & ESPE, Univ. de Toulouse Toulouse, France e-mail: josiane.mothe@irit.fr Karen Mkhitaryan Institute for Informatics

More information

Dynamic Clustering of Data with Modified K-Means Algorithm

Dynamic Clustering of Data with Modified K-Means Algorithm 2012 International Conference on Information and Computer Networks (ICICN 2012) IPCSIT vol. 27 (2012) (2012) IACSIT Press, Singapore Dynamic Clustering of Data with Modified K-Means Algorithm Ahamed Shafeeq

More information

Datasets Size: Effect on Clustering Results

Datasets Size: Effect on Clustering Results 1 Datasets Size: Effect on Clustering Results Adeleke Ajiboye 1, Ruzaini Abdullah Arshah 2, Hongwu Qin 3 Faculty of Computer Systems and Software Engineering Universiti Malaysia Pahang 1 {ajibraheem@live.com}

More information

COMP 4601 Hubs and Authorities

COMP 4601 Hubs and Authorities COMP 4601 Hubs and Authorities 1 Motivation PageRank gives a way to compute the value of a page given its position and connectivity w.r.t. the rest of the Web. Is it the only algorithm: No! It s just one

More information

Types of general clustering methods. Clustering Algorithms for general similarity measures. Similarity between clusters

Types of general clustering methods. Clustering Algorithms for general similarity measures. Similarity between clusters Types of general clustering methods Clustering Algorithms for general similarity measures agglomerative versus divisive algorithms agglomerative = bottom-up build up clusters from single objects divisive

More information

Using PageRank in Feature Selection

Using PageRank in Feature Selection Using PageRank in Feature Selection Dino Ienco, Rosa Meo, and Marco Botta Dipartimento di Informatica, Università di Torino, Italy {ienco,meo,botta}@di.unito.it Abstract. Feature selection is an important

More information

A Connection between Network Coding and. Convolutional Codes

A Connection between Network Coding and. Convolutional Codes A Connection between Network Coding and 1 Convolutional Codes Christina Fragouli, Emina Soljanin christina.fragouli@epfl.ch, emina@lucent.com Abstract The min-cut, max-flow theorem states that a source

More information

A Local Algorithm for Structure-Preserving Graph Cut

A Local Algorithm for Structure-Preserving Graph Cut A Local Algorithm for Structure-Preserving Graph Cut Presenter: Dawei Zhou Dawei Zhou* (ASU) Si Zhang (ASU) M. Yigit Yildirim (ASU) Scott Alcorn (Early Warning) Hanghang Tong (ASU) Hasan Davulcu (ASU)

More information

On the Relationships between Zero Forcing Numbers and Certain Graph Coverings

On the Relationships between Zero Forcing Numbers and Certain Graph Coverings On the Relationships between Zero Forcing Numbers and Certain Graph Coverings Fatemeh Alinaghipour Taklimi, Shaun Fallat 1,, Karen Meagher 2 Department of Mathematics and Statistics, University of Regina,

More information

MATH 567: Mathematical Techniques in Data

MATH 567: Mathematical Techniques in Data Supervised and unsupervised learning Supervised learning problems: MATH 567: Mathematical Techniques in Data (X, Y ) P (X, Y ). Data Science Clustering I is labelled (input/output) with joint density We

More information

Graph projection techniques for Self-Organizing Maps

Graph projection techniques for Self-Organizing Maps Graph projection techniques for Self-Organizing Maps Georg Pölzlbauer 1, Andreas Rauber 1, Michael Dittenbach 2 1- Vienna University of Technology - Department of Software Technology Favoritenstr. 9 11

More information

Web consists of web pages and hyperlinks between pages. A page receiving many links from other pages may be a hint of the authority of the page

Web consists of web pages and hyperlinks between pages. A page receiving many links from other pages may be a hint of the authority of the page Link Analysis Links Web consists of web pages and hyperlinks between pages A page receiving many links from other pages may be a hint of the authority of the page Links are also popular in some other information

More information

Emotion Classification

Emotion Classification Emotion Classification Shai Savir 038052395 Gil Sadeh 026511469 1. Abstract Automated facial expression recognition has received increased attention over the past two decades. Facial expressions convey

More information

Non Overlapping Communities

Non Overlapping Communities Non Overlapping Communities Davide Mottin, Konstantina Lazaridou HassoPlattner Institute Graph Mining course Winter Semester 2016 Acknowledgements Most of this lecture is taken from: http://web.stanford.edu/class/cs224w/slides

More information

An Edge-Swap Heuristic for Finding Dense Spanning Trees

An Edge-Swap Heuristic for Finding Dense Spanning Trees Theory and Applications of Graphs Volume 3 Issue 1 Article 1 2016 An Edge-Swap Heuristic for Finding Dense Spanning Trees Mustafa Ozen Bogazici University, mustafa.ozen@boun.edu.tr Hua Wang Georgia Southern

More information

Concept Tree Based Clustering Visualization with Shaded Similarity Matrices

Concept Tree Based Clustering Visualization with Shaded Similarity Matrices Syracuse University SURFACE School of Information Studies: Faculty Scholarship School of Information Studies (ischool) 12-2002 Concept Tree Based Clustering Visualization with Shaded Similarity Matrices

More information

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CS224W: Analysis of Networks Jure Leskovec, Stanford University CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu 11/13/17 Jure Leskovec, Stanford CS224W: Analysis of Networks, http://cs224w.stanford.edu 2 Observations Models

More information

Matching Algorithms. Proof. If a bipartite graph has a perfect matching, then it is easy to see that the right hand side is a necessary condition.

Matching Algorithms. Proof. If a bipartite graph has a perfect matching, then it is easy to see that the right hand side is a necessary condition. 18.433 Combinatorial Optimization Matching Algorithms September 9,14,16 Lecturer: Santosh Vempala Given a graph G = (V, E), a matching M is a set of edges with the property that no two of the edges have

More information

A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm

A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm IJCSES International Journal of Computer Sciences and Engineering Systems, Vol. 5, No. 2, April 2011 CSES International 2011 ISSN 0973-4406 A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm

More information

A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data

A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data Journal of Computational Information Systems 11: 6 (2015) 2139 2146 Available at http://www.jofcis.com A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data

More information

A TIGHT BOUND ON THE LENGTH OF ODD CYCLES IN THE INCOMPATIBILITY GRAPH OF A NON-C1P MATRIX

A TIGHT BOUND ON THE LENGTH OF ODD CYCLES IN THE INCOMPATIBILITY GRAPH OF A NON-C1P MATRIX A TIGHT BOUND ON THE LENGTH OF ODD CYCLES IN THE INCOMPATIBILITY GRAPH OF A NON-C1P MATRIX MEHRNOUSH MALEKESMAEILI, CEDRIC CHAUVE, AND TAMON STEPHEN Abstract. A binary matrix has the consecutive ones property

More information

A new edge selection heuristic for computing the Tutte polynomial of an undirected graph.

A new edge selection heuristic for computing the Tutte polynomial of an undirected graph. FPSAC 2012, Nagoya, Japan DMTCS proc. (subm.), by the authors, 1 12 A new edge selection heuristic for computing the Tutte polynomial of an undirected graph. Michael Monagan 1 1 Department of Mathematics,

More information

Clustering via Random Walk Hitting Time on Directed Graphs

Clustering via Random Walk Hitting Time on Directed Graphs Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (8) Clustering via Random Walk Hitting Time on Directed Graphs Mo Chen Jianzhuang Liu Xiaoou Tang, Dept. of Information Engineering

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu How to organize the Web? First try: Human curated Web directories Yahoo, DMOZ, LookSmart Second

More information

Confidence Regions and Averaging for Trees

Confidence Regions and Averaging for Trees Confidence Regions and Averaging for Trees 1 Susan Holmes Statistics Department, Stanford and INRA- Biométrie, Montpellier,France susan@stat.stanford.edu http://www-stat.stanford.edu/~susan/ Joint Work

More information

Principal Coordinate Clustering

Principal Coordinate Clustering Principal Coordinate Clustering Ali Sekmen, Akram Aldroubi, Ahmet Bugra Koku, Keaton Hamm Department of Computer Science, Tennessee State University Department of Mathematics, Vanderbilt University Department

More information

A Data Classification Algorithm of Internet of Things Based on Neural Network

A Data Classification Algorithm of Internet of Things Based on Neural Network A Data Classification Algorithm of Internet of Things Based on Neural Network https://doi.org/10.3991/ijoe.v13i09.7587 Zhenjun Li Hunan Radio and TV University, Hunan, China 278060389@qq.com Abstract To

More information

Advanced visualization techniques for Self-Organizing Maps with graph-based methods

Advanced visualization techniques for Self-Organizing Maps with graph-based methods Advanced visualization techniques for Self-Organizing Maps with graph-based methods Georg Pölzlbauer 1, Andreas Rauber 1, and Michael Dittenbach 2 1 Department of Software Technology Vienna University

More information

Using PageRank in Feature Selection

Using PageRank in Feature Selection Using PageRank in Feature Selection Dino Ienco, Rosa Meo, and Marco Botta Dipartimento di Informatica, Università di Torino, Italy fienco,meo,bottag@di.unito.it Abstract. Feature selection is an important

More information

AN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING

AN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING AN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING Irina Bernst, Patrick Bouillon, Jörg Frochte *, Christof Kaufmann Dept. of Electrical Engineering

More information

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Ms. Gayatri Attarde 1, Prof. Aarti Deshpande 2 M. E Student, Department of Computer Engineering, GHRCCEM, University

More information