A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm

Size: px
Start display at page:

Download "A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm"

Transcription

1 IJCSES International Journal of Computer Sciences and Engineering Systems, Vol. 5, No. 2, April 2011 CSES International 2011 ISSN A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm 1 Bhaskar Adepu and 2 Kiran Kumar Bejjanki 1 Department of MCA, Kakatiya Institute of Technology & Science, Warangal, Andhra Pradesh, INDIA, bhaskar_adepu@yahoo.com; 2 kiran_b_kumar@yahoo.com Abstract: Clustering analysis has been an emerging research issue in Data Mining due to its variety of applications. In the recent years, it has become an essential tool for Gene expression analysis. Many clustering algorithms have been proposed so far. However each algorithm has its own advantages and disadvantages and cannot work for all real situations. The Minimum Spanning Tree (MST) based clustering algorithms have been widely used due to their ability to detect clusters with irregular boundaries. In this paper we present a clustering algorithm that is inspired by MST. In this algorithm, we propose a new method for construction of MST which reduces the time complexity compared with traditional MST construction methods. Key words: Clustering, Minimum Spanning Tree, Partitioning, Dissimilarity Matrix. 1. INTRODUCTION Clustering is the process of grouping the data objects into classes or clusters, so that objects within a cluster have high similarly in comparison to one another but are very dissimilar to objects in other clusters. Usually, the common properties are quantitatively evaluated by some measures of the optimality such as minimum intra-cluster distance or maximum inter-cluster distance. Clustering plays an important role in various fields including Pattern Recognition, Image Processing, Biological Data Analysis, Micro Aggregation, Mobile Communication, Medicine and Economics. Clustering is used to explore the hidden structure of modern large databases and many algorithms have been proposed in the literature. Because of the huge variety of the problems and data distribution, different techniques, such as hierarchical, partition, density and model based algorithms have been developed and no techniques are completely satisfactory for all the cases. With the recent advances of micro array technology, there has been tremendous growth of the micro array data. Identifying co-regulated genes to organize them into meaningful groups has been an important research in bioinformatics. Therefore, clustering analysis has become an essential and valuable Manuscript received May 25, 2010 Manuscript revised December 15, 2010 tool in micro array or gene expression data analysis [1]. Given a set of N data points, a minimum spanning tree is a spanning tree that connects all the data points either by a direct edge or by a path and has minimum total weight. The total weight is the sum of the weights of all the edges of the spanning tree. In MST based clustering algorithms, the weight for each edge is usually computed as Euclidean distance between the points connecting that edge. Minimum Spanning Tree (MST) based clustering algorithms allows us to overcome many of the problems faced by the classical clustering algorithms. Due to their ability to detect clusters with irregular boundaries, MST based clustering algorithms have been widely used in practice. Initially, Zhan[2] proposed MST based clustering algorithms. Later MST based clustering algorithms has been extensively studied in the fields of biological data analysis [3], image processing, pattern recognition [4] and outlier detection [5], [6]. Usually, MST based clustering algorithms[2] consists of three steps: (1) A minimum spanning tree is constructed (typically in quadratic time) using either the Prim s algorithm or the Kruskal s algorithm (2) The inconsistent edges are removed to get a set of connected components(clusters) and (3) Step (2) is repeated until some terminating condition is satisfied.

2 70 IJCSES International Journal of Computer Sciences and Engineering Systems, Vol. 5, No. 2, April 2011 In this paper, we proposed a new method for construction of MST which is based on partitioning technique. Our algorithm has no specific requirement of prior knowledge of some parameters like number of clusters required and the dimensionality of the datasets etc. The rest of the paper is organized as follows. In section 2, we introduce the necessary concepts of MST and review of existing work on MST-based clustering algorithms. We next present MST construction method and proposed algorithm in section 3. Finally, conclusions are made in section RELATED WORK 2.1. MST-based Clustering Algorithms After MST being constructed, the next step is to define an edge inconsistency measure so as to partition the tree into clusters. The simple edge inconsistency measure is the removal of longest edge candidates from the MST. So that k number of clusters are formed by removing ( k-1) inconsistent edges from the MST. The number of clusters k is given as an input parameter in many algorithms. The definition of the inconsistent edges and the development of the terminating condition are two major issues that have to be addressed in all MST-based clustering algorithms. In Zahn s original work [2], the inconsistent edges are those edges whose weights are significantly larger than the average weight of the nearby edges in the tree. The performance of this clustering algorithm is affected by the size of the nearby neighborhood. Five group clustering is shown in Figure 1. Figure 1: MST Representation of Five Group Clustering There exist other spanning tree based clustering algorithms that maximize or minimize the degrees of link of the vertices [7], which is computationally expensive. Grygorash et al. [9] proposed two MSTbased clustering algorithms called the Hierarchical Euclidean Distance based MST clustering algorithm (HEMST) and the Maximum Standard Deviation Reduction clustering algorithm (MSDR) respectively. As stated in [3] that MST based clustering algorithm does not depend on the detailed geometric structure of a cluster and therefore, it can overcome many of the problems faced by many clustering algorithms. The other graphical structures such as Relative Neighborhood Graph (RNG), Gabriel Graph (GG), and Delaunay Triangulation (DT) have also been used for cluster analysis. The relationship among these graphs can be seen as MST RNG GG DT [10]. In Density-oriented approach, Chowdbury and Murthy s MST based clustering technique[11] assumes that the boundary between any two clusters must belong to a valley region i.e., where the density of data points is the lowest compared to the neighboring regions and the inconsistency measure is based on finding such valley regions. Laszlo and Mukherjee present an MST-based clustering algorithm [12] that puts a constraint on the minimum cluster size rather than on the number of clusters. This algorithm is developed for micro aggregation problem, where the number of clusters in the data set can be figured out by the constraints of the problem itself. Vathy-Fogarassy et al. suggest three new cutting criteria for the MST-based clustering [4]. Their goal is to decrease the number of heuristically defined parameters of existing algorithms so as to decrease the influence of the user on the clustering results. Recently, Wang et al. [8] proposed a new approach called Divide and Conquer method to facilitate efficient MST-based clustering by using the idea of the Reverse Delete algorithm. 3. PROPOSED METHOD Our algorithm mainly consists of the following steps: 1. Representation of n-dimensional data points in the form of Dissimilarity Matrix (Object-by-Object Structure). 2. Construction of Spanning Tree (ST) using this Dissimilarity Matrix (DM). 3. Construction of MST from ST. 4. Generating Clusters using the MST Dissimilarity Matrix Representation Generally in most of the clustering algorithms data points can be represented as Data Matrix or Dissimilarity Matrix representation. In our method

3 A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm 71 we represented data points in the form of Dissimilarity Matrix. It contains the distance values between the data points represented as lower or upper triangular matrix. The distance calculation measure we used is Euclidean distance ( ) d( i,) j = xi x j + xi x j + + xin x jn (1) where i, j are n-dimensional data points. Consider the sample data about the students as shown in Table 1. Table 1 Sample Data StudentID Age Marks (iii) Select an edge e such that only any one end point of e is in ST and dist(e) 0 (iv) Add edge e to the ST. The sample spanning tree for the above data by randomly selecting an edge {1, 2} using the above procedure is shown in Table 3 and Fig. 2. Edge Table 3 Spanning Tree Distance/Weight {1, 2} 6.0 {2, 3} 10.3 {1, 4} {4, 5} {1, 6} {6, 7} {1, 8} {1, 9} 9.06 {5, 10} The DM for the above sample data is shown in Table 2 by using Eq. (1). Table 2 Dissimilarity Matrix Construction of Spanning Tree Randomly choose one edge and add it to the ST. (ii) Repeat the following steps until number of edges in ST=N-1 where N is the number of data points. Figure 2: Spanning Tree 3.3. Construction of MST - Proposed Algorithm The basic idea of our proposed algorithm is as follows: Repeat 1. Select the longest distance edge e from the ST. 2. Remove an edge e from the ST, then the vertices in the ST are partitioned into two sets P1, P2. 3. Find an edge E such that the following conditions are satisfied. dist ( E ) < dist ( e ).

4 72 IJCSES International Journal of Computer Sciences and Engineering Systems, Vol. 5, No. 2, April 2011 (ii) One of the end points of E should be in one partition and the other end point should be in another partition. 4. if (edge E found) then Add edge E to the ST 5. else if (edge E not found) then Add edge e to the MST. Until ( number_of_edges in the MST = N-1); For example in the above ST the longest edge e ={6, 7} whose distance = By removing this edge from ST, vertices (data points) are partitioned into two sets P1 = {1, 2, 3, 4, 5, 6, 8, 9, 10} and P2 = {7}. Next we can find many edges satisfying the above two conditions from the DM. Those are {1-7, 2-7, 3-7, 4-7, 5-7, 8-7, 9-7, 10-7}. Select the minimum distance edge from these set of edges and add it to the MST. The final MST generated from the above process is depicted in Fig. 3. Figure 3: Minimum Spanning Tree 3.4. Generating Clusters using the MST Calculate the Mean(M), Standard Deviation(SD) of the edge weights in the MST (ii) Calculate Threshold(λ) = M + SD (iii) for each edge e MST if weight of e (w e ) λ remove e from MST end if end for This gives us disjoint sub trees {T 1, T 2, T 3 }. Each of the sub trees T i is a cluster. For the above MST Mean = , Standard Deviation = , Threshold = The Clusters formed are: Cluster1: {1, 2, 3, 4, 5, 6, 8, 9}, Cluster2: {7}, Cluster3: {10} 4. CONCLUSIONS In this paper, we have presented a new approach for the construction of minimum spanning tree, which takes less time compared to classical minimum spanning tree algorithms. Unlike the other algorithms such as k-means, our algorithm does not require any prior parametric values, like, number of clusters, initial cluster seeds etc. We have done experiments on some synthetic data sets namely Students, Employees data. Experimental results demonstrate that the proposed algorithm performs better than the k-means. REFERENCES [1] Daxin Jiang, Chun Tang and Aidong Zhang, Cluster Analysis for Gene Expression Data: A Survey, IEEE Transactions on Knowledge and Data Engineering, 16, 2004, [2] C. T. Zahn, Graph-Theriotical Methods for Detecting and Describing Getalt Clusters, IEEE Trans. Computers, 20(1), 1971, [3] Y. Xu, V. Olman and D. Xu, Clustering Gene Expression Data using a Graph-Theriotic Approach: An Application of Minimum Spanning Trees, Bioinformatics, 18(4), 2002, [4] A. Vathy-Fogarassy, A. Kiss and 1. Abonyi, Hybrid Minimal Spanning Tree and Mixture of Gaussians based Clustering Algorithm, Foundations of Information and Knowledge Systems, Springer, 2006, [5] J. Lin, D. Ye, C. Chen, and M. Gao, Minimum Spanning Tree-Based Spatial Outlier Mining and Its Applications, Lecture Notes in Computer Science, Springer-Verlag, Vol. 5009/2008, 2008,pp [6] M. F. Jiang, S. S. Tseng, and C. M. Su, Two-Phase Clustering Process for Outliers Detection, Pattern Recognition Letters, 22, 2001, [7] N. Paivinen, Clustering with a Minimum Spanning Tree of Scale- free-like Structure, Pattern Recognition Letters, Elsevier, 26(7), 2005, [8] Xiaochun Wang, Xiali Wang and D. Mitchell Wilkes, A Divide-and-conquer Approach for Minimum Spanning Tree-based Clustering, IEEE Transactions on Knowledge and Data Engg., 21, [9] O. Gryorash, Y. Zhou ands Z, Jorgenssn, Minimum Spanning tree-based Clustering Algorithms, Proc.

5 A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm 73 IEEE Int l Conf. Tools with Artificial Intelligence, 2006, pp [10] A. K. Jain, Algorithms for Clustering Data, New Jersey: Prentice Hall, Englewood Cliffs, [11] N. Chowdhury and C.A. Murthy, Minimum Spanning Tree-Based Clustering Technique: Relationship with Bayes Classifier, Recognition, 30(11), 1997, Pattern [12] M. Laszlo and S. Mukherjee, Minimum Spanning Tree Partitioning Algorithm for Microaggregation, IEEE Trans. on Knowledge and Data Engg., 17(7), 2005,

Enhancing Clustering Results In Hierarchical Approach By Mvs Measures

Enhancing Clustering Results In Hierarchical Approach By Mvs Measures International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 6 (June 2014), PP.25-30 Enhancing Clustering Results In Hierarchical Approach

More information

REDUCING RUNTIME VALUES IN MINIMUM SPANNING TREE BASED CLUSTERING BY VISUAL ACCESS TENDENCY

REDUCING RUNTIME VALUES IN MINIMUM SPANNING TREE BASED CLUSTERING BY VISUAL ACCESS TENDENCY REDUCING RUNTIME VALUES IN MINIMUM SPANNING TREE BASED CLUSTERING BY VISUAL ACCESS TENDENCY Dr. B. Eswara Reddy 1 and K. Rajendra Prasad 2 1 Associate Professor, Head of CSE Department, JNTUA College of

More information

Manuscript Click here to download Manuscript: Interval neutrosophic MST clustering algorithm and its an application to taxonomy.

Manuscript Click here to download Manuscript: Interval neutrosophic MST clustering algorithm and its an application to taxonomy. Manuscript Click here to download Manuscript: Interval neutrosophic MST clustering algorithm and its an application to taxonomy.pdf 0 0 0 0 0 INTERVAL NEUTROSOPHIC MST CLUSTERING ALGORITHM AND ITS AN APPLICATION

More information

Accelerating Unique Strategy for Centroid Priming in K-Means Clustering

Accelerating Unique Strategy for Centroid Priming in K-Means Clustering IJIRST International Journal for Innovative Research in Science & Technology Volume 3 Issue 07 December 2016 ISSN (online): 2349-6010 Accelerating Unique Strategy for Centroid Priming in K-Means Clustering

More information

Triclustering in Gene Expression Data Analysis: A Selected Survey

Triclustering in Gene Expression Data Analysis: A Selected Survey Triclustering in Gene Expression Data Analysis: A Selected Survey P. Mahanta, H. A. Ahmed Dept of Comp Sc and Engg Tezpur University Napaam -784028, India Email: priyakshi@tezu.ernet.in, hasin@tezu.ernet.in

More information

Dynamic Clustering of Data with Modified K-Means Algorithm

Dynamic Clustering of Data with Modified K-Means Algorithm 2012 International Conference on Information and Computer Networks (ICICN 2012) IPCSIT vol. 27 (2012) (2012) IACSIT Press, Singapore Dynamic Clustering of Data with Modified K-Means Algorithm Ahamed Shafeeq

More information

Efficiency of k-means and K-Medoids Algorithms for Clustering Arbitrary Data Points

Efficiency of k-means and K-Medoids Algorithms for Clustering Arbitrary Data Points Efficiency of k-means and K-Medoids Algorithms for Clustering Arbitrary Data Points Dr. T. VELMURUGAN Associate professor, PG and Research Department of Computer Science, D.G.Vaishnav College, Chennai-600106,

More information

International Journal of Advanced Research in Computer Science and Software Engineering

International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 3, March 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue:

More information

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Ms. Gayatri Attarde 1, Prof. Aarti Deshpande 2 M. E Student, Department of Computer Engineering, GHRCCEM, University

More information

Flexible-Hybrid Sequential Floating Search in Statistical Feature Selection

Flexible-Hybrid Sequential Floating Search in Statistical Feature Selection Flexible-Hybrid Sequential Floating Search in Statistical Feature Selection Petr Somol 1,2, Jana Novovičová 1,2, and Pavel Pudil 2,1 1 Dept. of Pattern Recognition, Institute of Information Theory and

More information

A Review of K-mean Algorithm

A Review of K-mean Algorithm A Review of K-mean Algorithm Jyoti Yadav #1, Monika Sharma *2 1 PG Student, CSE Department, M.D.U Rohtak, Haryana, India 2 Assistant Professor, IT Department, M.D.U Rohtak, Haryana, India Abstract Cluster

More information

Enhancing K-means Clustering Algorithm with Improved Initial Center

Enhancing K-means Clustering Algorithm with Improved Initial Center Enhancing K-means Clustering Algorithm with Improved Initial Center Madhu Yedla #1, Srinivasa Rao Pathakota #2, T M Srinivasa #3 # Department of Computer Science and Engineering, National Institute of

More information

A Naïve Soft Computing based Approach for Gene Expression Data Analysis

A Naïve Soft Computing based Approach for Gene Expression Data Analysis Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2124 2128 International Conference on Modeling Optimization and Computing (ICMOC-2012) A Naïve Soft Computing based Approach for

More information

Analyzing Outlier Detection Techniques with Hybrid Method

Analyzing Outlier Detection Techniques with Hybrid Method Analyzing Outlier Detection Techniques with Hybrid Method Shruti Aggarwal Assistant Professor Department of Computer Science and Engineering Sri Guru Granth Sahib World University. (SGGSWU) Fatehgarh Sahib,

More information

International Journal of Advance Research in Computer Science and Management Studies

International Journal of Advance Research in Computer Science and Management Studies Volume 2, Issue 11, November 2014 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online

More information

CS 2750 Machine Learning. Lecture 19. Clustering. CS 2750 Machine Learning. Clustering. Groups together similar instances in the data sample

CS 2750 Machine Learning. Lecture 19. Clustering. CS 2750 Machine Learning. Clustering. Groups together similar instances in the data sample Lecture 9 Clustering Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square Clustering Groups together similar instances in the data sample Basic clustering problem: distribute data into k different groups

More information

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 11, November 2014,

More information

International Journal of Advanced Research in Computer Science and Software Engineering

International Journal of Advanced Research in Computer Science and Software Engineering Volume 2, Issue 10, October 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A New Efficient

More information

Clustering Algorithms for Data Stream

Clustering Algorithms for Data Stream Clustering Algorithms for Data Stream Karishma Nadhe 1, Prof. P. M. Chawan 2 1Student, Dept of CS & IT, VJTI Mumbai, Maharashtra, India 2Professor, Dept of CS & IT, VJTI Mumbai, Maharashtra, India Abstract:

More information

ISSN: (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies

ISSN: (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online

More information

FACE RECOGNITION FROM A SINGLE SAMPLE USING RLOG FILTER AND MANIFOLD ANALYSIS

FACE RECOGNITION FROM A SINGLE SAMPLE USING RLOG FILTER AND MANIFOLD ANALYSIS FACE RECOGNITION FROM A SINGLE SAMPLE USING RLOG FILTER AND MANIFOLD ANALYSIS Jaya Susan Edith. S 1 and A.Usha Ruby 2 1 Department of Computer Science and Engineering,CSI College of Engineering, 2 Research

More information

A NOVEL ALGORITHM FOR MINIMUM SPANNING CLUSTERING TREE

A NOVEL ALGORITHM FOR MINIMUM SPANNING CLUSTERING TREE A NOVEL ALGORITHM FOR MINIMUM SPANNING CLUSTERING TREE 1 S.JOHN PETER, 2 S.P.VICTOR 1. Assistant Professor 2. Associate Professor Department of Computer Science and Research Center St. Xavier s College,

More information

SOMSN: An Effective Self Organizing Map for Clustering of Social Networks

SOMSN: An Effective Self Organizing Map for Clustering of Social Networks SOMSN: An Effective Self Organizing Map for Clustering of Social Networks Fatemeh Ghaemmaghami Research Scholar, CSE and IT Dept. Shiraz University, Shiraz, Iran Reza Manouchehri Sarhadi Research Scholar,

More information

[Raghuvanshi* et al., 5(8): August, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116

[Raghuvanshi* et al., 5(8): August, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A SURVEY ON DOCUMENT CLUSTERING APPROACH FOR COMPUTER FORENSIC ANALYSIS Monika Raghuvanshi*, Rahul Patel Acropolise Institute

More information

AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION

AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION WILLIAM ROBSON SCHWARTZ University of Maryland, Department of Computer Science College Park, MD, USA, 20742-327, schwartz@cs.umd.edu RICARDO

More information

Unsupervised learning on Color Images

Unsupervised learning on Color Images Unsupervised learning on Color Images Sindhuja Vakkalagadda 1, Prasanthi Dhavala 2 1 Computer Science and Systems Engineering, Andhra University, AP, India 2 Computer Science and Systems Engineering, Andhra

More information

DS504/CS586: Big Data Analytics Big Data Clustering II

DS504/CS586: Big Data Analytics Big Data Clustering II Welcome to DS504/CS586: Big Data Analytics Big Data Clustering II Prof. Yanhua Li Time: 6pm 8:50pm Thu Location: AK 232 Fall 2016 More Discussions, Limitations v Center based clustering K-means BFR algorithm

More information

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Improving the Efficiency of Fast Using Semantic Similarity Algorithm International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year

More information

Fast Approximate Minimum Spanning Tree Algorithm Based on K-Means

Fast Approximate Minimum Spanning Tree Algorithm Based on K-Means Fast Approximate Minimum Spanning Tree Algorithm Based on K-Means Caiming Zhong 1,2,3, Mikko Malinen 2, Duoqian Miao 1,andPasiFränti 2 1 Department of Computer Science and Technology, Tongji University,

More information

A FAST CLUSTERING-BASED FEATURE SUBSET SELECTION ALGORITHM

A FAST CLUSTERING-BASED FEATURE SUBSET SELECTION ALGORITHM A FAST CLUSTERING-BASED FEATURE SUBSET SELECTION ALGORITHM Akshay S. Agrawal 1, Prof. Sachin Bojewar 2 1 P.G. Scholar, Department of Computer Engg., ARMIET, Sapgaon, (India) 2 Associate Professor, VIT,

More information

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering Gurpreet Kaur M-Tech Student, Department of Computer Engineering, Yadawindra College of Engineering, Talwandi Sabo,

More information

I. INTRODUCTION II. RELATED WORK.

I. INTRODUCTION II. RELATED WORK. ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: A New Hybridized K-Means Clustering Based Outlier Detection Technique

More information

Keywords Clustering, Goals of clustering, clustering techniques, clustering algorithms.

Keywords Clustering, Goals of clustering, clustering techniques, clustering algorithms. Volume 3, Issue 5, May 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Survey of Clustering

More information

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Classification Vladimir Curic Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Outline An overview on classification Basics of classification How to choose appropriate

More information

CHAPTER 4 K-MEANS AND UCAM CLUSTERING ALGORITHM

CHAPTER 4 K-MEANS AND UCAM CLUSTERING ALGORITHM CHAPTER 4 K-MEANS AND UCAM CLUSTERING 4.1 Introduction ALGORITHM Clustering has been used in a number of applications such as engineering, biology, medicine and data mining. The most popular clustering

More information

Filtered Clustering Based on Local Outlier Factor in Data Mining

Filtered Clustering Based on Local Outlier Factor in Data Mining , pp.275-282 http://dx.doi.org/10.14257/ijdta.2016.9.5.28 Filtered Clustering Based on Local Outlier Factor in Data Mining 1 Vishal Bhatt, 2 Mradul Dhakar and 3 Brijesh Kumar Chaurasia 1,2,3 Deptt. of

More information

Text Data Pre-processing and Dimensionality Reduction Techniques for Document Clustering

Text Data Pre-processing and Dimensionality Reduction Techniques for Document Clustering Text Data Pre-processing and Dimensionality Reduction Techniques for Document Clustering A. Anil Kumar Dept of CSE Sri Sivani College of Engineering Srikakulam, India S.Chandrasekhar Dept of CSE Sri Sivani

More information

REMOVAL OF REDUNDANT AND IRRELEVANT DATA FROM TRAINING DATASETS USING SPEEDY FEATURE SELECTION METHOD

REMOVAL OF REDUNDANT AND IRRELEVANT DATA FROM TRAINING DATASETS USING SPEEDY FEATURE SELECTION METHOD Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,

More information

CLUSTERING. CSE 634 Data Mining Prof. Anita Wasilewska TEAM 16

CLUSTERING. CSE 634 Data Mining Prof. Anita Wasilewska TEAM 16 CLUSTERING CSE 634 Data Mining Prof. Anita Wasilewska TEAM 16 1. K-medoids: REFERENCES https://www.coursera.org/learn/cluster-analysis/lecture/nj0sb/3-4-the-k-medoids-clustering-method https://anuradhasrinivas.files.wordpress.com/2013/04/lesson8-clustering.pdf

More information

6. Concluding Remarks

6. Concluding Remarks [8] K. J. Supowit, The relative neighborhood graph with an application to minimum spanning trees, Tech. Rept., Department of Computer Science, University of Illinois, Urbana-Champaign, August 1980, also

More information

A Novel Algorithm for Meta Similarity Clusters Using Minimum Spanning Tree

A Novel Algorithm for Meta Similarity Clusters Using Minimum Spanning Tree 254 IJCSNS International Journal of Computer Science and Network Security, VOL.10 No.2, February 2010 A Novel Algorithm for Meta Similarity Clusters Using Minimum Spanning Tree S.John Peter 1, S.P.Victor

More information

Including the Size of Regions in Image Segmentation by Region Based Graph

Including the Size of Regions in Image Segmentation by Region Based Graph International Journal of Emerging Engineering Research and Technology Volume 3, Issue 4, April 2015, PP 81-85 ISSN 2349-4395 (Print) & ISSN 2349-4409 (Online) Including the Size of Regions in Image Segmentation

More information

Data Mining Classification: Alternative Techniques. Lecture Notes for Chapter 4. Instance-Based Learning. Introduction to Data Mining, 2 nd Edition

Data Mining Classification: Alternative Techniques. Lecture Notes for Chapter 4. Instance-Based Learning. Introduction to Data Mining, 2 nd Edition Data Mining Classification: Alternative Techniques Lecture Notes for Chapter 4 Instance-Based Learning Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar Instance Based Classifiers

More information

OUTLIER DETECTION FOR DYNAMIC DATA STREAMS USING WEIGHTED K-MEANS

OUTLIER DETECTION FOR DYNAMIC DATA STREAMS USING WEIGHTED K-MEANS OUTLIER DETECTION FOR DYNAMIC DATA STREAMS USING WEIGHTED K-MEANS DEEVI RADHA RANI Department of CSE, K L University, Vaddeswaram, Guntur, Andhra Pradesh, India. deevi_radharani@rediffmail.com NAVYA DHULIPALA

More information

An Unsupervised Technique for Statistical Data Analysis Using Data Mining

An Unsupervised Technique for Statistical Data Analysis Using Data Mining International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 5, Number 1 (2013), pp. 11-20 International Research Publication House http://www.irphouse.com An Unsupervised Technique

More information

A Generalized Method to Solve Text-Based CAPTCHAs

A Generalized Method to Solve Text-Based CAPTCHAs A Generalized Method to Solve Text-Based CAPTCHAs Jason Ma, Bilal Badaoui, Emile Chamoun December 11, 2009 1 Abstract We present work in progress on the automated solving of text-based CAPTCHAs. Our method

More information

Obtaining Rough Set Approximation using MapReduce Technique in Data Mining

Obtaining Rough Set Approximation using MapReduce Technique in Data Mining Obtaining Rough Set Approximation using MapReduce Technique in Data Mining Varda Dhande 1, Dr. B. K. Sarkar 2 1 M.E II yr student, Dept of Computer Engg, P.V.P.I.T Collage of Engineering Pune, Maharashtra,

More information

NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM

NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM Saroj 1, Ms. Kavita2 1 Student of Masters of Technology, 2 Assistant Professor Department of Computer Science and Engineering JCDM college

More information

A NOVEL APPROACH TO TEST SUITE REDUCTION USING DATA MINING

A NOVEL APPROACH TO TEST SUITE REDUCTION USING DATA MINING A NOVEL APPROACH TO TEST SUITE REDUCTION USING DATA MINING KARTHEEK MUTHYALA Computer Science and Information Systems, Birla Institute of Technology and Science, Pilani, Rajasthan, India, kartheek0274@gmail.com

More information

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science Data Mining Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of Computer Science 2016 201 Road map What is Cluster Analysis? Characteristics of Clustering

More information

Color Image Segmentation Using a Spatial K-Means Clustering Algorithm

Color Image Segmentation Using a Spatial K-Means Clustering Algorithm Color Image Segmentation Using a Spatial K-Means Clustering Algorithm Dana Elena Ilea and Paul F. Whelan Vision Systems Group School of Electronic Engineering Dublin City University Dublin 9, Ireland danailea@eeng.dcu.ie

More information

CAD SYSTEM FOR AUTOMATIC DETECTION OF BRAIN TUMOR THROUGH MRI BRAIN TUMOR DETECTION USING HPACO CHAPTER V BRAIN TUMOR DETECTION USING HPACO

CAD SYSTEM FOR AUTOMATIC DETECTION OF BRAIN TUMOR THROUGH MRI BRAIN TUMOR DETECTION USING HPACO CHAPTER V BRAIN TUMOR DETECTION USING HPACO CHAPTER V BRAIN TUMOR DETECTION USING HPACO 145 CHAPTER 5 DETECTION OF BRAIN TUMOR REGION USING HYBRID PARALLEL ANT COLONY OPTIMIZATION (HPACO) WITH FCM (FUZZY C MEANS) 5.1 PREFACE The Segmentation of

More information

Mine Blood Donors Information through Improved K- Means Clustering Bondu Venkateswarlu 1 and Prof G.S.V.Prasad Raju 2

Mine Blood Donors Information through Improved K- Means Clustering Bondu Venkateswarlu 1 and Prof G.S.V.Prasad Raju 2 Mine Blood Donors Information through Improved K- Means Clustering Bondu Venkateswarlu 1 and Prof G.S.V.Prasad Raju 2 1 Department of Computer Science and Systems Engineering, Andhra University, Visakhapatnam-

More information

Image Segmentation for Image Object Extraction

Image Segmentation for Image Object Extraction Image Segmentation for Image Object Extraction Rohit Kamble, Keshav Kaul # Computer Department, Vishwakarma Institute of Information Technology, Pune kamble.rohit@hotmail.com, kaul.keshav@gmail.com ABSTRACT

More information

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Cluster Analysis Mu-Chun Su Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Introduction Cluster analysis is the formal study of algorithms and methods

More information

Fast Efficient Clustering Algorithm for Balanced Data

Fast Efficient Clustering Algorithm for Balanced Data Vol. 5, No. 6, 214 Fast Efficient Clustering Algorithm for Balanced Data Adel A. Sewisy Faculty of Computer and Information, Assiut University M. H. Marghny Faculty of Computer and Information, Assiut

More information

Uniformity and Homogeneity Based Hierachical Clustering

Uniformity and Homogeneity Based Hierachical Clustering Uniformity and Homogeneity Based Hierachical Clustering Peter Bajcsy and Narendra Ahuja Becman Institute University of Illinois at Urbana-Champaign 45 N. Mathews Ave., Urbana, IL 181 E-mail: peter@stereo.ai.uiuc.edu

More information

DS504/CS586: Big Data Analytics Big Data Clustering II

DS504/CS586: Big Data Analytics Big Data Clustering II Welcome to DS504/CS586: Big Data Analytics Big Data Clustering II Prof. Yanhua Li Time: 6pm 8:50pm Thu Location: KH 116 Fall 2017 Updates: v Progress Presentation: Week 15: 11/30 v Next Week Office hours

More information

International Journal of Research in Advent Technology, Vol.7, No.3, March 2019 E-ISSN: Available online at

International Journal of Research in Advent Technology, Vol.7, No.3, March 2019 E-ISSN: Available online at Performance Evaluation of Ensemble Method Based Outlier Detection Algorithm Priya. M 1, M. Karthikeyan 2 Department of Computer and Information Science, Annamalai University, Annamalai Nagar, Tamil Nadu,

More information

A SURVEY ON CLUSTERING ALGORITHMS Ms. Kirti M. Patil 1 and Dr. Jagdish W. Bakal 2

A SURVEY ON CLUSTERING ALGORITHMS Ms. Kirti M. Patil 1 and Dr. Jagdish W. Bakal 2 Ms. Kirti M. Patil 1 and Dr. Jagdish W. Bakal 2 1 P.G. Scholar, Department of Computer Engineering, ARMIET, Mumbai University, India 2 Principal of, S.S.J.C.O.E, Mumbai University, India ABSTRACT Now a

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK STUDY ON DIFFERENT SENTENCE LEVEL CLUSTERING ALGORITHMS FOR TEXT MINING RAKHI S.WAGHMARE,

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervised Learning and Clustering Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)

More information

DOI:: /ijarcsse/V7I1/0111

DOI:: /ijarcsse/V7I1/0111 Volume 7, Issue 1, January 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Survey on

More information

DENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE

DENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE DENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE Sinu T S 1, Mr.Joseph George 1,2 Computer Science and Engineering, Adi Shankara Institute of Engineering

More information

MOSAIC: A Proximity Graph Approach for Agglomerative Clustering 1

MOSAIC: A Proximity Graph Approach for Agglomerative Clustering 1 MOSAIC: A Proximity Graph Approach for Agglomerative Clustering Jiyeon Choo, Rachsuda Jiamthapthaksin, Chun-sheng Chen, Oner Ulvi Celepcikay, Christian Giusti, and Christoph F. Eick Computer Science Department,

More information

CSE 5243 INTRO. TO DATA MINING

CSE 5243 INTRO. TO DATA MINING CSE 5243 INTRO. TO DATA MINING Cluster Analysis: Basic Concepts and Methods Huan Sun, CSE@The Ohio State University 09/25/2017 Slides adapted from UIUC CS412, Fall 2017, by Prof. Jiawei Han 2 Chapter 10.

More information

ISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies

ISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information

Distributed and clustering techniques for Multiprocessor Systems

Distributed and clustering techniques for Multiprocessor Systems www.ijcsi.org 199 Distributed and clustering techniques for Multiprocessor Systems Elsayed A. Sallam Associate Professor and Head of Computer and Control Engineering Department, Faculty of Engineering,

More information

APPLICATION OF MULTIPLE RANDOM CENTROID (MRC) BASED K-MEANS CLUSTERING ALGORITHM IN INSURANCE A REVIEW ARTICLE

APPLICATION OF MULTIPLE RANDOM CENTROID (MRC) BASED K-MEANS CLUSTERING ALGORITHM IN INSURANCE A REVIEW ARTICLE APPLICATION OF MULTIPLE RANDOM CENTROID (MRC) BASED K-MEANS CLUSTERING ALGORITHM IN INSURANCE A REVIEW ARTICLE Sundari NallamReddy, Samarandra Behera, Sanjeev Karadagi, Dr. Anantha Desik ABSTRACT: Tata

More information

Analysis and Extensions of Popular Clustering Algorithms

Analysis and Extensions of Popular Clustering Algorithms Analysis and Extensions of Popular Clustering Algorithms Renáta Iváncsy, Attila Babos, Csaba Legány Department of Automation and Applied Informatics and HAS-BUTE Control Research Group Budapest University

More information

A Novel Analysis of Clustering for Minimum Spanning Tree using Divide & Conquer Technique

A Novel Analysis of Clustering for Minimum Spanning Tree using Divide & Conquer Technique Global Journal of Computer Science and Technology Network, Web & Security Volume 13 Issue 14 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals

More information

Mining Quantitative Association Rules on Overlapped Intervals

Mining Quantitative Association Rules on Overlapped Intervals Mining Quantitative Association Rules on Overlapped Intervals Qiang Tong 1,3, Baoping Yan 2, and Yuanchun Zhou 1,3 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China {tongqiang,

More information

Finding Consistent Clusters in Data Partitions

Finding Consistent Clusters in Data Partitions Finding Consistent Clusters in Data Partitions Ana Fred Instituto de Telecomunicações Instituto Superior Técnico, Lisbon, Portugal afred@lx.it.pt Abstract. Given an arbitrary data set, to which no particular

More information

CHAPTER 7 A GRID CLUSTERING ALGORITHM

CHAPTER 7 A GRID CLUSTERING ALGORITHM CHAPTER 7 A GRID CLUSTERING ALGORITHM 7.1 Introduction The grid-based methods have widely been used over all the algorithms discussed in previous chapters due to their rapid clustering results. In this

More information

Normalization based K means Clustering Algorithm

Normalization based K means Clustering Algorithm Normalization based K means Clustering Algorithm Deepali Virmani 1,Shweta Taneja 2,Geetika Malhotra 3 1 Department of Computer Science,Bhagwan Parshuram Institute of Technology,New Delhi Email:deepalivirmani@gmail.com

More information

Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms

Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms Binoda Nand Prasad*, Mohit Rathore**, Geeta Gupta***, Tarandeep Singh**** *Guru Gobind Singh Indraprastha University,

More information

A New Meta-heuristic Bat Inspired Classification Approach for Microarray Data

A New Meta-heuristic Bat Inspired Classification Approach for Microarray Data Available online at www.sciencedirect.com Procedia Technology 4 (2012 ) 802 806 C3IT-2012 A New Meta-heuristic Bat Inspired Classification Approach for Microarray Data Sashikala Mishra a, Kailash Shaw

More information

Density Based Clustering using Modified PSO based Neighbor Selection

Density Based Clustering using Modified PSO based Neighbor Selection Density Based Clustering using Modified PSO based Neighbor Selection K. Nafees Ahmed Research Scholar, Dept of Computer Science Jamal Mohamed College (Autonomous), Tiruchirappalli, India nafeesjmc@gmail.com

More information

An Enhanced K-Medoid Clustering Algorithm

An Enhanced K-Medoid Clustering Algorithm An Enhanced Clustering Algorithm Archna Kumari Science &Engineering kumara.archana14@gmail.com Pramod S. Nair Science &Engineering, pramodsnair@yahoo.com Sheetal Kumrawat Science &Engineering, sheetal2692@gmail.com

More information

Data Stream Clustering Using Micro Clusters

Data Stream Clustering Using Micro Clusters Data Stream Clustering Using Micro Clusters Ms. Jyoti.S.Pawar 1, Prof. N. M.Shahane. 2 1 PG student, Department of Computer Engineering K. K. W. I. E. E. R., Nashik Maharashtra, India 2 Assistant Professor

More information

Intro to Artificial Intelligence

Intro to Artificial Intelligence Intro to Artificial Intelligence Ahmed Sallam { Lecture 5: Machine Learning ://. } ://.. 2 Review Probabilistic inference Enumeration Approximate inference 3 Today What is machine learning? Supervised

More information

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X Analysis about Classification Techniques on Categorical Data in Data Mining Assistant Professor P. Meena Department of Computer Science Adhiyaman Arts and Science College for Women Uthangarai, Krishnagiri,

More information

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images Karthik Ram K.V & Mahantesh K Department of Electronics and Communication Engineering, SJB Institute of Technology, Bangalore,

More information

Graph-based High Level Motion Segmentation using Normalized Cuts

Graph-based High Level Motion Segmentation using Normalized Cuts Graph-based High Level Motion Segmentation using Normalized Cuts Sungju Yun, Anjin Park and Keechul Jung Abstract Motion capture devices have been utilized in producing several contents, such as movies

More information

Detection of Anomalies using Online Oversampling PCA

Detection of Anomalies using Online Oversampling PCA Detection of Anomalies using Online Oversampling PCA Miss Supriya A. Bagane, Prof. Sonali Patil Abstract Anomaly detection is the process of identifying unexpected behavior and it is an important research

More information

A NOVEL ALGORITHM FOR CENTRAL CLUSTER USING MINIMUM SPANNING TREE

A NOVEL ALGORITHM FOR CENTRAL CLUSTER USING MINIMUM SPANNING TREE A NOVEL ALGORITHM FOR CENTRAL CLUSTER USING MINIMUM SPANNING TREE S.JOHN PETER 1, S.P.VICTOR 2 1. Assistant Professor 2. Associate Professor Department of Computer Science and Research Center St. Xavier

More information

A Survey on DBSCAN Algorithm To Detect Cluster With Varied Density.

A Survey on DBSCAN Algorithm To Detect Cluster With Varied Density. A Survey on DBSCAN Algorithm To Detect Cluster With Varied Density. Amey K. Redkar, Prof. S.R. Todmal Abstract Density -based clustering methods are one of the important category of clustering methods

More information

Classification of Face Images for Gender, Age, Facial Expression, and Identity 1

Classification of Face Images for Gender, Age, Facial Expression, and Identity 1 Proc. Int. Conf. on Artificial Neural Networks (ICANN 05), Warsaw, LNCS 3696, vol. I, pp. 569-574, Springer Verlag 2005 Classification of Face Images for Gender, Age, Facial Expression, and Identity 1

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information

Introduction to Mobile Robotics

Introduction to Mobile Robotics Introduction to Mobile Robotics Clustering Wolfram Burgard Cyrill Stachniss Giorgio Grisetti Maren Bennewitz Christian Plagemann Clustering (1) Common technique for statistical data analysis (machine learning,

More information

Lorentzian Distance Classifier for Multiple Features

Lorentzian Distance Classifier for Multiple Features Yerzhan Kerimbekov 1 and Hasan Şakir Bilge 2 1 Department of Computer Engineering, Ahmet Yesevi University, Ankara, Turkey 2 Department of Electrical-Electronics Engineering, Gazi University, Ankara, Turkey

More information

Efficient and Effective Clustering Methods for Spatial Data Mining. Raymond T. Ng, Jiawei Han

Efficient and Effective Clustering Methods for Spatial Data Mining. Raymond T. Ng, Jiawei Han Efficient and Effective Clustering Methods for Spatial Data Mining Raymond T. Ng, Jiawei Han 1 Overview Spatial Data Mining Clustering techniques CLARANS Spatial and Non-Spatial dominant CLARANS Observations

More information

A Graph Theoretic Approach to Image Database Retrieval

A Graph Theoretic Approach to Image Database Retrieval A Graph Theoretic Approach to Image Database Retrieval Selim Aksoy and Robert M. Haralick Intelligent Systems Laboratory Department of Electrical Engineering University of Washington, Seattle, WA 98195-2500

More information

Introduction to Computer Science

Introduction to Computer Science DM534 Introduction to Computer Science Clustering and Feature Spaces Richard Roettger: About Me Computer Science (Technical University of Munich and thesis at the ICSI at the University of California at

More information

IT-Dendrogram: A New Member of the In-Tree (IT) Clustering Family

IT-Dendrogram: A New Member of the In-Tree (IT) Clustering Family IT-Dendrogram: A New Member of the In-Tree (IT) Clustering Family Teng Qiu (qiutengcool@163.com) Yongjie Li (liyj@uestc.edu.cn) University of Electronic Science and Technology of China, Chengdu, China

More information

AUTOMATIC PATTERN CLASSIFICATION BY UNSUPERVISED LEARNING USING DIMENSIONALITY REDUCTION OF DATA WITH MIRRORING NEURAL NETWORKS

AUTOMATIC PATTERN CLASSIFICATION BY UNSUPERVISED LEARNING USING DIMENSIONALITY REDUCTION OF DATA WITH MIRRORING NEURAL NETWORKS AUTOMATIC PATTERN CLASSIFICATION BY UNSUPERVISED LEARNING USING DIMENSIONALITY REDUCTION OF DATA WITH MIRRORING NEURAL NETWORKS Name(s) Dasika Ratna Deepthi (1), G.R.Aditya Krishna (2) and K. Eswaran (3)

More information

Correlation Based Feature Selection with Irrelevant Feature Removal

Correlation Based Feature Selection with Irrelevant Feature Removal Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

Redefining and Enhancing K-means Algorithm

Redefining and Enhancing K-means Algorithm Redefining and Enhancing K-means Algorithm Nimrat Kaur Sidhu 1, Rajneet kaur 2 Research Scholar, Department of Computer Science Engineering, SGGSWU, Fatehgarh Sahib, Punjab, India 1 Assistant Professor,

More information

A Keypoint Descriptor Inspired by Retinal Computation

A Keypoint Descriptor Inspired by Retinal Computation A Keypoint Descriptor Inspired by Retinal Computation Bongsoo Suh, Sungjoon Choi, Han Lee Stanford University {bssuh,sungjoonchoi,hanlee}@stanford.edu Abstract. The main goal of our project is to implement

More information