A Novel Adaptive Clustering Algorithm Using Validity Index in Remote Sensing Data

Similar documents
Nearest Clustering Algorithm for Satellite Image Classification in Remote Sensing Applications

SEGMENTATION OF SATELLITE IMAGES BY MODIFIED MOUNTAIN CLUSTERING

Change Detection in Remotely Sensed Images Based on Image Fusion and Fuzzy Clustering

AN IMPORTANT task in remote sensing applications is

BCD-Clustering algorithm for Breast Cancer Diagnosis

Web Based Fuzzy Clustering Analysis

Analysis of K-Means Clustering Based Image Segmentation

An Endowed Takagi-Sugeno-type Fuzzy Model for Classification Problems

TOWARDS NEW ESTIMATING INCREMENTAL DIMENSIONAL ALGORITHM (EIDA)

Engineering And Technology (affiliated to Anna University, Chennai) Tamil. Nadu, India

[Gidhane* et al., 5(7): July, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116

Spatial Information Based Image Classification Using Support Vector Machine

Keywords: clustering algorithms, unsupervised learning, cluster validity

I. INTRODUCTION II. RELATED WORK.

1. AN.Valliyappan 2. Assistant Professor/ECE VCET, Madurai VCET, Madurai

A Survey On Different Text Clustering Techniques For Patent Analysis

An Improved Fuzzy K-Medoids Clustering Algorithm with Optimized Number of Clusters

Gene Clustering & Classification

Colour Image Segmentation Using K-Means, Fuzzy C-Means and Density Based Clustering

Clustering Algorithms In Data Mining

Recognition of Changes in SAR Images Based on Gauss-Log Ratio and MRFFCM

HARD, SOFT AND FUZZY C-MEANS CLUSTERING TECHNIQUES FOR TEXT CLASSIFICATION

Using Categorical Attributes for Clustering

Implementation of Fuzzy C-Means and Possibilistic C-Means Clustering Algorithms, Cluster Tendency Analysis and Cluster Validation

DENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE

Rezaee proposed the V CW B index [3] which measures the compactness by computing the variance of samples within the cluster with respect to average sc

A New Approach to Discover Periodic Frequent Patterns

Texture Image Segmentation using FCM

Research and Improvement on K-means Algorithm Based on Large Data Set

International Journal of Advanced Research in Computer Science and Software Engineering

Efficient FM Algorithm for VLSI Circuit Partitioning

Analyzing Outlier Detection Techniques with Hybrid Method

An Enhanced K-Medoid Clustering Algorithm

Fuzzy clustering with volume prototypes and adaptive cluster merging

OPTIMIZATION. Optimization. Derivative-based optimization. Derivative-free optimization. Steepest descent (gradient) methods Newton s method

Remote Sensed Image Classification based on Spatial and Spectral Features using SVM

Datasets Size: Effect on Clustering Results

Analysis and Extensions of Popular Clustering Algorithms

Biometric Security System Using Palm print

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION

Unsupervised Learning

Matrix Inference in Fuzzy Decision Trees

Kapitel 4: Clustering

Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms

A fuzzy k-modes algorithm for clustering categorical data. Citation IEEE Transactions on Fuzzy Systems, 1999, v. 7 n. 4, p.

CS570: Introduction to Data Mining

Clustering part II 1

A Quantitative Approach for Textural Image Segmentation with Median Filter

Novel Hybrid k-d-apriori Algorithm for Web Usage Mining

Image Contrast Enhancement in Wavelet Domain

Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data

Dynamic Clustering of Data with Modified K-Means Algorithm

Clustering. Supervised vs. Unsupervised Learning

An ICA based Approach for Complex Color Scene Text Binarization

Unsupervised Learning. Presenter: Anil Sharma, PhD Scholar, IIIT-Delhi

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering

Data Mining: An experimental approach with WEKA on UCI Dataset

Data Mining: Concepts and Techniques. Chapter March 8, 2007 Data Mining: Concepts and Techniques 1

Randomized Response Technique in Data Mining

Unsupervised learning on Color Images

Supervised vs. Unsupervised Learning

Collaborative Rough Clustering

Fuzzy C-means Clustering with Temporal-based Membership Function

Understanding Clustering Supervising the unsupervised

An Objective Approach to Cluster Validation

K-means algorithm and its application for clustering companies listed in Zhejiang province

Image retrieval based on bag of images

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science

NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM

Automatic clustering based on an information-theoretic approach with application to spectral anomaly detection

Unsupervised Learning. Andrea G. B. Tettamanzi I3S Laboratory SPARKS Team

Enhanced Hemisphere Concept for Color Pixel Classification

CHAPTER 3 TUMOR DETECTION BASED ON NEURO-FUZZY TECHNIQUE

Road map. Basic concepts

On Clustering Validation Techniques

Enhancing K-means Clustering Algorithm with Improved Initial Center

Image Segmentation Using FELICM Clustering Method

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

S. Sreenivasan Research Scholar, School of Advanced Sciences, VIT University, Chennai Campus, Vandalur-Kelambakkam Road, Chennai, Tamil Nadu, India

A Remote Sensing Image Segmentation Method Based On Spectral and Texture Information

Case Study on Enhanced K-Means Algorithm for Bioinformatics Data Clustering

Methods for Intelligent Systems

Filtered Clustering Based on Local Outlier Factor in Data Mining

A Soft Clustering Algorithm Based on k-median

THE AREA UNDER THE ROC CURVE AS A CRITERION FOR CLUSTERING EVALUATION

Introduction to digital image classification

Semi-Supervised Clustering with Partial Background Information

Clustering Documents in Large Text Corpora

Community Overlapping Detection in Complex Networks

C-NBC: Neighborhood-Based Clustering with Constraints

Data Distortion for Privacy Protection in a Terrorist Analysis System

CHAPTER 7. PAPER 3: EFFICIENT HIERARCHICAL CLUSTERING OF LARGE DATA SETS USING P-TREES

CIMA Asia. Interactive Timetable Live Online

Use of line based symmetry for developing cluster validity indices

NOVEL APPROACH TO CONTENT-BASED VIDEO INDEXING AND RETRIEVAL BY USING A MEASURE OF STRUCTURAL SIMILARITY OF FRAMES. David Asatryan, Manuk Zakaryan

Normalization based K means Clustering Algorithm

K-modes Clustering Algorithm for Categorical Data

Discovery of Agricultural Patterns Using Parallel Hybrid Clustering Paradigm

Semi-supervised Data Representation via Affinity Graph Learning

Transcription:

A ovel Adaptive ing Algorithm Using Validity Index in Remote Sensing Data Dalmiya C.P 1 and Santhi 1 Research Scholar, Associate Professor 1, Department of Electronics and Communication Engineering, oorul Islam Centre for Higher Education, Kumaracoil, Thuckalay Kanyakumari dist., Tamil adu, 918, India. 1 Orcid Id: --19-977, Orcid Id: -1-8-X Abstract ing refers to a non-predefined factor that is subject to concern more on unsupervised classification methods. ing technique in remote sensing images without human interference helps iarious applications like change detection, land cover information, classification etc. In this paper a new adaptive clustering algorithm called Xie-Beni based Adaptive Fuzzy C Means ing (XB-AFCM) method is proposed and the validity indices are measured. The number of clusters is the major task in remote sensing applications that can be determined by cluster validity indices (CVIs). In recent years more number of CVIs is proposed in different papers and some common CVIs are evaluated in this paper. The proposed work is tested on experimental datasets. Finally the calculations revealed that the proposed algorithm is best suited for accurate clustering of remote sensing images for unsupervised change detection methods. Keywords: clustering, cluster validity index, xie-beni index, remote sensing, fuzzy C means. ITRODUCTIO ing aims at partitioning the data set into significant groups. ing deals with data mining process and for identifying interesting distributions and patterns [1]. ing is based on a non-predefined criterion on the other hand classification is based on predefined criteria. Based on the literature there are different types of clustering methods such as K means, Fuzzy C means, Constrained k means [], Hierarchical clustering [,1,] which produces accurate clustering results. As we know that clustering method is very sensitive to initializations, Fuzzy C Means ing [] is one of the best methods for unsupervised change detection methods. The main concern in clustering is to decide accurate number of clusters. To evaluate the clustering process and for finding optimal number of clusters, Validity Indices (CVIs) are calculated []. A better clustering algorithm and validation measures are important for grouping a data []. For high-resolution remote sensing data it is difficult to visualize the data, so a better clustering analysis [17] with optimal number of cluster is important. If the clustering parameters assigned an improper value, it may lead to wrong decisions [8]. Two criteria such as compactness and separation are proposed for a better clustering scheme [1,8]. Compactness indicates the variance of the cluster. The smaller the variance, the clustering technique is better. Separation indicates the distance between the cluster elements and if the distance is large the method is more isolated. In [] 1 commonly used CVIs discussed are applicable to remote sensing data sets. In [, 7, 1, 1, and 1] Xie-Beni index is proposed which is called compactness and separatioalidity function. It is used for evaluating Fuzzy C means clustering technique. Other validating indices used in Fuzzy clustering algorithms are I index [9], Fukuyama and Sugeno Index [7, 11], Partition Coefficient [7, 1, 1], Partition Entropy Coefficient 1], Fuzzy hyper volume [1], Partition Density [1, 1] etc. K means clustering is a partitional algorithm, which decomposes the data set into group of disjoint clusters. Partitioning Around Medoids (PAM), CLARA (ing Large Applications) and CLARAS (ing Large Applications based on Randomized Search) [1]. The mostly used CVIs for K means clustering are Root Mean Square Standard Deviation (RMSSTD), Semi- partial R-squared (SPR) and R-Squared (RS) [1]. These indices can also be applied to Hierarchical clustering techniques. Moreover other algorithms such as BIRCH [1], CURE [18] and ROCK [19] use hierarchical structure for partitioning the data points. PROPOSED CLUSTERIG ALGORITHM Xie-Beni based Adaptive Fuzzy C Means ing (XB- AFCM) Fuzzy C means clustering is developed by Dunn in 197 and the method is improved by Bezdek in 1981. This is based on minimization of the objective function J f that can be calculated by the eqn. (). From the analysis of validity indices it is noted that Xie-Beni index performs well in most of the images. Let S = {s 1, s, s... s n} be the set of data points and T = {t 1, t, t..., t c} be the set of centers. 97

Step 1: Randomly select c cluster centers. Step : Calculate the fuzzy membership U ij using eqn. (1) C j=1 s i t c )1/(m 1) ] 1 (1) U ij = [ ( s i t j Step : Compute the fuzzy centers 't j Step : Repeat step ) and ) until the minimum 'J f' value is achieved J f = i=1 C j=1 (U ij ) f s i t j () Step : Calculate the Xie-Beni validity index, which is given in eqn. (). Step : Stop the iteration when the validity index reaches the minimum value. CVIs for Fuzzy C Means ing Partition Coefficent (PCoeff) PCoeff = 1 U ij i=1 j=1 () PCoeff values are in the range [1/, 1]. The value should be closer to unity for a better clustering technique. Where -number of clusters U ij Membership function Partition Entropy Coefficient (PECoeff) Total Fuzzy Hyper Volume (TFHV) Total Fuzzy Hyper Volume is given by TFHV = i=1 V i (7) V i = 1/ i (8) Σ i = Σ f j=1 Uij(ai b j )(a i b j ) T M f j=1 U ij Where b j is the centre of cluster j. RESULTS AD DISCUSSIOS In order to evaluate the effectiveness of proposed method; experiments are carried out in different types of satellite data. There are many validity indices available for evaluating the clustering techniques, but it is evident from the statistical assessment that all the validity indices does not work accurately in different remote sensing data. The proposed method categorizes the data into accurate number of clusters. For evaluating the correctness of result, a manual evaluation of validity indices is performed for the next cluster from which the clustering operation is stopped. For the analysis of validity indices, a dataset consists of five types of satellite data are taken such as GeoEye-1 (Image 1), Quick bird (Image ), Worldview- (Image ), IKOOS (Image ) and Worldview- (Image ). The details of dataset and specifications are given in Table1. The clustering result of each image is given in Figure 1,,, and. (9) j=1 PECoeff = 1 U i=1 ij log a (U ij ) () where a is the base of the logarithm. Partition entropy coefficient values ranges from to log a. The value should be closer to for a good clustering method. Xie-Beni index (XBindex) Xie-Beni index [1] is best-suited validity index in Fuzzy C means clustering. It works well with compact clusters. i=1 j=1 a i b j min XBindex = μ ij j { b j b n () Fukuyama-Sugeno index (FSindex) For well-separated clusters FS index value will be minimum. It measures the compactness and distance between the clusters. j=1 f FSindex = i=1 U ij ( a i b j X b j b X ) () In FS index b is the meaector of A and X is a symmetric matrix. For a well-separated cluster the FS index should be as small as possible. Figure 1.a GeoEye-1 Figure.a Quick bird b. ing results b. ing results 97

Performance Evaluation In fact cluster validity indices are certain measures to evaluate the number of clusters in an image. In the proposed XB- AFCM algorithm, the best cluster is formed which includes a correct classification of areas in each image. In Figure -1, the cluster evaluation of various datasets for the proposed algorithm is illustrated. Figure.a Worldview b. ing results Table : CVI evaluation Figure.a IKOOS b. ing results Figure a.worldview- b. ing results STUDY AREA AD SPECIFICATIOS Table 1: Dataset and Specifications Dataset o. of clusters PCoeff PECoeff FSindex(-) X 1 - TFHV X 1 - Proposed 1.87.1...1.8.1..1.1.8.7.7..1.78.1.8..11.7..79.7.1.8.7.1 1.7.1.78.9..87.1.77..8.7.1.7.1.7..1.89.17.9.8.1.79..9.7.18.77.1.9.1.19.77..99..1.7..89.7.1.8...9.1.78.8..7.1.7..8.1.17.7.8.9..1.7....1 7.7...9.1.8.1. 1.8.18.77.9..8.1.7..7.81.1.7..7.81.1.7..81..1 Sensor Area Resolution Acquisition Date Geo Eye-1 Sendai, Japan. m February, 1 Quick bird Berlin, Germany - March 8, Worldview- Summer Olympic Complex, Australia IKOOS Carribean Sea, West Indies Worldview- Maracana Stadium, Brazil. m October, 9 1m December, 9 cm -.1.1.1.1.1.1.11.11 Figure : evaluation of EeoEye-1 Image 97

..1.1. Figure 7: evaluation of Quick bird Image As we can see the results, in GeoEye-1 image, clustering stops at th cluster in the proposed method A manual calculation of th cluster is carried out to clarify the results. From the manual calculation it can be seen that value of XB-AFCM s low at th cluster and increasing in the next cluster. In Quick bird image, proposed method value of th cluster is.1 and for th cluster the value is.1. Thus clustering stops at th cluster. In Worldview- image also it shows the XB-AFCM is low at th cluster than th cluster, so clustering stops at th cluster. In IKOOS and Worldview- also it can be viewed as the clustering stops at the point where XB-AFCM value becomes minimum. In terms of results and discussions it is revealed that the proposed method shows accurate number of clusters...1.1. Figure 8: evaluation of Worldview- Image COCLUSIO Iarious remote sensing applications clustering evaluation plays a vital task for measuring the compactness of the method. In this work various CVIs are evaluated for different remote sensing images. For the study, five satellite images are taken and clustering performs in each image and CVIs are noted. From the validation it is vividly revealed that clustering stops at the minimum accurate value of XB-AFCM method. From the evaluation it is conclude that the proposed Xie-Beni based Adaptive Fuzzy C Means ing (XB-AFCM) performs well and produces accurate number of clusters...1.1. 7 Figure 9: evaluation of IKOOS image..1.1. Figure 1: evaluation of Worldview- Image ACKOWLEDGEMET We express our sincere thanks to DST, ew Delhi, India for the financial support under ISPIRE fellowship scheme for the above research work. REFERECES [1] M.Halkidi, Y.Batistakis and M.Vazirgiannis, On clustering validation techniques, Journal of Intell. Info. Sys.., pp. 17-1, 1. [] A.M Lal and S.M Anouncia, Semi-supervised change detection approach combining sparse fusion and constrained k means for multi-temporal remote sensing images, The Egyp. J. Remote Sens., vol.18, no., pp. 79-88, Dec 1. [] M.Halkidi, Y.Batistakis and M.Vazirgiannis, validity methods: part 1, Sigmod Record., vol. 1, no., pp.9-, Jun. [] H.Li, S.Zhang, X.Ding, C.Zhang and P.Dale, Performance evaluation of cluster validity indices (CVIs) on multi/hyper spectral remote sensing datasets, MDPI J. Remote Sens., vol.8, no., Mar 1. 97

[] U.Fayyad, G.P-Shapiro and P.Smyth, From data mining to knowledge discovery in databases, AI Mag., vol.17, no., pp.7-, 199. [] A. Ghosh,.S Mishra and S.Ghosh, Fuzzy clustering algorithms for unsupervised change detection in remote sensing images, Info. Sci., vol.181, no., pp. 99-71, Feb.11. [7] H. Cui, M.Xie, Y. Cai, X.Huang and Y.Liu, validity index for adaptive clustering algorithms, IET Comm., vol.8, no., pp. -, Sep.1. [8] W.Wang and Y. Zhang, On fuzzy cluster validity indices, Fuzzy Sets and Sys., vol.18, no.19, pp. 9-117, Oct.7. [9] U. Maulik and I.Saha, Automatic fuzzy clustering using modified differential evolution for image classification, IEEE Trans. Geosci. Remote Sens., vol. 8, no. 9, pp. -19, Sep.1. [1] X.L Xie and G.Beni, A validity measure for fuzzy clustering, IEEE Trans. Patt. Anal. Machine Intell., vol. 1, no. 8, pp. 81-87, Aug.1991. [11].R Pal and J.C Bezdek, On cluster validity for the fuzzy c-means model, IEEE Trans. Fuzzy Sys., vol., no., pp. 7-79, Aug. 199. [1] J.C Bezdek, validity with fuzzy sets, J. Cybernetics., vol., no., pp. 8-7, 197. [1] C-H Wu, C-S Ouyang, L-W Chen and L-W Lu, A new fuzzy clustering validity index with a median factor for centroid-based clustering, IEEE Trans. Fuzzy Sys.., vol., no., pp. 71-718, Jun. 1. [1] Gath and A.B Geva, Unsupervised optimal fuzzy clustering, IEEE Trans. Patt. Anal. Machine Intell, vol.11, no.7, pp. 77-781, Jul.1989. [1] S.Saha and S.Bandyopadhyaya, Application of a new symmetry-based cluster validity index for satellite image segmentation, IEEE Geosci. Remote Sens. Letters., vol., no., pp. 1-17, Apr.8. [1] T.Zhang, R.Ramakrishnan and M.Livny, BIRCH: an efficient data clustering method for very large databases, ACM SIGMOD., vol., no., pp. 1-11, June 199. [17] S. Sharma, Applied multivariate techniques, John Wiley and Sons. 199. [18] S. Guha, R.Rastogi and K.Shim, CURE: An efficient clustering algorithm for large databases, In Proc. ACM SIGMOD Conf., pp. 7-8, 1998. [19] S. Guha, R.Rastogi and K.Shim, ROCK: A robust clustering algorithm for categorical attributes, In Proc. IEEE Conf. Data Engg., pp. 1-1, March 1999. [] V.S Sidorova, Automatic hierarchical clustering algorithm for remote sensing data, Patt. Recogn. Image Analy., vol.1, no., pp. 8-1, June 11. [1] V.S Sidorova, Detecting clusters of specified separability for multispectral data oarious hierarchical levels, Patt. Recogn. Image Analy., vol., no.1, pp. 11-1, March 1. [] V.S Sidorova, Hierarchical cluster algorithm for remote sensing data of earth, Patt. Recogn. Image Analy., vol., no., pp. 7-79, June 1. 977