Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality

Similar documents
Medical Image Feature, Extraction, Selection And Classification

Computer-aided detection of clustered microcalcifications in digital mammograms.

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION

ISSN: (Online) Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

Approaches For Automated Detection And Classification Of Masses In Mammograms

Mammogram Segmentation using Region based Method with Split and Merge Technique

Computer Aided Diagnosis Based on Medical Image Processing and Artificial Intelligence Methods

S.KANIMOZHI SUGUNA 1, DR.S.UMA MAHESWARI 2

CHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS

Classification of Microcalcification in Digital Mammogram using Stochastic Neighbor Embedding and KNN Classifier

A. Overview of the CAD System In this paper, the classification is performed in four steps. Initially the mammogram image is smoothened using median

Classification of Microcalcification Clusters via PSO-KNN Heuristic Parameter Selection and GLCM Features

algorithms ISSN

Diagnosis of Breast Cancer using Wavelet Entropy Features

SVM-based CBIR of Breast Masses on Mammograms

An approach for classification of malignant and benign microcalcification clusters

Available Online through

Analysis of classifier to improve Medical diagnosis for Breast Cancer Detection using Data Mining Techniques A.subasini 1

An Enhanced Mammogram Image Classification Using Fuzzy Association Rule Mining

MULTICLASS CLASSIFICATION OF MAMMOGRAM IMAGES WITH GLCM FEATURES

Evolutionary Algorithms in the Classification of Mammograms

FEATURE EXTRACTION TECHNIQUES USING SUPPORT VECTOR MACHINES IN DISEASE PREDICTION

Tumor Detection and classification of Medical MRI UsingAdvance ROIPropANN Algorithm

Global Journal of Engineering Science and Research Management

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

Design of a high-sensitivity classifier based on a genetic algorithm: application to computer-aided diagnosis

Fuzzy C-means Clustering with Temporal-based Membership Function

k-nn Disgnosing Breast Cancer

Breast Cancer Detection and Classification Using Ultrasound and Ultrasound Elastography Images

Classification Performance related to Intrinsic Dimensionality in Mammographic Image Analysis

Comparative Study of Clustering Algorithms using R

Content Based Medical Image Retrieval Using Fuzzy C- Means Clustering With RF

International Research Journal of Engineering and Technology (IRJET) e-issn:

Department of Computer Engineering, Punjabi University, Patiala, Punjab, India 2

Salman Ahmed.G* et al. /International Journal of Pharmacy & Technology

Automated Colon Cancer Detection Using Kernel Sparse Representation Based Classifier

Biomedical Research 2016; Special Issue: S123-S127 ISSN X

A fast breast nonlinear elastography reconstruction technique using the Veronda-Westman model

Classification of Mammographic Images Using Artificial Neural Networks

Modern Medical Image Analysis 8DC00 Exam

Available online Journal of Scientific and Engineering Research, 2019, 6(1): Research Article

CHAPTER 4 SEGMENTATION

Parameter Optimization for Mammogram Image Classification with Support Vector Machine

PERFORMANCE EVALUATION OF A-TROUS WAVELET-BASED FILTERS FOR ENHANCING DIGITAL MAMMOGRAPHY IMAGES

A COMPARATIVE STUDY OF DIMENSION REDUCTION METHODS COMBINED WITH WAVELET TRANSFORM APPLIED TO THE CLASSIFICATION OF MAMMOGRAPHIC IMAGES

A New Approach for Mammogram Image Classification Using Fractal Properties

A CAD SYSTEM FOR THE DETECTION OF MALIGNANT TUMORS IN DIGITIZED MAMMOGRAM FILMS

An Improved Modified Tracking Algorithm Hybrid with Fuzzy C Means Clustering In Digital Mammograms

1 School of Biological Science and Medical Engineering, Beihang University, Beijing, China

Automated Lesion Detection Methods for 2D and 3D Chest X-Ray Images

AUTOMATED EVALUATION OF BREAST CANCER DETECTION USING SVM CLASSIFIER

CS6375: Machine Learning Gautam Kunapuli. Mid-Term Review

Mutual Information with PSO for Feature Selection

An Approach for Breast Cancer Mass Detection in Mammograms

A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES

Fuzzy detection of microcalcifications in mammograms

Segmentation of the pectoral muscle edge on mammograms by tunable parametric edge detection

LITERATURE SURVEY ON DETECTION OF LUMPS IN BRAIN

FEATURE EXTRACTION FROM MAMMOGRAPHIC MASS SHAPES AND DEVELOPMENT OF A MAMMOGRAM DATABASE

A Survey on Image Segmentation Using Clustering Techniques

Early Stage Oral Cavity Cancer Detection: Anisotropic Pre-Processing and Fuzzy C-Means Segmentation

Automatic Machinery Fault Detection and Diagnosis Using Fuzzy Logic

Approaches for automated detection and classification of masses in mammograms

The MAGIC-5 CAD for nodule detection in low dose and thin slice lung CT. Piergiorgio Cerello - INFN

Pattern recognition (4)

Classification Lecture Notes cse352. Neural Networks. Professor Anita Wasilewska

Applying Supervised Learning

MIT 801. Machine Learning I. [Presented by Anna Bosman] 16 February 2018

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering

Improving the detection of excessive activation of ciliaris muscle by clustering thermal images

Parameter Estimation of Markov Random Field Model of Image Textures

Breast Density Classification Using Local Quinary Patterns with Various Neighbourhood Topologies

A Novel Low-Complexity Framework in Ultra-Wideband Imaging for Breast Cancer Detection

Object Extraction Using Image Segmentation and Adaptive Constraint Propagation

Clustering & Classification (chapter 15)

Semi-Supervised Clustering with Partial Background Information

Investigations on Impact of Feature Normalization Techniques on Classifier s Performance in Breast Tumor Classification

k-nn CLASSIFIER FOR SKIN CANCER CLASSIFICATION

k-nearest Neighbor (knn) Sept Youn-Hee Han

A comparison of different Gabor feature extraction approaches for mass classification in mammography

Tumor Detection in Breast Ultrasound images

Image Segmentation. Shengnan Wang

A SURVEY OF IMAGE MINING TECHNIQUES AND APPLICATIONS

Image Mining: frameworks and techniques

Hybrid Approach for MRI Human Head Scans Classification using HTT based SFTA Texture Feature Extraction Technique

Differentiation of Malignant and Benign Breast Lesions Using Machine Learning Algorithms

A CRITIQUE ON IMAGE SEGMENTATION USING K-MEANS CLUSTERING ALGORITHM

A Classifier with the Function-based Decision Tree

A Comparison of wavelet and curvelet for lung cancer diagnosis with a new Cluster K-Nearest Neighbor classifier

MELANOMA DETECTION USING HYBRID CLASSIFIER

Content Based Image Retrieval system with a combination of Rough Set and Support Vector Machine

Bipartite Graph Partitioning and Content-based Image Clustering

Parameter optimization of a computer-aided diagnosis scheme for the segmentation of microcalcification clusters in mammograms

Analysis of texture patterns in medical images with an application to breast imaging


Effect of Principle Component Analysis and Support Vector Machine in Software Fault Prediction

Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude

Weighting and selection of features.

Thermographic Image Analysis Method in Detection of Canine Bone Cancer (Osteosarcoma)

CAD SYSTEM FOR AUTOMATIC DETECTION OF BRAIN TUMOR THROUGH MRI BRAIN TUMOR DETECTION USING HPACO CHAPTER V BRAIN TUMOR DETECTION USING HPACO

Transcription:

Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality Abstract: Mass classification of objects is an important area of research and application in a variety of fields. In this paper, we present an efficient computer-aided mass classification method in digitized mammograms using Fuzzy K-Nearest Neighbour Equality (FK-NNE), which performs benign or malignant classification on region of interest that contains mass. One of the major mammographic characteristics for mass classification is texture. FK- NNE exploits this important factor to classify the mass into benign or malignant. The statistical textural features used in characterizing the masses are Haralick and Run-length features. The main aim of the method is to increase the effectiveness and efficiency of the classification process in an objective manner to reduce the numbers of falsepositive of malignancies. In this paper proposes a novel Fuzzy K-Nearest Neighbour Equality algorithm for classifying the marked regions into benign and malignant and 94.46% sensitivity, 96.81% specificity and 96.52% accuracy is achieved that is very much promising compare to the radiologist s accuracy. Keywords: Feature Classification; K-N, FK-NN; K-NNE; FK-NNE I. Introduction Breast cancer is the most common of all cancers and is the leading cause of cancer deaths in women worldwide, accounting for more than 1.6% of deaths and case fatality rates are highest in low-resource countries. In India the average age of the high risk group in India is 43-46 years unlike in the west where women aged 53-57 years are more prone to breast cancer. As there is no effective method for its prevention, the diagnosis of breast cancer in its early stage of development has become very crucial for the prevention of cancer. Computer-aided diagnosis (CAD) systems play an important role in earlier diagnosis of breast cancer. Classifiers play an important role in the implementation of intelligent system to identify the breast cancer from mammogram. The features are given as input to the classifiers to classify microcalcifications into benign and malignant. The Back Propagation Neural network is tested by the Jack Knife method [1]. Hassanien and Ali presented an enhanced rough set approach for attribute reduction and generating classification rules from digital mammogram. The classifier model was built and quadratic distances similarly; function is used for matching process. This Paper is organized as follows. Section II presents related work. Section III describes pre-processing work using new filtering techniques, new segmentation techniques and features extraction techniques. The flow diagram represents the step of the processing. After features extraction and selection, how the feature to be classify the technique that is given in Section IV. Section V presents experimental results. Finally, Section VI presents conclusion. II. Related Work It allows partial membership of an object to different classes, and also takes into account the relative importance (closeness) of each neighbour with respect to the test instance. However, as Sarkar correctly argued in [2], the Fuzzy Neural Network (FNN) algorithm has problems dealing adequately with insufficient knowledge. Cornelis, C., et al. [3] introduced vague quantifiers like some or most into the definition of upper and lower approximation. Coenen, F., [4] proposed CPAR (Classification based on Predictive Association Rules) algorithm is an extension of PRM (Predictive Rule Mining) which in turn is an extension of FOIL (First Order Inductive Learner) algorithm. Mahmoud, R., et al. [5] proposed approach is performed in two stages. In the first stage, the system separates segments of the image that may correspond to tumors using a combination of morphological operations and a region growing technique. In the second stage, segmented regions are classified as normal, benign, or malignant tissues based on different measurements. Cheng, H.D., et al. [6] discussed microcalcifications and masses are the two most important indicators of malignancy, and their automated detection is very valuable for early breast cancer diagnosis. Noel pérez et al. [7] described a novel CAD tool that combines digital image processing and artificial neural networks among others techniques to diagnose mammography Pathological Lesions (PL) (as benign or malignant tissues) on GRID environments. Erkang Cheng et al. [8] proposed using normalized Histogram Intersection (HI) as a similarity measure with the K-nearest neighbour (K-NN) classifier. Furthermore, by taking advantage of the fact that HI forms a Mercer kernel, HI is combined with Support Vector Machines (SVM), which further improves the

classification performance. A hybrid method of data mining technique is used to predict the texture features which play a vital role in classification. III. Pre-Processing Work Three Hundred and Thirty Two of mammograms are obtained from the MIAS database (ftp://peipa.essex.ac.uk) to analyze the proposed methods. In this chapter, New Filter-I and New Filter-II have been proposed and applied to enhance the mammogram images. The mammogram images are normalized using Max-Min method. The hybridization of different methods based Fuzzy approaches have been proposed for pectoral muscle region. The next work is breast border and edge detection using hybridization of Fuzzy, Canny and Gradient Edge algorithms. Using the border points as references, the mammogram images are aligned and subtracted to extract the suspicious region and background. The two methods of breast region segmentation used to mammogram images to extract the suspicious regions. In the case of pairs of images, the Fuzzy entropy and weighted Fuzzy entropy based on multi-thresholding is used to extract the suspicious region from the digital mammograms. In the last work, microcalcification (Region of Interest (ROI)) process is applied with Modified Ant Colony Optimization and Modified Water Shed Transform algorithms. The Haralick features from the textural description methods such as Surrounding Region Dependency Matrix, Spatial Grey Level Dependency Matrix, and Grey Level Difference Matrix and run length features from the texture description method Grey Level Run-Length Matrix are extracted from segmented mammogram images for further analysis. The reduced features are selected using two kinds of tolerances rough set based quick reduct, relative reduct and unsupervised Swarm Particle Optimization relative reduct algorithms from the features extracted. IV. Proposed Work However, in many pattern recognition problems, the classification of an input pattern is based on data where the respective sample sizes of each class are small and possibly not representative of the actual probability distributions, even if they are known. In these cases, many techniques rely on some notion of similarity or distance in feature space, for instance, clustering and discriminant analysis. One of the problems encountered in using the K-NN classifier is that normally each of the sample vectors is considered equally important in the assignment of the class label to the input vector. This frequently causes difficulty in those places where the sample sets overlap. The typical vectors are given as much weight as those that are truly representative of the clusters. Another difficulty is that once an input vector is assigned to a class, there is no indication of its strength of membership in that class FK-NN uses concepts from fuzzy logic to assign degree of membership to different classes while considering the distance of its K-Nearest Neighbours. Points closer to the query point contributes larger value to be assigned to the membership function of their corresponding class in comparison to far way neighbours. Class with the highest membership function value is taken as the winner. Since that time researchers have found numerous ways to utilize this theory to generalize existing techniques and to develop new algorithms in pattern recognition and decision analysis. A. Fuzzy K-Nearest Neighbour Equality (FK-NNE) Algorithm This section proposes a novel Fuzzy K-Nearest Neighbour Equality algorithm. It assigns class membership to a sample vector rather than assigning the vector to a particular class. The advantage is that no arbitrary assignments are made by the algorithm. The basis of the algorithm is to assign membership as a function of the vector s distance from its K-Nearest Neighbours and those neighbours memberships in the possible classes. The Fuzzy algorithm is similar to the crisp version in the sense that it must also search the labeled sample set for the K-Nearest Neighbours. Beyond obtaining these K samples, the procedures differ considerably. The mean distance was calculated from K-points to testing data. The minimized mean distance based the output class values are stored.

Algorithm: Fuzzy K-Nearest Neighbour Equality (FK-NNE) Input: x Vector to be classified; K- Samples; (x i, D i ), i = 1 n Output: Class of Vector x Step 1: BEGIN Step 2: Input x, of unknown classification Step 3: Set K, 1 K n Step 4: Initialize i =1 Step 5: DO UNTIL (K-nearest neighbours to x found) Step 6: Compute distance from x to x i Step 7: IF (i K) THEN Step 8: Include x i in the set of K-nearest neighbours Step 9: ELSE IF (x i closer to x than any previous nearest neighbour) THEN Step 10: Delete the farthest of the K-nearest neighbours Step 11: Include x i in the set of the K-nearest neighbours Step 12: END IF Step 13: END DO UNTIL Step 14: Initialize i = 1 Step 15: DO UNTIL (x assigned membership in all classes) K 2/( m1) ij 1 x x j j1 Step 16: Compute i ( x) K 2/( m1) 1 x x j j1 Step 17: Increment i Step 18: END DO UNTIL Step 19: FOR each class value D i DO //Fuzzy Equality Calculation Step 20: Select the fuzzy K-Nearest Neighbours to x that belongs to D i from the sample file Step 21: Compute the mean distance from these k points to d, d i D Step 22: Output the class m * with the minimized mean distance d i, among all the classes Step 23: END FOR Step 24: END Figure 1: The FK-NNE Algorithm Compared with K-NNE, FK-NNE can achieve acceptable accuracy rate. An improvement over the K-NNE classifier is the Fuzzy K-NN Equality classifier, which uses concepts from fuzzy logic to assign degree of membership to different classes while considering the mean distance of its K-Nearest Neighbours. Figure 1 shows that the Fuzzy K-Nearest Neighbourhood Equality algorithm. V. Experimental Results Table I Average classification results of classification algorithms D Methods Sensitivity Specificity Accuracy K-NN 0.8986 0.8962 125 K-NNE 311 436 534 FK-NN 084 222 342 FK-NNE 446 681 652 The results tabulated in Table I, clearly show that the different classification models discriminate malignant and benign with different accuracy. And also it also shows tht the classification accuracy achieved using FK-NNE

Value(%) Value (%) Value (%) Value(%) is much better than others. It is observed that the maximum and minimum classification accuracies are 98% and 91% with FK-NNE and K-NN classifier respectively shown in figures 2 and 3. 6 4 2 0.86 Sensitivity 8 6 4 2 0.86 Specificity 8 6 4 2 Accuracy Figure 2: Performance of Sensitivity, Specificity and Accuracy for K-NN, F-KNN, K-NNE and FK-NNE Classifiers 1 (a) (b) (c) 5 0.85 Sensitivity Specificity Accuracy Relative Measures K-NN K-NNE FK-NN FK-NNE Figure 3: Relative Performance measures for K-NN, FK-NN, K-NNE and FK-NNE and Classifiers The selected features discriminate between malignant masses and benign masses on mammogram images with 91% accuracy, 90% sensitivity and 90% specificity levels that are relatively poorer when compared to others. FK-NN yielded an accuracy of 93% for distinguishing malignant and benign masses on mammogram images. It is 2% higher than K-NN. The K-NNE classifier achieved an accuracy of 95% where 93% sensitivity and 94% specificity. It is 1% higher than FK-NN. The FK-NNE classifier achieved an accuracy of 97% where 94% sensitivity and 97% specificity. It is 2% higher than K-NNE. ROC curve is generated for the results of four classifiers and exposed in the figure 4. Figure 4: ROC Curves for K-NN, FK-NN, K-NNE and FK-NNE Classifiers Area under the ROC curve is an important criterion for classifier. The most popular summary measure of accuracy is the area under the ROC curve, often denoted as area under curve. The area under the ROC curve represents the probability of a random positive sample to receive a better score than a random negative sample. The value of area under curve ranges from 0.5 to 1.0 that indicates chance to perfect discrimination. The

AUC Value diagnostic test is more accurate when area is larger. The computed value of area under curve fo each classifier is recorded in Table II. Table II: Performance of Classification Algorithms using Area under Curve Algorithms A Z Value K-NN 125 K-NNE 634 FK-NN 452 FK-NNE 734 A Z Value 1 5 0.85 K-NN K-NNE FK-NN FK-NNE Az Value Figure 5: Areas (A z ) under ROC curves for the classifier of K-NN, FK-NN, K-NNE and FK-NNE From the Table II, the best area under curve value is 734 for FK-NNE followed by K-NNE, FK-NN and K-NN are 634, 452 and 125 respectively. Figure 5 represents the areas under ROC curves for the proposed classifiers. VI. Conclusion The early detection of breast cancer from the mammogram images is one of the most challenging tasks. Accuracy is also most important in the field of medical diagnosis using images. The performance of four classifiers namely FK-NNE, K-NNE, FK-NN and K-NN have been constructed, and these are investigated for the task of breast cancer classification using mammogram images. The proposed FK-NNE classification accuracy is higher when compared to existing K-NNE, FK-NN and K-NN classifiers. The experimental result reveals that the FK-NNE classifier achieves better classification accuracy than others. VII. References [1] Hassanien, A.E., and Ali, J.M., Enhanced Rough Sets Rule Reduction Algorithm for Classification Digital Mammography, Intelligent System jou., UK, Freund &Pettman, Vol. 13, No. 2, pp. 151-171, 2004. [2] Sarkar, M., Fuzzy-Rough nearest neighbors algorithm, Fuzzy Sets and Systems, Vol.158, pp. 2123-2152, 2007. [3] Cornelis, C., De Cock, M., Radzikowska, A.M., Vaguely Quantified Rough Sets, Proceedings of 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing (RSFDGrC2007), Lecture Notes in Artificial Intelligence 4482, pp. 87-94, 2007. [4] Coenen, F., The LUCS-KDD implementations of CPAR (Classification based on Predictive Association Rules), School of Electrical and Electronics Engineering. Nanyang Technological University, National Cancer Center, Singapore, 2004. [5] Mahmoud, R., et.al, Automated Detection of Tumors in Mammograms Using Two Segments for Classification, PCM 2005, Part I, LNCS 3767, pp. 910 921, 2005. [6] Cheng, H.D., et al., Approaches for automated detection and classification of masses in mammograms, Pattern Recognition 39 Elsevier, pp. 646-668, 2006. [7] Noel pérez et al., A CAD tool for mammography image analysis evolution on GRID environment, IJCSI International Journal of Computer Science Issues, Vol. 10, Issue 1, No. 2, pp. 255-259, January, 2007. [8] Erkang Cheng et al, Mammographic image classification using histogram intersection, Biomedical Imaging: From Nano. to Macro, IEEE, pp.197 200, 2010. VIII. Acknowledgments The First author would like to thank the University Grant Commission, India (F. No. 41-1361/2012(SR)) for his research.