Classification of Mammographic Images Using Artificial Neural Networks

Similar documents
Computer-aided detection of clustered microcalcifications in digital mammograms.

An Efficient Neural Network Based System for Diagnosis of Breast Cancer

CHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS

Medical Image Feature, Extraction, Selection And Classification

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Week 3: Perceptron and Multi-layer Perceptron

Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality

An Algorithm For Training Multilayer Perceptron (MLP) For Image Reconstruction Using Neural Network Without Overfitting.

Assignment # 5. Farrukh Jabeen Due Date: November 2, Neural Networks: Backpropation

COMPUTATIONAL INTELLIGENCE

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION

S.KANIMOZHI SUGUNA 1, DR.S.UMA MAHESWARI 2

Mammogram Segmentation using Region based Method with Split and Merge Technique

IN recent years, neural networks have attracted considerable attention

Neural Networks. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

CS6220: DATA MINING TECHNIQUES

EE 589 INTRODUCTION TO ARTIFICIAL NETWORK REPORT OF THE TERM PROJECT REAL TIME ODOR RECOGNATION SYSTEM FATMA ÖZYURT SANCAR

Supervised Learning in Neural Networks (Part 2)

Automated Lesion Detection Methods for 2D and 3D Chest X-Ray Images

Classification of Microcalcification in Digital Mammogram using Stochastic Neighbor Embedding and KNN Classifier

Computer-Aided Diagnosis in Abdominal and Cardiac Radiology Using Neural Networks

Available Online through

A Massively Parallel Virtual Machine for. SIMD Architectures

4.12 Generalization. In back-propagation learning, as many training examples as possible are typically used.

Ensemble methods in machine learning. Example. Neural networks. Neural networks

CMPT 882 Week 3 Summary

Visual object classification by sparse convolutional neural networks

Artificial Neural Networks MLP, RBF & GMDH

Neural Network based textural labeling of images in multimedia applications

Diagnosis of Breast Cancer using Wavelet Entropy Features

Supervised Learning with Neural Networks. We now look at how an agent might learn to solve a general problem by seeing examples.

Neural Network Approach for Automatic Landuse Classification of Satellite Images: One-Against-Rest and Multi-Class Classifiers

A Novel Technique for Optimizing the Hidden Layer Architecture in Artificial Neural Networks N. M. Wagarachchi 1, A. S.

Classification of Microcalcification Clusters via PSO-KNN Heuristic Parameter Selection and GLCM Features

Approaches For Automated Detection And Classification Of Masses In Mammograms

Exercise: Training Simple MLP by Backpropagation. Using Netlab.

Research Article International Journals of Advanced Research in Computer Science and Software Engineering ISSN: X (Volume-7, Issue-6)

Image Compression: An Artificial Neural Network Approach

THE NEURAL NETWORKS: APPLICATION AND OPTIMIZATION APPLICATION OF LEVENBERG-MARQUARDT ALGORITHM FOR TIFINAGH CHARACTER RECOGNITION

Multicriteria Image Thresholding Based on Multiobjective Particle Swarm Optimization

A. Overview of the CAD System In this paper, the classification is performed in four steps. Initially the mammogram image is smoothened using median

Optimizing Number of Hidden Nodes for Artificial Neural Network using Competitive Learning Approach

Computer-Aided Diagnosis for Lung Diseases based on Artificial Intelligence: A Review to Comparison of Two- Ways: BP Training and PSO Optimization

Some Algebraic (n, n)-secret Image Sharing Schemes

CS 4510/9010 Applied Machine Learning. Neural Nets. Paula Matuszek Fall copyright Paula Matuszek 2016

ISSN: (Online) Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

An Enhanced Mammogram Image Classification Using Fuzzy Association Rule Mining

Machine Learning Classifiers and Boosting

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

Neural Networks (Overview) Prof. Richard Zanibbi

A Texture Extraction Technique for. Cloth Pattern Identification

Machine Learning Approaches for Breast Cancer Diagnosis and their Comparison

Notes on Multilayer, Feedforward Neural Networks

Research Article Forecasting SPEI and SPI Drought Indices Using the Integrated Artificial Neural Networks

CNN Template Design Using Back Propagation Algorithm

RIMT IET, Mandi Gobindgarh Abstract - In this paper, analysis the speed of sending message in Healthcare standard 7 with the use of back

CHAPTER VI BACK PROPAGATION ALGORITHM

Linear Separability. Linear Separability. Capabilities of Threshold Neurons. Capabilities of Threshold Neurons. Capabilities of Threshold Neurons

Neuron Selectivity as a Biologically Plausible Alternative to Backpropagation

Robustness of Selective Desensitization Perceptron Against Irrelevant and Partially Relevant Features in Pattern Classification

Image Segmentation Based on. Modified Tsallis Entropy

Hybrid Approach for MRI Human Head Scans Classification using HTT based SFTA Texture Feature Extraction Technique

Tumor Detection and classification of Medical MRI UsingAdvance ROIPropANN Algorithm

Dr. Qadri Hamarsheh Supervised Learning in Neural Networks (Part 1) learning algorithm Δwkj wkj Theoretically practically

Compression, Noise Removal and Comparison in Digital Mammography using LabVIEW

Neural Networks Laboratory EE 329 A

Procedia Computer Science

Supervised Learning (contd) Linear Separation. Mausam (based on slides by UW-AI faculty)

Review on Methods of Selecting Number of Hidden Nodes in Artificial Neural Network

A Comparison of wavelet and curvelet for lung cancer diagnosis with a new Cluster K-Nearest Neighbor classifier

Machine Learning in Biology

Edge Detection for Dental X-ray Image Segmentation using Neural Network approach

Deep Learning With Noise

Comparison Machine Learning Algorithms in Abnormal Mammograms Classification

Harish Kumar N et al,int.j.computer Techology & Applications,Vol 3 (1),

Comparative Study of Instance Based Learning and Back Propagation for Classification Problems

A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES

MULTILAYER PERCEPTRON WITH ADAPTIVE ACTIVATION FUNCTIONS CHINMAY RANE. Presented to the Faculty of Graduate School of

Multilayer Feed-forward networks

International Research Journal of Engineering and Technology (IRJET) e-issn:

A Systematic Overview of Data Mining Algorithms

The Number of Fuzzy Subgroups of Cuboid Group

Research Article Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation

APPLICATION OF A MULTI- LAYER PERCEPTRON FOR MASS VALUATION OF REAL ESTATES

Performance Analysis of Data Mining Classification Techniques

Query Learning Based on Boundary Search and Gradient Computation of Trained Multilayer Perceptrons*

WorkstationOne. - a diagnostic breast imaging workstation. User Training Presentation. doc #24 (v2.0) Let MammoOne Assist You in Digital Mammography

Pattern Classification Algorithms for Face Recognition

ID~tzbutice UnRmlwd DEPARTMENT OF THE AIR FORCE AIR UNIVERSITY AIR FORCE INSTITUTE OF TECHNOLOGY. Wright-Patterson Air Force Base, Ohio DI UL NFC

Enhanced Breast Cancer Classification with Automatic Thresholding Using Support Vector Machine and Harris Corner Detection

Foveal Algorithm for the Detection of Microcalcification Clusters: a FROC Analysis

The Generalized Stability Indicator of. Fragment of the Network. II Critical Performance Event

Rough Connected Topologized. Approximation Spaces

SVM-based CBIR of Breast Masses on Mammograms

4. Feedforward neural networks. 4.1 Feedforward neural network structure

Neural Networks. By Laurence Squires

COMBINING NEURAL NETWORKS FOR SKIN DETECTION

STEREO-DISPARITY ESTIMATION USING A SUPERVISED NEURAL NETWORK

A modified and fast Perceptron learning rule and its use for Tag Recommendations in Social Bookmarking Systems

Neural Network Neurons

Transcription:

Applied Mathematical Sciences, Vol. 7, 2013, no. 89, 4415-4423 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2013.35293 Classification of Mammographic Images Using Artificial Neural Networks Ayoub Arafi, Youssef Safi, Rkia Fajr and Abdelaziz Bouroumi Information Processing Laboratory, Ben M sik Faculty of Sciences Hassan II Mohammedia-Casablanca University, BP.7955 Sidi Othmane Casablanca, 20702, Morocco {a.ayoub.arafi, ysf.safi, fajr.rkia, a.bouroumi}@gmail.com Copyright c 2013 Ayoub Arafi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract We present an application of artificial neural networks to mammographic images, aimed at improving early detection of sensitivity to breast cancer. The proposed application consists of two main steps: a pretreatment step whose role is to extract the characteristics of the available mammographic images using the standard library OpenCV [1]; and a classification step based on an artificial neural network that uses these characteristics as input vectors for its training algorithm. The output of the training phase of this model is a categorization of the pretreated images into two main groups: normal and abnormal. After the training phase, the network can be used in order to label new and unseen images as normal or abnormal. We illustrate the performance of this model using a database of 322 real medical mammographic images. Mathematics Subject Classification: 68T05 Keywords: Breast Cancer, Artificial Neural Networks, Mammographic Images, Backpropagation, Classification 1 Introduction Breast cancer is currently one of the leading causes of mortality among women throughout the world. Statistically speaking, the breast cancer represents

4416 Ayoub Arafi et al. about 25% of all diagnosed cancers in women, and approximately 20% of all fatal cancers [2-4]. In Morocco, for example, some studies have shown that approximately one out of every eight women might be affected with breast cancer during her lifetime, and half of all affected women might die of this disease [5]. To reduce these high percentages, medical specialists insist on the necessity for early detection of this disease; and according to most of them mammography remains the only reliable and practical method to achieve this goal [6][7]. Mammography is a medical examination that uses low doses of X-rays for producing practical images of the internal structure of the breast. This is why mammogram-based diagnosis is highly recommended in order to increase the chances of cure for early affected patients. However, one of the main characteristics of mammographic images is their high spatial resolution, which makes the task of detection very hard for radiologists [8-10]. Hence the necessity for developing new tools that could help specialists in deciding based only on mammographic images, whether a woman has an early stage of breast cancer or not. The main goal of this paper is the proposition of such a tool in the form of an intelligent system capable of filtering and classifying digital mammographic images. The filtering process of this system transforms each mammographic image into an object vector whose components represent the image features useful for discriminating between the two categories. The classification process submits these object vectors as inputs to a multilayer perceptron that learns, using the backpropagation training algorithm, to separate them into normal and abnormal categories. After this training step, the system is tested on unseen images in order to assess its ability to recognize the category of each new image it receives as input. In the next section, we provide a more detailed description of this system and the ideas behind its two steps. In section III we present and discuss some examples of experimental results that illustrate the usefulness of the proposed system. Finally, in section IV we provide some concluding remarks and perspectives for future work. 2 Description of the proposed method 2.1 Preprocessing and feature extraction of raw images Image preprocessing is one of the subjects that have been widely studied in the area of medical image classification because the classification results may depend on the quality of the used images. This is why the first step of our system is dedicated to the problems of preprocessing and feature extraction. As an example, fig. 1 shows a filtered image, which consists in a digitized grayscale

Classification of mammographic images using ANN 4417 image with a resolution of 50 microns per pixel and a spatial resolution of 1024 1024 pixels. Figure 1: A sample example of filtered mammographic image The preprocessing problem consists in extracting from each filtered image the best vector of characteristics that can be used as an input for the classification process. For this, we apply a segmentation method that allows the partitioning of each mammographic image into two regions. This method consists in using an intensity threshold, η, as a reference to which each pixel value in the image should be compared. According to this comparison all pixels that have a gray level value greater than are coded by 1, and all others by 0. (a) (b) Figure 2: Example of a sample image: (a) before segmentation and (b) after segmentation Fig 2 shows the result of this operation for a simple image. The following pseudo code shows in more technical terms how the various steps of the segmentation process are implemented using the OpenCV library: 1. Let I be the current image and R n the set of regions that form I; 2. Let P (R i ) denotes the intensity of R i for i = 1, 2, 3,..., n; 3. For i = 1 to n do if (P (R i ) < η) then P (R i ) = 1 else P (R i ) = 0

4418 Ayoub Arafi et al. 2.2 Multilayer Perceptrons A multilayer perceptron (MLP) is a particular type of neural networks whose architecture contains an input layer, an output layer and at least one hidden layer. The size of the input layer in terms of the number of neurons is fixed by the dimensionality, p, of the data space. Indeed, each of the p units of this layer is dedicated to the perception of one component of each p-dimensional object vector presented as an input datum to the network. The size of the output layer depends on the number of classes, c. In the case of c=2, a single binary neuron is sufficient because each of its possible outputs, 0 or 1, can be associated with one of the two classes. The number and the size of hidden layers are application-dependant. In our case, only one hidden layer was sufficient and the size of this layer was heuristically determined (see section 3). MLP are generally trained using the well-know backpropagation algorithm [11], which is an optimization procedure aimed at minimizing the global error observed at the output layer of the network. This supervised learning algorithm is based on two major steps. The first one consists in forward propagating all input signals in the network, and calculating the results at the output layer. The second step consists in computing error gradients and propagating them backward in order to update the synaptic weights of all neurons that have participated in the observed error. For this, an updating rule based on the gradient descent technique is employed. More formally, the backpropagation algorithm (BP) can be described as follows: Given a training database X = {x 1, x 2, x 3,..., x n } where n is the total number of available examples and x i the object vector associated with the i th example, the local error observed at the output of the k th neuron is given by e(t) = d k (t) y k (t) (1) where d k (t) and y k (t) denote, respectively, the desired and the observed outputs of neuron k. This error represents the contribution of neuron k to the global squared error defined by E(t) = 1 c (d k (t) y k (t)) 2 (2) 2 k=1 The BP algorithm is an iterative procedure aimed at minimizing the error given by eq. (2) using the gradient descent technique. In more formal terms, the functioning of this algorithm can be described by the following pseudo-code [12]: // Backpropagation algorithm Given a labeled data set X = {x 1, x 2, x 3,..., x n } R p :

Classification of mammographic images using ANN 4419 1. Initialize the synaptic weights to small random values [ 0.5, +0.5]; 2. Randomly arrange the training data; 3. For i = 1 to n do { } (a) calculate y j (n) for j = 1 to c y j (n) = 1 1 + e ( b+ n k=1 w k x k ) (3) where m is the total inputs number of neuron j and b its bias (fixed to 1). (b) By retro-propagating the resulting errors, adjust the weights of each neuron j using the delta rule: with if j output layer or w ji (n) = w ji (n 1) + η δ j (n) y i (n) (4) δ j (n) = y j (n) (1 y j (n)) e j (n) (5) δ j (n) = y j (n) (1 y j (n)) k Next layer δ k (n) w kj (n) (6) otherwise. where 0 < η < 1 is a fixed learning rate and y i (n) the output of neuron i of the precedent layer, if it exists, or the i t h component of x otherwise. 4. Repeat steps 2 and 3 until E becomes smaller than a specified threshold, or until a maximum number of iterations is reached. Fig. 3 depicts the architecture we used in this application. The first layer of this architecture contains 1024 1024 neurons, and the last layer one single neuron. Each object vector presented at its input is a 1024 1024 dimensional vector whose components represent the intensities of the different pixels of an image. The total number of images used in the learning phase is 222, whilst 100 other images were used in the test phase. During the learning phase, the desired outputs are fixed to 0 for normal images and 1 for abnormal ones. In the test phase, when the output is below a fixed threshold, we consider that the mammographic image is normal; otherwise it is considered as abnormal.

4420 Ayoub Arafi et al. Figure 3: Architecture of the used MLP 3 Results and discussion The database of mammographic images used in our work was taken from the Mammographic Images Analysis Society [13]. It contains 322 images, digitized using a resolution of 50 microns per pixel with 8 bits. The spatial resolution of each image is fixed to 1024 1024 pixels. 208 of these images are normal and 116 are abnormal. Among the abnormal cases, 63 are malign and 51 benign. Table 1 shows how we used these data in both the learning and the test phases. Table 1: Amount of used data in the learning and testing phase. Category Learning phase Testing phase Normal 150 58 Abnormal 72 42 Moreover, given the importance of the hidden layer size, and its impact on the results of the diagnostic system, a heuristic method was used in order to experimentally determine the best value of this important parameter. This method consists in testing several architectures corresponding to different intuitively chosen values for this parameter. The results obtained for these different architectures, in both the learning and the testing phases were assessed and compared using the percentage of well classification rates for both normal and abnormal images. The results of these comparisons are summarized on Table 2. Analysis of these results shows clearly that very good results can be obtained, provided that the size of the hidden layer is appropriately tuned. In our case, the best results were obtained for the architecture with 80 hidden units, for which the percentages of well classification rates are of 95,49% and

Classification of mammographic images using ANN 4421 Table 2: Experimental results of different architectures Hidden Classification rate (in %) layer Learning phase Testing phase size Normal Abnormal Normal Abnormal 80 100 95.49 95.49 75.77 50 100 90.54 81.94 63.88 60 100 91.44 81.94 63.88 20 100 94.59 83.32 69.44 100 100 94.14 84.67 75.00 45 100 90.09 81.94 52.77 15 100 86.48 69.44 16.66 100% in the learning step and of 75,77% and 95,49% in the test phase in the case of normal images; with similar percentages in the case of abnormal images for both steps. These results show also that different architectures can give similar results, meaning that the problem of choosing the best value for the number of hidden neurons does not have a unique solution. This was the case, for example, for the two architectures with 20 and 60 hidden units. 4 Conclusion In this work, we proposed a method for breast cancer diagnosis based on artificial neural networks. This method is divided into two main parts. The first part consists in extracting the characteristics of mammographic images using special tools of an existing specialized library. The second part consists in using a multilayer perception in order to classify these images into normal and abnormal categories. The method was implemented using the C++ language and tested on real images taken from the MIAS database. The obtained results were very satisfying in both of the learning and the testing phases, with classification precision reaching 100% for many cases. These results confirm the usefulness of the neural networks-based model proposed in this work, and justify future efforts and developments in order to improve these preliminary results. For example, it would be very interesting to find more sophisticated ways for optimizing other parameters related to the preprocessing step as well as the number of hidden layers. References [1] Bradski, G. and Kaehler, A. (2008). Learning OpenCV: Computer Vision

4422 Ayoub Arafi et al. with the OpenCV Library. O Reilly, Cambridge. [2] J. Michaelson, S. Satija and R. Moore, et al. The pattern of breast cancer screening utilization and its consequences,cancer, Jan 1 2002 94(1): 37-43. [3] F. Meyer and S. Beucher, Morphological segmentation,j. Vis. Commun. Image Represent., vol. 1, pp. 2146, 1990. [4] J. Suckling et al (1994):The Mammographic Image Analysis Society Digital Mammogram Database Exerpta Medica.International Congress Series, 1069 pp375-378. [5] Association Lalla Salma de Lutte Contre le Cancer,Guide de protection precoce des cancers du sein, pp 10-13, Edition 2011. [6] S. Bawa, A thesis on Edge Based Region Growing, Department of Electronics and communication Engineering, Thapar.Institute of Engineering & Technology (Deemed University), India, June, 2006. [7] I.S. Jacobs and C.P. Bean, Fine particles, thin films and exchange anisotropy, Magnetism, vol. III, G.T. Rado and H. Suhl, Eds. New York: Academic, 1963, pp. 271-350. [8] Norum J. Breast cancer screening by mammography in Norway. Is it costeffective?,ann Oncol 1999, 10: 197-20. [9] T.Wang and N. Karayiannis, Detection of microcalcification in digitalmammograms using wavelets, IEEE Trans. Medical Imaging, 17(4): 498-509, 1998. [10] S. Lai, X. Li, and W. Bischof, On techniques for detecting circumscribed masses in mammograms,ieee Trans. Medical Imaging, 8(4): 377-386, 1989. [11] V. H. C. de Albuquerque, A. R. de Alexandria, P. C. Cortez, and J. M. R. S. Tavares, Evaluation of multilayer perceptron and self-organizing map neural network topologies applied on microstructure segmentation from metallographic images,ndt & E International, vol. 42, no. 7, pp. 644-651, Oct. 2009. [12] Y. Safi and A. Bouroumi, Prediction of Forest Fires Using Artificial Neural Networks, Applied Mathematical Sciences, vol. 7, no. 6, pp. 271-286, 2013. [13] J Suckling et al (1994): The Mammographic Image Analysis Society Digital Mammogram Database, Exerpta Medica.International Congress Series, 1069 pp. 375-378.

Classification of mammographic images using ANN 4423 Received: June 1, 2013