Available online at ScienceDirect. Procedia Computer Science 49 (2015 ) ICAC3 15

Size: px
Start display at page:

Download "Available online at ScienceDirect. Procedia Computer Science 49 (2015 ) ICAC3 15"

Transcription

1 Available online at ScienceDirect Procedia Computer Science 49 (2015 ) ICAC3 15 Microarray Classification of Cancerous Cell Using Soft Computing Technique PrachiVijayeeta a, Utsav Kar b, MadhurimaRana c, Madhabananda Das d, B.S.P Mishra e a School of Compute er Engineering,KIIT University, Odisha, India Abstract In the late 19th century, the advent of malignant tissues in the human cells has come into limelight. A lot of research works has been carried out to detect the tumorous cells and subsequently, various methodologies have been adopted to distinguish different types of cancerous cells. Gene expression based microarray technique has been used as an efficient and emerging technique in the field of cancer classification, prediction, diagnosis and many other type of research. In this paper,we have used Microarray data which is a collection of microscopic spots. Since there is an existence of huge dataset, the dimensionality reduction of the dataset is done at the first stage of our work. At the second stage, classification is performed using supervised learning methods. At the final stage, we have optimized the objective function using soft computing technique which yields the relevant results.soft computing refers to a consortium of computational methodologies that has motivated many scientific researchers to contribute their efforts in designing highly powerful intelligentt systems. We are mainly focusing on the classification of cancerous cells using two classifiers called SVM (support vector machine) and K-NN (K-nearest neighbor) and an optimization technique is being incorporated on significant parameters that results in better and relevant output Published The Authors.Published by Elsevier B.V. by This Elsevier is an B.V. open access article under the CC BY-NC-ND license ( Peer-review under responsibility of organizing committee of the 4th InternationalConference on Advances in Computing, Peer-review under responsibility of organizing committee of the 4th International Conference on Advances in Computing, Communication and Control (ICAC3 15). Communication and Control (ICAC3 15) Keywords:Microarray data, Support Vector Machine (SVM), K-NN(K- Nearest Neighbor), Soft computing. 1. Introduction With the growing importance of computational intelligence in diverse application areas, soft computing is fast gaining popularity in the world of research [12]. In the early 1990s Latif A. Zadeh a pioneer in this field coined the term Soft Computing [13] which gives a holistic view of inexactness and non-determinism of computational problems that need to be solved. The spirit of soft computing resembles a family of techniques with the potential of Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license ( Peer-review under responsibility of organizing committee of the 4th International Conference on Advances in Computing, Communication and Control (ICAC3 15) doi: /j.procs

2 Prachi Vijayeeta et al. / Procedia Computer Science 49 ( 2015 ) solving a specific class of problems for which other conventional techniques are merely found to be inadequate. Taking into consideration the perspective of soft computing, researchers have categorized the problem solving techniques into feasible components that are highly interactive over a given set of related domains. These principal components of soft computing comprises of ANN, Evolutionary search strategies including GA, ACO, Swarm optimization and etc. [18]. A microarray is a 2-D array on a small solid glass surface that holds large amount of biological data. The gene microarray [6] is a highly sophisticated chip, used in the detection of malignant tissues, proteins, peptides and carbohydrates expansion and in many other scientific researches related to biotechnology. We can simply analyse microarray as a collection of microscopic features which can be probed with the targeted ones to reveal significant gene expression data [4]. This paper entails classification of Breast cancer dataset from ( At the initial stage, dataset constitutes of high dimensional data value that stands as a great hindrance in performing high-ended computations. So we need to adopt some methodological traits that lead to the reduction of dimension of the dataset and to enhance the targeted output steadily. In our work, classification of cancerous cells has been carried out by using Principal Component Analysis (PCA) for feature selection so that the dimensionality of the dataset has been reduced to a great extent. We have taken the help of SVM (Support Vector Machine), K-NN (K- Nearest Neighboras a supervised machine learning classification tool for categorizing into tumorous and non-tumorous classes. Finally we optimize the classification cost using Cat Swarm Optimization (CSO) algorithm [2] that enables us in finding best solutions. We have organized the entire paper into five sections. The first section comprises of introduction where we have narrated the technologies in nutshell. Section 2 reveals the literature review where all related works are mentioned briefly. Section 3 describes the working process model of the entire work. Section 4 reveals the simulation s and the experimental results are represented on graphs. In section 5, the work has been concluded. 2. The State of the Art Processing Techniques Since the last two decades, the consolidation of soft computing as a collection of various synergistically complementary computational techniques has become one of the challenging areas in the field of biological and evolutionary researches. In this paper, we have chosen a list of pioneering literature survey based on intelligent techniques by concentrating on emerging field of computer science. Roy and Chakraborty [12] have introduced the fundamental concepts of soft computing, that provides a schematic view of overall architecture of soft computing techniques [13]. Vapnik et al.[9] have contributed their efforts in the field of vector learning by developing the concept of support vector machine[svm] that is mostly applied for classification of cancer metamorphosis using genotype features. Further, Simek et al. [7] have carried out experiments over SVD(singular value decomposition) and SVM (support vector machine) for selection of relevant features and extended their work by classifying the vectors into different classes. Cristianini et al. [8] have thrown light on the application of SVM for categorizing the trained data to their corresponding targeted output. Many researchers like Furey et al. [15] from university of Washington have dedicated their skills in carrying out simulations for validating malignant tissue samples using microarray dataset. They have revealed the presence of tumorous and non-tumorous cells, using distinct fluorescent dyes on the biochip. G.Panda et al. [10] have published both nationally as well as internationally numerous works on solving optimal problems using highly interesting and computationally smart enough techniques. In this paper the application of cat swarm optimization (CSO) is highly explored with simulations and analysis being conducted by taking into account different significant parameters. Lu et al. [11] have cited many experimental works at University of Illionis that have helped the researchers in studying the evolutionary strategies and there implementation in the field of computer science. Jang et al. [1] has published various articles that have guided us in determining advanced searching strategies by incorporating hybrid forms of computational techniques. Chu et al. [3] have proposed a new swarm based optimization technique called CSO (cat swarm optimization) by studying the behavior of cat s and have successfully implemented in different engineering applications.

3 68 Prachi Vijayeeta et al. / Procedia Computer Science 49 ( 2015 ) Tsi et al. [2] have extended the research on a family of CSO algorithms and have experimentally tested the functions by simulating different parameters. Mc Lachlan et al. [4] have carried out statistical data analysis of a large cancerous microarray dataset and have represented it graphically. His contributions to the classification of microarray data have been a strong backbone for the researchers both in the computational as well as in the statistical scenario. Kalyanmoy Deb et al. [18] have proposed various evolutionary algorithms based on single objectives and multi-objectives problem-solution techniques that are pedagogically sound in reviewing classical techniques as well as the advanced techniques. 3. Working Process Model The working process model is based on following four technologies: 3.1. Microarray Technology A microarray is a 2-D array on a small solid glass surface that holds large amount of biological data. The gene microarray [6] is a highly sophisticated chip, used in the detection of malignant tissues, proteins, peptides and carbohydrates expansion and in many other scientific researches related to biotechnology [4]. With the help of this biochip, it is possible to store all the observations of gene expression concerned to a single sample or to a set of samples. These set of samples are a combination of tumorous and non-tumorous tissues and we need to carry out biopsy for detecting the cancer metamorphosis. The messenger m-rna [14] is then obtained from the cells and tiny droplets containing gene fragments are allocated on the exact locations of the thin glass-slide. The probes are synthesized and are represented as spots that correspond to the features or the predictors [17]. Then a fluorescent labelling is done at the target location and finally hybridized to the probe microarray. The diameter of each probe spot is nearly 18 m which can accommodate large probes per array [7] Feature Selection The microarray dataset are very huge dimension and needs to be reduced for convenience. In order to get a global solution we need to search the m-dimensional search space very efficiently. The main aim of feature selection is to minimize the dimension of the dataset so that, the training and the testing of the data can be done in an efficient manner. Previously classical feature selection techniques were used but in our paper we have used Principal Component Analysis for decreasing the size of the dataset [5] ( PCA is basically applied in statistical data analysis using orthogonal transformation which converts correlated variables into a set of linearly uncorrelated variables from a given set of observations. These are called Principal components. The cardinality of the Principal components are never more than the number of actual number of variables. It is a powerful exploratory data analysis tool that is used for making predictive models K- Nearest Neighbor (K-NN) Classifier K-nearest neighbor is an instance based learning in which classification is done by comparing feature vectors of the different points. If a i (x) denotes the features (for i=1,2, n) then the Euclidian distance between two instances is: The classification accuracy of K-NN is improved by minimizing the distance between the points. The value of K depends on dataset and is chosen by adopting some heuristic techniques. While performing classification we consider K as a user defined constant that reduces the effect of noise in the classifier Support Vector Machines (SVM) SVM is a supervised machine learning tool that is used for analyzing the gene expression data arriving from the DNA microarray. It is a binomial classification that basically categorizes the elements of microarray data into two groups as per the various rules. For a given set of training samples, each is recognized by any one of the two classes. The SVM training algorithm designs a model that assigns the newly arriving samples to one of theeither category.

4 Prachi Vijayeeta et al. / Procedia Computer Science 49 ( 2015 ) This model represents the sample as points in the space that are separable linearly and the hyper plane computation is done by drawing the margin thereby distinguishing the training sample from the class boundary [8]. 3.4 Problem Formulation Let G be a n x m matrix where the rows vector represents the Patients (P 1, P 2, P 3. P n ) and the column vector denotes the genes (G 1, G 2, G 3,.G m ) in such a way that n<m [11]. We have considered two class vectors [9] that represent tumor and non-tumor tissues respectively. Then we need to design a classification problem with a linear kernel function (1) [9, 13] K(x i, x j ) = (x i )T (x j ) The mathematical interpretation of SVM is to correctly classify all the training data in the following manner. w x i + b 1 ify i = +1 wx i + b 1 if y i = -1 y i (wx i + b ) 1, i, x i R n and then to maximize the margin (M) i.e. the hyper plane, such that M= 2 / W The linear programming problem model takes the form: Subject to the constraint We need to predict the values of w and b and C which is a positive Cost factor. 3.5 Salient Feature of Cat Swarm Optimization It is consider that each and every cat is being placed in its own m-dimensional position. Let V = {v 1, v 2,.,v n } be the velocities in the respective dimensions and let a suitable fitness function be taken into consideration so that the corresponding fitness value is computed to represent the location of the cat. To identify whether the cat is in seeking mode or in tracing mode a flag is used. The ultimate solution at the end would lead to one of the cat s best positions. [3] CSO algorithm is designed by simulating the behavior of the cats in these two different modes. Model-1: In the first mode (i.e. Seeking mode), the situation of the cat which islooking around, seekingand restingfor the next position to move is being modeled.model-2: In the second mode (i.e. Tracing mode), the modeling is done in such a way that it traces out only to the targeted ones. Table1: Parameters Description. Parameters of Model Description Random variable(r1) Random variable (r1) belongs to [0, 1] Constant (c1) A constant which is set to 2 in the experiment. Cats Behavior: The cats behavior reveals that, it remains idle most of the time when they are awake and move very slowly in and around. When they feel the occurrence of some targeted prey, they move very speedily and try to capture it. They are very curious about all kinds of things moving around them [10]. Cat Swarm Optimization Algorithm Step1 Initialize the number of cats (N). Step 2 Distribute the cats arbitrarily into the M-dimensional search space. Step 3 Assign random values to the velocity of the cat within the limit of maximum velocity. Randomly pick up a number of cats and set them into tracing mode as per the value of Mixture Ratio(MR) and set the remaining cats to the seeking mode.

5 70 Prachi Vijayeeta et al. / Procedia Computer Science 49 ( 2015 ) Step 4 Compute the fitness value of each and every cat by calculating the fitness function based on the position of the cat. Save the best cat position in the memory (x best ) that generates the global best solution. Step 5 Then the cats are moved to other locations depending on their flag value. If the k th cat is in the seeking mode then applying it to the seeking mode process otherwise place it in the tracing mode process. Step 5 Repeat this process by picking up other cats and set them in tracing mode as for the value of mixture ratio (MR).In the meanwhile, set the other cats into seeking mode. Step 6 Continue this process until the termination condition is encountered.if not, repeat step 3 to step7.step 7 Finish. 4. Experimental Results and Simulations Based upon the input parameters we have provided, graph is plotted accordingly and is being depicted in Fig. 1(a) and Fig. 1(b) respectively. These two figure clearly shows that dimensionality of the huge dataset is reduced significantly after using PCA. The graph reveals the reduced number of components by eliminating the unworthy elements of the data set. Fig.1. PCA result graphs. Fig. 2.Graphical representation of Classification results.

6 Prachi Vijayeeta et al. / Procedia Computer Science 49 ( 2015 ) Table 2: Parameters Setting of Cat Swarm Optimization. Parameters Value SMP 8.0 SRD 0.5 CDC 0.8 MR 0.7 C1 2.0 R1 0.1 Numberof Cats(N) 20 Number of Iterations 60 Fig. 3. The Graphical View of the optimization result. Table 3: Experimental results. K-NN SVM Re substitution Loss Cross validation Loss Classification Cost 2.304e e+02

7 72 Prachi Vijayeeta et al. / Procedia Computer Science 49 ( 2015 ) In our work, we have conducted experiments based on 3 different requirements. At the first stage we have designed the PCA code for reducing the components of the dataset. At the second stage, the squeezed data matrix obtained is being trained using supervised learning (SVM and K-NN) for further classification. Here Fig. 2(a) is the graphical view of SVM classification result and Fig. 2(b) is the result of KNN classifier. In our Experiment we have also checked the accuracy of the classifiers by using cross validation and re substitution methods. In Table 3 the experimental values are shown. Ultimately, the final classification cost which is the global best solution is being obtained. In Fig. 3(a) is obtained after applying CSO with SVM and Fig. 3(b) is the graphical view of optimized result after SVM classification. While conducting the experiment we have taken the number of cats (N) as 20 and the corresponding parameters are assigned values as for the Table 2. The targeted output is represented in tabular form and its diagrammatic view is represented on the graph. 5. Conclusion Although there are many searching strategies for evaluating an optimal solution, but we have applied cat swarm optimization (CSO) technique to compute the classification cost. The simulations carried out for dimension reducibility have resulted in a graph with a very feasible targeted output. The classification analysis using SVM and KNN, supervised learning tool has also resulted in a valid solution. From the experimental result it is seen that re substitution loss and cross validation loss for SVM is less as compared to KNN i.e for SVM it is 5%,!3% respectively and for KNN it is 47% and 63%. So, from the accuracy point of view SVM can classify more accurately than SVM. Classification cost of SVM CSO is lesser than KNN SVM classification cost. So it has been found that the classification cost of CSO SVM is better and it can classify more accurately. In our future work we planned to implement other soft computing technique and other classifier to yield better result and accuracy. References 1. Jang JSR, Sun CT and Mizutani E,Neuro-Fuzzy and Soft Computing: A Computational Approach to Learning and Machine Intelligence, Prentice Hall. 2. Tsai PW. andistanda, V.Review on Cat Swarm Optimization Algorithms.3rd International Conference on Consumer Electronics. Communications and Networks (CECNet) Chu SC and Tsai PW. Computational Intelligence Based On the Behavior of Cats. International Journal of InnovativeComputing. Information and Control ICIC International Volume 3, McLachlan G and Jones LBT.Classification of Microarray Data-Recent Statistical Approaches, Wiley Publication. 5. Jolliffe IT.Principal Component Analysis, Springer Series in Statistical Theory and Method Schena M. Microarray analysis. Wiley Blackwell Publication. 7. Simek K.Fujarewicz K and Jarzab B. Using SVD and SVM method for selection, classification, clustering and modeling of DNA microarray data. Eng. APP. Artf. Intell Cristianini N. and Taylor JS. An Introduction to Support Vector Machines and other kernel-based learning methods. Cambridge University Press Guyon I, Weston J, Barnhill S andvapnik V. Gene Selection for cancer classification using SVM. Machine Learning Pradhan P. M and Panda G. Solving multi-objective problems using cat swarm optimization. Expert System andapplication Ying L and Jiawei H, Cancer Classification Using Gene Expression Data. Information System Vol Roy S and ChakrabortyU.Introduction to Soft Computing: Neuro-Fuzzy and Genetic Algorithms, Pearson publication. 13. Zadeh L.A. Applied Soft Computing, vol 1, No.1, Huerta EB, Duval B and Hao JK.A Hybrid GA/SVM Approach for Gene Selection and Classification of Microarray Data. Evolutionary Workshops LNCS 3907,

8 Prachi Vijayeeta et al. / Procedia Computer Science 49 ( 2015 ) Furey TS, Cristianini N, Duffy N,Bednarski DW, Schummer M and Haussler D. Support Vector Machine Classification and validation of cancer tissue samples using Microarray Expression Data. Oxford Journals of Bioinformatics Ho SY, Hsieh CH, Chen HM and Huang HL.Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis. BioSystems Vol Sarhan AM.Cancer Classification Based On Microarray Gene Expression Data Using DCT and ANN.Journal oftheoretical and Applied Information Technology. 2009, Deb K..Multiobjective optimization using evolutionary algorithms. John Wiley and Sons

A Naïve Soft Computing based Approach for Gene Expression Data Analysis

A Naïve Soft Computing based Approach for Gene Expression Data Analysis Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2124 2128 International Conference on Modeling Optimization and Computing (ICMOC-2012) A Naïve Soft Computing based Approach for

More information

A New Meta-heuristic Bat Inspired Classification Approach for Microarray Data

A New Meta-heuristic Bat Inspired Classification Approach for Microarray Data Available online at www.sciencedirect.com Procedia Technology 4 (2012 ) 802 806 C3IT-2012 A New Meta-heuristic Bat Inspired Classification Approach for Microarray Data Sashikala Mishra a, Kailash Shaw

More information

Data mining with Support Vector Machine

Data mining with Support Vector Machine Data mining with Support Vector Machine Ms. Arti Patle IES, IPS Academy Indore (M.P.) artipatle@gmail.com Mr. Deepak Singh Chouhan IES, IPS Academy Indore (M.P.) deepak.schouhan@yahoo.com Abstract: Machine

More information

Dimension reduction : PCA and Clustering

Dimension reduction : PCA and Clustering Dimension reduction : PCA and Clustering By Hanne Jarmer Slides by Christopher Workman Center for Biological Sequence Analysis DTU The DNA Array Analysis Pipeline Array design Probe design Question Experimental

More information

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Available online at   ScienceDirect. Procedia Computer Science 89 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 89 (2016 ) 778 784 Twelfth International Multi-Conference on Information Processing-2016 (IMCIP-2016) Color Image Compression

More information

CT79 SOFT COMPUTING ALCCS-FEB 2014

CT79 SOFT COMPUTING ALCCS-FEB 2014 Q.1 a. Define Union, Intersection and complement operations of Fuzzy sets. For fuzzy sets A and B Figure Fuzzy sets A & B The union of two fuzzy sets A and B is a fuzzy set C, written as C=AUB or C=A OR

More information

CANCER PREDICTION USING PATTERN CLASSIFICATION OF MICROARRAY DATA. By: Sudhir Madhav Rao &Vinod Jayakumar Instructor: Dr.

CANCER PREDICTION USING PATTERN CLASSIFICATION OF MICROARRAY DATA. By: Sudhir Madhav Rao &Vinod Jayakumar Instructor: Dr. CANCER PREDICTION USING PATTERN CLASSIFICATION OF MICROARRAY DATA By: Sudhir Madhav Rao &Vinod Jayakumar Instructor: Dr. Michael Nechyba 1. Abstract The objective of this project is to apply well known

More information

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract 19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms

More information

SVM Classification in -Arrays

SVM Classification in -Arrays SVM Classification in -Arrays SVM classification and validation of cancer tissue samples using microarray expression data Furey et al, 2000 Special Topics in Bioinformatics, SS10 A. Regl, 7055213 What

More information

Classification by Support Vector Machines

Classification by Support Vector Machines Classification by Support Vector Machines Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Practical DNA Microarray Analysis 2003 1 Overview I II III

More information

Classification by Support Vector Machines

Classification by Support Vector Machines Classification by Support Vector Machines Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Practical DNA Microarray Analysis 2003 1 Overview I II III

More information

An Efficient QBIR system using Adaptive segmentation and multiple features

An Efficient QBIR system using Adaptive segmentation and multiple features Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 87 (2016 ) 134 139 2016 International Conference on Computational Science An Efficient QBIR system using Adaptive segmentation

More information

AN IMPROVED HYBRIDIZED K- MEANS CLUSTERING ALGORITHM (IHKMCA) FOR HIGHDIMENSIONAL DATASET & IT S PERFORMANCE ANALYSIS

AN IMPROVED HYBRIDIZED K- MEANS CLUSTERING ALGORITHM (IHKMCA) FOR HIGHDIMENSIONAL DATASET & IT S PERFORMANCE ANALYSIS AN IMPROVED HYBRIDIZED K- MEANS CLUSTERING ALGORITHM (IHKMCA) FOR HIGHDIMENSIONAL DATASET & IT S PERFORMANCE ANALYSIS H.S Behera Department of Computer Science and Engineering, Veer Surendra Sai University

More information

Naïve Bayes for text classification

Naïve Bayes for text classification Road Map Basic concepts Decision tree induction Evaluation of classifiers Rule induction Classification using association rules Naïve Bayesian classification Naïve Bayes for text classification Support

More information

Goal-oriented Schema in Biological Database Design

Goal-oriented Schema in Biological Database Design Goal-oriented Schema in Biological Database Design Ping Chen Department of Computer Science University of Helsinki Helsinki, Finland 00014 EMAIL: pchen@cs.helsinki.fi Abstract In this paper, I reviewed

More information

ScienceDirect. An Advanced Multi Class Instance Selection based Support Vector Machine for Text Classification

ScienceDirect. An Advanced Multi Class Instance Selection based Support Vector Machine for Text Classification Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 57 (2015 ) 1124 1130 3rd International Conference on Recent Trends in Computing 2015 (ICRTC-2015) An Advanced Multi Class

More information

Applying Supervised Learning

Applying Supervised Learning Applying Supervised Learning When to Consider Supervised Learning A supervised learning algorithm takes a known set of input data (the training set) and known responses to the data (output), and trains

More information

Exploratory data analysis for microarrays

Exploratory data analysis for microarrays Exploratory data analysis for microarrays Jörg Rahnenführer Computational Biology and Applied Algorithmics Max Planck Institute for Informatics D-66123 Saarbrücken Germany NGFN - Courses in Practical DNA

More information

MICROARRAY IMAGE SEGMENTATION USING CLUSTERING METHODS

MICROARRAY IMAGE SEGMENTATION USING CLUSTERING METHODS Mathematical and Computational Applications, Vol. 5, No. 2, pp. 240-247, 200. Association for Scientific Research MICROARRAY IMAGE SEGMENTATION USING CLUSTERING METHODS Volkan Uslan and Đhsan Ömür Bucak

More information

Available Online through

Available Online through Available Online through www.ijptonline.com ISSN: 0975-766X CODEN: IJPTFI Research Article ANALYSIS OF CT LIVER IMAGES FOR TUMOUR DIAGNOSIS BASED ON CLUSTERING TECHNIQUE AND TEXTURE FEATURES M.Krithika

More information

Available online at ScienceDirect. Procedia Computer Science 35 (2014 )

Available online at  ScienceDirect. Procedia Computer Science 35 (2014 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 35 (2014 ) 388 396 18 th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems

More information

Classifying Brain Anomalies Using PCA And SVM Rosy Kumari 1, Rishi Kumar Soni 2

Classifying Brain Anomalies Using PCA And SVM Rosy Kumari 1, Rishi Kumar Soni 2 International Journal of scientific research and management (IJSRM) Volume 2 Issue 5 Pages 935-939 2014 Website: www.ijsrm.in ISSN (e): 2321-3418 Classifying Brain Anomalies Using PCA And SVM Rosy Kumari

More information

Machine Learning in Biology

Machine Learning in Biology Università degli studi di Padova Machine Learning in Biology Luca Silvestrin (Dottorando, XXIII ciclo) Supervised learning Contents Class-conditional probability density Linear and quadratic discriminant

More information

Review of feature selection techniques in bioinformatics by Yvan Saeys, Iñaki Inza and Pedro Larrañaga.

Review of feature selection techniques in bioinformatics by Yvan Saeys, Iñaki Inza and Pedro Larrañaga. Americo Pereira, Jan Otto Review of feature selection techniques in bioinformatics by Yvan Saeys, Iñaki Inza and Pedro Larrañaga. ABSTRACT In this paper we want to explain what feature selection is and

More information

Robot Learning. There are generally three types of robot learning: Learning from data. Learning by demonstration. Reinforcement learning

Robot Learning. There are generally three types of robot learning: Learning from data. Learning by demonstration. Reinforcement learning Robot Learning 1 General Pipeline 1. Data acquisition (e.g., from 3D sensors) 2. Feature extraction and representation construction 3. Robot learning: e.g., classification (recognition) or clustering (knowledge

More information

A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES

A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES Narsaiah Putta Assistant professor Department of CSE, VASAVI College of Engineering, Hyderabad, Telangana, India Abstract Abstract An Classification

More information

Using a genetic algorithm for editing k-nearest neighbor classifiers

Using a genetic algorithm for editing k-nearest neighbor classifiers Using a genetic algorithm for editing k-nearest neighbor classifiers R. Gil-Pita 1 and X. Yao 23 1 Teoría de la Señal y Comunicaciones, Universidad de Alcalá, Madrid (SPAIN) 2 Computer Sciences Department,

More information

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION Supervised and unsupervised learning are the two prominent machine learning algorithms used in pattern recognition and classification. In this

More information

CHAPTER 6 HYBRID AI BASED IMAGE CLASSIFICATION TECHNIQUES

CHAPTER 6 HYBRID AI BASED IMAGE CLASSIFICATION TECHNIQUES CHAPTER 6 HYBRID AI BASED IMAGE CLASSIFICATION TECHNIQUES 6.1 INTRODUCTION The exploration of applications of ANN for image classification has yielded satisfactory results. But, the scope for improving

More information

Redefining and Enhancing K-means Algorithm

Redefining and Enhancing K-means Algorithm Redefining and Enhancing K-means Algorithm Nimrat Kaur Sidhu 1, Rajneet kaur 2 Research Scholar, Department of Computer Science Engineering, SGGSWU, Fatehgarh Sahib, Punjab, India 1 Assistant Professor,

More information

Biclustering for Microarray Data: A Short and Comprehensive Tutorial

Biclustering for Microarray Data: A Short and Comprehensive Tutorial Biclustering for Microarray Data: A Short and Comprehensive Tutorial 1 Arabinda Panda, 2 Satchidananda Dehuri 1 Department of Computer Science, Modern Engineering & Management Studies, Balasore 2 Department

More information

Classification by Support Vector Machines

Classification by Support Vector Machines Classification by Support Vector Machines Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Practical DNA Microarray Analysis 2003 1 Overview I II III

More information

FEATURE SELECTION TECHNIQUES

FEATURE SELECTION TECHNIQUES CHAPTER-2 FEATURE SELECTION TECHNIQUES 2.1. INTRODUCTION Dimensionality reduction through the choice of an appropriate feature subset selection, results in multiple uses including performance upgrading,

More information

PROBLEM FORMULATION AND RESEARCH METHODOLOGY

PROBLEM FORMULATION AND RESEARCH METHODOLOGY PROBLEM FORMULATION AND RESEARCH METHODOLOGY ON THE SOFT COMPUTING BASED APPROACHES FOR OBJECT DETECTION AND TRACKING IN VIDEOS CHAPTER 3 PROBLEM FORMULATION AND RESEARCH METHODOLOGY The foregoing chapter

More information

Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality

Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality Abstract: Mass classification of objects is an important area of research and application in a variety of fields. In this

More information

ScienceDirect. Reducing Semantic Gap in Video Retrieval with Fusion: A survey

ScienceDirect. Reducing Semantic Gap in Video Retrieval with Fusion: A survey Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 50 (2015 ) 496 502 Reducing Semantic Gap in Video Retrieval with Fusion: A survey D.Sudha a, J.Priyadarshini b * a School

More information

Application of Support Vector Machine In Bioinformatics

Application of Support Vector Machine In Bioinformatics Application of Support Vector Machine In Bioinformatics V. K. Jayaraman Scientific and Engineering Computing Group CDAC, Pune jayaramanv@cdac.in Arun Gupta Computational Biology Group AbhyudayaTech, Indore

More information

DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION DATA. Received April 2013; revised October 2013

DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION DATA. Received April 2013; revised October 2013 International Journal of Innovative Computing, Information and Control ICIC International c 2014 ISSN 1349-4198 Volume 10, Number 3, June 2014 pp. 1051-1069 DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION

More information

Rule extraction from support vector machines

Rule extraction from support vector machines Rule extraction from support vector machines Haydemar Núñez 1,3 Cecilio Angulo 1,2 Andreu Català 1,2 1 Dept. of Systems Engineering, Polytechnical University of Catalonia Avda. Victor Balaguer s/n E-08800

More information

Available online at ScienceDirect. Procedia CIRP 44 (2016 )

Available online at  ScienceDirect. Procedia CIRP 44 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia CIRP 44 (2016 ) 102 107 6th CIRP Conference on Assembly Technologies and Systems (CATS) Worker skills and equipment optimization in assembly

More information

Hamming Distance based Binary PSO for Feature Selection and Classification from high dimensional Gene Expression Data

Hamming Distance based Binary PSO for Feature Selection and Classification from high dimensional Gene Expression Data Hamming Distance based Binary PSO for Feature Selection and Classification from high dimensional Gene Expression Data Haider Banka and Suresh Dara Department of Computer Science and Engineering Indian

More information

Data Cleaning and Prototyping Using K-Means to Enhance Classification Accuracy

Data Cleaning and Prototyping Using K-Means to Enhance Classification Accuracy Data Cleaning and Prototyping Using K-Means to Enhance Classification Accuracy Lutfi Fanani 1 and Nurizal Dwi Priandani 2 1 Department of Computer Science, Brawijaya University, Malang, Indonesia. 2 Department

More information

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION 6.1 INTRODUCTION Fuzzy logic based computational techniques are becoming increasingly important in the medical image analysis arena. The significant

More information

Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM

Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Lu Chen and Yuan Hang PERFORMANCE DEGRADATION ASSESSMENT AND FAULT DIAGNOSIS OF BEARING BASED ON EMD AND PCA-SOM.

More information

String Vector based KNN for Text Categorization

String Vector based KNN for Text Categorization 458 String Vector based KNN for Text Categorization Taeho Jo Department of Computer and Information Communication Engineering Hongik University Sejong, South Korea tjo018@hongik.ac.kr Abstract This research

More information

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate

More information

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Available online at  ScienceDirect. Procedia Computer Science 89 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 89 (2016 ) 341 348 Twelfth International Multi-Conference on Information Processing-2016 (IMCIP-2016) Parallel Approach

More information

A PSO-based Generic Classifier Design and Weka Implementation Study

A PSO-based Generic Classifier Design and Weka Implementation Study International Forum on Mechanical, Control and Automation (IFMCA 16) A PSO-based Generic Classifier Design and Weka Implementation Study Hui HU1, a Xiaodong MAO1, b Qin XI1, c 1 School of Economics and

More information

Estimating Error-Dimensionality Relationship for Gene Expression Based Cancer Classification

Estimating Error-Dimensionality Relationship for Gene Expression Based Cancer Classification 1 Estimating Error-Dimensionality Relationship for Gene Expression Based Cancer Classification Feng Chu and Lipo Wang School of Electrical and Electronic Engineering Nanyang Technological niversity Singapore

More information

Taxonomically Clustering Organisms Based on the Profiles of Gene Sequences Using PCA

Taxonomically Clustering Organisms Based on the Profiles of Gene Sequences Using PCA Journal of Computer Science 2 (3): 292-296, 2006 ISSN 1549-3636 2006 Science Publications Taxonomically Clustering Organisms Based on the Profiles of Gene Sequences Using PCA 1 E.Ramaraj and 2 M.Punithavalli

More information

Available online at ScienceDirect. Procedia Computer Science 96 (2016 )

Available online at  ScienceDirect. Procedia Computer Science 96 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 946 950 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems

More information

CHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION

CHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION CHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION 4.1. Introduction Indian economy is highly dependent of agricultural productivity. Therefore, in field of agriculture, detection of

More information

CLUSTERING IN BIOINFORMATICS

CLUSTERING IN BIOINFORMATICS CLUSTERING IN BIOINFORMATICS CSE/BIMM/BENG 8 MAY 4, 0 OVERVIEW Define the clustering problem Motivation: gene expression and microarrays Types of clustering Clustering algorithms Other applications of

More information

Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest

Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest Bhakti V. Gavali 1, Prof. Vivekanand Reddy 2 1 Department of Computer Science and Engineering, Visvesvaraya Technological

More information

Class dependent feature weighting and K-nearest neighbor classification

Class dependent feature weighting and K-nearest neighbor classification Class dependent feature weighting and K-nearest neighbor classification Elena Marchiori Institute for Computing and Information Sciences, Radboud University Nijmegen, The Netherlands elenam@cs.ru.nl Abstract.

More information

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2 Chapter 5 A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2 Graph Matching has attracted the exploration of applying new computing paradigms because of the large number of applications

More information

A Survey And Comparative Analysis Of Data

A Survey And Comparative Analysis Of Data A Survey And Comparative Analysis Of Data Mining Techniques For Network Intrusion Detection Systems In Information Security, intrusion detection is the act of detecting actions that attempt to In 11th

More information

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology 9/9/ I9 Introduction to Bioinformatics, Clustering algorithms Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Outline Data mining tasks Predictive tasks vs descriptive tasks Example

More information

A Classifier with the Function-based Decision Tree

A Classifier with the Function-based Decision Tree A Classifier with the Function-based Decision Tree Been-Chian Chien and Jung-Yi Lin Institute of Information Engineering I-Shou University, Kaohsiung 84008, Taiwan, R.O.C E-mail: cbc@isu.edu.tw, m893310m@isu.edu.tw

More information

Available online at ScienceDirect. Procedia Computer Science 98 (2016 )

Available online at  ScienceDirect. Procedia Computer Science 98 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 98 (016 ) 548 553 The 3rd International Symposium on Emerging Information, Communication and Networks (EICN 016) Finding

More information

Available online at ScienceDirect. Procedia Computer Science 37 (2014 )

Available online at   ScienceDirect. Procedia Computer Science 37 (2014 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 37 (2014 ) 176 180 The 5th International Conference on Emerging Ubiquitous Systems and Pervasive Networks (EUSPN-2014)

More information

Available online at ScienceDirect. Procedia Computer Science 79 (2016 )

Available online at  ScienceDirect. Procedia Computer Science 79 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 79 (2016 ) 483 489 7th International Conference on Communication, Computing and Virtualization 2016 Novel Content Based

More information

Available online at ScienceDirect. Procedia Computer Science 58 (2015 )

Available online at  ScienceDirect. Procedia Computer Science 58 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 58 (2015 ) 552 557 Second International Symposium on Computer Vision and the Internet (VisionNet 15) Fingerprint Recognition

More information

ScienceDirect. Analogy between immune system and sensor replacement using mobile robots on wireless sensor networks

ScienceDirect. Analogy between immune system and sensor replacement using mobile robots on wireless sensor networks Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 35 (2014 ) 1352 1359 18 th International Conference in Knowledge Based and Intelligent Information & Engineering Systems

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Features and Patterns The Curse of Size and

More information

Clustering Techniques

Clustering Techniques Clustering Techniques Bioinformatics: Issues and Algorithms CSE 308-408 Fall 2007 Lecture 16 Lopresti Fall 2007 Lecture 16-1 - Administrative notes Your final project / paper proposal is due on Friday,

More information

Use of Multi-category Proximal SVM for Data Set Reduction

Use of Multi-category Proximal SVM for Data Set Reduction Use of Multi-category Proximal SVM for Data Set Reduction S.V.N Vishwanathan and M Narasimha Murty Department of Computer Science and Automation, Indian Institute of Science, Bangalore 560 012, India Abstract.

More information

SVM CLASSIFICATION AND ANALYSIS OF MARGIN DISTANCE ON MICROARRAY DATA. A Thesis. Presented to. The Graduate Faculty of The University of Akron

SVM CLASSIFICATION AND ANALYSIS OF MARGIN DISTANCE ON MICROARRAY DATA. A Thesis. Presented to. The Graduate Faculty of The University of Akron SVM CLASSIFICATION AND ANALYSIS OF MARGIN DISTANCE ON MICROARRAY DATA A Thesis Presented to The Graduate Faculty of The University of Akron In Partial Fulfillment of the Requirements for the Degree Master

More information

Gene Expression Based Classification using Iterative Transductive Support Vector Machine

Gene Expression Based Classification using Iterative Transductive Support Vector Machine Gene Expression Based Classification using Iterative Transductive Support Vector Machine Hossein Tajari and Hamid Beigy Abstract Support Vector Machine (SVM) is a powerful and flexible learning machine.

More information

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy Sokratis K. Makrogiannis, PhD From post-doctoral research at SBIA lab, Department of Radiology,

More information

Double Self-Organizing Maps to Cluster Gene Expression Data

Double Self-Organizing Maps to Cluster Gene Expression Data Double Self-Organizing Maps to Cluster Gene Expression Data Dali Wang, Habtom Ressom, Mohamad Musavi, Cristian Domnisoru University of Maine, Department of Electrical & Computer Engineering, Intelligent

More information

MRI Classification and Segmentation of Cervical Cancer to Find the Area of Tumor

MRI Classification and Segmentation of Cervical Cancer to Find the Area of Tumor MRI Classification and Segmentation of Cervical Cancer to Find the Area of Tumor S. S. Dhumal 1, S. S. Agrawal 2 1 Department of EnTC, University of Pune, India Abstract Automated classification and detection

More information

CHAPTER 6 ORTHOGONAL PARTICLE SWARM OPTIMIZATION

CHAPTER 6 ORTHOGONAL PARTICLE SWARM OPTIMIZATION 131 CHAPTER 6 ORTHOGONAL PARTICLE SWARM OPTIMIZATION 6.1 INTRODUCTION The Orthogonal arrays are helpful in guiding the heuristic algorithms to obtain a good solution when applied to NP-hard problems. This

More information

An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data

An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data Nian Zhang and Lara Thompson Department of Electrical and Computer Engineering, University

More information

Support Vector Machines for visualization and dimensionality reduction

Support Vector Machines for visualization and dimensionality reduction Support Vector Machines for visualization and dimensionality reduction Tomasz Maszczyk and W lodzis law Duch Department of Informatics, Nicolaus Copernicus University, Toruń, Poland tmaszczyk@is.umk.pl;google:w.duch

More information

Facial Expression Recognition using Principal Component Analysis with Singular Value Decomposition

Facial Expression Recognition using Principal Component Analysis with Singular Value Decomposition ISSN: 2321-7782 (Online) Volume 1, Issue 6, November 2013 International Journal of Advance Research in Computer Science and Management Studies Research Paper Available online at: www.ijarcsms.com Facial

More information

Image Processing Techniques for Brain Tumor Extraction from MRI Images using SVM Classifier

Image Processing Techniques for Brain Tumor Extraction from MRI Images using SVM Classifier Image Processing Techniques for Brain Tumor Extraction from MRI Images using SVM Classifier Mr. Ajaj Khan M. Tech (CSE) Scholar Central India Institute of Technology Indore, India ajajkhan72@gmail.com

More information

Feature Selection in Knowledge Discovery

Feature Selection in Knowledge Discovery Feature Selection in Knowledge Discovery Susana Vieira Technical University of Lisbon, Instituto Superior Técnico Department of Mechanical Engineering, Center of Intelligent Systems, IDMEC-LAETA Av. Rovisco

More information

Multiple Sequence Alignment Using Reconfigurable Computing

Multiple Sequence Alignment Using Reconfigurable Computing Multiple Sequence Alignment Using Reconfigurable Computing Carlos R. Erig Lima, Heitor S. Lopes, Maiko R. Moroz, and Ramon M. Menezes Bioinformatics Laboratory, Federal University of Technology Paraná

More information

Fuzzy C-means with Bi-dimensional Empirical Mode Decomposition for Segmentation of Microarray Image

Fuzzy C-means with Bi-dimensional Empirical Mode Decomposition for Segmentation of Microarray Image www.ijcsi.org 316 Fuzzy C-means with Bi-dimensional Empirical Mode Decomposition for Segmentation of Microarray Image J.Harikiran 1, D.RamaKrishna 2, M.L.Phanendra 3, Dr.P.V.Lakshmi 4, Dr.R.Kiran Kumar

More information

Available online at ScienceDirect. Procedia Computer Science 93 (2016 )

Available online at   ScienceDirect. Procedia Computer Science 93 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 93 (2016 ) 396 402 6th International Conference On Advances In Computing & Communications, ICACC 2016, 6-8 September 2016,

More information

!" # $ % & $ '$ !" #$% & '( "88, A *? #&( ) .!" # " $%&' ( ) * +, -.# / $ Stepwise linear discriminate analysis

! # $ % & $ '$ ! #$% & '( 88, A *? #&( )   .! #  $%&' ( ) * +, -.# / $ Stepwise linear discriminate analysis 1393/1/31: 1393/4/12: 1393 "# /292 / " # $ % & $ '$ 2 1 #&( ). " # " $%&' ( ) * +, -.# / $ :. @4 44. " @4 34) > =; < #) 8 # #" 8" DNA (#5." 6245 123" # : " ( (, $%&' H > / 5 (G (.# F3" (AE ( C D)* (>"

More information

Noise-based Feature Perturbation as a Selection Method for Microarray Data

Noise-based Feature Perturbation as a Selection Method for Microarray Data Noise-based Feature Perturbation as a Selection Method for Microarray Data Li Chen 1, Dmitry B. Goldgof 1, Lawrence O. Hall 1, and Steven A. Eschrich 2 1 Department of Computer Science and Engineering

More information

Keyword Extraction by KNN considering Similarity among Features

Keyword Extraction by KNN considering Similarity among Features 64 Int'l Conf. on Advances in Big Data Analytics ABDA'15 Keyword Extraction by KNN considering Similarity among Features Taeho Jo Department of Computer and Information Engineering, Inha University, Incheon,

More information

Face Recognition using Eigenfaces SMAI Course Project

Face Recognition using Eigenfaces SMAI Course Project Face Recognition using Eigenfaces SMAI Course Project Satarupa Guha IIIT Hyderabad 201307566 satarupa.guha@research.iiit.ac.in Ayushi Dalmia IIIT Hyderabad 201307565 ayushi.dalmia@research.iiit.ac.in Abstract

More information

The Curse of Dimensionality

The Curse of Dimensionality The Curse of Dimensionality ACAS 2002 p1/66 Curse of Dimensionality The basic idea of the curse of dimensionality is that high dimensional data is difficult to work with for several reasons: Adding more

More information

Encoding Words into String Vectors for Word Categorization

Encoding Words into String Vectors for Word Categorization Int'l Conf. Artificial Intelligence ICAI'16 271 Encoding Words into String Vectors for Word Categorization Taeho Jo Department of Computer and Information Communication Engineering, Hongik University,

More information

Clustering analysis of gene expression data

Clustering analysis of gene expression data Clustering analysis of gene expression data Chapter 11 in Jonathan Pevsner, Bioinformatics and Functional Genomics, 3 rd edition (Chapter 9 in 2 nd edition) Human T cell expression data The matrix contains

More information

Clustering of Data with Mixed Attributes based on Unified Similarity Metric

Clustering of Data with Mixed Attributes based on Unified Similarity Metric Clustering of Data with Mixed Attributes based on Unified Similarity Metric M.Soundaryadevi 1, Dr.L.S.Jayashree 2 Dept of CSE, RVS College of Engineering and Technology, Coimbatore, Tamilnadu, India 1

More information

Support Vector Machines: Brief Overview" November 2011 CPSC 352

Support Vector Machines: Brief Overview November 2011 CPSC 352 Support Vector Machines: Brief Overview" Outline Microarray Example Support Vector Machines (SVMs) Software: libsvm A Baseball Example with libsvm Classifying Cancer Tissue: The ALL/AML Dataset Golub et

More information

Comparison of different preprocessing techniques and feature selection algorithms in cancer datasets

Comparison of different preprocessing techniques and feature selection algorithms in cancer datasets Comparison of different preprocessing techniques and feature selection algorithms in cancer datasets Konstantinos Sechidis School of Computer Science University of Manchester sechidik@cs.man.ac.uk Abstract

More information

Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation

Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation Learning 4 Supervised Learning 4 Unsupervised Learning 4

More information

Available online Journal of Scientific and Engineering Research, 2019, 6(1): Research Article

Available online   Journal of Scientific and Engineering Research, 2019, 6(1): Research Article Available online www.jsaer.com, 2019, 6(1):193-197 Research Article ISSN: 2394-2630 CODEN(USA): JSERBR An Enhanced Application of Fuzzy C-Mean Algorithm in Image Segmentation Process BAAH Barida 1, ITUMA

More information

REPRESENTATION OF BIG DATA BY DIMENSION REDUCTION

REPRESENTATION OF BIG DATA BY DIMENSION REDUCTION Fundamental Journal of Mathematics and Mathematical Sciences Vol. 4, Issue 1, 2015, Pages 23-34 This paper is available online at http://www.frdint.com/ Published online November 29, 2015 REPRESENTATION

More information

Table of Contents. Recognition of Facial Gestures... 1 Attila Fazekas

Table of Contents. Recognition of Facial Gestures... 1 Attila Fazekas Table of Contents Recognition of Facial Gestures...................................... 1 Attila Fazekas II Recognition of Facial Gestures Attila Fazekas University of Debrecen, Institute of Informatics

More information

Tumor Detection and classification of Medical MRI UsingAdvance ROIPropANN Algorithm

Tumor Detection and classification of Medical MRI UsingAdvance ROIPropANN Algorithm International Journal of Engineering Research and Advanced Technology (IJERAT) DOI:http://dx.doi.org/10.31695/IJERAT.2018.3273 E-ISSN : 2454-6135 Volume.4, Issue 6 June -2018 Tumor Detection and classification

More information

Research Article Feature Selection and Parameter Optimization of Support Vector Machines Based on Modified Cat Swarm Optimization

Research Article Feature Selection and Parameter Optimization of Support Vector Machines Based on Modified Cat Swarm Optimization Hindawi Publishing Corporation International Journal of Distributed Sensor Networks Volume 2015, Article ID 365869, 9 pages http://dx.doi.org/10.1155/2015/365869 Research Article Feature Selection and

More information

Hybrid Feature Selection for Modeling Intrusion Detection Systems

Hybrid Feature Selection for Modeling Intrusion Detection Systems Hybrid Feature Selection for Modeling Intrusion Detection Systems Srilatha Chebrolu, Ajith Abraham and Johnson P Thomas Department of Computer Science, Oklahoma State University, USA ajith.abraham@ieee.org,

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748 CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 3/3/08 CAP5510 1 Gene g Probe 1 Probe 2 Probe N 3/3/08 CAP5510

More information

Features: representation, normalization, selection. Chapter e-9

Features: representation, normalization, selection. Chapter e-9 Features: representation, normalization, selection Chapter e-9 1 Features Distinguish between instances (e.g. an image that you need to classify), and the features you create for an instance. Features

More information