A proposal of hierarchization method based on data distribution for multi class classification

Size: px
Start display at page:

Download "A proposal of hierarchization method based on data distribution for multi class classification"

Transcription

1 1 2 2 Paul Horton 3 2 One-Against-One OAO One-Against-All OAA UCI k-nn OAO SVM A proposal of hierarchization method based on data distribution for multi class classification Tatsuya Kusuda, 1 Shinya Watanabe, 2 Jianming Shi 2 and Paul Horton 3 This paper proposes a new hierarchical method for multi-class classification using binary classifiers. Unlike existing extension methods, such as One- Against-One (OAO) and One-Against-All (OAA), our proposed method makes a hierarchic structure of classification according to distributions of each class data as given training data. Thus, a more accurate multi-class classification can be expected than by existing methods. In particular, our proposed method is expected to be more effective for classes with few samples. Since a hierarchical structure of classification derived by our proposed method is formed on the basis of similarities among classes, relative features of each class can be inferable through the classification structure. In this paper, the effectiveness of our proposed method is discussed through some examples from UCI repository, based on comparison with that of k-nn and OAO. 1. SVM 1) 4) k k-nearest neighbor algorithm k-nn SVM 2 2 (Support Vector Machine, SVM) 1) 2 One-Against-One OAO One-Against-All OAA 5) 6) OAO OAA 5) 1 Graduate School of Information and Electronic Engineering, Muroran Institute of Technology 2 Department of Information and Electronic Engineering, Muroran Institute of Technology 3 National Institute of Advanced Industrial Science and Technology Computational Biology Research Center (AIST CBRC) 1

2 UCI 7) k-nn SVM OAO 2 3 UCI hierarchy hierarchy hierarchy classifier 3 1 classifier 2 3 classifier4 class 2 Conceptual diagram of proposal method classifier class 2 3 classifier5 class Step1: Step2: 2.2 Step3: Step2 SVM Step4: Step3 Step2 Step1:. Step2: SVM Step3: OAO SVM Step means k-means k=2 2

3 2-means 2-means 8) ,1, Step1: 2-means 2 Step2: Step1 Step3: α% Step4: N Gj Step5: C G Balance Accuracy Eval = β Balance + (1 β) Accuracy (1) Balance = G j=1 ( N Gj G C ) (2) Accuracy = Step6: Step1 Step5 (3) Step7: Step1 Step6 Step8: (1) 1 2 β 0.3 Step3 α α 3. UCI k-nn SVM OAO 3.1 UCI 7) 6 Iris,Wine,Heart Disease,Glass,Vowel,Car Evaluation α β Balance (1) 2-means 2 Car Evaluation α 0.2 α UCI 6 Iris Wine Heart Disease Glass Vowel Car Evaluation 1 Car Evaluation

4 1 The characteristics of the used data Dataset Number Features Classes of data Iris Wine Heart Disease Glass Vowel Car Evaluation Parameter 2 Used parameters Determination of group parameter α Values Weight of evaluation 0.3 formula parameter β Number of partitions 10 in cross validation Number of clustering 30 4 Glass The predictive accuracy of each class in Glass Id Number of data Weighted k-nn OAO Proposed method % 71.43% 84.29% % 76.32% 71.05% % 11.76% 17.65% % % 46.15% % 22.22% 44.44% % 79.31% 86.21% % 66.36% 70.56% 3 The results of predictive accuracy Dataset Weighted k-nn OAO Proposed method Iris 95.33% 96.00% 97.33% Wine 93.26% 97.19% 97.75% Heart Disease 55.56% 55.22% 56.57% Glass 65.42% 66.36% 70.56% Vowel 92.02% 99.60% 95.04% Car Evaluation 88.60% 99.36% 99.07% (10-fold Cross-Validation) k-nn SVM OAO SVM Iris Wine Heart Disease Glass Vowel Car Evaluation OAO Vowel Car Evaluation OAO α Glass Glass 4 4 Glass 2 OAO 5 α The effects of α value for predictive accuracy Dataset Weighted OAO Proposed method k-nn α values Iris 94.00% 94.67% 96.00% 95.33% 95.33% 96.00% 96.67% Wine 93.26% 97.19% 97.75% 97.19% 97.75% 98.31% 97.19% Heart Disease 54.88% 54.88% 54.88% 54.88% 55.22% 54.88% 54.88% Glass 65.42% 64.95% 64.49% % 64.95% 67.76% 64.95% Vowel 92.02% 99.60% 95.45% 95.15% 95.05% 95.75% 99.60% Car Evaluation 82.81% 99.71% 99.42% 99.54% 99.71% 99.71% 99.71% α α α α α 5 5 α α OAO 4

5 id 6 Glass Classes of Glass data Details of classes 1 building windows float processed 2 building windows non float processed 3 vehicle windows float processed 5 containers 6 tableware 7 headlamps 7 Glass The attributes of Glass data id Features name Details of features 1 RI refractive index 2 Na Sodium 3 Mg Magnesium 4 Al Aluminum 5 Si Silicon 6 K Potassium 7 Ca Calcium 8 Ba Barium 9 Fe Iron OAO Glass Heart Disease Glass UCI 6 7 Glass Heart Disease Heart Disease id 9 2 Glass Hierarchy diagram of proposal technique(glass) Classes id Heart Disease Classes of Heart Disease data Details of classes 0 person who hasn t heart disease person who has heart disease 1 4 is degree 9 Heart Disease The attributes of Heart Disease data Features id Features name Details of features 1 age age in years 2 sex sex (1 = male; 0 = female) 3 cp chest pain type 4 trestbps resting blood pressure 5 chol serum cholestoral in mg/dl 6 fbs fasting blood sugar 120 mg/dl 7 restecg resting electrocardiographic results 8 thalach maximum heart rate achieved 9 exang exercise induced angina 10 oldpeak ST depression induced by exercise relative to rest 11 slope the slope of the peak exercise ST segment 12 ca number of major vessels (0-3) colored by flourosopy 13 thal 3 = normal; 6 = fixed defect; 7 = reversable defect 5

6 3 Heart Disease 0.2 Hierarchy diagram of proposal technique(heart Disease 0.2) 4 (Heart Disease 0.4) Hierarchy diagram of proposal technique(heart Disease 0.4) α UCI k-nn OAO OAO 1) G. Nalbantov P.J.F.Groenen and J.C.Bioch. a majorization approach to linear support vector machines with different hinge errors. Advances in Data Analysis and Classification, Vol. 2, pp , ) J.A.K. Suykens and J. Vandewalle: Least squares support vector machine classifiers, Neural Netherlands, Vol. 9, No. 3, pp , ) Doumpos, M. Zopounidis, C. Golfinopoulou, V: Additive Support Vector Machines for Pattern Classification, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, Vol. 37, No. 3, pp , ) Fukumizu, K: Special statistical properties of neural network learning. Proc, NOLTA 97, pp , ) Jonathan Milgram and Mohamed Cheriet and Robert Sabourin: One Against One or One Against All :Which One is Better for Handwriting Recognition with SVMs?, Ecole de Technologie Superieure, Montreal, Canada, ) Chih-Wei Hsu Chih-Jen Lin: A comparison of methods for multiclass support vector machines, Neural Networks, IEEE Transactions on, Vol. 13, No. 2, pp , ) C.L.Blake and C.J.Merz: UCI repository of machine learning databases, University of California, Department of Information and Computer Science, 1998, 8),,,,1988 9),, ) Baldi P et al: Assessing the accuracy of prediction algorithms for classification, an overview, Bioinformatics, Vol. 16, No. 5, pp ,

Predicting Diabetes and Heart Disease Using Diagnostic Measurements and Supervised Learning Classification Models

Predicting Diabetes and Heart Disease Using Diagnostic Measurements and Supervised Learning Classification Models Predicting Diabetes and Heart Disease Using Diagnostic Measurements and Supervised Learning Classification Models Kunal Sharma CS 4641 Machine Learning Abstract Supervised learning classification algorithms

More information

Heart Disease Detection using EKSTRAP Clustering with Statistical and Distance based Classifiers

Heart Disease Detection using EKSTRAP Clustering with Statistical and Distance based Classifiers IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 3, Ver. IV (May-Jun. 2016), PP 87-91 www.iosrjournals.org Heart Disease Detection using EKSTRAP Clustering

More information

Performance Evaluation of Various Classification Algorithms

Performance Evaluation of Various Classification Algorithms Performance Evaluation of Various Classification Algorithms Shafali Deora Amritsar College of Engineering & Technology, Punjab Technical University -----------------------------------------------------------***----------------------------------------------------------

More information

Package ordinalforest

Package ordinalforest Type Package Package ordinalforest July 16, 2018 Title Ordinal Forests: Prediction and Variable Ranking with Ordinal Target Variables Version 2.2 Date 2018-07-16 Author Roman Hornung Maintainer Roman Hornung

More information

Performance Comparison of Decision Tree Algorithms for Medical Data Sets

Performance Comparison of Decision Tree Algorithms for Medical Data Sets Performance Comparison of Decision Tree Algorithms for Medical Data Sets Hyontai Sug Abstract Decision trees have been favored much for the task of data mining in medicine domain, because understandability

More information

The Pseudo Gradient Search and a Penalty Technique Used in Classifications.

The Pseudo Gradient Search and a Penalty Technique Used in Classifications. The Pseudo Gradient Search and a Penalty Technique Used in Classifications. Janyl Jumadinova Advisor: Zhenyuan Wang Department of Mathematics University of Nebraska at Omaha Omaha, NE, 68182, USA Abstract

More information

DATA MINING I - 1DL360

DATA MINING I - 1DL360 Uppsala University Department of Information Technology Kjell Orsborn DATA MINING I - 1DL360 Assignment 1 - Classification using knn 1 Classification using a k-nearest Neighbours Algorithm This assignment

More information

Report on Forensic Science Application

Report on Forensic Science Application Report on Forensic Science Application Assignment #4 Vikramaditya Jakkula [10917554] Introduction The goal of the assignment is to develop an ANN system which could classify a piece of glass from a beer

More information

Artificial Neural Networks (Feedforward Nets)

Artificial Neural Networks (Feedforward Nets) Artificial Neural Networks (Feedforward Nets) y w 03-1 w 13 y 1 w 23 y 2 w 01 w 21 w 22 w 02-1 w 11 w 12-1 x 1 x 2 6.034 - Spring 1 Single Perceptron Unit y w 0 w 1 w n w 2 w 3 x 0 =1 x 1 x 2 x 3... x

More information

Lecture 20: Bagging, Random Forests, Boosting

Lecture 20: Bagging, Random Forests, Boosting Lecture 20: Bagging, Random Forests, Boosting Reading: Chapter 8 STATS 202: Data mining and analysis November 13, 2017 1 / 17 Classification and Regression trees, in a nut shell Grow the tree by recursively

More information

A neural network that classifies glass either as window or non-window depending on the glass chemistry.

A neural network that classifies glass either as window or non-window depending on the glass chemistry. A neural network that classifies glass either as window or non-window depending on the glass chemistry. Djaber Maouche Department of Electrical Electronic Engineering Cukurova University Adana, Turkey

More information

Lecture 19: Decision trees

Lecture 19: Decision trees Lecture 19: Decision trees Reading: Section 8.1 STATS 202: Data mining and analysis November 10, 2017 1 / 17 Decision trees, 10,000 foot view R2 R5 t4 1. Find a partition of the space of predictors. X2

More information

The Basics of Decision Trees

The Basics of Decision Trees Tree-based Methods Here we describe tree-based methods for regression and classification. These involve stratifying or segmenting the predictor space into a number of simple regions. Since the set of splitting

More information

IJMIE Volume 2, Issue 9 ISSN:

IJMIE Volume 2, Issue 9 ISSN: Dimensionality Using Optimization Algorithm for High Dimensional Data Clustering Saranya.S* Dr.Punithavalli.M** Abstract: This paper present an efficient approach to a feature selection problem based on

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 4, April ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 4, April ISSN International Journal of Scientific & Engineering Research, Volume 4, Issue 4, April 2013 1914 Heart Disease Prediction System Using Bayes Theorem Sahana Devanathan 1, Ambika R 2 1 Assistant Professor,

More information

Class-Specific Feature Selection for One-Against-All Multiclass SVMs

Class-Specific Feature Selection for One-Against-All Multiclass SVMs Class-Specific Feature Selection for One-Against-All Multiclass SVMs Gaël de Lannoy and Damien François and Michel Verleysen Université catholique de Louvain Institute of Information and Communication

More information

Evolutionary Instance Selection Algorithm based on Takagi-Sugeno Fuzzy Model

Evolutionary Instance Selection Algorithm based on Takagi-Sugeno Fuzzy Model Appl. Math. Inf. Sci. 8, No. 3, 1307-1312 (2014) 1307 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.12785/amis/080346 Evolutionary Instance Selection Algorithm

More information

A Dynamic Linkage Clustering using KD-Tree

A Dynamic Linkage Clustering using KD-Tree The International Arab Journal of Information Technology, Vol. 10, No. 3, May 2013 283 A Dynamic Linkage Clustering using KD-Tree Shadi Abudalfa 1 and Mohammad Mikki 2 1 Department of Information Technology,

More information

Classification of Hand-Written Numeric Digits

Classification of Hand-Written Numeric Digits Classification of Hand-Written Numeric Digits Nyssa Aragon, William Lane, Fan Zhang December 12, 2013 1 Objective The specific hand-written recognition application that this project is emphasizing is reading

More information

Heart Disease Prediction and Classification Using Machine Learning Algorithms Optimized by Particle Swarm Optimization and Ant Colony Optimization

Heart Disease Prediction and Classification Using Machine Learning Algorithms Optimized by Particle Swarm Optimization and Ant Colony Optimization Received: October 22, 2018 242 Heart Disease Prediction and Classification Using Machine Learning Algorithms Optimized by Particle Swarm Optimization and Ant Colony Optimization Youness Khourdifi 1 * Mohamed

More information

Machine Learning with MATLAB --classification

Machine Learning with MATLAB --classification Machine Learning with MATLAB --classification Stanley Liang, PhD York University Classification the definition In machine learning and statistics, classification is the problem of identifying to which

More information

Predicting the Heart Attack Symptoms using Biomedical Data Mining Techniques

Predicting the Heart Attack Symptoms using Biomedical Data Mining Techniques Volume 1, No. 3, May ISSN 2278-1080 The International Journal of Computer Science & Applications (TIJCSA) RESEARCH PAPER Available Online at http://www.journalofcomputerscience.com/ Predicting the Heart

More information

Rank Measures for Ordering

Rank Measures for Ordering Rank Measures for Ordering Jin Huang and Charles X. Ling Department of Computer Science The University of Western Ontario London, Ontario, Canada N6A 5B7 email: fjhuang33, clingg@csd.uwo.ca Abstract. Many

More information

Learning Dot Product Polynomials for multiclass problems

Learning Dot Product Polynomials for multiclass problems Learning Dot Product Polynomials for multiclass problems Lauriola Ivano1, Donini Michele and Aiolli Fabio1 1- Department of Mathematics, University of Padova via Trieste 63, Padova, Italy - Computational

More information

Random Forests and Boosting

Random Forests and Boosting Random Forests and Boosting Tree-based methods are simple and useful for interpretation. However they typically are not competitive with the best supervised learning approaches in terms of prediction accuracy.

More information

Why Do Nearest-Neighbour Algorithms Do So Well?

Why Do Nearest-Neighbour Algorithms Do So Well? References Why Do Nearest-Neighbour Algorithms Do So Well? Brian D. Ripley Professor of Applied Statistics University of Oxford Ripley, B. D. (1996) Pattern Recognition and Neural Networks. CUP. ISBN 0-521-48086-7.

More information

A genetic algorithm for interpretable model extraction from decision tree ensembles

A genetic algorithm for interpretable model extraction from decision tree ensembles A genetic algorithm for interpretable model extraction from decision tree ensembles Gilles Vandewiele, Kiani Lannoye, Olivier Janssens, Femke Ongenae, Filip De Turck, and Sofie Van Hoecke Department of

More information

Statistical Methods for Data Mining

Statistical Methods for Data Mining Statistical Methods for Data Mining Kuangnan Fang Xiamen University Email: xmufkn@xmu.edu.cn Tree-based Methods Here we describe tree-based methods for regression and classification. These involve stratifying

More information

Package FFTrees. November 2, 2017

Package FFTrees. November 2, 2017 Package FFTrees November 2, 2017 Type Package Title Generate, Visualise, and Evaluate Fast-and-Frugal Decision Trees Version 1.3.5 Date 2017-11-01 Maintainer Nathaniel Phillips

More information

Individual feature selection in each One-versus-One classifier improves multi-class SVM performance

Individual feature selection in each One-versus-One classifier improves multi-class SVM performance Individual feature selection in each One-versus-One classifier improves multi-class SVM performance Phoenix X. Huang School of Informatics University of Edinburgh 10 Crichton street, Edinburgh Xuan.Huang@ed.ac.uk

More information

FEATURE EXTRACTION TECHNIQUES USING SUPPORT VECTOR MACHINES IN DISEASE PREDICTION

FEATURE EXTRACTION TECHNIQUES USING SUPPORT VECTOR MACHINES IN DISEASE PREDICTION FEATURE EXTRACTION TECHNIQUES USING SUPPORT VECTOR MACHINES IN DISEASE PREDICTION Sandeep Kaur 1, Dr. Sheetal Kalra 2 1,2 Computer Science Department, Guru Nanak Dev University RC, Jalandhar(India) ABSTRACT

More information

Chapter 8 The C 4.5*stat algorithm

Chapter 8 The C 4.5*stat algorithm 109 The C 4.5*stat algorithm This chapter explains a new algorithm namely C 4.5*stat for numeric data sets. It is a variant of the C 4.5 algorithm and it uses variance instead of information gain for the

More information

Learning highly non-separable Boolean functions using Constructive Feedforward Neural Network

Learning highly non-separable Boolean functions using Constructive Feedforward Neural Network Learning highly non-separable Boolean functions using Constructive Feedforward Neural Network Marek Grochowski and W lodzis law Duch Department of Informatics, Nicolaus Copernicus University, Grudzi adzka

More information

Machine Learning - Clustering. CS102 Fall 2017

Machine Learning - Clustering. CS102 Fall 2017 Machine Learning - Fall 2017 Big Data Tools and Techniques Basic Data Manipulation and Analysis Performing well-defined computations or asking well-defined questions ( queries ) Data Mining Looking for

More information

Fuzzy Signature Based Radial Basis Neural Network

Fuzzy Signature Based Radial Basis Neural Network Fuzzy Signature Based Radial Basis Neural Network Wei Fan 4 November 2011 A report submitted for the degree of Master of Computing of the Australian National University Under the supervision of Prof. Tom

More information

Using Network Analysis to Improve Nearest Neighbor Classification of Non-Network Data

Using Network Analysis to Improve Nearest Neighbor Classification of Non-Network Data Using Network Analysis to Improve Nearest Neighbor Classification of Non-Network Data Maciej Piernik, Dariusz Brzezinski, Tadeusz Morzy, and Mikolaj Morzy Institute of Computing Science, Poznan University

More information

Mining di Dati Web. Lezione 3 - Clustering and Classification

Mining di Dati Web. Lezione 3 - Clustering and Classification Mining di Dati Web Lezione 3 - Clustering and Classification Introduction Clustering and classification are both learning techniques They learn functions describing data Clustering is also known as Unsupervised

More information

Online Mathematical Symbol Recognition using SVMs with Features from Functional Approximation

Online Mathematical Symbol Recognition using SVMs with Features from Functional Approximation Online Mathematical Symbol Recognition using SVMs with Features from Functional Approximation Birendra Keshari and Stephen M. Watt Ontario Research Centre for Computer Algebra Department of Computer Science

More information

Comparison of various classification models for making financial decisions

Comparison of various classification models for making financial decisions Comparison of various classification models for making financial decisions Vaibhav Mohan Computer Science Department Johns Hopkins University Baltimore, MD 21218, USA vmohan3@jhu.edu Abstract Banks are

More information

Classification/Regression Trees and Random Forests

Classification/Regression Trees and Random Forests Classification/Regression Trees and Random Forests Fabio G. Cozman - fgcozman@usp.br November 6, 2018 Classification tree Consider binary class variable Y and features X 1,..., X n. Decide Ŷ after a series

More information

Distance Weighted Discrimination Method for Parkinson s for Automatic Classification of Rehabilitative Speech Treatment for Parkinson s Patients

Distance Weighted Discrimination Method for Parkinson s for Automatic Classification of Rehabilitative Speech Treatment for Parkinson s Patients Operations Research II Project Distance Weighted Discrimination Method for Parkinson s for Automatic Classification of Rehabilitative Speech Treatment for Parkinson s Patients Nicol Lo 1. Introduction

More information

Feature weighting using particle swarm optimization for learning vector quantization classifier

Feature weighting using particle swarm optimization for learning vector quantization classifier Journal of Physics: Conference Series PAPER OPEN ACCESS Feature weighting using particle swarm optimization for learning vector quantization classifier To cite this article: A Dongoran et al 2018 J. Phys.:

More information

A Systematic Overview of Data Mining Algorithms. Sargur Srihari University at Buffalo The State University of New York

A Systematic Overview of Data Mining Algorithms. Sargur Srihari University at Buffalo The State University of New York A Systematic Overview of Data Mining Algorithms Sargur Srihari University at Buffalo The State University of New York 1 Topics Data Mining Algorithm Definition Example of CART Classification Iris, Wine

More information

The Effects of Outliers on Support Vector Machines

The Effects of Outliers on Support Vector Machines The Effects of Outliers on Support Vector Machines Josh Hoak jrhoak@gmail.com Portland State University Abstract. Many techniques have been developed for mitigating the effects of outliers on the results

More information

IMPLEMENTATION OF OWNERSHIP RIGHTS PROTECTION FOR NUMERIC AND NON-NUMERIC RELATIONAL DATA USING WATERMARKING

IMPLEMENTATION OF OWNERSHIP RIGHTS PROTECTION FOR NUMERIC AND NON-NUMERIC RELATIONAL DATA USING WATERMARKING International Journal of Computer Engineering and Applications, Volume X, Issue VI, June 16 www.ijcea.com ISSN 2321-3469 IMPLEMENTATION OF OWNERSHIP RIGHTS PROTECTION FOR NUMERIC AND NON-NUMERIC RELATIONAL

More information

Using Decision Trees and Soft Labeling to Filter Mislabeled Data. Abstract

Using Decision Trees and Soft Labeling to Filter Mislabeled Data. Abstract Using Decision Trees and Soft Labeling to Filter Mislabeled Data Xinchuan Zeng and Tony Martinez Department of Computer Science Brigham Young University, Provo, UT 84602 E-Mail: zengx@axon.cs.byu.edu,

More information

An Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm

An Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm Proceedings of the National Conference on Recent Trends in Mathematical Computing NCRTMC 13 427 An Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm A.Veeraswamy

More information

Using a genetic algorithm for editing k-nearest neighbor classifiers

Using a genetic algorithm for editing k-nearest neighbor classifiers Using a genetic algorithm for editing k-nearest neighbor classifiers R. Gil-Pita 1 and X. Yao 23 1 Teoría de la Señal y Comunicaciones, Universidad de Alcalá, Madrid (SPAIN) 2 Computer Sciences Department,

More information

Performance Comparison of the Automatic Data Reduction System (ADRS)

Performance Comparison of the Automatic Data Reduction System (ADRS) Performance Comparison of the Automatic Data Reduction System (ADRS) Dan Patterson a, David Turner a, Arturo Concepcion a, and Robert Lynch b a Department of Computer Science, California State University,

More information

Classification using Weka (Brain, Computation, and Neural Learning)

Classification using Weka (Brain, Computation, and Neural Learning) LOGO Classification using Weka (Brain, Computation, and Neural Learning) Jung-Woo Ha Agenda Classification General Concept Terminology Introduction to Weka Classification practice with Weka Problems: Pima

More information

TWRBF Transductive RBF Neural Network with Weighted Data Normalization

TWRBF Transductive RBF Neural Network with Weighted Data Normalization TWRBF Transductive RBF eural etwork with Weighted Data ormalization Qun Song and ikola Kasabov Knowledge Engineering & Discovery Research Institute Auckland University of Technology Private Bag 9006, Auckland

More information

Some questions of consensus building using co-association

Some questions of consensus building using co-association Some questions of consensus building using co-association VITALIY TAYANOV Polish-Japanese High School of Computer Technics Aleja Legionow, 4190, Bytom POLAND vtayanov@yahoo.com Abstract: In this paper

More information

Problems in generalized linear model selection and predictive evaluation for binary outcomes

Problems in generalized linear model selection and predictive evaluation for binary outcomes University of Iowa Iowa Research Online Theses and Dissertations Fall 2015 Problems in generalized linear model selection and predictive evaluation for binary outcomes Patrick Ten Eyck University of Iowa

More information

Fuzzy Partitioning with FID3.1

Fuzzy Partitioning with FID3.1 Fuzzy Partitioning with FID3.1 Cezary Z. Janikow Dept. of Mathematics and Computer Science University of Missouri St. Louis St. Louis, Missouri 63121 janikow@umsl.edu Maciej Fajfer Institute of Computing

More information

An Ensemble of Classifiers using Dynamic Method on Ambiguous Data

An Ensemble of Classifiers using Dynamic Method on Ambiguous Data An Ensemble of Classifiers using Dynamic Method on Ambiguous Data Dnyaneshwar Kudande D.Y. Patil College of Engineering, Pune, Maharashtra, India Abstract- The aim of proposed work is to analyze the Instance

More information

Efficient Pruning Method for Ensemble Self-Generating Neural Networks

Efficient Pruning Method for Ensemble Self-Generating Neural Networks Efficient Pruning Method for Ensemble Self-Generating Neural Networks Hirotaka INOUE Department of Electrical Engineering & Information Science, Kure National College of Technology -- Agaminami, Kure-shi,

More information

Instance-Based Learning with Genetically Derived Attribute Weights

Instance-Based Learning with Genetically Derived Attribute Weights Proceedings of the International Conference on Artificial Intelligence, Expert Systems and Neural Networks (AIE 96), pp. 11-14, 1996. Instance-Based Learning with Genetically Derived Attribute Weights

More information

Journal of Babylon University/Pure and Applied Sciences/ No.(2)/ Vol.(25): 2017

Journal of Babylon University/Pure and Applied Sciences/ No.(2)/ Vol.(25): 2017 A Fuzzy Petri Nets System for Heart Disease Diagnosis Hussin Attya Lafta Wed Kadhim Oleiwi University of Babylon, College of Science for women hzazmk@yahoo.com wd_dd_ww@yahoo.com Abstract In this paper

More information

Evaluating the Replicability of Significance Tests for Comparing Learning Algorithms

Evaluating the Replicability of Significance Tests for Comparing Learning Algorithms Evaluating the Replicability of Significance Tests for Comparing Learning Algorithms Remco R. Bouckaert 1,2 and Eibe Frank 2 1 Xtal Mountain Information Technology 215 Three Oaks Drive, Dairy Flat, Auckland,

More information

THE discrete multi-valued neuron was presented by N.

THE discrete multi-valued neuron was presented by N. Proceedings of International Joint Conference on Neural Networks, Dallas, Texas, USA, August 4-9, 2013 Multi-Valued Neuron with New Learning Schemes Shin-Fu Wu and Shie-Jue Lee Department of Electrical

More information

An Empirical Study of Hoeffding Racing for Model Selection in k-nearest Neighbor Classification

An Empirical Study of Hoeffding Racing for Model Selection in k-nearest Neighbor Classification An Empirical Study of Hoeffding Racing for Model Selection in k-nearest Neighbor Classification Flora Yu-Hui Yeh and Marcus Gallagher School of Information Technology and Electrical Engineering University

More information

Machine Learning in Biology

Machine Learning in Biology Università degli studi di Padova Machine Learning in Biology Luca Silvestrin (Dottorando, XXIII ciclo) Supervised learning Contents Class-conditional probability density Linear and quadratic discriminant

More information

Rita McCue University of California, Santa Cruz 12/7/09

Rita McCue University of California, Santa Cruz 12/7/09 Rita McCue University of California, Santa Cruz 12/7/09 1 Introduction 2 Naïve Bayes Algorithms 3 Support Vector Machines and SVMLib 4 Comparative Results 5 Conclusions 6 Further References Support Vector

More information

Data Mining: An experimental approach with WEKA on UCI Dataset

Data Mining: An experimental approach with WEKA on UCI Dataset Data Mining: An experimental approach with WEKA on UCI Dataset Ajay Kumar Dept. of computer science Shivaji College University of Delhi, India Indranath Chatterjee Dept. of computer science Faculty of

More information

CS145: INTRODUCTION TO DATA MINING

CS145: INTRODUCTION TO DATA MINING CS145: INTRODUCTION TO DATA MINING 08: Classification Evaluation and Practical Issues Instructor: Yizhou Sun yzsun@cs.ucla.edu October 24, 2017 Learnt Prediction and Classification Methods Vector Data

More information

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3 Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

A Maximal Margin Classification Algorithm Based on Data Field

A Maximal Margin Classification Algorithm Based on Data Field Send Orders for Reprints to reprints@benthamscience.ae 1088 The Open Cybernetics & Systemics Journal, 2015, 9, 1088-1093 Open Access A Maximal Margin Classification Algorithm Based on Data Field Zhao 1,*,

More information

Experimental Approach for the Evaluation of Neural Network Classifier Algorithms

Experimental Approach for the Evaluation of Neural Network Classifier Algorithms Experimental Approach for the Evaluation of Neural Network Classifier Algorithms Masoud Ghaffari and Ernest L. Hall Center for Robotics Research University of Cincinnati Cincinnati, Oh 45-7 ABSTRACT The

More information

Adaptive Metric Nearest Neighbor Classification

Adaptive Metric Nearest Neighbor Classification Adaptive Metric Nearest Neighbor Classification Carlotta Domeniconi Jing Peng Dimitrios Gunopulos Computer Science Department Computer Science Department Computer Science Department University of California

More information

Classifier Inspired Scaling for Training Set Selection

Classifier Inspired Scaling for Training Set Selection Classifier Inspired Scaling for Training Set Selection Walter Bennette DISTRIBUTION A: Approved for public release: distribution unlimited: 16 May 2016. Case #88ABW-2016-2511 Outline Instance-based classification

More information

OPTIMUM COMPLEXITY NEURAL NETWORKS FOR ANOMALY DETECTION TASK

OPTIMUM COMPLEXITY NEURAL NETWORKS FOR ANOMALY DETECTION TASK OPTIMUM COMPLEXITY NEURAL NETWORKS FOR ANOMALY DETECTION TASK Robert Kozma, Nivedita Sumi Majumdar and Dipankar Dasgupta Division of Computer Science, Institute for Intelligent Systems 373 Dunn Hall, Department

More information

PCA-based Offline Handwritten Character Recognition System

PCA-based Offline Handwritten Character Recognition System Smart Computing Review, vol. 3, no. 5, October 2013 346 Smart Computing Review PCA-based Offline Handwritten Character Recognition System Munish Kumar 1, M. K. Jindal 2, and R. K. Sharma 3 1 Computer Science

More information

Dataset Editing Techniques: A Comparative Study

Dataset Editing Techniques: A Comparative Study Dataset Editing Techniques: A Comparative Study Nidal Zeidat, Sujing Wang, and Christoph F. Eick Department of Computer Science, University of Houston Houston, Texas, USA {nzeidat, sujingwa, ceick}@cs.uh.edu

More information

Maximum Margin Binary Classifiers using Intrinsic and Penalty Graphs

Maximum Margin Binary Classifiers using Intrinsic and Penalty Graphs Maximum Margin Binary Classifiers using Intrinsic and Penalty Graphs Berkay Kicanaoglu, Alexandros Iosifidis and Moncef Gabbouj Department of Signal Processing, Tampere University of Technology, Tampere,

More information

Global Metric Learning by Gradient Descent

Global Metric Learning by Gradient Descent Global Metric Learning by Gradient Descent Jens Hocke and Thomas Martinetz University of Lübeck - Institute for Neuro- and Bioinformatics Ratzeburger Allee 160, 23538 Lübeck, Germany hocke@inb.uni-luebeck.de

More information

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2.

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2. Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

Comparative analysis of classifier algorithm in data mining Aikjot Kaur Narula#, Dr.Raman Maini*

Comparative analysis of classifier algorithm in data mining Aikjot Kaur Narula#, Dr.Raman Maini* Comparative analysis of classifier algorithm in data mining Aikjot Kaur Narula#, Dr.Raman Maini* #Student, Department of Computer Engineering, Punjabi university Patiala, India, aikjotnarula@gmail.com

More information

Fuzzy Modeling using Vector Quantization with Supervised Learning

Fuzzy Modeling using Vector Quantization with Supervised Learning Fuzzy Modeling using Vector Quantization with Supervised Learning Hirofumi Miyajima, Noritaka Shigei, and Hiromi Miyajima Abstract It is known that learning methods of fuzzy modeling using vector quantization

More information

Nearest Cluster Classifier

Nearest Cluster Classifier Nearest Cluster Classifier Hamid Parvin, Moslem Mohamadi, Sajad Parvin, Zahra Rezaei, and Behrouz Minaei Nourabad Mamasani Branch, Islamic Azad University, Nourabad Mamasani, Iran hamidparvin@mamasaniiau.ac.ir,

More information

Nearest Cluster Classifier

Nearest Cluster Classifier Nearest Cluster Classifier Hamid Parvin, Moslem Mohamadi, Sajad Parvin, Zahra Rezaei, Behrouz Minaei Nourabad Mamasani Branch Islamic Azad University Nourabad Mamasani, Iran hamidparvin@mamasaniiau.ac.ir,

More information

LEARNING WEIGHTS OF FUZZY RULES BY USING GRAVITATIONAL SEARCH ALGORITHM

LEARNING WEIGHTS OF FUZZY RULES BY USING GRAVITATIONAL SEARCH ALGORITHM International Journal of Innovative Computing, Information and Control ICIC International c 2013 ISSN 1349-4198 Volume 9, Number 4, April 2013 pp. 1593 1601 LEARNING WEIGHTS OF FUZZY RULES BY USING GRAVITATIONAL

More information

Cost-Conscious Comparison of Supervised Learning Algorithms over Multiple Data Sets

Cost-Conscious Comparison of Supervised Learning Algorithms over Multiple Data Sets Cost-Conscious Comparison of Supervised Learning Algorithms over Multiple Data Sets Mehmet Aydın Ulaş, Olcay Taner Yıldız, Ethem Alpaydın Technical Report, FBE/CMPE-01/2008-04 Institute of Graduate Studies

More information

An Empirical Comparison of Ensemble Methods Based on Classification Trees. Mounir Hamza and Denis Larocque. Department of Quantitative Methods

An Empirical Comparison of Ensemble Methods Based on Classification Trees. Mounir Hamza and Denis Larocque. Department of Quantitative Methods An Empirical Comparison of Ensemble Methods Based on Classification Trees Mounir Hamza and Denis Larocque Department of Quantitative Methods HEC Montreal Canada Mounir Hamza and Denis Larocque 1 June 2005

More information

Intro to Artificial Intelligence

Intro to Artificial Intelligence Intro to Artificial Intelligence Ahmed Sallam { Lecture 5: Machine Learning ://. } ://.. 2 Review Probabilistic inference Enumeration Approximate inference 3 Today What is machine learning? Supervised

More information

Distance Learning and Attribute Importance Analysis by Linear Regression on Idealized Distance Functions

Distance Learning and Attribute Importance Analysis by Linear Regression on Idealized Distance Functions Wright State University CORE Scholar Browse all Theses and Dissertations Theses and Dissertations 2017 Distance Learning and Attribute Importance Analysis by Linear Regression on Idealized Distance Functions

More information

Reihe Informatik 10/2001. Efficient Feature Subset Selection for Support Vector Machines. Matthias Heiler, Daniel Cremers, Christoph Schnörr

Reihe Informatik 10/2001. Efficient Feature Subset Selection for Support Vector Machines. Matthias Heiler, Daniel Cremers, Christoph Schnörr Computer Vision, Graphics, and Pattern Recognition Group Department of Mathematics and Computer Science University of Mannheim D-68131 Mannheim, Germany Reihe Informatik 10/2001 Efficient Feature Subset

More information

Wrapper Feature Selection using Discrete Cuckoo Optimization Algorithm Abstract S.J. Mousavirad and H. Ebrahimpour-Komleh* 1 Department of Computer and Electrical Engineering, University of Kashan, Kashan,

More information

A Comparative Study of Selected Classification Algorithms of Data Mining

A Comparative Study of Selected Classification Algorithms of Data Mining Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 6, June 2015, pg.220

More information

Application of Clustering as a Data Mining Tool in Bp systolic diastolic

Application of Clustering as a Data Mining Tool in Bp systolic diastolic Application of Clustering as a Data Mining Tool in Bp systolic diastolic Assist. Proffer Dr. Zeki S. Tywofik Department of Computer, Dijlah University College (DUC),Baghdad, Iraq. Assist. Lecture. Ali

More information

Decision Jungles: Compact and Rich Models for Classification Supplementary Material

Decision Jungles: Compact and Rich Models for Classification Supplementary Material Decision Jungles: Compact and Rich Models for Classification Supplementary Material Jamie Shotton Toby Sharp Pushmeet Kohli Sebastian Nowozin John Winn Antonio Criminisi Microsoft Research, Cambridge,

More information

Data Mining and Knowledge Discovery. Data Mining and Knowledge Discovery

Data Mining and Knowledge Discovery. Data Mining and Knowledge Discovery The Computer Science Seminars University of Colorado at Denver Data Mining and Knowledge Discovery The Computer Science Seminars, University of Colorado at Denver Data Mining and Knowledge Discovery Knowledge

More information

Value Difference Metrics for Continuously Valued Attributes

Value Difference Metrics for Continuously Valued Attributes Proceedings of the International Conference on Artificial Intelligence, Expert Systems and Neural Networks (AIE 96), pp. 11-14, 1996. Value Difference Metrics for Continuously Valued Attributes D. Randall

More information

Class Strength Prediction Method for Associative Classification

Class Strength Prediction Method for Associative Classification Class Strength Prediction Method for Associative Classification Suzan Ayyat Joan Lu Fadi Thabtah Department of Informatics Huddersfield University Department of Informatics Huddersfield University Ebusiness

More information

ECG782: Multidimensional Digital Signal Processing

ECG782: Multidimensional Digital Signal Processing ECG782: Multidimensional Digital Signal Processing Object Recognition http://www.ee.unlv.edu/~b1morris/ecg782/ 2 Outline Knowledge Representation Statistical Pattern Recognition Neural Networks Boosting

More information

Practice EXAM: SPRING 2012 CS 6375 INSTRUCTOR: VIBHAV GOGATE

Practice EXAM: SPRING 2012 CS 6375 INSTRUCTOR: VIBHAV GOGATE Practice EXAM: SPRING 0 CS 6375 INSTRUCTOR: VIBHAV GOGATE The exam is closed book. You are allowed four pages of double sided cheat sheets. Answer the questions in the spaces provided on the question sheets.

More information

CHAPTER 3 RESEARCH METHODOLOGY

CHAPTER 3 RESEARCH METHODOLOGY CHAPTER 3 RESEARCH METHODOLOGY 3.1 Introduction This chapter discusses the methodology that is used in this study. The first section describes the steps involve, follows by dataset representation. The

More information

Computational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions

Computational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions Computational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions Thomas Giraud Simon Chabot October 12, 2013 Contents 1 Discriminant analysis 3 1.1 Main idea................................

More information

Fingerprint Based Gender Classification Using Block-Based DCT

Fingerprint Based Gender Classification Using Block-Based DCT Fingerprint Based Gender Classification Using Block-Based DCT Akhil Anjikar 1, Suchita Tarare 2, M. M. Goswami 3 Dept. of IT, Rajiv Gandhi College of Engineering & Research, RTM Nagpur University, Nagpur,

More information

Some Thoughts on Machine Learning Software Design

Some Thoughts on Machine Learning Software Design Support Vector Machines 1 Some Thoughts on Machine Learning Software Design Chih-Jen Lin Department of Computer Science National Taiwan University Talk at University of Southampton, February 6, 2004 Support

More information

Assignment 1: CS Machine Learning

Assignment 1: CS Machine Learning Assignment 1: CS7641 - Machine Learning Saad Khan September 18, 2015 1 Introduction I intend to apply supervised learning algorithms to classify the quality of wine samples as being of high or low quality

More information