Automatic Paroxysmal Atrial Fibrillation Based on not Fibrillating ECGs. 1. Introduction

Size: px
Start display at page:

Download "Automatic Paroxysmal Atrial Fibrillation Based on not Fibrillating ECGs. 1. Introduction"

Transcription

1 Schattauer GmbH Automatic Paroxysmal Atrial Fibrillation Based on not Fibrillating ECGs E. Ros, S. Mota, F. J. Toro, A. F. Díaz, F. J. Fernández Department of Architecture and Computer Technology, University of Granada, Spain Summary Objectives: The objective of the paper is to describe an automatic algorithm for Paroxysmal Atrial Fibrillation (PAF) Detection, based on parameters extracted from ECG traces with no atrial fibrillation episode. The modular automatic classification algorithm for PAF diagnosis is developed and evaluated with different parameter configurations. Methods: The database used in this study was provided by Physiobank for The Computers in Cardiology Challenge Each ECG file in this database was translated into a 48 parameter vector. The modular classification algorithm used for PAF diagnosis was based on the nearest K-neighbours. Several configuration options were evaluated to optimize the classification performance. Results: Different configurations of the proposed modular classification algorithm were tested. The uniparametric approach achieved a top classification rate value of 76%. A multi-parametric approach was configured using the 5 parameters with highest discrimination power, and a top classification rate of 80% was achieved; different functions to typify the parameters were tested. Finally, two automatic parametric scanning strategies, Forward and Backward methods, were adopted. The results obtained with these approaches achieved a top classification rate of 92%. Conclusions: A modular classification algorithm based on the nearest K-neighbours was designed. The classification performance of the algorithm was evaluated using different parameter configurations, typification functions and number of K-neighbors. The automatic parametric scanning techniques achieved much better results than previously tested configurations. Keywords Paroxysmal Atrial Fibrillation, automatic diagnosis, ECG signal processing Methods Inf Med 2004; 43: Introduction The automatic diagnosis of patients that suffer PAF analysing ECG registers that do not contain explicit PAF episodes is a difficult task. An international initiative recently concluded that addressed this problem [1, 2] but the results where not definitive thus the problem remains open. The development of an automatic algorithm for PAF detection consists on two different stages: Parameter characterization of PAF. This stage has been described in [3]. It describes a parameter set to be used for PAF detection applications. Automatic classification algorithm set up. In this stage the classification strategy and the parameter selection that optimise its performance is defined. The present paper describes an automatic classification algorithm that discriminates between PAF patients and healthy subjects (Section 2). The algorithm uses as inputs parameters extracted from ECG traces that do not contain explicit fibrillation episodes. It is studied how the performance is affected with different algorithm set ups (Section 3) and finally Section 4 summarizes the discussion of the results. 2. Methodology As described in [3], each ECG register is translated into a 48 component vector (P 1,,P 48 ). It is necessary to remark that the characterization parameters described in [3] are low level parameters (P wave amplitude, P wave width, etc). Each parameter represents a different physical characteristic of very diverse range. In order to make possible a multiparametric distance scheme as classification kernel, all these parameters must be typified: for instance the average (M i ) of each parameter is calculated within the whole database and then each parameter is divided by its correspondent M i, i.e. p i = P i /M i. A modular classification algorithm based on the closest K-neighbours has been used for this application. The labelled vectors work as references of the classification system. For each new non-labelled vector, the Euclidean distances to the labelled vectors are calculated.the labels of the K closest neighbours are consulted and the final label is calculated through a voting scheme as the label of the majority of the K-neighbours. The scheme of the complete classification system is shown in Figure 1. In this way the classification algorithm is modular, new parameters can be added easily, only the dimension considered in the Euclidean distance calculation step has to be modified. The modularity of the classification algorithm enables the parameter scanning techniques described in the next section. 3. Results Due to the small size of the training database (25 patients and 25 healthy subjects), the evaluation of the classification rate is calculated in 50 cycles with the method leaving one out. In each of them, one vector is selected as a test element. This vector is classified according to the scheme described above with the other 49 labelled vectors as classification references. In each cycle the classification results in four counters are actualized: True_Positive (TP),

2 95 Paroxysmal Atrial Fibrillation Diagnosis True_Negative (TN), False_Positive (FP) and False_Negative (FN). The final classification rate (CR), the sensibility (SE) and the specificity (SP) are finally calculated with these counters that accumulate the classification results of the 50 cycles. All the values of CR, SE and SP reported in the following tables are expressed in %. A more appropriate study of the classification performance requires a specific test data set. In fact, a test database is available via WWW [1]. Any registered user can send a file of labels obtained by a classification algorithm for the test database and the Classification Rate is received automatically as the only result. The number of accesses is very restricted and the test registers labels are maintained secret. Because of that, no specific test classification result is reported in the Discussion Section. But it has to be remarked that the test results with the proposed approach are much lower (below 70 %) than the classification results reported in the next sections. These high disparities seem to be caused by significant statistical differences between the training vector database and the test vector database as also concluded by the top-scores authors in [1]. We expected to overcome these disparities using the normalized parameters proposed in [3] but this is not the case. A multifactorial statistical study has been carried out with the Analysis of Variance method and it has been observed that the algorithm performance depends significantly on several factors: number of K neighbours, typification function and number of considered parameters (dimension of the input vectors). We will study the influence of these factors. Uniparametric Classifier In Table 1, the classification results of the best parameters considering different number of K neighbours are summarized. The best results are obtained taking into account 9 neighbours, this characteristic depends on the overlapping degree of the data. Fig. 1 Classification scheme. algorithm Multiparametric Classifier In order to implement a multiparametric classification scheme it is necessary to typify the input vector in order to transform the parameters into adimensional values. We have studied three typification functions: A. The average (M i ) of each parameter (P i ) is calculated. Then each parameter is typified: p i = P i /M i. B. The Standard Deviation of each parameter (SD i ) is calculated. Then each parameter is typified as follows: p i = (P i -M i )/SD i. C. We define a gain term β i = α C i, where a is a scale factor (with possible values 1, 10 and 100) and C i is a discrimination power estimator calculated for each parameter using it in an uniparametric classification scheme (C i is the classification rate (%) of each parameter in Table 1). Finally, the typification is given by the following expression: p i = β i P i / M i. In this way each dimension of the parametric space is deformed according to the power discrimination (C i ) of its correspondent parameter. To evaluate how the classification performance changes with the typification modality, we have calculated the classification results using the 5 parameters with a greater individual discrimination power (C i ). In Table 2 the classification results obtained with the different typification functions are summarized. It is seen that typification

3 96 Ros et al. Table 1 Classifications results obtained by the best uniparametric classifier varying the number of K-neighbours. The parameters are described in (3). Table 2 Performance variation due to the typification mode. Results obtained using a multiparametric scheme with the best 5 parameters and taking into account different number of neighbours in the voting step mode A and typification mode C are almost equivalent (with values of α from 1 to 100). This is so because the discrimination power of the different parameters used as weight are similar. Automatic Configuration: Parameter Scanning The parameter space has been scanned sequentially in order to maximize the classification performance reducing the number of parameters in which it is based. Two well known methods have been used for this sequential scanning task [4]: Forward Method. In a first step a uniparametric classifier is used (trying with all the parameters). The parameter that leads to a better classification performance is selected. In a second step, the classifier is used with the selected parameter and a new one (trying combinations with all but the already selected parameter). The second parameter leading to a better result is also selected. In this way the parameter set in which the classifier is based grows up by the addition of a single parameter in each cycle until all the parameters are considered. The performance curve obtained as the parameter set grows increasingly for the first parameters and decreases when too many parameters are taken into account. The best parameter set is finally selected, i.e. the minimum parameter set with the maximum classification performance. Backward Method. In a first step all the parameters are used in the classifier. In a second step one of the parameters is dropped out (trying with all the parameters individually). The parameter set that reaches a better classification is maintained. In this way, a different parameter is dropped out in each cycle. The final parameter set is again the one that maximizes the classification performance with a minimum size.

4 97 Paroxysmal Atrial Fibrillation Diagnosis After defining the classification performance (CP) as simply the classification rate (CR) or other more sophisticated function these two methods can maximize it. Tables 3 and 4 show the results applying the forward and backward methods to maximize the classification rate (CR) using a minimum number of parameters in the classification process. The classification performance (CP) can be defined as a function of the classification rate (CR), the sensibility (SE) and the specificity (SP), using the following expression: (1) In this way, the classification rate is maximized, and the ratio SE/(SE+SP) is maximized for the cases with equivalent CR values. The influence of this second component is always between 0 and 1, thus being of significant importance only between cases with equivalent Classification Rate values. This target function takes into account the importance of the sensibility in this application, provided that the ECG is a non-invasive method. Other more specific tests can be applied for subjects indicated as PAF patients with this algorithm. Tables 5 and 6 show the results applying the forward and backward methods to maximize the CP defined in expression [1] and using a minimum number of parameters in the classification process. [1] are higher than the ones obtained using CP just as CR. With both CP functions the better results are obtained using a single closest neighbour configuration, with a top score CR = 92 % and SE = 96 %, defining the CP in expression [1]. As expected the Table 3 Forward Method: Obtains the maximum classification rate with a minimum number of parameters growing up the parameter set Table 4 Backward Method: Obtains the maximum classification rate with a minimum number of parameters reducing the parameter set definition of CP in equation [1] leads to better SE levels, but it also interferes with the obtained CR scores, because it leads the searching process through different parameter space areas. 4. Discussion The uniparametric approach reaches top CR values of 76%. A multiparametric approach has been configured using the 5 parameters with higher discrimination power, reaching top CR values of 80% and testing different typification functions. Finally, two automatic parametric scanning strategies are adopted: Forward and Backward methods. Tables 3 to 6 show the results obtained with these approaches with a top CR of 92%. In Tables 5 and 6 it is observed that the values of SE reached with the expression Table 5 Forward Method: Obtains the maximum CP with a minimum number of parameters growing up the parameter set

5 98 Ros et al. 5. Conclusion Table 6 Backward Method: Obtains the maximum CP with a minimum number of parameters reducing the parameter set Using a parameters set that characterizes PAF pathology with an acceptable efficiency [3] a modular classification algorithm based on the closest K-neighbors has been designed. The classification performance of an algorithm has been evaluated using different parameter configurations, typification functions and number of K-neighbors. The automatic parametric scanning techniques reach results much better than the previous tested configurations. It is important to remark that the parametric scanning processes with the Forward and Backward methods are not exhaustive, and therefore new scanning techniques may improve the reported results. References Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov P.Ch, Mark RG, Mietus JE, Moody GB, Peng CK, Stanley HE. PhysioBank, PhysioToolkit, and Physionet: Components of a New Research Resource for Complex Physiologic Signals. Circulation 2000, Vol. 101(23): e215-e220 [Circulation Electronic Pages; circ.ahajournals.org/cgi/content/full/101/23/ e215]. 3. Mota S, Ros E, Fernández FJ, Díaz AF, Prieto A. ECG Parameter Characterization of Paroxysmal Atrial Fibrillation. BSI 2002, Narendra PM, Fukunaga K. A branch and bound algorithm for feature subset selection, IEEE Trans on Comp 1977; 26: Correspondence to: Eduardo Ros Departamento de Arquitectura y Tecnología de Computadores E.T.S.I. Informática Universidad de Granada, Spain C/Periodista Daniel Saucedo, s/n Granada, Spain eduardo@atc.ugr.es

Multi-objective Optimization for Paroxysmal Atrial Fibrillation Diagnosis

Multi-objective Optimization for Paroxysmal Atrial Fibrillation Diagnosis Multi-objective Optimization for Paroxysmal Atrial Fibrillation Diagnosis Francisco de Toro, Eduardo Ros 2, Sonia Mota 2, Julio Ortega 2 Departamento de Ingeniería Electrónica, Sistemas Informáticos y

More information

Optimal Knots Allocation in Smoothing Splines using intelligent system. Application in bio-medical signal processing.

Optimal Knots Allocation in Smoothing Splines using intelligent system. Application in bio-medical signal processing. Optimal Knots Allocation in Smoothing Splines using intelligent system. Application in bio-medical signal processing. O.Valenzuela, M.Pasadas, F. Ortuño, I.Rojas University of Granada, Spain Abstract.

More information

Evaluating Classifiers

Evaluating Classifiers Evaluating Classifiers Reading for this topic: T. Fawcett, An introduction to ROC analysis, Sections 1-4, 7 (linked from class website) Evaluating Classifiers What we want: Classifier that best predicts

More information

Evaluating Classifiers

Evaluating Classifiers Evaluating Classifiers Reading for this topic: T. Fawcett, An introduction to ROC analysis, Sections 1-4, 7 (linked from class website) Evaluating Classifiers What we want: Classifier that best predicts

More information

Empirical Mode Decomposition Based Denoising by Customized Thresholding

Empirical Mode Decomposition Based Denoising by Customized Thresholding Vol:11, No:5, 17 Empirical Mode Decomposition Based Denoising by Customized Thresholding Wahiba Mohguen, Raïs El hadi Bekka International Science Index, Electronics and Communication Engineering Vol:11,

More information

MODULE 6 Different Approaches to Feature Selection LESSON 10

MODULE 6 Different Approaches to Feature Selection LESSON 10 MODULE 6 Different Approaches to Feature Selection LESSON 10 Sequential Feature Selection Keywords: Forward, Backward, Sequential, Floating 1 Sequential Methods In these methods, features are either sequentially

More information

INF 4300 Classification III Anne Solberg The agenda today:

INF 4300 Classification III Anne Solberg The agenda today: INF 4300 Classification III Anne Solberg 28.10.15 The agenda today: More on estimating classifier accuracy Curse of dimensionality and simple feature selection knn-classification K-means clustering 28.10.15

More information

Package OSTSC. December 4, 2017

Package OSTSC. December 4, 2017 Package OSTSC December 4, 2017 Title Over Sampling for Time Series Classification Version 0.0.1 Author Matthew Dixon [ctb], Diego Klabjan [ctb], Lan Wei [aut, trl, cre] Maintainer Lan Wei

More information

Acta Mathematica Academiae Paedagogicae Nyíregyháziensis 32 (2016), ISSN

Acta Mathematica Academiae Paedagogicae Nyíregyháziensis 32 (2016), ISSN Acta Mathematica Academiae Paedagogicae Nyíregyháziensis 32 (2016), 327 333 www.emis.de/journals ISSN 1786-0091 ECG-BASED HEART BEAT DETECTION USING RATIONAL FUNCTIONS Dedicated to Professor Ferenc Schipp

More information

The role of Fisher information in primary data space for neighbourhood mapping

The role of Fisher information in primary data space for neighbourhood mapping The role of Fisher information in primary data space for neighbourhood mapping H. Ruiz 1, I. H. Jarman 2, J. D. Martín 3, P. J. Lisboa 1 1 - School of Computing and Mathematical Sciences - Department of

More information

Artificial Immune System Approach for Access Control Based on EEG Signals

Artificial Immune System Approach for Access Control Based on EEG Signals Artificial Immune System Approach for Access Control Based on EEG Signals Wael H. Khalifa 1, Abdel Badeeh M. Salem 1 and Mohamed I. Roushdy 1 1 Computer Science Department, Faculty of Computer and Information

More information

Efficient Subsequence Search on Streaming Data Based on Time Warping Distance

Efficient Subsequence Search on Streaming Data Based on Time Warping Distance 2 ECTI TRANSACTIONS ON COMPUTER AND INFORMATION TECHNOLOGY VOL.5, NO.1 May 2011 Efficient Subsequence Search on Streaming Data Based on Time Warping Distance Sura Rodpongpun 1, Vit Niennattrakul 2, and

More information

Texture-Based Detection of Myositis in Ultrasonographies

Texture-Based Detection of Myositis in Ultrasonographies Texture-Based Detection of Myositis in Ultrasonographies Tim König 1, Marko Rak 1, Johannes Steffen 1, Grit Neumann 2, Ludwig von Rohden 2, Klaus D. Tönnies 1 1 Institut für Simulation & Graphik, Otto-von-Guericke-Universität

More information

An Echo State Neural Network for Foetal ECG Extraction Optimised by Random Search

An Echo State Neural Network for Foetal ECG Extraction Optimised by Random Search An Echo State Neural Network for Foetal ECG Extraction Optimised by Random Search Joachim Behar joachim.behar@eng.ox.ac.uk Alistair E. W. Johnson alistair.johnson@eng.ox.ac.uk Julien Oster julien.oster@eng.ox.ac.uk

More information

Germán Llort

Germán Llort Germán Llort gllort@bsc.es >10k processes + long runs = large traces Blind tracing is not an option Profilers also start presenting issues Can you even store the data? How patient are you? IPDPS - Atlanta,

More information

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging 1 CS 9 Final Project Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging Feiyu Chen Department of Electrical Engineering ABSTRACT Subject motion is a significant

More information

Chapter 4: Non-Parametric Techniques

Chapter 4: Non-Parametric Techniques Chapter 4: Non-Parametric Techniques Introduction Density Estimation Parzen Windows Kn-Nearest Neighbor Density Estimation K-Nearest Neighbor (KNN) Decision Rule Supervised Learning How to fit a density

More information

Network Traffic Measurements and Analysis

Network Traffic Measurements and Analysis DEIB - Politecnico di Milano Fall, 2017 Sources Hastie, Tibshirani, Friedman: The Elements of Statistical Learning James, Witten, Hastie, Tibshirani: An Introduction to Statistical Learning Andrew Ng:

More information

Representation of 2D objects with a topology preserving network

Representation of 2D objects with a topology preserving network Representation of 2D objects with a topology preserving network Francisco Flórez, Juan Manuel García, José García, Antonio Hernández, Departamento de Tecnología Informática y Computación. Universidad de

More information

Role of dimensionality reduction in segment-based classification of damaged building roofs in airborne laser scanning data. Kourosh Khoshelham

Role of dimensionality reduction in segment-based classification of damaged building roofs in airborne laser scanning data. Kourosh Khoshelham Role of dimensionality reduction in segment-based classification of damaged building roofs in airborne laser scanning data Kourosh Khoshelham Detection of damaged buildings in post-disaster aerial data

More information

Evaluation of Fourier Transform Coefficients for The Diagnosis of Rheumatoid Arthritis From Diffuse Optical Tomography Images

Evaluation of Fourier Transform Coefficients for The Diagnosis of Rheumatoid Arthritis From Diffuse Optical Tomography Images Evaluation of Fourier Transform Coefficients for The Diagnosis of Rheumatoid Arthritis From Diffuse Optical Tomography Images Ludguier D. Montejo *a, Jingfei Jia a, Hyun K. Kim b, Andreas H. Hielscher

More information

Statistics 202: Statistical Aspects of Data Mining

Statistics 202: Statistical Aspects of Data Mining Statistics 202: Statistical Aspects of Data Mining Professor Rajan Patel Lecture 9 = More of Chapter 5 Agenda: 1) Lecture over more of Chapter 5 1 Introduction to Data Mining by Tan, Steinbach, Kumar Chapter

More information

Receiver Operating Characteristic (ROC) Graph to Determine the Most Suitable Pairs Analysis Threshold Value

Receiver Operating Characteristic (ROC) Graph to Determine the Most Suitable Pairs Analysis Threshold Value Advances in Electrical and Electronics Engineering - IAENG Special Edition of the World Congress on Engineering and Computer Science 2008 Receiver Operating Characteristic (ROC) Graph to Determine the

More information

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Ms. Gayatri Attarde 1, Prof. Aarti Deshpande 2 M. E Student, Department of Computer Engineering, GHRCCEM, University

More information

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate

More information

Non-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions

Non-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions Non-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Fall 2017 CS 551,

More information

Robust PDF Table Locator

Robust PDF Table Locator Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records

More information

A New Shape Matching Measure for Nonlinear Distorted Object Recognition

A New Shape Matching Measure for Nonlinear Distorted Object Recognition A New Shape Matching Measure for Nonlinear Distorted Object Recognition S. Srisuky, M. Tamsriy, R. Fooprateepsiri?, P. Sookavatanay and K. Sunaty Department of Computer Engineeringy, Department of Information

More information

Data Mining. 3.5 Lazy Learners (Instance-Based Learners) Fall Instructor: Dr. Masoud Yaghini. Lazy Learners

Data Mining. 3.5 Lazy Learners (Instance-Based Learners) Fall Instructor: Dr. Masoud Yaghini. Lazy Learners Data Mining 3.5 (Instance-Based Learners) Fall 2008 Instructor: Dr. Masoud Yaghini Outline Introduction k-nearest-neighbor Classifiers References Introduction Introduction Lazy vs. eager learning Eager

More information

Artificial Intelligence. Programming Styles

Artificial Intelligence. Programming Styles Artificial Intelligence Intro to Machine Learning Programming Styles Standard CS: Explicitly program computer to do something Early AI: Derive a problem description (state) and use general algorithms to

More information

An Empirical Study on Lazy Multilabel Classification Algorithms

An Empirical Study on Lazy Multilabel Classification Algorithms An Empirical Study on Lazy Multilabel Classification Algorithms Eleftherios Spyromitros, Grigorios Tsoumakas and Ioannis Vlahavas Machine Learning & Knowledge Discovery Group Department of Informatics

More information

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CHAPTER 4 CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS 4.1 Introduction Optical character recognition is one of

More information

Neural competitive structures for segmentation based on motion features

Neural competitive structures for segmentation based on motion features Neural competitive structures for segmentation based on motion features Javier Díaz 1, Sonia Mota 1, Eduardo Ros 1 and Guillermo Botella 1 1 Departamento de Arquitectura y Tecnología de Computadores, E.T.S.I.

More information

Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation

Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation Learning 4 Supervised Learning 4 Unsupervised Learning 4

More information

CS 468 Data-driven Shape Analysis. Shape Descriptors

CS 468 Data-driven Shape Analysis. Shape Descriptors CS 468 Data-driven Shape Analysis Shape Descriptors April 1, 2014 What Is A Shape Descriptor? Shapes Shape Descriptor F1=[f1, f2,.., fn] F2=[f1, f2,.., fn] F3=[f1, f2,.., fn] What Is A Shape Descriptor?

More information

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X Analysis about Classification Techniques on Categorical Data in Data Mining Assistant Professor P. Meena Department of Computer Science Adhiyaman Arts and Science College for Women Uthangarai, Krishnagiri,

More information

MODULE 7 Nearest Neighbour Classifier and its variants LESSON 11. Nearest Neighbour Classifier. Keywords: K Neighbours, Weighted, Nearest Neighbour

MODULE 7 Nearest Neighbour Classifier and its variants LESSON 11. Nearest Neighbour Classifier. Keywords: K Neighbours, Weighted, Nearest Neighbour MODULE 7 Nearest Neighbour Classifier and its variants LESSON 11 Nearest Neighbour Classifier Keywords: K Neighbours, Weighted, Nearest Neighbour 1 Nearest neighbour classifiers This is amongst the simplest

More information

Handwritten Script Recognition at Block Level

Handwritten Script Recognition at Block Level Chapter 4 Handwritten Script Recognition at Block Level -------------------------------------------------------------------------------------------------------------------------- Optical character recognition

More information

Support Vector Machines + Classification for IR

Support Vector Machines + Classification for IR Support Vector Machines + Classification for IR Pierre Lison University of Oslo, Dep. of Informatics INF3800: Søketeknologi April 30, 2014 Outline of the lecture Recap of last week Support Vector Machines

More information

Estimating Feature Discriminant Power in Decision Tree Classifiers*

Estimating Feature Discriminant Power in Decision Tree Classifiers* Estimating Feature Discriminant Power in Decision Tree Classifiers* I. Gracia 1, F. Pla 1, F. J. Ferri 2 and P. Garcia 1 1 Departament d'inform~tica. Universitat Jaume I Campus Penyeta Roja, 12071 Castell6.

More information

Fuzzy based Multiple Dictionary Bag of Words for Image Classification

Fuzzy based Multiple Dictionary Bag of Words for Image Classification Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image

More information

Crossing Line Profile: A New Approach to Detecting Defects in Aluminium Die Casting

Crossing Line Profile: A New Approach to Detecting Defects in Aluminium Die Casting Crossing Line Profile: A New Approach to Detecting Defects in Aluminium Die Casting Domingo Mery Departamento de Ingeniería Informática Universidad de Santiago de Chile Av. Ecuador 3659, Santiago de Chile

More information

A new predictive image compression scheme using histogram analysis and pattern matching

A new predictive image compression scheme using histogram analysis and pattern matching University of Wollongong Research Online University of Wollongong in Dubai - Papers University of Wollongong in Dubai 00 A new predictive image compression scheme using histogram analysis and pattern matching

More information

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric. CS 188: Artificial Intelligence Fall 2008 Lecture 25: Kernels and Clustering 12/2/2008 Dan Klein UC Berkeley Case-Based Reasoning Similarity for classification Case-based reasoning Predict an instance

More information

CS 188: Artificial Intelligence Fall 2008

CS 188: Artificial Intelligence Fall 2008 CS 188: Artificial Intelligence Fall 2008 Lecture 25: Kernels and Clustering 12/2/2008 Dan Klein UC Berkeley 1 1 Case-Based Reasoning Similarity for classification Case-based reasoning Predict an instance

More information

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy Sokratis K. Makrogiannis, PhD From post-doctoral research at SBIA lab, Department of Radiology,

More information

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics

More information

An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data

An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data Nian Zhang and Lara Thompson Department of Electrical and Computer Engineering, University

More information

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques.

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques. . Non-Parameteric Techniques University of Cambridge Engineering Part IIB Paper 4F: Statistical Pattern Processing Handout : Non-Parametric Techniques Mark Gales mjfg@eng.cam.ac.uk Michaelmas 23 Introduction

More information

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques Mark Gales mjfg@eng.cam.ac.uk Michaelmas 2011 11. Non-Parameteric Techniques

More information

Evaluation Metrics. (Classifiers) CS229 Section Anand Avati

Evaluation Metrics. (Classifiers) CS229 Section Anand Avati Evaluation Metrics (Classifiers) CS Section Anand Avati Topics Why? Binary classifiers Metrics Rank view Thresholding Confusion Matrix Point metrics: Accuracy, Precision, Recall / Sensitivity, Specificity,

More information

Reliability Measure of 2D-PAGE Spot Matching using Multiple Graphs

Reliability Measure of 2D-PAGE Spot Matching using Multiple Graphs Reliability Measure of 2D-PAGE Spot Matching using Multiple Graphs Dae-Seong Jeoune 1, Chan-Myeong Han 2, Yun-Kyoo Ryoo 3, Sung-Woo Han 4, Hwi-Won Kim 5, Wookhyun Kim 6, and Young-Woo Yoon 6 1 Department

More information

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques Mark Gales mjfg@eng.cam.ac.uk Michaelmas 2015 11. Non-Parameteric Techniques

More information

DESIGN AND EVALUATION OF MACHINE LEARNING MODELS WITH STATISTICAL FEATURES

DESIGN AND EVALUATION OF MACHINE LEARNING MODELS WITH STATISTICAL FEATURES EXPERIMENTAL WORK PART I CHAPTER 6 DESIGN AND EVALUATION OF MACHINE LEARNING MODELS WITH STATISTICAL FEATURES The evaluation of models built using statistical in conjunction with various feature subset

More information

Validation for Data Classification

Validation for Data Classification Validation for Data Classification HILARIO LÓPEZ and IVÁN MACHÓN and EVA FERNÁNDEZ Departamento de Ingeniería Eléctrica, Electrónica de Computadores y Sistemas Universidad de Oviedo Edificio Departamental

More information

Embedded Systems. Cristian Rotariu

Embedded Systems. Cristian Rotariu Embedded Systems Cristian Rotariu Dept. of of Biomedical Sciences Grigore T Popa University of Medicine and Pharmacy of Iasi, Romania cristian.rotariu@bioinginerie.ro May 2016 Introduction An embedded

More information

Context-sensitive Classification Forests for Segmentation of Brain Tumor Tissues

Context-sensitive Classification Forests for Segmentation of Brain Tumor Tissues Context-sensitive Classification Forests for Segmentation of Brain Tumor Tissues D. Zikic, B. Glocker, E. Konukoglu, J. Shotton, A. Criminisi, D. H. Ye, C. Demiralp 3, O. M. Thomas 4,5, T. Das 4, R. Jena

More information

Network Snakes for the Segmentation of Adjacent Cells in Confocal Images

Network Snakes for the Segmentation of Adjacent Cells in Confocal Images Network Snakes for the Segmentation of Adjacent Cells in Confocal Images Matthias Butenuth 1 and Fritz Jetzek 2 1 Institut für Photogrammetrie und GeoInformation, Leibniz Universität Hannover, 30167 Hannover

More information

Feature Subset Selection using Clusters & Informed Search. Team 3

Feature Subset Selection using Clusters & Informed Search. Team 3 Feature Subset Selection using Clusters & Informed Search Team 3 THE PROBLEM [This text box to be deleted before presentation Here I will be discussing exactly what the prob Is (classification based on

More information

A Registration-Based Atlas Propagation Framework for Automatic Whole Heart Segmentation

A Registration-Based Atlas Propagation Framework for Automatic Whole Heart Segmentation A Registration-Based Atlas Propagation Framework for Automatic Whole Heart Segmentation Xiahai Zhuang (PhD) Centre for Medical Image Computing University College London Fields-MITACS Conference on Mathematics

More information

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped

More information

A Novel Criterion Function in Feature Evaluation. Application to the Classification of Corks.

A Novel Criterion Function in Feature Evaluation. Application to the Classification of Corks. A Novel Criterion Function in Feature Evaluation. Application to the Classification of Corks. X. Lladó, J. Martí, J. Freixenet, Ll. Pacheco Computer Vision and Robotics Group Institute of Informatics and

More information

Methodological progress in image registration for ventilation estimation, segmentation propagation and multi-modal fusion

Methodological progress in image registration for ventilation estimation, segmentation propagation and multi-modal fusion Methodological progress in image registration for ventilation estimation, segmentation propagation and multi-modal fusion Mattias P. Heinrich Julia A. Schnabel, Mark Jenkinson, Sir Michael Brady 2 Clinical

More information

Selection of Location, Frequency and Orientation Parameters of 2D Gabor Wavelets for Face Recognition

Selection of Location, Frequency and Orientation Parameters of 2D Gabor Wavelets for Face Recognition Selection of Location, Frequency and Orientation Parameters of 2D Gabor Wavelets for Face Recognition Berk Gökberk, M.O. İrfanoğlu, Lale Akarun, and Ethem Alpaydın Boğaziçi University, Department of Computer

More information

Detection of Ventricular Fibrillation Using Random Forest Classifier

Detection of Ventricular Fibrillation Using Random Forest Classifier J. Biomedical Science and Engineering, 2016, 9, 259-268 Published Online April 2016 in SciRes. http://www.scirp.org/journal/jbise http://dx.doi.org/10.4236/jbise.2016.95019 Detection of Ventricular Fibrillation

More information

Machine Learning and Pervasive Computing

Machine Learning and Pervasive Computing Stephan Sigg Georg-August-University Goettingen, Computer Networks 17.12.2014 Overview and Structure 22.10.2014 Organisation 22.10.3014 Introduction (Def.: Machine learning, Supervised/Unsupervised, Examples)

More information

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Classification Vladimir Curic Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Outline An overview on classification Basics of classification How to choose appropriate

More information

Interpretation and evaluation

Interpretation and evaluation Interpretation and evaluation 1. Descriptive tasks Evaluation based on novelty, interestingness, usefulness and understandability Qualitative evaluation: obvious (common sense) knowledge knowledge that

More information

Distribution-free Predictive Approaches

Distribution-free Predictive Approaches Distribution-free Predictive Approaches The methods discussed in the previous sections are essentially model-based. Model-free approaches such as tree-based classification also exist and are popular for

More information

Fast Efficient Clustering Algorithm for Balanced Data

Fast Efficient Clustering Algorithm for Balanced Data Vol. 5, No. 6, 214 Fast Efficient Clustering Algorithm for Balanced Data Adel A. Sewisy Faculty of Computer and Information, Assiut University M. H. Marghny Faculty of Computer and Information, Assiut

More information

MEDICAL IMAGE NOISE REDUCTION AND REGION CONTRAST ENHANCEMENT USING PARTIAL DIFFERENTIAL EQUATIONS

MEDICAL IMAGE NOISE REDUCTION AND REGION CONTRAST ENHANCEMENT USING PARTIAL DIFFERENTIAL EQUATIONS MEDICAL IMAGE NOISE REDUCTION AND REGION CONTRAST ENHANCEMENT USING PARTIAL DIFFERENTIAL EQUATIONS Miguel Alemán-Flores, Luis Álvarez-León Departamento de Informática y Sistemas, Universidad de Las Palmas

More information

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015 Instance-based Learning CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2015 Outline Non-parametric approach Unsupervised: Non-parametric density estimation Parzen Windows K-Nearest

More information

Analysis of classifier to improve Medical diagnosis for Breast Cancer Detection using Data Mining Techniques A.subasini 1

Analysis of classifier to improve Medical diagnosis for Breast Cancer Detection using Data Mining Techniques A.subasini 1 2117 Analysis of classifier to improve Medical diagnosis for Breast Cancer Detection using Data Mining Techniques A.subasini 1 1 Research Scholar, R.D.Govt college, Sivagangai Nirase Fathima abubacker

More information

Web site Prototype Design to Suit the Education and Research Purposes of University Departments

Web site Prototype Design to Suit the Education and Research Purposes of University Departments Recent Research Developments in Learning Technologies (00) 0 Web site Prototype Design to Suit the Education and Research Purposes of University Departments B. Prieto, J. L. Bernier, P. Castillo, A. Cañas

More information

Lecture 25: Review I

Lecture 25: Review I Lecture 25: Review I Reading: Up to chapter 5 in ISLR. STATS 202: Data mining and analysis Jonathan Taylor 1 / 18 Unsupervised learning In unsupervised learning, all the variables are on equal standing,

More information

A Guide to Open-Access Databases and Open-Source Software on PhysioNet

A Guide to Open-Access Databases and Open-Source Software on PhysioNet A Guide to Open-Access Databases and Open-Source Software on PhysioNet George B. Moody Harvard-MIT Division of Health Sciences and Technology Cambridge, Massachusetts, USA Outline Background Open-Access

More information

DEVELOPMENT OF PREVENTIVE MAINTENANCE SYSTEM ARISING IN ADVANCED EDDY CURRENT TESTING USING NETWORK TOMOGRAPHY

DEVELOPMENT OF PREVENTIVE MAINTENANCE SYSTEM ARISING IN ADVANCED EDDY CURRENT TESTING USING NETWORK TOMOGRAPHY DEVELOPMENT OF PREVENTIVE MAINTENANCE SYSTEM ARISING IN ADVANCED EDDY CURRENT TESTING USING NETWORK TOMOGRAPHY F. Kojima 1, F.Kobayashi 1, A. Nishimizu 2, M. Koike 2, T. Matsui 2, H. Endo 3, T. Uchimoto

More information

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

The Comparative Study of Machine Learning Algorithms in Text Data Classification* The Comparative Study of Machine Learning Algorithms in Text Data Classification* Wang Xin School of Science, Beijing Information Science and Technology University Beijing, China Abstract Classification

More information

Data-driven models for mortality assessment at the Intensive Care Unit

Data-driven models for mortality assessment at the Intensive Care Unit Data-driven models for mortality assessment at the Intensive Care Unit Research Paper Business Analytics ABSTRACT Ali el Hassouni Vrije Universiteit Amsterdam Faculty of Sciences The Netherlands a.el.hassouni@student.vu.nl

More information

Automatic Vascular Tree Formation Using the Mahalanobis Distance

Automatic Vascular Tree Formation Using the Mahalanobis Distance Automatic Vascular Tree Formation Using the Mahalanobis Distance Julien Jomier, Vincent LeDigarcher, and Stephen R. Aylward Computer-Aided Diagnosis and Display Lab, Department of Radiology The University

More information

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIENCE, VOL.32, NO.9, SEPTEMBER 2010 Hae Jong Seo, Student Member,

More information

SD 372 Pattern Recognition

SD 372 Pattern Recognition SD 372 Pattern Recognition Lab 2: Model Estimation and Discriminant Functions 1 Purpose This lab examines the areas of statistical model estimation and classifier aggregation. Model estimation will be

More information

MODULE 7 Nearest Neighbour Classifier and its Variants LESSON 12

MODULE 7 Nearest Neighbour Classifier and its Variants LESSON 12 MODULE 7 Nearest Neighbour Classifier and its Variants LESSON 2 Soft Nearest Neighbour Classifiers Keywords: Fuzzy, Neighbours in a Sphere, Classification Time Fuzzy knn Algorithm In Fuzzy knn Algorithm,

More information

Using Real-valued Meta Classifiers to Integrate and Contextualize Binding Site Predictions

Using Real-valued Meta Classifiers to Integrate and Contextualize Binding Site Predictions Using Real-valued Meta Classifiers to Integrate and Contextualize Binding Site Predictions Offer Sharabi, Yi Sun, Mark Robinson, Rod Adams, Rene te Boekhorst, Alistair G. Rust, Neil Davey University of

More information

Face Recognition by Combining Kernel Associative Memory and Gabor Transforms

Face Recognition by Combining Kernel Associative Memory and Gabor Transforms Face Recognition by Combining Kernel Associative Memory and Gabor Transforms Author Zhang, Bai-ling, Leung, Clement, Gao, Yongsheng Published 2006 Conference Title ICPR2006: 18th International Conference

More information

Object Recognition Robust under Translation, Rotation and Scaling in Application of Image Retrieval

Object Recognition Robust under Translation, Rotation and Scaling in Application of Image Retrieval Object Recognition Robust under Translation, Rotation and Scaling in Application of Image Retrieval Sanun Srisuky, Rerkchai Fooprateepsiri? and Sahatsawat Waraklang? yadvanced Machine Intelligence Research

More information

I211: Information infrastructure II

I211: Information infrastructure II Data Mining: Classifier Evaluation I211: Information infrastructure II 3-nearest neighbor labeled data find class labels for the 4 data points 1 0 0 6 0 0 0 5 17 1.7 1 1 4 1 7.1 1 1 1 0.4 1 2 1 3.0 0 0.1

More information

Global Journal of Engineering Science and Research Management

Global Journal of Engineering Science and Research Management ADVANCED K-MEANS ALGORITHM FOR BRAIN TUMOR DETECTION USING NAIVE BAYES CLASSIFIER Veena Bai K*, Dr. Niharika Kumar * MTech CSE, Department of Computer Science and Engineering, B.N.M. Institute of Technology,

More information

BITS F464: MACHINE LEARNING

BITS F464: MACHINE LEARNING BITS F464: MACHINE LEARNING Lecture-16: Decision Tree (contd.) + Random Forest Dr. Kamlesh Tiwari Assistant Professor Department of Computer Science and Information Systems Engineering, BITS Pilani, Rajasthan-333031

More information

An Empirical Study of Hoeffding Racing for Model Selection in k-nearest Neighbor Classification

An Empirical Study of Hoeffding Racing for Model Selection in k-nearest Neighbor Classification An Empirical Study of Hoeffding Racing for Model Selection in k-nearest Neighbor Classification Flora Yu-Hui Yeh and Marcus Gallagher School of Information Technology and Electrical Engineering University

More information

Connected Component Analysis and Change Detection for Images

Connected Component Analysis and Change Detection for Images Connected Component Analysis and Change Detection for Images Prasad S.Halgaonkar Department of Computer Engg, MITCOE Pune University, India Abstract Detection of the region of change in images of a particular

More information

Web Information Retrieval. Exercises Evaluation in information retrieval

Web Information Retrieval. Exercises Evaluation in information retrieval Web Information Retrieval Exercises Evaluation in information retrieval Evaluating an IR system Note: information need is translated into a query Relevance is assessed relative to the information need

More information

Predictive Indexing for Fast Search

Predictive Indexing for Fast Search Predictive Indexing for Fast Search Sharad Goel, John Langford and Alex Strehl Yahoo! Research, New York Modern Massive Data Sets (MMDS) June 25, 2008 Goel, Langford & Strehl (Yahoo! Research) Predictive

More information

Best First and Greedy Search Based CFS and Naïve Bayes Algorithms for Hepatitis Diagnosis

Best First and Greedy Search Based CFS and Naïve Bayes Algorithms for Hepatitis Diagnosis Best First and Greedy Search Based CFS and Naïve Bayes Algorithms for Hepatitis Diagnosis CHAPTER 3 BEST FIRST AND GREEDY SEARCH BASED CFS AND NAÏVE BAYES ALGORITHMS FOR HEPATITIS DIAGNOSIS 3.1 Introduction

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

Image-Based Face Recognition using Global Features

Image-Based Face Recognition using Global Features Image-Based Face Recognition using Global Features Xiaoyin xu Research Centre for Integrated Microsystems Electrical and Computer Engineering University of Windsor Supervisors: Dr. Ahmadi May 13, 2005

More information

Encoding Words into String Vectors for Word Categorization

Encoding Words into String Vectors for Word Categorization Int'l Conf. Artificial Intelligence ICAI'16 271 Encoding Words into String Vectors for Word Categorization Taeho Jo Department of Computer and Information Communication Engineering, Hongik University,

More information

Binary Diagnostic Tests Clustered Samples

Binary Diagnostic Tests Clustered Samples Chapter 538 Binary Diagnostic Tests Clustered Samples Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. In the twogroup case, each cluster

More information

Implementation of Modified K-Nearest Neighbor for Diagnosis of Liver Patients

Implementation of Modified K-Nearest Neighbor for Diagnosis of Liver Patients Implementation of Modified K-Nearest Neighbor for Diagnosis of Liver Patients Alwis Nazir, Lia Anggraini, Elvianti, Suwanto Sanjaya, Fadhilla Syafria Department of Informatics, Faculty of Science and Technology

More information

Bagging and Boosting Algorithms for Support Vector Machine Classifiers

Bagging and Boosting Algorithms for Support Vector Machine Classifiers Bagging and Boosting Algorithms for Support Vector Machine Classifiers Noritaka SHIGEI and Hiromi MIYAJIMA Dept. of Electrical and Electronics Engineering, Kagoshima University 1-21-40, Korimoto, Kagoshima

More information