Data Parallelism and the Support Vector Machine

Similar documents
Data Mining Practical Machine Learning Tools and Techniques. Slides for Chapter 6 of Data Mining by I. H. Witten and E. Frank

All lecture slides will be available at CSC2515_Winter15.html

CS 229 Midterm Review

Classification: Linear Discriminant Functions

Robot Learning. There are generally three types of robot learning: Learning from data. Learning by demonstration. Reinforcement learning

Use of Multi-category Proximal SVM for Data Set Reduction

Bagging for One-Class Learning

ECG782: Multidimensional Digital Signal Processing

SoftDoubleMinOver: A Simple Procedure for Maximum Margin Classification

CPSC 340: Machine Learning and Data Mining. More Linear Classifiers Fall 2017

Ensembles. An ensemble is a set of classifiers whose combined results give the final decision. test feature vector

Using Analytic QP and Sparseness to Speed Training of Support Vector Machines

Bagging and Boosting Algorithms for Support Vector Machine Classifiers

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Using Analytic QP and Sparseness to Speed Training of Support Vector Machines

Naïve Bayes for text classification

A Two-phase Distributed Training Algorithm for Linear SVM in WSN

ORT EP R RCH A ESE R P A IDI! " #$$% &' (# $!"

Contents. Preface to the Second Edition

.. Spring 2017 CSC 566 Advanced Data Mining Alexander Dekhtyar..

Training Data Selection for Support Vector Machines

Generating the Reduced Set by Systematic Sampling

The design of the data preprocessing using AHP in automatic meter reading system

Hsiaochun Hsu Date: 12/12/15. Support Vector Machine With Data Reduction

Supervised vs. Unsupervised Learning. Supervised vs. Unsupervised Learning. Supervised vs. Unsupervised Learning. Supervised vs. Unsupervised Learning

Non-Bayesian Classifiers Part II: Linear Discriminants and Support Vector Machines

SUPPORT VECTOR MACHINES

Tour-Based Mode Choice Modeling: Using An Ensemble of (Un-) Conditional Data-Mining Classifiers

Classification Lecture Notes cse352. Neural Networks. Professor Anita Wasilewska

Well Analysis: Program psvm_welllogs

Large Margin Classification Using the Perceptron Algorithm

Kernel Methods & Support Vector Machines

Unlabeled Data Classification by Support Vector Machines

More on Learning. Neural Nets Support Vectors Machines Unsupervised Learning (Clustering) K-Means Expectation-Maximization

Hierarchical Local Clustering for Constraint Reduction in Rank-Optimizing Linear Programs

Network Lasso: Clustering and Optimization in Large Graphs

Data Mining: Concepts and Techniques. Chapter 9 Classification: Support Vector Machines. Support Vector Machines (SVMs)

SUPPORT VECTOR MACHINES

Content-based image and video analysis. Machine learning

Support Vector Machines: Brief Overview" November 2011 CPSC 352

Support Vector Machines

Introduction to Machine Learning

Instance-based Learning

Optimal Separating Hyperplane and the Support Vector Machine. Volker Tresp Summer 2018

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others

9.1. K-means Clustering

Random projection for non-gaussian mixture models

Support Vector Machines

CLASSIFICATION FOR SCALING METHODS IN DATA MINING

EFFICIENT FPGA MAPPING OF GILBERT S ALGORITHM FOR SVM TRAINING ON LARGE-SCALE CLASSIFICATION PROBLEMS

Efficient Learning from Massive Spatial-Temporal Data through Selective Support Vector Propagation

Support vector machines. Dominik Wisniewski Wojciech Wawrzyniak

Generalized version of the support vector machine for binary classification problems: supporting hyperplane machine.

Linear Models. Lecture Outline: Numeric Prediction: Linear Regression. Linear Classification. The Perceptron. Support Vector Machines

Chakra Chennubhotla and David Koes

PARALLEL CLASSIFICATION ALGORITHMS

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING

Statistics 202: Statistical Aspects of Data Mining

Chap.12 Kernel methods [Book, Chap.7]

Neetha Das Prof. Andy Khong

Perceptron Learning Algorithm (PLA)

Efficient Tuning of SVM Hyperparameters Using Radius/Margin Bound and Iterative Algorithms

Support Vector Machines

Module 4. Non-linear machine learning econometrics: Support Vector Machine

Improved DAG SVM: A New Method for Multi-Class SVM Classification

Support Vector Machines.

CS570: Introduction to Data Mining

Community edition(open-source) Enterprise edition

Introduction to Support Vector Machines

Data-driven Kernels for Support Vector Machines

Lecture 7: Support Vector Machine

CS229 Lecture notes. Raphael John Lamarre Townshend

Fast Learning for Statistical Face Detection

Table of Contents. Recognition of Facial Gestures... 1 Attila Fazekas

Lab 2: Support vector machines

Classification by Support Vector Machines

Support Vector Machines

Data Analysis 3. Support Vector Machines. Jan Platoš October 30, 2017

Dynamic Ensemble Construction via Heuristic Optimization

SVM Classification in Multiclass Letter Recognition System

Generative and discriminative classification techniques

arxiv: v2 [cs.lg] 11 Sep 2015

Experimenting with Multi-Class Semi-Supervised Support Vector Machines and High-Dimensional Datasets

9. Support Vector Machines. The linearly separable case: hard-margin SVMs. The linearly separable case: hard-margin SVMs. Learning objectives

Constrained optimization

Incremental SVM and Visualization Tools for Biomedical

Support Vector Machines (a brief introduction) Adrian Bevan.

Distribution-free Predictive Approaches

Support Vector Machines for Face Recognition

Support vector machines

Introduction to Mobile Robotics

SVM in Oracle Database 10g: Removing the Barriers to Widespread Adoption of Support Vector Machines

Neural Networks. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

NEAREST-INSTANCE-CENTROID-ESTIMATION LINEAR DISCRIMINANT ANALYSIS (NICE LDA) Rishabh Singh, Kan Li (Member, IEEE) and Jose C. Principe (Fellow, IEEE)

CS5670: Computer Vision

The Curse of Dimensionality

Fast Support Vector Machine Classification of Very Large Datasets

6.867 Machine Learning

Volumetric Classification: Program psvm3d

Support Vector Machines

Transcription:

Data Parallelism and the Support Vector Machine Solomon Gibbs he support vector machine is a common algorithm for pattern classification. However, many of the most popular implementations are not suitable for parallelization in a distributed-data sense. We discuss current serial techniques and several alternative methods already in literature as well as their potential suitability to the Matlab-MPI system. he support vector machine has gained popularity because it produces a classifier with maximum generality, can be adapted to produce non-linear classifiers, and has mitigating factors for the curse of dimensionality. However, most SVM algorithms are serial in nature. We believe that a parallel implementation could be used to handle very large datasets efficiently.

Linear Support Vector Machine Canonical problem: he maximal margin classifier. w 2 Maximize w subject to the constraint that all points are correctly classified. otice that the circled points are sufficient to define the hyperplane. hese are the support vectors.

Decomposition echniques he above problem leads to the quadratic program: minimize w( Λ) = Λ 1+ 1 2Λ DΛ subject to Λ y = 0, 0 Λ C1 where Λ is the vector of multipliers, D is defined as[dij] = yiy jk(xi, x j), and C is a positive constant. It is easy to see that the problem quickly becomes intractable for large datasets. (D grows with the square of the number of training points) Various authors [1, 2] have proposed decomposition techniques that solve a series of smaller programming problems and converge to a global solution. Λ y Λ = y = Λ y minimize w(λ) = -Λ D D = D (1- D Λ ) + 1 + Λ 1 2 D Λ Λ subject to Λ y + Λ y = 0, 0 Λ C1 And the last two terms of w( Λ) are held constant, and can be neglected. D D 1 2 Λ D Λ

Matlab-MPI A message-passing protocol in Matlab allows workstations to pass data and tasks among one another Sender Shared file system Variable save Data file load Variable create Lock file detect Receiver waits until it detects the existence of the lock file. Receiver deletes the data and lock file, after it loads the variable from the data file.

Incremental Proximal SVM An alternate formulation[4] of the maximal margin problem allows an incremental update algorithm: w γ y altering the constraints in the optimization problem, a proximal classifier that can be solved by a system of linear equations is formed. his algorithm can be further modified to add or remove data from the training set, while updating the classifier. veit has proposed a cascade of updating nodes, each computing an update increment in parallel.[5]

Ensemble Methods he above algorithms focused on a single 2-way classifier. Using ensemble methods, there are a variety of possible ways to combine SVM sub-units. M-Way classifier rain each unit on the entire dataset, but with different classes to separate. Mixture of Experts[6] rain each unit on a random subset of the data, then use weighted combination to classify. Consensus methods (voting, least squared error, maximum likelihood) rain units using different criteria, then combine results using above methods.

Conclusions and Future Work Serial support vector machine algorithms have been developed that are capable of handling very large datasets. Many of the most popular methods are unsuitable for use in a data-parallel environment. A number of algorithms for making use of support vector concepts in a data-parallelizing have also been proposed. A data-parallel classifier based on the SVM is desirable because of the accuracy and flexibility of the SVM and the speed up provided by the parallel environment, without the need for specialized parallel hardware. More work is needed to determine what methods will produce a computationally efficient and theoretically sound classifier for use in a high-latency message passing environment.

References 1. C. urges, A utorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery 2:121-167, 1998 2. E. Osuna and R. Freund and F. Girosi, Improved raining Algorithm for Support Vector Machines, SP 1997 3. J. Platt, Sequential Minimal Optimization: A fast algorithm for training support vector machines, 1998 4. G. Fung and O. Mangasarian, Incremental Support Vector Machine Classification, SIAM 2001 5. A. veit and H. Engum, Parallelization of the Incremental Proximal Support Vector Machine Classifier Using a Heap-based ree opology, 2003 6. R. Collobert and S. engio and Y. engio, A Parallel Mixture of SVMs for Very large Scale Problems, Advances in eural Information Processing Systems, 2002