Package kernelfactory

Similar documents
Package nodeharvest. June 12, 2015

Package rpst. June 6, 2017

Package misclassglm. September 3, 2016

Package ebmc. August 29, 2017

Package missforest. February 15, 2013

Package gbts. February 27, 2017

Package EnsembleBase

Package obliquerf. February 20, Index 8. Extract variable importance measure

Package BayesLogit. May 6, 2012

Package subsemble. February 20, 2015

Package naivebayes. R topics documented: January 3, Type Package. Title High Performance Implementation of the Naive Bayes Algorithm

Package nonlinearicp

Nonparametric Classification Methods

Package randomforest.ddr

Package GWRM. R topics documented: July 31, Type Package

Package RaPKod. February 5, 2018

Package CALIBERrfimpute

Package AnDE. R topics documented: February 19, 2015

Using Machine Learning to Identify Security Issues in Open-Source Libraries. Asankhaya Sharma Yaqin Zhou SourceClear

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2.

Package woebinning. December 15, 2017

Package svmpath. R topics documented: August 30, Title The SVM Path Algorithm Date Version Author Trevor Hastie

Package rferns. November 15, 2017

Package EFS. R topics documented:

Package sbrl. R topics documented: July 29, 2016

Package IPMRF. August 9, 2017

Package strat. November 23, 2016

Package clustmixtype

Package RAMP. May 25, 2017

Package logicfs. R topics documented:

The Effects of Outliers on Support Vector Machines

Package caretensemble

Package C50. December 1, 2017

Package arulescba. April 24, 2018

Package RLT. August 20, 2018

Package maboost. R topics documented: February 20, 2015

Package sensory. R topics documented: February 23, Type Package

Package assortnet. January 18, 2016

Package preprocomb. June 26, 2016

Package fasttextr. May 12, 2017

Nonparametric Approaches to Regression

Package FastKNN. February 19, 2015

Package datasets.load

Package logisticpca. R topics documented: August 29, 2016

Package glinternet. June 15, 2018

Package cclust. August 29, 2013

Classification of Hand-Written Numeric Digits

Package desirability

Package LVGP. November 14, 2018

Package fastdummies. January 8, 2018

Package MatchIt. April 18, 2017

Package gensemble. R topics documented: February 19, 2015

Package blocksdesign

Package milr. June 8, 2017

2 cubist.default. Fit a Cubist model

Package bife. May 7, 2017

Package rknn. June 9, 2015

Package cattonum. R topics documented: May 2, Type Package Version Title Encode Categorical Features

Package kofnga. November 24, 2015

Package cwm. R topics documented: February 19, 2015

Package clusterrepro

Package optimus. March 24, 2017

Package Combine. R topics documented: September 4, Type Package Title Game-Theoretic Probability Combination Version 1.

Business Club. Decision Trees

Package flam. April 6, 2018

Package RSKC. R topics documented: August 28, Type Package Title Robust Sparse K-Means Version Date Author Yumi Kondo

Package mrm. December 27, 2016

Package StatMeasures

Package smbinning. December 1, 2017

Package FisherEM. February 19, 2015

Package randomglm. R topics documented: February 20, 2015

Package PrivateLR. March 20, 2018

Package JOUSBoost. July 12, 2017

Package ranger. November 10, 2015

Package glassomix. May 30, 2013

Package ordinalforest

Package kamila. August 29, 2016

Package rocc. February 20, 2015

Package RPEnsemble. October 7, 2017

Package glmnetutils. August 1, 2017

Package maxent. July 2, 2014

Package Cubist. December 2, 2017

Package rgcvpack. February 20, Index 6. Fitting Thin Plate Smoothing Spline. Fit thin plate splines of any order with user specified knots

Package bisect. April 16, 2018

Package madsim. December 7, 2016

Package neural. R topics documented: February 20, 2015

Package kerasformula

Project Presentation. Pattern Recognition. Under the guidance of Prof. Sumeet Agar wal

Package apastyle. March 29, 2017

Package lazydata. December 4, 2016

Classifier Inspired Scaling for Training Set Selection

Package subtype. February 20, 2015

Lecture 06 Decision Trees I

Package ICEbox. July 13, 2017

Package DPBBM. September 29, 2016

Package metaheur. June 30, 2016

Package SAENET. June 4, 2015

Package CatEncoders. March 8, 2017

Chapter 8 The C 4.5*stat algorithm

SOFTWARE DEFECT PREDICTION USING IMPROVED SUPPORT VECTOR MACHINE CLASSIFIER

Transcription:

Package kernelfactory September 29, 2015 Type Package Title Kernel Factory: An Ensemble of Kernel Machines Version 0.3.0 Date 2015-09-29 Imports randomforest, AUC, genalg, kernlab, stats Author Michel Ballings, Dirk Van den Poel Maintainer Michel Ballings <Michel.Ballings@GMail.com> Binary classification based on an ensemble of kernel machines (``Ballings, M. and Van den Poel, D. (2013), Kernel Factory: An Ensemble of Kernel Machines. Expert Systems With Applications, 40(8), 2904-2913''). Kernel factory is an ensemble method where each base classifier (random forest) is fit on the kernel matrix of a subset of the training data. License GPL (>= 2) NeedsCompilation no Repository CRAN Date/Publication 2015-09-29 17:33:15 R topics documented: Credit............................................ 2 kernelfactory........................................ 2 kfnews........................................... 4 predict.kernelfactory.................................... 5 Index 7 1

2 kernelfactory Credit Credit approval (Frank and Asuncion, 2010) Credit contains credit card applications. The dataset has a good mix of continuous and categorical features. Format A data frame with 653 observations, 15 predictors and a binary criterion variable called Response Details All observations with missing values are deleted. Source Frank, A. and Asuncion, A. (2010). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science. The original dataset can be downloaded at http://archive.ics.uci.edu/ml/datasets/credit+approval str(credit) table(credit$response) kernelfactory Binary classification with Kernel Factory kernelfactory implements an ensemble method for kernel machines (Ballings and Van den Poel, 2013).

kernelfactory 3 kernelfactory(x = NULL, y = NULL, cp = 1, rp = round(log(nrow(x), 10)), method = "burn", ntree = 500, filter = 0.01, popsize = rp * cp * 7, iters = 80, mutationchance = 1/(rp * cp), elitism = max(1, round((rp * cp) * 0.05)), oversample = TRUE) Arguments x y cp rp Value method ntree filter popsize iters A data frame of predictors (numeric, integer or factor). Categorical variables need to be factors. Indicator values should not be too imbalanced because this might produce constants in the subsetting process. A factor containing the response vector. Only {0,1} is allowed. The number of column partitions. The number of row partitions. Can be one of the following: POLynomial kernel function (pol), LINear kernel function (lin), Radial Basis kernel Function rbf), random choice (random=pol, lin, rbf) (random), burn- in choice of best function (burn=pol, lin, rbf ) (burn). Use random or burn if you don t know in advance which kernel function is best. Number of trees in the Random Forest base classifiers. either NULL (deactivate) or a percentage denoting the minimum class size of dummy predictors. This parameter is used to remove near constants. For example if nrow(xtrain)=100, and filter=0.01 then all dummy predictors with any class size equal to 1 will be removed. Set this higher (e.g., 0.05 or 0.10) in case of errors. Population size of the genetic algorithm. Number of generations of the genetic algorithm. mutationchance Mutationchance of the genetic algorithm. elitism oversample Elitism parameter of the genetic algorithm. Oversample the smallest class. This helps avoid problems related to the subsetting procedure (e.g., if rp is too high). An object of class kernelfactory, which is a list with the following elements: trn trnlst rbfstre rbfmtrx rsltskf cpr rpr cntr Training data set. List of training partitions. List of used kernel functions. List of augmented kernel matrices. List of models. Number of column partitions. Number of row partitions. Number of partitions.

4 kfnews wghts nmdtrn rngs constants Weights of the ensemble members. Vector indicating the numeric (and integer) features. Ranges of numeric predictors. To exclude from newdata. Author(s) Authors: Michel Ballings and Dirk Van den Poel, Maintainer: <Michel.Ballings@GMail.com> Ballings, M. and Van den Poel, D. (2013), Kernel Factory: An Ensemble of Kernel Machines. Expert Systems With Applications, 40(8), 2904-2913. See Also predict.kernelfactory #Credit Approval data available at UCI Machine Learning Repository #take subset (for the purpose of a quick example) and train and test Credit <- Credit[1:100,] train.ind <- sample(nrow(credit),round(0.5*nrow(credit))) #Train Kernel Factory on training data kfmodel <- kernelfactory(x=credit[train.ind,names(credit)!= "Response"], y=credit[train.ind,"response"], method=random) #Deploy Kernel Factory to predict response for test data #predictedresponse <- predict(kfmodel, newdata=credit[-train.ind,names(credit)!= "Response"]) kfnews Display the NEWS file kfnews shows the NEWS file of the kernelfactory package. kfnews() Value None.

predict.kernelfactory 5 Author(s) Authors: Michel Ballings and Dirk Van den Poel, Maintainer: <Michel.Ballings@GMail.com> Ballings, M. and Van den Poel, D. (2013), Kernel Factory: An Ensemble of Kernel Machines. Expert Systems With Applications, 40(8), 2904-2913. See Also kernelfactory, predict.kernelfactory kfnews() predict.kernelfactory Predict method for kernelfactory objects Prediction of new data using kernelfactory. ## S3 method for class kernelfactory predict(object, newdata = NULL, predict.all = FALSE,...) Arguments Value object newdata predict.all An object of class kernelfactory, as created by the function kernelfactory A data frame with the same predictors as in the training data. TRUE or FALSE. If TRUE and rp and cp are 1 then the individual predictions of the random forest are returned. If TRUE and any of rp and cp or bigger than 1 then the predictions of all the members are returned.... Not used currently. A vector containing the response probabilities. Author(s) Authors: Michel Ballings and Dirk Van den Poel, Maintainer: <Michel.Ballings@GMail.com>

6 predict.kernelfactory Ballings, M. and Van den Poel, D. (2013), Kernel Factory: An Ensemble of Kernel Machines. Expert Systems With Applications, 40(8), 2904-2913. See Also kernelfactory #Credit Approval data available at UCI Machine Learning Repository #take subset (for the purpose of a quick example) and train and test Credit <- Credit[1:100,] train.ind <- sample(nrow(credit),round(0.5*nrow(credit))) #Train Kernel Factory on training data kfmodel <- kernelfactory(x=credit[train.ind,names(credit)!= "Response"], y=credit[train.ind,"response"], method=random) #Deploy Kernel Factory to predict response for test data predictedresponse <- predict(kfmodel, newdata=credit[-train.ind,names(credit)!= "Response"])

Index Topic classification kernelfactory, 2 predict.kernelfactory, 5 Topic datasets Credit, 2 Credit, 2 kernelfactory, 2, 5, 6 kfnews, 4 predict.kernelfactory, 4, 5, 5 7