Package msda. February 20, 2015

Similar documents
Package PUlasso. April 7, 2018

Package glinternet. June 15, 2018

Package TANDEM. R topics documented: June 15, Type Package

Package BigTSP. February 19, 2015

Package RAMP. May 25, 2017

Package KERE. August 28, 2015

Package hiernet. March 18, 2018

Package SSLASSO. August 28, 2018

Package penalizedlda

Package enpls. May 14, 2018

Package SAM. March 12, 2019

Package listdtr. November 29, 2016

Package flam. April 6, 2018

Package parcor. February 20, 2015

Package nsprcomp. August 29, 2016

Package svmpath. R topics documented: August 30, Title The SVM Path Algorithm Date Version Author Trevor Hastie

Package kdetrees. February 20, 2015

Package slam. February 15, 2013

Package ordinalnet. December 5, 2017

Package stapler. November 27, 2017

Package narray. January 28, 2018

Package mfbvar. December 28, Type Package Title Mixed-Frequency Bayesian VAR Models Version Date

Package logisticpca. R topics documented: August 29, 2016

Package TVsMiss. April 5, 2018

Package DRaWR. February 5, 2016

Package slam. December 1, 2016

Package PrivateLR. March 20, 2018

Package picasso. December 18, 2015

Package vennlasso. January 25, 2019

Lasso. November 14, 2017

Package catenary. May 4, 2018

Package FWDselect. December 19, 2015

Package cwm. R topics documented: February 19, 2015

Package glmnetutils. August 1, 2017

Package lol. R topics documented: December 13, Type Package Title Lots Of Lasso Version Date Author Yinyin Yuan

Package glassomix. May 30, 2013

Package spark. July 21, 2017

Package obliquerf. February 20, Index 8. Extract variable importance measure

Package madsim. December 7, 2016

Package LINselect. June 9, 2018

Package bestnormalize

Package clime. April 9, 2010

Package esabcv. May 29, 2015

Package quadprogxt. February 4, 2018

Package rgcvpack. February 20, Index 6. Fitting Thin Plate Smoothing Spline. Fit thin plate splines of any order with user specified knots

Package kdtools. April 26, 2018

Package IntNMF. R topics documented: July 19, 2018

Package oasis. February 21, 2018

Package sparsehessianfd

Package mgc. April 13, 2018

Package Rglpk. May 18, 2017

Package EBglmnet. January 30, 2016

Package cgh. R topics documented: February 19, 2015

Package hsiccca. February 20, 2015

Package MTLR. March 9, 2019

Package BigVAR. R topics documented: April 3, Type Package

Package SAENET. June 4, 2015

Package sgd. August 29, 2016

Package gppm. July 5, 2018

The grplasso Package

Package orloca. April 21, Type Package Depends methods, png, ucminf Suggests grdevices, graphics, knitr VignetteBuilder knitr

Package glmpath. R topics documented: January 28, Version 0.98 Date

Package clustmixtype

Package stochprofml. February 20, 2015

Package batchmeans. R topics documented: July 4, Version Date

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2.

Package nscancor. February 15, 2018

Package mixsqp. November 14, 2018

Package naivebayes. R topics documented: January 3, Type Package. Title High Performance Implementation of the Naive Bayes Algorithm

Package rknn. June 9, 2015

Package palmtree. January 16, 2018

Package apastyle. March 29, 2017

Package flsa. February 19, 2015

Package nonlinearicp

Package CVR. March 22, 2017

Package CMPControl. February 19, 2015

Package gbts. February 27, 2017

Package rpst. June 6, 2017

Package distance.sample.size

Package zebu. R topics documented: October 24, 2017

Package biglars. February 19, 2015

Package scoop. September 16, 2011

Package ICSShiny. April 1, 2018

Package SeleMix. R topics documented: November 22, 2016

Package superbiclust

Package kirby21.base

Package bisect. April 16, 2018

Package milr. June 8, 2017

Package GWRM. R topics documented: July 31, Type Package

Package NNLM. June 18, 2018

Package optimsimplex

Package graphframes. June 21, 2018

Package pandar. April 30, 2018

Package cgraph. March 11, 2019

Package datasets.load

Package ssa. July 24, 2016

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015

Package ConvergenceClubs

Package mvprobit. November 2, 2015

Package subsemble. February 20, 2015

Transcription:

Title Multi-Class Sparse Discriminant Analysis Version 1.0.2 Date 2014-09-26 Package msda February 20, 2015 Author Maintainer Yi Yang <yiyang@umn.edu> Depends Matri, MASS Efficient procedures for computing a new Multi-Class Sparse Discriminant Analysis method that estimates all discriminant directions simultaneously. LazyData yes License GPL-2 URL https://github.com/emeryyi/msda NeedsCompilation yes Repository CRAN Date/Publication 2015-02-20 09:16:53 R topics documented: cv.msda........................................... 2 GDS1615.......................................... 3 msda............................................. 4 plot.msda.......................................... 6 predict.msda......................................... 7 Inde 9 1

2 cv.msda cv.msda Cross-validation for msda Does k-fold cross-validation for msda, returns a value for lambda. cv.msda(, y, nfolds = 5, lambda = NULL, lambda.opt = "min",...) Arguments y nfolds lambda lambda.opt matri of predictors, of dimension N p; each row is an observation vector. response variable. This argument should be a factor for classification. number of folds - default is 5. Although nfolds can be as large as the sample size (leave-one-out CV), it is not recommended for large datasets. Smallest value allowable is nfolds=3. optional user-supplied lambda sequence; default is NULL, and msda chooses its own sequence. If choose "min", the smallest lambda that gives minimum cross validation error cvm will be returned. If choose "ma", the largest lambda that gives minimum cross validation error cvm will be returned.... other arguments that can be passed to msda. Details The function runs msda nfolds+1 times; the first to get the lambda sequence, and then the remainder to compute the fit with each of the folds omitted. The average error and standard deviation over the folds are computed. Value an object of class cv.msda is returned, which is a list with the ingredients of the cross-validation fit. lambda cvm cvsd lambda.min lambda.1se msda.fit the values of lambda used in the fits. the mean cross-validated error - a vector of length length(lambda). estimate of standard error of cvm. the optimal value of lambda that gives minimum cross validation error cvm. the largest value of lambda such that error is within 1 standard error of the minimum. a fitted msda object for the full data.

GDS1615 3 Author(s) Maintainer: Yi Yang <yiyang@umn.edu> See Also msda Eamples <-GDS1615$ y<-gds1615$y obj.cv<-cv.msda(=,y=y,nfolds=5,lambda.opt="ma") lambda.min<-obj.cv$lambda.min id.min<-which(obj.cv$lambda==lambda.min) pred<-predict(obj.cv$msda.fit,)[,id.min] GDS1615 GDS1615 data introduced in Burczynski et al. (2012). The dataset is a subset of the dataset available on Gene Epression Omnibus with the accession number GDS1615. The original dataset contains 22283 gene epression levels and the disease states of the observed subjects. In Mai, Yang and Zou, the dimension of the original dataset was first reduced to 127 by F-test screening. Value This data frame contains the following: y gene epression levels. Disease state that is coded as 1,2,3. 1: normal; 2: ulcerative colitis; 3: Crohn s disease.

4 msda M. E. Burczynski, R. L Peterson, N. C. Twine, K. A. Zuberek, B. J. Brodeur, L. Casciotti, V. Maganti, P. S. Reddy, A. Strahs, F. Immermann, W. Spinelli, U. Schwertschlag, A. M. Slager, M. M. Cotreau, and A. J. Dorner. (2012), "Molecular classification of crohn s disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells". Journal of Molecular Diagnostics, 8:51 61. Eamples msda Fits a regularization path for Multi-Class Sparse Discriminant Analysis Fits a regularization path for Multi-Class Sparse Discriminant Analysis at a sequence of regularization parameters lambda. msda(, y, nlambda = 100, lambda.factor = ifelse((nobs - nclass) <= nvars, 0.2, 0.001), lambda = NULL, dfma = nobs, pma = min(dfma * 2 + 20, nvars), pf = rep(1, nvars), eps = 1e-04, mait = 1e+06, sml = 1e-06, verbose = FALSE, perturb = NULL) Arguments y matri of predictors, of dimension N p; each row is an observation vector. response variable. This argument should be a factor for classification. nlambda the number of lambda values - default is 100. lambda.factor The factor for getting the minimal lambda in lambda sequence, where min(lambda) = lambda.factor * ma(lambda). ma(lambda) is the smallest value of lambda for which all coefficients are zero. The default depends on the relationship between N (the number of rows in the matri of predictors) and p (the number of predictors). If N > p, the default is 0.0001, close to zero. If N < p, the default is 0.2. A very small value of lambda.factor will lead to a saturated fit. It takes no effect if there is user-defined lambda sequence.

msda 5 lambda dfma pma pf eps mait sml verbose perturb a user supplied lambda sequence. Typically, by leaving this option unspecified users can have the program compute its own lambda sequence based on nlambda and lambda.factor. Supplying a value of lambda overrides this. It is better to supply a decreasing sequence of lambda values than a single (small) value, if not, the program will sort user-defined lambda sequence in decreasing order automatically. limit the maimum number of variables in the model. Useful for very large p, if a partial path is desired. Default is n. limit the maimum number of variables ever to be nonzero. For eample once β enters the model, no matter how many times it eits or re-enters model through the path, it will be counted only once. Default is min(dfma*1.2,p). L1 penalty factor of length p. Separate L1 penalty weights can be applied to each coefficient of θ to allow differential L1 shrinkage. Can be 0 for some variables, which implies no L1 shrinkage, and results in that variable always being included in the model. Default is 1 for all variables (and implicitly infinity for variables listed in eclude). convergence threshold for coordinate descent. Each inner coordinate descent loop continues until the relative change in any coefficient. Defaults value is 1e-8. maimum number of outer-loop iterations allowed at fied lambda value. Default is 1e6. If models do not converge, consider increasing mait. whether to print out computation progress. The default is FALSE. a scalar number. If it is specified, the number will be added to each diagonal element of the sigma matri as perturbation. The default is NULL. Details Value Note that for computing speed reason, if models are not converging or running slow, consider increasing eps and sml, or decreasing nlambda, or increasing lambda.factor before increasing mait. Users can also reduce dfma to limit the maimum number of variables in the model. An object with S3 class msda. theta df obj dim lambda a list of length(lambda) for fitted coefficients theta, each one corresponding to one lambda value, each stored as a sparse matri (dgcmatri class, the standard class for sparse numeric matrices in the Matri package.). To convert it into normal type matri use as.matri(). the number of nonzero coefficients for each value of lambda. the fitted value of the objective function for each value of lambda. dimension of each coefficient matri at each lambda. the actual sequence of lambda values used. matri of predictors.

6 plot.msda y npasses jerr sigma delta mu prior call response variable. total number of iterations (the most inner loop) summed over all lambda values error flag, for warnings and errors, 0 if no error. estimated sigma matri. estimated delta matri. delta[k] = mu[k]-mu[1]. estimated mu vector. prior probability that y belong to class k, estimated by mean(y that belong to k). the call that produced this object Author(s) Maintainer: Yi Yang <yiyang@umn.edu> See Also cv.msda, predict.msda Eamples <-GDS1615$ y<-gds1615$y obj <- msda( =, y = y) plot.msda Plot coefficients from a "msda" object Produces a coefficient profile plot of the coefficient paths for a fitted msda object. ## S3 method for class msda plot(, var = c("norm", "lambda"),...)

predict.msda 7 Arguments var fitted msda model the variable on the X-ais. The option "norm" plots the coefficients against the L1-norm of the coefficients, and the option "lambda" plots the coefficient against the log-lambda sequence.... other graphical parameters to plot Details A coefficient profile plot is produced. Author(s) Maintainer: Yi Yang <yiyang@umn.edu> Eamples <-GDS1615$ y<-gds1615$y obj <- msda( =, y = y) plot(obj) predict.msda make predictions from a "msda" object. This functions predicts class labels from a fitted msda object. ## S3 method for class msda predict(object, new,...)

8 predict.msda Arguments Value object new fitted msda model object. matri of new values for at which predictions are to be made. NOTE: new must be a matri, predict function does not accept a vector or other formats of new.... Not used. Other arguments to predict. predicted class label(s) at the entire sequence of the penalty parameter lambda used to create the model. Author(s) Maintainer: Yi Yang <yiyang@umn.edu> See Also msda Eamples <-GDS1615$ y<-gds1615$y obj <- msda( =, y = y) pred<-predict(obj,)

Inde Topic classification cv.msda, 2 msda, 4 plot.msda, 6 predict.msda, 7 Topic datasets GDS1615, 3 Topic models cv.msda, 2 msda, 4 plot.msda, 6 predict.msda, 7 cv.msda, 2, 2 GDS1615, 3 msda, 2, 3, 4, 5 8 plot.msda, 6 predict.msda, 7 (GDS1615), 3 y (GDS1615), 3 9