Package MMIX. R topics documented: February 15, Type Package. Title Model selection uncertainty and model mixing. Version 1.2.

Size: px
Start display at page:

Download "Package MMIX. R topics documented: February 15, Type Package. Title Model selection uncertainty and model mixing. Version 1.2."

Transcription

1 Package MMIX February 15, 2013 Type Package Title Model selection uncertainty and model mixing Version 1.2 Date Author Marie Morfin and David Makowski Maintainer Description Implement different types of model mixing and model selection methods for linear or logistic models. License GPL (>= 2) LazyLoad yes Repository CRAN Date/Publication :10:51 NeedsCompilation no R topics documented: MMIX-package arms bmabic bootfreq fullmodel pmsecv print.mmixclass stepsel varselec Index 20 1

2 2 MMIX-package MMIX-package Model selection uncertainty and model mixing Description Details Implement different types of model mixing and model selection methods for linear or logistic models. Package: MMIX Type: Package Version: Date: License: GPL (>= 2) LazyLoad: yes This package implements three model mixing methods, two model selection procedures and three functions for model evaluation. Author(s) Marie Morfin and David Makowski Maintainer: Who to complain to References Barbottin A., Makowski D., Le Bail M., Jeuffroy M.-H., Bouchard C., Barrier C. (2008) Comparison of models and indicators for categorizing soft wheat fields according to their grain protein contents, European Journal of Agronomy 29, Buckland, S.T., Burnham, K.P. and Augustin, N.H. (1997) Model selection: an integral part of inference, Biometrics 53, Burnham, K.P. and Anderson, D.R. (2002) Model selection and multimodel inference: a practical information-theoretic approach, Springer-Verlag, New York. Casagrande, M., David, C., Valantin-Morison, M., Makowski, D. and Jeuffroy, M.-H. (2009) Factors limiting protein content of organic winter wheat in south- eastern France: a mixed-model approach, Agronomy for Sustainable Development, In press. Chatfield, C. (1995) Model uncertainty, data mining and statistical inference, Journal of the Royal Statistical Society /A 158, Efron, B. (1979) Bootstrap methods : another look at the jackknife, American Statistician 7, Efron, B. and Tibshirani, R.J. (1993) An introduction to the bootstrap, Chapman & Hall. Hammersley, J.M. and Handscomb, D.C. (1964) Monte Carlo Methods, Chapman & Hall, London & New York.

3 MMIX-package 3 Hoeting, JA., Madigan, D., Raftery, A.E. and Volinsky, C.T. (1999), Bayesian model averaging: a tutorial, Statistical Science, 14, Miller A. (2002) Subset selection in regression, 2nd edition Chapman & Hall/CRC, New York. Mooney, C.Z. and Duval, R.D. (1993) Bootstrapping : a nonparametric approach to statistical inference, Sage Publications, London. Prost, L., Makowski, D. and Jeuffroy, M.-H. (2006) Comparison of stepwise selection and Bayesian model averaging for yield gap analysis, Ecological Modelling 219, Raftery, A.E., Madigan D. and Hoeting J.A. (1997) Bayesian model averaging for linear regression models, Journal of the American Statistical Association 92, Sing, T., Sander, O., Beerenwinkel, N. and Lengauer, T. (2005) ROCR: visualizing classifier performance, Bioinformatics applications note 21, Viallefont, V., Raftery A.E. and Richardson, S. (2001) Variable selection and Bayesian model averaging in case-control studies, Statistics in medicine 20, Yuan, Z. and Ghosh, D. (2008) Combining Multiple Biomarker Models in Logistic Regression, Biometrics 64, Yuan, Z. and Yang, Y. (2005) Combining Linear Regression Models: When and How?, Journal of the American Statistical Association 100, Examples ###Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) ##Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) ##Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1) data2<-data.frame(y2,x1,x2,x3) ##bootfreq #Stepwise selection method bootstep1<-bootfreq(data=data1,family=gaussian("identity"),nboot=50,method=2, criterion="bic",trace=0) bootstep1 summary(bootstep1) plot(bootstep1) bootstep2<-bootfreq(data=data2,family=binomial("logit"),method=2, criterion="bic",nboot=20,trace=0) bootstep2 summary(bootstep2) plot(bootstep2)

4 4 arms ##ARMS method armsresult1<-arms(data=data1,family=gaussian("identity"),nbest=5,nsample=10, criterion="both",weight="aic") armsresult1 summary(armsresult1) plot(armsresult1) armsresult2<-arms(data=data2,family=binomial("logit"),nbest=5,nsample=10, criterion="both",weight="aic") armsresult2 summary(armsresult2) plot(armsresult2) ##Root Mean Square Error by cross-validation #Stepwise selection with BIC pmsestepbic<-pmsecv(data=data1,method=2,np=1,random=false,direction="both", criterion="bic",trace=0) pmsestepbic #BMA pmsebma<-pmsecv(data=data1,method=3,np=1,random=false) pmsebma ##Area Under ROC Curve by cross-validation aucstepbic<-auccv(data=data2,method=2,np=1,random=false,direction="both", criterion="bic",trace=0) aucstepbic aucbma<-auccv(data=data2,method=3,np=1,random=false) aucbma arms Adaptative Regression by Mixing with model Screening Description Apply ARMS for linear and logistic models. Usage arms(data, family, nsample = 20, nbest = 20, criterion= "both", weight = "aic", maxvar=10) Arguments data a data frame including the response variable (first column) and the explanatory variables. All the variables must be numeric and the response variable value must be 0 or 1 for the logistic model.

5 arms 5 family nsample nbest criterion weight maxvar a description of the error distribution (gaussian("identity") or binomial("logit")). number of sample permutations number of models selected at the screening step selection criterion used at the screening step. criterion="aic", "bic" or "both". "both" means that nbest models are selected according to AIC, and nbest according to "bic" (between nbest and 2*nbest models may be selected). model weight type, weight = "likeli" or "aic" (the default). See details. maximum number of explanatory variables to include in the model. See varselec. Details Value A set of models including all possible combinations of at most maxvar variables is defined. The models are weighted as follows: Step 1. The sample is splitted in two parts. The first one is used to fit the models. Step 2. Each model is fitted by least squares or maximum likelihood. Step 3. A set of models is selected according to AIC and BIC. Step 4. Weights are computed for the selected models using the second sample. Two weight types can be used, "likeli" or "aic". At each iteration, the sample is permuted. The final weight of each model is the mean across all the permutations. The coefficients are calculated on the whole sample using the averaged model weights. arms returns an object of class "MMIXclass". A data frame with the main results is printed with the function "print" and a graphic with the weights of the explanatory variables is obtained with the function "plot". An arms object is a list including the following components: coef pne0 fitted.values label modweights allcoef a named vector of coefficients. a vector containing the probability of each explanatory variable to be different from zero. the fitted values, obtained by transforming the linear predictors by the inverse of the link function. a list of the explanatory variables used by each model. the weights of all models. matrix with one row per model and one column per coefficient giving the ML estimate of each coefficient for each model. Warning This function does not accept or treat missing values. Author(s) Marie Morfin and David Makowski

6 6 bmabic References Yuan, Zheng and Ghosh, Debashis (2008) Combining Multiple Biomarkers Models in Logistic Regression, Biometrics 64, Yuan, Zheng and Yang, Yuhong (2005) Combining Linear Regression Models : When and How?, Journal of the American Statistical Association 100, See Also fullmodel, bmabic, mixaic, stepsel, bootfreq Examples ##Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) #Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) #Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1) data2<-data.frame(y2,x1,x2,x3) #arms method armsresult1<-arms(data=data1,family=gaussian( identity ),nbest=5,nsample=10, criterion="both",weight="aic") armsresult1 summary(armsresult1) plot(armsresult1) armsresult2<-arms(data=data2,family=binomial("logit"),nbest=5,nsample=10, criterion="both",weight="aic") armsresult2 summary(armsresult2) plot(armsresult2) bmabic Model mixing methods for linear or logistic regression Description bmabic is a Bayesian Model Averaging method based on the bic criterion, and mixaic is a model mixing method based on the aic criterion.

7 bmabic 7 Usage bmabic(data, family, maxvar=10) mixaic(data, family, maxvar=10) Arguments data family maxvar a data frame including the response variable (first column) and the explanatory variables. All the variables must be numeric and the response variable value must be 0 or 1 for the logistic model. a description of the error distribution (gaussian("identity") or binomial("logit")). maximum number of explanatory variables to include in the model. See varselec. Details Value A set of models including all possible combinations of at most maxvar variables is defined. The models are weighted using aic (mixaic) or bic (bmabic). Computation time may be very long if maxvar is set to a high value. When the number of explanatory variables in data is higher than maxvar, this number is reduced to at most maxvar variables by stepwise selection ("forward", "aic"). bmabic and mixaic return an object of class "MMIXclass". A data frame with the main results is printed when calling the function "print", the detailed results are obtained with the function "summary" and a graphic of the weights of the explanatory variables is plotted with the function "plot". A bmabic or mixaic object is a list including the following components : coef pne0 fitted.values BestModels sd label modweights allcoef a named vector of coefficients. a vector containing the probability of each explanatory variable to be different from zero. the fitted values, obtained by transforming the linear predictors by the inverse of the link function. the best three models selected by AIC (mixaic) or BIC (bmabic). standard deviation of the estimated parameter values. a list of the explanatory variables used by each model. the weights of all models. matrix with one row per model and one column per coefficient giving the ML estimate of each coefficient for each model. Warning These functions do not accept or treat missing values. Author(s) Marie Morfin and David Makowski

8 8 bmabic References Barbottin A., Makowski D., Le Bail M., Jeuffroy M.-H., Bouchard C., Barrier C. (2008) Comparison of models and indicators for categorizing soft wheat fields according to their grain protein contents, European Journal of Agronomy 29, Burnham, K.P. and Anderson, D.R. (2002) Model selection and multimodel inference: a practical information-theoretic approach, Springer-Verlag, New York. Casagrande, M., David, C., Valantin-Morison, M., Makowski, D. and Jeuffroy, M.-H. (2009) Factors limiting protein content of organic winter wheat in south- eastern France: a mixed-model approach, Agronomy for Sustainable Development, In press. Hoeting, J.A., Madigan, D., Raftery, A.E. and Volinsky, C.T. (1999) Bayesian model averaging: a tutorial, Statistical Science 14, Prost, L., Makowski, D. and Jeuffroy, M.-H. (2006) Comparison of stepwise selection and Bayesian model averaging for yield gap analysis, Ecological Modelling 219, Raftery, A.E., Madigan D. and Hoeting, J.A. (1997) Bayesian model averaging for linear regression models, Journal of the American Statistical Association 92, Raftery, A.E., Painter, I.S. and Volinsky, C.T. (2005) BMA : An R package for Bayesian Model Averaging, R News 5, 2-8. Viallefont, V., Raftery, A.E. and Richardson, S. (2001) Variable selection and Bayesian model averaging in case-control studies, Statistics in medicine 20, See Also family, varselec Examples ##Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) #Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) #Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1) data2<-data.frame(y2,x1,x2,x3) ##bmabic method bmaresult1<-bmabic(data=data1,family=gaussian("identity")) bmaresult1 summary(bmaresult1) plot(bmaresult1)

9 bootfreq 9 bmaresult2<-bmabic(data=data2,family=binomial("logit")) bmaresult2 summary(bmaresult2) plot(bmaresult2) ##mixaic method mixresult1<-mixaic(data=data1,family=gaussian("identity")) mixresult1 summary(mixresult1) plot(mixresult1) mixresult2<-mixaic(data=data2,family=binomial("logit")) mixresult2 summary(mixresult2) plot(mixresult2) bootfreq Stability of selection and mixing methods Description This function analyses the stability of the stepwise selection and mixing methods using a bootstrap procedure. Usage bootfreq(data, family, nboot = 100, method = 1, file = NULL,...) Arguments data family nboot method file a data frame including the response variable (first column) and the explanatory variables. All the variables must be numeric and the response variable value must be 0 or 1 for the logistic model. a description of the error distribution (gaussian("identity") or binomial("logit")). number of bootstrap samples drawn. the statistical method used to estimate the model parameters. method = 1 for fullmodel, method = 2 for stepsel, method = 3 for bmabic, method = 4 for mixaic, method = 5 for arms. the path of the file where the results are stored during the run. If file = NULL no file is created.... the specific arguments of the called method.

10 10 bootfreq Details nboot samples are generated with replacement from the original dataset. For each sample, the parameters are estimated using the chosen method. The frequency of selection of a variable is the part of the samples for which the estimated value of its coefficient is different from zero. Selection frequency is an indicator of the stability of the method. Frequencies close to 0 or 1 indicates that the corresponding variables was almost never or always selected in the bootstrap samples. Value bootfreq returns an object of class "classmmix". A data frame with the main results is printed with the function "print", and a graphic with the weights of the explanatory variables is plotted with the function "plot". A bootfreq object is a list including the following components: frequency coef mean sd pne0 frequency of selection of each variable across the bootstrap samples estimated parameter values across the bootstrap samples; a matrix with nsample rows mean of the estimated parameter values across the bootstrap samples standard deviation of the estimated parameter values across the bootstrap samples mean values of the variable weights computed using a model mixing method. Pne0 = frequency if method = 1 or 2. Warning This function does not accept or treat missing values. Author(s) Marie Morfin and David Makowski <makowski@grignon.inra.fr> References Buckland, S.T., Burnham, K.P. and Augustin, N.H. (1997) Model selection: an integral part of inference, Biometrics 53, Chatfield, C. (1995) Model uncertainty, data mining and statistical inference, Journal of the Royal Statistical Society /A 158, Efron, B. (1979) Bootstrap methods : another look at the jackknife, American Statistician 7, Efron, B. and Tibshirani, R.J. (1993) An introduction to the bootstrap, Chapman & Hall. Hammersley, J.M. and Handscomb, D.C. (1964) Monte Carlo Methods, Chapman & Hall. Miller A. (2002) Subset selection in regression, 2nd edition Chapman & Hall/CRC, New York. Mooney, C.Z. and Duval, R.D. (1993) Bootstrapping: a nonparametric approach to statistical inference, Sage Publications, London. Prost, L., Makowski, D. and Jeuffroy, M.-H. (2006) Comparison of stepwise selection and Bayesian model averaging for yield gap analysis, Ecological Modelling 219,

11 fullmodel 11 See Also fullmodel, stepsel, bmabic, mixaic, arms Examples ##Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) #Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) #Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1) data2<-data.frame(y2,x1,x2,x3) ##Analysis of a stepwise selection bootstep1<-bootfreq(data=data1,family=gaussian("identity"),nboot=50,method=2, criterion="bic",trace=0) bootstep1 summary(bootstep1) plot(bootstep1) bootstep2<-bootfreq(data=data2,family=binomial("logit"),method=2, criterion="bic",nboot=20,trace=0) bootstep2 summary(bootstep2) plot(bootstep2) fullmodel Linear and logistic regressions Description Linear or logistic model without any variable selection. Usage fullmodel(data, family)

12 12 fullmodel Arguments data family a data frame including the response variable (first column) and the explanatory variables. All the variables must be numeric and the response variable value must be 0 or 1 for the logistic model. a description of the error distribution (gaussian("identity") or binomial("logit")). Value fullmodel returns an object of class "MMIXclass". A data frame with the main results is printed when calling the function "print", and the detailed results are obtained with the function "summary". A fullmodel object is a list including the following components: cv coef aic bic fitted.values logical. Was the IWLS algorithm judged to have converged? a named vector of coefficients estimated by least squares or maximum likelihood. Akaike Information Criterion, minus twice the maximized log-likelihood plus twice the number of coefficients. Bayesian information criterion, minus twice the maximized log-likelihood plus the logarithm of the number of observation multiplied by the number of coefficients the fitted values, obtained by transforming the linear predictors by the inverse of the link function. Author(s) Marie Morfin and David Makowski References Venables, W.N. and Ripley, B.D. (2002) Modern Applied Statistics with S., Springer, New York. See Also family, arms, bmabic, mixaic, stepsel, bootfreq Examples ###Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) ##Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3)

13 pmsecv 13 ##Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1) data2<-data.frame(y2,x1,x2,x3) ##fullmodel method fullresult1<-fullmodel(data=data1,family=gaussian("identity")) fullresult1 summary(fullresult1) fullresult2<-fullmodel(data=data2,family=binomial("logit")) fullresult2 summary(fullresult2) pmsecv Model performance indicators PMSE and AUC Description pmsecv and auccv calculate the Predictive Mean Square Error (PMSE) by "leave-np-out" crossvalidation and Area Under Roc Curve (AUC) by "leave-np-pair-out" cross-validation. They can be applied to models created using fullmodel, stepsel, bmabic, mixaic and arms. Usage pmsecv(data, method = 1, np, random = TRUE, npermu = 100, file = NULL,...) auccv(data, method = 1, np, random = TRUE, npermu = 100, file = NULL,...) Arguments data method np random npermu file a data frame including the response variable (first column) and the explanatory variables. All the variables must be numeric and the response variable value must be 0 or 1 for the logistic model. the statistical method used to estimate the model parameters. method = 1 for fullmodel, method = 2 for stepsel, method = 3 for bmabic, method = 4 for mixaic, method = 5 for arms. number of observations (pmsecv) or pairs of observations (auccv) left out for computing the PMSE or AUC. observations are selected at random if TRUE. random can be FALSE only if np = 1. In this case all the possible sets are selected. number of random samples of np observations if random = TRUE. the path of the file where the results are stored during the run. If file = NULL no file is created.... the specific arguments of the called method

14 14 pmsecv Details Value These two cross validation procedures are implemented to assess the accuracy of model predictions. Linear models should be evaluated using pmsecv and logistic models should be evaluated using auccv. In the auccv procedure, data are held out by pair (one data from each class, 0 and 1) at each iteration in order to calculate the corresponding AUC. The PMSE (AUC) estimated by cross validation is the mean of the PMSE (AUC) calculated for possible sets of np observations (pairs). If file is not NULL, the np predictions (first column) and the np corresponding observations (second column) are saved at each iteration. pmsecv (auccv) returns a one-row data frame including the PMSE (AUC) calculated from the whole sample in the first column and by cross validation in the other one. Warning These functions do not accept or treat the missing values. Author(s) Marie Morfin and David Makowski References Sing, T., Sander, O., Beerenwinkel, N. and Lengauer, T. (2005) ROCR: visualizing classifier performance, Bioinformatics applications note 21, Yuan, Z. and Ghosh, D. (2008) Combining Multiple Biomarker Models in Logistic Regression, Biometrics 64, Yuan, Z. and Yang, Y. (2005) Combining Linear Regression Models: When and How?, Journal of the American Statistical Association 100, See Also fullmodel, stepsel, bmabic, mixaic, arms Examples ##Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) #Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) #Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1)

15 print.mmixclass 15 data2<-data.frame(y2,x1,x2,x3) ##Linear models evaluated by pmsecv #for a stepwise selection pmsestepbic<-pmsecv(data=data1,method=2,np=1,random=false,direction="both", criterion="bic",trace=0) pmsestepbic #for the BMA method pmsebma<-pmsecv(data=data1,method=3,np=1,random=false) pmsebma ##Logistic models evaluated by auccv #for a stepwise selection aucstepbic<-auccv(data=data2,method=2,np=1,random=false,direction="both", criterion="bic",trace=0) aucstepbic #for the BMA method aucbma<-auccv(data=data2,method=3,np=1,random=false) aucbma print.mmixclass Summary, print and plot of MMIXclass objects Description summary, print and plot methods for MMIXclass objects. Usage ## S3 method for class MMIXclass print(x,...) ## S3 method for class MMIXclass summary(object,...) ## S3 method for class MMIXclass plot(x,...) Arguments x object of type MMIXclass. object object of type MMIXclass.... other parameters to be passed to plot.mmixclass, print.mmixclass or summary.mmixclass

16 16 stepsel Details print displays a data.frame with the main results of the function. summary displays a detailed list of the results. plot displays explanatory variable weights, i.e. the probability of each variable to be different from zero. Author(s) Marie Morfin and David Makowski <makowski@grignon.inra.fr> Examples ##Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) #Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) ##bmabic method bmaresult1<-bmabic(data=data1,family=gaussian("identity")) bmaresult1 summary(bmaresult1) plot(bmaresult1) stepsel Stepwise selection Description Select and fit a model by stepwise regression, for linear and logistic models. Usage stepsel(data, family, direction = "both", criterion, trace = 0) Arguments data family direction a data frame including the response variable (first column) and the explanatory variables. All the variables must be numeric and the response variable value must be 0 or 1 for the logistic model. a description of the error distribution (gaussian( identity ) or binomial( logit )). the type of stepwise search, can be one of both (the default), backward or forward

17 stepsel 17 criterion trace selection criterion at each step of the procedure. criterion = aic for the Akaike Information Criterion, and criterion = bic for the Bayesian Information Criterion. print information during the run if trace = 1. Larger values may give more information. If trace = 0 no information is printed. Details This function uses the function step. Value stepsel returns an object of class "MMIXclass". A data frame with the main results is printed whith the function "print", and the detailed results are obtained with the function "summary". A stepsel object is a list including the following components: coef aic bic fitted.values a named vector of coefficients estimated by least squares or maximum likelihood. Akaike Information Criterion, minus twice the maximized log-likelihood plus twice the number of coefficients. Bayesian information criterion, minus twice the maximized log-likelihood plus the logartithm of the number of observation multiplied by the number of coefficients the fitted values, obtained by transforming the linear predictors by the inverse of the link function. Author(s) Marie Morfin and David Makowski References Akaike H. (1974) A new look at the statistical model identification, IEEE Transactions on Automatic Control 19, Miller A. (2002) Subset selection in regression, 2nd edition Chapman & Hall/CRC, New York. Schwarz, G. (1978) Estimating the dimension of a model, Annals of Statistics 6, Whittingham M.J., Stephens P., Bradbury R.B.. Freckleton R.P. (2006) Why do we still use stepwise modelling in ecology and behaviour?, J. Anim. Ecol. 75, See Also family, step

18 18 varselec Examples ##Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) #Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) #Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1) data2<-data.frame(y2,x1,x2,x3) ##stepsel method stepresult1<-stepsel(data=data1,family=gaussian("identity"),criterion="bic", direction="both") stepresult1 summary(stepresult1) stepresult2<-stepsel(data=data2,family=binomial("logit"),criterion="bic", direction="both") stepresult2 summary(stepresult2) varselec Variable selection Description Create a data frame from the original data set including a subset of variables. Usage varselec(data,family,maxvar=10,trace=0) Arguments data family maxvar trace a data frame including the response variable (first column) and the explanatory variables. All the variables must be numeric and the response variable value must be 0 or 1 for the logistic model. a description of the error distribution (gaussian("identity") or binomial("logit")). maximum number of explanatory variables to include. print information during the run if trace = 1. Larger values may give more information. If trace = 0 no information is printed.

19 varselec 19 Details Value This function implements a stepwise regression for linear and logistic models, in the direction "forward" and with the criterion "aic". The procedure stops if the model includes more than maxvar factors. The returned data frame includes no more than maxvar explanatory variables. varselec returns a data frame including the response variable and the selected explanatory variables. Author(s) See Also Marie Morfin and David Makowski bmabic, mixaic, arms Examples ##Data #Explanatory variables X1<-c(-0.2,-2.4,-0.7,1.2,0.0,-1.1,-2.1,-0.3,2.0,-1.7,1.4,-1.3,-3.4,0.4,-1.3, -4.8) X2<- c(-3, 2, 1, -2, -2, -4, 0, 1, 1, -1, -1, -4, 0, 2, 0, -4) X3<-c(2,1,0,-2,1,-2, 0, -1, -4, 1, -3, -3, -3, -1, 0, 2) #Linear model Y1<- c(8.7, 6, 9.1, 10.4, 7.6,10.4, 7.9, 11.9, 18, 10.5, 16.5, 8.8, 7.7, 13.5, 8.2, 0.8) data1<-data.frame(y1,x1,x2,x3) #Logistic model Y2<-c(1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1) data2<-data.frame(y2,x1,x2,x3) ##varselec data1bis<-varselec(data=data1,family=gaussian("identity"),maxvar=2) data2bis<-varselec(data=data2,family=binomial("logit"),maxvar=2)

20 Index Topic iteration arms, 4 bootfreq, 9 pmsecv, 13 Topic models fullmodel, 11 Topic package MMIX-package, 2 Topic print print.mmixclass, 15 Topic regression arms, 4 bmabic, 6 bootfreq, 9 fullmodel, 11 pmsecv, 13 stepsel, 16 varselec, 18 Topic robust arms, 4 bmabic, 6 pmsecv, 13 step, 17 stepsel, 6, 9, 11 14, 16 summary.mmixclass (print.mmixclass), 15 varselec, 5, 7, 8, 18 arms, 4, 9, 11 14, 19 auccv (pmsecv), 13 bmabic, 6, 6, 9, 11 14, 19 bootfreq, 6, 9, 12 family, 8, 12, 17 fullmodel, 6, 9, 11, 11, 13, 14 mixaic, 6, 9, 11 14, 19 mixaic (bmabic), 6 MMIX (MMIX-package), 2 MMIX-package, 2 MMIXclass (print.mmixclass), 15 plot.mmixclass (print.mmixclass), 15 pmsecv, 13 print.mmixclass, 15 20

Package FWDselect. December 19, 2015

Package FWDselect. December 19, 2015 Title Selecting Variables in Regression Models Version 2.1.0 Date 2015-12-18 Author Marta Sestelo [aut, cre], Nora M. Villanueva [aut], Javier Roca-Pardinas [aut] Maintainer Marta Sestelo

More information

Package caic4. May 22, 2018

Package caic4. May 22, 2018 Type Package Package caic4 May 22, 2018 Title Conditional Akaike Information Criterion for 'lme4' Version 0.4 Date 2018-05-22 Author Benjamin Saefken and David Ruegamer, with contributions from Sonja Greven

More information

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer Model Assessment and Selection Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Model Training data Testing data Model Testing error rate Training error

More information

Package ICsurv. February 19, 2015

Package ICsurv. February 19, 2015 Package ICsurv February 19, 2015 Type Package Title A package for semiparametric regression analysis of interval-censored data Version 1.0 Date 2014-6-9 Author Christopher S. McMahan and Lianming Wang

More information

Package rpst. June 6, 2017

Package rpst. June 6, 2017 Type Package Title Recursive Partitioning Survival Trees Version 1.0.0 Date 2017-6-6 Author Maintainer Package rpst June 6, 2017 An implementation of Recursive Partitioning Survival Trees

More information

Lecture 13: Model selection and regularization

Lecture 13: Model selection and regularization Lecture 13: Model selection and regularization Reading: Sections 6.1-6.2.1 STATS 202: Data mining and analysis October 23, 2017 1 / 17 What do we know so far In linear regression, adding predictors always

More information

Package glmmml. R topics documented: March 25, Encoding UTF-8 Version Date Title Generalized Linear Models with Clustering

Package glmmml. R topics documented: March 25, Encoding UTF-8 Version Date Title Generalized Linear Models with Clustering Encoding UTF-8 Version 1.0.3 Date 2018-03-25 Title Generalized Linear Models with Clustering Package glmmml March 25, 2018 Binomial and Poisson regression for clustered data, fixed and random effects with

More information

Package fbroc. August 29, 2016

Package fbroc. August 29, 2016 Type Package Package fbroc August 29, 2016 Title Fast Algorithms to Bootstrap Receiver Operating Characteristics Curves Version 0.4.0 Date 2016-06-21 Implements a very fast C++ algorithm to quickly bootstrap

More information

Package missforest. February 15, 2013

Package missforest. February 15, 2013 Type Package Package missforest February 15, 2013 Title Nonparametric Missing Value Imputation using Random Forest Version 1.3 Date 2012-06-26 Author Maintainer Depends randomforest The function missforest

More information

Package TVsMiss. April 5, 2018

Package TVsMiss. April 5, 2018 Type Package Title Variable Selection for Missing Data Version 0.1.1 Date 2018-04-05 Author Jiwei Zhao, Yang Yang, and Ning Yang Maintainer Yang Yang Package TVsMiss April 5, 2018

More information

Package cwm. R topics documented: February 19, 2015

Package cwm. R topics documented: February 19, 2015 Package cwm February 19, 2015 Type Package Title Cluster Weighted Models by EM algorithm Version 0.0.3 Date 2013-03-26 Author Giorgio Spedicato, Simona C. Minotti Depends R (>= 2.14), MASS Imports methods,

More information

The brlr Package. March 22, brlr... 1 lizards Index 5. Bias-reduced Logistic Regression

The brlr Package. March 22, brlr... 1 lizards Index 5. Bias-reduced Logistic Regression The brlr Package March 22, 2006 Version 0.8-8 Date 2006-03-22 Title Bias-reduced logistic regression Author David Firth URL http://www.warwick.ac.uk/go/dfirth Maintainer David Firth

More information

Package parcor. February 20, 2015

Package parcor. February 20, 2015 Type Package Package parcor February 20, 2015 Title Regularized estimation of partial correlation matrices Version 0.2-6 Date 2014-09-04 Depends MASS, glmnet, ppls, Epi, GeneNet Author, Juliane Schaefer

More information

Package freeknotsplines

Package freeknotsplines Version 1.0.1 Date 2018-05-17 Package freeknotsplines June 10, 2018 Title Algorithms for Implementing Free-Knot Splines Author , Philip Smith , Pierre Lecuyer

More information

Package cumseg. February 19, 2015

Package cumseg. February 19, 2015 Package cumseg February 19, 2015 Type Package Title Change point detection in genomic sequences Version 1.1 Date 2011-10-14 Author Vito M.R. Muggeo Maintainer Vito M.R. Muggeo Estimation

More information

Discussion Notes 3 Stepwise Regression and Model Selection

Discussion Notes 3 Stepwise Regression and Model Selection Discussion Notes 3 Stepwise Regression and Model Selection Stepwise Regression There are many different commands for doing stepwise regression. Here we introduce the command step. There are many arguments

More information

Package abe. October 30, 2017

Package abe. October 30, 2017 Package abe October 30, 2017 Type Package Title Augmented Backward Elimination Version 3.0.1 Date 2017-10-25 Author Rok Blagus [aut, cre], Sladana Babic [ctb] Maintainer Rok Blagus

More information

Package Combine. R topics documented: September 4, Type Package Title Game-Theoretic Probability Combination Version 1.

Package Combine. R topics documented: September 4, Type Package Title Game-Theoretic Probability Combination Version 1. Type Package Title Game-Theoretic Probability Combination Version 1.0 Date 2015-08-30 Package Combine September 4, 2015 Author Alaa Ali, Marta Padilla and David R. Bickel Maintainer M. Padilla

More information

Package clustvarsel. April 9, 2018

Package clustvarsel. April 9, 2018 Version 2.3.2 Date 2018-04-09 Package clustvarsel April 9, 2018 Title Variable Selection for Gaussian Model-Based Clustering Description Variable selection for Gaussian model-based clustering as implemented

More information

Package GWRM. R topics documented: July 31, Type Package

Package GWRM. R topics documented: July 31, Type Package Type Package Package GWRM July 31, 2017 Title Generalized Waring Regression Model for Count Data Version 2.1.0.3 Date 2017-07-18 Maintainer Antonio Jose Saez-Castillo Depends R (>= 3.0.0)

More information

Discriminant analysis in R QMMA

Discriminant analysis in R QMMA Discriminant analysis in R QMMA Emanuele Taufer file:///c:/users/emanuele.taufer/google%20drive/2%20corsi/5%20qmma%20-%20mim/0%20labs/l4-lda-eng.html#(1) 1/26 Default data Get the data set Default library(islr)

More information

An Introduction to the Bootstrap

An Introduction to the Bootstrap An Introduction to the Bootstrap Bradley Efron Department of Statistics Stanford University and Robert J. Tibshirani Department of Preventative Medicine and Biostatistics and Department of Statistics,

More information

Package flexcwm. May 20, 2018

Package flexcwm. May 20, 2018 Type Package Title Flexible Cluster-Weighted Modeling Version 1.8 Date 2018-05-20 Author Mazza A., Punzo A., Ingrassia S. Maintainer Angelo Mazza Package flexcwm May 20, 2018 Description

More information

Package assortnet. January 18, 2016

Package assortnet. January 18, 2016 Type Package Package assortnet January 18, 2016 Title Calculate the Assortativity Coefficient of Weighted and Binary Networks Version 0.12 Date 2016-01-18 Author Damien Farine Maintainer

More information

Package mrm. December 27, 2016

Package mrm. December 27, 2016 Type Package Package mrm December 27, 2016 Title An R Package for Conditional Maximum Likelihood Estimation in Mixed Rasch Models Version 1.1.6 Date 2016-12-23 Author David Preinerstorfer Maintainer David

More information

Package SOIL. September 20, 2017

Package SOIL. September 20, 2017 Package SOIL September 20, 2017 Type Package Title Sparsity Oriented Importance Learning Version 1.1 Date 2017-09-20 Author Chenglong Ye , Yi Yang , Yuhong Yang

More information

Package MSwM. R topics documented: February 19, Type Package Title Fitting Markov Switching Models Version 1.

Package MSwM. R topics documented: February 19, Type Package Title Fitting Markov Switching Models Version 1. Type Package Title Fitting Markov Switching Models Version 1.2 Date 2014-02-05 Package MSwM February 19, 2015 Author Josep A. Sanchez-Espigares, Alberto Lopez-Moreno Maintainer Josep A. Sanchez-Espigares

More information

Package svalues. July 15, 2018

Package svalues. July 15, 2018 Type Package Package svalues July 15, 2018 Title Measures of the Sturdiness of Regression Coefficients Version 0.1.6 Author Carlos Cinelli Maintainer Carlos Cinelli Implements

More information

Package mcemglm. November 29, 2015

Package mcemglm. November 29, 2015 Type Package Package mcemglm November 29, 2015 Title Maximum Likelihood Estimation for Generalized Linear Mixed Models Version 1.1 Date 2015-11-28 Author Felipe Acosta Archila Maintainer Maximum likelihood

More information

Package biglars. February 19, 2015

Package biglars. February 19, 2015 Package biglars February 19, 2015 Type Package Title Scalable Least-Angle Regression and Lasso Version 1.0.2 Date Tue Dec 27 15:06:08 PST 2011 Author Mark Seligman, Chris Fraley, Tim Hesterberg Maintainer

More information

Package oaxaca. January 3, Index 11

Package oaxaca. January 3, Index 11 Type Package Title Blinder-Oaxaca Decomposition Version 0.1.4 Date 2018-01-01 Package oaxaca January 3, 2018 Author Marek Hlavac Maintainer Marek Hlavac

More information

Package pampe. R topics documented: November 7, 2015

Package pampe. R topics documented: November 7, 2015 Package pampe November 7, 2015 Type Package Title Implementation of the Panel Data Approach Method for Program Evaluation Version 1.1.2 Date 2015-11-06 Author Ainhoa Vega-Bayo Maintainer Ainhoa Vega-Bayo

More information

Package edrgraphicaltools

Package edrgraphicaltools Type Package Package edrgraphicaltools September 1, 2017 Title Provides Tools for Dimension Reduction Methods Version 2.2 Date 2017-08-29 Author Raphaël Coudret, Benoît Liquet and Jérôme Saracco Maintainer

More information

Package sbf. R topics documented: February 20, Type Package Title Smooth Backfitting Version Date Author A. Arcagni, L.

Package sbf. R topics documented: February 20, Type Package Title Smooth Backfitting Version Date Author A. Arcagni, L. Type Package Title Smooth Backfitting Version 1.1.1 Date 2014-12-19 Author A. Arcagni, L. Bagnato Package sbf February 20, 2015 Maintainer Alberto Arcagni Smooth Backfitting

More information

Package SSLASSO. August 28, 2018

Package SSLASSO. August 28, 2018 Package SSLASSO August 28, 2018 Version 1.2-1 Date 2018-08-28 Title The Spike-and-Slab LASSO Author Veronika Rockova [aut,cre], Gemma Moran [aut] Maintainer Gemma Moran Description

More information

Package RAMP. May 25, 2017

Package RAMP. May 25, 2017 Type Package Package RAMP May 25, 2017 Title Regularized Generalized Linear Models with Interaction Effects Version 2.0.1 Date 2017-05-24 Author Yang Feng, Ning Hao and Hao Helen Zhang Maintainer Yang

More information

Package slp. August 29, 2016

Package slp. August 29, 2016 Version 1.0-5 Package slp August 29, 2016 Author Wesley Burr, with contributions from Karim Rahim Copyright file COPYRIGHTS Maintainer Wesley Burr Title Discrete Prolate Spheroidal

More information

Package DTRreg. August 30, 2017

Package DTRreg. August 30, 2017 Package DTRreg August 30, 2017 Type Package Title DTR Estimation and Inference via G-Estimation, Dynamic WOLS, and Q-Learning Version 1.3 Date 2017-08-30 Author Michael Wallace, Erica E M Moodie, David

More information

Package DengueRT. May 13, 2016

Package DengueRT. May 13, 2016 Type Package Package DengueRT May 13, 2016 Title Parameter Estimates and Real-Time Prediction of a Single Dengue Outbreak Version 1.0.1 Date 2016-04-19 Author Carlos Sebrango, Lizet Sanchez, Ziv Shkedy,

More information

Package rocc. February 20, 2015

Package rocc. February 20, 2015 Package rocc February 20, 2015 Title ROC based classification Version 1.2 Depends ROCR Author Martin Lauss Description Functions for a classification method based on receiver operating characteristics

More information

Package simr. April 30, 2018

Package simr. April 30, 2018 Type Package Package simr April 30, 2018 Title Power Analysis for Generalised Linear Mixed Models by Simulation Calculate power for generalised linear mixed models, using simulation. Designed to work with

More information

Package intccr. September 12, 2017

Package intccr. September 12, 2017 Type Package Package intccr September 12, 2017 Title Semiparametric Competing Risks Regression under Interval Censoring Version 0.2.0 Author Giorgos Bakoyannis , Jun Park

More information

Assessing the Quality of the Natural Cubic Spline Approximation

Assessing the Quality of the Natural Cubic Spline Approximation Assessing the Quality of the Natural Cubic Spline Approximation AHMET SEZER ANADOLU UNIVERSITY Department of Statisticss Yunus Emre Kampusu Eskisehir TURKEY ahsst12@yahoo.com Abstract: In large samples,

More information

Package mcmcse. February 15, 2013

Package mcmcse. February 15, 2013 Package mcmcse February 15, 2013 Version 1.0-1 Date 2012 Title Monte Carlo Standard Errors for MCMC Author James M. Flegal and John Hughes Maintainer James M. Flegal

More information

Package qvcalc. R topics documented: September 19, 2017

Package qvcalc. R topics documented: September 19, 2017 Package qvcalc September 19, 2017 Version 0.9-1 Date 2017-09-18 Title Quasi Variances for Factor Effects in Statistical Models Author David Firth Maintainer David Firth URL https://github.com/davidfirth/qvcalc

More information

Machine Learning (BSMC-GA 4439) Wenke Liu

Machine Learning (BSMC-GA 4439) Wenke Liu Machine Learning (BSMC-GA 4439) Wenke Liu 01-31-017 Outline Background Defining proximity Clustering methods Determining number of clusters Comparing two solutions Cluster analysis as unsupervised Learning

More information

Package robustgam. February 20, 2015

Package robustgam. February 20, 2015 Type Package Package robustgam February 20, 2015 Title Robust Estimation for Generalized Additive Models Version 0.1.7 Date 2013-5-7 Author Raymond K. W. Wong, Fang Yao and Thomas C. M. Lee Maintainer

More information

Package PTE. October 10, 2017

Package PTE. October 10, 2017 Type Package Title Personalized Treatment Evaluator Version 1.6 Date 2017-10-9 Package PTE October 10, 2017 Author Adam Kapelner, Alina Levine & Justin Bleich Maintainer Adam Kapelner

More information

Comparison of Optimization Methods for L1-regularized Logistic Regression

Comparison of Optimization Methods for L1-regularized Logistic Regression Comparison of Optimization Methods for L1-regularized Logistic Regression Aleksandar Jovanovich Department of Computer Science and Information Systems Youngstown State University Youngstown, OH 44555 aleksjovanovich@gmail.com

More information

Machine Learning (BSMC-GA 4439) Wenke Liu

Machine Learning (BSMC-GA 4439) Wenke Liu Machine Learning (BSMC-GA 4439) Wenke Liu 01-25-2018 Outline Background Defining proximity Clustering methods Determining number of clusters Other approaches Cluster analysis as unsupervised Learning Unsupervised

More information

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2.

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2. Package ridge February 15, 2013 Title Ridge Regression with automatic selection of the penalty parameter Version 2.1-2 Date 2012-25-09 Author Erika Cule Linear and logistic ridge regression for small data

More information

Linear Model Selection and Regularization. especially usefull in high dimensions p>>100.

Linear Model Selection and Regularization. especially usefull in high dimensions p>>100. Linear Model Selection and Regularization especially usefull in high dimensions p>>100. 1 Why Linear Model Regularization? Linear models are simple, BUT consider p>>n, we have more features than data records

More information

Package glinternet. June 15, 2018

Package glinternet. June 15, 2018 Type Package Package glinternet June 15, 2018 Title Learning Interactions via Hierarchical Group-Lasso Regularization Version 1.0.8 Date 2018-06-20 Author Michael Lim, Trevor Hastie Maintainer Michael

More information

Package nricens. R topics documented: May 30, Type Package

Package nricens. R topics documented: May 30, Type Package Type Package Package nricens May 30, 2018 Title NRI for Risk Prediction Models with Time to Event and Binary Response Data Version 1.6 Date 2018-5-30 Author Eisuke Inoue Depends survival Maintainer Eisuke

More information

Topics in Machine Learning-EE 5359 Model Assessment and Selection

Topics in Machine Learning-EE 5359 Model Assessment and Selection Topics in Machine Learning-EE 5359 Model Assessment and Selection Ioannis D. Schizas Electrical Engineering Department University of Texas at Arlington 1 Training and Generalization Training stage: Utilizing

More information

Package acebayes. R topics documented: November 21, Type Package

Package acebayes. R topics documented: November 21, Type Package Type Package Package acebayes November 21, 2018 Title Optimal Bayesian Experimental Design using the ACE Algorithm Version 1.5.2 Date 2018-11-21 Author Antony M. Overstall, David C. Woods & Maria Adamou

More information

Package svmpath. R topics documented: August 30, Title The SVM Path Algorithm Date Version Author Trevor Hastie

Package svmpath. R topics documented: August 30, Title The SVM Path Algorithm Date Version Author Trevor Hastie Title The SVM Path Algorithm Date 2016-08-29 Version 0.955 Author Package svmpath August 30, 2016 Computes the entire regularization path for the two-class svm classifier with essentially the same cost

More information

Package subsemble. February 20, 2015

Package subsemble. February 20, 2015 Type Package Package subsemble February 20, 2015 Title An Ensemble Method for Combining Subset-Specific Algorithm Fits Version 0.0.9 Date 2014-07-01 Author Erin LeDell, Stephanie Sapp, Mark van der Laan

More information

Bootstrap Confidence Interval of the Difference Between Two Process Capability Indices

Bootstrap Confidence Interval of the Difference Between Two Process Capability Indices Int J Adv Manuf Technol (2003) 21:249 256 Ownership and Copyright 2003 Springer-Verlag London Limited Bootstrap Confidence Interval of the Difference Between Two Process Capability Indices J.-P. Chen 1

More information

Package gbts. February 27, 2017

Package gbts. February 27, 2017 Type Package Package gbts February 27, 2017 Title Hyperparameter Search for Gradient Boosted Trees Version 1.2.0 Date 2017-02-26 Author Waley W. J. Liang Maintainer Waley W. J. Liang

More information

Package gplm. August 29, 2016

Package gplm. August 29, 2016 Type Package Title Generalized Partial Linear Models (GPLM) Version 0.7-4 Date 2016-08-28 Author Package gplm August 29, 2016 Maintainer Provides functions for estimating a generalized

More information

Package EBglmnet. January 30, 2016

Package EBglmnet. January 30, 2016 Type Package Package EBglmnet January 30, 2016 Title Empirical Bayesian Lasso and Elastic Net Methods for Generalized Linear Models Version 4.1 Date 2016-01-15 Author Anhui Huang, Dianting Liu Maintainer

More information

Package misclassglm. September 3, 2016

Package misclassglm. September 3, 2016 Type Package Package misclassglm September 3, 2016 Title Computation of Generalized Linear Models with Misclassified Covariates Using Side Information Version 0.2.0 Date 2016-09-02 Author Stephan Dlugosz

More information

Package iterativebmasurv

Package iterativebmasurv Type Package Package iterativebmasurv April 12, 2018 Title The Iterative Bayesian Model Averaging (BMA) Algorithm For Survival Analysis Version 1.36.0 Date 2008-5-19 Author Amalia Annest, University of

More information

Package GAMBoost. February 19, 2015

Package GAMBoost. February 19, 2015 Version 1.2-3 Package GAMBoost February 19, 2015 Title Generalized linear and additive models by likelihood based boosting Author Harald Binder Maintainer Harald Binder

More information

Package mtsdi. January 23, 2018

Package mtsdi. January 23, 2018 Version 0.3.5 Date 2018-01-02 Package mtsdi January 23, 2018 Author Washington Junger and Antonio Ponce de Leon Maintainer Washington Junger

More information

Package ParetoPosStable

Package ParetoPosStable Type Package Package ParetoPosStable September 2, 2015 Title Computing, Fitting and Validating the PPS Distribution Version 1.1 Date 2015-09-02 Maintainer Antonio Jose Saez-Castillo Depends

More information

Package PrivateLR. March 20, 2018

Package PrivateLR. March 20, 2018 Type Package Package PrivateLR March 20, 2018 Title Differentially Private Regularized Logistic Regression Version 1.2-22 Date 2018-03-19 Author Staal A. Vinterbo Maintainer Staal

More information

Package penalizedlda

Package penalizedlda Type Package Package penalizedlda July 11, 2015 Title Penalized Classification using Fisher's Linear Discriminant Version 1.1 Date 2015-07-09 Author Daniela Witten Maintainer Daniela Witten

More information

Package svapls. February 20, 2015

Package svapls. February 20, 2015 Package svapls February 20, 2015 Type Package Title Surrogate variable analysis using partial least squares in a gene expression study. Version 1.4 Date 2013-09-19 Author Sutirtha Chakraborty, Somnath

More information

Package truncreg. R topics documented: August 3, 2016

Package truncreg. R topics documented: August 3, 2016 Package truncreg August 3, 2016 Version 0.2-4 Date 2016-08-03 Title Truncated Gaussian Regression Models Depends R (>= 1.8.0), maxlik Suggests survival Description Estimation of models for truncated Gaussian

More information

CS249: ADVANCED DATA MINING

CS249: ADVANCED DATA MINING CS249: ADVANCED DATA MINING Classification Evaluation and Practical Issues Instructor: Yizhou Sun yzsun@cs.ucla.edu April 24, 2017 Homework 2 out Announcements Due May 3 rd (11:59pm) Course project proposal

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Variable Selection - Chapters 8 in Dielman Artin Department of Statistical Science October 23, 2009 Outline Introduction 1 Introduction 2 3 4 Variable Selection Model

More information

Dynamic Thresholding for Image Analysis

Dynamic Thresholding for Image Analysis Dynamic Thresholding for Image Analysis Statistical Consulting Report for Edward Chan Clean Energy Research Center University of British Columbia by Libo Lu Department of Statistics University of British

More information

Package EFS. R topics documented:

Package EFS. R topics documented: Package EFS July 24, 2017 Title Tool for Ensemble Feature Selection Description Provides a function to check the importance of a feature based on a dependent classification variable. An ensemble of feature

More information

Package mixphm. July 23, 2015

Package mixphm. July 23, 2015 Type Package Title Mixtures of Proportional Hazard Models Version 0.7-2 Date 2015-07-23 Package mixphm July 23, 2015 Fits multiple variable mixtures of various parametric proportional hazard models using

More information

Package rankdist. April 8, 2018

Package rankdist. April 8, 2018 Type Package Title Distance Based Ranking Models Version 1.1.3 Date 2018-04-02 Author Zhaozhi Qian Package rankdist April 8, 2018 Maintainer Zhaozhi Qian Implements distance

More information

Package r2d2. February 20, 2015

Package r2d2. February 20, 2015 Package r2d2 February 20, 2015 Version 1.0-0 Date 2014-03-31 Title Bivariate (Two-Dimensional) Confidence Region and Frequency Distribution Author Arni Magnusson [aut], Julian Burgos [aut, cre], Gregory

More information

Package rfsa. R topics documented: January 10, Type Package

Package rfsa. R topics documented: January 10, Type Package Type Package Package rfsa January 10, 2018 Title Feasible Solution Algorithm for Finding Best Subsets and Interactions Version 0.9.1 Date 2017-12-21 Assists in statistical model building to find optimal

More information

Package XMRF. June 25, 2015

Package XMRF. June 25, 2015 Type Package Package XMRF June 25, 2015 Title Markov Random Fields for High-Throughput Genetics Data Version 1.0 Date 2015-06-12 Author Ying-Wooi Wan, Genevera I. Allen, Yulia Baker, Eunho Yang, Pradeep

More information

Package FMsmsnReg. March 30, 2016

Package FMsmsnReg. March 30, 2016 Package FMsmsnReg March 30, 2016 Type Package Title Regression Models with Finite Mixtures of Skew Heavy-Tailed Errors Version 1.0 Date 2016-03-29 Author Luis Benites Sanchez and Rocio Paola Maehara and

More information

Package treethresh. R topics documented: June 30, Version Date

Package treethresh. R topics documented: June 30, Version Date Version 0.1-11 Date 2017-06-29 Package treethresh June 30, 2017 Title Methods for Tree-Based Local Adaptive Thresholding Author Ludger Evers and Tim Heaton

More information

Package ssmn. R topics documented: August 9, Type Package. Title Skew Scale Mixtures of Normal Distributions. Version 1.1.

Package ssmn. R topics documented: August 9, Type Package. Title Skew Scale Mixtures of Normal Distributions. Version 1.1. Package ssmn August 9, 2016 Type Package Title Skew Scale Mixtures of Normal Distributions Version 1.1 Date 2016-08-08 Author Luis Benites Sanchez and Clecio da Silva Ferreira Maintainer Luis Benites Sanchez

More information

Generalized Additive Model

Generalized Additive Model Generalized Additive Model by Huimin Liu Department of Mathematics and Statistics University of Minnesota Duluth, Duluth, MN 55812 December 2008 Table of Contents Abstract... 2 Chapter 1 Introduction 1.1

More information

Package CompModSA. February 14, 2012

Package CompModSA. February 14, 2012 Package CompModSA February 14, 2012 Type Package Title Sensitivity Analysis for Complex Computer Models Version 1.3.1 Date 2011-11-05 Author Curtis Storlie , with contributions from

More information

Lecture 25: Review I

Lecture 25: Review I Lecture 25: Review I Reading: Up to chapter 5 in ISLR. STATS 202: Data mining and analysis Jonathan Taylor 1 / 18 Unsupervised learning In unsupervised learning, all the variables are on equal standing,

More information

Package extweibquant

Package extweibquant Type Package Package extweibquant February 19, 2015 Title Estimate Lower Extreme Quantile with the Censored Weibull MLE and Censored Weibull Mixture Version 1.1 Date 2014-12-03 Author Yang (Seagle) Liu

More information

Package glassomix. May 30, 2013

Package glassomix. May 30, 2013 Package glassomix May 30, 2013 Type Package Title High dimensional Mixture Graph Models selection Version 1.1 Date 2013-05-22 Author Anani Lotsi and Ernst Wit Maintainer Anani Lotsi Depends

More information

Package dglm. August 24, 2016

Package dglm. August 24, 2016 Version 1.8.3 Date 2015-10-27 Title Double Generalized Linear Models Package dglm August 24, 2016 Author Peter K Dunn and Gordon K Smyth Maintainer Robert Corty

More information

The glmmml Package. August 20, 2006

The glmmml Package. August 20, 2006 The glmmml Package August 20, 2006 Version 0.65-1 Date 2006/08/20 Title Generalized linear models with clustering A Maximum Likelihood and bootstrap approach to mixed models. License GPL version 2 or newer.

More information

Package SeleMix. R topics documented: November 22, 2016

Package SeleMix. R topics documented: November 22, 2016 Package SeleMix November 22, 2016 Type Package Title Selective Editing via Mixture Models Version 1.0.1 Date 2016-11-22 Author Ugo Guarnera, Teresa Buglielli Maintainer Teresa Buglielli

More information

Package marqlevalg. February 20, 2015

Package marqlevalg. February 20, 2015 Type Package Title An algorithm for least-squares curve fitting Version 1.1 Date 2013-03-01 Package marqlevalg February 20, 2015 Author D. Commenges , M. Prague

More information

Package sensory. R topics documented: February 23, Type Package

Package sensory. R topics documented: February 23, Type Package Package sensory February 23, 2016 Type Package Title Simultaneous Model-Based Clustering and Imputation via a Progressive Expectation-Maximization Algorithm Version 1.1 Date 2016-02-23 Author Brian C.

More information

The segmented Package

The segmented Package The segmented Package February 17, 2004 Version 0.1-4 Date 2004-02-17 Title Segmented relationships in regression models Author Vito M. R. Muggeo Maintainer Vito M. R. Muggeo

More information

Package optimus. March 24, 2017

Package optimus. March 24, 2017 Type Package Package optimus March 24, 2017 Title Model Based Diagnostics for Multivariate Cluster Analysis Version 0.1.0 Date 2017-03-24 Maintainer Mitchell Lyons Description

More information

Package influence.sem

Package influence.sem Type Package Package influence.sem April 14, 2018 Title Case Influence in Structural Equation Models Version 2.2 Date 2018-04-14 Author Massimiliano Pastore & Gianmarco Altoe' Depends lavaan Suggests tcltk

More information

Package FisherEM. February 19, 2015

Package FisherEM. February 19, 2015 Type Package Title The Fisher-EM algorithm Version 1.4 Date 2013-06-21 Author Charles Bouveyron and Camille Brunet Package FisherEM February 19, 2015 Maintainer Camille Brunet

More information

Package gee. June 29, 2015

Package gee. June 29, 2015 Title Generalized Estimation Equation Solver Version 4.13-19 Depends stats Suggests MASS Date 2015-06-29 DateNote Gee version 1998-01-27 Package gee June 29, 2015 Author Vincent J Carey. Ported to R by

More information

Package CausalGAM. October 19, 2017

Package CausalGAM. October 19, 2017 Package CausalGAM October 19, 2017 Version 0.1-4 Date 2017-10-16 Title Estimation of Causal Effects with Generalized Additive Models Author Adam Glynn , Kevin Quinn

More information

Package clusterpower

Package clusterpower Version 0.6.111 Date 2017-09-03 Package clusterpower September 5, 2017 Title Power Calculations for Cluster-Randomized and Cluster-Randomized Crossover Trials License GPL (>= 2) Imports lme4 (>= 1.0) Calculate

More information