Package TVsMiss. April 5, 2018

Similar documents
Package RAMP. May 25, 2017

Package milr. June 8, 2017

Package SSLASSO. August 28, 2018

Package ECctmc. May 1, 2018

Package robustreg. R topics documented: April 27, Version Date Title Robust Regression Functions

Package SOIL. September 20, 2017

Package spark. July 21, 2017

Package validara. October 19, 2017

Package BigVAR. R topics documented: April 3, Type Package

Package datasets.load

Package BayesCR. September 11, 2017

Package kirby21.base

Package flam. April 6, 2018

Package robets. March 6, Type Package

Package TANDEM. R topics documented: June 15, Type Package

Package fastdummies. January 8, 2018

Package MTLR. March 9, 2019

Package msgps. February 20, 2015

Package cattonum. R topics documented: May 2, Type Package Version Title Encode Categorical Features

Package intccr. September 12, 2017

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3.

Package freeknotsplines

Package gppm. July 5, 2018

Package SAM. March 12, 2019

Package bayeslongitudinal

Package sparsenetgls

Package gtrendsr. August 4, 2018

Package PUlasso. April 7, 2018

Package strat. November 23, 2016

Package vinereg. August 10, 2018

Package vennlasso. January 25, 2019

Package rpst. June 6, 2017

Package ordinalnet. December 5, 2017

Package gtrendsr. October 19, 2017

Package bisect. April 16, 2018

Package svalues. July 15, 2018

Package bigreadr. R topics documented: August 13, Version Date Title Read Large Text Files

Package Mondrian. R topics documented: March 4, Type Package

Package repec. August 31, 2018

Package vip. June 15, 2018

Package penalizedlda

Package CompGLM. April 29, 2018

Package canvasxpress

Package kdevine. May 19, 2017

Package weco. May 4, 2018

Package OsteoBioR. November 15, 2018

Package inca. February 13, 2018

Package geogrid. August 19, 2018

Package hiernet. March 18, 2018

Package glassomix. May 30, 2013

Package ggimage. R topics documented: December 5, Title Use Image in 'ggplot2' Version 0.1.0

Package elmnnrcpp. July 21, 2018

Package clusternomics

Package fst. December 18, 2017

Package pwrab. R topics documented: June 6, Type Package Title Power Analysis for AB Testing Version 0.1.0

Package gggenes. R topics documented: November 7, Title Draw Gene Arrow Maps in 'ggplot2' Version 0.3.2

Package glinternet. June 15, 2018

Package SFtools. June 28, 2017

Package farver. November 20, 2018

Package coga. May 8, 2018

Package basictrendline

Package LINselect. June 9, 2018

Package polywog. April 20, 2018

Package sbrl. R topics documented: July 29, 2016

Package fusionclust. September 19, 2017

Package ggimage. R topics documented: November 1, Title Use Image in 'ggplot2' Version 0.0.7

Package Numero. November 24, 2018

Package EBglmnet. January 30, 2016

Package sessioninfo. June 21, 2017

Package RNAseqNet. May 16, 2017

Package CVR. March 22, 2017

Package bayescl. April 14, 2017

Package kdtools. April 26, 2018

Package nonlinearicp

Package manet. September 19, 2017

Package sankey. R topics documented: October 22, 2017

Package ConvergenceClubs

Package FMsmsnReg. March 30, 2016

Package nmslibr. April 14, 2018

Package wrswor. R topics documented: February 2, Type Package

Package ibm. August 29, 2016

Package mrm. December 27, 2016

Package epitab. July 4, 2018

Package ArCo. November 5, 2017

Package sure. September 19, 2017

Package ADMM. May 29, 2018

Package PCADSC. April 19, 2017

Package mixsqp. November 14, 2018

Package flsa. February 19, 2015

Package glmnetutils. August 1, 2017

Package robustgam. February 20, 2015

Package hypercube. December 15, 2017

Package rvinecopulib

Package bayesdp. July 10, 2018

Package orthodr. R topics documented: March 26, Type Package

Package LVGP. November 14, 2018

Package libstabler. June 1, 2017

Package TestDataImputation

Package tropicalsparse

Package rankdist. April 8, 2018

Transcription:

Type Package Title Variable Selection for Missing Data Version 0.1.1 Date 2018-04-05 Author Jiwei Zhao, Yang Yang, and Ning Yang Maintainer Yang Yang <yyang39@buffalo.edu> Package TVsMiss April 5, 2018 Use a regularization likelihood method to achieve variable selection purpose. Likelihood can be worked with penalty lasso, smoothly clipped absolute deviations (SCAD), and minimax concave penalty (MCP). Tuning parameter selection techniques include cross validation (CV), Bayesian information criterion (BIC) (low and high), stability of variable selection (svs), stability of BIC (sbic), and stability of estimation (sest). More details see Jiwei Zhao, Yang Yang, and Yang Ning (2018) <arxiv:1703.06379> ``Penalized pairwise pseudo likelihood for variable selection with nonignorable missing data.'' Statistica Sinica. License GPL (>= 2) Imports glmnet, Rcpp NeedsCompilation yes Encoding UTF-8 LazyData true RoxygenNote 6.0.1 LinkingTo Rcpp URL https://github.com/yang0117/tvsmiss BugReports https://github.com/yang0117/tvsmiss/issues Repository CRAN Date/Publication 2018-04-05 05:16:08 UTC 1

2 plot.tvsmiss R topics documented: pairdata........................................... 2 plot.tvsmiss........................................ 2 tvsmiss........................................... 3 Index 5 pairdata Generate paired dataset from original dataset Generate the paired data used for logistics regression pairdata(data1) data1 the original unpaired data pair01 <- pairdata(cbind(y,xm)) nrow(pair01) == choose(n,2) plot.tvsmiss plot solution path from the fitted "TVsMiss" object solution path is generated, the x-axis can be either in log or normal scale, the variable names of each predictors can be chosen to show or not ## S3 method for class 'TVsMiss' plot(x, label = FALSE, log = TRUE,...)

tvsmiss 3 x fitted "TVsMiss" object label If TRUE, the name of each predictor variable will be showed log If TRUE, x-axis is log scale; if FALSE, x-axis is in normal scale... graphical parameters to plot fit01 <- tvsmiss(x=xm,y=y) fit01$selection_beta fit01$beta_matrix plot(fit01) plot(fit01,x.log=true,label = FALSE) plot(fit01,x.log=true,label = TRUE) fit04 <- tvsmiss(x=xm,y=y,penalty = "SCAD",method = "BIC") fit04$selection_beta fit04$beta_matrix plot(fit04) plot(fit04,x.log = TRUE) plot(fit04,x.log = TRUE,label = TRUE) tvsmiss fit and select variable(s) for data with missing value Fit a model based on a pseudo likelihood and select variable(s) through one of multiple techniques. The regularization path is computed for lasso, SCAD, or MCP. Three steps are used to finish this the variable selection purpose: 1. remove missing and pair each observations; 2. use penalty to get lambda path and corresponding beta matrix; 3. use specific method to finish variable selection. tvsmiss(x, y, penalty = c("lasso", "MCP", "SCAD"), method = c("cv", "BIC", "BIC1", "BIC2", "sbic", "sbic1", "sbic2", "svs", "sest"), lambda = NULL, fold = 5, cv.ind = NULL, repeat_b = 20, alpha_n = 0.1, refit = F, gamma = switch(penalty, SCAD = 3.7, MCP = 3, lasso = NA), use.penalty = T)

4 tvsmiss x y penalty method lambda fold cv.ind repeat_b alpha_n refit gamma use.penalty the covariate matrix, should be in matrix format and at least two columns, each row is an observation the response variable the penalty used for regularization, can be lasso, SCAD, or MCP. The default is lasso. the variable selection method, can be cross-validation (CV), Bayesian information criterion (BIC), BIC1 and BIC2 are adapted for the consistency in the high dimension, sbic is the information stability, sbic1 and sbic2 are information stability for high dimension data, svs is the variable selection stability, sest is the estimation stability lambda path used in the regularization path. If not specified by user, the path will be generated automatically the number of folds used to divided data, will be used in CV, sbic, sbic1, sbic2, svs, and sest method a vector to indicate what fold each observations belong, useful to make reproducible research B parameter in svs method, the repeating time to calculate selection stability criteria the parameter used to take care of variables with weak effect in svs method If TRUE, refit technique will be used to get estimation, i.e., use selection variable to refit the model to get estimation the tuning parameter of the SCAD/MCP. Default is 3.7 for SCAD and 3 for MCP If TRUE, use penalty and variable selection techniques; if FALSE, just fit a logistic regression model with paired data fit01 <- tvsmiss(x=xm,y=y) fit01$selection_beta fit02 <- tvsmiss(x=xm,y=y,method = "BIC") fit02$selection_beta fit02$beta_matrix fit06 <- tvsmiss(x=xm,y=y,penalty = "SCAD",method = "svs",fold = 5) fit06$selection_beta fit06$beta_matrix

Index pairdata, 2 plot.tvsmiss, 2 tvsmiss, 3 5