Package sensory. R topics documented: February 23, Type Package

Similar documents
Package FisherEM. February 19, 2015

Package longclust. March 18, 2018

Package cwm. R topics documented: February 19, 2015

Package FPDclustering

Package mrm. December 27, 2016

Package missforest. February 15, 2013

Package glassomix. May 30, 2013

Package ssmn. R topics documented: August 9, Type Package. Title Skew Scale Mixtures of Normal Distributions. Version 1.1.

Package FMsmsnReg. March 30, 2016

Package RPMM. August 10, 2010

Package mixphm. July 23, 2015

Package pnmtrem. February 20, Index 9

Tools and methods for model-based clustering in R

Package SeleMix. R topics documented: November 22, 2016

Package flexcwm. May 20, 2018

Package StagedChoiceSplineMix

Package RPMM. September 14, 2014

Package mmtsne. July 28, 2017

Package rankdist. April 8, 2018

Package clusternomics

Package TVsMiss. April 5, 2018

Package Rambo. February 19, 2015

Package gee. June 29, 2015

Package clustmd. May 8, 2017

Package hsiccca. February 20, 2015

Package BayesCR. September 11, 2017

Package gpr. February 20, 2015

Package bisect. April 16, 2018

Package milr. June 8, 2017

Package palmtree. January 16, 2018

Package FastKNN. February 19, 2015

Package RaPKod. February 5, 2018

Package semisup. March 10, Version Title Semi-Supervised Mixture Model

Package kernelfactory

Package PrivateLR. March 20, 2018

Package LVGP. November 14, 2018

Package ICsurv. February 19, 2015

Clustering Lecture 5: Mixture Model

Package sgmcmc. September 26, Type Package

Package PCGSE. March 31, 2017

Package gibbs.met. February 19, 2015

Package subtype. February 20, 2015

Package glmmml. R topics documented: March 25, Encoding UTF-8 Version Date Title Generalized Linear Models with Clustering

Package extweibquant

CS 229 Midterm Review

Package mtsdi. January 23, 2018

Package capushe. R topics documented: April 19, Type Package

Package clustvarsel. April 9, 2018

Package rpst. June 6, 2017

Package kamila. August 29, 2016

Package SAENET. June 4, 2015

Expectation Maximization (EM) and Gaussian Mixture Models

Package CINID. February 19, 2015

Package MeanShift. R topics documented: August 29, 2016

Package AnDE. R topics documented: February 19, 2015

Package PedCNV. February 19, 2015

Package marqlevalg. February 20, 2015

Package lmesplines. R topics documented: February 20, Version

Package flam. April 6, 2018

Package Funclustering

Package GWRM. R topics documented: July 31, Type Package

Package naivebayes. R topics documented: January 3, Type Package. Title High Performance Implementation of the Naive Bayes Algorithm

Package tidylpa. March 28, 2018

Package ssa. July 24, 2016

Package RCA. R topics documented: February 29, 2016

Package robustreg. R topics documented: April 27, Version Date Title Robust Regression Functions

Package RAMP. May 25, 2017

Package fractalrock. February 19, 2015

Package vinereg. August 10, 2018

Package BayesLogit. May 6, 2012

Package NB. R topics documented: February 19, Type Package

Package dglm. August 24, 2016

Package ClustVarLV. December 14, 2016

Package DPBBM. September 29, 2016

Package CoClust. December 17, 2017

Package tidyimpute. March 5, 2018

Package sbf. R topics documented: February 20, Type Package Title Smooth Backfitting Version Date Author A. Arcagni, L.

Package datasets.load

Package nlnet. April 8, 2018

Package strat. November 23, 2016

Package spatialtaildep

Package manet. September 19, 2017

Package SSLASSO. August 28, 2018

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask

Package msgps. February 20, 2015

Package nodeharvest. June 12, 2015

Package DCEM. September 30, 2018

Package stockr. June 14, 2018

Package RandPro. January 10, 2018

Package stochprofml. February 20, 2015

Package TANDEM. R topics documented: June 15, Type Package

Package GenSA. R topics documented: January 17, Type Package Title Generalized Simulated Annealing Version 1.1.

Package mcclust.ext. May 15, 2015

Package sbrl. R topics documented: July 29, 2016

Package gppm. July 5, 2018

Package PUlasso. April 7, 2018

Package DSBayes. February 19, 2015

Package bayescl. April 14, 2017

Package bingat. R topics documented: July 5, Type Package Title Binary Graph Analysis Tools Version 1.3 Date

Package gwrr. February 20, 2015

Transcription:

Package sensory February 23, 2016 Type Package Title Simultaneous Model-Based Clustering and Imputation via a Progressive Expectation-Maximization Algorithm Version 1.1 Date 2016-02-23 Author Brian C. Franczak, Ryan P. Browne and Paul D. McNicholas Maintainer Brian C. Franczak <bfrancza@math.mcmaster.ca> Contains the function CUUimpute() which performs modelbased clustering and imputation simultaneously. Depends Matrix, gtools, MASS, R (>= 3.2.2) NeedsCompilation no License GPL (>= 2) Repository CRAN Date/Publication 2016-02-23 23:02:20 R topics documented: CUUimpute......................................... 2 sensory........................................... 4 The Brown Bread Data................................... 5 The Iris Data with Missing Values............................. 5 The White Bread Data................................... 6 Index 7 1

2 CUUimpute CUUimpute Cluster-Wise Imputation via a PEM Algorithm Simultaneously performs model-based clustering and imputation using a Parsimonious Gaussian Mixture Model (PGMM) with CUU covariance structure. CUUimpute(x, G = 1:3, q = 1:2, epsilon = 0.01, max.iter = 10000, known = NULL, print = TRUE) Arguments x G q epsilon max.iter known print A numeric matrix. A number or vector indicating the number components to fit. A number or vector indicating the number of latent factors to fit. The tolerance value for Aitken s acceleration. The maximum number of iterations for the PEM algorithm. A warning message is displayed if the maximum is met. Optional. A vector of group memberships that must be numeric and whose length must be equal to the number of rows in x. (See Example I below) Logical indicating whether or not the iteration number and the corresponding log-likelihood value should be printed. Details The PGMM with CUU covariance structure, herein referred to as the CUU model, was developed in McNicholas and Murphy (2008) for model-based clustering. The CUU model, like the other PGMMs, arises by assuming a latent Gaussian model structure for each population. As a result, the number of free parameters in CUU model s covariance structure increases linearly as the dimension of the data increases. Browne et al. (2013) developed a Progressive Expectation-Maximization (PEM) algorithm to fit the CUU model to data with missing values. Under this parameter estimation scheme the CUU model is able to simultaneously impute and cluster a given data matrix. The CUUimpute() function fits the CUU model for a varying number of components, G, and latent factors, q, to a data matrix x. The function will run if only a data matrix is supplied however, the user is able to choose how many components and latent factors to fit, specify the tolerance limit for Aitken s acceleration, the maximum number of PEM iterations the algorithm will perform and give a vector representing the group memberships of each observation, if that information exists.

CUUimpute 3 Value allbic bic G q loglik gpar yhat u zig map class.table iclresult An array containing the Bayesian Information Criterion (BIC) values for each CUU model fitted to the data. The BIC of the best fitting CUU model. The number of components in the best fitting CUU model. The number of latent factors in the best fitting CUU model. A vector whose length is equal to the number of PEM iterations performed and whose elements contain the log-likelihood value on each iteration. A list of the model parameters corresponding to the best fitting CUU model. A data matrix with imputed values. A matrix containing the values of the latent variables. A matrix giving the probabilities of group membership for each observation. Note: these probabilties are based off the model parameters of the best fitting CUU model A vector whose length is equal to the number of observations and whose elements correspond to the group membership of each observation. A cross tabulation between the predicted and true group memberships. A list containing all of the information listed above for the best fitting CUU model chosen by the Integrated Complete Likelihood (ICL) measure. Note: allbic and bic are replaced by allicl and icl. Author(s) Brian C. Franczak, Ryan P. Browne and Paul D. McNicholas Maintainer: Brian C. Franczak <bfrancza@math.mcmaster.ca> References P.D. McNicholas and T.B. Murphy (2008). Parsimonious Gaussian Mixture Models. Statistics and Computing (18), 285-296. Browne, R.P., P.D. McNicholas, and C.J. Findlay (2013). A partial EM algorithm for clustering white breads. arxiv preprint arxiv:1302.6625. See Also sensory ### Example I data(iiris) # loads the modified Iris data head(iiris) # displays the first six rows of the modified Iris data data(iris) # loads the original Iris data # create a vectors whose elements contain the group memberships of each flower

4 sensory iris.known <- as.integer(iris[,ncol(iris)]) # fit a three component CUU models to the modified Iris data output <- CUUimpute(x=iiris,G=3,q=1,print=FALSE,known=iris.known) output summary(output) # Summarizes the results names(output) # Shows what results are available output$allbic # Gives every BIC value output$bic # Gives BIC for best fitting model output$g # Gives the number of components for the best fitting CUU model output$loglik # Gives a vector whose elements are the log-likelihood at each iteration output$gpar # Gives the model parameters for the best fitting CUU model head(output$yhat) # Displays the first 6 rows of the imputed matrix output$yhat # Gives the entire imputed matrix output$class.table # Gives a classification table between the predicted and true group memberships sensory Cluster-Wise Imputation via a PEM Algorithm Details Contains the function CUUimpute() which performs model-based clustering and imputation simultaneously. Package: sensory Type: Package Version: 1.0 Date: 2014-08-07 License: GPL (>=2) This package contains two data sets collected at Compusense Inc. in Guelph, Ontario, Canada, a modified version of Fisher s Irises, and the function CUUImpute(). Author(s) Brian C. Franczak, Ryan P. Browne and Paul D. McNicholas Maintainer: Brian C. Franczak <bfrancza@math.mcmaster.ca> References P.D. McNicholas and T.B. Murphy (2008). Parsimonious Gaussian Mixture Models. Statistics and Computing (18), 285-296. Browne, R.P., P.D. McNicholas, and C.J. Findlay (2013). A partial EM algorithm for clustering white breads. arxiv preprint arxiv:1302.6625.

The Brown Bread Data 5 See Also Details, examples, and references are given under CUUimpute The Brown Bread Data The Brown Bread Data Set The brown bread data set is composed of 570 panelists and 16 products, labelled A,...,P to protect the identity of the manufacturer. Each panelist rated a subset of 6 products using the 9-point Hedonic Scale. data(bbread) data(bbread) # Loads the brown bread data set head(bbread) # Displays the first six rows of the brown bread data set The Iris Data with Missing Values The Iris Data with Missing Values Fisher s Irises with one measurement removed per observation. For details load data(iris,package="gclus"). data(iiris) data(iiris) # Loads the modified Iris data. head(iiris) # Displays the first six rows of the modified Iris Data

6 The White Bread Data The White Bread Data The White Bread Data The white bread data is a liking study composed of 420 panelists and 12 products, labelled A,...,L to protect the identity of the manufacturer. Each panelist rated 6 white breads using the 9-point Hedonic scale. data(wbread) References Browne, R.P., P.D. McNicholas, and C.J. Findlay (2013). A partial EM algorithm for clustering white breads. arxiv prerpeint arxiv:1302.6625. data(wbread) # Loads the white bread data head(wbread) # Displays the first six rows of the white bread data

Index Topic Brown Bread Data Topic CUU Model Topic Clustering Topic Expectation Maximization (EM) Algorithm Topic Fisher s Irises The Iris Data with Missing Values, 5 Topic Imputation Topic Parsimonious Gaussian Mixture Models Topic Sensory Informed Designs Topic White Bread Data Topic datasets The Iris Data with Missing Values, 5 sensory, 3, 4 The Iris Data with Missing Values, 5 wbread (The White Bread Data), 6 bbread (The Brown Bread Data), 5, 5 iiris (The Iris Data with Missing Values), 5 7