CATLVM User Guide for Latent Transition Analysis

Size: px
Start display at page:

Download "CATLVM User Guide for Latent Transition Analysis"

Transcription

1 CATLVM User Guide for Latent Transition Analysis Dimension Term Type Definition nkase Integer scalar Number of subject nstr Integer scalar Number of subgroups nc Integer scalar Number of latent subgroups ns Integer scalar Number of latent dynamic statuses nt Integer scalar Number of time points nsi Integer scalar Number of manifest items nrs Integer vector of length nsi Number of categories for each item ncov$delta Integer scalar Number of covariates for DELTA ncov$tau Integer scalar Number of covariates for TAU Parameters for an LTA Parameter Array dimension Definition param$lta$big.rho nsi max(nrs) ns nt nc nstr Item response probabilities within a latent status at time t for each latent class and subgroup param$lta$delta nt ns nc nstr Status prevalence for each latent class and subgroup Param$lta$tau ns ns nt-1 nc nstr Transitional probability of status membership param$lta$beta$delta ncov$delta ns nstr Logistic regression coefficients to predict status membership at the initial time param$lta$beta$tau ncov$tau ns ns nt- 1 nc nstr Logistic regression coefficients to predict the transition of statuses 1

2 1. Loading the data Data should be loaded in a list. Data-list consists of following: data$is.grp (logical) True if analysis will use group variable. data$group (optional) An integer vector of group variable with length of nkase. Integer values should start from 1. If data$is.grp = TRUE, then data$group should be provided. data$lta$si Observed manifest items to measure latent statuses. It should be an array size of nkase nsi nt. The values must be integer starting from 1. Missing values should be coded as NA. data$lta$cov$delta (optional) Observed covariates to predict latent status membership at the initial time point. It should be a matrix size of nkase ncov$delta. data$lta$cov$tau (optional) Observed covariates to predict transition of latent status membership. It should be an array size of nkase ncov$tau (nt-1). 2. Model configuration Model configuration should be saved in a list. Model-list consists of following: model$lta$is.lta (logical) True if you want to fit a latent transition analysis. model$lta$ns Number of latent statuses. model$lta$nt Number of time points. model$lta$nrs Integer vector. Number of response categories for each item. The length of vector should be equal to the number of items (i.e., nsi). 2

3 3. Parameter constraints (optional) Parameter constraints should be saved in a list. Constraint-list consists of following: constraint$lta$equal$big.rho$group (logical) True if param$lta$big.rho are constrained to be equal across group variable. Default value = FALSE. constraint$lta$equal$big.rho$time (logical) True if param$lta$big.rho are constrained to be equal across time points. Default value = FALSE. constraint$lta$equal$big.rho$class (logical) True if param$lta$big.rho are constrained to be equal across latent classes (latent group variable). It will be used only when model$lca$is.lca = TRUE. Default value = FALSE. constraint$lta$big.rho Integer array of 1 or 0. The elements of constraint$lta$big.rho with one represent that the corresponding param$lta$big.rho will be freely estimated. The elements of constraint$lta$big.rho with 0 means that the corresponding param$lta$big.rho will be fixed as zero. Dimension of constraint$lta$big.rho is equal to param$lta$big.rho. Default setting is to freely estimate. constraint$lta$delta Integer array of 1 or 0. The array displays the constraint of param$lta$delta at Time 1. The elements of constraint$lta$delta with 1 represents that the corresponding param$lta$delta will be freely estimated. The elements of constraint$lta$delta with 0 means that the corresponding param$lta$delta will be fixed as zero. Dimension of constraint$lta$delta is ns nc nstr. Default setting is to freely estimate. constraint$lta$tau Integer array of 1 or 0. The elements of constraint$lta$taua with 1 represents that the corresponding param$lta$tau will be freely estimated. The elements of constraint$lta$tau with 0 means that the corresponding param$lta$tau will be 3

4 fixed as zero. Dimension of constraint$lta$tau is equal to param$lta$tau. Default setting is to freely estimate. 4. Starting values (optional) Starting-value configuration should be saved in a list. Starval-list consists of following: starval$is.random (logical, optional) True if you want to generate random starting values. Default value = TRUE. starval$param$lta (optional) Starting values provided by users. If starval$is.random = FALSE, then users should provide starting values for LTA parameters. The dimension starval$param$lta is equal to param$lta. starval$ntry (optional) Number of sets of different starting values that the program will try. This is to assure global maximum at the final solution. Default value = 10. starval$try.maxits (optional) Maximum number of iterations for each set of starting values. Program will pick the set starting values which leads to the largest likelihood after this trial iteration. Default value = Iteration (optional) Iteration configuration should be saved in a list. Iteration-list consists of following: iteration$maxits (optional) Maximum number of iterations. Default value = iteration$eps (optional) Convergence criteria. Default value = 1e-6. 4

5 An example of latent transition analysis using CATLVM This is a three-class LTA with logistic regression. Three latent classes are measured by six binary items from two subgroups. nkase = 500 nstr = 1 nc = 1 nt = 3 ns = 3 nsi = 4 nrc = c(2, 2, 2, 2) ## Items of first five subjects LTA.sItem[1:5,,],, 1 In R Console [,3] [,4] [1,] [2,] [3,] [4,] NA [5,] ,, 2 [,3] [,4] [1,] NA [2,] [3,] [4,] [5,] NA,, 3 5

6 [,3] [,4] [1,] NA [2,] [3,] [4,] [5,] ## DELTA cvariate of first five subjects LTA.covariate$delta[1:5,] [1] ## TAU cvariate of first five subjects LTA.covariate$tau[1:5,,] [1,] [2,] [3,] [4,] [5,] ############### ## LOAD DATA ## ############### data <- list() data$lta$si <- LTA.sItem ########################## ## SPECIFY AN LTA MODEL ## ########################## model <- list() model$lta$is.lta <- TRUE model$lta$ns <- 3 model$lta$nrs <- c(2, 2, 2, 2) ############################# ## RESTRICT RHO PARAMETERS ## ############################# constraint <- list() 6

7 constraint$lta$equal$big.rho$time <- TRUE ########################## ## STARTING VALUE SETUP ## ########################## starval <- list() starval$is.random <- TRUE ##################### ## RUNNING PROGRAM ## ##################### LTA.1 <- cat.lvm(data=data, model=model, starval=starval, + constraint=constraint) Trying to find the best set of starting values... SET: SET: SET: SET: SET: SET: SET: SET: SET: SET: Done. SET 3 selected. Starting main iteration e e e-07 OK, converged at 166 7

8 summary(lta.1) Length Class Mode model 3 -none- list iteration 6 -none- list param 1 -none- list constraint 1 -none- list prior 1 -none- list ######################### ## ESTIMATED PRAMETERS ## ######################### LTA.1$param $lta $lta$big.rho,, Stage.1, Time.1,, sitem sitem sitem sitem ,, Stage.2, Time.1,, sitem sitem sitem sitem ,, Stage.3, Time.1,, sitem sitem sitem sitem

9 ,, Stage.1, Time.2,, sitem sitem sitem sitem ,, Stage.2, Time.2,, sitem sitem sitem sitem ,, Stage.3, Time.2,, sitem sitem sitem sitem ,, Stage.1, Time.3,, sitem sitem sitem sitem ,, Stage.2, Time.3,, sitem sitem sitem

10 sitem ,, Stage.3, Time.3,, sitem sitem sitem sitem $lta$delta,,, Time Time Time $lta$tau,, Time.2,, Stage Stage Stage ,, Time.3,, Stage Stage Stage

11 ############################## ## INFO ABOUT EM ITERATIONS ## ############################## LTA.1$iteration $maxits [1] 2500 $eps [1] 1e-06 $niter [1] 166 $maxdiff [1] e-07 $converged [1] TRUE $llvec [1] ################################# ## INFO ABOUT FITTED LTA MODEL ## ################################# LTA.1$model $lta $lta$is.lta [1] TRUE $lta$ns [1] 3 $lta$nrs [1] $lta$nkase 11

12 [1] 50 $lta$nstr [1] 1 $lta$nsi [1] 4 $lta$nt [1] 3 $lta$nc [1] 1 $lta$ncov $lta$ncov$delta [1] 0 $lta$ncov$tau [1] 0 $lca $lca$is.lca [1] FALSE $lca$nc [1] 1 $lcpa $lcpa$is.lcpa [1] FALSE ###################################### 12

13 ## INFO ABOUT PARAMETER CONSTRAINTS ## ###################################### LTA.1$constraint $lta $lta$equal $lta$equal$big.rho $lta$equal$big.rho$group [1] FALSE $lta$equal$big.rho$time [1] TRUE $lta$equal$big.rho$class [1] FALSE $lta$big.rho,, Stage.1, Time.1,, sitem sitem sitem sitem.4 1 1,, Stage.2, Time.1,, sitem sitem sitem sitem.4 1 1,, Stage.3, Time.1,, sitem

14 sitem sitem sitem.4 1 1,, Stage.1, Time.2,, sitem sitem sitem sitem.4 1 1,, Stage.2, Time.2,, sitem sitem sitem sitem.4 1 1,, Stage.3, Time.2,, sitem sitem sitem sitem.4 1 1,, Stage.1, Time.3,, sitem sitem sitem sitem.4 1 1,, Stage.2, Time.3,, 14

15 sitem sitem sitem sitem.4 1 1,, Stage.3, Time.3,, sitem sitem sitem sitem $lta$delta,, Stage.1 1 Stage.2 1 Stage.3 1 $lta$tau,, Time.2,, Stage Stage Stage ,, Time.3,, Stage Stage Stage

16 ##################################### ## ADD DELTA COVARIATES TO THE LTA ## ##################################### data$lta$cov$delta <- LTA.covariate$delta ##################### ## RUNNING PROGRAM ## ##################### LTA.REG.1 <- cat.lvm(data=data, model=model, starval=starval, + constraint=constraint) Trying to find the best set of starting values... SET: SET: SET: SET: SET: SET: SET: SET: SET: SET: Done. SET 5 selected. Starting main iteration e e e e-07 OK, converged at 71 16

17 ################################### ## ADD TAU COVARIATES TO THE LTA ## ################################### data$lta$cov$tau <- LTA.covariate$tau ###################################################### ## FIX SOME TAUs AS ZERO TO AVOID BOUNDARY SOLUTION ## ###################################################### constraint$lta$tau <- LTA.REG.1$constraint$lta$TAU w <- LTA.REG.1$param$lta$tau < 0.1 constraint$lta$tau[w] <- 0 ##################### ## RUNNING PROGRAM ## ##################### LTA.REG.2 <- cat.lvm(data=data, model=model, starval=starval, + constraint=constraint) Trying to find the best set of starting values... SET: SET: SET: SET: SET: SET: SET: SET: SET: SET: Done. SET 9 selected. Starting main iteration

18 e e e e-07 OK, converged at 32 ######################### ## ESTIMATED PRAMETERS ## ######################### LTA.REG.2$param $lta $lta$big.rho,, Stage.1, Time.1,, sitem sitem sitem sitem ,, Stage.2, Time.1,, sitem sitem sitem sitem ,, Stage.3, Time.1,, sitem sitem sitem sitem

19 ,, Stage.1, Time.2,, sitem sitem sitem sitem ,, Stage.2, Time.2,, sitem sitem sitem sitem ,, Stage.3, Time.2,, sitem sitem sitem sitem ,, Stage.1, Time.3,, sitem sitem sitem sitem ,, Stage.2, Time.3,, sitem sitem sitem

20 sitem ,, Stage.3, Time.3,, sitem sitem sitem sitem $lta$delta,,, Time Time Time $lta$tau,, Time.2,, Stage Stage Stage ,, Time.3,, Stage Stage Stage $lta$beta $lta$beta$delta 20

21 ,,, Int cov.delta $lta$beta$tau,, Stage.1, Time.2,, Int. 0 NA NA cov.tau.1 0 NA NA,, Stage.2, Time.2,, Int. NA cov.tau.1 NA ,, Stage.3, Time.2,, Int. NA 0 NA cov.tau.1 NA 0 NA,, Stage.1, Time.3,, Int NA cov.tau NA,, Stage.2, Time.3,, Int. NA cov.tau.1 NA

22 ,, Stage.3, Time.3,, Int cov.tau ################################################# ## USE ESTIMATED PARAMETERS AS STARTING VALUES ## ################################################# starval <- list() starval$is.random <- FALSE starval$param <- LTA.REG.2$param LTA.REG.33 <- cat.lvm(data=data, model=model, starval=starval, + constraint=constraint) Starting main iteration e-07 OK, converged at 1 22

CATLVM User Guide for Latent Class-Profile Analysis

CATLVM User Guide for Latent Class-Profile Analysis CATLVM User Guide for Latent Class-Profile Analysis Dimension Term Type Definition nkase Integer scalar Number of subject nstr Integer scalar Number of subgroups nc Integer scalar Number of static latent

More information

SAS Graphics Macros for Latent Class Analysis Users Guide

SAS Graphics Macros for Latent Class Analysis Users Guide SAS Graphics Macros for Latent Class Analysis Users Guide Version 2.0.1 John Dziak The Methodology Center Stephanie Lanza The Methodology Center Copyright 2015, Penn State. All rights reserved. Please

More information

Expectation Maximization (EM) and Gaussian Mixture Models

Expectation Maximization (EM) and Gaussian Mixture Models Expectation Maximization (EM) and Gaussian Mixture Models Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 2 3 4 5 6 7 8 Unsupervised Learning Motivation

More information

Package hiernet. March 18, 2018

Package hiernet. March 18, 2018 Title A Lasso for Hierarchical Interactions Version 1.7 Author Jacob Bien and Rob Tibshirani Package hiernet March 18, 2018 Fits sparse interaction models for continuous and binary responses subject to

More information

Package StagedChoiceSplineMix

Package StagedChoiceSplineMix Type Package Package StagedChoiceSplineMix August 11, 2016 Title Mixture of Two-Stage Logistic Regressions with Fixed Candidate Knots Version 1.0.0 Date 2016-08-10 Author Elizabeth Bruch ,

More information

Function Algorithms: Linear Regression, Logistic Regression

Function Algorithms: Linear Regression, Logistic Regression CS 4510/9010: Applied Machine Learning 1 Function Algorithms: Linear Regression, Logistic Regression Paula Matuszek Fall, 2016 Some of these slides originated from Andrew Moore Tutorials, at http://www.cs.cmu.edu/~awm/tutorials.html

More information

Package SMAT. January 29, 2013

Package SMAT. January 29, 2013 Package SMAT January 29, 2013 Type Package Title Scaled Multiple-phenotype Association Test Version 0.98 Date 2013-01-26 Author Lin Li, Ph.D.; Elizabeth D. Schifano, Ph.D. Maintainer Lin Li ;

More information

Package SmoothHazard

Package SmoothHazard Package SmoothHazard September 19, 2014 Title Fitting illness-death model for interval-censored data Version 1.2.3 Author Celia Touraine, Pierre Joly, Thomas A. Gerds SmoothHazard is a package for fitting

More information

Package cwm. R topics documented: February 19, 2015

Package cwm. R topics documented: February 19, 2015 Package cwm February 19, 2015 Type Package Title Cluster Weighted Models by EM algorithm Version 0.0.3 Date 2013-03-26 Author Giorgio Spedicato, Simona C. Minotti Depends R (>= 2.14), MASS Imports methods,

More information

Package gpr. February 20, 2015

Package gpr. February 20, 2015 Package gpr February 20, 2015 Version 1.1 Date 2013-08-27 Title A Minimalistic package to apply Gaussian Process in R License GPL-3 Author Maintainer ORPHANED Depends R (>= 2.13) This package provides

More information

John J. Dziak, Stephanie M. Lanza, and Xianming Tan

John J. Dziak, Stephanie M. Lanza, and Xianming Tan Appendix 2: Technical Appendix for Effect Size, Statistical Power and Sample Size Requirements for the Bootstrap Likelihood Ratio Test in Latent Class Analysis John J. Dziak, Stephanie M. Lanza, and Xianming

More information

Package StVAR. February 11, 2017

Package StVAR. February 11, 2017 Type Package Title Student's t Vector Autoregression (StVAR) Version 1.1 Date 2017-02-10 Author Niraj Poudyal Maintainer Niraj Poudyal Package StVAR February 11, 2017 Description Estimation

More information

Clustering Lecture 5: Mixture Model

Clustering Lecture 5: Mixture Model Clustering Lecture 5: Mixture Model Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced topics

More information

Introduction to Machine Learning CMU-10701

Introduction to Machine Learning CMU-10701 Introduction to Machine Learning CMU-10701 Clustering and EM Barnabás Póczos & Aarti Singh Contents Clustering K-means Mixture of Gaussians Expectation Maximization Variational Methods 2 Clustering 3 K-

More information

Package SeleMix. R topics documented: November 22, 2016

Package SeleMix. R topics documented: November 22, 2016 Package SeleMix November 22, 2016 Type Package Title Selective Editing via Mixture Models Version 1.0.1 Date 2016-11-22 Author Ugo Guarnera, Teresa Buglielli Maintainer Teresa Buglielli

More information

Also, for all analyses, two other files are produced upon program completion.

Also, for all analyses, two other files are produced upon program completion. MIXOR for Windows Overview MIXOR is a program that provides estimates for mixed-effects ordinal (and binary) regression models. This model can be used for analysis of clustered or longitudinal (i.e., 2-level)

More information

CS 229 Midterm Review

CS 229 Midterm Review CS 229 Midterm Review Course Staff Fall 2018 11/2/2018 Outline Today: SVMs Kernels Tree Ensembles EM Algorithm / Mixture Models [ Focus on building intuition, less so on solving specific problems. Ask

More information

The glmmml Package. August 20, 2006

The glmmml Package. August 20, 2006 The glmmml Package August 20, 2006 Version 0.65-1 Date 2006/08/20 Title Generalized linear models with clustering A Maximum Likelihood and bootstrap approach to mixed models. License GPL version 2 or newer.

More information

Package mixsqp. November 14, 2018

Package mixsqp. November 14, 2018 Encoding UTF-8 Type Package Version 0.1-79 Date 2018-11-05 Package mixsqp November 14, 2018 Title Sequential Quadratic Programming for Fast Maximum-Likelihood Estimation of Mixture Proportions URL https://github.com/stephenslab/mixsqp

More information

Package PrivateLR. March 20, 2018

Package PrivateLR. March 20, 2018 Type Package Package PrivateLR March 20, 2018 Title Differentially Private Regularized Logistic Regression Version 1.2-22 Date 2018-03-19 Author Staal A. Vinterbo Maintainer Staal

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION Introduction CHAPTER 1 INTRODUCTION Mplus is a statistical modeling program that provides researchers with a flexible tool to analyze their data. Mplus offers researchers a wide choice of models, estimators,

More information

7 Control Structures, Logical Statements

7 Control Structures, Logical Statements 7 Control Structures, Logical Statements 7.1 Logical Statements 1. Logical (true or false) statements comparing scalars or matrices can be evaluated in MATLAB. Two matrices of the same size may be compared,

More information

Package HMMASE. February 4, HMMASE R package

Package HMMASE. February 4, HMMASE R package Package HMMASE February 4, 2014 Type Package Title HMMASE R package Version 1.0 Date 2014-02-04 Author Juan R. Steibel, Heng Wang, Ping-Shou Zhong Maintainer Heng Wang An R package that

More information

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Examples: Mixture Modeling With Cross-Sectional Data CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Mixture modeling refers to modeling with categorical latent variables that represent

More information

ECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov

ECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov ECE521: Week 11, Lecture 20 27 March 2017: HMM learning/inference With thanks to Russ Salakhutdinov Examples of other perspectives Murphy 17.4 End of Russell & Norvig 15.2 (Artificial Intelligence: A Modern

More information

Clustering & Dimensionality Reduction. 273A Intro Machine Learning

Clustering & Dimensionality Reduction. 273A Intro Machine Learning Clustering & Dimensionality Reduction 273A Intro Machine Learning What is Unsupervised Learning? In supervised learning we were given attributes & targets (e.g. class labels). In unsupervised learning

More information

CSE 158. Web Mining and Recommender Systems. Midterm recap

CSE 158. Web Mining and Recommender Systems. Midterm recap CSE 158 Web Mining and Recommender Systems Midterm recap Midterm on Wednesday! 5:10 pm 6:10 pm Closed book but I ll provide a similar level of basic info as in the last page of previous midterms CSE 158

More information

Package glmmml. R topics documented: March 25, Encoding UTF-8 Version Date Title Generalized Linear Models with Clustering

Package glmmml. R topics documented: March 25, Encoding UTF-8 Version Date Title Generalized Linear Models with Clustering Encoding UTF-8 Version 1.0.3 Date 2018-03-25 Title Generalized Linear Models with Clustering Package glmmml March 25, 2018 Binomial and Poisson regression for clustered data, fixed and random effects with

More information

Package pnmtrem. February 20, Index 9

Package pnmtrem. February 20, Index 9 Type Package Package pnmtrem February 20, 2015 Title Probit-Normal Marginalized Transition Random Effects Models Version 1.3 Date 2013-05-19 Author Ozgur Asar, Ozlem Ilk Depends MASS Maintainer Ozgur Asar

More information

SAS Structural Equation Modeling 1.3 for JMP

SAS Structural Equation Modeling 1.3 for JMP SAS Structural Equation Modeling 1.3 for JMP SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2012. SAS Structural Equation Modeling 1.3 for JMP. Cary,

More information

Practice Questions for Midterm

Practice Questions for Midterm Practice Questions for Midterm - 10-605 Oct 14, 2015 (version 1) 10-605 Name: Fall 2015 Sample Questions Andrew ID: Time Limit: n/a Grade Table (for teacher use only) Question Points Score 1 6 2 6 3 15

More information

Ansoft HFSS Convergence

Ansoft HFSS Convergence Data Max Delta Matrix Parameters Ansoft HFSS Choose from the Executive Commands window to view information about the solution. If you have solved for a driven solution, the following window appears: Maxwell

More information

Example Using Missing Data 1

Example Using Missing Data 1 Ronald H. Heck and Lynn N. Tabata 1 Example Using Missing Data 1 Creating the Missing Data Variable (Miss) Here is a data set (achieve subset MANOVAmiss.sav) with the actual missing data on the outcomes.

More information

Package PUlasso. April 7, 2018

Package PUlasso. April 7, 2018 Type Package Package PUlasso April 7, 2018 Title High-Dimensional Variable Selection with Presence-Only Data Version 3.1 Date 2018-4-4 Efficient algorithm for solving PU (Positive and Unlabelled) problem

More information

Package SAENET. June 4, 2015

Package SAENET. June 4, 2015 Type Package Package SAENET June 4, 2015 Title A Stacked Autoencoder Implementation with Interface to 'neuralnet' Version 1.1 Date 2015-06-04 An implementation of a stacked sparse autoencoder for dimension

More information

Multinomial Regression and the Softmax Activation Function. Gary Cottrell!

Multinomial Regression and the Softmax Activation Function. Gary Cottrell! Multinomial Regression and the Softmax Activation Function Gary Cottrell Notation reminder We have N data points, or patterns, in the training set, with the pattern number as a superscript: {(x 1,t 1 ),

More information

A New Method of Using Polytomous Independent Variables with Many Levels for the Binary Outcome of Big Data Analysis

A New Method of Using Polytomous Independent Variables with Many Levels for the Binary Outcome of Big Data Analysis Paper 2641-2015 A New Method of Using Polytomous Independent Variables with Many Levels for the Binary Outcome of Big Data Analysis ABSTRACT John Gao, ConstantContact; Jesse Harriott, ConstantContact;

More information

S2 Text. Instructions to replicate classification results.

S2 Text. Instructions to replicate classification results. S2 Text. Instructions to replicate classification results. Machine Learning (ML) Models were implemented using WEKA software Version 3.8. The software can be free downloaded at this link: http://www.cs.waikato.ac.nz/ml/weka/downloading.html.

More information

Clustering and The Expectation-Maximization Algorithm

Clustering and The Expectation-Maximization Algorithm Clustering and The Expectation-Maximization Algorithm Unsupervised Learning Marek Petrik 3/7 Some of the figures in this presentation are taken from An Introduction to Statistical Learning, with applications

More information

The brlr Package. March 22, brlr... 1 lizards Index 5. Bias-reduced Logistic Regression

The brlr Package. March 22, brlr... 1 lizards Index 5. Bias-reduced Logistic Regression The brlr Package March 22, 2006 Version 0.8-8 Date 2006-03-22 Title Bias-reduced logistic regression Author David Firth URL http://www.warwick.ac.uk/go/dfirth Maintainer David Firth

More information

Package sensory. R topics documented: February 23, Type Package

Package sensory. R topics documented: February 23, Type Package Package sensory February 23, 2016 Type Package Title Simultaneous Model-Based Clustering and Imputation via a Progressive Expectation-Maximization Algorithm Version 1.1 Date 2016-02-23 Author Brian C.

More information

Package RAMP. May 25, 2017

Package RAMP. May 25, 2017 Type Package Package RAMP May 25, 2017 Title Regularized Generalized Linear Models with Interaction Effects Version 2.0.1 Date 2017-05-24 Author Yang Feng, Ning Hao and Hao Helen Zhang Maintainer Yang

More information

Pattern Recognition Winter Semester 2009/2010. Exemplary Solution

Pattern Recognition Winter Semester 2009/2010. Exemplary Solution Pattern Recognition Winter Semester 2009/2010 Exemplary Solution (bernhard.renard@iwr.uni-heidelberg.de) 4 Exercise The focus of this exercise will be neural networks. You will train and apply a multi-layer

More information

Package actyr. August 5, 2015

Package actyr. August 5, 2015 Type Package Package actyr August 5, 2015 Title Replication Package for Abbring, Campbell, Tilly, Yang (2014): Very Simple Markov Perfect Industry Dynamics Version 1.0 Date 2015-08-01 Author Jan Tilly

More information

STA431 Handout 9 Double Measurement Regression on the BMI Data

STA431 Handout 9 Double Measurement Regression on the BMI Data STA431 Handout 9 Double Measurement Regression on the BMI Data /********************** bmi5.sas **************************/ options linesize=79 pagesize = 500 noovp formdlim='-'; title 'BMI and Health:

More information

Using Machine Learning to Optimize Storage Systems

Using Machine Learning to Optimize Storage Systems Using Machine Learning to Optimize Storage Systems Dr. Kiran Gunnam 1 Outline 1. Overview 2. Building Flash Models using Logistic Regression. 3. Storage Object classification 4. Storage Allocation recommendation

More information

CS395T paper review. Indoor Segmentation and Support Inference from RGBD Images. Chao Jia Sep

CS395T paper review. Indoor Segmentation and Support Inference from RGBD Images. Chao Jia Sep CS395T paper review Indoor Segmentation and Support Inference from RGBD Images Chao Jia Sep 28 2012 Introduction What do we want -- Indoor scene parsing Segmentation and labeling Support relationships

More information

Last updated January 4, 2012

Last updated January 4, 2012 Last updated January 4, 2012 This document provides a description of Mplus code for implementing mixture factor analysis with four latent class components with and without covariates described in the following

More information

Package vennlasso. January 25, 2019

Package vennlasso. January 25, 2019 Type Package Package vennlasso January 25, 2019 Title Variable Selection for Heterogeneous Populations Version 0.1.5 Provides variable selection and estimation routines for models with main effects stratified

More information

Package FisherEM. February 19, 2015

Package FisherEM. February 19, 2015 Type Package Title The Fisher-EM algorithm Version 1.4 Date 2013-06-21 Author Charles Bouveyron and Camille Brunet Package FisherEM February 19, 2015 Maintainer Camille Brunet

More information

Generalized least squares (GLS) estimates of the level-2 coefficients,

Generalized least squares (GLS) estimates of the level-2 coefficients, Contents 1 Conceptual and Statistical Background for Two-Level Models...7 1.1 The general two-level model... 7 1.1.1 Level-1 model... 8 1.1.2 Level-2 model... 8 1.2 Parameter estimation... 9 1.3 Empirical

More information

Package mrm. December 27, 2016

Package mrm. December 27, 2016 Type Package Package mrm December 27, 2016 Title An R Package for Conditional Maximum Likelihood Estimation in Mixed Rasch Models Version 1.1.6 Date 2016-12-23 Author David Preinerstorfer Maintainer David

More information

CARTWARE Documentation

CARTWARE Documentation CARTWARE Documentation CARTWARE is a collection of R functions written for Classification and Regression Tree (CART) Analysis of ecological data sets. All of these functions make use of existing R functions

More information

3 Types of Gradient Descent Algorithms for Small & Large Data Sets

3 Types of Gradient Descent Algorithms for Small & Large Data Sets 3 Types of Gradient Descent Algorithms for Small & Large Data Sets Introduction Gradient Descent Algorithm (GD) is an iterative algorithm to find a Global Minimum of an objective function (cost function)

More information

Package bgeva. May 19, 2017

Package bgeva. May 19, 2017 Version 0.3-1 Package bgeva May 19, 2017 Author Giampiero Marra, Raffaella Calabrese and Silvia Angela Osmetti Maintainer Giampiero Marra Title Binary Generalized Extreme Value

More information

Package acebayes. R topics documented: November 21, Type Package

Package acebayes. R topics documented: November 21, Type Package Type Package Package acebayes November 21, 2018 Title Optimal Bayesian Experimental Design using the ACE Algorithm Version 1.5.2 Date 2018-11-21 Author Antony M. Overstall, David C. Woods & Maria Adamou

More information

Package mtsdi. January 23, 2018

Package mtsdi. January 23, 2018 Version 0.3.5 Date 2018-01-02 Package mtsdi January 23, 2018 Author Washington Junger and Antonio Ponce de Leon Maintainer Washington Junger

More information

Learning Alignments from Latent Space Structures

Learning Alignments from Latent Space Structures Learning Alignments from Latent Space Structures Ieva Kazlauskaite Department of Computer Science University of Bath, UK i.kazlauskaite@bath.ac.uk Carl Henrik Ek Faculty of Engineering University of Bristol,

More information

run ld50 /* Plot the onserved proportions and the fitted curve */ DATA SETR1 SET SETR1 PROB=X1/(X1+X2) /* Use this to create graphs in Windows */ gopt

run ld50 /* Plot the onserved proportions and the fitted curve */ DATA SETR1 SET SETR1 PROB=X1/(X1+X2) /* Use this to create graphs in Windows */ gopt /* This program is stored as bliss.sas */ /* This program uses PROC LOGISTIC in SAS to fit models with logistic, probit, and complimentary log-log link functions to the beetle mortality data collected

More information

A tutorial for blockcluster R package Version 1.01

A tutorial for blockcluster R package Version 1.01 A tutorial for blockcluster R package Version 1.01 Parmeet Singh Bhatia INRIA-Lille, parmeet.bhatia@inria.fr Contents 1 Introduction 1 2 Package details 2 2.1 cocluster function....................................

More information

Machine Learning. Topic 5: Linear Discriminants. Bryan Pardo, EECS 349 Machine Learning, 2013

Machine Learning. Topic 5: Linear Discriminants. Bryan Pardo, EECS 349 Machine Learning, 2013 Machine Learning Topic 5: Linear Discriminants Bryan Pardo, EECS 349 Machine Learning, 2013 Thanks to Mark Cartwright for his extensive contributions to these slides Thanks to Alpaydin, Bishop, and Duda/Hart/Stork

More information

Logistic Regression. May 28, Decision boundary is a property of the hypothesis and not the data set e z. g(z) = g(z) 0.

Logistic Regression. May 28, Decision boundary is a property of the hypothesis and not the data set e z. g(z) = g(z) 0. Logistic Regression May 28, 202 Logistic Regression. Decision Boundary Decision boundary is a property of the hypothesis and not the data set. sigmoid function: h (x) = g( x) = P (y = x; ) suppose predict

More information

Week 3: Perceptron and Multi-layer Perceptron

Week 3: Perceptron and Multi-layer Perceptron Week 3: Perceptron and Multi-layer Perceptron Phong Le, Willem Zuidema November 12, 2013 Last week we studied two famous biological neuron models, Fitzhugh-Nagumo model and Izhikevich model. This week,

More information

Gradient LASSO algoithm

Gradient LASSO algoithm Gradient LASSO algoithm Yongdai Kim Seoul National University, Korea jointly with Yuwon Kim University of Minnesota, USA and Jinseog Kim Statistical Research Center for Complex Systems, Korea Contents

More information

Package extweibquant

Package extweibquant Type Package Package extweibquant February 19, 2015 Title Estimate Lower Extreme Quantile with the Censored Weibull MLE and Censored Weibull Mixture Version 1.1 Date 2014-12-03 Author Yang (Seagle) Liu

More information

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview Chapter 888 Introduction This procedure generates D-optimal designs for multi-factor experiments with both quantitative and qualitative factors. The factors can have a mixed number of levels. For example,

More information

As a reference, please find a version of the Machine Learning Process described in the diagram below.

As a reference, please find a version of the Machine Learning Process described in the diagram below. PREDICTION OVERVIEW In this experiment, two of the Project PEACH datasets will be used to predict the reaction of a user to atmospheric factors. This experiment represents the first iteration of the Machine

More information

The Mplus modelling framework

The Mplus modelling framework The Mplus modelling framework Continuous variables Categorical variables 1 Mplus syntax structure TITLE: a title for the analysis (not part of the syntax) DATA: (required) information about the data set

More information

Package vbmp. March 3, 2018

Package vbmp. March 3, 2018 Type Package Package vbmp March 3, 2018 Title Variational Bayesian Multinomial Probit Regression Version 1.47.0 Author Nicola Lama , Mark Girolami Maintainer

More information

PRI Workshop Introduction to AMOS

PRI Workshop Introduction to AMOS PRI Workshop Introduction to AMOS Krissy Zeiser Pennsylvania State University klz24@pop.psu.edu 2-pm /3/2008 Setting up the Dataset Missing values should be recoded in another program (preferably with

More information

Estimation of Item Response Models

Estimation of Item Response Models Estimation of Item Response Models Lecture #5 ICPSR Item Response Theory Workshop Lecture #5: 1of 39 The Big Picture of Estimation ESTIMATOR = Maximum Likelihood; Mplus Any questions? answers Lecture #5:

More information

Package palmtree. January 16, 2018

Package palmtree. January 16, 2018 Package palmtree January 16, 2018 Title Partially Additive (Generalized) Linear Model Trees Date 2018-01-15 Version 0.9-0 Description This is an implementation of model-based trees with global model parameters

More information

LCA Stata Plugin Users' Guide Version 1.2.1

LCA Stata Plugin Users' Guide Version 1.2.1 LCA Stata Plugin Users' Guide Version 1.2.1 Stephanie T. Lanza John J. Dziak Liying Huang Aaron T. Wagner Linda M. Collins Penn State 2018, The Pennsylvania State University Please send questions and comments

More information

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References...

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References... Chapter 15 Mixed Models Chapter Table of Contents Introduction...309 Split Plot Experiment...311 Clustered Data...320 References...326 308 Chapter 15. Mixed Models Chapter 15 Mixed Models Introduction

More information

c. Typically results in an intractably large set of test cases even for small programs

c. Typically results in an intractably large set of test cases even for small programs Multiple-Choice Questions: 1. True or false? Generally, in practice, developers exhaustively test software. a. True b. False 2. True or false? All real software contains bugs. a. True b. False 3. Which

More information

Chapter 3. Speech segmentation. 3.1 Preprocessing

Chapter 3. Speech segmentation. 3.1 Preprocessing , as done in this dissertation, refers to the process of determining the boundaries between phonemes in the speech signal. No higher-level lexical information is used to accomplish this. This chapter presents

More information

Didacticiel - Études de cas

Didacticiel - Études de cas Subject In some circumstances, the goal of the supervised learning is not to classify examples but rather to organize them in order to point up the most interesting individuals. For instance, in the direct

More information

Missing Data Analysis for the Employee Dataset

Missing Data Analysis for the Employee Dataset Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup Random Variables: Y i =(Y i1,...,y ip ) 0 =(Y i,obs, Y i,miss ) 0 R i =(R i1,...,r ip ) 0 ( 1

More information

Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University

Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University NIPS 2008: E. Sudderth & M. Jordan, Shared Segmentation of Natural

More information

K-Means Clustering 3/3/17

K-Means Clustering 3/3/17 K-Means Clustering 3/3/17 Unsupervised Learning We have a collection of unlabeled data points. We want to find underlying structure in the data. Examples: Identify groups of similar data points. Clustering

More information

The dblcens Package. August 7, Title Compute the NPMLE of distribution from doubly censored data. d d011ch... 4.

The dblcens Package. August 7, Title Compute the NPMLE of distribution from doubly censored data. d d011ch... 4. The dblcens Package August 7, 2005 Title Compute the NPMLE of distribution from doubly censored data Version 1.1.3 Author Mai Zhou, Li Lee, Kun Chen. Description Use EM algorithm to compute the NPMLE of

More information

IBL and clustering. Relationship of IBL with CBR

IBL and clustering. Relationship of IBL with CBR IBL and clustering Distance based methods IBL and knn Clustering Distance based and hierarchical Probability-based Expectation Maximization (EM) Relationship of IBL with CBR + uses previously processed

More information

Louis Fourrier Fabien Gaie Thomas Rolf

Louis Fourrier Fabien Gaie Thomas Rolf CS 229 Stay Alert! The Ford Challenge Louis Fourrier Fabien Gaie Thomas Rolf Louis Fourrier Fabien Gaie Thomas Rolf 1. Problem description a. Goal Our final project is a recent Kaggle competition submitted

More information

Package LVGP. November 14, 2018

Package LVGP. November 14, 2018 Type Package Package LVGP November 14, 2018 Title Latent Variable Gaussian Process Modeling with Qualitative and Quantitative Input Variables Version 2.1.4 Author Siyu Tao, Yichi Zhang Maintainer Siyu

More information

Package binomlogit. February 19, 2015

Package binomlogit. February 19, 2015 Type Package Title Efficient MCMC for Binomial Logit Models Version 1.2 Date 2014-03-12 Author Agnes Fussl Maintainer Agnes Fussl Package binomlogit February 19, 2015 Description The R package

More information

Discrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 3

Discrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 3 Discrete Choice Modeling William Greene Stern School of Business, New York University Lab Session 3 Part I. iate and Multivariate This exercise uses the data file panelprobit.lpj Some preliminaries after

More information

Package bdots. March 12, 2018

Package bdots. March 12, 2018 Type Package Title Bootstrapped Differences of Time Series Version 0.1.19 Date 2018-03-05 Package bdots March 12, 2018 Author Michael Seedorff, Jacob Oleson, Grant Brown, Joseph Cavanaugh, and Bob McMurray

More information

Package milr. June 8, 2017

Package milr. June 8, 2017 Type Package Package milr June 8, 2017 Title Multiple-Instance Logistic Regression with LASSO Penalty Version 0.3.0 Date 2017-06-05 The multiple instance data set consists of many independent subjects

More information

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask Machine Learning and Data Mining Clustering (1): Basics Kalev Kask Unsupervised learning Supervised learning Predict target value ( y ) given features ( x ) Unsupervised learning Understand patterns of

More information

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset. Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied

More information

The biglm Package. August 25, bigglm... 1 biglm Index 5. Bounded memory linear regression

The biglm Package. August 25, bigglm... 1 biglm Index 5. Bounded memory linear regression The biglm Package August 25, 2006 Type Package Title bounded memory linear and generalized linear models Version 0.4 Date 2005-09-24 Author Thomas Lumley Maintainer Thomas Lumley

More information

The cmprsk Package. January 1, 2007

The cmprsk Package. January 1, 2007 The cmprsk Package January 1, 2007 Version 2.1-7 Date 2006-dec-24 Title Subdistribution Analysis of Competing Risks Author Bob Gray Maintainer Bob Gray

More information

Approximation methods and quadrature points in PROC NLMIXED: A simulation study using structured latent curve models

Approximation methods and quadrature points in PROC NLMIXED: A simulation study using structured latent curve models Approximation methods and quadrature points in PROC NLMIXED: A simulation study using structured latent curve models Nathan Smith ABSTRACT Shelley A. Blozis, Ph.D. University of California Davis Davis,

More information

Pseudo-Likelihood Estimation of Log-Linear by Linear Association Models to Really Large Tables Using SAS

Pseudo-Likelihood Estimation of Log-Linear by Linear Association Models to Really Large Tables Using SAS Pseudo-Likelihood Estimation of Log-Linear by Linear Association Models to Really Large Tables Using SAS Carolyn J. Anderson University of Illinois at Urbana-Champaign March 22, 2007 Abstract This document

More information

1 Training/Validation/Testing

1 Training/Validation/Testing CPSC 340 Final (Fall 2015) Name: Student Number: Please enter your information above, turn off cellphones, space yourselves out throughout the room, and wait until the official start of the exam to begin.

More information

Homework #4 Programming Assignment Due: 11:59 pm, November 4, 2018

Homework #4 Programming Assignment Due: 11:59 pm, November 4, 2018 CSCI 567, Fall 18 Haipeng Luo Homework #4 Programming Assignment Due: 11:59 pm, ovember 4, 2018 General instructions Your repository will have now a directory P4/. Please do not change the name of this

More information

Package DRaWR. February 5, 2016

Package DRaWR. February 5, 2016 Title Discriminative Random Walk with Restart Version 1.0.1 Author Charles Blatti [aut, cre] Package DRaWR February 5, 2016 Maintainer Charles Blatti We present DRaWR, a network-based

More information

Recitation Supplement: Creating a Neural Network for Classification SAS EM December 2, 2002

Recitation Supplement: Creating a Neural Network for Classification SAS EM December 2, 2002 Recitation Supplement: Creating a Neural Network for Classification SAS EM December 2, 2002 Introduction Neural networks are flexible nonlinear models that can be used for regression and classification

More information

Package VarReg. April 24, 2018

Package VarReg. April 24, 2018 Type Package Title Semi-Parametric Variance Regression Version 1.0.2 Package VarReg April 24, 2018 Maintainer Kristy Robledo Methods for fitting semi-parametric mean and variance

More information

Exponential Random Graph Models for Social Networks

Exponential Random Graph Models for Social Networks Exponential Random Graph Models for Social Networks ERGM Introduction Martina Morris Departments of Sociology, Statistics University of Washington Departments of Sociology, Statistics, and EECS, and Institute

More information