Introduction to Mplus

Similar documents
CHAPTER 1 INTRODUCTION

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA

CHAPTER 11 EXAMPLES: MISSING DATA MODELING AND BAYESIAN ANALYSIS

PSY 9556B (Jan8) Design Issues and Missing Data Continued Examples of Simulations for Projects

Acknowledgments. Acronyms

Generalized least squares (GLS) estimates of the level-2 coefficients,

A Beginner's Guide to. Randall E. Schumacker. The University of Alabama. Richard G. Lomax. The Ohio State University. Routledge

Using Mplus Monte Carlo Simulations In Practice: A Note On Non-Normal Missing Data In Latent Variable Models

CHAPTER 13 EXAMPLES: SPECIAL FEATURES

CHAPTER 18 OUTPUT, SAVEDATA, AND PLOT COMMANDS

Correctly Compute Complex Samples Statistics

Study Guide. Module 1. Key Terms

The Mplus modelling framework

Handbook of Statistical Modeling for the Social and Behavioral Sciences

PSY 9556B (Feb 5) Latent Growth Modeling

Correctly Compute Complex Samples Statistics

An Introduction to Growth Curve Analysis using Structural Equation Modeling

3.6 Sample code: yrbs_data <- read.spss("yrbs07.sav",to.data.frame=true)

Monte Carlo 1. Appendix A

ANNOUNCING THE RELEASE OF LISREL VERSION BACKGROUND 2 COMBINING LISREL AND PRELIS FUNCTIONALITY 2 FIML FOR ORDINAL AND CONTINUOUS VARIABLES 3

MPLUS Analysis Examples Replication Chapter 10

Ronald H. Heck 1 EDEP 606 (F2015): Multivariate Methods rev. November 16, 2015 The University of Hawai i at Mānoa

JMP Book Descriptions

CHAPTER 5. BASIC STEPS FOR MODEL DEVELOPMENT

Analysis of Complex Survey Data with SAS

Missing Data Missing Data Methods in ML Multiple Imputation

Binary IFA-IRT Models in Mplus version 7.11

STATISTICS (STAT) Statistics (STAT) 1

Supplementary Notes on Multiple Imputation. Stephen du Toit and Gerhard Mels Scientific Software International

About Me. Mplus: A Tutorial. Today. Contacting Me. Today. Today. Hands-on training. Intermediate functions

USER S GUIDE LATENT GOLD 4.0. Innovations. Statistical. Jeroen K. Vermunt & Jay Magidson. Thinking outside the brackets! TM

Mplus: A Tutorial. About Me

Performance of Latent Growth Curve Models with Binary Variables

LISREL 10.1 RELEASE NOTES 2 1 BACKGROUND 2 2 MULTIPLE GROUP ANALYSES USING A SINGLE DATA FILE 2

PRI Workshop Introduction to AMOS

**************************************************************************************************************** * Single Wave Analyses.

CLAREMONT MCKENNA COLLEGE. Fletcher Jones Student Peer to Peer Technology Training Program. Basic Statistics using Stata

Missing Data Techniques

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242

SAS Structural Equation Modeling 1.3 for JMP

Minitab 18 Feature List

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

An introduction to SPSS

STATISTICS (STAT) 200 Level Courses. 300 Level Courses. Statistics (STAT) 1

SPSS Modules Features

STATISTICS (STAT) 200 Level Courses Registration Restrictions: STAT 250: Required Prerequisites: not Schedule Type: Mason Core: STAT 346:

Predict Outcomes and Reveal Relationships in Categorical Data

book 2014/5/6 15:21 page v #3 List of figures List of tables Preface to the second edition Preface to the first edition

Technical Support Minitab Version Student Free technical support for eligible products

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.

Missing Data and Imputation

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions

Latent Curve Models. A Structural Equation Perspective WILEY- INTERSCIENΠKENNETH A. BOLLEN

JMP 10 Student Edition Quick Guide

Scatterplot: The Bridge from Correlation to Regression

Blimp User s Guide. Version 1.0. Brian T. Keller. Craig K. Enders.

Statistical Methods for the Analysis of Repeated Measurements

Multiple Imputation for Multilevel Models with Missing Data Using Stat-JR

Missing Data Analysis for the Employee Dataset

Multiple Imputation for Missing Data. Benjamin Cooper, MPH Public Health Data & Training Center Institute for Public Health

Dealing with Categorical Data Types in a Designed Experiment

Statistical Matching using Fractional Imputation

Learn What s New. Statistical Software

Also, for all analyses, two other files are produced upon program completion.

Instructions for Using ABCalc James Alan Fox Northeastern University Updated: August 2009

Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018

IBM SPSS Statistics Traditional License packages and features

SAS High-Performance Analytics Products

Excel 2010 with XLSTAT

REALCOM-IMPUTE: multiple imputation using MLwin. Modified September Harvey Goldstein, Centre for Multilevel Modelling, University of Bristol

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Supplementary Material. 4) Mplus input code to estimate the latent transition analysis model

Random Number Generation and Monte Carlo Methods

Last updated January 4, 2012

Big Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1

Estimating DCMs Using Mplus. Chapter 9 Example Data

CHAPTER 16 ANALYSIS COMMAND

Using Amos For Structural Equation Modeling In Market Research

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Design of Experiments

Estimation of Item Response Models

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data

Data analysis using Microsoft Excel

STA 570 Spring Lecture 5 Tuesday, Feb 1

SOS3003 Applied data analysis for social science Lecture note Erling Berge Department of sociology and political science NTNU.

1. Estimation equations for strip transect sampling, using notation consistent with that used to

Hierarchical Generalized Linear Models

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures

Basic Medical Statistics Course

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Introduction to Hierarchical Linear Model. Hsueh-Sheng Wu CFDR Workshop Series January 30, 2017

Missing Data. SPIDA 2012 Part 6 Mixed Models with R:

HLM versus SEM Perspectives on Growth Curve Modeling. Hsueh-Sheng Wu CFDR Workshop Series August 3, 2015

Categorical Data in a Designed Experiment Part 2: Sizing with a Binary Response

GETTING STARTED WITH THE STUDENT EDITION OF LISREL 8.51 FOR WINDOWS

STATISTICS FOR PSYCHOLOGISTS

Missing Data Analysis with SPSS

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM

lavaan: an R package for structural equation modeling

Basic Medical Statistics Course

Transcription:

Introduction to Mplus May 12, 2010 SPONSORED BY: Research Data Centre Population and Life Course Studies PLCS Interdisciplinary Development Initiative Piotr Wilk piotr.wilk@schulich.uwo.ca

OVERVIEW Mplus modeling framework Mplus language Examples of research using Mplus

WHAT IS MPLUS? Statistical modeling program for structural equation modeling... and more Extremely flexible modeling framework: Multiple types of data formats (variables) Multiple types of statistical models (relationships) Code-based path-centric specification

MODELING FRAMEWORK

MODELING FRAMEWORK Describes structure of the (rectangles and circles) Describes relationships between variables (arrows) Acknowledges complex data structures: Multilevel and multiple population data

DATA STRUCTURE Observe variables (rectangles) Latent variables (circles) Combinations

OBSERVED OUTCOME VARIABLES Continuous (y) Categorical (u): Censored Binary Ordered categorical (ordinal) Unordered categorical (nominal) Counts Combinations: Single model (y with u)

LATANT VARIABLES Continuous latent variables (f): Continuous indicators Categorical indicators Categorical latent variables (c) Measurement model Group membership

RELATIONSHIPS: OBSERVED VARIABLES Linear regression for continuous outcomes Probit or logistic regression for binary outcomes Poisson or zero-inflated regression for count outcomes Simultaneous modeling of several related relationships: Path analysis

RELATIONSHIPS: LATANT VARIABLES Continuous latent variables: Structural equation modeling Categorical latent variables: Mixture modeling Latent class analysis Latent variable interactions

RELATIONSHIPS: ALL VARIABLES Ability to combine all types of variables and all types of relationships into a single analytical framework

MODELING POSSIBILITIES: EXAMPLES Complex survey data Multiple group analysis Multilevel modeling Mixture modeling Latent class analysis Longitudinal data analysis Modeling with missing data Monte Carlo simulations And more

COMPLEX SURVEY DATA Adjustment of standard errors: Takes into account stratification and/or nonindependence of observations Unequal probabilities of selection (sampling weights) Multilevel framework: Specify a separate model for each level of the multilevel data Both approaches can be combined

MULTILEVEL MODELING Multilevel models separate the overall variance into two sources: Within (individual-level variation) Between (group-level variation) Allows random intercepts and random slopes Random effects can be specified for any relationship

MIXTURE MODELING Modeling with categorical latent variables Represent subpopulations where population membership not known but inferred from the data

LATENT CLASS ANALYSIS A special case of mixture modeling Explains relationships among observed dependent variables Provides classification of individuals into more homogenous sub-groups

LONGITUDINAL DATA ANALYSIS Broad class of statistical methods for longitudinal data Latent growth curve analysis Resembles classic confirmatory factor analysis Multilevel modeling

MODELING WITH MISSING DATA Several options for estimating models with missing data Estimation based on two assumptions: Missing completely at random Missing at random Non-ignorable missing data modeling: Categorical outcomes as indicators of missingness Generates and analyzes multiple data sets using multiple imputation Computes bootstrapped standard errors

MONTE CARLO SIMULATIONS Extensive Monte Carlo facilities for data generation and data analysis Generates several types of data based on specified parameters Can be used for power analysis Other Monte Carlo features: Saving generated data and parameter estimates Analytical results from each replication can be saved in an external file

OTHER USEFUL FEATURES Indirect effects (specific paths) Bootstrap standard errors and confidence intervals Robust estimation of standard errors and chi-square tests for model fit And more

COMMAND STRUCTURE Mplus is a command-based program There are nine sets of Mplus commands: TITLE: DATA: VARIABLE: DEFINE: ANALYSIS: MODEL: OUTPUT: SAVEDATA: PLOT: MONTECARLO:

GENERAL RULES All commands must begin on a new line and must be followed by a colon (:) Some commands have numerous subcommands Semicolons (;) separate subcommands Individual lines of code cannot exceed 80 characters Not case sensitive (only variable names are case sensitive) Exclamation mark in front (!) serves as a comment character

TITLE COMMAND Specifies a title that will be printed on each page of the output file No limit on length

DATA COMMAND Specifies where the data file is located and the format of the data Records may be in free format or fixed format Accepts covariance or correlation matrices Data files from other statistical packages have to be converted: SAS and SPSS: fixed format ASCII file STATA: stata2mplus function

DEFINE COMMAND Allows for transformation and creation of new variables Supports a large number of transformation functions Allows for conditional statements Selection of observations

ANALYSIS COMMAND Specifies analysis type(s) and estimation procedure Many estimation options are available Some analyses require additional commands

MODEL COMMAND: OVERVIEW Specifies the parameters of the model Models are built in terms of relationships between variables: Variable RELATIONSHIP Variable

MODEL COMMAND: RELATIONSHIPS BY keyword ("measured by"): Define the latent variables ON keyword ( regressed on ): Structural path between variables WITH keyword ( correlated with ): correlation between two variables

Variances: MODEL COMMAND: PARAMETERS Variable name without brackets [Means] or thresholds [catvar$1]: Variable name inside square brackets {Scale factors}: Variable name in curly brackets

OUTPUT COMMAND Specifies optional outputs to be generated Mplus creates an output file using the extension.out (text file) Specific elements of output can be included or suppressed

SAVEDATA COMMAND Determines what to save in new text files Analysis dependent outputs Datasets Parameter estimates Latent class memberships Cook s distances or influence statistics

PLOT COMMAND Provides graphical displays of observed data and results: Histograms / scatterplots Individual observed and estimated values Sample and estimated means and proportions/probabilities Available for: Total sample By group / class Adjusted for covariates Editing and exporting of plots

DEFAULTS The command language is set up with defaults to minimize the amount of text Version specific defaults Example: Missing data Mplus assumes that there are no missing values or that FIML estimation (missing values are missing at random) Listwise deletion must be specified under the DATA command

SUMMARY: PROS Many great features not available in other packages Ability to combine various data types Path-centric specification: Relatively intuitive and easy to learn Extensions to larger models are easy to implement Commitment to development Excellent support

SUMMARY: CONS Cost: Mplus is a commercial package Annual fee: Support and updates Matrix specification is not supported No data management beyond Monte Carlo capabilities, transformations, and selection of observations

ADDITIONAL RESOURCES Technical and theoretical support: Homepage: www.statmodel.com Discussion forum: www.statmodel.com/cgibin/discus/discus.cgi Online manuals and tutorials Other websites: http://www.ats.ucla.edu/stat/mplus/

MPLUS COMMERCIAL VERSION Current version: 6.0 (new!) Base Program: 595 USD Mixture "add-on": 745 USD Multilevel "add-on": 745 USD Combination "add-on": 895 USD

MPLUS DEMO VERSION Free version of the software www.statmodel.com/demo.shtml Limit on the number of variables 2 independent variables 6 dependent variables

CONCLUSION Advantages and disadvantage of using only one program Each program has strengths and weaknesses Use the correct one for the problem at hand

EXAMPLES Path analysis (3.11) Structural equation modeling (511) Latent growth curve analysis Quadratic growth (6.9) Paralleled processes (6.13) Mixture modeling (7.1) Advanced models Latent class growth curves analysis Complier average causal effect

PATH ANALYSIS

PATH ANALYSIS TITLE: Path analysis with continuous dependent variables DATA: FILE IS ex3.11.dat; VARIABLE: NAMES ARE y1-y3 x1-x3; MODEL: y1 y2 ON x1 x2 x3; y3 ON y1 y2 x2;

STRUCTURAL EQUATION MODEL

STRUCTURAL EQUATION MODEL TITLE: SEM with continuous indicators DATA: FILE IS ex5.11.dat; VARIABLE: NAMES ARE y1-y12; MODEL: f1 BY y1-y3; f2 BY y4-y6; f3 BY y7-y9; f4 BY y10-y12; f4 ON f3; f3 ON f1 f2;

LATENT GROWTH MODEL

LATENT GROWTH MODEL TITLE: Quadratic growth model DATA: FILE IS ex6.9.dat; VARIABLE: NAMES ARE y11-y14; MODEL: i s q y11@0 y12@1 y13@2 y14@3; PLOT: Type is Plot3; Series = y11 (0) y12 (1) y13 (2) y14 (3);

LATENT GROWTH MODEL

LATENT GROWTH MODEL TITLE: Growth model for two parallel processes DATA: FILE IS ex6.13.dat; VARIABLE: NAMES ARE y11- y24; MODEL: i1 s1 y11@0 y12@1 y13@2 y14@3; i2 s2 y21@0 y22@1 y23@2 y24@3; s1 ON i2; s2 ON i1;

MIXTURE MODEL

MIXTURE MODEL TITLE: Mixture regression analysis DATA: FILE IS ex7.1.dat; VARIABLE: NAMES ARE y x1 x2; CLASSES = c (2); ANALYSIS: TYPE = MIXTURE; MODEL: %OVERALL% y ON x1 x2; c ON x1; %c#2% y ON x2; y;

GROWTH MIXTURE MODEL

COMPLIER AVERAGE CAUSAL EFFECT Outcome Compliance Covariates Treatment