Health Disparities (HD): It s just about comparing two groups

Similar documents
HDs with 1-on-1 Matching

schooling.log 7/5/2006

May 24, Emil Coman 1 Yinghui Duan 2 Daren Anderson 3

Linear regression Number of obs = 6,866 F(16, 326) = Prob > F = R-squared = Root MSE =

optimization_machine_probit_bush106.c

Week 4: Simple Linear Regression II

Panel Data 4: Fixed Effects vs Random Effects Models

Week 4: Simple Linear Regression III

Week 5: Multiple Linear Regression II

Bivariate (Simple) Regression Analysis

Week 11: Interpretation plus

Week 10: Heteroskedasticity II

range: [1,20] units: 1 unique values: 20 missing.: 0/20 percentiles: 10% 25% 50% 75% 90%

PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation. Simple Linear Regression Software: Stata v 10.1

Stata Training. AGRODEP Technical Note 08. April Manuel Barron and Pia Basurto

Page 1. Notes: MB allocated to data 2. Stata running in batch mode. . do 2-simpower-varests.do. . capture log close. .

Introduction to Mixed Models: Multivariate Regression

Introduction to Hierarchical Linear Model. Hsueh-Sheng Wu CFDR Workshop Series January 30, 2017

Introduction to STATA 6.0 ECONOMICS 626

. predict mod1. graph mod1 ed, connect(l) xlabel ylabel l1(model1 predicted income) b1(years of education)

Instrumental variables, bootstrapping, and generalized linear models

An Introduction to Growth Curve Analysis using Structural Equation Modeling

Week 9: Modeling II. Marcelo Coca Perraillon. Health Services Research Methods I HSMP University of Colorado Anschutz Medical Campus

SAS Structural Equation Modeling 1.3 for JMP

Instrumental Variable Regression

Stata versions 12 & 13 Week 4 Practice Problems

Two-Stage Least Squares

Show how the LG-Syntax can be generated from a GUI model. Modify the LG-Equations to specify a different LC regression model

Study Guide. Module 1. Key Terms

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA

texdoc 2.0 An update on creating LaTeX documents from within Stata Example 2

Package endogenous. October 29, 2016

Creating LaTeX and HTML documents from within Stata using texdoc and webdoc. Example 2

Heteroskedasticity and Homoskedasticity, and Homoskedasticity-Only Standard Errors

SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian. Panel Data Analysis: Fixed Effects Models

INTRODUCTION TO PANEL DATA ANALYSIS

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics

piecewise ginireg 1 Piecewise Gini Regressions in Stata Jan Ditzen 1 Shlomo Yitzhaki 2 September 8, 2017

050 0 N 03 BECABCDDDBDBCDBDBCDADDBACACBCCBAACEDEDBACBECCDDCEA

Conditional and Unconditional Regression with No Measurement Error

Introduction to DAGs Directed Acyclic Graphs

Dr. Barbara Morgan Quantitative Methods

Analysis of Complex Survey Data with SAS

Computing Murphy Topel-corrected variances in a heckprobit model with endogeneity

An introduction to SPSS

Binary IFA-IRT Models in Mplus version 7.11

Introduction to Mplus

Big Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1

050 0 N 03 BECABCDDDBDBCDBDBCDADDBACACBCCBAACEDEDBACBECCDDCEA

An Introductory Guide to Stata

International Graduate School of Genetic and Molecular Epidemiology (GAME) Computing Notes and Introduction to Stata

Laboratory for Two-Way ANOVA: Interactions

I Launching and Exiting Stata. Stata will ask you if you would like to check for updates. Update now or later, your choice.

Estimation and Comparison of Receiver Operating Characteristic Curves

Statistics Lab #7 ANOVA Part 2 & ANCOVA

Statistical Analysis of List Experiments

Stata Session 2. Tarjei Havnes. University of Oslo. Statistics Norway. ECON 4136, UiO, 2012

CHAPTER 5. BASIC STEPS FOR MODEL DEVELOPMENT

Stat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors

Introduction to Stata Toy Program #1 Basic Descriptives

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Empirical Asset Pricing

Conducting a Path Analysis With SPSS/AMOS

Linear Model Selection and Regularization. especially usefull in high dimensions p>>100.

PRI Workshop Introduction to AMOS

Using Mplus Monte Carlo Simulations In Practice: A Note On Non-Normal Missing Data In Latent Variable Models

Missing Data Part II: Multiple Imputation Richard Williams, University of Notre Dame, Last revised January 24, 2015

Chapter 1. Looking at Data-Distribution

CHAPTER 1 INTRODUCTION

BIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA

ZunZun.com. User-Selectable Polynomial. Sat Jan 14 09:49: local server time

Resampling Methods. Levi Waldron, CUNY School of Public Health. July 13, 2016

THE LINEAR PROBABILITY MODEL: USING LEAST SQUARES TO ESTIMATE A REGRESSION EQUATION WITH A DICHOTOMOUS DEPENDENT VARIABLE

How fixed mobile usage interact? Does FTTH deployment. influence the interaction?

Stat 5100 Handout #14.a SAS: Logistic Regression

Description Remarks and examples References Also see

Hierarchical Generalized Linear Models

After opening Stata for the first time: set scheme s1mono, permanently

ECON Introductory Econometrics Seminar 4

HLM versus SEM Perspectives on Growth Curve Modeling. Hsueh-Sheng Wu CFDR Workshop Series August 3, 2015

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/

Cluster Randomization Create Cluster Means Dataset

Review of Stata II AERC Training Workshop Nairobi, May 2002

25 Working with categorical data and factor variables

Introduction to Statistical Analyses in SAS

Section 4 Matching Estimator

Product Catalog. AcaStat. Software

Generalized least squares (GLS) estimates of the level-2 coefficients,

An Econometric Study: The Cost of Mobile Broadband

Package MatchIt. April 18, 2017

Lab 2: OLS regression

MISSING DATA AND MULTIPLE IMPUTATION

Package mcemglm. November 29, 2015

(R) / / / / / / / / / / / / Statistics/Data Analysis

Statistical Matching using Fractional Imputation

Minitab detailed

Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.

Regression. Dr. G. Bharadwaja Kumar VIT Chennai

Transcription:

A review of modern methods of estimating the size of health disparities May 24, 2017 Emil Coman 1 Helen Wu 2 1 UConn Health Disparities Institute, 2 UConn Health Modern Modeling conference, May 22-24, 2017 1 Health Disparities (HD): It s just about comparing two groups Goals 1. Simplify and reposition common analytic methods 2. Compare methods to estimate HDs 3. Suggest cross-pollinations 4. Encourage disparities investigations Modern Modeling conference, May 22-24, 2017 2 1

One intuitive model of Health Disparities Lu, M. C., & Halfon, N. (2003). Racial and ethnic disparities in birth outcomes: a life-course perspective. Maternal & Child Health Journal, 7(1), 13-30. Modern Modeling conference, May 22-24, 2017 3 One intuitive model of Health Disparities HD Modern Modeling conference, May 22-24, 2017 4 2

Another intuitive model of HDs Tony Iton Framework for Health Equity, referenced in the text: Tackling Health Inequities Through Public Health Practice: Theory to Action. Edited by Richard Hofrichter and Rajiv Bhatia. Based on a project by the National Association of County and City Health Officials. Oxford University Press, 2010 (page 380). Iton, A., & Shrimali, B. P. (2016). Power, Politics, and Health: A New Public Health Practice Targeting the Root Causes of Health Equity. Maternal and Child Health Journal, 20(8), 1753-1758. doi:10.1007/s10995-016-1980-6 Modern Modeling conference, May 22-24, 2017 5 A testable model of HDs Black vs. White HD Others Anxiety Caveats: 1. Causality: can race cause health? 2. Compare the comparable 3. Control for covariates VanderWeele TJ, & Hernán MA. (2012). Causal effects and natural laws: towards a conceptualization of causal counterfactuals for non-manipulable exposures with application to the effects of race and sex. In Berzuini C, Dawid P, & Bernardinelli L (Eds.), Causality: Statistical Perspectives and Applications (pp. 101 113). West Sussex, UK John Wiley & Sons. VanderWeele, T. J., & Robinson, W. R. (2014). On the causal interpretation of race in regressions adjusting for confounding and mediating variables. Epidemiology, 25(4), 473-484. Modern Modeling conference, May 22-24, 2017 6 3

1. Independent samples t-test 2. Anova 3. Regression 4. Instrumental Variable regression 5. SEM 6. Matching methods 7. + Modern Modeling conference, May 22-24, 2017 7 1. Independent samples T- test is a 2-group model Coman, E. N., Suggs, L. S., Iordache, E., Coman, M. A., & Fifield, J. (2015). A Review of Graphical Approaches to Common Statistical Analyses. The Omnipresence of Latent Variables in Statistics International Journal of Clinical Biostatistics and Biometrics, 1(1), 1-9. Modern Modeling conference, May 22-24, 2017 8 4

1. Independent samples T-test is a 2-group model ttest y, by(binary) pwmean y, over(binary) effects 2. Anova similar for 2 groups anova y binary Allows for covariates though anova y binary c.x1 c.x2 Coman, E. N., Suggs, L. S., Iordache, E., Coman, M. A., & Fifield, J. (2015). A Review of Graphical Approaches to Common Statistical Analyses. The Omnipresence of Latent Variables in Statistics International Journal of Clinical Biostatistics and Biometrics, 1(1), 1-9. Modern Modeling conference, May 22-24, 2017 9 3. Regression X 1 reg y binary x1 x2 Binary X 2 But:? Can race cause health? VanderWeele TJ, & Hernán MA. (2012). Causal effects and natural laws: towards a conceptualization of causal counterfactuals for non-manipulable exposures with application to the effects of race and sex. In Berzuini C, Dawid P, & Bernardinelli L (Eds.), Causality: Statistical Perspectives and Applications (pp. 101 113). West Sussex, UK John Wiley & Sons. VanderWeele, T. J., & Robinson, W. R. (2014). On the causal interpretation of race in regressions adjusting for confounding and mediating variables. Epidemiology, 25(4), 473-484. Modern Modeling conference, May 22-24, 2017 10 5

4. IV regression [Cameron] Binary correlated with u Binary u Need IV->X but not IV-> ivregress 2sls y (binary = x1 x2), first Tests the IVs: estat endogenous X 1 X 2 Binary u Modern Modeling conference, May 22-24, 2017 11 5. SEM 2-group model, compare intercepts (needs centering) X 1 X 2 Binary u α Blacks sem y <- x1 x2, group(binary) ginvariant(covex) X 1 X 2 Binary Test u α Whites Can relax the equal variances assumption Modern Modeling conference, May 22-24, 2017 12 6

6. Matching options teffects psmatch (y) (binary x1 x2 ) Stage 1 X 1 X 2 Binary psmatch2 binary x1 x2, outcome (y) common ate attnd y binary x1 x2 Stage 2 Stage 3 Match in strata on Binary Strata1 αblacks Strata2 αblacks Strata3 αblacks Strata1 α Whites Modern Modeling conference, May 22-24, 2017 α Whites 13 1 2 3 Strata2 Strata3 α Whites 1. T-test 2. Anova identical Means (+/- 2 SE) of the anxiety baseline measure for White and Black women (t-test) 20 19.1 17.1 M - 2SE Mean M + 2SE 16.4 15 15.1 14.7 Allows for covariates though anova y binary c.x1 c.x2 13.0 10 White Black Modern Modeling conference, May 22-24, 2017 14 7

T-test Anova with covariates Means (+/- 2 SE) of the anxiety baseline measure for White and Black women (t-test) 20 M - 2SE 19.1 Mean M + 2SE 17.1 16.4 15 15.1 14.7 13.0 25 20 15 Means (+/- 2 SE) of the anxiety baseline measure for White and Black women(anova with covariates) M - 2SE Mean 20.7 M + 2SE 17.7 14.8 16.9 14.6 12.4 10 White Black 10 White Black Modern Modeling conference, May 22-24, 2017 15 Anova with covariates SEM with correlates 25 20 15 Means (+/- 2 SE) of the anxiety baseline measure for White and Black women(anova with covariates) M - 2SE Mean 20.7 M + 2SE 17.7 14.8 16.9 14.6 12.4 25 20 15 Means (+/- 2 SE) of the anxiety baseline measure for White and Black women (SEM correlated) 17.0 M - 2SE Mean M + 2SE 15.1 Note: variances assumed equal. 10 White Black 10 1 2 Modern Modeling conference, May 22-24, 2017 16 8

SEM with covariates [0=White; 1=Black]; variances freed sem y <- x1 x2, group(binary) ginvariant(covex) OIM Coef. Std. Err. z P> z [95% Conf. Interval] -------------+---------------------------------------------------------------- Structural y <- x1 0.0199704 2.16228 0.01 0.993-4.21802 4.257961 1 2.61248 1.673379 1.56 0.118 -.6672832 5.892244 x2 0.164855.3766247 0.44 0.662 -.5733159.9030258 1.4905938.3001652 1.63 0.102 -.0977191 1.078907 _cons 0 13.43964 8.259071 1.63 0.104-2.747845 29.62712 1 3.422483 6.895766 0.50 0.620-10.09297 16.93794 -------------+---------------------------------------------------------------- var(e.y) 0 60.78133 13.59112 39.21351 94.21166 1 84.67747 14.74046 60.19947 119.1086 ------means centered at.050 (X1 neighborhood disorder) and 22.91y (X2=age)------- Modern Modeling conference, May 22-24, 2017 17 teffects psmatch. teffects psmatch (y) (binary x1 x2 ) Treatment-effects estimation Number of obs = 106 Estimator : propensity-score matching Matches: requested = 1 Outcome model : matching min = 1 Treatment model: logit max = 1 ------------------------------------------------------------------------------ AI Robust y Coef. Std. Err. z P> z [95% Conf. Interval] -------------+---------------------------------------------------------------- ATE binary (1 vs 0) -3.169811 2.228166-1.42 0.155-7.536936 1.197313 ------------------------------------------------------------------------------ Modern Modeling conference, May 22-24, 2017 18 9

psmatch2 psmatch2 binary x1 x2, outcome (y) common ate Probit regression Number of obs = 106 LR chi2(2) = 12.08 Prob > chi2 = 0.0024 Log likelihood = -64.210562 Pseudo R2 = 0.0860 binary Coef. Std. Err. z P> z [95% Conf. Interval] -------------+---------------------------------------------------------------- x1.6428318.2054421 3.13 0.002.2401726 1.045491 x2.0468275.0366658 1.28 0.202 -.0250361.1186911 _cons -.7222502.8205248-0.88 0.379-2.330449.8859489 psmatch2: psmatch2: Common Treatment support assignment Off suppo On suppor Total -----------+----------------------+---------- Untreated 0 40 40 Treated 11 55 66 -----------+----------------------+---------- Total 11 95 106 Modern Modeling conference, May 22-24, 2017 19 psmatch2 psmatch2 binary x1 x2, outcome (y) common ate Probit regression Number of obs = 106 Variable Sample Treated Controls Difference S.E. T-stat ----------------------------+----------------------------------------------------------- y Unmatched 15.0757576 17.025-1.94924242 1.8091337-1.08 ATT 13.8 18.9272727-5.12727273 2.19360704-2.34 ATU 17.025 12.5-4.525.. ATE -4.87368421.. ----------------------------+----------------------------------------------------------- Modern Modeling conference, May 22-24, 2017 20 10

attnd attnd y binary x1 x2, comsup boot reps(100) dots logit detail Note: the common support option has been selected The region of common support is [.30252502,.89642859] The distribution of the pscore is Pr(binary) ------------------------------------------------------------- Percentiles Smallest 1%.3173758.302525 5%.3990624.3173758 10%.4397769.3197376 Obs 141 25%.4943597.3332007 Sum of Wgt. 141 50%.6010811 Mean.6028369 Largest Std. Dev..1374292 75%.7131706.854728 90%.7813184.8586264 Variance.0188868 95%.8165104.8644204 Skewness -.0160976 99%.8644204.8964286 Kurtosis 2.2178 The program is searching the nearest neighbor of each treated unit. This operation may take a while. Modern Modeling conference, May 22-24, 2017 21 attnd attnd y binary x1 x2, comsup boot reps(100) dots logit detail ATT estimation with Nearest Neighbor Matching method (random draw version) Analytical standard errors --------------------------------------------------------- n. treat. n. contr. ATT Std. Err. t --------------------------------------------------------- 85 23-0.524 2.008-0.261 --------------------------------------------------------- Modern Modeling conference, May 22-24, 2017 22 11

ivregress ivregress 2sls y (binary = x1 x2 ), first First-stage regressions ----------------------- Number of obs = 106 F( 2, 103) = 6.05 Prob > F = 0.0033 R-squared = 0.1051 Adj R-squared = 0.0878 Root MSE = 0.4652 ------------------------------------------------------------------------------ binary Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- x1.2176514.0672757 3.24 0.002.0842259.3510769 x2.0148822.0124655 1.19 0.235 -.0098403.0396046 _cons.2805711.281939 1.00 0.322 -.2785883.8397305 ------------------------------------------------------------------------------ Modern Modeling conference, May 22-24, 2017 23 ivregress ivregress 2sls y (binary = x1 x2 ), first Instrumental variables (2SLS) regression Number of obs = 106 Wald chi2(1) = 1.57 Prob > chi2 = 0.2098 R-squared =. Root MSE = 10.126 ------------------------------------------------------------------------------ y Coef. Std. Err. z P> z [95% Conf. Interval] -------------+---------------------------------------------------------------- binary 7.847662 6.257221 1.25 0.210-4.416267 20.11159 _cons 10.92504 4.018222 2.72 0.007 3.04947 18.80061 ------------------------------------------------------------------------------ Instrumented: binary Instruments: x1 x2 Modern Modeling conference, May 22-24, 2017 24 12

ivregress estat endogenous Tests of endogeneity Ho: variables are exogenous Durbin (score) chi2(1) = 3.51188 (p = 0.0609) Wu-Hausman F(1,103) = 3.52942 (p = 0.0631) If the test statistic is significant, the variables must b e treated as endogenous URL estat overid Tests of overidentifying restrictions: Sargan (score) chi2(1) =.872447 (p = 0.3503) Basmann chi2(1) =.854791 (p = 0.3552) A statistically significant test statistic indicates that the instruments may not be valid URL Modern Modeling conference, May 22-24, 2017 25 Comparisons Comparison of racial/ethnic (R/E) differences (92 White - 145 Black) in Anxiety baseline scores, by estimation method 5 All models used age and neighborhood stress as: covariates (regression); propensity predictors (PSM); instruments for R/E (IV). t-test (n=176) Anova & covariates (n=106) Regression (n=106) PSM teffects (n=106) PSM psmatch2 (55W + 40B) PSM attnd (85W + 23B) Some methods are better than others in Identifying health disparities. SEM is not shown, the test of HD is a Wald (or chi-squared test). 0-5 -2.4-2.0-3.1-3.2-5.1-0.5-10 Mean(-2*SE) Difference Mean(+2*SE) Modern Modeling conference, May 22-24, 2017 26 13

Conclusions email for >: comanus@gmail.com Some methods are better than others in identifying health disparities. Comparisons encourage analysts to think about the place of the variables in the analytical model. One general insight: we can trust estimates when that are somewhat consistent across methods. Modern Modeling conference, May 22-24, 2017 27 14