Nonparametric Mixed-Effects Models for Longitudinal Data

Similar documents
Analyzing Longitudinal Data Using Regression Splines

davidr Cornell University

Non-Parametric and Semi-Parametric Methods for Longitudinal Data

Nonparametric regression using kernel and spline methods

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is...

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni

Generalized Additive Models

Nonparametric Regression Methods for Longitudinal Data Analysis

Last time... Bias-Variance decomposition. This week

Improved smoothing spline regression by combining estimates of dierent smoothness

Generalized Additive Model

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique

NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University

Smoothing parameterselection forsmoothing splines: a simulation study

Nonparametric Approaches to Regression

Moving Beyond Linearity

Incorporating Geospatial Data in House Price Indexes: A Hedonic Imputation Approach with Splines. Robert J. Hill and Michael Scholz

Median and Extreme Ranked Set Sampling for penalized spline estimation

The linear mixed model: modeling hierarchical and longitudinal data

Curve fitting using linear models

Knot-Placement to Avoid Over Fitting in B-Spline Scedastic Smoothing. Hirokazu Yanagihara* and Megu Ohtaki**

P-spline ANOVA-type interaction models for spatio-temporal smoothing

Lecture 17: Smoothing splines, Local Regression, and GAMs

STAT 705 Introduction to generalized additive models

Functional Data Analysis

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego

CH9.Generalized Additive Model

Linear penalized spline model estimation using ranked set sampling technique

Lecture 16: High-dimensional regression, non-linear regression

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Generalized additive models I

Feature Subset Selection for Logistic Regression via Mixed Integer Optimization

Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica

Statistics & Analysis. Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2

Nonparametric and Semiparametric Linear Mixed Models

Package freeknotsplines

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)

Package ICsurv. February 19, 2015

Package lmesplines. R topics documented: February 20, Version

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR

100 Myung Hwan Na log-hazard function. The discussion section of Abrahamowicz, et al.(1992) contains a good review of many of the papers on the use of

What is machine learning?

Computational Physics PHYS 420

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2)

Smoothing Scatterplots Using Penalized Splines

Nonparametric Smoothing of Yield Curves

Interactive Graphics. Lecture 9: Introduction to Spline Curves. Interactive Graphics Lecture 9: Slide 1

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric)

A Graphical Analysis of Simultaneously Choosing the Bandwidth and Mixing Parameter for Semiparametric Regression Techniques

Additive hedonic regression models for the Austrian housing market ERES Conference, Edinburgh, June

Optimal designs for comparing curves

Linear Mixed Model Robust Regression

arxiv: v1 [stat.me] 2 Jun 2017

Estimating Curves and Derivatives with Parametric Penalized Spline Smoothing

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.

Independent Components Analysis through Product Density Estimation

Smoothing and Forecasting Mortality Rates with P-splines. Iain Currie. Data and problem. Plan of talk

Theoretical and Practical Aspects of Penalized Spline Smoothing

Smoothing-splines Mixed-effects Models in R using the sme Package: a Tutorial

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer

Missing Data Analysis for the Employee Dataset

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS

Nonparametric Estimation of Distribution Function using Bezier Curve

Handling Sparse and Missing Data in Functional Data Analysis: A Functional Mixed-Effects Model Approach. Kimberly L. Ward

Latent Curve Models. A Structural Equation Perspective WILEY- INTERSCIENΠKENNETH A. BOLLEN

Lecture 13: Model selection and regularization

AN ADDITIVE BIVARIATE HIERARCHICAL MODEL FOR FUNCTIONAL DATA AND RELATED COMPUTATIONS. A Dissertation ANDREW MIDDLETON REDD

Applied Statistics : Practical 9

Splines and penalized regression

Principal component models for sparse functional data

Bayesian Time-Stratified-Petersen estimators for abundance. Sampling Protocol. Simon J. Bonner (UBC) Carl James Schwarz (SFU)

Comment. J. L. FRENCH, E. E. KAMMANN, and M. F? WAND. a: 1 = -{lly -X~-Z~U~~'+AU~Z~U). sion involves minimization of

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask

Linear Methods for Regression and Shrinkage Methods

REPLACING MLE WITH BAYESIAN SHRINKAGE CAS ANNUAL MEETING NOVEMBER 2018 GARY G. VENTER

BASIC LOESS, PBSPLINE & SPLINE

Resampling Methods. Levi Waldron, CUNY School of Public Health. July 13, 2016

9.1 Random coefficients models Constructed data Consumer preference mapping of carrots... 10

Nonparametric Methods Recap

This is called a linear basis expansion, and h m is the mth basis function For example if X is one-dimensional: f (X) = β 0 + β 1 X + β 2 X 2, or

Package robustgam. January 7, 2013

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

A Bayesian approach to detect time-specific group differences between nonlinear temporal curves

Assessing the Quality of the Natural Cubic Spline Approximation

Nonparametric Regression

PSY 9556B (Feb 5) Latent Growth Modeling

Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples

Lecture 7: Splines and Generalized Additive Models

Package robustgam. February 20, 2015

Package GAMBoost. February 19, 2015

ME 261: Numerical Analysis Lecture-12: Numerical Interpolation

Nonparametric Regression and Generalized Additive Models Part I

Extending the GLM. Outline. Mixed effects motivation Evaluating mixed effects methods Three methods. Conclusions. Overview

Machine Learning. Topic 4: Linear Regression Models

Multivariable Regression Modelling

CS 229 Midterm Review

Transcription:

Nonparametric Mixed-Effects Models for Longitudinal Data Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore University of Seoul, South Korea, 7 p.1/26

OUTLINE The Motivating Data Various Parametric/Nonparametric ME Models Various Fitting Approaches Smoothing Parameter Selection Real Data Application Other ME Models University of Seoul, South Korea, 7 p.2/26

The Motivating Data The ACTG 388 Data (Park and Wu 4) 1 Raw Curves 1 1 8 CD4 count 6 Response CD4 cell counts Covariate Time 2 4 6 8 1 12 Week patients. Total Number of Observations University of Seoul, South Korea, 7 p.3/26

The Motivating Data Six Selected Subjects 6 Subj 8 6 Subj 11 2 4 6 8 1 12 Subj 21 6 2 4 6 8 1 12 Subj 36 6 2 4 6 8 1 12 Subj 39 6 2 4 6 8 1 12 Subj 59 6 2 4 6 8 1 12 CD4 count 2 4 6 8 1 12 Week University of Seoul, South Korea, 7 p.4/26

The Motivating Data Pointwise Means 1 Pointwise Raw Means 1 1 8 6 CD4 count 6 2 4 6 8 1 12 Week University of Seoul, South Korea, 7 p.5/26

The Motivating Data For the ACTG 388 data, the following Population-Mean ME Model is proper: GP where : measurement errors for -th measurement of -th subject : smooth fixed-effects function : smooth random-effects function of -th subject : individual function of -th subject Aim: Estimate, and ; Predict and University of Seoul, South Korea, 7 p.6/26

Various Parametric/Nonparametric ME Models Parametric Mixed-Effects Models In classical longitudinal data analysis, parametric mixed-effects models are often used. Linear Mixed-effects Models: i.i.d : Parametric fixed-effects : Parametric random-effects : Measurement errors University of Seoul, South Korea, 7 p.7/26

Various Parametric/Nonparametric ME Models Nonlinear Mixed-effects Models: i.i.d where is some known nonlinear function. See Davidian and Giltinan (1995), Vonesh and Chinchilli (1996), among others. University of Seoul, South Korea, 7 p.8/26

Various Parametric/Nonparametric ME Models Advantages: May take many covariates into account Easy to fit and analyze via EM algorithm Methodologies well developed Disadvantages: Need valid parametric assumptions May lead to misleading conclusions Not robust against model misspecification University of Seoul, South Korea, 7 p.9/26

Various Parametric/Nonparametric ME Models Nonparametric Mixed-Effects Models Recently, various nonparametric mixed-effects models are proposed. Population-Mean ME Model GP GP : nonparametric fixed-effects function : nonparametric random-effects function : Measurement error process See Zhang et al. (1998), Rice and Wu (1), Wu and Zhang (2, 6) among others University of Seoul, South Korea, 7 p.1/26

Various Parametric/Nonparametric ME Models Varying Coefficient ME Model GP GP where is the vector of some unknown smooth coefficient functions. See Wu and Zhang (6) University of Seoul, South Korea, 7 p.11/26

Various Parametric/Nonparametric ME Models Random Coefficient ME Model GP GP -th random coefficient function. is the where the University of Seoul, South Korea, 7 p.12/26

Various Parametric/Nonparametric ME Models Advantages: Flexible to fit longitudinal data Robust against model misspecification Disadvantages: Only involve a few covariates May computationally intensive Methodologies under developing See Wu and Zhang (6) and the references therein. University of Seoul, South Korea, 7 p.13/26

Various Fitting Approaches In the above nonparametric ME models, the nonparametric components such as should be fitted using some smoothing technique. The major smoothing techniques include Regression Spline Method (Eubank, 1999) and Smoothing Spline Method (Wahba, 199, Green and Silverman 1994) Penalized Spline Method (Ruppert, Wand and Carroll, 3) Local Polynomial Method (Fan and Gijbels, 1996) To adopt the above smoothing methods, we shall use the Population Mean ME model as an example. University of Seoul, South Korea, 7 p.14/26

Various Fitting Approaches respectively. the The Regression Spline Method Key Ideas: Approximate the nonparametric FE component by a regression spline, and approximate the nonaparametric RE components by regression splines and are two regression spline bases of dimensions and Then Population-Mean ME model can be approximated by where where and. This is a Standard Linear Mixed-effects (LME) Model. See Rice and Wu (1), Wu and Zhang (6, Chapter 5) for more details. University of Seoul, South Korea, 7 p.15/26

Various Fitting Approaches The Smoothing Spline Method Key Ideas: Take into account the roughness of and simultaneously via introducing roughness penalty. For example, for the cubic smoothing spline method, it is to find and to minimize the following criterion: Loglik where Loglik is the log-likelihood function evaluated at the design time points, and are the associated smoothing parameters. See Brumback and Rice (1998), Wu and Zhang (6, Chapter 6) for more details. University of Seoul, South Korea, 7 p.16/26

Various Fitting Approaches The Penalized Spline Method Key Ideas: Take into account the roughness of approximating and by regression splines penalizing the associated coefficients. That is to find following criterion: and and simultaneously via and, and to minimize the Loglik where Loglik is the log-likelihood function evaluated at the design time points and based on the regression spline approximations, and are the associated smoothing parameters. See Wu and Zhang (6, Chapter 7) for more details. University of Seoul, South Korea, 7 p.17/26

Various Fitting Approaches The Local Polynomial Method Key Ideas: At any fixed time point, the Population-Mean model can be approx- imated by a standard LME model via approximating and by polynomials of some order. The resulting LME can be fitted by the existing approaches for LME models. See Wu and Zhang (2, 6, Chapter 4) for more details. University of Seoul, South Korea, 7 p.18/26

Smoothing Parameter Selection Mixed Effects Fits Let the above methods and for the Population Mean Model, the Fixed-Effects Fits at can be expressed as be all the design time points. For where is the smoother matrix for, and the Random Effects Fits at is where the smoother matrix for all the random-effects evaluated at the design time points. University of Seoul, South Korea, 7 p.19/26

Smoothing Parameter Selection Goodness of Fit and Model Complexity Smoothing Parameter Selection attempts to trade off Goodness of Fit and Model Complexity. Goodness of Fit can be measured by the Log-likelihood: Loglik Const Cov (1) The larger the Loglik, the better Goodness of Fit of the modeling, indicating that the data are fitted very closely by the model. Model Complexity can be measured by the Trace of Smoother Matrix Model Complexity for Fixed-Effects df tr, indicating how complicate of the model is for fitting the fixed effects. Model Comlexity for Random-Effects df tr, indicating how complicate of the model is for fitting the random effects. University of Seoul, South Korea, 7 p.2/26

Smoothing Parameter Selection AIC and BIC A Criterion should trade off Goodness of Fit and Model Complexity for Fixed-effects and Random-effects. For example, for the Population Mean Model and for the regression spline method, we shall define AIC Loglik df df, BIC Loglik df df, where and are the number of knots for and. For the smoothing spline and penalized spline methods, and should be replaced by and respectively. University of Seoul, South Korea, 7 p.21/26

Real Data Application AIC and BIC for ACTG 388 Data 4.465 x (a) AIC 14 4.46 4.455 4.45 4.445 4.44 4.435 Value 4.72 x (b) BIC 14 4.7 4.68 4.66 4.64 K v =1 K v =2 K v =3 4.43 2 4 6 8 1 4.62 2 4 6 8 1 K seems a good choice University of Seoul, South Korea, 7 p.22/26

Real Data Application Overall Fits: CD4 count 1 8 6 (a) Fitted individual functions 5 1 Week CD4 count (b) Fitted mean function with ± 2 SD 45 35 3 25 15 5 1 Week x 1 4 (c) Fitted covariance function (d) Fitted correlation function 5 1 Covariance 4 3 Correlation.9.8 2 1 Week 1 5 Week 15.7 1 Week 1 5 Week 15 University of Seoul, South Korea, 7 p.23/26

Real Data Application Individual Fits: Subj 8 6 5 3 5 1 Subj 21 3 5 1 Subj 39 3 1 5 1 CD4 count 6 5 3 3 1 3 1 Subj 11 5 1 Subj 36 5 1 Subj 59 raw data population individual 5 1 Week University of Seoul, South Korea, 7 p.24/26

Other ME Models The Methodologies proposed above can be applied to other ME Models, e.g.: Semiparametric ME Models: GP Generalized Nonparametric ME Model: is known. where Other ME models.. University of Seoul, South Korea, 7 p.25/26

End of the Talk Thank You University of Seoul, South Korea, 7 p.26/26