davidr Cornell University

Similar documents
Nonparametric Mixed-Effects Models for Longitudinal Data

Non-Parametric and Semi-Parametric Methods for Longitudinal Data

NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University

Generalized Additive Models

Generalized Additive Model

CH9.Generalized Additive Model

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K.

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric)

Nonparametric regression using kernel and spline methods

Splines and penalized regression

Statistical Methods for the Analysis of Repeated Measurements

A fast Mixed Model B-splines algorithm

arxiv: v1 [stat.me] 2 Jun 2017

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is...

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2)

STAT 705 Introduction to generalized additive models

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

Generalized additive models I

Independent Components Analysis through Product Density Estimation

Linear Methods for Regression and Shrinkage Methods

Analyzing Longitudinal Data Using Regression Splines

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400

Spatially Adaptive Bayesian Penalized Regression Splines (P-splines)

What is machine learning?

Nonparametric Estimation of Distribution Function using Bezier Curve

Smoothing Scatterplots Using Penalized Splines

On Kernel Density Estimation with Univariate Application. SILOKO, Israel Uzuazor

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique

Statistics & Analysis. A Comparison of PDLREG and GAM Procedures in Measuring Dynamic Effects

Practical 4: Mixed effect models

An Introduction to the Bootstrap

Functional Data Analysis

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

Statistics & Analysis. Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010

Nonparametric Regression

Use of Extreme Value Statistics in Modeling Biometric Systems

STATISTICS (STAT) Statistics (STAT) 1

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

Simple fitting of subject-specific curves for longitudinal data

Stat 8053, Fall 2013: Additive Models

Comment. J. L. FRENCH, E. E. KAMMANN, and M. F? WAND. a: 1 = -{lly -X~-Z~U~~'+AU~Z~U). sion involves minimization of

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS

PSY 9556B (Feb 5) Latent Growth Modeling

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)

Free Knot Polynomial Spline Confidence Intervals

Bootstrapping Methods

The theory of the linear model 41. Theorem 2.5. Under the strong assumptions A3 and A5 and the hypothesis that

SAS PROC GLM and PROC MIXED. for Recovering Inter-Effect Information

Functional Modeling of Longitudinal Data with the SSM Procedure

Assessing the Quality of the Natural Cubic Spline Approximation

Approximate Smoothing Spline Methods for Large Data Sets in the Binary Case Dong Xiang, SAS Institute Inc. Grace Wahba, University of Wisconsin at Mad

Package lmesplines. R topics documented: February 20, Version

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR

Linear penalized spline model estimation using ranked set sampling technique

Last time... Bias-Variance decomposition. This week

Fitting latency models using B-splines in EPICURE for DOS

Section 4 Matching Estimator

A Method for Comparing Multiple Regression Models

Week 5: Multiple Linear Regression II

Week 4: Simple Linear Regression III

A Fast Clustering Algorithm with Application to Cosmology. Woncheol Jang

Statistical Modeling with Spline Functions Methodology and Theory

Median and Extreme Ranked Set Sampling for penalized spline estimation

Constrained Penalized Splines

A Modified Weibull Distribution

Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples

Fly wing length data Sokal and Rohlf Box 10.1 Ch13.xls. on chalk board

Generalized Additive Models

Parallel line analysis and relative potency in SoftMax Pro 7 Software

Nonparametric Regression and Generalized Additive Models Part I

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Moving Beyond Linearity

Wildfire Chances and Probabilistic Risk Assessment

Using Multivariate Adaptive Regression Splines (MARS ) to enhance Generalised Linear Models. Inna Kolyshkina PriceWaterhouseCoopers

by John Barnard and Walter T. Federer Cornell University

Statistical Modeling with Spline Functions Methodology and Theory

Notes on Simulations in SAS Studio

Multivariable Regression Modelling

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model

Estimation and Inference by the Method of Projection Minimum Distance. Òscar Jordà Sharon Kozicki U.C. Davis Bank of Canada

Introduction to Mixed Models: Multivariate Regression

Package gamm4. July 25, Index 10

Missing Data Analysis for the Employee Dataset

Introduction to Statistical Analyses in SAS

Unified Methods for Censored Longitudinal Data and Causality

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica

Inference in mixed models in R - beyond the usual asymptotic likelihood ratio test

Package RLRsim. November 4, 2016

Course Business. types of designs. " Week 4. circle back to if we have time later. ! Two new datasets for class today:

STATISTICS (STAT) 200 Level Courses Registration Restrictions: STAT 250: Required Prerequisites: not Schedule Type: Mason Core: STAT 346:

Smoothing non-stationary noise of the Nigerian Stock Exchange All-Share Index data using variable coefficient functions

Hierarchical Generalized Linear Models

Modeling Criminal Careers as Departures From a Unimodal Population Age-Crime Curve: The Case of Marijuana Use

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures:

A Bayesian approach to detect time-specific group differences between nonlinear temporal curves

Transcription:

1 NONPARAMETRIC RANDOM EFFECTS MODELS AND LIKELIHOOD RATIO TESTS Oct 11, 2002 David Ruppert Cornell University www.orie.cornell.edu/ davidr (These transparencies and preprints available link to Recent Talks and Recent Papers ) Work done jointly with Ciprian Crainiceanu, Cornell University

2 OUTLINE Smoothing can be done using standard mixed models software because Splines can be viewed as BLUPs in mixed models This random-effects spline model extends to: Semiparametric models (allows parametric submodels) Longitudinal data nested families of curves

3 EXAMPLE Rick Canfield and Chuck Henderson, Jr. at Cornell are working on effects of low-level lead exposure on IQ of children. They have a mixed model but the dose-response curve should be modeled nonparametrically.

4 EXAMPLE CONT They asked SAS is a PROC GAMMIXED would be available someday. short answer was no Then, they found Matt Wand s work and then contacted me. Now they know that GAMMIXED GLMMIXED. SAS has GAMMIXED and does not know it!

5 TESTING IN THIS FRAMEWORK In principle, likelihood ratio tests (LRTs) could be used to test for effects of interest E.g., hypothesis that a curve is linear or that an effect is zero a variance component (and possibly a fixed effect) is zero allows an elegant, unified theory

6 TESTING CONT However, the distribution theory of LRTs is complex: the null hypothesis is on the boundary of the parameter space, so standard theory suggests chi-squared mixtures as the asympototic distribution. but standard asymptotics do not apply because of correlation for the case of one variance component, we now have asymptotics that do apply

% " % assume they are iid be a spline controls the amount of shrinkage or smoothing. size of! #" will be treated as random effects letting model UNIVARIATE NONPARAMETRIC REGRESSION 7

as a spline: & & & ' ) +* +* as another spline: model the (th subject curve model the population curve & consider the nonparametric model & is th observation on (th subject NONPARAMETRIC MODELS FOR LONGITUDINAL DATA 8

effects somewhat different that usual interpretation of random this assumption can be viewed as a Bayesian model %,.- assume they are iid! #" ( /= population ) will be treated as random effects will be treated as fixed effects Recall: POPULATION CURVE 9

) 10 %,21 assume they are iid! #" ( 3= subject ) +* +* will also be treated as random effects this is a typical random effects assumption assume they are iid! #" 0 ) +* +* will be treated as random effects ) +* +* Recall: SUBJECT CURVES

4 " " %,21 subject effects are %,21 0 " ) +* +* 11 th degree polynomials no subject effects Recall: NULL HYPOTHESES OF INTEREST

12 RELATED WORK Brumback and Rice (1998) Zhang, Lin, Raz, and Sowers (1998) Lin and Zhang (1999) Rice and Wu (2001) See references at end.

5 13 < >= " % null hypothesis: ; #" % and & 6 & ( 7 8and ' model: BALANCE 1-WAY ANOVA 7 :9

BA C D H < J J J BA C D? @ 8 E G? KL under Then the LR test is equivalent to the F-test K. Note: The equivalent fixed effects hypothesis is This is the iid case if we take the subjects as observations (Self and Liang, 1987; Stram and Lee, 1994) FE G? IH H If with9fixed, then 14

N N P BA C D 8 7 7 B C D BA C D 7 B C D U P P 9? @ (Crainiceanu and Ruppert, 2002) K L where K L ; _a` VXW Y and KL ; K _a` V W Y G E G ] N K L N K L ^ R S T VXW Y Z \ and E G 8 MON K L N K L QOR S>T VXW Y[Z \ If with8fixed, then 15

f / g h C D 16 Probability 1-Way ANOVA: bb c ed ] B FE G ^ Number of levels 0.5 0 10 20 30 40 50 60 70 80 90 100 0.55 0.6 0.65 0.7 0.75 0.8 0.85 ML REML 0.9

4 H H H i 17 These theoretical results help explain their findings. is better replaced by 7 4 H for 4 kj i. found some empirical evidence that the i mixture simulated the LRT Pinheiro and Bates (2000, p. 87)

18 QQ plot for χ 2 1 versus the distributions in equation (9), K=3, 5, 20 9 20 5 6.63 3 5.70 3 0 0 3 9 Q0.99 =6.63 1-Way ANOVA: asympt. null dist of RLR, given RLRj0

19 QQ plot for χ 2 1 versus the distributions in equation (10), K=3, 5, 20 9 6.63 20 4.99 3 5 3 0 0 3 9 Q0.99 =6.63 1-Way ANOVA: asympt. null dist of LR, given LRj0

o p m 20 < q = m alternative hypothesis: < >= m L n L n null hypothesis: l model: PENALIZED SPLINES

{ L z ~ x z ~ y r d u 21 } with ] m v ^ w r u x r minimize penalized least squares: r s t notation:

% " " % 22 (Brumback, Ruppert, and Wand, 1999) w and Cov " same as BLUP in a linear mixed model with

n " J J J o " o 23 L or, if j, if new form of null: and % %

ed f / g h C D ed f / g h C D 4 o bb c ] B E G ^ i i not.5 and ] B bb c G E G ^ i ƒ not.5 Then, mean) (constant mean versus piecewise constant 20 equally spaced knots s equally spaced Example: (Crainiceanu and Ruppert, 2002) 24

f / g h C D 25 Probability P-splines:Bb c ed ] B E G ^ Number of knots 0.5 0 10 20 30 40 50 60 70 80 90 100 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 ML REML 0.95 1

26 ORTHOGONALIZATION one can apply Gram-Schmidt to the design matrix power functions are replaced by orthogonal polynomials Plus functions are replaced by spline basis functions that are orthogonal to polynomials The asymptotics of the LRT are changed by this reparametrization

27 Asymptotics are essentially the same as for 1-way ANOVA with 8( # levels) = ( # knots) +1 E.g., 5 levels is like 4 knots

Probability Asymptotic null probabilities that log-lr is zero P-splines Orthogonalized = 1-way ANOVA Number of knots Number of levels 0.5 0 10 20 30 40 50 60 70 80 90 100 0.5 0 10 20 30 40 50 60 70 80 90 100 0.55 0.55 0.6 0.6 0.65 0.8 0.75 0.7 Probability 0.75 0.7 0.65 0.85 0.8 0.9 ML REML 0.85 0.95 ML REML 1 0.9 28

29 Hypotheses: linear trend versus 20-knot linear spline Comparison of finite-sample and asymptotic quantiles Quantiles of the asymptotic distribution (n= ) q 0.66 q 0.95 q 0.99 q 0.995 0 1.74 Quantiles of distributions 4.20 5.32 n=50 n=100 n= 0.5:0.5 mixture

30 Comparison of LRT with other tests Reference: Crainiceanu, Ruppert, Aerts, Claeskens, and Wand (2002, in preparation) Results in next table are for testing constant mean versus general alternative piecewise constant spline, or linear spline

31 The comparisons are made with an increasing, concave, and periodic mean function, chosen so that good tests had power approximately 0.8

32 R-test is from Cantoni and Hastie (2002) F-test is as in Hastie and Tibshirani (1990) C means alternative is a piecewise constant function L means alternative is a linear spline

33 1 means estimate under alternative has DF one greater than under null ML means smoothing parameter under alternative is chosen by ML GCV means smoothing parameter under alternative is chosen by GCV

34 Test Average power Maximum power Minimum Power RLRT-C 0.8885 0.9660 0.8166 R-GCV-L 0.8737 0.9910 0.7188 R-ML-C 0.8615 0.9916 0.7022 F-ML-L 0.8569 0.8796 0.8328 R-ML-L 0.8569 0.8796 0.8328 F-ML-C 0.8534 0.9928 0.6708 F-GCV-L 0.8482 0.9946 0.6634 LRT-L 0.7561 0.8466 0.6832 F-1-C 0.7087 0.8442 0.4816 F-1-L 0.6775 0.9414 0.3012 R-1-L 0.6239 0.9126 0.1462 R-GCV-C 0.6144 0.9284 0.3392

35 Conclusions Standard asymptotics are, in general, not suitable Better asymptotics for one variance component are feasible For more than one variance component, one might need to use simulation to get p-values

36 References Brumback, B., and Rice, J., (1998), Smoothing spline models for the analsysi of nested and crossed samples of curves, JASA, 93, 944 961 Brumback, B., Ruppert, D., and Wand, M.P., (1999). Comment on Variable selection and function estimation in additive nonparametric regression using data-based prior by Shively, Kohn, and Wood, JASA, 94, 794 797. Cantoni, E., and Hastie, T.J., 2002. Degrees of freedom tests for smoothing splines, Biometrika, 89. 251 263. Crainiceanu, C. M., and Ruppert, D., (2002), Asymptotic distribution of likelihood ratio tests in linear mixed models, submitted Crainiceanu, C. M., Ruppert, D., and Vogelsang, T. J., (2002). Probability that the mle of a variance component is zero with applications to likelihood ratio Tests, manuscript

37 Crainiceanu, C. M., Ruppert, D., Aerts, M., Claeskens, G., and Wand, M., (2002), Tests of rolynomial regression against a general alternative, in preparation. Hastie, T.J., Tibshirani, R., 1990. Generalized Additive Models, London: Chapman and Hall. Lin, X., and Zhang, D. (1999), Inference in generalized additive mixed models by using smoothing splines, JRSS-B, 61, 381 400. Pinheiro, J., and Bates, D., (2000), Mixed-Effects Models in S and S-PLUS, Springer, New York. Rice, J., and Wu, C., (2001), Nonparametric mixed effects models for unequally sampled noisy curves, Biometrics, 57, 253 259. Self, S., and Liang, K., (1987). Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under non-standard conditions, JASA, 82, 605 610.

38 Stram, D., Lee, J., (1994). Variance Components Testing in the Longitudinal Mixed Effects Model, Biometrics, 50, 1171 1177. Zhang, D., Lin, X., Raz, J., and Sowers, M. (1998), Semi-parametric stochastic mixed models for longitudinal data, JASA, 93, 710 719