davidr Cornell University

Size: px
Start display at page:

Download "davidr Cornell University"

Transcription

1 1 NONPARAMETRIC RANDOM EFFECTS MODELS AND LIKELIHOOD RATIO TESTS Oct 11, 2002 David Ruppert Cornell University davidr (These transparencies and preprints available link to Recent Talks and Recent Papers ) Work done jointly with Ciprian Crainiceanu, Cornell University

2 2 OUTLINE Smoothing can be done using standard mixed models software because Splines can be viewed as BLUPs in mixed models This random-effects spline model extends to: Semiparametric models (allows parametric submodels) Longitudinal data nested families of curves

3 3 EXAMPLE Rick Canfield and Chuck Henderson, Jr. at Cornell are working on effects of low-level lead exposure on IQ of children. They have a mixed model but the dose-response curve should be modeled nonparametrically.

4 4 EXAMPLE CONT They asked SAS is a PROC GAMMIXED would be available someday. short answer was no Then, they found Matt Wand s work and then contacted me. Now they know that GAMMIXED GLMMIXED. SAS has GAMMIXED and does not know it!

5 5 TESTING IN THIS FRAMEWORK In principle, likelihood ratio tests (LRTs) could be used to test for effects of interest E.g., hypothesis that a curve is linear or that an effect is zero a variance component (and possibly a fixed effect) is zero allows an elegant, unified theory

6 6 TESTING CONT However, the distribution theory of LRTs is complex: the null hypothesis is on the boundary of the parameter space, so standard theory suggests chi-squared mixtures as the asympototic distribution. but standard asymptotics do not apply because of correlation for the case of one variance component, we now have asymptotics that do apply

7 % " % assume they are iid be a spline controls the amount of shrinkage or smoothing. size of! #" will be treated as random effects letting model UNIVARIATE NONPARAMETRIC REGRESSION 7

8 as a spline: & & & ' ) +* +* as another spline: model the (th subject curve model the population curve & consider the nonparametric model & is th observation on (th subject NONPARAMETRIC MODELS FOR LONGITUDINAL DATA 8

9 effects somewhat different that usual interpretation of random this assumption can be viewed as a Bayesian model %,.- assume they are iid! #" ( /= population ) will be treated as random effects will be treated as fixed effects Recall: POPULATION CURVE 9

10 ) 10 %,21 assume they are iid! #" ( 3= subject ) +* +* will also be treated as random effects this is a typical random effects assumption assume they are iid! #" 0 ) +* +* will be treated as random effects ) +* +* Recall: SUBJECT CURVES

11 4 " " %,21 subject effects are %,21 0 " ) +* +* 11 th degree polynomials no subject effects Recall: NULL HYPOTHESES OF INTEREST

12 12 RELATED WORK Brumback and Rice (1998) Zhang, Lin, Raz, and Sowers (1998) Lin and Zhang (1999) Rice and Wu (2001) See references at end.

13 5 13 < >= " % null hypothesis: ; #" % and & 6 & ( 7 8and ' model: BALANCE 1-WAY ANOVA 7 :9

14 BA C D H < J J J BA C 8 E G? KL under Then the LR test is equivalent to the F-test K. Note: The equivalent fixed effects hypothesis is This is the iid case if we take the subjects as observations (Self and Liang, 1987; Stram and Lee, 1994) FE G? IH H If with9fixed, then 14

15 N N P BA C D B C D BA C D 7 B C D U P P (Crainiceanu and Ruppert, 2002) K L where K L ; _a` VXW Y and KL ; K _a` V W Y G E G ] N K L N K L ^ R S T VXW Y Z \ and E G 8 MON K L N K L QOR S>T VXW Y[Z \ If with8fixed, then 15

16 f / g h C D 16 Probability 1-Way ANOVA: bb c ed ] B FE G ^ Number of levels ML REML 0.9

17 4 H H H i 17 These theoretical results help explain their findings. is better replaced by 7 4 H for 4 kj i. found some empirical evidence that the i mixture simulated the LRT Pinheiro and Bates (2000, p. 87)

18 18 QQ plot for χ 2 1 versus the distributions in equation (9), K=3, 5, Q0.99 = Way ANOVA: asympt. null dist of RLR, given RLRj0

19 19 QQ plot for χ 2 1 versus the distributions in equation (10), K=3, 5, Q0.99 = Way ANOVA: asympt. null dist of LR, given LRj0

20 o p m 20 < q = m alternative hypothesis: < >= m L n L n null hypothesis: l model: PENALIZED SPLINES

21 { L z ~ x z ~ y r d u 21 } with ] m v ^ w r u x r minimize penalized least squares: r s t notation:

22 % " " % 22 (Brumback, Ruppert, and Wand, 1999) w and Cov " same as BLUP in a linear mixed model with

23 n " J J J o " o 23 L or, if j, if new form of null: and % %

24 ed f / g h C D ed f / g h C D 4 o bb c ] B E G ^ i i not.5 and ] B bb c G E G ^ i ƒ not.5 Then, mean) (constant mean versus piecewise constant 20 equally spaced knots s equally spaced Example: (Crainiceanu and Ruppert, 2002) 24

25 f / g h C D 25 Probability P-splines:Bb c ed ] B E G ^ Number of knots ML REML

26 26 ORTHOGONALIZATION one can apply Gram-Schmidt to the design matrix power functions are replaced by orthogonal polynomials Plus functions are replaced by spline basis functions that are orthogonal to polynomials The asymptotics of the LRT are changed by this reparametrization

27 27 Asymptotics are essentially the same as for 1-way ANOVA with 8( # levels) = ( # knots) +1 E.g., 5 levels is like 4 knots

28 Probability Asymptotic null probabilities that log-lr is zero P-splines Orthogonalized = 1-way ANOVA Number of knots Number of levels Probability ML REML ML REML

29 29 Hypotheses: linear trend versus 20-knot linear spline Comparison of finite-sample and asymptotic quantiles Quantiles of the asymptotic distribution (n= ) q 0.66 q 0.95 q 0.99 q Quantiles of distributions n=50 n=100 n= 0.5:0.5 mixture

30 30 Comparison of LRT with other tests Reference: Crainiceanu, Ruppert, Aerts, Claeskens, and Wand (2002, in preparation) Results in next table are for testing constant mean versus general alternative piecewise constant spline, or linear spline

31 31 The comparisons are made with an increasing, concave, and periodic mean function, chosen so that good tests had power approximately 0.8

32 32 R-test is from Cantoni and Hastie (2002) F-test is as in Hastie and Tibshirani (1990) C means alternative is a piecewise constant function L means alternative is a linear spline

33 33 1 means estimate under alternative has DF one greater than under null ML means smoothing parameter under alternative is chosen by ML GCV means smoothing parameter under alternative is chosen by GCV

34 34 Test Average power Maximum power Minimum Power RLRT-C R-GCV-L R-ML-C F-ML-L R-ML-L F-ML-C F-GCV-L LRT-L F-1-C F-1-L R-1-L R-GCV-C

35 35 Conclusions Standard asymptotics are, in general, not suitable Better asymptotics for one variance component are feasible For more than one variance component, one might need to use simulation to get p-values

36 36 References Brumback, B., and Rice, J., (1998), Smoothing spline models for the analsysi of nested and crossed samples of curves, JASA, 93, Brumback, B., Ruppert, D., and Wand, M.P., (1999). Comment on Variable selection and function estimation in additive nonparametric regression using data-based prior by Shively, Kohn, and Wood, JASA, 94, Cantoni, E., and Hastie, T.J., Degrees of freedom tests for smoothing splines, Biometrika, Crainiceanu, C. M., and Ruppert, D., (2002), Asymptotic distribution of likelihood ratio tests in linear mixed models, submitted Crainiceanu, C. M., Ruppert, D., and Vogelsang, T. J., (2002). Probability that the mle of a variance component is zero with applications to likelihood ratio Tests, manuscript

37 37 Crainiceanu, C. M., Ruppert, D., Aerts, M., Claeskens, G., and Wand, M., (2002), Tests of rolynomial regression against a general alternative, in preparation. Hastie, T.J., Tibshirani, R., Generalized Additive Models, London: Chapman and Hall. Lin, X., and Zhang, D. (1999), Inference in generalized additive mixed models by using smoothing splines, JRSS-B, 61, Pinheiro, J., and Bates, D., (2000), Mixed-Effects Models in S and S-PLUS, Springer, New York. Rice, J., and Wu, C., (2001), Nonparametric mixed effects models for unequally sampled noisy curves, Biometrics, 57, Self, S., and Liang, K., (1987). Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under non-standard conditions, JASA, 82,

38 38 Stram, D., Lee, J., (1994). Variance Components Testing in the Longitudinal Mixed Effects Model, Biometrics, 50, Zhang, D., Lin, X., Raz, J., and Sowers, M. (1998), Semi-parametric stochastic mixed models for longitudinal data, JASA, 93,

Nonparametric Mixed-Effects Models for Longitudinal Data

Nonparametric Mixed-Effects Models for Longitudinal Data Nonparametric Mixed-Effects Models for Longitudinal Data Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore University of Seoul, South Korea, 7 p.1/26 OUTLINE The Motivating Data

More information

Non-Parametric and Semi-Parametric Methods for Longitudinal Data

Non-Parametric and Semi-Parametric Methods for Longitudinal Data PART III Non-Parametric and Semi-Parametric Methods for Longitudinal Data CHAPTER 8 Non-parametric and semi-parametric regression methods: Introduction and overview Xihong Lin and Raymond J. Carroll Contents

More information

NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University

NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University www.orie.cornell.edu/ davidr (These transparencies, preprints, and references a link to Recent Talks and

More information

Generalized Additive Models

Generalized Additive Models :p Texts in Statistical Science Generalized Additive Models An Introduction with R Simon N. Wood Contents Preface XV 1 Linear Models 1 1.1 A simple linear model 2 Simple least squares estimation 3 1.1.1

More information

Generalized Additive Model

Generalized Additive Model Generalized Additive Model by Huimin Liu Department of Mathematics and Statistics University of Minnesota Duluth, Duluth, MN 55812 December 2008 Table of Contents Abstract... 2 Chapter 1 Introduction 1.1

More information

CH9.Generalized Additive Model

CH9.Generalized Additive Model CH9.Generalized Additive Model Regression Model For a response variable and predictor variables can be modeled using a mean function as follows: would be a parametric / nonparametric regression or a smoothing

More information

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K.

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K. GAMs semi-parametric GLMs Simon Wood Mathematical Sciences, University of Bath, U.K. Generalized linear models, GLM 1. A GLM models a univariate response, y i as g{e(y i )} = X i β where y i Exponential

More information

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric)

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric) Splines Patrick Breheny November 20 Patrick Breheny STA 621: Nonparametric Statistics 1/46 Introduction Introduction Problems with polynomial bases We are discussing ways to estimate the regression function

More information

Nonparametric regression using kernel and spline methods

Nonparametric regression using kernel and spline methods Nonparametric regression using kernel and spline methods Jean D. Opsomer F. Jay Breidt March 3, 016 1 The statistical model When applying nonparametric regression methods, the researcher is interested

More information

Splines and penalized regression

Splines and penalized regression Splines and penalized regression November 23 Introduction We are discussing ways to estimate the regression function f, where E(y x) = f(x) One approach is of course to assume that f has a certain shape,

More information

Statistical Methods for the Analysis of Repeated Measurements

Statistical Methods for the Analysis of Repeated Measurements Charles S. Davis Statistical Methods for the Analysis of Repeated Measurements With 20 Illustrations #j Springer Contents Preface List of Tables List of Figures v xv xxiii 1 Introduction 1 1.1 Repeated

More information

A fast Mixed Model B-splines algorithm

A fast Mixed Model B-splines algorithm A fast Mixed Model B-splines algorithm arxiv:1502.04202v1 [stat.co] 14 Feb 2015 Abstract Martin P. Boer Biometris WUR Wageningen The Netherlands martin.boer@wur.nl February 17, 2015 A fast algorithm for

More information

arxiv: v1 [stat.me] 2 Jun 2017

arxiv: v1 [stat.me] 2 Jun 2017 Inference for Penalized Spline Regression: Improving Confidence Intervals by Reducing the Penalty arxiv:1706.00865v1 [stat.me] 2 Jun 2017 Ning Dai School of Statistics University of Minnesota daixx224@umn.edu

More information

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221 Yixiao Sun Department of Economics, University of California, San Diego Winter 2007 Contents Preface ix 1 Kernel Smoothing: Density

More information

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is...

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1D Regression i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1 Non-linear problems What if the underlying function is

More information

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2)

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2) SOC6078 Advanced Statistics: 9. Generalized Additive Models Robert Andersen Department of Sociology University of Toronto Goals of the Lecture Introduce Additive Models Explain how they extend from simple

More information

STAT 705 Introduction to generalized additive models

STAT 705 Introduction to generalized additive models STAT 705 Introduction to generalized additive models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 22 Generalized additive models Consider a linear

More information

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer Ludwig Fahrmeir Gerhard Tute Statistical odelling Based on Generalized Linear Model íecond Edition. Springer Preface to the Second Edition Preface to the First Edition List of Examples List of Figures

More information

Generalized additive models I

Generalized additive models I I Patrick Breheny October 6 Patrick Breheny BST 764: Applied Statistical Modeling 1/18 Introduction Thus far, we have discussed nonparametric regression involving a single covariate In practice, we often

More information

Independent Components Analysis through Product Density Estimation

Independent Components Analysis through Product Density Estimation Independent Components Analysis through Product Density Estimation 'frevor Hastie and Rob Tibshirani Department of Statistics Stanford University Stanford, CA, 94305 { hastie, tibs } @stat.stanford. edu

More information

Linear Methods for Regression and Shrinkage Methods

Linear Methods for Regression and Shrinkage Methods Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors

More information

Analyzing Longitudinal Data Using Regression Splines

Analyzing Longitudinal Data Using Regression Splines Analyzing Longitudinal Data Using Regression Splines Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore August 18, 6 DSAP, NUS p.1/16 OUTLINE Motivating Longitudinal Data Parametric

More information

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400 Statistics (STAT) 1 Statistics (STAT) STAT 1200: Introductory Statistical Reasoning Statistical concepts for critically evaluation quantitative information. Descriptive statistics, probability, estimation,

More information

Spatially Adaptive Bayesian Penalized Regression Splines (P-splines)

Spatially Adaptive Bayesian Penalized Regression Splines (P-splines) Spatially Adaptive Bayesian Penalized Regression Splines (P-splines) Veerabhadran BALADANDAYUTHAPANI, Bani K. MALLICK, and Raymond J. CARROLL In this article we study penalized regression splines (P-splines),

More information

What is machine learning?

What is machine learning? Machine learning, pattern recognition and statistical data modelling Lecture 12. The last lecture Coryn Bailer-Jones 1 What is machine learning? Data description and interpretation finding simpler relationship

More information

Nonparametric Estimation of Distribution Function using Bezier Curve

Nonparametric Estimation of Distribution Function using Bezier Curve Communications for Statistical Applications and Methods 2014, Vol. 21, No. 1, 105 114 DOI: http://dx.doi.org/10.5351/csam.2014.21.1.105 ISSN 2287-7843 Nonparametric Estimation of Distribution Function

More information

Smoothing Scatterplots Using Penalized Splines

Smoothing Scatterplots Using Penalized Splines Smoothing Scatterplots Using Penalized Splines 1 What do we mean by smoothing? Fitting a "smooth" curve to the data in a scatterplot 2 Why would we want to fit a smooth curve to the data in a scatterplot?

More information

On Kernel Density Estimation with Univariate Application. SILOKO, Israel Uzuazor

On Kernel Density Estimation with Univariate Application. SILOKO, Israel Uzuazor On Kernel Density Estimation with Univariate Application BY SILOKO, Israel Uzuazor Department of Mathematics/ICT, Edo University Iyamho, Edo State, Nigeria. A Seminar Presented at Faculty of Science, Edo

More information

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique Al Kadiri M. A. Abstract Benefits of using Ranked Set Sampling (RSS) rather than Simple Random Sampling (SRS) are indeed significant

More information

Statistics & Analysis. A Comparison of PDLREG and GAM Procedures in Measuring Dynamic Effects

Statistics & Analysis. A Comparison of PDLREG and GAM Procedures in Measuring Dynamic Effects A Comparison of PDLREG and GAM Procedures in Measuring Dynamic Effects Patralekha Bhattacharya Thinkalytics The PDLREG procedure in SAS is used to fit a finite distributed lagged model to time series data

More information

Practical 4: Mixed effect models

Practical 4: Mixed effect models Practical 4: Mixed effect models This practical is about how to fit (generalised) linear mixed effects models using the lme4 package. You may need to install it first (using either the install.packages

More information

An Introduction to the Bootstrap

An Introduction to the Bootstrap An Introduction to the Bootstrap Bradley Efron Department of Statistics Stanford University and Robert J. Tibshirani Department of Preventative Medicine and Biostatistics and Department of Statistics,

More information

Functional Data Analysis

Functional Data Analysis Functional Data Analysis Venue: Tuesday/Thursday 1:25-2:40 WN 145 Lecturer: Giles Hooker Office Hours: Wednesday 2-4 Comstock 1186 Ph: 5-1638 e-mail: gjh27 Texts and Resources Ramsay and Silverman, 2007,

More information

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu FMA901F: Machine Learning Lecture 3: Linear Models for Regression Cristian Sminchisescu Machine Learning: Frequentist vs. Bayesian In the frequentist setting, we seek a fixed parameter (vector), with value(s)

More information

Statistics & Analysis. Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2

Statistics & Analysis. Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2 Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2 Weijie Cai, SAS Institute Inc., Cary NC July 1, 2008 ABSTRACT Generalized additive models are useful in finding predictor-response

More information

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Tim Hanson, Ph.D. University of South Carolina T. Hanson (USC) Stat 704: Data Analysis I, Fall 2010 1 / 26 Additive predictors

More information

Nonparametric Regression

Nonparametric Regression Nonparametric Regression John Fox Department of Sociology McMaster University 1280 Main Street West Hamilton, Ontario Canada L8S 4M4 jfox@mcmaster.ca February 2004 Abstract Nonparametric regression analysis

More information

Use of Extreme Value Statistics in Modeling Biometric Systems

Use of Extreme Value Statistics in Modeling Biometric Systems Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an Soft Threshold Estimation for Varying{coecient Models Artur Klinger, Universitat Munchen ABSTRACT: An alternative penalized likelihood estimator for varying{coecient regression in generalized linear models

More information

Simple fitting of subject-specific curves for longitudinal data

Simple fitting of subject-specific curves for longitudinal data STATISTICS IN MEDICINE Statist Med 2004; 00:1 24 Simple fitting of subject-specific curves for longitudinal data M Durbán 1,, J Harezlak 2,MP Wand 3,RJ Carroll 4 1 Department of Statistics, Universidad

More information

Stat 8053, Fall 2013: Additive Models

Stat 8053, Fall 2013: Additive Models Stat 853, Fall 213: Additive Models We will only use the package mgcv for fitting additive and later generalized additive models. The best reference is S. N. Wood (26), Generalized Additive Models, An

More information

Comment. J. L. FRENCH, E. E. KAMMANN, and M. F? WAND. a: 1 = -{lly -X~-Z~U~~'+AU~Z~U). sion involves minimization of

Comment. J. L. FRENCH, E. E. KAMMANN, and M. F? WAND. a: 1 = -{lly -X~-Z~U~~'+AU~Z~U). sion involves minimization of French, Kammann, and Wand: Comment J. L. FRENCH, E. E. KAMMANN, and M. F? WAND Comment 1. INTRODUCTION Semiparametric nonlinear mixed models are a useful addition to the regression modeling arsenal, and

More information

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines Spline Models Introduction to CS and NCS Regression splines Smoothing splines 3 Cubic Splines a knots: a< 1 < 2 < < m

More information

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS Analysis of Panel Data Third Edition Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS Contents Preface to the ThirdEdition Preface to the Second Edition Preface to the First Edition

More information

PSY 9556B (Feb 5) Latent Growth Modeling

PSY 9556B (Feb 5) Latent Growth Modeling PSY 9556B (Feb 5) Latent Growth Modeling Fixed and random word confusion Simplest LGM knowing how to calculate dfs How many time points needed? Power, sample size Nonlinear growth quadratic Nonlinear growth

More information

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM) School of Computer Science Probabilistic Graphical Models Structured Sparse Additive Models Junming Yin and Eric Xing Lecture 7, April 4, 013 Reading: See class website 1 Outline Nonparametric regression

More information

Free Knot Polynomial Spline Confidence Intervals

Free Knot Polynomial Spline Confidence Intervals Free Knot Polynomial Spline Confidence Intervals Vincent W. Mao Linda H. Zhao Department of Statistics University of Pennsylvania Philadelphia, PA 19104-6302 lzhao@wharton.upenn.edu Abstract We construct

More information

Bootstrapping Methods

Bootstrapping Methods Bootstrapping Methods example of a Monte Carlo method these are one Monte Carlo statistical method some Bayesian statistical methods are Monte Carlo we can also simulate models using Monte Carlo methods

More information

The theory of the linear model 41. Theorem 2.5. Under the strong assumptions A3 and A5 and the hypothesis that

The theory of the linear model 41. Theorem 2.5. Under the strong assumptions A3 and A5 and the hypothesis that The theory of the linear model 41 Theorem 2.5. Under the strong assumptions A3 and A5 and the hypothesis that E(Y X) =X 0 b 0 0 the F-test statistic follows an F-distribution with (p p 0, n p) degrees

More information

SAS PROC GLM and PROC MIXED. for Recovering Inter-Effect Information

SAS PROC GLM and PROC MIXED. for Recovering Inter-Effect Information SAS PROC GLM and PROC MIXED for Recovering Inter-Effect Information Walter T. Federer Biometrics Unit Cornell University Warren Hall Ithaca, NY -0 biometrics@comell.edu Russell D. Wolfinger SAS Institute

More information

Functional Modeling of Longitudinal Data with the SSM Procedure

Functional Modeling of Longitudinal Data with the SSM Procedure Paper SAS1580-2015 Functional Modeling of Longitudinal Data with the SSM Procedure Rajesh Selukar, SAS Institute Inc. ABSTRACT In many studies, a continuous response variable is repeatedly measured over

More information

Assessing the Quality of the Natural Cubic Spline Approximation

Assessing the Quality of the Natural Cubic Spline Approximation Assessing the Quality of the Natural Cubic Spline Approximation AHMET SEZER ANADOLU UNIVERSITY Department of Statisticss Yunus Emre Kampusu Eskisehir TURKEY ahsst12@yahoo.com Abstract: In large samples,

More information

Approximate Smoothing Spline Methods for Large Data Sets in the Binary Case Dong Xiang, SAS Institute Inc. Grace Wahba, University of Wisconsin at Mad

Approximate Smoothing Spline Methods for Large Data Sets in the Binary Case Dong Xiang, SAS Institute Inc. Grace Wahba, University of Wisconsin at Mad DEPARTMENT OF STATISTICS University of Wisconsin 1210 West Dayton St. Madison, WI 53706 TECHNICAL REPORT NO. 982 September 30, 1997 Approximate Smoothing Spline Methods for Large Data Sets in the Binary

More information

Package lmesplines. R topics documented: February 20, Version

Package lmesplines. R topics documented: February 20, Version Version 1.1-10 Package lmesplines February 20, 2015 Title Add smoothing spline modelling capability to nlme. Author Rod Ball Maintainer Andrzej Galecki

More information

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR J. D. Maca July 1, 1997 Abstract The purpose of this manual is to demonstrate the usage of software for

More information

Linear penalized spline model estimation using ranked set sampling technique

Linear penalized spline model estimation using ranked set sampling technique Hacettepe Journal of Mathematics and Statistics Volume 46 (4) (2017), 669 683 Linear penalized spline model estimation using ranked set sampling technique Al Kadiri M A Abstract Benets of using Ranked

More information

Last time... Bias-Variance decomposition. This week

Last time... Bias-Variance decomposition. This week Machine learning, pattern recognition and statistical data modelling Lecture 4. Going nonlinear: basis expansions and splines Last time... Coryn Bailer-Jones linear regression methods for high dimensional

More information

Fitting latency models using B-splines in EPICURE for DOS

Fitting latency models using B-splines in EPICURE for DOS Fitting latency models using B-splines in EPICURE for DOS Michael Hauptmann, Jay Lubin January 11, 2007 1 Introduction Disease latency refers to the interval between an increment of exposure and a subsequent

More information

Section 4 Matching Estimator

Section 4 Matching Estimator Section 4 Matching Estimator Matching Estimators Key Idea: The matching method compares the outcomes of program participants with those of matched nonparticipants, where matches are chosen on the basis

More information

A Method for Comparing Multiple Regression Models

A Method for Comparing Multiple Regression Models CSIS Discussion Paper No. 141 A Method for Comparing Multiple Regression Models Yuki Hiruta Yasushi Asami Department of Urban Engineering, the University of Tokyo e-mail: hiruta@ua.t.u-tokyo.ac.jp asami@csis.u-tokyo.ac.jp

More information

Week 5: Multiple Linear Regression II

Week 5: Multiple Linear Regression II Week 5: Multiple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Adjusted R

More information

Week 4: Simple Linear Regression III

Week 4: Simple Linear Regression III Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of

More information

A Fast Clustering Algorithm with Application to Cosmology. Woncheol Jang

A Fast Clustering Algorithm with Application to Cosmology. Woncheol Jang A Fast Clustering Algorithm with Application to Cosmology Woncheol Jang May 5, 2004 Abstract We present a fast clustering algorithm for density contour clusters (Hartigan, 1975) that is a modified version

More information

Statistical Modeling with Spline Functions Methodology and Theory

Statistical Modeling with Spline Functions Methodology and Theory This is page 1 Printer: Opaque this Statistical Modeling with Spline Functions Methodology and Theory Mark H Hansen University of California at Los Angeles Jianhua Z Huang University of Pennsylvania Charles

More information

Median and Extreme Ranked Set Sampling for penalized spline estimation

Median and Extreme Ranked Set Sampling for penalized spline estimation Appl Math Inf Sci 10, No 1, 43-50 (016) 43 Applied Mathematics & Information Sciences An International Journal http://dxdoiorg/1018576/amis/10014 Median and Extreme Ranked Set Sampling for penalized spline

More information

Constrained Penalized Splines

Constrained Penalized Splines Constrained Penalized Splines Mary C Meyer Colorado State University October 15, 2010 Summary The penalized spline is a popular method for function estimation when the assumption of smoothness is valid.

More information

A Modified Weibull Distribution

A Modified Weibull Distribution IEEE TRANSACTIONS ON RELIABILITY, VOL. 52, NO. 1, MARCH 2003 33 A Modified Weibull Distribution C. D. Lai, Min Xie, Senior Member, IEEE, D. N. P. Murthy, Member, IEEE Abstract A new lifetime distribution

More information

Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples

Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples Journal of Of cial Statistics, Vol. 19, No. 2, 2003, pp. 99±117 Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples Hui Zheng 1 and Roderick

More information

Fly wing length data Sokal and Rohlf Box 10.1 Ch13.xls. on chalk board

Fly wing length data Sokal and Rohlf Box 10.1 Ch13.xls. on chalk board Model Based Statistics in Biology. Part IV. The General Linear Model. Multiple Explanatory Variables. Chapter 13.6 Nested Factors (Hierarchical ANOVA ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6,

More information

Generalized Additive Models

Generalized Additive Models Generalized Additive Models Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Additive Models GAMs are one approach to non-parametric regression in the multiple predictor setting.

More information

Parallel line analysis and relative potency in SoftMax Pro 7 Software

Parallel line analysis and relative potency in SoftMax Pro 7 Software APPLICATION NOTE Parallel line analysis and relative potency in SoftMax Pro 7 Software Introduction Biological assays are frequently analyzed with the help of parallel line analysis (PLA). PLA is commonly

More information

Nonparametric Regression and Generalized Additive Models Part I

Nonparametric Regression and Generalized Additive Models Part I SPIDA, June 2004 Nonparametric Regression and Generalized Additive Models Part I Robert Andersen McMaster University Plan of the Lecture 1. Detecting nonlinearity Fitting a linear model to a nonlinear

More information

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball

More information

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data Smoothing Dissimilarities for Cluster Analysis: Binary Data and unctional Data David B. University of South Carolina Department of Statistics Joint work with Zhimin Chen University of South Carolina Current

More information

Moving Beyond Linearity

Moving Beyond Linearity Moving Beyond Linearity Basic non-linear models one input feature: polynomial regression step functions splines smoothing splines local regression. more features: generalized additive models. Polynomial

More information

Wildfire Chances and Probabilistic Risk Assessment

Wildfire Chances and Probabilistic Risk Assessment Wildfire Chances and Probabilistic Risk Assessment D R Brillinger, H K Preisler and H M Naderi Statistics Department University of California Berkeley, CA, 97-386 Pacific Southwest Research Station USDA

More information

Using Multivariate Adaptive Regression Splines (MARS ) to enhance Generalised Linear Models. Inna Kolyshkina PriceWaterhouseCoopers

Using Multivariate Adaptive Regression Splines (MARS ) to enhance Generalised Linear Models. Inna Kolyshkina PriceWaterhouseCoopers Using Multivariate Adaptive Regression Splines (MARS ) to enhance Generalised Linear Models. Inna Kolyshkina PriceWaterhouseCoopers Why enhance GLM? Shortcomings of the linear modelling approach. GLM being

More information

by John Barnard and Walter T. Federer Cornell University

by John Barnard and Walter T. Federer Cornell University Genstat and SAS Programs for Recovering Interblock, Interrow-column, and Intergradient Information by John Barnard and Walter T. Federer Cornell University BU-1383-M 1 anuary 1997 0. Abstrast Genstat and

More information

Statistical Modeling with Spline Functions Methodology and Theory

Statistical Modeling with Spline Functions Methodology and Theory This is page 1 Printer: Opaque this Statistical Modeling with Spline Functions Methodology and Theory Mark H. Hansen University of California at Los Angeles Jianhua Z. Huang University of Pennsylvania

More information

Notes on Simulations in SAS Studio

Notes on Simulations in SAS Studio Notes on Simulations in SAS Studio If you are not careful about simulations in SAS Studio, you can run into problems. In particular, SAS Studio has a limited amount of memory that you can use to write

More information

Multivariable Regression Modelling

Multivariable Regression Modelling Multivariable Regression Modelling A review of available spline packages in R. Aris Perperoglou for TG2 ISCB 2015 Aris Perperoglou for TG2 Multivariable Regression Modelling ISCB 2015 1 / 41 TG2 Members

More information

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model Topic 16 - Other Remedies Ridge Regression Robust Regression Regression Trees Outline - Fall 2013 Piecewise Linear Model Bootstrapping Topic 16 2 Ridge Regression Modification of least squares that addresses

More information

Estimation and Inference by the Method of Projection Minimum Distance. Òscar Jordà Sharon Kozicki U.C. Davis Bank of Canada

Estimation and Inference by the Method of Projection Minimum Distance. Òscar Jordà Sharon Kozicki U.C. Davis Bank of Canada Estimation and Inference by the Method of Projection Minimum Distance Òscar Jordà Sharon Kozicki U.C. Davis Bank of Canada The Paper in a Nutshell: An Efficient Limited Information Method Step 1: estimate

More information

Introduction to Mixed Models: Multivariate Regression

Introduction to Mixed Models: Multivariate Regression Introduction to Mixed Models: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #9 March 30, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate

More information

Package gamm4. July 25, Index 10

Package gamm4. July 25, Index 10 Version 0.2-5 Author Simon Wood, Fabian Scheipl Package gamm4 July 25, 2017 Maintainer Simon Wood Title Generalized Additive Mixed Models using 'mgcv' and 'lme4' Description

More information

Missing Data Analysis for the Employee Dataset

Missing Data Analysis for the Employee Dataset Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup Random Variables: Y i =(Y i1,...,y ip ) 0 =(Y i,obs, Y i,miss ) 0 R i =(R i1,...,r ip ) 0 ( 1

More information

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

Unified Methods for Censored Longitudinal Data and Causality

Unified Methods for Censored Longitudinal Data and Causality Mark J. van der Laan James M. Robins Unified Methods for Censored Longitudinal Data and Causality Springer Preface v Notation 1 1 Introduction 8 1.1 Motivation, Bibliographic History, and an Overview of

More information

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016 CPSC 340: Machine Learning and Data Mining Principal Component Analysis Fall 2016 A2/Midterm: Admin Grades/solutions will be posted after class. Assignment 4: Posted, due November 14. Extra office hours:

More information

Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica

Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica Boston-Keio Workshop 2016. Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica... Mihoko Minami Keio University, Japan August 15, 2016 Joint work with

More information

Inference in mixed models in R - beyond the usual asymptotic likelihood ratio test

Inference in mixed models in R - beyond the usual asymptotic likelihood ratio test 1 / 42 Inference in mixed models in R - beyond the usual asymptotic likelihood ratio test Søren Højsgaard 1 Ulrich Halekoh 2 1 Department of Mathematical Sciences Aalborg University, Denmark sorenh@math.aau.dk

More information

Package RLRsim. November 4, 2016

Package RLRsim. November 4, 2016 Type Package Package RLRsim November 4, 2016 Title Exact (Restricted) Likelihood Ratio Tests for Mixed and Additive Models Version 3.1-3 Date 2016-11-03 Maintainer Fabian Scheipl

More information

Course Business. types of designs. " Week 4. circle back to if we have time later. ! Two new datasets for class today:

Course Business. types of designs.  Week 4. circle back to if we have time later. ! Two new datasets for class today: Course Business! Two new datasets for class today:! CourseWeb: Course Documents " Sample Data " Week 4! Still need help with lmertest?! Other fixed effect questions?! Effect size in lecture slides on CourseWeb.

More information

STATISTICS (STAT) 200 Level Courses Registration Restrictions: STAT 250: Required Prerequisites: not Schedule Type: Mason Core: STAT 346:

STATISTICS (STAT) 200 Level Courses Registration Restrictions: STAT 250: Required Prerequisites: not Schedule Type: Mason Core: STAT 346: Statistics (STAT) 1 STATISTICS (STAT) 200 Level Courses STAT 250: Introductory Statistics I. 3 credits. Elementary introduction to statistics. Topics include descriptive statistics, probability, and estimation

More information

Smoothing non-stationary noise of the Nigerian Stock Exchange All-Share Index data using variable coefficient functions

Smoothing non-stationary noise of the Nigerian Stock Exchange All-Share Index data using variable coefficient functions Smoothing non-stationary noise of the Nigerian Stock Exchange All-Share Index data using variable coefficient functions 1 Alabi Nurudeen Olawale, 2 Are Stephen Olusegun 1 Department of Mathematics and

More information

Hierarchical Generalized Linear Models

Hierarchical Generalized Linear Models Generalized Multilevel Linear Models Introduction to Multilevel Models Workshop University of Georgia: Institute for Interdisciplinary Research in Education and Human Development 07 Generalized Multilevel

More information

Modeling Criminal Careers as Departures From a Unimodal Population Age-Crime Curve: The Case of Marijuana Use

Modeling Criminal Careers as Departures From a Unimodal Population Age-Crime Curve: The Case of Marijuana Use Modeling Criminal Careers as Departures From a Unimodal Population Curve: The Case of Marijuana Use Donatello Telesca, Elena A. Erosheva, Derek A. Kreader, & Ross Matsueda April 15, 2014 extends Telesca

More information

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures:

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures: Homework Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression 3.0-3.2 Pod-cast lecture on-line Next lectures: I posted a rough plan. It is flexible though so please come with suggestions Bayes

More information

A Bayesian approach to detect time-specific group differences between nonlinear temporal curves

A Bayesian approach to detect time-specific group differences between nonlinear temporal curves University of Iowa Iowa Research Online Theses and Dissertations Spring 2016 A Bayesian approach to detect time-specific group differences between nonlinear temporal curves Melissa Anna Maria Pugh University

More information