Nonparametric Mixed-Effects Models for Longitudinal Data

Size: px
Start display at page:

Download "Nonparametric Mixed-Effects Models for Longitudinal Data"

Transcription

1 Nonparametric Mixed-Effects Models for Longitudinal Data Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore University of Seoul, South Korea, 7 p.1/26

2 OUTLINE The Motivating Data Various Parametric/Nonparametric ME Models Various Fitting Approaches Smoothing Parameter Selection Real Data Application Other ME Models University of Seoul, South Korea, 7 p.2/26

3 The Motivating Data The ACTG 388 Data (Park and Wu 4) 1 Raw Curves CD4 count 6 Response CD4 cell counts Covariate Time Week patients. Total Number of Observations University of Seoul, South Korea, 7 p.3/26

4 The Motivating Data Six Selected Subjects 6 Subj 8 6 Subj Subj Subj Subj Subj CD4 count Week University of Seoul, South Korea, 7 p.4/26

5 The Motivating Data Pointwise Means 1 Pointwise Raw Means CD4 count Week University of Seoul, South Korea, 7 p.5/26

6 The Motivating Data For the ACTG 388 data, the following Population-Mean ME Model is proper: GP where : measurement errors for -th measurement of -th subject : smooth fixed-effects function : smooth random-effects function of -th subject : individual function of -th subject Aim: Estimate, and ; Predict and University of Seoul, South Korea, 7 p.6/26

7 Various Parametric/Nonparametric ME Models Parametric Mixed-Effects Models In classical longitudinal data analysis, parametric mixed-effects models are often used. Linear Mixed-effects Models: i.i.d : Parametric fixed-effects : Parametric random-effects : Measurement errors University of Seoul, South Korea, 7 p.7/26

8 Various Parametric/Nonparametric ME Models Nonlinear Mixed-effects Models: i.i.d where is some known nonlinear function. See Davidian and Giltinan (1995), Vonesh and Chinchilli (1996), among others. University of Seoul, South Korea, 7 p.8/26

9 Various Parametric/Nonparametric ME Models Advantages: May take many covariates into account Easy to fit and analyze via EM algorithm Methodologies well developed Disadvantages: Need valid parametric assumptions May lead to misleading conclusions Not robust against model misspecification University of Seoul, South Korea, 7 p.9/26

10 Various Parametric/Nonparametric ME Models Nonparametric Mixed-Effects Models Recently, various nonparametric mixed-effects models are proposed. Population-Mean ME Model GP GP : nonparametric fixed-effects function : nonparametric random-effects function : Measurement error process See Zhang et al. (1998), Rice and Wu (1), Wu and Zhang (2, 6) among others University of Seoul, South Korea, 7 p.1/26

11 Various Parametric/Nonparametric ME Models Varying Coefficient ME Model GP GP where is the vector of some unknown smooth coefficient functions. See Wu and Zhang (6) University of Seoul, South Korea, 7 p.11/26

12 Various Parametric/Nonparametric ME Models Random Coefficient ME Model GP GP -th random coefficient function. is the where the University of Seoul, South Korea, 7 p.12/26

13 Various Parametric/Nonparametric ME Models Advantages: Flexible to fit longitudinal data Robust against model misspecification Disadvantages: Only involve a few covariates May computationally intensive Methodologies under developing See Wu and Zhang (6) and the references therein. University of Seoul, South Korea, 7 p.13/26

14 Various Fitting Approaches In the above nonparametric ME models, the nonparametric components such as should be fitted using some smoothing technique. The major smoothing techniques include Regression Spline Method (Eubank, 1999) and Smoothing Spline Method (Wahba, 199, Green and Silverman 1994) Penalized Spline Method (Ruppert, Wand and Carroll, 3) Local Polynomial Method (Fan and Gijbels, 1996) To adopt the above smoothing methods, we shall use the Population Mean ME model as an example. University of Seoul, South Korea, 7 p.14/26

15 Various Fitting Approaches respectively. the The Regression Spline Method Key Ideas: Approximate the nonparametric FE component by a regression spline, and approximate the nonaparametric RE components by regression splines and are two regression spline bases of dimensions and Then Population-Mean ME model can be approximated by where where and. This is a Standard Linear Mixed-effects (LME) Model. See Rice and Wu (1), Wu and Zhang (6, Chapter 5) for more details. University of Seoul, South Korea, 7 p.15/26

16 Various Fitting Approaches The Smoothing Spline Method Key Ideas: Take into account the roughness of and simultaneously via introducing roughness penalty. For example, for the cubic smoothing spline method, it is to find and to minimize the following criterion: Loglik where Loglik is the log-likelihood function evaluated at the design time points, and are the associated smoothing parameters. See Brumback and Rice (1998), Wu and Zhang (6, Chapter 6) for more details. University of Seoul, South Korea, 7 p.16/26

17 Various Fitting Approaches The Penalized Spline Method Key Ideas: Take into account the roughness of approximating and by regression splines penalizing the associated coefficients. That is to find following criterion: and and simultaneously via and, and to minimize the Loglik where Loglik is the log-likelihood function evaluated at the design time points and based on the regression spline approximations, and are the associated smoothing parameters. See Wu and Zhang (6, Chapter 7) for more details. University of Seoul, South Korea, 7 p.17/26

18 Various Fitting Approaches The Local Polynomial Method Key Ideas: At any fixed time point, the Population-Mean model can be approx- imated by a standard LME model via approximating and by polynomials of some order. The resulting LME can be fitted by the existing approaches for LME models. See Wu and Zhang (2, 6, Chapter 4) for more details. University of Seoul, South Korea, 7 p.18/26

19 Smoothing Parameter Selection Mixed Effects Fits Let the above methods and for the Population Mean Model, the Fixed-Effects Fits at can be expressed as be all the design time points. For where is the smoother matrix for, and the Random Effects Fits at is where the smoother matrix for all the random-effects evaluated at the design time points. University of Seoul, South Korea, 7 p.19/26

20 Smoothing Parameter Selection Goodness of Fit and Model Complexity Smoothing Parameter Selection attempts to trade off Goodness of Fit and Model Complexity. Goodness of Fit can be measured by the Log-likelihood: Loglik Const Cov (1) The larger the Loglik, the better Goodness of Fit of the modeling, indicating that the data are fitted very closely by the model. Model Complexity can be measured by the Trace of Smoother Matrix Model Complexity for Fixed-Effects df tr, indicating how complicate of the model is for fitting the fixed effects. Model Comlexity for Random-Effects df tr, indicating how complicate of the model is for fitting the random effects. University of Seoul, South Korea, 7 p.2/26

21 Smoothing Parameter Selection AIC and BIC A Criterion should trade off Goodness of Fit and Model Complexity for Fixed-effects and Random-effects. For example, for the Population Mean Model and for the regression spline method, we shall define AIC Loglik df df, BIC Loglik df df, where and are the number of knots for and. For the smoothing spline and penalized spline methods, and should be replaced by and respectively. University of Seoul, South Korea, 7 p.21/26

22 Real Data Application AIC and BIC for ACTG 388 Data x (a) AIC Value 4.72 x (b) BIC K v =1 K v =2 K v = K seems a good choice University of Seoul, South Korea, 7 p.22/26

23 Real Data Application Overall Fits: CD4 count (a) Fitted individual functions 5 1 Week CD4 count (b) Fitted mean function with ± 2 SD Week x 1 4 (c) Fitted covariance function (d) Fitted correlation function 5 1 Covariance 4 3 Correlation Week 1 5 Week Week 1 5 Week 15 University of Seoul, South Korea, 7 p.23/26

24 Real Data Application Individual Fits: Subj Subj Subj CD4 count Subj Subj Subj 59 raw data population individual 5 1 Week University of Seoul, South Korea, 7 p.24/26

25 Other ME Models The Methodologies proposed above can be applied to other ME Models, e.g.: Semiparametric ME Models: GP Generalized Nonparametric ME Model: is known. where Other ME models.. University of Seoul, South Korea, 7 p.25/26

26 End of the Talk Thank You University of Seoul, South Korea, 7 p.26/26

Analyzing Longitudinal Data Using Regression Splines

Analyzing Longitudinal Data Using Regression Splines Analyzing Longitudinal Data Using Regression Splines Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore August 18, 6 DSAP, NUS p.1/16 OUTLINE Motivating Longitudinal Data Parametric

More information

davidr Cornell University

davidr Cornell University 1 NONPARAMETRIC RANDOM EFFECTS MODELS AND LIKELIHOOD RATIO TESTS Oct 11, 2002 David Ruppert Cornell University www.orie.cornell.edu/ davidr (These transparencies and preprints available link to Recent

More information

Non-Parametric and Semi-Parametric Methods for Longitudinal Data

Non-Parametric and Semi-Parametric Methods for Longitudinal Data PART III Non-Parametric and Semi-Parametric Methods for Longitudinal Data CHAPTER 8 Non-parametric and semi-parametric regression methods: Introduction and overview Xihong Lin and Raymond J. Carroll Contents

More information

Nonparametric regression using kernel and spline methods

Nonparametric regression using kernel and spline methods Nonparametric regression using kernel and spline methods Jean D. Opsomer F. Jay Breidt March 3, 016 1 The statistical model When applying nonparametric regression methods, the researcher is interested

More information

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is...

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1D Regression i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1 Non-linear problems What if the underlying function is

More information

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni Nonparametric Risk Attribution for Factor Models of Portfolios October 3, 2017 Kellie Ottoboni Outline The problem Page 3 Additive model of returns Page 7 Euler s formula for risk decomposition Page 11

More information

Generalized Additive Models

Generalized Additive Models :p Texts in Statistical Science Generalized Additive Models An Introduction with R Simon N. Wood Contents Preface XV 1 Linear Models 1 1.1 A simple linear model 2 Simple least squares estimation 3 1.1.1

More information

Nonparametric Regression Methods for Longitudinal Data Analysis

Nonparametric Regression Methods for Longitudinal Data Analysis Nonparametric Regression Methods for Longitudinal Data Analysis HULIN WU University of Rochester Dept. of Biostatistics and Computer Biology Rochester, New York JIN-TING ZHANG National University of Singapore

More information

Last time... Bias-Variance decomposition. This week

Last time... Bias-Variance decomposition. This week Machine learning, pattern recognition and statistical data modelling Lecture 4. Going nonlinear: basis expansions and splines Last time... Coryn Bailer-Jones linear regression methods for high dimensional

More information

Improved smoothing spline regression by combining estimates of dierent smoothness

Improved smoothing spline regression by combining estimates of dierent smoothness Available online at www.sciencedirect.com Statistics & Probability Letters 67 (2004) 133 140 Improved smoothing spline regression by combining estimates of dierent smoothness Thomas C.M. Lee Department

More information

Generalized Additive Model

Generalized Additive Model Generalized Additive Model by Huimin Liu Department of Mathematics and Statistics University of Minnesota Duluth, Duluth, MN 55812 December 2008 Table of Contents Abstract... 2 Chapter 1 Introduction 1.1

More information

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer Ludwig Fahrmeir Gerhard Tute Statistical odelling Based on Generalized Linear Model íecond Edition. Springer Preface to the Second Edition Preface to the First Edition List of Examples List of Figures

More information

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique Al Kadiri M. A. Abstract Benefits of using Ranked Set Sampling (RSS) rather than Simple Random Sampling (SRS) are indeed significant

More information

NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University

NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University www.orie.cornell.edu/ davidr (These transparencies, preprints, and references a link to Recent Talks and

More information

Smoothing parameterselection forsmoothing splines: a simulation study

Smoothing parameterselection forsmoothing splines: a simulation study Computational Statistics & Data Analysis 42 (2003) 139 148 www.elsevier.com/locate/csda Smoothing parameterselection forsmoothing splines: a simulation study Thomas C.M. Lee Department of Statistics, Colorado

More information

Nonparametric Approaches to Regression

Nonparametric Approaches to Regression Nonparametric Approaches to Regression In traditional nonparametric regression, we assume very little about the functional form of the mean response function. In particular, we assume the model where m(xi)

More information

Moving Beyond Linearity

Moving Beyond Linearity Moving Beyond Linearity The truth is never linear! 1/23 Moving Beyond Linearity The truth is never linear! r almost never! 1/23 Moving Beyond Linearity The truth is never linear! r almost never! But often

More information

Incorporating Geospatial Data in House Price Indexes: A Hedonic Imputation Approach with Splines. Robert J. Hill and Michael Scholz

Incorporating Geospatial Data in House Price Indexes: A Hedonic Imputation Approach with Splines. Robert J. Hill and Michael Scholz Incorporating Geospatial Data in House Price Indexes: A Hedonic Imputation Approach with Splines Robert J. Hill and Michael Scholz Department of Economics University of Graz, Austria OeNB Workshop Vienna,

More information

Median and Extreme Ranked Set Sampling for penalized spline estimation

Median and Extreme Ranked Set Sampling for penalized spline estimation Appl Math Inf Sci 10, No 1, 43-50 (016) 43 Applied Mathematics & Information Sciences An International Journal http://dxdoiorg/1018576/amis/10014 Median and Extreme Ranked Set Sampling for penalized spline

More information

The linear mixed model: modeling hierarchical and longitudinal data

The linear mixed model: modeling hierarchical and longitudinal data The linear mixed model: modeling hierarchical and longitudinal data Analysis of Experimental Data AED The linear mixed model: modeling hierarchical and longitudinal data 1 of 44 Contents 1 Modeling Hierarchical

More information

Curve fitting using linear models

Curve fitting using linear models Curve fitting using linear models Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark September 28, 2012 1 / 12 Outline for today linear models and basis functions polynomial regression

More information

Knot-Placement to Avoid Over Fitting in B-Spline Scedastic Smoothing. Hirokazu Yanagihara* and Megu Ohtaki**

Knot-Placement to Avoid Over Fitting in B-Spline Scedastic Smoothing. Hirokazu Yanagihara* and Megu Ohtaki** Knot-Placement to Avoid Over Fitting in B-Spline Scedastic Smoothing Hirokazu Yanagihara* and Megu Ohtaki** * Department of Mathematics, Faculty of Science, Hiroshima University, Higashi-Hiroshima, 739-8529,

More information

P-spline ANOVA-type interaction models for spatio-temporal smoothing

P-spline ANOVA-type interaction models for spatio-temporal smoothing P-spline ANOVA-type interaction models for spatio-temporal smoothing Dae-Jin Lee and María Durbán Universidad Carlos III de Madrid Department of Statistics IWSM Utrecht 2008 D.-J. Lee and M. Durban (UC3M)

More information

Lecture 17: Smoothing splines, Local Regression, and GAMs

Lecture 17: Smoothing splines, Local Regression, and GAMs Lecture 17: Smoothing splines, Local Regression, and GAMs Reading: Sections 7.5-7 STATS 202: Data mining and analysis November 6, 2017 1 / 24 Cubic splines Define a set of knots ξ 1 < ξ 2 < < ξ K. We want

More information

STAT 705 Introduction to generalized additive models

STAT 705 Introduction to generalized additive models STAT 705 Introduction to generalized additive models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 22 Generalized additive models Consider a linear

More information

Functional Data Analysis

Functional Data Analysis Functional Data Analysis Venue: Tuesday/Thursday 1:25-2:40 WN 145 Lecturer: Giles Hooker Office Hours: Wednesday 2-4 Comstock 1186 Ph: 5-1638 e-mail: gjh27 Texts and Resources Ramsay and Silverman, 2007,

More information

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221 Yixiao Sun Department of Economics, University of California, San Diego Winter 2007 Contents Preface ix 1 Kernel Smoothing: Density

More information

CH9.Generalized Additive Model

CH9.Generalized Additive Model CH9.Generalized Additive Model Regression Model For a response variable and predictor variables can be modeled using a mean function as follows: would be a parametric / nonparametric regression or a smoothing

More information

Linear penalized spline model estimation using ranked set sampling technique

Linear penalized spline model estimation using ranked set sampling technique Hacettepe Journal of Mathematics and Statistics Volume 46 (4) (2017), 669 683 Linear penalized spline model estimation using ranked set sampling technique Al Kadiri M A Abstract Benets of using Ranked

More information

Lecture 16: High-dimensional regression, non-linear regression

Lecture 16: High-dimensional regression, non-linear regression Lecture 16: High-dimensional regression, non-linear regression Reading: Sections 6.4, 7.1 STATS 202: Data mining and analysis November 3, 2017 1 / 17 High-dimensional regression Most of the methods we

More information

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an Soft Threshold Estimation for Varying{coecient Models Artur Klinger, Universitat Munchen ABSTRACT: An alternative penalized likelihood estimator for varying{coecient regression in generalized linear models

More information

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Tim Hanson, Ph.D. University of South Carolina T. Hanson (USC) Stat 704: Data Analysis I, Fall 2010 1 / 26 Additive predictors

More information

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data Smoothing Dissimilarities for Cluster Analysis: Binary Data and unctional Data David B. University of South Carolina Department of Statistics Joint work with Zhimin Chen University of South Carolina Current

More information

Generalized additive models I

Generalized additive models I I Patrick Breheny October 6 Patrick Breheny BST 764: Applied Statistical Modeling 1/18 Introduction Thus far, we have discussed nonparametric regression involving a single covariate In practice, we often

More information

Feature Subset Selection for Logistic Regression via Mixed Integer Optimization

Feature Subset Selection for Logistic Regression via Mixed Integer Optimization Feature Subset Selection for Logistic Regression via Mixed Integer Optimization Yuichi TAKANO (Senshu University, Japan) Toshiki SATO (University of Tsukuba) Ryuhei MIYASHIRO (Tokyo University of Agriculture

More information

Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica

Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica Boston-Keio Workshop 2016. Doubly Cyclic Smoothing Splines and Analysis of Seasonal Daily Pattern of CO2 Concentration in Antarctica... Mihoko Minami Keio University, Japan August 15, 2016 Joint work with

More information

Statistics & Analysis. Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2

Statistics & Analysis. Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2 Fitting Generalized Additive Models with the GAM Procedure in SAS 9.2 Weijie Cai, SAS Institute Inc., Cary NC July 1, 2008 ABSTRACT Generalized additive models are useful in finding predictor-response

More information

Nonparametric and Semiparametric Linear Mixed Models

Nonparametric and Semiparametric Linear Mixed Models Nonparametric and Semiparametric Linear Mixed Models Megan J. Waterman Department of Defense, U.S. Government, USA. Jeffrey B. Birch Virginia Polytechnic Institute and State University, Blacksburg, VA

More information

Package freeknotsplines

Package freeknotsplines Version 1.0.1 Date 2018-05-17 Package freeknotsplines June 10, 2018 Title Algorithms for Implementing Free-Knot Splines Author , Philip Smith , Pierre Lecuyer

More information

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM) School of Computer Science Probabilistic Graphical Models Structured Sparse Additive Models Junming Yin and Eric Xing Lecture 7, April 4, 013 Reading: See class website 1 Outline Nonparametric regression

More information

Package ICsurv. February 19, 2015

Package ICsurv. February 19, 2015 Package ICsurv February 19, 2015 Type Package Title A package for semiparametric regression analysis of interval-censored data Version 1.0 Date 2014-6-9 Author Christopher S. McMahan and Lianming Wang

More information

Package lmesplines. R topics documented: February 20, Version

Package lmesplines. R topics documented: February 20, Version Version 1.1-10 Package lmesplines February 20, 2015 Title Add smoothing spline modelling capability to nlme. Author Rod Ball Maintainer Andrzej Galecki

More information

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR J. D. Maca July 1, 1997 Abstract The purpose of this manual is to demonstrate the usage of software for

More information

100 Myung Hwan Na log-hazard function. The discussion section of Abrahamowicz, et al.(1992) contains a good review of many of the papers on the use of

100 Myung Hwan Na log-hazard function. The discussion section of Abrahamowicz, et al.(1992) contains a good review of many of the papers on the use of J. KSIAM Vol.3, No.2, 99-106, 1999 SPLINE HAZARD RATE ESTIMATION USING CENSORED DATA Myung Hwan Na Abstract In this paper, the spline hazard rate model to the randomly censored data is introduced. The

More information

What is machine learning?

What is machine learning? Machine learning, pattern recognition and statistical data modelling Lecture 12. The last lecture Coryn Bailer-Jones 1 What is machine learning? Data description and interpretation finding simpler relationship

More information

Computational Physics PHYS 420

Computational Physics PHYS 420 Computational Physics PHYS 420 Dr Richard H. Cyburt Assistant Professor of Physics My office: 402c in the Science Building My phone: (304) 384-6006 My email: rcyburt@concord.edu My webpage: www.concord.edu/rcyburt

More information

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2)

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2) SOC6078 Advanced Statistics: 9. Generalized Additive Models Robert Andersen Department of Sociology University of Toronto Goals of the Lecture Introduce Additive Models Explain how they extend from simple

More information

Smoothing Scatterplots Using Penalized Splines

Smoothing Scatterplots Using Penalized Splines Smoothing Scatterplots Using Penalized Splines 1 What do we mean by smoothing? Fitting a "smooth" curve to the data in a scatterplot 2 Why would we want to fit a smooth curve to the data in a scatterplot?

More information

Nonparametric Smoothing of Yield Curves

Nonparametric Smoothing of Yield Curves Review of Quantitative Finance and Accounting, 9 (1997): 251 267 1997 Kluwer Academic Publishers, Boston. Manufactured in The Netherlands. Nonparametric Smoothing of Yield Curves CARSTEN TANGGAARD Department

More information

Interactive Graphics. Lecture 9: Introduction to Spline Curves. Interactive Graphics Lecture 9: Slide 1

Interactive Graphics. Lecture 9: Introduction to Spline Curves. Interactive Graphics Lecture 9: Slide 1 Interactive Graphics Lecture 9: Introduction to Spline Curves Interactive Graphics Lecture 9: Slide 1 Interactive Graphics Lecture 13: Slide 2 Splines The word spline comes from the ship building trade

More information

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric)

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric) Splines Patrick Breheny November 20 Patrick Breheny STA 621: Nonparametric Statistics 1/46 Introduction Introduction Problems with polynomial bases We are discussing ways to estimate the regression function

More information

A Graphical Analysis of Simultaneously Choosing the Bandwidth and Mixing Parameter for Semiparametric Regression Techniques

A Graphical Analysis of Simultaneously Choosing the Bandwidth and Mixing Parameter for Semiparametric Regression Techniques Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2009 A Graphical Analysis of Simultaneously Choosing the Bandwidth and Mixing Parameter for Semiparametric

More information

Additive hedonic regression models for the Austrian housing market ERES Conference, Edinburgh, June

Additive hedonic regression models for the Austrian housing market ERES Conference, Edinburgh, June for the Austrian housing market, June 14 2012 Ao. Univ. Prof. Dr. Fachbereich Stadt- und Regionalforschung Technische Universität Wien Dr. Strategic Risk Management Bank Austria UniCredit, Wien Inhalt

More information

Optimal designs for comparing curves

Optimal designs for comparing curves Optimal designs for comparing curves Holger Dette, Ruhr-Universität Bochum Maria Konstantinou, Ruhr-Universität Bochum Kirsten Schorning, Ruhr-Universität Bochum FP7 HEALTH 2013-602552 Outline 1 Motivation

More information

Linear Mixed Model Robust Regression

Linear Mixed Model Robust Regression Linear Mixed Model Robust Regression Megan J. Waterman, Jeffrey B. Birch, and Oliver Schabenberger November 5, 2006 Abstract Mixed models are powerful tools for the analysis of clustered data and many

More information

arxiv: v1 [stat.me] 2 Jun 2017

arxiv: v1 [stat.me] 2 Jun 2017 Inference for Penalized Spline Regression: Improving Confidence Intervals by Reducing the Penalty arxiv:1706.00865v1 [stat.me] 2 Jun 2017 Ning Dai School of Statistics University of Minnesota daixx224@umn.edu

More information

Estimating Curves and Derivatives with Parametric Penalized Spline Smoothing

Estimating Curves and Derivatives with Parametric Penalized Spline Smoothing Estimating Curves and Derivatives with Parametric Penalized Spline Smoothing Jiguo Cao, Jing Cai Department of Statistics & Actuarial Science, Simon Fraser University, Burnaby, BC, Canada V5A 1S6 Liangliang

More information

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. http://en.wikipedia.org/wiki/local_regression Local regression

More information

Independent Components Analysis through Product Density Estimation

Independent Components Analysis through Product Density Estimation Independent Components Analysis through Product Density Estimation 'frevor Hastie and Rob Tibshirani Department of Statistics Stanford University Stanford, CA, 94305 { hastie, tibs } @stat.stanford. edu

More information

Smoothing and Forecasting Mortality Rates with P-splines. Iain Currie. Data and problem. Plan of talk

Smoothing and Forecasting Mortality Rates with P-splines. Iain Currie. Data and problem. Plan of talk Smoothing and Forecasting Mortality Rates with P-splines Iain Currie Heriot Watt University Data and problem Data: CMI assured lives : 20 to 90 : 1947 to 2002 Problem: forecast table to 2046 London, June

More information

Theoretical and Practical Aspects of Penalized Spline Smoothing

Theoretical and Practical Aspects of Penalized Spline Smoothing Theoretical and Practical Aspects of Penalized Spline Smoothing Dissertation zur Erlangung des Grades eines Doktors der Wirtschaftswissenschaften (Dr rer pol) der Fakultät für Wirtschaftswissenschaften

More information

Smoothing-splines Mixed-effects Models in R using the sme Package: a Tutorial

Smoothing-splines Mixed-effects Models in R using the sme Package: a Tutorial Smoothing-splines Mixed-effects Models in R using the sme Package: a Tutorial Maurice Berk Imperial College London maurice@mauriceberk.com February 9, 2018 Abstract In this vignette, the user is guided

More information

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer Model Assessment and Selection Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Model Training data Testing data Model Testing error rate Training error

More information

Missing Data Analysis for the Employee Dataset

Missing Data Analysis for the Employee Dataset Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup For our analysis goals we would like to do: Y X N (X, 2 I) and then interpret the coefficients

More information

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS

Analysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS Analysis of Panel Data Third Edition Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS Contents Preface to the ThirdEdition Preface to the Second Edition Preface to the First Edition

More information

Nonparametric Estimation of Distribution Function using Bezier Curve

Nonparametric Estimation of Distribution Function using Bezier Curve Communications for Statistical Applications and Methods 2014, Vol. 21, No. 1, 105 114 DOI: http://dx.doi.org/10.5351/csam.2014.21.1.105 ISSN 2287-7843 Nonparametric Estimation of Distribution Function

More information

Handling Sparse and Missing Data in Functional Data Analysis: A Functional Mixed-Effects Model Approach. Kimberly L. Ward

Handling Sparse and Missing Data in Functional Data Analysis: A Functional Mixed-Effects Model Approach. Kimberly L. Ward Handling Sparse and Missing Data in Functional Data Analysis: A Functional Mixed-Effects Model Approach by Kimberly L. Ward A Thesis Presented in Partial Fulfillment of the Requirements for the Degree

More information

Latent Curve Models. A Structural Equation Perspective WILEY- INTERSCIENΠKENNETH A. BOLLEN

Latent Curve Models. A Structural Equation Perspective WILEY- INTERSCIENΠKENNETH A. BOLLEN Latent Curve Models A Structural Equation Perspective KENNETH A. BOLLEN University of North Carolina Department of Sociology Chapel Hill, North Carolina PATRICK J. CURRAN University of North Carolina Department

More information

Lecture 13: Model selection and regularization

Lecture 13: Model selection and regularization Lecture 13: Model selection and regularization Reading: Sections 6.1-6.2.1 STATS 202: Data mining and analysis October 23, 2017 1 / 17 What do we know so far In linear regression, adding predictors always

More information

AN ADDITIVE BIVARIATE HIERARCHICAL MODEL FOR FUNCTIONAL DATA AND RELATED COMPUTATIONS. A Dissertation ANDREW MIDDLETON REDD

AN ADDITIVE BIVARIATE HIERARCHICAL MODEL FOR FUNCTIONAL DATA AND RELATED COMPUTATIONS. A Dissertation ANDREW MIDDLETON REDD AN ADDITIVE BIVARIATE HIERARCHICAL MODEL FOR FUNCTIONAL DATA AND RELATED COMPUTATIONS A Dissertation by ANDREW MIDDLETON REDD Submitted to the Office of Graduate Studies of Texas A&M University in partial

More information

Applied Statistics : Practical 9

Applied Statistics : Practical 9 Applied Statistics : Practical 9 This practical explores nonparametric regression and shows how to fit a simple additive model. The first item introduces the necessary R commands for nonparametric regression

More information

Splines and penalized regression

Splines and penalized regression Splines and penalized regression November 23 Introduction We are discussing ways to estimate the regression function f, where E(y x) = f(x) One approach is of course to assume that f has a certain shape,

More information

Principal component models for sparse functional data

Principal component models for sparse functional data Principal component models for sparse functional data GARETH M. JAMES Marshall School of Business, University of Southern California, Los Angeles, California 90089-0809 gareth@usc.edu TREVOR J. HASTIE

More information

Bayesian Time-Stratified-Petersen estimators for abundance. Sampling Protocol. Simon J. Bonner (UBC) Carl James Schwarz (SFU)

Bayesian Time-Stratified-Petersen estimators for abundance. Sampling Protocol. Simon J. Bonner (UBC) Carl James Schwarz (SFU) Bayesian Time-Stratified-Petersen estimators for abundance Sampling Protocol Simon J. Bonner (UBC) Carl James Schwarz (SFU) cschwarz@stat.sfu.ca Simple-Petersen or Stratified-Petersen methods are often

More information

Comment. J. L. FRENCH, E. E. KAMMANN, and M. F? WAND. a: 1 = -{lly -X~-Z~U~~'+AU~Z~U). sion involves minimization of

Comment. J. L. FRENCH, E. E. KAMMANN, and M. F? WAND. a: 1 = -{lly -X~-Z~U~~'+AU~Z~U). sion involves minimization of French, Kammann, and Wand: Comment J. L. FRENCH, E. E. KAMMANN, and M. F? WAND Comment 1. INTRODUCTION Semiparametric nonlinear mixed models are a useful addition to the regression modeling arsenal, and

More information

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask Machine Learning and Data Mining Clustering (1): Basics Kalev Kask Unsupervised learning Supervised learning Predict target value ( y ) given features ( x ) Unsupervised learning Understand patterns of

More information

Linear Methods for Regression and Shrinkage Methods

Linear Methods for Regression and Shrinkage Methods Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors

More information

REPLACING MLE WITH BAYESIAN SHRINKAGE CAS ANNUAL MEETING NOVEMBER 2018 GARY G. VENTER

REPLACING MLE WITH BAYESIAN SHRINKAGE CAS ANNUAL MEETING NOVEMBER 2018 GARY G. VENTER REPLACING MLE WITH BAYESIAN SHRINKAGE CAS ANNUAL MEETING NOVEMBER 2018 GARY G. VENTER ESTIMATION Problems with MLE known since Charles Stein 1956 paper He showed that when estimating 3 or more means, shrinking

More information

BASIC LOESS, PBSPLINE & SPLINE

BASIC LOESS, PBSPLINE & SPLINE CURVES AND SPLINES DATA INTERPOLATION SGPLOT provides various methods for fitting smooth trends to scatterplot data LOESS An extension of LOWESS (Locally Weighted Scatterplot Smoothing), uses locally weighted

More information

Resampling Methods. Levi Waldron, CUNY School of Public Health. July 13, 2016

Resampling Methods. Levi Waldron, CUNY School of Public Health. July 13, 2016 Resampling Methods Levi Waldron, CUNY School of Public Health July 13, 2016 Outline and introduction Objectives: prediction or inference? Cross-validation Bootstrap Permutation Test Monte Carlo Simulation

More information

9.1 Random coefficients models Constructed data Consumer preference mapping of carrots... 10

9.1 Random coefficients models Constructed data Consumer preference mapping of carrots... 10 St@tmaster 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE Module 9: R 9.1 Random coefficients models...................... 1 9.1.1 Constructed data........................

More information

Nonparametric Methods Recap

Nonparametric Methods Recap Nonparametric Methods Recap Aarti Singh Machine Learning 10-701/15-781 Oct 4, 2010 Nonparametric Methods Kernel Density estimate (also Histogram) Weighted frequency Classification - K-NN Classifier Majority

More information

This is called a linear basis expansion, and h m is the mth basis function For example if X is one-dimensional: f (X) = β 0 + β 1 X + β 2 X 2, or

This is called a linear basis expansion, and h m is the mth basis function For example if X is one-dimensional: f (X) = β 0 + β 1 X + β 2 X 2, or STA 450/4000 S: February 2 2005 Flexible modelling using basis expansions (Chapter 5) Linear regression: y = Xβ + ɛ, ɛ (0, σ 2 ) Smooth regression: y = f (X) + ɛ: f (X) = E(Y X) to be specified Flexible

More information

Package robustgam. January 7, 2013

Package robustgam. January 7, 2013 Package robustgam January 7, 2013 Type Package Title Robust Estimation for Generalized Additive Models Version 0.1.5 Date 2013-1-6 Author Raymond K. W. Wong Maintainer Raymond K. W. Wong

More information

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines Spline Models Introduction to CS and NCS Regression splines Smoothing splines 3 Cubic Splines a knots: a< 1 < 2 < < m

More information

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1 Preface to the Second Edition Preface to the First Edition vii xi 1 Introduction 1 2 Overview of Supervised Learning 9 2.1 Introduction... 9 2.2 Variable Types and Terminology... 9 2.3 Two Simple Approaches

More information

A Bayesian approach to detect time-specific group differences between nonlinear temporal curves

A Bayesian approach to detect time-specific group differences between nonlinear temporal curves University of Iowa Iowa Research Online Theses and Dissertations Spring 2016 A Bayesian approach to detect time-specific group differences between nonlinear temporal curves Melissa Anna Maria Pugh University

More information

Assessing the Quality of the Natural Cubic Spline Approximation

Assessing the Quality of the Natural Cubic Spline Approximation Assessing the Quality of the Natural Cubic Spline Approximation AHMET SEZER ANADOLU UNIVERSITY Department of Statisticss Yunus Emre Kampusu Eskisehir TURKEY ahsst12@yahoo.com Abstract: In large samples,

More information

Nonparametric Regression

Nonparametric Regression Nonparametric Regression John Fox Department of Sociology McMaster University 1280 Main Street West Hamilton, Ontario Canada L8S 4M4 jfox@mcmaster.ca February 2004 Abstract Nonparametric regression analysis

More information

PSY 9556B (Feb 5) Latent Growth Modeling

PSY 9556B (Feb 5) Latent Growth Modeling PSY 9556B (Feb 5) Latent Growth Modeling Fixed and random word confusion Simplest LGM knowing how to calculate dfs How many time points needed? Power, sample size Nonlinear growth quadratic Nonlinear growth

More information

Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples

Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples Journal of Of cial Statistics, Vol. 19, No. 2, 2003, pp. 99±117 Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples Hui Zheng 1 and Roderick

More information

Lecture 7: Splines and Generalized Additive Models

Lecture 7: Splines and Generalized Additive Models Lecture 7: and Generalized Additive Models Computational Statistics Thierry Denœux April, 2016 Introduction Overview Introduction Simple approaches Polynomials Step functions Regression splines Natural

More information

Package robustgam. February 20, 2015

Package robustgam. February 20, 2015 Type Package Package robustgam February 20, 2015 Title Robust Estimation for Generalized Additive Models Version 0.1.7 Date 2013-5-7 Author Raymond K. W. Wong, Fang Yao and Thomas C. M. Lee Maintainer

More information

Package GAMBoost. February 19, 2015

Package GAMBoost. February 19, 2015 Version 1.2-3 Package GAMBoost February 19, 2015 Title Generalized linear and additive models by likelihood based boosting Author Harald Binder Maintainer Harald Binder

More information

ME 261: Numerical Analysis Lecture-12: Numerical Interpolation

ME 261: Numerical Analysis Lecture-12: Numerical Interpolation 1 ME 261: Numerical Analysis Lecture-12: Numerical Interpolation Md. Tanver Hossain Department of Mechanical Engineering, BUET http://tantusher.buet.ac.bd 2 Inverse Interpolation Problem : Given a table

More information

Nonparametric Regression and Generalized Additive Models Part I

Nonparametric Regression and Generalized Additive Models Part I SPIDA, June 2004 Nonparametric Regression and Generalized Additive Models Part I Robert Andersen McMaster University Plan of the Lecture 1. Detecting nonlinearity Fitting a linear model to a nonlinear

More information

Extending the GLM. Outline. Mixed effects motivation Evaluating mixed effects methods Three methods. Conclusions. Overview

Extending the GLM. Outline. Mixed effects motivation Evaluating mixed effects methods Three methods. Conclusions. Overview Extending the GLM So far, we have considered the GLM for one run in one subject The same logic can be applied to multiple runs and multiple subjects GLM Stats For any given region, we can evaluate the

More information

Machine Learning. Topic 4: Linear Regression Models

Machine Learning. Topic 4: Linear Regression Models Machine Learning Topic 4: Linear Regression Models (contains ideas and a few images from wikipedia and books by Alpaydin, Duda/Hart/ Stork, and Bishop. Updated Fall 205) Regression Learning Task There

More information

Multivariable Regression Modelling

Multivariable Regression Modelling Multivariable Regression Modelling A review of available spline packages in R. Aris Perperoglou for TG2 ISCB 2015 Aris Perperoglou for TG2 Multivariable Regression Modelling ISCB 2015 1 / 41 TG2 Members

More information

CS 229 Midterm Review

CS 229 Midterm Review CS 229 Midterm Review Course Staff Fall 2018 11/2/2018 Outline Today: SVMs Kernels Tree Ensembles EM Algorithm / Mixture Models [ Focus on building intuition, less so on solving specific problems. Ask

More information