NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University

Size: px
Start display at page:

Download "NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University"

Transcription

1 NONPARAMETRIC REGRESSION WIT MEASUREMENT ERROR: SOME RECENT PR David Ruppert Cornell University davidr (These transparencies, preprints, and references a link to Recent Talks and Recent Papers Work done jointly with Scott Berry, Berry Consultants R. J. Carroll, Texas A & M Jeff Maca, Novartis John Staudenmayer, U Mass

2 OUTLINE The problem nonparametric regression with m error Review of the currently available estimators New Bayesian spline approach (Berry, Carroll, 2002, JASA) Simulation results

3 THE PROBLEM OF MEASUREMENT ERR ILLUSTRATION 1.5 Gold Standard data true curve Spline Data with measureme y y x w

4 THE PROBLEM OF MEASUREMENT ERR ILLUSTRATION Gold Standard data true curve Spline Data with measureme y 0 y x w

5 THE PROBLEM OF MEASUREMENT ERR The regression model is where is only known to be smooth Observe and where var( ) = normally distributed Measurement error variance is estimated from! cate data. (Observe,.)

6 THE PROBLEM OF MEASUREMENT ERROR Measurement error occurs in a wide variety of pro Measuring nutrient intake Measuring airborne lead exposure Measuring blood pressure C"$# dating The effects of measurement error are: biased estimates of the regression curve increase in the perceived variability about the re

7 THE PROBLEM OF MEASUREMENT ERROR Other than the work of Fan and Truong (1993, A had been little done on nonparametric regression w ment error until Carroll, Maca, and Ruppert (1999, Biometrika) ( Berry, Carroll, Ruppert (2002, JASA) (BCR)

8 % REVIEW OF CURRENT ESTIMATORS Globally consistent nonparametric regression by d kernels (Fan and Truong, 1993, Annals) does not work so well % Fan & Truong show very poor asymptotic rat gence we have simulations showing poor finite-samp no methodology for bandwidth selection or infer

9 REVIEW OF CURRENT ESTIMATORS Standard measurement error method: SIMEX ' functional no assumptions on & very general can be applied to nearly any m error problem, parametric or nonparametric Structural Spline Regression splines for basic regression model Mixtures of normals for covariate density model Emphasis is on flexible parametric modeling, not ric modeling. (Little or no difference in practice.)

10 SIMEX The SIMEX method is due to Cook & Stefanski (1 The theory is in Carroll, et al. (1996, JASA) Also see Carroll, Ruppert, and Stefanski (1995, M Error in Nonlinear Models) SIMEX has been previously applied to parametric makes no assumptions about the true s. (Fun results in estimators which are approximately co consistent at least to order ( Here is the method, defined via a graph. ).

11 SIMEX, ILLUSTRATED Illustration of SIMEX SIMEX Estimate 0.8 Coefficient Naive Estimate * * + +, Measurement Error Variance

12 SIMEX CMR applied the SIMEX to nonparametric regress CMR have asymptotic theory in the local polynom (LPR) context. The estimators have the usual rates of converge They are approximately consistent, to order ( An asymptotic theory with rates seems very difficu but, simulations in CMR indicate that SIMEX/sp little better than SIMEX/kernel problem seems due to undersmoothing

13 SIMEX Staudenmayer (2000, Cornell PhD thesis) looked selection for SIMEX/LPR. With better bandwidth selection, SIMEX/LPR i with other methods.

14 9 STRUCTURAL MODELING The regression of on the observed is - / If we had a model 687 for and if we knew we could estimate 687 by minimizing over the ;: " < We need two things to make this work: convenient flexible form for convenient flexible distribution for. >=?1

15 H H D Model REGRESSION SPLINES D I " A 647 BC EJ A E:GF E: The key remaining issue: the joint distribution of ' CMR used a mixtures of normals for & and Gi to estimate the parameters. % This is an extension to measurement error Roeder & Wasserman (JASA, 1997).

16 FULLY BAYESIAN MODEL What s New? Answer: Fully Bayesian MCMC method in BCR Uses splines smoothing or penalized P-splines in this talk Structural are iid normal but seems robust to violations of normality

17 FULLY BAYESIAN MODEL Smoothing parameter is automatic Inference adjusts for the data-based smoothing p for measurement error Allow global confidence bands

18 FULLY BAYESIAN MODEL PARAMET Regression Model is a P-spline ;L iid K M Measurement Error Model N where N iid K ;L Structural Model AOPQ P iid K Parameters: 7 R SO P P

19 % P R R T Priors 7 is K FULLY BAYESIAN MODEL PARAMET 5L UT SV " ing parameter.] T ;Y Z[]\ Z^ is Gamma R is Inv-Gamma ;Y R ]\ is Inv-Gamma ;Y ]\ OP _=`PQSa P is K ;Y P ]\ P is Inv-Gamma Hyperparameters: Y R where is known. [W ]\ by ]\ by PQ]\ PQc=`PdSa BX R i P by Z all fixed at values making the priors noninformat E.g., a P e ).

20 % % & " Iterate through 7 R GIBBS SAMPLING P SO P ft S gs 9. All steps except one are easy, either gamma, inv or normal E.g., Here 7 ih 'kj l monqp rts other parameters u v/rtw x T V " xkh y m/z R x T V " is one of the other parameters Essentially we re fitting a spline to the impute observed s

21 % 7 Estimate of 7, call it { GIBBS SAMPLING, is x T V " x h averaged over T and.

22 < &. < R 9 GIBBS SAMPLING The exception to the sampling being quick and e Metropolis-Hastings step is needed for S " OPQ M P 7 } vi~>k < E: " 647 ƒ0 ih ' < < P A < OP>

23 7 7 W p BAYESIAN INFERENCE Let be the spline basis function evaluated on a i c' some interval, &. i c' is the curve on &.. { is the estimated curve. Let ˆ be the 4 < W MCMC sample quantile of 7 7 rš~ < { grid SD 7 Then, 7 { ˆ Œ cž SD 7 is a < i c' &. % simultaneous confidence band fo

24 7 7 W p ˆ BAYESIAN INFERENCE Let be derivatives of the spline basis function e i c' fine grid over &. i c' is the curve s derivative on &. { Let ˆ Then, is a < i i' on &. is the estimated derivative. be the 4 < W MCMC sample quantile of 7 7 rš~ < { grid SD 7 7 { Œ cž SD % simultaneous confidence band for 7

25 SIMULATIONS The six cases were considered. in each case. Case 1 The regression function is w A 1-12 S 1 wš E0. S e LC LCœ with, M,, OP P and Gold Standard data true curve Spline y x

26 Case 2 Same as Case 1 except.

27 Case 3 A modification of Case 1 above except that

28 1 J < Case 4 Case 1 of CMR so that 1[Ÿd -1, with OP L P L and. -12 et1d J 8, M 12 ž J L L,

29 Case 5 A modification of Case 4 of CMR so that with g, M LCo -12 e, S L i wš 12 ], OP Lª and

30 Case 6 The same as Case 1 above except that is chi-square(4) random variable. (Tests robustness tion of the structural assumptions.)

31 Mean Squared Bias «k Method Case 1 Case 2 Case 3 Case 4 Ca Naive ,108 3, Bayes Structural, 5 knots Structural, 15 knots Mean Squared Error «[ Method Case 1 Case 2 Case 3 Case 4 Ca Naive ,155 3, Bayes , Structural, 5 knots , Structural, 15 knots Results based on 200 Monte Carlo simulations for each was not included in the table it was not among the b

32 wš 8 is K R L L EXAMPLE SIMULATED for all ± 15 knot quadratic P-splines 2,000 iterations of Gibbs. First 667 deleted as bur

33 Gold Standard Data with measur y 0 y x wbar wbar mhat x x

34 < 10 beta(1) 0.5 yhat(1) 1.5 yhat(51) yhat(101) log(gamma) x(1) log(σ ) e sigma u mu x Results of Gibbs Sampling. Every twentieth ite Note: 4 ² ³ 4 LCœ < and. Also, log

35 < < < < <. { EXAMPLE SIMULATED What does the Bayes approach work so well? Here s tion: Bayes uses all possible information to estimate cially,. <. µ A µ A A ih other param. 0 rez v S0 ² ` ih. other param. 0 _rez v 0. { A 4 el C

36 DISCUSSION With the work of CMR and BCR we now have r ficient estimators for nonparametric regression w ment error. SIMEX (LPR and splines) in CMR (Flexible) Structural splines in CMR Fully Bayesian (hardcore structural) in BCR

37 DISCUSSION With BCR we have a methodology that automatically selects the amount of smoothing estimates the unknown s allows inference that takes account of the effects parameter selection and measurement error Most efficient methods appear to be structural, th may be competitive hardcore structural methods seem reasonably r

davidr Cornell University

davidr Cornell University 1 NONPARAMETRIC RANDOM EFFECTS MODELS AND LIKELIHOOD RATIO TESTS Oct 11, 2002 David Ruppert Cornell University www.orie.cornell.edu/ davidr (These transparencies and preprints available link to Recent

More information

Bayesian Smoothing and Regression Splines for Measurement Error Problems

Bayesian Smoothing and Regression Splines for Measurement Error Problems Bayesian Smoothing and Regression Splines for Measurement Error Problems Scott M. Berry, Raymond J. Carroll, and David Ruppert In the presence of covariate measurement error, estimating a regression function

More information

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR

NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR NONPARAMETRIC REGRESSION SPLINES FOR GENERALIZED LINEAR MODELS IN THE PRESENCE OF MEASUREMENT ERROR J. D. Maca July 1, 1997 Abstract The purpose of this manual is to demonstrate the usage of software for

More information

Nonparametric Mixed-Effects Models for Longitudinal Data

Nonparametric Mixed-Effects Models for Longitudinal Data Nonparametric Mixed-Effects Models for Longitudinal Data Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore University of Seoul, South Korea, 7 p.1/26 OUTLINE The Motivating Data

More information

Nonparametric regression using kernel and spline methods

Nonparametric regression using kernel and spline methods Nonparametric regression using kernel and spline methods Jean D. Opsomer F. Jay Breidt March 3, 016 1 The statistical model When applying nonparametric regression methods, the researcher is interested

More information

Statistical Matching using Fractional Imputation

Statistical Matching using Fractional Imputation Statistical Matching using Fractional Imputation Jae-Kwang Kim 1 Iowa State University 1 Joint work with Emily Berg and Taesung Park 1 Introduction 2 Classical Approaches 3 Proposed method 4 Application:

More information

Linear Modeling with Bayesian Statistics

Linear Modeling with Bayesian Statistics Linear Modeling with Bayesian Statistics Bayesian Approach I I I I I Estimate probability of a parameter State degree of believe in specific parameter values Evaluate probability of hypothesis given the

More information

Calibration and emulation of TIE-GCM

Calibration and emulation of TIE-GCM Calibration and emulation of TIE-GCM Serge Guillas School of Mathematics Georgia Institute of Technology Jonathan Rougier University of Bristol Big Thanks to Crystal Linkletter (SFU-SAMSI summer school)

More information

A noninformative Bayesian approach to small area estimation

A noninformative Bayesian approach to small area estimation A noninformative Bayesian approach to small area estimation Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 glen@stat.umn.edu September 2001 Revised May 2002 Research supported

More information

Markov Chain Monte Carlo (part 1)

Markov Chain Monte Carlo (part 1) Markov Chain Monte Carlo (part 1) Edps 590BAY Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2018 Depending on the book that you select for

More information

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection (Kohavi, 1995)

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection (Kohavi, 1995) A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection (Kohavi, 1995) Department of Information, Operations and Management Sciences Stern School of Business, NYU padamopo@stern.nyu.edu

More information

Approximate Bayesian Computation. Alireza Shafaei - April 2016

Approximate Bayesian Computation. Alireza Shafaei - April 2016 Approximate Bayesian Computation Alireza Shafaei - April 2016 The Problem Given a dataset, we are interested in. The Problem Given a dataset, we are interested in. The Problem Given a dataset, we are interested

More information

Modeling Criminal Careers as Departures From a Unimodal Population Age-Crime Curve: The Case of Marijuana Use

Modeling Criminal Careers as Departures From a Unimodal Population Age-Crime Curve: The Case of Marijuana Use Modeling Criminal Careers as Departures From a Unimodal Population Curve: The Case of Marijuana Use Donatello Telesca, Elena A. Erosheva, Derek A. Kreader, & Ross Matsueda April 15, 2014 extends Telesca

More information

Moving Beyond Linearity

Moving Beyond Linearity Moving Beyond Linearity The truth is never linear! 1/23 Moving Beyond Linearity The truth is never linear! r almost never! 1/23 Moving Beyond Linearity The truth is never linear! r almost never! But often

More information

Programs for MDE Modeling and Conditional Distribution Calculation

Programs for MDE Modeling and Conditional Distribution Calculation Programs for MDE Modeling and Conditional Distribution Calculation Sahyun Hong and Clayton V. Deutsch Improved numerical reservoir models are constructed when all available diverse data sources are accounted

More information

Spatially Adaptive Bayesian Penalized Regression Splines (P-splines)

Spatially Adaptive Bayesian Penalized Regression Splines (P-splines) Spatially Adaptive Bayesian Penalized Regression Splines (P-splines) Veerabhadran BALADANDAYUTHAPANI, Bani K. MALLICK, and Raymond J. CARROLL In this article we study penalized regression splines (P-splines),

More information

Bayesian Estimation for Skew Normal Distributions Using Data Augmentation

Bayesian Estimation for Skew Normal Distributions Using Data Augmentation The Korean Communications in Statistics Vol. 12 No. 2, 2005 pp. 323-333 Bayesian Estimation for Skew Normal Distributions Using Data Augmentation Hea-Jung Kim 1) Abstract In this paper, we develop a MCMC

More information

1 Methods for Posterior Simulation

1 Methods for Posterior Simulation 1 Methods for Posterior Simulation Let p(θ y) be the posterior. simulation. Koop presents four methods for (posterior) 1. Monte Carlo integration: draw from p(θ y). 2. Gibbs sampler: sequentially drawing

More information

Analyzing Longitudinal Data Using Regression Splines

Analyzing Longitudinal Data Using Regression Splines Analyzing Longitudinal Data Using Regression Splines Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore August 18, 6 DSAP, NUS p.1/16 OUTLINE Motivating Longitudinal Data Parametric

More information

Nonparametric Estimation of Distribution Function using Bezier Curve

Nonparametric Estimation of Distribution Function using Bezier Curve Communications for Statistical Applications and Methods 2014, Vol. 21, No. 1, 105 114 DOI: http://dx.doi.org/10.5351/csam.2014.21.1.105 ISSN 2287-7843 Nonparametric Estimation of Distribution Function

More information

arxiv: v1 [stat.me] 2 Jun 2017

arxiv: v1 [stat.me] 2 Jun 2017 Inference for Penalized Spline Regression: Improving Confidence Intervals by Reducing the Penalty arxiv:1706.00865v1 [stat.me] 2 Jun 2017 Ning Dai School of Statistics University of Minnesota daixx224@umn.edu

More information

Level-set MCMC Curve Sampling and Geometric Conditional Simulation

Level-set MCMC Curve Sampling and Geometric Conditional Simulation Level-set MCMC Curve Sampling and Geometric Conditional Simulation Ayres Fan John W. Fisher III Alan S. Willsky February 16, 2007 Outline 1. Overview 2. Curve evolution 3. Markov chain Monte Carlo 4. Curve

More information

Markov chain Monte Carlo methods

Markov chain Monte Carlo methods Markov chain Monte Carlo methods (supplementary material) see also the applet http://www.lbreyer.com/classic.html February 9 6 Independent Hastings Metropolis Sampler Outline Independent Hastings Metropolis

More information

Monte Carlo Methods and Statistical Computing: My Personal E

Monte Carlo Methods and Statistical Computing: My Personal E Monte Carlo Methods and Statistical Computing: My Personal Experience Department of Mathematics & Statistics Indian Institute of Technology Kanpur November 29, 2014 Outline Preface 1 Preface 2 3 4 5 6

More information

Nested Sampling: Introduction and Implementation

Nested Sampling: Introduction and Implementation UNIVERSITY OF TEXAS AT SAN ANTONIO Nested Sampling: Introduction and Implementation Liang Jing May 2009 1 1 ABSTRACT Nested Sampling is a new technique to calculate the evidence, Z = P(D M) = p(d θ, M)p(θ

More information

Non-Parametric and Semi-Parametric Methods for Longitudinal Data

Non-Parametric and Semi-Parametric Methods for Longitudinal Data PART III Non-Parametric and Semi-Parametric Methods for Longitudinal Data CHAPTER 8 Non-parametric and semi-parametric regression methods: Introduction and overview Xihong Lin and Raymond J. Carroll Contents

More information

Statistical techniques for data analysis in Cosmology

Statistical techniques for data analysis in Cosmology Statistical techniques for data analysis in Cosmology arxiv:0712.3028; arxiv:0911.3105 Numerical recipes (the bible ) Licia Verde ICREA & ICC UB-IEEC http://icc.ub.edu/~liciaverde outline Lecture 1: Introduction

More information

AMCMC: An R interface for adaptive MCMC

AMCMC: An R interface for adaptive MCMC AMCMC: An R interface for adaptive MCMC by Jeffrey S. Rosenthal * (February 2007) Abstract. We describe AMCMC, a software package for running adaptive MCMC algorithms on user-supplied density functions.

More information

MCMC Diagnostics. Yingbo Li MATH Clemson University. Yingbo Li (Clemson) MCMC Diagnostics MATH / 24

MCMC Diagnostics. Yingbo Li MATH Clemson University. Yingbo Li (Clemson) MCMC Diagnostics MATH / 24 MCMC Diagnostics Yingbo Li Clemson University MATH 9810 Yingbo Li (Clemson) MCMC Diagnostics MATH 9810 1 / 24 Convergence to Posterior Distribution Theory proves that if a Gibbs sampler iterates enough,

More information

Bootstrapping Methods

Bootstrapping Methods Bootstrapping Methods example of a Monte Carlo method these are one Monte Carlo statistical method some Bayesian statistical methods are Monte Carlo we can also simulate models using Monte Carlo methods

More information

COPYRIGHTED MATERIAL CONTENTS

COPYRIGHTED MATERIAL CONTENTS PREFACE ACKNOWLEDGMENTS LIST OF TABLES xi xv xvii 1 INTRODUCTION 1 1.1 Historical Background 1 1.2 Definition and Relationship to the Delta Method and Other Resampling Methods 3 1.2.1 Jackknife 6 1.2.2

More information

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric)

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric) Splines Patrick Breheny November 20 Patrick Breheny STA 621: Nonparametric Statistics 1/46 Introduction Introduction Problems with polynomial bases We are discussing ways to estimate the regression function

More information

Dynamic Thresholding for Image Analysis

Dynamic Thresholding for Image Analysis Dynamic Thresholding for Image Analysis Statistical Consulting Report for Edward Chan Clean Energy Research Center University of British Columbia by Libo Lu Department of Statistics University of British

More information

Missing Data Analysis for the Employee Dataset

Missing Data Analysis for the Employee Dataset Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup For our analysis goals we would like to do: Y X N (X, 2 I) and then interpret the coefficients

More information

100 Myung Hwan Na log-hazard function. The discussion section of Abrahamowicz, et al.(1992) contains a good review of many of the papers on the use of

100 Myung Hwan Na log-hazard function. The discussion section of Abrahamowicz, et al.(1992) contains a good review of many of the papers on the use of J. KSIAM Vol.3, No.2, 99-106, 1999 SPLINE HAZARD RATE ESTIMATION USING CENSORED DATA Myung Hwan Na Abstract In this paper, the spline hazard rate model to the randomly censored data is introduced. The

More information

Missing Data Analysis for the Employee Dataset

Missing Data Analysis for the Employee Dataset Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup Random Variables: Y i =(Y i1,...,y ip ) 0 =(Y i,obs, Y i,miss ) 0 R i =(R i1,...,r ip ) 0 ( 1

More information

Introduction to WinBUGS

Introduction to WinBUGS Introduction to WinBUGS Joon Jin Song Department of Statistics Texas A&M University Introduction: BUGS BUGS: Bayesian inference Using Gibbs Sampling Bayesian Analysis of Complex Statistical Models using

More information

Machine Learning / Jan 27, 2010

Machine Learning / Jan 27, 2010 Revisiting Logistic Regression & Naïve Bayes Aarti Singh Machine Learning 10-701/15-781 Jan 27, 2010 Generative and Discriminative Classifiers Training classifiers involves learning a mapping f: X -> Y,

More information

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego

Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221. Yixiao Sun Department of Economics, University of California, San Diego Nonparametric and Semiparametric Econometrics Lecture Notes for Econ 221 Yixiao Sun Department of Economics, University of California, San Diego Winter 2007 Contents Preface ix 1 Kernel Smoothing: Density

More information

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer Ludwig Fahrmeir Gerhard Tute Statistical odelling Based on Generalized Linear Model íecond Edition. Springer Preface to the Second Edition Preface to the First Edition List of Examples List of Figures

More information

Section 4 Matching Estimator

Section 4 Matching Estimator Section 4 Matching Estimator Matching Estimators Key Idea: The matching method compares the outcomes of program participants with those of matched nonparticipants, where matches are chosen on the basis

More information

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni Nonparametric Risk Attribution for Factor Models of Portfolios October 3, 2017 Kellie Ottoboni Outline The problem Page 3 Additive model of returns Page 7 Euler s formula for risk decomposition Page 11

More information

Splines and penalized regression

Splines and penalized regression Splines and penalized regression November 23 Introduction We are discussing ways to estimate the regression function f, where E(y x) = f(x) One approach is of course to assume that f has a certain shape,

More information

Bayesian Inference for Sample Surveys

Bayesian Inference for Sample Surveys Bayesian Inference for Sample Surveys Trivellore Raghunathan (Raghu) Director, Survey Research Center Professor of Biostatistics University of Michigan Distinctive features of survey inference 1. Primary

More information

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400 Statistics (STAT) 1 Statistics (STAT) STAT 1200: Introductory Statistical Reasoning Statistical concepts for critically evaluation quantitative information. Descriptive statistics, probability, estimation,

More information

SPATIALLY-ADAPTIVE PENALTIES FOR SPLINE FITTING

SPATIALLY-ADAPTIVE PENALTIES FOR SPLINE FITTING SPATIALLY-ADAPTIVE PENALTIES FOR SPLINE FITTING David Ruppert and Raymond J. Carroll January 6, 1999 Revised, July 17, 1999 Abstract We study spline fitting with a roughness penalty that adapts to spatial

More information

A Nonparametric Bayesian Approach to Detecting Spatial Activation Patterns in fmri Data

A Nonparametric Bayesian Approach to Detecting Spatial Activation Patterns in fmri Data A Nonparametric Bayesian Approach to Detecting Spatial Activation Patterns in fmri Data Seyoung Kim, Padhraic Smyth, and Hal Stern Bren School of Information and Computer Sciences University of California,

More information

Nonparametric Density Estimation

Nonparametric Density Estimation Nonparametric Estimation Data: X 1,..., X n iid P where P is a distribution with density f(x). Aim: Estimation of density f(x) Parametric density estimation: Fit parametric model {f(x θ) θ Θ} to data parameter

More information

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures:

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures: Homework Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression 3.0-3.2 Pod-cast lecture on-line Next lectures: I posted a rough plan. It is flexible though so please come with suggestions Bayes

More information

What is machine learning?

What is machine learning? Machine learning, pattern recognition and statistical data modelling Lecture 12. The last lecture Coryn Bailer-Jones 1 What is machine learning? Data description and interpretation finding simpler relationship

More information

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Tim Hanson, Ph.D. University of South Carolina T. Hanson (USC) Stat 704: Data Analysis I, Fall 2010 1 / 26 Additive predictors

More information

Bayesian Analysis of Extended Lomax Distribution

Bayesian Analysis of Extended Lomax Distribution Bayesian Analysis of Extended Lomax Distribution Shankar Kumar Shrestha and Vijay Kumar 2 Public Youth Campus, Tribhuvan University, Nepal 2 Department of Mathematics and Statistics DDU Gorakhpur University,

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION Introduction CHAPTER 1 INTRODUCTION Mplus is a statistical modeling program that provides researchers with a flexible tool to analyze their data. Mplus offers researchers a wide choice of models, estimators,

More information

Package sspse. August 26, 2018

Package sspse. August 26, 2018 Type Package Version 0.6 Date 2018-08-24 Package sspse August 26, 2018 Title Estimating Hidden Population Size using Respondent Driven Sampling Data Maintainer Mark S. Handcock

More information

Predictor Selection Algorithm for Bayesian Lasso

Predictor Selection Algorithm for Bayesian Lasso Predictor Selection Algorithm for Baesian Lasso Quan Zhang Ma 16, 2014 1 Introduction The Lasso [1] is a method in regression model for coefficients shrinkage and model selection. It is often used in the

More information

Assessing the Quality of the Natural Cubic Spline Approximation

Assessing the Quality of the Natural Cubic Spline Approximation Assessing the Quality of the Natural Cubic Spline Approximation AHMET SEZER ANADOLU UNIVERSITY Department of Statisticss Yunus Emre Kampusu Eskisehir TURKEY ahsst12@yahoo.com Abstract: In large samples,

More information

Canopy Light: Synthesizing multiple data sources

Canopy Light: Synthesizing multiple data sources Canopy Light: Synthesizing multiple data sources Tree growth depends upon light (previous example, lab 7) Hard to measure how much light an ADULT tree receives Multiple sources of proxy data Exposed Canopy

More information

Statistical Modeling with Spline Functions Methodology and Theory

Statistical Modeling with Spline Functions Methodology and Theory This is page 1 Printer: Opaque this Statistical Modeling with Spline Functions Methodology and Theory Mark H. Hansen University of California at Los Angeles Jianhua Z. Huang University of Pennsylvania

More information

Chapter 6: Examples 6.A Introduction

Chapter 6: Examples 6.A Introduction Chapter 6: Examples 6.A Introduction In Chapter 4, several approaches to the dual model regression problem were described and Chapter 5 provided expressions enabling one to compute the MSE of the mean

More information

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is...

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1D Regression i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1 Non-linear problems What if the underlying function is

More information

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model Topic 16 - Other Remedies Ridge Regression Robust Regression Regression Trees Outline - Fall 2013 Piecewise Linear Model Bootstrapping Topic 16 2 Ridge Regression Modification of least squares that addresses

More information

Sequential Estimation in Item Calibration with A Two-Stage Design

Sequential Estimation in Item Calibration with A Two-Stage Design Sequential Estimation in Item Calibration with A Two-Stage Design Yuan-chin Ivan Chang Institute of Statistical Science, Academia Sinica, Taipei, Taiwan In this paper we apply a two-stage sequential design

More information

R package mcll for Monte Carlo local likelihood estimation

R package mcll for Monte Carlo local likelihood estimation R package mcll for Monte Carlo local likelihood estimation Minjeong Jeon University of California, Berkeley Sophia Rabe-Hesketh University of California, Berkeley February 4, 2013 Cari Kaufman University

More information

Comparison of Methods for Analyzing and Interpreting Censored Exposure Data

Comparison of Methods for Analyzing and Interpreting Censored Exposure Data Comparison of Methods for Analyzing and Interpreting Censored Exposure Data Paul Hewett Ph.D. CIH Exposure Assessment Solutions, Inc. Gary H. Ganser Ph.D. West Virginia University Comparison of Methods

More information

Clustering Relational Data using the Infinite Relational Model

Clustering Relational Data using the Infinite Relational Model Clustering Relational Data using the Infinite Relational Model Ana Daglis Supervised by: Matthew Ludkin September 4, 2015 Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015

More information

Handling Data with Three Types of Missing Values:

Handling Data with Three Types of Missing Values: Handling Data with Three Types of Missing Values: A Simulation Study Jennifer Boyko Advisor: Ofer Harel Department of Statistics University of Connecticut Storrs, CT May 21, 2013 Jennifer Boyko Handling

More information

Nonparametric Approaches to Regression

Nonparametric Approaches to Regression Nonparametric Approaches to Regression In traditional nonparametric regression, we assume very little about the functional form of the mean response function. In particular, we assume the model where m(xi)

More information

Cumulative Distribution Function (CDF) Deconvolution

Cumulative Distribution Function (CDF) Deconvolution Cumulative Distribution Function (CDF) Deconvolution Thomas Kincaid September 20, 2013 Contents 1 Introduction 1 2 Preliminaries 1 3 Read the Simulated Variables Data File 2 4 Illustration of Extraneous

More information

BART STAT8810, Fall 2017

BART STAT8810, Fall 2017 BART STAT8810, Fall 2017 M.T. Pratola November 1, 2017 Today BART: Bayesian Additive Regression Trees BART: Bayesian Additive Regression Trees Additive model generalizes the single-tree regression model:

More information

The framework of the BCLA and its applications

The framework of the BCLA and its applications NOTE: Please cite the references as: Zhu, T.T., Dunkley, N., Behar, J., Clifton, D.A., and Clifford, G.D.: Fusing Continuous-Valued Medical Labels Using a Bayesian Model Annals of Biomedical Engineering,

More information

Semiparametric Mixed Effecs with Hierarchical DP Mixture

Semiparametric Mixed Effecs with Hierarchical DP Mixture Semiparametric Mixed Effecs with Hierarchical DP Mixture R topics documented: April 21, 2007 hdpm-package........................................ 1 hdpm............................................ 2 hdpmfitsetup........................................

More information

Learning the Structure of Deep, Sparse Graphical Models

Learning the Structure of Deep, Sparse Graphical Models Learning the Structure of Deep, Sparse Graphical Models Hanna M. Wallach University of Massachusetts Amherst wallach@cs.umass.edu Joint work with Ryan Prescott Adams & Zoubin Ghahramani Deep Belief Networks

More information

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2)

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2) SOC6078 Advanced Statistics: 9. Generalized Additive Models Robert Andersen Department of Sociology University of Toronto Goals of the Lecture Introduce Additive Models Explain how they extend from simple

More information

A Fast Clustering Algorithm with Application to Cosmology. Woncheol Jang

A Fast Clustering Algorithm with Application to Cosmology. Woncheol Jang A Fast Clustering Algorithm with Application to Cosmology Woncheol Jang May 5, 2004 Abstract We present a fast clustering algorithm for density contour clusters (Hartigan, 1975) that is a modified version

More information

Theoretical Concepts of Machine Learning

Theoretical Concepts of Machine Learning Theoretical Concepts of Machine Learning Part 2 Institute of Bioinformatics Johannes Kepler University, Linz, Austria Outline 1 Introduction 2 Generalization Error 3 Maximum Likelihood 4 Noise Models 5

More information

Stochastic Simulation: Algorithms and Analysis

Stochastic Simulation: Algorithms and Analysis Soren Asmussen Peter W. Glynn Stochastic Simulation: Algorithms and Analysis et Springer Contents Preface Notation v xii I What This Book Is About 1 1 An Illustrative Example: The Single-Server Queue 1

More information

Bayesian Modelling with JAGS and R

Bayesian Modelling with JAGS and R Bayesian Modelling with JAGS and R Martyn Plummer International Agency for Research on Cancer Rencontres R, 3 July 2012 CRAN Task View Bayesian Inference The CRAN Task View Bayesian Inference is maintained

More information

Article. Bayesian inference for finite population quantiles from unequal probability samples

Article. Bayesian inference for finite population quantiles from unequal probability samples Component of Statistics Canada Catalogue no. 1-001-X Business Survey Methods Division Article Bayesian inference for finite population quantiles from unequal probability samples by December 01 How to obtain

More information

Journal of the Royal Statistical Society. Series B (Statistical Methodology), Vol. 60, No. 2. (1998), pp

Journal of the Royal Statistical Society. Series B (Statistical Methodology), Vol. 60, No. 2. (1998), pp Automatic Bayesian Curve Fitting D. G. T. Denison; B. K. Mallick; A. F. M. Smith Journal of the Royal Statistical Society. Series B (Statistical Methodology), Vol. 60, No. 2. (1998), pp. 333-350. Stable

More information

Adaptive Metric Nearest Neighbor Classification

Adaptive Metric Nearest Neighbor Classification Adaptive Metric Nearest Neighbor Classification Carlotta Domeniconi Jing Peng Dimitrios Gunopulos Computer Science Department Computer Science Department Computer Science Department University of California

More information

Generalized Additive Models

Generalized Additive Models :p Texts in Statistical Science Generalized Additive Models An Introduction with R Simon N. Wood Contents Preface XV 1 Linear Models 1 1.1 A simple linear model 2 Simple least squares estimation 3 1.1.1

More information

Overview of various smoothers

Overview of various smoothers Chapter 2 Overview of various smoothers A scatter plot smoother is a tool for finding structure in a scatter plot: Figure 2.1: CD4 cell count since seroconversion for HIV infected men. CD4 counts vs Time

More information

ADAPTIVE METROPOLIS-HASTINGS SAMPLING, OR MONTE CARLO KERNEL ESTIMATION

ADAPTIVE METROPOLIS-HASTINGS SAMPLING, OR MONTE CARLO KERNEL ESTIMATION ADAPTIVE METROPOLIS-HASTINGS SAMPLING, OR MONTE CARLO KERNEL ESTIMATION CHRISTOPHER A. SIMS Abstract. A new algorithm for sampling from an arbitrary pdf. 1. Introduction Consider the standard problem of

More information

Clustering. Discover groups such that samples within a group are more similar to each other than samples across groups.

Clustering. Discover groups such that samples within a group are more similar to each other than samples across groups. Clustering 1 Clustering Discover groups such that samples within a group are more similar to each other than samples across groups. 2 Clustering Discover groups such that samples within a group are more

More information

MCMC Methods for data modeling

MCMC Methods for data modeling MCMC Methods for data modeling Kenneth Scerri Department of Automatic Control and Systems Engineering Introduction 1. Symposium on Data Modelling 2. Outline: a. Definition and uses of MCMC b. MCMC algorithms

More information

Bayesian Methods. David Rosenberg. April 11, New York University. David Rosenberg (New York University) DS-GA 1003 April 11, / 19

Bayesian Methods. David Rosenberg. April 11, New York University. David Rosenberg (New York University) DS-GA 1003 April 11, / 19 Bayesian Methods David Rosenberg New York University April 11, 2017 David Rosenberg (New York University) DS-GA 1003 April 11, 2017 1 / 19 Classical Statistics Classical Statistics David Rosenberg (New

More information

Supplementary Notes on Multiple Imputation. Stephen du Toit and Gerhard Mels Scientific Software International

Supplementary Notes on Multiple Imputation. Stephen du Toit and Gerhard Mels Scientific Software International Supplementary Notes on Multiple Imputation. Stephen du Toit and Gerhard Mels Scientific Software International Part A: Comparison with FIML in the case of normal data. Stephen du Toit Multivariate data

More information

Statistical Analysis Using Combined Data Sources: Discussion JPSM Distinguished Lecture University of Maryland

Statistical Analysis Using Combined Data Sources: Discussion JPSM Distinguished Lecture University of Maryland Statistical Analysis Using Combined Data Sources: Discussion 2011 JPSM Distinguished Lecture University of Maryland 1 1 University of Michigan School of Public Health April 2011 Complete (Ideal) vs. Observed

More information

Composite Likelihood Data Augmentation for Within-Network Statistical Relational Learning

Composite Likelihood Data Augmentation for Within-Network Statistical Relational Learning Composite Likelihood Data Augmentation for Within-Network Statistical Relational Learning Joseph J. Pfeiffer III 1 Jennifer Neville 1 Paul Bennett 2 1 Purdue University 2 Microsoft Research ICDM 2014,

More information

Variability in Annual Temperature Profiles

Variability in Annual Temperature Profiles Variability in Annual Temperature Profiles A Multivariate Spatial Analysis of Regional Climate Model Output Tamara Greasby, Stephan Sain Institute for Mathematics Applied to Geosciences, National Center

More information

Estimating Curves and Derivatives with Parametric Penalized Spline Smoothing

Estimating Curves and Derivatives with Parametric Penalized Spline Smoothing Estimating Curves and Derivatives with Parametric Penalized Spline Smoothing Jiguo Cao, Jing Cai Department of Statistics & Actuarial Science, Simon Fraser University, Burnaby, BC, Canada V5A 1S6 Liangliang

More information

DEPARTMENT OF STATISTICS

DEPARTMENT OF STATISTICS Department of Statistics 1 DEPARTMENT OF STATISTICS Office in Statistics Building, Room 102 (970) 491-5269 or (970) 491-6546 stat.colostate.edu (http://www.stat.colostate.edu) Don Estep, Department Chair

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software October 2015, Volume 67, Issue 3. doi: 10.18637/jss.v067.i03 Threshold Value Estimation Using Adaptive Two-Stage Plans in R Shawn Mankad Cornell University George Michailidis

More information

CHAPTER 11 EXAMPLES: MISSING DATA MODELING AND BAYESIAN ANALYSIS

CHAPTER 11 EXAMPLES: MISSING DATA MODELING AND BAYESIAN ANALYSIS Examples: Missing Data Modeling And Bayesian Analysis CHAPTER 11 EXAMPLES: MISSING DATA MODELING AND BAYESIAN ANALYSIS Mplus provides estimation of models with missing data using both frequentist and Bayesian

More information

Chapter 7: Dual Modeling in the Presence of Constant Variance

Chapter 7: Dual Modeling in the Presence of Constant Variance Chapter 7: Dual Modeling in the Presence of Constant Variance 7.A Introduction An underlying premise of regression analysis is that a given response variable changes systematically and smoothly due to

More information

Computational Physics PHYS 420

Computational Physics PHYS 420 Computational Physics PHYS 420 Dr Richard H. Cyburt Assistant Professor of Physics My office: 402c in the Science Building My phone: (304) 384-6006 My email: rcyburt@concord.edu My webpage: www.concord.edu/rcyburt

More information

Package alphastable. June 15, 2018

Package alphastable. June 15, 2018 Title Inference for Stable Distribution Version 0.1.0 Package stable June 15, 2018 Author Mahdi Teimouri, Adel Mohammadpour, Saralees Nadarajah Maintainer Mahdi Teimouri Developed

More information

Discussion on Bayesian Model Selection and Parameter Estimation in Extragalactic Astronomy by Martin Weinberg

Discussion on Bayesian Model Selection and Parameter Estimation in Extragalactic Astronomy by Martin Weinberg Discussion on Bayesian Model Selection and Parameter Estimation in Extragalactic Astronomy by Martin Weinberg Phil Gregory Physics and Astronomy Univ. of British Columbia Introduction Martin Weinberg reported

More information

Estimating survival from Gray s flexible model. Outline. I. Introduction. I. Introduction. I. Introduction

Estimating survival from Gray s flexible model. Outline. I. Introduction. I. Introduction. I. Introduction Estimating survival from s flexible model Zdenek Valenta Department of Medical Informatics Institute of Computer Science Academy of Sciences of the Czech Republic I. Introduction Outline II. Semi parametric

More information

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K.

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K. GAMs semi-parametric GLMs Simon Wood Mathematical Sciences, University of Bath, U.K. Generalized linear models, GLM 1. A GLM models a univariate response, y i as g{e(y i )} = X i β where y i Exponential

More information