Estimation in the Time Domain

Size: px
Start display at page:

Download "Estimation in the Time Domain"

Transcription

1 Estimation in the Time Domain Al Nosedal University of Toronto March 9, 2016 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

2 The R function ar.yw( ) An R function is available for solving the Yule-Walker equations, ar.yw( ). Pass any sequence of errors or residuals to the function and the function will solve a sequence of models, AR(1)... AR(m) and pick the best model using the criterion of minimizing AIC. It should be noted that the Yule-Walker estimates are not maximum likelihood estimates, so that AIC is only approximated. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

3 AR(3) Example # Simulating 2000 AR(3) errors; error<- arima.sim(n=2000, list(ar=c(1.4,-0.31,-0.126) ),sd=3 ); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

4 AR(3) Example # Simulating 2000 AR(3) errors; plot.ts(error,main="ar(3) errors, n=2000"); abline(0,0); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

5 AR(3) Example AR(3) errors, n=2000 error Time Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

6 ar.yw() z<-ar.yw(error); names(z); ## [1] "order" "ar" "var.pred" "x.mean" ## [5] "aic" "n.used" "order.max" "partiala ## [9] "resid" "method" "series" "frequenc ## [13] "call" "asy.var.coef" Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

7 ar.yw() z$aic[1:8]; ## ## ## ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

8 This is a common format for AIC values. With AIC (or BIC), the values themselves are unique only up to a common constant, and it is the minimum value and differences that are important. Therefore, all values are frequently reported as differences from the minimum value. Using AIC, the AR(3) model is identified as the best. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

9 ar.yw() z$order; ## [1] 3 # The order, m, of the model # chosen by AIC. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

10 ar.yw() z$ar; ## [1] # The estimated values of: # a_1, a_2,..., a_m for # the chosen order. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

11 ar.yw() z$var.pred; ## [1] # The estimate for # variance of w. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

12 In some cases, there is a desire to find ˆφ 1,..., ˆφ m and/ or σ 2 w for a different model. In that case, the command ar.yw(error, aic = FALSE, order.max=m) will ignore AIC and force the fitting of some specified order m. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

13 For each candidate model, AR(0) to AR(5), the command ar.yw(error, aic = FALSE, order.max =m) can be used to fit a specified order. Because information criteria assumes maximum likelihood estimates, a slightly different command can be used to get just those values ar.mle(error, aic = FALSE, order.max =m) (we use it to obtain ˆσ 2 w ). Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

14 BIC estimate A reasonable estimate would be BIC nln(ˆσ 2 w ) + mln(n), using ar.mle( ) to estimate σ 2 w. (ˆσ 2 w = variance estimate, n = sample size or number of observations in your series, m = specified order). Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

15 ar.yw() z.0<-ar.mle(error,aic=false,order.max=0); z.1<-ar.mle(error,aic=false,order.max=1); z.2<-ar.mle(error,aic=false,order.max=2); z.3<-ar.mle(error,aic=false,order.max=3); z.4<-ar.mle(error,aic=false,order.max=4); z.5<-ar.mle(error,aic=false,order.max=5); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

16 Model σw 2 BIC BIC Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

17 The choice of ar.yw( ) versus ar.mle( ) for estimation of σ 2 w was of no impact. However, ar.mle( ) may run slow or even fail to converge for large sets of data. The results from our table correctly identify the AR(3) model as the model that generated that data. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

18 A sequence of hypothesis tests Let ˆφ m m be the estimate of φ m for the model AR(m). It is known that ˆφ m m has a N(0, 1/n) distribution if the true model has order less than m. Therefore z n ˆφ m m is a standard Normal if the order is less than m and can be treated as a test statistic for H 0 : the order is less than m vs H a : the order is at least m. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

19 A sequence of tests for the order Model ˆφm m (mle) z p-value Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

20 It seems clear that the correct order is 3 from these hypothesis tests. Put more precisely, the data suggests that an order of at least 3 is required, but there is no evidence that an order higher than 3 is required. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

21 Hypothesis tests presented in a plot If the values ˆφ m m for m = 1, 2,... are plotted versus m, with lines at ±2/ n, the display becomes a visual representation of the above hypothesis tests. When ˆφ m m falls far outside the lines ±2/ n, there is evidence that the model is at least of order m; when the values begin to fall far inside the lines, the indications are that the order is no higher than the last large spike. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

22 The Partial Autocorrelation Plot pacf(error); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

23 The Partial Autocorrelation Plot Series error Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

24 Example. The lake data This series {Y t, t = 1, 2,..., 98} has already been studied (see assignment 1). In this example we shall consider the problem of fitting an AR process directly to the data without first removing any trend component. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

25 Example. The lake data # Reading data; lake<-read.table(file="lake-huron.dat",header=false); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

26 Example. The lake data plot.ts(lake,xlab="t",ylab="level"); level t Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

27 Example. The lake data new.lake<-unlist(lake) - mean(unlist(lake)) # mean-corrected data Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

28 Example. The lake data acf(new.lake) Series new.lake ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

29 Example. The lake data pacf(new.lake) Series new.lake Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

30 Example. The lake data mod1.yw<-ar.yw(new.lake); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

31 Example. The lake data mod1.yw$aic[1:11]; ## ## ## ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

32 Example. The lake data mod1.yw$order; ## [1] 2 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

33 Example. The lake data mod1.yw$ar; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

34 Example. The lake data mod1.yw$var.pred; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

35 Example. The lake data mod1.yw$asy.var.coef; ## [,1] [,2] ## [1,] ## [2,] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

36 Example. The lake data 95% Confidence Bounds for ˆφ 1. mod1.yw$ar[1]-1.96*sqrt(mod1.yw$asy.var.coef[1,1]); ## [1] mod1.yw$ar[1]+1.96*sqrt(mod1.yw$asy.var.coef[1,1]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

37 Example. The lake data 95% Confidence Bounds for ˆφ 2. mod1.yw$ar[2]-1.96*sqrt(mod1.yw$asy.var.coef[2,2]); ## [1] mod1.yw$ar[2]+1.96*sqrt(mod1.yw$asy.var.coef[2,2]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

38 ar.yw() m.0<-ar.mle(new.lake,aic=false,order.max=0); m.1<-ar.mle(new.lake,aic=false,order.max=1); m.2<-ar.mle(new.lake,aic=false,order.max=2); m.3<-ar.mle(new.lake,aic=false,order.max=3); m.4<-ar.mle(new.lake,aic=false,order.max=4); m.5<-ar.mle(new.lake,aic=false,order.max=5); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

39 Model σw 2 BIC BIC Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

40 Example. Dow Jones # Reading data; dow<-read.table(file="dowj.dat",header=false) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

41 Example. Dow Jones plot.ts(dow,xlab="t",ylab="dow Index") Dow Index t Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

42 Example. Dow Jones mode(dow); ## [1] "list" Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

43 Example. Dow Jones # Computing first differences; diff.dow<-diff(unlist(dow)); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

44 Example. Dow Jones plot.ts(diff.dow,xlab="t",ylab="diff(dow Index)"); abline(h=mean(diff.dow), lty=2, col="red"); diff(dow Index) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

45 Example. Dow Jones acf(diff.dow); Series diff.dow ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

46 Example. Dow Jones pacf(diff.dow); Series diff.dow Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

47 Example. Dow Jones mean(diff.dow); ## [1] new.diff<-diff.dow - mean(diff.dow); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

48 Example. Dow Jones mod2.yw<-ar.yw(new.diff); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

49 Example. Dow Jones mod2.yw$aic[1:8]; ## ## ## 6 7 ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

50 Example. Dow Jones mod2.yw$order; ## [1] 1 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

51 Example. Dow Jones mod2.yw$ar; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

52 Example. Dow Jones mod2.yw$var.pred; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

53 Example. Dow Jones mod2.yw$asy.var.coef; ## [,1] ## [1,] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

54 Example. Dow Jones 95% Confidence Bounds for ˆφ 1. mod2.yw$ar[1]-1.96*sqrt(mod2.yw$asy.var.coef[1,1]); ## [1] mod2.yw$ar[1]+1.96*sqrt(mod2.yw$asy.var.coef[1,1]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

55 Installing a package install.packages("forecast", dependencies = TRUE) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

56 Loading library library(forecast); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

57 Example. Dow Jones... again auto.arima(dow); ## Series: dow ## ARIMA(1,1,0) with drift ## ## Coefficients: ## ar1 drift ## ## s.e ## ## sigma^2 estimated as : log likelihood= ## AIC=75.38 AICc=75.71 BIC=82.41 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

58 Example. Dow Jones... again 95% Confidence bounds for φ *(0.1058); ## [1] *(0.1058); ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

= 75. See Hamilton p. 111 on solving PACF given ACF.

= 75. See Hamilton p. 111 on solving PACF given ACF. Washington University Fall 2009 Department of Economics James Morley Economics 518B Homework #2 GDP and ARMA Models Due: Tuesday 9/15 As with the first homework assignment, try to conserve paper in presenting

More information

Intro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019

Intro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019 Intro to ARMA models FISH 507 Applied Time Series Analysis Mark Scheuerell 15 Jan 2019 Topics for today Review White noise Random walks Autoregressive (AR) models Moving average (MA) models Autoregressive

More information

SYS 6021 Linear Statistical Models

SYS 6021 Linear Statistical Models SYS 6021 Linear Statistical Models Project 2 Spam Filters Jinghe Zhang Summary The spambase data and time indexed counts of spams and hams are studied to develop accurate spam filters. Static models are

More information

Package tseriesentropy

Package tseriesentropy Package tseriesentropy April 15, 2017 Title Entropy Based Analysis and Tests for Time Series Date 2017-04-15 Version 0.6-0 Author Simone Giannerini Depends R (>= 2.14.0) Imports cubature, methods, parallel,

More information

Stat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors

Stat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors Week 10: Autocorrelated errors This week, I have done one possible analysis and provided lots of output for you to consider. Case study: predicting body fat Body fat is an important health measure, but

More information

The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC

The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC INTRODUCTION The Time Series Forecasting System (TSFS) is a component of SAS/ETS that provides a menu-based

More information

Model Diagnostic tests

Model Diagnostic tests Model Diagnostic tests 1. Multicollinearity a) Pairwise correlation test Quick/Group stats/ correlations b) VIF Step 1. Open the EViews workfile named Fish8.wk1. (FROM DATA FILES- TSIME) Step 2. Select

More information

Section 4.1: Time Series I. Jared S. Murray The University of Texas at Austin McCombs School of Business

Section 4.1: Time Series I. Jared S. Murray The University of Texas at Austin McCombs School of Business Section 4.1: Time Series I Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Time Series Data and Dependence Time-series data are simply a collection of observations gathered

More information

Chapter 15: Forecasting

Chapter 15: Forecasting Chapter 15: Forecasting In this chapter: 1. Forecasting chicken consumption using OLS (UE 15.1, Equation 6.8, p. 501) 2. Forecasting chicken consumption using a generalized least squares (GLS) model estimated

More information

Stat 8053, Fall 2013: Additive Models

Stat 8053, Fall 2013: Additive Models Stat 853, Fall 213: Additive Models We will only use the package mgcv for fitting additive and later generalized additive models. The best reference is S. N. Wood (26), Generalized Additive Models, An

More information

Adaptive spline autoregression threshold method in forecasting Mitsubishi car sales volume at PT Srikandi Diamond Motors

Adaptive spline autoregression threshold method in forecasting Mitsubishi car sales volume at PT Srikandi Diamond Motors IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS Adaptive spline autoregression threshold method in forecasting Mitsubishi car sales volume at PT Srikandi Diamond Motors To cite

More information

Lecture 13: Model selection and regularization

Lecture 13: Model selection and regularization Lecture 13: Model selection and regularization Reading: Sections 6.1-6.2.1 STATS 202: Data mining and analysis October 23, 2017 1 / 17 What do we know so far In linear regression, adding predictors always

More information

Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling

Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Abstract Forecasting future events and statistics is problematic because the data set is a stochastic, rather

More information

Package robets. March 6, Type Package

Package robets. March 6, Type Package Type Package Package robets March 6, 2018 Title Forecasting Time Series with Robust Exponential Smoothing Version 1.4 Date 2018-03-06 We provide an outlier robust alternative of the function ets() in the

More information

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA ECL 290 Statistical Models in Ecology using R Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA Datasets in this problem set adapted from those provided

More information

Package freeknotsplines

Package freeknotsplines Version 1.0.1 Date 2018-05-17 Package freeknotsplines June 10, 2018 Title Algorithms for Implementing Free-Knot Splines Author , Philip Smith , Pierre Lecuyer

More information

GW - WINKS SDA Windows KwikStat Statistical Data Analysis for Time Series Programs. Getting Started Guide

GW - WINKS SDA Windows KwikStat Statistical Data Analysis for Time Series Programs. Getting Started Guide GW - WINKS SDA Windows KwikStat Statistical Data Analysis for Time Series Programs Getting Started Guide. An Overview of GW-WINKS Time Series Programs GW WINKS is an add-on component of the WINKS SDA Statistical

More information

Variable selection is intended to select the best subset of predictors. But why bother?

Variable selection is intended to select the best subset of predictors. But why bother? Chapter 10 Variable Selection Variable selection is intended to select the best subset of predictors. But why bother? 1. We want to explain the data in the simplest way redundant predictors should be removed.

More information

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes. Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department

More information

CPSC 340: Machine Learning and Data Mining. Feature Selection Fall 2017

CPSC 340: Machine Learning and Data Mining. Feature Selection Fall 2017 CPSC 340: Machine Learning and Data Mining Feature Selection Fall 2017 Assignment 2: Admin 1 late day to hand in tonight, 2 for Wednesday, answers posted Thursday. Extra office hours Thursday at 4pm (ICICS

More information

Video Traffic Modeling Using Seasonal ARIMA Models

Video Traffic Modeling Using Seasonal ARIMA Models Video Traffic Modeling Using Seasonal ARIMA Models Abdel-Karim Al-Tamimi and Raj Jain Washington University in Saint Louis Saint Louis, MO 63130 Jain@cse.wustl.edu These slides are available on-line at:

More information

Annexes : Sorties SAS pour l'exercice 3. Code SAS. libname process 'G:\Enseignements\M2ISN-Series temp\sas\';

Annexes : Sorties SAS pour l'exercice 3. Code SAS. libname process 'G:\Enseignements\M2ISN-Series temp\sas\'; Annexes : Sorties SAS pour l'exercice 3 Code SAS libname process 'G:\Enseignements\M2ISN-Series temp\sas\'; /* Etape 1 - Création des données*/ proc iml; phi={1-1.583 0.667-0.083}; theta={1}; y=armasim(phi,

More information

Repeated Measures Part 4: Blood Flow data

Repeated Measures Part 4: Blood Flow data Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value

More information

SAS Econometrics and Time Series Analysis 1.1 for JMP

SAS Econometrics and Time Series Analysis 1.1 for JMP SAS Econometrics and Time Series Analysis 1.1 for JMP SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2011. SAS Econometrics and Time Series Analysis

More information

Differentiation of Cognitive Abilities across the Lifespan. Online Supplement. Elliot M. Tucker-Drob

Differentiation of Cognitive Abilities across the Lifespan. Online Supplement. Elliot M. Tucker-Drob 1 Differentiation of Cognitive Abilities across the Lifespan Online Supplement Elliot M. Tucker-Drob This online supplement reports the results of an alternative set of analyses performed on a single sample

More information

Module 29.1: nag tsa identify Time Series Analysis Identification. Contents

Module 29.1: nag tsa identify Time Series Analysis Identification. Contents Time Series Analysis Module Contents Module 29.1: nag tsa identify Time Series Analysis Identification nag tsa identify contains procedures for analysing time series data. Contents Introduction..............................................................

More information

Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models

Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models Statistics Preprints Statistics 3-29-1999 Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models William Q. Meeker Iowa State University, wqmeeker@iastate.edu Follow this and additional

More information

Bootstrapping Methods

Bootstrapping Methods Bootstrapping Methods example of a Monte Carlo method these are one Monte Carlo statistical method some Bayesian statistical methods are Monte Carlo we can also simulate models using Monte Carlo methods

More information

[spa-temp.inf] Spatial-temporal information

[spa-temp.inf] Spatial-temporal information [spa-temp.inf] Spatial-temporal information VI Table of Contents for Spatial-temporal information I. Spatial-temporal information........................................... VI - 1 A. Cohort-survival method.........................................

More information

Univariate Extreme Value Analysis. 1 Block Maxima. Practice problems using the extremes ( 2.0 5) package. 1. Pearson Type III distribution

Univariate Extreme Value Analysis. 1 Block Maxima. Practice problems using the extremes ( 2.0 5) package. 1. Pearson Type III distribution Univariate Extreme Value Analysis Practice problems using the extremes ( 2.0 5) package. 1 Block Maxima 1. Pearson Type III distribution (a) Simulate 100 maxima from samples of size 1000 from the gamma

More information

SAS Econometrics and Time Series Analysis 2 for JMP. Third Edition

SAS Econometrics and Time Series Analysis 2 for JMP. Third Edition SAS Econometrics and Time Series Analysis 2 for JMP Third Edition The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2013. SAS Econometrics and Time Series Analysis 2

More information

Math 121. Graphing Rational Functions Fall 2016

Math 121. Graphing Rational Functions Fall 2016 Math 121. Graphing Rational Functions Fall 2016 1. Let x2 85 x 2 70. (a) State the domain of f, and simplify f if possible. (b) Find equations for the vertical asymptotes for the graph of f. (c) For each

More information

Practical 4: Mixed effect models

Practical 4: Mixed effect models Practical 4: Mixed effect models This practical is about how to fit (generalised) linear mixed effects models using the lme4 package. You may need to install it first (using either the install.packages

More information

MPhil computer package lesson: getting started with Eviews

MPhil computer package lesson: getting started with Eviews MPhil computer package lesson: getting started with Eviews Ryoko Ito (ri239@cam.ac.uk, itoryoko@gmail.com, www.itoryoko.com ) 1. Creating an Eviews workfile 1.1. Download Wage data.xlsx from my homepage:

More information

On Choosing the Right Coordinate Transformation Method

On Choosing the Right Coordinate Transformation Method On Choosing the Right Coordinate Transformation Method Yaron A. Felus 1 and Moshe Felus 1 The Survey of Israel, Tel-Aviv Surveying Engineering Department, MI Halperin Felus Surveying and Photogrammetry

More information

ST440/540: Applied Bayesian Analysis. (5) Multi-parameter models - Initial values and convergence diagn

ST440/540: Applied Bayesian Analysis. (5) Multi-parameter models - Initial values and convergence diagn (5) Multi-parameter models - Initial values and convergence diagnostics Tuning the MCMC algoritm MCMC is beautiful because it can handle virtually any statistical model and it is usually pretty easy to

More information

Model selection Outline for today

Model selection Outline for today Model selection Outline for today The problem of model selection Choose among models by a criterion rather than significance testing Criteria: Mallow s C p and AIC Search strategies: All subsets; stepaic

More information

2016 Stat-Ease, Inc. & CAMO Software

2016 Stat-Ease, Inc. & CAMO Software Multivariate Analysis and Design of Experiments in practice using The Unscrambler X Frank Westad CAMO Software fw@camo.com Pat Whitcomb Stat-Ease pat@statease.com Agenda Goal: Part 1: Part 2: Show how

More information

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems RSM Split-Plot Designs & Diagnostics Solve Real-World Problems Shari Kraber Pat Whitcomb Martin Bezener Stat-Ease, Inc. Stat-Ease, Inc. Stat-Ease, Inc. 221 E. Hennepin Ave. 221 E. Hennepin Ave. 221 E.

More information

One Factor Experiments

One Factor Experiments One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal

More information

Stat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7)

Stat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7) Stat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7) Data: Weekly sales (in thousands of units) of Super Tech Videocassette Tapes over 161 weeks [see Bowerman & O Connell Forecasting and Time Series:

More information

Serial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix

Serial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix Serial Correlation and Heteroscedasticity in Time series Regressions Econometric (EC3090) - Week 11 Agustín Bénétrix 1 Properties of OLS with serially correlated errors OLS still unbiased and consistent

More information

1 Simple Linear Regression

1 Simple Linear Regression Math 158 Jo Hardin R code 1 Simple Linear Regression Consider a dataset from ISLR on credit scores. Because we don t know the sampling mechanism used to collect the data, we are unable to generalize the

More information

Zero-Inflated Poisson Regression

Zero-Inflated Poisson Regression Chapter 329 Zero-Inflated Poisson Regression Introduction The zero-inflated Poisson (ZIP) regression is used for count data that exhibit overdispersion and excess zeros. The data distribution combines

More information

Modelling and simulation of seismic reflectivity

Modelling and simulation of seismic reflectivity Modelling reflectivity Modelling and simulation of seismic reflectivity Rita Aggarwala, Michael P. Lamoureux, and Gary F. Margrave ABSTRACT We decompose the reflectivity series obtained from a seismic

More information

Make Forecasts with Data exported from Google Analytics

Make Forecasts with Data exported from Google Analytics Beyond Google Analytics. Using C# to integrate Google Analytics with R statistical server Juan Antonio Breña Moral Abstract Commercial Web sites are designed, planned and launched in many cases by Marketing

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Variable Selection - Chapters 8 in Dielman Artin Department of Statistical Science October 23, 2009 Outline Introduction 1 Introduction 2 3 4 Variable Selection Model

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

The Truth behind PGA Tour Player Scores

The Truth behind PGA Tour Player Scores The Truth behind PGA Tour Player Scores Sukhyun Sean Park, Dong Kyun Kim, Ilsung Lee May 7, 2016 Abstract The main aim of this project is to analyze the variation in a dataset that is obtained from the

More information

Cross-validation and the Bootstrap

Cross-validation and the Bootstrap Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. These methods refit a model of interest to samples formed from the training set,

More information

Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS)

Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Linear Model in matrix notation for the population Y = Xβ + Var ( ) = In GLS, the error covariance matrix is known In EGLS

More information

Computation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No.

Computation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No. Computation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No. 679 School of Statistics University of Minnesota July 29, 2010 Abstract

More information

Introduction to mixed-effects regression for (psycho)linguists

Introduction to mixed-effects regression for (psycho)linguists Introduction to mixed-effects regression for (psycho)linguists Martijn Wieling Department of Humanities Computing, University of Groningen Groningen, April 21, 2015 1 Martijn Wieling Introduction to mixed-effects

More information

The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up.

The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up. Fall 2010 Handout on Using R Page: 1 The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up. 1. Installing R First off,

More information

Pattern Recognition. Kjell Elenius. Speech, Music and Hearing KTH. March 29, 2007 Speech recognition

Pattern Recognition. Kjell Elenius. Speech, Music and Hearing KTH. March 29, 2007 Speech recognition Pattern Recognition Kjell Elenius Speech, Music and Hearing KTH March 29, 2007 Speech recognition 2007 1 Ch 4. Pattern Recognition 1(3) Bayes Decision Theory Minimum-Error-Rate Decision Rules Discriminant

More information

Generalized least squares (GLS) estimates of the level-2 coefficients,

Generalized least squares (GLS) estimates of the level-2 coefficients, Contents 1 Conceptual and Statistical Background for Two-Level Models...7 1.1 The general two-level model... 7 1.1.1 Level-1 model... 8 1.1.2 Level-2 model... 8 1.2 Parameter estimation... 9 1.3 Empirical

More information

National College of Ireland. Project Submission Sheet 2015/2016. School of Computing

National College of Ireland. Project Submission Sheet 2015/2016. School of Computing National College of Ireland Project Submission Sheet 2015/2016 School of Computing Student Name: Anicia Lafayette-Madden Student ID: 15006590 Programme: M.Sc Data Analytics Year: 2015-2016 Module: Configuration

More information

# R script for Applied Time Series Analysis for Ecologists # Day 1 # 24 March # written by Mark Scheuerell

# R script for Applied Time Series Analysis for Ecologists # Day 1 # 24 March # written by Mark Scheuerell # R script for Applied Time Series Analysis for Ecologists # Day 1 # 24 March 2014 # written by Mark Scheuerell #------------------------- # load required libraries #------------------------- require("tseries")

More information

Bayesian Statistics Group 8th March Slice samplers. (A very brief introduction) The basic idea

Bayesian Statistics Group 8th March Slice samplers. (A very brief introduction) The basic idea Bayesian Statistics Group 8th March 2000 Slice samplers (A very brief introduction) The basic idea lacements To sample from a distribution, simply sample uniformly from the region under the density function

More information

Markov Chain Monte Carlo (part 1)

Markov Chain Monte Carlo (part 1) Markov Chain Monte Carlo (part 1) Edps 590BAY Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2018 Depending on the book that you select for

More information

Package FWDselect. December 19, 2015

Package FWDselect. December 19, 2015 Title Selecting Variables in Regression Models Version 2.1.0 Date 2015-12-18 Author Marta Sestelo [aut, cre], Nora M. Villanueva [aut], Javier Roca-Pardinas [aut] Maintainer Marta Sestelo

More information

Statistical Consulting Topics Using cross-validation for model selection. Cross-validation is a technique that can be used for model evaluation.

Statistical Consulting Topics Using cross-validation for model selection. Cross-validation is a technique that can be used for model evaluation. Statistical Consulting Topics Using cross-validation for model selection Cross-validation is a technique that can be used for model evaluation. We often fit a model to a full data set and then perform

More information

Variogram Inversion and Uncertainty Using Dynamic Data. Simultaneouos Inversion with Variogram Updating

Variogram Inversion and Uncertainty Using Dynamic Data. Simultaneouos Inversion with Variogram Updating Variogram Inversion and Uncertainty Using Dynamic Data Z. A. Reza (zreza@ualberta.ca) and C. V. Deutsch (cdeutsch@civil.ualberta.ca) Department of Civil & Environmental Engineering, University of Alberta

More information

22s:152 Applied Linear Regression

22s:152 Applied Linear Regression 22s:152 Applied Linear Regression Chapter 22: Model Selection In model selection, the idea is to find the smallest set of variables which provides an adequate description of the data. We will consider

More information

Overview. Monte Carlo Methods. Statistics & Bayesian Inference Lecture 3. Situation At End Of Last Week

Overview. Monte Carlo Methods. Statistics & Bayesian Inference Lecture 3. Situation At End Of Last Week Statistics & Bayesian Inference Lecture 3 Joe Zuntz Overview Overview & Motivation Metropolis Hastings Monte Carlo Methods Importance sampling Direct sampling Gibbs sampling Monte-Carlo Markov Chains Emcee

More information

Standard Errors in OLS Luke Sonnet

Standard Errors in OLS Luke Sonnet Standard Errors in OLS Luke Sonnet Contents Variance-Covariance of ˆβ 1 Standard Estimation (Spherical Errors) 2 Robust Estimation (Heteroskedasticity Constistent Errors) 4 Cluster Robust Estimation 7

More information

22s:152 Applied Linear Regression

22s:152 Applied Linear Regression 22s:152 Applied Linear Regression Chapter 22: Model Selection In model selection, the idea is to find the smallest set of variables which provides an adequate description of the data. We will consider

More information

Package mcmcse. February 15, 2013

Package mcmcse. February 15, 2013 Package mcmcse February 15, 2013 Version 1.0-1 Date 2012 Title Monte Carlo Standard Errors for MCMC Author James M. Flegal and John Hughes Maintainer James M. Flegal

More information

A New Measure of the Cluster Hypothesis

A New Measure of the Cluster Hypothesis A New Measure of the Cluster Hypothesis Mark D. Smucker 1 and James Allan 2 1 Department of Management Sciences University of Waterloo 2 Center for Intelligent Information Retrieval Department of Computer

More information

GEOP 505/MATH 587 Fall 02 Homework 6

GEOP 505/MATH 587 Fall 02 Homework 6 GEOP 55/MATH 587 Fall 2 Homework 6 In grading these homeworks, I found that the major problem that students had was a misunderstanding of the difference between the original time series z, the differenced

More information

Chronology development, statistical analysis

Chronology development, statistical analysis A R S T A N Guide for computer program ARSTAN, by Edward R. Cook and Richard L. Holmes Adapted from Users Manual for Program ARSTAN, in Tree-Ring Chronologies of Western North America: California, eastern

More information

Package MARX. June 16, 2017

Package MARX. June 16, 2017 Package MARX June 16, 2017 Title Simulation, Estimation and Selection of MARX Models Version 0.1 Date 2017-06-16 Author [aut, cre, cph], Alain Hecq [ctb], Lenard Lieb [ctb] Maintainer

More information

Non-linear Modelling Solutions to Exercises

Non-linear Modelling Solutions to Exercises Non-linear Modelling Solutions to Exercises April 13, 2017 Contents 1 Non-linear Modelling 2 1.6 Exercises.................................................... 2 1.6.1 Chick Weight.............................................

More information

Lab Session 1. Introduction to Eviews

Lab Session 1. Introduction to Eviews Albert-Ludwigs University Freiburg Department of Empirical Economics Time Series Analysis, Summer 2009 Dr. Sevtap Kestel To see the data of m1: 1 Lab Session 1 Introduction to Eviews We introduce the basic

More information

Binary Diagnostic Tests Clustered Samples

Binary Diagnostic Tests Clustered Samples Chapter 538 Binary Diagnostic Tests Clustered Samples Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. In the twogroup case, each cluster

More information

NOTE: Any use of trade, product or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government.

NOTE: Any use of trade, product or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government. U.S. Geological Survey (USGS) MMA(1) NOTE: Any use of trade, product or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government. NAME MMA, A Computer Code for

More information

Sistemática Teórica. Hernán Dopazo. Biomedical Genomics and Evolution Lab. Lesson 03 Statistical Model Selection

Sistemática Teórica. Hernán Dopazo. Biomedical Genomics and Evolution Lab. Lesson 03 Statistical Model Selection Sistemática Teórica Hernán Dopazo Biomedical Genomics and Evolution Lab Lesson 03 Statistical Model Selection Facultad de Ciencias Exactas y Naturales Universidad de Buenos Aires Argentina 2013 Statistical

More information

Bayesian Networks Inference (continued) Learning

Bayesian Networks Inference (continued) Learning Learning BN tutorial: ftp://ftp.research.microsoft.com/pub/tr/tr-95-06.pdf TAN paper: http://www.cs.huji.ac.il/~nir/abstracts/frgg1.html Bayesian Networks Inference (continued) Learning Machine Learning

More information

MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data

MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data MINI-PAPER by Rong Pan, Ph.D., Assistant Professor of Industrial Engineering, Arizona State University We, applied statisticians and manufacturing engineers, often need to deal with sequential data, which

More information

False Analog Data Injection Attack Towards Topology Errors: Formulation and Feasibility Analysis

False Analog Data Injection Attack Towards Topology Errors: Formulation and Feasibility Analysis False Analog Data Injection Attack Towards Topology Errors: Formulation and Feasibility Analysis Yuqi Zhou, Jorge Cisneros-Saldana, Le Xie Department of Electrical and Computer Engineering Texas A&M University

More information

CPSC 340: Machine Learning and Data Mining

CPSC 340: Machine Learning and Data Mining CPSC 340: Machine Learning and Data Mining Feature Selection Original version of these slides by Mark Schmidt, with modifications by Mike Gelbart. Admin Assignment 3: Due Friday Midterm: Feb 14 in class

More information

CS490D: Introduction to Data Mining Prof. Chris Clifton

CS490D: Introduction to Data Mining Prof. Chris Clifton CS490D: Introduction to Data Mining Prof. Chris Clifton April 5, 2004 Mining of Time Series Data Time-series database Mining Time-Series and Sequence Data Consists of sequences of values or events changing

More information

Discussion Notes 3 Stepwise Regression and Model Selection

Discussion Notes 3 Stepwise Regression and Model Selection Discussion Notes 3 Stepwise Regression and Model Selection Stepwise Regression There are many different commands for doing stepwise regression. Here we introduce the command step. There are many arguments

More information

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References...

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References... Chapter 15 Mixed Models Chapter Table of Contents Introduction...309 Split Plot Experiment...311 Clustered Data...320 References...326 308 Chapter 15. Mixed Models Chapter 15 Mixed Models Introduction

More information

Normal Plot of the Effects (response is Mean free height, Alpha = 0.05)

Normal Plot of the Effects (response is Mean free height, Alpha = 0.05) Percent Normal Plot of the Effects (response is Mean free height, lpha = 5) 99 95 Effect Type Not Significant Significant 90 80 70 60 50 40 30 20 E F actor C D E Name C D E 0 5 E -0.3-0. Effect 0. 0.3

More information

On the Test and Estimation of Fractional Parameter. in ARFIMA Model: Bootstrap Approach

On the Test and Estimation of Fractional Parameter. in ARFIMA Model: Bootstrap Approach Applied Mathematical Sciences, Vol. 8, 2014, no. 96, 4783-4792 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.46498 On the Test and Estimation of Fractional Parameter in ARFIMA Model:

More information

The Perils of Unfettered In-Sample Backtesting

The Perils of Unfettered In-Sample Backtesting The Perils of Unfettered In-Sample Backtesting Tyler Yeats June 8, 2015 Abstract When testing a financial investment strategy, it is common to use what is known as a backtest, or a simulation of how well

More information

RocLab 1.0. Rock Mass Strength Analysis using the Generalized Hoek-Brown failure criterion

RocLab 1.0. Rock Mass Strength Analysis using the Generalized Hoek-Brown failure criterion RocLab 1.0 Rock Mass Strength Analysis using the Generalized Hoek-Brown failure criterion RocLab is a software program for determining rock mass strength parameters, based on the latest version of the

More information

SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian. Panel Data Analysis: Fixed Effects Models

SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian. Panel Data Analysis: Fixed Effects Models SOCY776: Longitudinal Data Analysis Instructor: Natasha Sarkisian Panel Data Analysis: Fixed Effects Models Fixed effects models are similar to the first difference model we considered for two wave data

More information

Small Libraries of Protein Fragments through Clustering

Small Libraries of Protein Fragments through Clustering Small Libraries of Protein Fragments through Clustering Varun Ganapathi Department of Computer Science Stanford University June 8, 2005 Abstract When trying to extract information from the information

More information

Polynomial Curve Fitting of Execution Time of Binary Search in Worst Case in Personal Computer

Polynomial Curve Fitting of Execution Time of Binary Search in Worst Case in Personal Computer Polynomial Curve Fitting of Execution Time of Binary Search in Worst Case in Personal Computer Dipankar Das Assistant Professor, The Heritage Academy, Kolkata, India Abstract Curve fitting is a well known

More information

A Sequential ESM Track Association Algorithm Based on the Use of Information Theoretic Criteria

A Sequential ESM Track Association Algorithm Based on the Use of Information Theoretic Criteria A Sequential ESM Track Association Algorithm Based on the Use of Information Theoretic Criteria Yifeng Zhou and Jim Mickeal Radar Electronic Warfare Section Defence R&D Canada - Ottawa 371 Carling Avenue,

More information

Package batchmeans. R topics documented: July 4, Version Date

Package batchmeans. R topics documented: July 4, Version Date Package batchmeans July 4, 2016 Version 1.0-3 Date 2016-07-03 Title Consistent Batch Means Estimation of Monte Carlo Standard Errors Author Murali Haran and John Hughes

More information

Descriptive Statistics, Standard Deviation and Standard Error

Descriptive Statistics, Standard Deviation and Standard Error AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.

More information

**************************************************************************************************************** * Single Wave Analyses.

**************************************************************************************************************** * Single Wave Analyses. ASDA2 ANALYSIS EXAMPLE REPLICATION SPSS C11 * Syntax for Analysis Example Replication C11 * Use data sets previously prepared in SAS, refer to SAS Analysis Example Replication for C11 for details. ****************************************************************************************************************

More information

Annotated multitree output

Annotated multitree output Annotated multitree output A simplified version of the two high-threshold (2HT) model, applied to two experimental conditions, is used as an example to illustrate the output provided by multitree (version

More information

Clustering. Mihaela van der Schaar. January 27, Department of Engineering Science University of Oxford

Clustering. Mihaela van der Schaar. January 27, Department of Engineering Science University of Oxford Department of Engineering Science University of Oxford January 27, 2017 Many datasets consist of multiple heterogeneous subsets. Cluster analysis: Given an unlabelled data, want algorithms that automatically

More information

COS Lecture 13 Autonomous Robot Navigation

COS Lecture 13 Autonomous Robot Navigation COS 495 - Lecture 13 Autonomous Robot Navigation Instructor: Chris Clark Semester: Fall 2011 1 Figures courtesy of Siegwart & Nourbakhsh Control Structure Prior Knowledge Operator Commands Localization

More information

1. Estimation equations for strip transect sampling, using notation consistent with that used to

1. Estimation equations for strip transect sampling, using notation consistent with that used to Web-based Supplementary Materials for Line Transect Methods for Plant Surveys by S.T. Buckland, D.L. Borchers, A. Johnston, P.A. Henrys and T.A. Marques Web Appendix A. Introduction In this on-line appendix,

More information

Chapter 6 Continued: Partitioning Methods

Chapter 6 Continued: Partitioning Methods Chapter 6 Continued: Partitioning Methods Partitioning methods fix the number of clusters k and seek the best possible partition for that k. The goal is to choose the partition which gives the optimal

More information