Estimation in the Time Domain
|
|
- Clementine Dorsey
- 5 years ago
- Views:
Transcription
1 Estimation in the Time Domain Al Nosedal University of Toronto March 9, 2016 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
2 The R function ar.yw( ) An R function is available for solving the Yule-Walker equations, ar.yw( ). Pass any sequence of errors or residuals to the function and the function will solve a sequence of models, AR(1)... AR(m) and pick the best model using the criterion of minimizing AIC. It should be noted that the Yule-Walker estimates are not maximum likelihood estimates, so that AIC is only approximated. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
3 AR(3) Example # Simulating 2000 AR(3) errors; error<- arima.sim(n=2000, list(ar=c(1.4,-0.31,-0.126) ),sd=3 ); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
4 AR(3) Example # Simulating 2000 AR(3) errors; plot.ts(error,main="ar(3) errors, n=2000"); abline(0,0); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
5 AR(3) Example AR(3) errors, n=2000 error Time Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
6 ar.yw() z<-ar.yw(error); names(z); ## [1] "order" "ar" "var.pred" "x.mean" ## [5] "aic" "n.used" "order.max" "partiala ## [9] "resid" "method" "series" "frequenc ## [13] "call" "asy.var.coef" Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
7 ar.yw() z$aic[1:8]; ## ## ## ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
8 This is a common format for AIC values. With AIC (or BIC), the values themselves are unique only up to a common constant, and it is the minimum value and differences that are important. Therefore, all values are frequently reported as differences from the minimum value. Using AIC, the AR(3) model is identified as the best. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
9 ar.yw() z$order; ## [1] 3 # The order, m, of the model # chosen by AIC. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
10 ar.yw() z$ar; ## [1] # The estimated values of: # a_1, a_2,..., a_m for # the chosen order. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
11 ar.yw() z$var.pred; ## [1] # The estimate for # variance of w. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
12 In some cases, there is a desire to find ˆφ 1,..., ˆφ m and/ or σ 2 w for a different model. In that case, the command ar.yw(error, aic = FALSE, order.max=m) will ignore AIC and force the fitting of some specified order m. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
13 For each candidate model, AR(0) to AR(5), the command ar.yw(error, aic = FALSE, order.max =m) can be used to fit a specified order. Because information criteria assumes maximum likelihood estimates, a slightly different command can be used to get just those values ar.mle(error, aic = FALSE, order.max =m) (we use it to obtain ˆσ 2 w ). Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
14 BIC estimate A reasonable estimate would be BIC nln(ˆσ 2 w ) + mln(n), using ar.mle( ) to estimate σ 2 w. (ˆσ 2 w = variance estimate, n = sample size or number of observations in your series, m = specified order). Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
15 ar.yw() z.0<-ar.mle(error,aic=false,order.max=0); z.1<-ar.mle(error,aic=false,order.max=1); z.2<-ar.mle(error,aic=false,order.max=2); z.3<-ar.mle(error,aic=false,order.max=3); z.4<-ar.mle(error,aic=false,order.max=4); z.5<-ar.mle(error,aic=false,order.max=5); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
16 Model σw 2 BIC BIC Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
17 The choice of ar.yw( ) versus ar.mle( ) for estimation of σ 2 w was of no impact. However, ar.mle( ) may run slow or even fail to converge for large sets of data. The results from our table correctly identify the AR(3) model as the model that generated that data. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
18 A sequence of hypothesis tests Let ˆφ m m be the estimate of φ m for the model AR(m). It is known that ˆφ m m has a N(0, 1/n) distribution if the true model has order less than m. Therefore z n ˆφ m m is a standard Normal if the order is less than m and can be treated as a test statistic for H 0 : the order is less than m vs H a : the order is at least m. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
19 A sequence of tests for the order Model ˆφm m (mle) z p-value Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
20 It seems clear that the correct order is 3 from these hypothesis tests. Put more precisely, the data suggests that an order of at least 3 is required, but there is no evidence that an order higher than 3 is required. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
21 Hypothesis tests presented in a plot If the values ˆφ m m for m = 1, 2,... are plotted versus m, with lines at ±2/ n, the display becomes a visual representation of the above hypothesis tests. When ˆφ m m falls far outside the lines ±2/ n, there is evidence that the model is at least of order m; when the values begin to fall far inside the lines, the indications are that the order is no higher than the last large spike. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
22 The Partial Autocorrelation Plot pacf(error); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
23 The Partial Autocorrelation Plot Series error Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
24 Example. The lake data This series {Y t, t = 1, 2,..., 98} has already been studied (see assignment 1). In this example we shall consider the problem of fitting an AR process directly to the data without first removing any trend component. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
25 Example. The lake data # Reading data; lake<-read.table(file="lake-huron.dat",header=false); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
26 Example. The lake data plot.ts(lake,xlab="t",ylab="level"); level t Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
27 Example. The lake data new.lake<-unlist(lake) - mean(unlist(lake)) # mean-corrected data Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
28 Example. The lake data acf(new.lake) Series new.lake ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
29 Example. The lake data pacf(new.lake) Series new.lake Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
30 Example. The lake data mod1.yw<-ar.yw(new.lake); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
31 Example. The lake data mod1.yw$aic[1:11]; ## ## ## ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
32 Example. The lake data mod1.yw$order; ## [1] 2 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
33 Example. The lake data mod1.yw$ar; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
34 Example. The lake data mod1.yw$var.pred; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
35 Example. The lake data mod1.yw$asy.var.coef; ## [,1] [,2] ## [1,] ## [2,] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
36 Example. The lake data 95% Confidence Bounds for ˆφ 1. mod1.yw$ar[1]-1.96*sqrt(mod1.yw$asy.var.coef[1,1]); ## [1] mod1.yw$ar[1]+1.96*sqrt(mod1.yw$asy.var.coef[1,1]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
37 Example. The lake data 95% Confidence Bounds for ˆφ 2. mod1.yw$ar[2]-1.96*sqrt(mod1.yw$asy.var.coef[2,2]); ## [1] mod1.yw$ar[2]+1.96*sqrt(mod1.yw$asy.var.coef[2,2]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
38 ar.yw() m.0<-ar.mle(new.lake,aic=false,order.max=0); m.1<-ar.mle(new.lake,aic=false,order.max=1); m.2<-ar.mle(new.lake,aic=false,order.max=2); m.3<-ar.mle(new.lake,aic=false,order.max=3); m.4<-ar.mle(new.lake,aic=false,order.max=4); m.5<-ar.mle(new.lake,aic=false,order.max=5); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
39 Model σw 2 BIC BIC Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
40 Example. Dow Jones # Reading data; dow<-read.table(file="dowj.dat",header=false) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
41 Example. Dow Jones plot.ts(dow,xlab="t",ylab="dow Index") Dow Index t Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
42 Example. Dow Jones mode(dow); ## [1] "list" Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
43 Example. Dow Jones # Computing first differences; diff.dow<-diff(unlist(dow)); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
44 Example. Dow Jones plot.ts(diff.dow,xlab="t",ylab="diff(dow Index)"); abline(h=mean(diff.dow), lty=2, col="red"); diff(dow Index) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
45 Example. Dow Jones acf(diff.dow); Series diff.dow ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
46 Example. Dow Jones pacf(diff.dow); Series diff.dow Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
47 Example. Dow Jones mean(diff.dow); ## [1] new.diff<-diff.dow - mean(diff.dow); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
48 Example. Dow Jones mod2.yw<-ar.yw(new.diff); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
49 Example. Dow Jones mod2.yw$aic[1:8]; ## ## ## 6 7 ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
50 Example. Dow Jones mod2.yw$order; ## [1] 1 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
51 Example. Dow Jones mod2.yw$ar; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
52 Example. Dow Jones mod2.yw$var.pred; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
53 Example. Dow Jones mod2.yw$asy.var.coef; ## [,1] ## [1,] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
54 Example. Dow Jones 95% Confidence Bounds for ˆφ 1. mod2.yw$ar[1]-1.96*sqrt(mod2.yw$asy.var.coef[1,1]); ## [1] mod2.yw$ar[1]+1.96*sqrt(mod2.yw$asy.var.coef[1,1]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
55 Installing a package install.packages("forecast", dependencies = TRUE) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
56 Loading library library(forecast); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
57 Example. Dow Jones... again auto.arima(dow); ## Series: dow ## ARIMA(1,1,0) with drift ## ## Coefficients: ## ar1 drift ## ## s.e ## ## sigma^2 estimated as : log likelihood= ## AIC=75.38 AICc=75.71 BIC=82.41 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
58 Example. Dow Jones... again 95% Confidence bounds for φ *(0.1058); ## [1] *(0.1058); ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58
= 75. See Hamilton p. 111 on solving PACF given ACF.
Washington University Fall 2009 Department of Economics James Morley Economics 518B Homework #2 GDP and ARMA Models Due: Tuesday 9/15 As with the first homework assignment, try to conserve paper in presenting
More informationIntro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019
Intro to ARMA models FISH 507 Applied Time Series Analysis Mark Scheuerell 15 Jan 2019 Topics for today Review White noise Random walks Autoregressive (AR) models Moving average (MA) models Autoregressive
More informationSYS 6021 Linear Statistical Models
SYS 6021 Linear Statistical Models Project 2 Spam Filters Jinghe Zhang Summary The spambase data and time indexed counts of spams and hams are studied to develop accurate spam filters. Static models are
More informationPackage tseriesentropy
Package tseriesentropy April 15, 2017 Title Entropy Based Analysis and Tests for Time Series Date 2017-04-15 Version 0.6-0 Author Simone Giannerini Depends R (>= 2.14.0) Imports cubature, methods, parallel,
More informationStat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors
Week 10: Autocorrelated errors This week, I have done one possible analysis and provided lots of output for you to consider. Case study: predicting body fat Body fat is an important health measure, but
More informationThe Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC
The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC INTRODUCTION The Time Series Forecasting System (TSFS) is a component of SAS/ETS that provides a menu-based
More informationModel Diagnostic tests
Model Diagnostic tests 1. Multicollinearity a) Pairwise correlation test Quick/Group stats/ correlations b) VIF Step 1. Open the EViews workfile named Fish8.wk1. (FROM DATA FILES- TSIME) Step 2. Select
More informationSection 4.1: Time Series I. Jared S. Murray The University of Texas at Austin McCombs School of Business
Section 4.1: Time Series I Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Time Series Data and Dependence Time-series data are simply a collection of observations gathered
More informationChapter 15: Forecasting
Chapter 15: Forecasting In this chapter: 1. Forecasting chicken consumption using OLS (UE 15.1, Equation 6.8, p. 501) 2. Forecasting chicken consumption using a generalized least squares (GLS) model estimated
More informationStat 8053, Fall 2013: Additive Models
Stat 853, Fall 213: Additive Models We will only use the package mgcv for fitting additive and later generalized additive models. The best reference is S. N. Wood (26), Generalized Additive Models, An
More informationAdaptive spline autoregression threshold method in forecasting Mitsubishi car sales volume at PT Srikandi Diamond Motors
IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS Adaptive spline autoregression threshold method in forecasting Mitsubishi car sales volume at PT Srikandi Diamond Motors To cite
More informationLecture 13: Model selection and regularization
Lecture 13: Model selection and regularization Reading: Sections 6.1-6.2.1 STATS 202: Data mining and analysis October 23, 2017 1 / 17 What do we know so far In linear regression, adding predictors always
More informationAaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling
Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Abstract Forecasting future events and statistics is problematic because the data set is a stochastic, rather
More informationPackage robets. March 6, Type Package
Type Package Package robets March 6, 2018 Title Forecasting Time Series with Robust Exponential Smoothing Version 1.4 Date 2018-03-06 We provide an outlier robust alternative of the function ets() in the
More informationProblem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA
ECL 290 Statistical Models in Ecology using R Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA Datasets in this problem set adapted from those provided
More informationPackage freeknotsplines
Version 1.0.1 Date 2018-05-17 Package freeknotsplines June 10, 2018 Title Algorithms for Implementing Free-Knot Splines Author , Philip Smith , Pierre Lecuyer
More informationGW - WINKS SDA Windows KwikStat Statistical Data Analysis for Time Series Programs. Getting Started Guide
GW - WINKS SDA Windows KwikStat Statistical Data Analysis for Time Series Programs Getting Started Guide. An Overview of GW-WINKS Time Series Programs GW WINKS is an add-on component of the WINKS SDA Statistical
More informationVariable selection is intended to select the best subset of predictors. But why bother?
Chapter 10 Variable Selection Variable selection is intended to select the best subset of predictors. But why bother? 1. We want to explain the data in the simplest way redundant predictors should be removed.
More informationResources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.
Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department
More informationCPSC 340: Machine Learning and Data Mining. Feature Selection Fall 2017
CPSC 340: Machine Learning and Data Mining Feature Selection Fall 2017 Assignment 2: Admin 1 late day to hand in tonight, 2 for Wednesday, answers posted Thursday. Extra office hours Thursday at 4pm (ICICS
More informationVideo Traffic Modeling Using Seasonal ARIMA Models
Video Traffic Modeling Using Seasonal ARIMA Models Abdel-Karim Al-Tamimi and Raj Jain Washington University in Saint Louis Saint Louis, MO 63130 Jain@cse.wustl.edu These slides are available on-line at:
More informationAnnexes : Sorties SAS pour l'exercice 3. Code SAS. libname process 'G:\Enseignements\M2ISN-Series temp\sas\';
Annexes : Sorties SAS pour l'exercice 3 Code SAS libname process 'G:\Enseignements\M2ISN-Series temp\sas\'; /* Etape 1 - Création des données*/ proc iml; phi={1-1.583 0.667-0.083}; theta={1}; y=armasim(phi,
More informationRepeated Measures Part 4: Blood Flow data
Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value
More informationSAS Econometrics and Time Series Analysis 1.1 for JMP
SAS Econometrics and Time Series Analysis 1.1 for JMP SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2011. SAS Econometrics and Time Series Analysis
More informationDifferentiation of Cognitive Abilities across the Lifespan. Online Supplement. Elliot M. Tucker-Drob
1 Differentiation of Cognitive Abilities across the Lifespan Online Supplement Elliot M. Tucker-Drob This online supplement reports the results of an alternative set of analyses performed on a single sample
More informationModule 29.1: nag tsa identify Time Series Analysis Identification. Contents
Time Series Analysis Module Contents Module 29.1: nag tsa identify Time Series Analysis Identification nag tsa identify contains procedures for analysing time series data. Contents Introduction..............................................................
More informationGraphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models
Statistics Preprints Statistics 3-29-1999 Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models William Q. Meeker Iowa State University, wqmeeker@iastate.edu Follow this and additional
More informationBootstrapping Methods
Bootstrapping Methods example of a Monte Carlo method these are one Monte Carlo statistical method some Bayesian statistical methods are Monte Carlo we can also simulate models using Monte Carlo methods
More information[spa-temp.inf] Spatial-temporal information
[spa-temp.inf] Spatial-temporal information VI Table of Contents for Spatial-temporal information I. Spatial-temporal information........................................... VI - 1 A. Cohort-survival method.........................................
More informationUnivariate Extreme Value Analysis. 1 Block Maxima. Practice problems using the extremes ( 2.0 5) package. 1. Pearson Type III distribution
Univariate Extreme Value Analysis Practice problems using the extremes ( 2.0 5) package. 1 Block Maxima 1. Pearson Type III distribution (a) Simulate 100 maxima from samples of size 1000 from the gamma
More informationSAS Econometrics and Time Series Analysis 2 for JMP. Third Edition
SAS Econometrics and Time Series Analysis 2 for JMP Third Edition The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2013. SAS Econometrics and Time Series Analysis 2
More informationMath 121. Graphing Rational Functions Fall 2016
Math 121. Graphing Rational Functions Fall 2016 1. Let x2 85 x 2 70. (a) State the domain of f, and simplify f if possible. (b) Find equations for the vertical asymptotes for the graph of f. (c) For each
More informationPractical 4: Mixed effect models
Practical 4: Mixed effect models This practical is about how to fit (generalised) linear mixed effects models using the lme4 package. You may need to install it first (using either the install.packages
More informationMPhil computer package lesson: getting started with Eviews
MPhil computer package lesson: getting started with Eviews Ryoko Ito (ri239@cam.ac.uk, itoryoko@gmail.com, www.itoryoko.com ) 1. Creating an Eviews workfile 1.1. Download Wage data.xlsx from my homepage:
More informationOn Choosing the Right Coordinate Transformation Method
On Choosing the Right Coordinate Transformation Method Yaron A. Felus 1 and Moshe Felus 1 The Survey of Israel, Tel-Aviv Surveying Engineering Department, MI Halperin Felus Surveying and Photogrammetry
More informationST440/540: Applied Bayesian Analysis. (5) Multi-parameter models - Initial values and convergence diagn
(5) Multi-parameter models - Initial values and convergence diagnostics Tuning the MCMC algoritm MCMC is beautiful because it can handle virtually any statistical model and it is usually pretty easy to
More informationModel selection Outline for today
Model selection Outline for today The problem of model selection Choose among models by a criterion rather than significance testing Criteria: Mallow s C p and AIC Search strategies: All subsets; stepaic
More information2016 Stat-Ease, Inc. & CAMO Software
Multivariate Analysis and Design of Experiments in practice using The Unscrambler X Frank Westad CAMO Software fw@camo.com Pat Whitcomb Stat-Ease pat@statease.com Agenda Goal: Part 1: Part 2: Show how
More informationRSM Split-Plot Designs & Diagnostics Solve Real-World Problems
RSM Split-Plot Designs & Diagnostics Solve Real-World Problems Shari Kraber Pat Whitcomb Martin Bezener Stat-Ease, Inc. Stat-Ease, Inc. Stat-Ease, Inc. 221 E. Hennepin Ave. 221 E. Hennepin Ave. 221 E.
More informationOne Factor Experiments
One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal
More informationStat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7)
Stat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7) Data: Weekly sales (in thousands of units) of Super Tech Videocassette Tapes over 161 weeks [see Bowerman & O Connell Forecasting and Time Series:
More informationSerial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix
Serial Correlation and Heteroscedasticity in Time series Regressions Econometric (EC3090) - Week 11 Agustín Bénétrix 1 Properties of OLS with serially correlated errors OLS still unbiased and consistent
More information1 Simple Linear Regression
Math 158 Jo Hardin R code 1 Simple Linear Regression Consider a dataset from ISLR on credit scores. Because we don t know the sampling mechanism used to collect the data, we are unable to generalize the
More informationZero-Inflated Poisson Regression
Chapter 329 Zero-Inflated Poisson Regression Introduction The zero-inflated Poisson (ZIP) regression is used for count data that exhibit overdispersion and excess zeros. The data distribution combines
More informationModelling and simulation of seismic reflectivity
Modelling reflectivity Modelling and simulation of seismic reflectivity Rita Aggarwala, Michael P. Lamoureux, and Gary F. Margrave ABSTRACT We decompose the reflectivity series obtained from a seismic
More informationMake Forecasts with Data exported from Google Analytics
Beyond Google Analytics. Using C# to integrate Google Analytics with R statistical server Juan Antonio Breña Moral Abstract Commercial Web sites are designed, planned and launched in many cases by Marketing
More informationSTA121: Applied Regression Analysis
STA121: Applied Regression Analysis Variable Selection - Chapters 8 in Dielman Artin Department of Statistical Science October 23, 2009 Outline Introduction 1 Introduction 2 3 4 Variable Selection Model
More informationTable Of Contents. Table Of Contents
Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store
More informationThe Truth behind PGA Tour Player Scores
The Truth behind PGA Tour Player Scores Sukhyun Sean Park, Dong Kyun Kim, Ilsung Lee May 7, 2016 Abstract The main aim of this project is to analyze the variation in a dataset that is obtained from the
More informationCross-validation and the Bootstrap
Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. These methods refit a model of interest to samples formed from the training set,
More informationGeneralized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS)
Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Linear Model in matrix notation for the population Y = Xβ + Var ( ) = In GLS, the error covariance matrix is known In EGLS
More informationComputation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No.
Computation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No. 679 School of Statistics University of Minnesota July 29, 2010 Abstract
More informationIntroduction to mixed-effects regression for (psycho)linguists
Introduction to mixed-effects regression for (psycho)linguists Martijn Wieling Department of Humanities Computing, University of Groningen Groningen, April 21, 2015 1 Martijn Wieling Introduction to mixed-effects
More informationThe goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up.
Fall 2010 Handout on Using R Page: 1 The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up. 1. Installing R First off,
More informationPattern Recognition. Kjell Elenius. Speech, Music and Hearing KTH. March 29, 2007 Speech recognition
Pattern Recognition Kjell Elenius Speech, Music and Hearing KTH March 29, 2007 Speech recognition 2007 1 Ch 4. Pattern Recognition 1(3) Bayes Decision Theory Minimum-Error-Rate Decision Rules Discriminant
More informationGeneralized least squares (GLS) estimates of the level-2 coefficients,
Contents 1 Conceptual and Statistical Background for Two-Level Models...7 1.1 The general two-level model... 7 1.1.1 Level-1 model... 8 1.1.2 Level-2 model... 8 1.2 Parameter estimation... 9 1.3 Empirical
More informationNational College of Ireland. Project Submission Sheet 2015/2016. School of Computing
National College of Ireland Project Submission Sheet 2015/2016 School of Computing Student Name: Anicia Lafayette-Madden Student ID: 15006590 Programme: M.Sc Data Analytics Year: 2015-2016 Module: Configuration
More information# R script for Applied Time Series Analysis for Ecologists # Day 1 # 24 March # written by Mark Scheuerell
# R script for Applied Time Series Analysis for Ecologists # Day 1 # 24 March 2014 # written by Mark Scheuerell #------------------------- # load required libraries #------------------------- require("tseries")
More informationBayesian Statistics Group 8th March Slice samplers. (A very brief introduction) The basic idea
Bayesian Statistics Group 8th March 2000 Slice samplers (A very brief introduction) The basic idea lacements To sample from a distribution, simply sample uniformly from the region under the density function
More informationMarkov Chain Monte Carlo (part 1)
Markov Chain Monte Carlo (part 1) Edps 590BAY Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2018 Depending on the book that you select for
More informationPackage FWDselect. December 19, 2015
Title Selecting Variables in Regression Models Version 2.1.0 Date 2015-12-18 Author Marta Sestelo [aut, cre], Nora M. Villanueva [aut], Javier Roca-Pardinas [aut] Maintainer Marta Sestelo
More informationStatistical Consulting Topics Using cross-validation for model selection. Cross-validation is a technique that can be used for model evaluation.
Statistical Consulting Topics Using cross-validation for model selection Cross-validation is a technique that can be used for model evaluation. We often fit a model to a full data set and then perform
More informationVariogram Inversion and Uncertainty Using Dynamic Data. Simultaneouos Inversion with Variogram Updating
Variogram Inversion and Uncertainty Using Dynamic Data Z. A. Reza (zreza@ualberta.ca) and C. V. Deutsch (cdeutsch@civil.ualberta.ca) Department of Civil & Environmental Engineering, University of Alberta
More information22s:152 Applied Linear Regression
22s:152 Applied Linear Regression Chapter 22: Model Selection In model selection, the idea is to find the smallest set of variables which provides an adequate description of the data. We will consider
More informationOverview. Monte Carlo Methods. Statistics & Bayesian Inference Lecture 3. Situation At End Of Last Week
Statistics & Bayesian Inference Lecture 3 Joe Zuntz Overview Overview & Motivation Metropolis Hastings Monte Carlo Methods Importance sampling Direct sampling Gibbs sampling Monte-Carlo Markov Chains Emcee
More informationStandard Errors in OLS Luke Sonnet
Standard Errors in OLS Luke Sonnet Contents Variance-Covariance of ˆβ 1 Standard Estimation (Spherical Errors) 2 Robust Estimation (Heteroskedasticity Constistent Errors) 4 Cluster Robust Estimation 7
More information22s:152 Applied Linear Regression
22s:152 Applied Linear Regression Chapter 22: Model Selection In model selection, the idea is to find the smallest set of variables which provides an adequate description of the data. We will consider
More informationPackage mcmcse. February 15, 2013
Package mcmcse February 15, 2013 Version 1.0-1 Date 2012 Title Monte Carlo Standard Errors for MCMC Author James M. Flegal and John Hughes Maintainer James M. Flegal
More informationA New Measure of the Cluster Hypothesis
A New Measure of the Cluster Hypothesis Mark D. Smucker 1 and James Allan 2 1 Department of Management Sciences University of Waterloo 2 Center for Intelligent Information Retrieval Department of Computer
More informationGEOP 505/MATH 587 Fall 02 Homework 6
GEOP 55/MATH 587 Fall 2 Homework 6 In grading these homeworks, I found that the major problem that students had was a misunderstanding of the difference between the original time series z, the differenced
More informationChronology development, statistical analysis
A R S T A N Guide for computer program ARSTAN, by Edward R. Cook and Richard L. Holmes Adapted from Users Manual for Program ARSTAN, in Tree-Ring Chronologies of Western North America: California, eastern
More informationPackage MARX. June 16, 2017
Package MARX June 16, 2017 Title Simulation, Estimation and Selection of MARX Models Version 0.1 Date 2017-06-16 Author [aut, cre, cph], Alain Hecq [ctb], Lenard Lieb [ctb] Maintainer
More informationNon-linear Modelling Solutions to Exercises
Non-linear Modelling Solutions to Exercises April 13, 2017 Contents 1 Non-linear Modelling 2 1.6 Exercises.................................................... 2 1.6.1 Chick Weight.............................................
More informationLab Session 1. Introduction to Eviews
Albert-Ludwigs University Freiburg Department of Empirical Economics Time Series Analysis, Summer 2009 Dr. Sevtap Kestel To see the data of m1: 1 Lab Session 1 Introduction to Eviews We introduce the basic
More informationBinary Diagnostic Tests Clustered Samples
Chapter 538 Binary Diagnostic Tests Clustered Samples Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. In the twogroup case, each cluster
More informationNOTE: Any use of trade, product or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government.
U.S. Geological Survey (USGS) MMA(1) NOTE: Any use of trade, product or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government. NAME MMA, A Computer Code for
More informationSistemática Teórica. Hernán Dopazo. Biomedical Genomics and Evolution Lab. Lesson 03 Statistical Model Selection
Sistemática Teórica Hernán Dopazo Biomedical Genomics and Evolution Lab Lesson 03 Statistical Model Selection Facultad de Ciencias Exactas y Naturales Universidad de Buenos Aires Argentina 2013 Statistical
More informationBayesian Networks Inference (continued) Learning
Learning BN tutorial: ftp://ftp.research.microsoft.com/pub/tr/tr-95-06.pdf TAN paper: http://www.cs.huji.ac.il/~nir/abstracts/frgg1.html Bayesian Networks Inference (continued) Learning Machine Learning
More informationMINI-PAPER A Gentle Introduction to the Analysis of Sequential Data
MINI-PAPER by Rong Pan, Ph.D., Assistant Professor of Industrial Engineering, Arizona State University We, applied statisticians and manufacturing engineers, often need to deal with sequential data, which
More informationFalse Analog Data Injection Attack Towards Topology Errors: Formulation and Feasibility Analysis
False Analog Data Injection Attack Towards Topology Errors: Formulation and Feasibility Analysis Yuqi Zhou, Jorge Cisneros-Saldana, Le Xie Department of Electrical and Computer Engineering Texas A&M University
More informationCPSC 340: Machine Learning and Data Mining
CPSC 340: Machine Learning and Data Mining Feature Selection Original version of these slides by Mark Schmidt, with modifications by Mike Gelbart. Admin Assignment 3: Due Friday Midterm: Feb 14 in class
More informationCS490D: Introduction to Data Mining Prof. Chris Clifton
CS490D: Introduction to Data Mining Prof. Chris Clifton April 5, 2004 Mining of Time Series Data Time-series database Mining Time-Series and Sequence Data Consists of sequences of values or events changing
More informationDiscussion Notes 3 Stepwise Regression and Model Selection
Discussion Notes 3 Stepwise Regression and Model Selection Stepwise Regression There are many different commands for doing stepwise regression. Here we introduce the command step. There are many arguments
More informationChapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References...
Chapter 15 Mixed Models Chapter Table of Contents Introduction...309 Split Plot Experiment...311 Clustered Data...320 References...326 308 Chapter 15. Mixed Models Chapter 15 Mixed Models Introduction
More informationNormal Plot of the Effects (response is Mean free height, Alpha = 0.05)
Percent Normal Plot of the Effects (response is Mean free height, lpha = 5) 99 95 Effect Type Not Significant Significant 90 80 70 60 50 40 30 20 E F actor C D E Name C D E 0 5 E -0.3-0. Effect 0. 0.3
More informationOn the Test and Estimation of Fractional Parameter. in ARFIMA Model: Bootstrap Approach
Applied Mathematical Sciences, Vol. 8, 2014, no. 96, 4783-4792 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.46498 On the Test and Estimation of Fractional Parameter in ARFIMA Model:
More informationThe Perils of Unfettered In-Sample Backtesting
The Perils of Unfettered In-Sample Backtesting Tyler Yeats June 8, 2015 Abstract When testing a financial investment strategy, it is common to use what is known as a backtest, or a simulation of how well
More informationRocLab 1.0. Rock Mass Strength Analysis using the Generalized Hoek-Brown failure criterion
RocLab 1.0 Rock Mass Strength Analysis using the Generalized Hoek-Brown failure criterion RocLab is a software program for determining rock mass strength parameters, based on the latest version of the
More informationSOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian. Panel Data Analysis: Fixed Effects Models
SOCY776: Longitudinal Data Analysis Instructor: Natasha Sarkisian Panel Data Analysis: Fixed Effects Models Fixed effects models are similar to the first difference model we considered for two wave data
More informationSmall Libraries of Protein Fragments through Clustering
Small Libraries of Protein Fragments through Clustering Varun Ganapathi Department of Computer Science Stanford University June 8, 2005 Abstract When trying to extract information from the information
More informationPolynomial Curve Fitting of Execution Time of Binary Search in Worst Case in Personal Computer
Polynomial Curve Fitting of Execution Time of Binary Search in Worst Case in Personal Computer Dipankar Das Assistant Professor, The Heritage Academy, Kolkata, India Abstract Curve fitting is a well known
More informationA Sequential ESM Track Association Algorithm Based on the Use of Information Theoretic Criteria
A Sequential ESM Track Association Algorithm Based on the Use of Information Theoretic Criteria Yifeng Zhou and Jim Mickeal Radar Electronic Warfare Section Defence R&D Canada - Ottawa 371 Carling Avenue,
More informationPackage batchmeans. R topics documented: July 4, Version Date
Package batchmeans July 4, 2016 Version 1.0-3 Date 2016-07-03 Title Consistent Batch Means Estimation of Monte Carlo Standard Errors Author Murali Haran and John Hughes
More informationDescriptive Statistics, Standard Deviation and Standard Error
AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.
More information**************************************************************************************************************** * Single Wave Analyses.
ASDA2 ANALYSIS EXAMPLE REPLICATION SPSS C11 * Syntax for Analysis Example Replication C11 * Use data sets previously prepared in SAS, refer to SAS Analysis Example Replication for C11 for details. ****************************************************************************************************************
More informationAnnotated multitree output
Annotated multitree output A simplified version of the two high-threshold (2HT) model, applied to two experimental conditions, is used as an example to illustrate the output provided by multitree (version
More informationClustering. Mihaela van der Schaar. January 27, Department of Engineering Science University of Oxford
Department of Engineering Science University of Oxford January 27, 2017 Many datasets consist of multiple heterogeneous subsets. Cluster analysis: Given an unlabelled data, want algorithms that automatically
More informationCOS Lecture 13 Autonomous Robot Navigation
COS 495 - Lecture 13 Autonomous Robot Navigation Instructor: Chris Clark Semester: Fall 2011 1 Figures courtesy of Siegwart & Nourbakhsh Control Structure Prior Knowledge Operator Commands Localization
More information1. Estimation equations for strip transect sampling, using notation consistent with that used to
Web-based Supplementary Materials for Line Transect Methods for Plant Surveys by S.T. Buckland, D.L. Borchers, A. Johnston, P.A. Henrys and T.A. Marques Web Appendix A. Introduction In this on-line appendix,
More informationChapter 6 Continued: Partitioning Methods
Chapter 6 Continued: Partitioning Methods Partitioning methods fix the number of clusters k and seek the best possible partition for that k. The goal is to choose the partition which gives the optimal
More information