Estimation in the Time Domain

Size: px

Start display at page:

Download "Estimation in the Time Domain"

Clementine Dorsey
5 years ago
Views:

1 Estimation in the Time Domain Al Nosedal University of Toronto March 9, 2016 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

2 The R function ar.yw( ) An R function is available for solving the Yule-Walker equations, ar.yw( ). Pass any sequence of errors or residuals to the function and the function will solve a sequence of models, AR(1)... AR(m) and pick the best model using the criterion of minimizing AIC. It should be noted that the Yule-Walker estimates are not maximum likelihood estimates, so that AIC is only approximated. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

3 AR(3) Example # Simulating 2000 AR(3) errors; error<- arima.sim(n=2000, list(ar=c(1.4,-0.31,-0.126) ),sd=3 ); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

4 AR(3) Example # Simulating 2000 AR(3) errors; plot.ts(error,main="ar(3) errors, n=2000"); abline(0,0); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

5 AR(3) Example AR(3) errors, n=2000 error Time Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

6 ar.yw() z<-ar.yw(error); names(z); ## [1] "order" "ar" "var.pred" "x.mean" ## [5] "aic" "n.used" "order.max" "partiala ## [9] "resid" "method" "series" "frequenc ## [13] "call" "asy.var.coef" Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

7 ar.yw() z$aic[1:8]; ## ## ## ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

8 This is a common format for AIC values. With AIC (or BIC), the values themselves are unique only up to a common constant, and it is the minimum value and differences that are important. Therefore, all values are frequently reported as differences from the minimum value. Using AIC, the AR(3) model is identified as the best. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

9 ar.yw() z$order; ## [1] 3 # The order, m, of the model # chosen by AIC. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

10 ar.yw() z$ar; ## [1] # The estimated values of: # a_1, a_2,..., a_m for # the chosen order. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

11 ar.yw() z$var.pred; ## [1] # The estimate for # variance of w. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

12 In some cases, there is a desire to find ˆφ 1,..., ˆφ m and/ or σ 2 w for a different model. In that case, the command ar.yw(error, aic = FALSE, order.max=m) will ignore AIC and force the fitting of some specified order m. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

13 For each candidate model, AR(0) to AR(5), the command ar.yw(error, aic = FALSE, order.max =m) can be used to fit a specified order. Because information criteria assumes maximum likelihood estimates, a slightly different command can be used to get just those values ar.mle(error, aic = FALSE, order.max =m) (we use it to obtain ˆσ 2 w ). Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

14 BIC estimate A reasonable estimate would be BIC nln(ˆσ 2 w ) + mln(n), using ar.mle( ) to estimate σ 2 w. (ˆσ 2 w = variance estimate, n = sample size or number of observations in your series, m = specified order). Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

15 ar.yw() z.0<-ar.mle(error,aic=false,order.max=0); z.1<-ar.mle(error,aic=false,order.max=1); z.2<-ar.mle(error,aic=false,order.max=2); z.3<-ar.mle(error,aic=false,order.max=3); z.4<-ar.mle(error,aic=false,order.max=4); z.5<-ar.mle(error,aic=false,order.max=5); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

16 Model σw 2 BIC BIC Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

17 The choice of ar.yw( ) versus ar.mle( ) for estimation of σ 2 w was of no impact. However, ar.mle( ) may run slow or even fail to converge for large sets of data. The results from our table correctly identify the AR(3) model as the model that generated that data. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

18 A sequence of hypothesis tests Let ˆφ m m be the estimate of φ m for the model AR(m). It is known that ˆφ m m has a N(0, 1/n) distribution if the true model has order less than m. Therefore z n ˆφ m m is a standard Normal if the order is less than m and can be treated as a test statistic for H 0 : the order is less than m vs H a : the order is at least m. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

19 A sequence of tests for the order Model ˆφm m (mle) z p-value Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

20 It seems clear that the correct order is 3 from these hypothesis tests. Put more precisely, the data suggests that an order of at least 3 is required, but there is no evidence that an order higher than 3 is required. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

21 Hypothesis tests presented in a plot If the values ˆφ m m for m = 1, 2,... are plotted versus m, with lines at ±2/ n, the display becomes a visual representation of the above hypothesis tests. When ˆφ m m falls far outside the lines ±2/ n, there is evidence that the model is at least of order m; when the values begin to fall far inside the lines, the indications are that the order is no higher than the last large spike. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

22 The Partial Autocorrelation Plot pacf(error); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

23 The Partial Autocorrelation Plot Series error Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

24 Example. The lake data This series {Y t, t = 1, 2,..., 98} has already been studied (see assignment 1). In this example we shall consider the problem of fitting an AR process directly to the data without first removing any trend component. Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

25 Example. The lake data # Reading data; lake<-read.table(file="lake-huron.dat",header=false); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

26 Example. The lake data plot.ts(lake,xlab="t",ylab="level"); level t Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

27 Example. The lake data new.lake<-unlist(lake) - mean(unlist(lake)) # mean-corrected data Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

28 Example. The lake data acf(new.lake) Series new.lake ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

29 Example. The lake data pacf(new.lake) Series new.lake Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

30 Example. The lake data mod1.yw<-ar.yw(new.lake); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

31 Example. The lake data mod1.yw$aic[1:11]; ## ## ## ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

32 Example. The lake data mod1.yw$order; ## [1] 2 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

33 Example. The lake data mod1.yw$ar; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

34 Example. The lake data mod1.yw$var.pred; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

35 Example. The lake data mod1.yw$asy.var.coef; ## [,1] [,2] ## [1,] ## [2,] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

36 Example. The lake data 95% Confidence Bounds for ˆφ 1. mod1.yw$ar[1]-1.96*sqrt(mod1.yw$asy.var.coef[1,1]); ## [1] mod1.yw$ar[1]+1.96*sqrt(mod1.yw$asy.var.coef[1,1]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

37 Example. The lake data 95% Confidence Bounds for ˆφ 2. mod1.yw$ar[2]-1.96*sqrt(mod1.yw$asy.var.coef[2,2]); ## [1] mod1.yw$ar[2]+1.96*sqrt(mod1.yw$asy.var.coef[2,2]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

38 ar.yw() m.0<-ar.mle(new.lake,aic=false,order.max=0); m.1<-ar.mle(new.lake,aic=false,order.max=1); m.2<-ar.mle(new.lake,aic=false,order.max=2); m.3<-ar.mle(new.lake,aic=false,order.max=3); m.4<-ar.mle(new.lake,aic=false,order.max=4); m.5<-ar.mle(new.lake,aic=false,order.max=5); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

39 Model σw 2 BIC BIC Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

40 Example. Dow Jones # Reading data; dow<-read.table(file="dowj.dat",header=false) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

41 Example. Dow Jones plot.ts(dow,xlab="t",ylab="dow Index") Dow Index t Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

42 Example. Dow Jones mode(dow); ## [1] "list" Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

43 Example. Dow Jones # Computing first differences; diff.dow<-diff(unlist(dow)); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

44 Example. Dow Jones plot.ts(diff.dow,xlab="t",ylab="diff(dow Index)"); abline(h=mean(diff.dow), lty=2, col="red"); diff(dow Index) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

45 Example. Dow Jones acf(diff.dow); Series diff.dow ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

46 Example. Dow Jones pacf(diff.dow); Series diff.dow Partial ACF Lag Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

47 Example. Dow Jones mean(diff.dow); ## [1] new.diff<-diff.dow - mean(diff.dow); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

48 Example. Dow Jones mod2.yw<-ar.yw(new.diff); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

49 Example. Dow Jones mod2.yw$aic[1:8]; ## ## ## 6 7 ## Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

50 Example. Dow Jones mod2.yw$order; ## [1] 1 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

51 Example. Dow Jones mod2.yw$ar; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

52 Example. Dow Jones mod2.yw$var.pred; ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

53 Example. Dow Jones mod2.yw$asy.var.coef; ## [,1] ## [1,] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

54 Example. Dow Jones 95% Confidence Bounds for ˆφ 1. mod2.yw$ar[1]-1.96*sqrt(mod2.yw$asy.var.coef[1,1]); ## [1] mod2.yw$ar[1]+1.96*sqrt(mod2.yw$asy.var.coef[1,1]) ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

55 Installing a package install.packages("forecast", dependencies = TRUE) Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

56 Loading library library(forecast); Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

57 Example. Dow Jones... again auto.arima(dow); ## Series: dow ## ARIMA(1,1,0) with drift ## ## Coefficients: ## ar1 drift ## ## s.e ## ## sigma^2 estimated as : log likelihood= ## AIC=75.38 AICc=75.71 BIC=82.41 Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

58 Example. Dow Jones... again 95% Confidence bounds for φ *(0.1058); ## [1] *(0.1058); ## [1] Al Nosedal University of Toronto Estimation in the Time Domain March 9, / 58

= 75. See Hamilton p. 111 on solving PACF given ACF.

= 75. See Hamilton p. 111 on solving PACF given ACF. Washington University Fall 2009 Department of Economics James Morley Economics 518B Homework #2 GDP and ARMA Models Due: Tuesday 9/15 As with the first homework assignment, try to conserve paper in presenting