xsmle  A Command to Estimate Spatial Panel Data Models in Stata


 Cameron Berry
 1 years ago
 Views:
Transcription
1 xsmle  A Command to Estimate Spatial Panel Data Models in Stata Federico Belotti?, Gordon Hughes,AndreaPianoMortari?? CEIS, University of Rome Tor Vergata School of Economics, University of Edinburg Italian Stata Users Group Meeting Florence, November
2 Outline 1 Motivation 2 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command 3
3 A general specification for Spatial Panel models: nx KX KX nx y it = + y it 1 + w ij y jt + x itk k + w ij x jtk k + j=1 k=1 k=1 j=1 + µ i + t + it (1) nx it = m ij it + it i =1,...,n t =1,...,T (2) j=1 Static Models ( = 0 ) and Dynamic Models ( 6= 0,Yuetal.(2008)) if =0and! Spatial Autoregressive Model with Auto Regressive disturbances (SAC) if if =0! Spatial Durbin Model (SDM) =0and =0! Spatial Autoregressive Model (SAR) if =0and =0! Spatial Error Model (SEM) if =0, =0,andµ i = P n j=1 w ij µ i + i! Generalised Spatial Panel Random E ects model (GSPRE)
4 Anumberofspatialrelatedroutineshavebeenwrittenbyusersandavailable through SSC. A noncomprehensive list includes: Data management and visualization shp2dta by K. Crow spmat by D.M. Drukker et al spwmatrix by P.W. Jeanty spmap by M. Pisati geocode3 by S. Bernhard Cross sectional data spreg: SAR,SEM,SACvia ML or GS2SLS by D. M. Drukker et al spivreg: SACvia GS2SLS by D. M. Drukker et al spmlreg: SAR,SEM,SDM,SACvia ML by P.W. Jeanty spatreg: SAR,SEMvia ML by M. Pisati spautoreg: SAR,SEM,SDM,SACvia ML or GS2SLS by E.A. Shehata Panel data spreg*xt suite SAR, SEM, SDM, SAC via LS, GLS, GMM or GS2SLS by E.A. Shehata
5 DGP replications y it = nx w ij y jt +0.3x 1it +0.7x 2it + µ i + t + it n =1,...,188 t =1,...,5 j=1 where the nuisance parameters µ i (i =1,...,n) are drawn from an iid standard Gaussian random variable. To allow for dependence between the unitspecific e ects and the regressors, we generate the latter as follows x kit =0.4µ i +( ) 1/2 z kit, where k =1, 2andthez kit is an iid standard Gaussian random variable. =0.3 =0.5 =0.7 bias MSE bias MSE bias MSE xsmle spregfext xtivreg
6 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command xsmle fits (balanced) Spatial Panel data models via maximum likelihood (ML) Requirements: (At least) Stata Version 10 The n n matrix of spatial weights. xsmle will deal with the longitudinal dimension automatically Data must be tsset or xtset
7 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command The basic xsmle syntax is the following xsmle depvar indepvars if in weight, options The default model is the randome ects SAR model Only aweight are allowed but the declared weight variable must be constant within each panel unit The mi prefix is allowed Factor variables are allowed
8 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command Options common to all spatial models model(name) specifiesthespatialmodeltobeestimated.maybesar for the SpatialAutoRegressive model, sdm for the Spatial Durbin Model, sem for the SpatialError Model, sac for the SpatialAutoregressive with Spatially Autocorrelated Errors Model, gspre for the Generalised Spatial Random E ects Model. re use the random e ects estimator; the default. This option cannot be specified when model(sac). fe use the fixed e ects estimator. This option cannot be specified when model(gspre). type(type options [, leeyu]) specifies fixede ects type; only for fe estimators. May be ind for spatial fixed e ects, time for time fixed e ects or both for both spatial and time fixed e ects. Suboption leeyu allows to transform the data according to Lee and Yu (2010) approach and can be used only when type(ind).
9 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command noconstant suppresses the constant term in the model. Only for re estimators. noeffects suppresses the computation of direct, indirect and total e ects. nsim(#) sets the number of simulations for the LeSage and Pace (2009) procedure to compute the standard errors of the direct, indirect and total e ects. constraints(constraints list) appliesspecifiedlinearconstraints. from(init specs) specifies initial values for the coe cients. level(#) sets confidence level for confidence intervals; default is level(95). postscore save observationbyobservation scores in the estimation results list. posthessian save the Hessian corresponding to the full set of coe in the estimation results list. hausman performs the Hausman test. cients
10 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command Variance estimation This section describes the arguments of the vce(vcetype) option. oim observed information matrix. opg outer product of the gradient vectors. robust clustered sandwich estimator where clustvar is the panelvar. cluster clustvar clustered sandwich estimator. dkraay(#) DriscollKraay robust estimator. Where # is the maximum lag used in the calculation.
11 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command In xsmle the spatial weighting matrix can be astatamatrix a spmat object In both cases the matrix can be standardized or not. e.g. a Stata matrix can be created using matrix define, imported from Mata using st matrix( string scalar name, real matrix) or imported from GIS softwares like GeoDa using spwmatrix gal using path to gal file, wname(name of the matrix) spmat objects are created by spmat spmat import name of the object using path to file
12 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command SAR model wmatrix(name) specifies the weight matrix for the spatialautoregressive term. dlag includes (time) lagged dependent variable in the model. SDM model wmatrix(name) specifies the weight matrix for the spatialautoregressive term. dmatrix(name) specifies the weight matrix for the spatially lagged regressors; default is to use the matrix specified in wmat(name). durbin(dvarlist) specifiestheregressorsthathavetobespatiallylagged; default is to lag all independent variables specified in varlist. dlag includes (time) lagged dependent variable in the model.
13 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command SEM model ematrix(name) specifies the weight matrix for the spatialautocorrelated error term. SAC model wmatrix(name) specifies the weight matrix for the spatialautoregressive term. ematrix(name) specifies the weight matrix for the spatialautocorrelated error term. GSPRE model wmatrix(name) specifies the weight matrix for the spatialautocorrelated randome ects. ematrix(name) specifies the weight matrix for the spatialautocorrelated error term. error(#) defines the structure of the model. # is equal to 1 when 6= 6= 0,#isequalto2when =0,#isequalto3when =0,#is equal to 4 when =.
14 Options common to all spatial models The weighting matrix Models specific options xsmle postestimation command command allows to postestimate spatial fixed or random e ects. The methods implemented in this command are the panel data extensions of those available in Drukker, Prucha, and Raciborski (2011) predict type newvar if in, statistic where statistic includes: rform the default, calculates predicted values from the reducedform equation: y it =(I n W ) 1 (x it + i ) limited predicted values based on the limited information set. This option is available only when model(sac). naive predicted values based on the observed values of y it = Wy it + x it + i xb calculates the linear prediction including the fixed or random e ect x it + i. a estimates i,thefixedorrandome ect. In the case of fixede ects models, this statistic is allowed only when type(ind)
15 DGP  Fixed e ects SDM y it = nx w ij y jt +0.5x 1it 0.3x 2it 0.2x 3it +0.3 j=1 nx w ij x 2it +0.9 j=1 nx w ij x 1it + j=1 nx w ij x 3it + µ i + t + it n =1,...,188 t =1,...,5 j=1 where the nuisance parameters µ i (i =1,...,n) are drawn from an iid standard Gaussian random variable. To allow for dependence between the unitspecific e ects and the regressors, we generate the latter as follows x kit =0.4µ i +( ) 1/2 z kit, where k =1, 2, 3, z 1it is standard Gaussian, z 2it is N(0, )andz 3it is N(0, 2 2 ).
16 .. *** load a dta dataset containing the spatial contiguity matrix. use ASL_contiguity_mat_ns.dta, clear. *** get an spmat objects from dta. spmat dta W W*, replace. *** Summarize the spmat obj. spmat summarize W, links Summary of spatialweighting object W Matrix Description Dimensions 188 x 188 Stored as 188 x 188 Links total 906 min 1 mean max
17 . ** Fixedeffects Durbin model (correctly specified, row normalized W). xsmle y x1 x2 x3, wmat(w) model(sdm) fe type(ind) nsim(500) nolog Warning: All regressors will be spatially lagged SDM with spatial fixedeffects Number of obs = 940 Group variable: id Number of groups = 188 Time variable: t Panel length = 5 Rsq: within = between = overall = Mean of fixedeffects = Loglikelihood = y Coef. Std. Err. z P> z [95% Conf. Interval] Main x x x Wx x x x Spatial rho Variance sigma2_e [CONTINUES]
18 In a spatial setting, the e ect of an explanatory variable change in a particular unit a ects not only that unit but also its neighbors (LeSage and Pace, 2009). 2 3 =(I W ) 1 nk 4 k w 12 k w 1n k w 21 k k w 2n k w n1 k w n2 k k 7 5 If we have only 2 units and 1 regressor: apple SAR and SAC! (I W ) apple 1 0 SEM! 0 1 apple SDM! (I W ) 1 1 w 12 1 w
19 [CONTINUES] Direct x x x Indirect x x x Total x x x estimates store sdm_fe
20 ** Fixedeffects Durbin model (correctly specified, row normalized W). xsmle y x1 x2 x3, wmat(w) model(sdm) re type(ind) nsim(500) nolog noeff Warning: Option type(ind) will be ignored Warning: All regressors will be spatially lagged SDM with randomeffects Number of obs = 940 Group variable: id Number of groups = 188 Time variable: t Panel length = 5 Rsq: within = between = overall = Loglikelihood = y Coef. Std. Err. z P> z [95% Conf. Interval] Main x x x _cons Wx x x x Spatial rho Variance lgt_theta sigma_e estimates store sdm_re
21 hausman sdm_fe sdm_re, eq(1:1 2:2 3:3)  Coefficients  (b) (B) (bb) sqrt(diag(v_bv_b)) sdm_fe sdm_re Difference S.E. comp1 x x x comp2 x x x comp3 rho b = consistent under Ho and Ha; obtained from xsmle B = inconsistent under Ha, efficient under Ho; obtained from xsmle Test: Ho: difference in coefficients not systematic chi2(7) = (bb) [(V_bV_B)^(1)](bB) = chi2<0 ==> model fitted on these data fails to meet the asymptotic assumptions of the Hausman test; see suest for a generalized test
22 . ** Fixedeffects Durbin model (correctly specified, row normalized W). xsmle y x1 x2 x3, wmat(w) model(sdm) fe type(ind) hausman noeff nolog Warning: All regressors will be spatially lagged... estimating randomeffects model to perform Hausman test SDM with spatial fixedeffects Number of obs = 940 Group variable: id Number of groups = 188 Time variable: t Panel length = 5 Rsq: within = between = overall = Mean of fixedeffects = Loglikelihood = y Coef. Std. Err. z P> z [95% Conf. Interval] Main x x x Wx x x x Spatial rho Variance sigma2_e Ho: difference in coeffs not systematic chi2(7) = Prob>=chi2 =
23 . *********. predict yhat, rform. predict yhat1, naive. predict alphahat, a. sum alpha alphahat Variable Obs Mean Std. Dev. Min Max alpha alphahat
24
25 Using matrix notation the SDM ( = 0) may be derived from a SEM model hence { y = X + u u = W u + u(1 W ) = y(1 W ) = X (1 W )+ y = W y + X WX + y = W y + X + WX + and test the following constraints 1 =0and 6= 0) the model is a SAR 2 = ) the model is a SEM.
26 ** Test for SAR. test [Wx]x1 = [Wx]x2 = [Wx]x3 = 0 ( 1) [Wx]x1  [Wx]x2 = 0 ( 2) [Wx]x1  [Wx]x3 = 0 ( 3) [Wx]x1 = 0 chi2( 3) = Prob > chi2 = ** Test for SEM. testnl ([Wx]x1 = [Spatial]rho*[Main]x1) ([Wx]x2 = [Spatial]rho*[Main]x2) ([ > Wx]x3 = [Spatial]rho*[Main]x3) (1) [Wx]x1 = [Spatial]rho*[Main]x1 (2) [Wx]x2 = [Spatial]rho*[Main]x2 (3) [Wx]x3 = [Spatial]rho*[Main]x3 chi2(3) = Prob > chi2 =
27 ** SAC Model. xsmle y x1 x2 x3, wmat(w) emat(w) model(sac) fe type(ind) noeff nolog SAC with spatial fixedeffects Number of obs = 940 Group variable: id Number of groups = 188 Time variable: t Panel length = 5 Rsq: within = between = overall = Mean of fixedeffects = Loglikelihood = y Coef. Std. Err. z P> z [95% Conf. Interval] Main x x x Spatial rho lambda Variance sigma2_e
28 SDM vs SAC. qui xsmle y x1 x2 x3, wmat(w) model(sdm) fe type(ind) noeff nolog. estat ic Model Obs ll(null) ll(model) df AIC BIC Note: N=Obs used in calculating BIC; see [R] BIC note. qui xsmle y x1 x2 x3, wmat(w) emat(w) model(sac) fe type(ind) noeff nolog. estat ic Model Obs ll(null) ll(model) df AIC BIC Note: N=Obs used in calculating BIC; see [R] BIC note
29 Lee, L. F. and Yu, J. (2010). of spatial autoregressive panel data models with fixed e ects. Journal of Econometrics, 154(2): LeSage, J. P. and Pace, R. K. (2009). Introduction to Spatial Econometrics. Taylor & Francis. Yu, J., de Jong, R., and Lee, L. F. (2008). Quasimaximum likelihood estimators for spatial dynamic panel data with fixed e ects when both n and tarelarge.journal of Econometrics, 146:
Bivariate (Simple) Regression Analysis
Revised July 2018 Bivariate (Simple) Regression Analysis This set of notes shows how to use Stata to estimate a simple (twovariable) regression equation. It assumes that you have set Stata up on your
More informationSOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian. Panel Data Analysis: Fixed Effects Models
SOCY776: Longitudinal Data Analysis Instructor: Natasha Sarkisian Panel Data Analysis: Fixed Effects Models Fixed effects models are similar to the first difference model we considered for two wave data
More informationPanel Data 4: Fixed Effects vs Random Effects Models
Panel Data 4: Fixed Effects vs Random Effects Models Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 4, 2017 These notes borrow very heavily, sometimes verbatim,
More informationpiecewise ginireg 1 Piecewise Gini Regressions in Stata Jan Ditzen 1 Shlomo Yitzhaki 2 September 8, 2017
piecewise ginireg 1 Piecewise Gini Regressions in Stata Jan Ditzen 1 Shlomo Yitzhaki 2 1 HeriotWatt University, Edinburgh, UK Center for Energy Economics Research and Policy (CEERP) 2 The Hebrew University
More informationInstrumental variables, bootstrapping, and generalized linear models
The Stata Journal (2003) 3, Number 4, pp. 351 360 Instrumental variables, bootstrapping, and generalized linear models James W. Hardin Arnold School of Public Health University of South Carolina Columbia,
More informationMay 24, Emil Coman 1 Yinghui Duan 2 Daren Anderson 3
Assessing Health Disparities in Intensive Longitudinal Data: Gender Differences in Granger Causality Between Primary Care Provider and Emergency Room Usage, Assessed with Medicaid Insurance Claims May
More informationFitting adjusted limited dependent variable mixture models to EQ5D
The Stata Journal (2015) 15, Number 3, pp. 737 750 Fitting adjusted limited dependent variable mixture models to EQ5D Mónica Hernández Alava School of Health and Related Research Health Economics and
More informationWeek 4: Simple Linear Regression III
Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of
More informationTitle. Syntax. stata.com. intreg Interval regression. weight. indepvars. intreg depvar 1 depvar 2. , options. Description. options
Title stata.com intreg Interval regression Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see Syntax intreg depvar 1 depvar 2 [ indepvars ] [ if
More informationStata Training. AGRODEP Technical Note 08. April Manuel Barron and Pia Basurto
AGRODEP Technical Note 08 April 2013 Stata Training Manuel Barron and Pia Basurto AGRODEP Technical Notes are designed to document stateoftheart tools and methods. They are circulated in order to help
More informationAn Introduction to Stata Part II: Data Analysis
An Introduction to Stata Part II: Data Analysis Kerry L. Papps 1. Overview Dofiles Sorting a dataset Combining datasets Creating a dataset of means or medians etc. Weights Panel data capabilities Dummy
More informationLab 2: OLS regression
Lab 2: OLS regression Andreas Beger February 2, 2009 1 Overview This lab covers basic OLS regression in Stata, including: multivariate OLS regression reporting coefficients with different confidence intervals
More informationComputing Murphy Topelcorrected variances in a heckprobit model with endogeneity
The Stata Journal (2010) 10, Number 2, pp. 252 258 Computing Murphy Topelcorrected variances in a heckprobit model with endogeneity Juan Muro Department of Statistics, Economic Structure, and International
More informationWeek 4: Simple Linear Regression II
Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties
More informationSpeeding Up the ARDL Estimation Command:
Speeding Up the ARDL Estimation Command: A Case Study in Efficient Programming in Stata and Mata Sebastian Kripfganz 1 Daniel C. Schneider 2 1 University of Exeter 2 Max Planck Institute for Demographic
More informationMultipleimputation analysis using Stata s mi command
Multipleimputation analysis using Stata s mi command Yulia Marchenko Senior Statistician StataCorp LP 2009 UK Stata Users Group Meeting Yulia Marchenko (StataCorp) Multipleimputation analysis using mi
More informationschooling.log 7/5/2006
 log: C:\dnb\schooling.log log type: text opened on: 5 Jul 2006, 09:03:57. /* schooling.log */ > use schooling;. gen age2=age76^2;. /* OLS (inconsistent) */ > reg lwage76
More informationWeek 10: Heteroskedasticity II
Week 10: Heteroskedasticity II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Dealing with heteroskedasticy
More informationCH5: CORR & SIMPLE LINEAR REFRESSION =======================================
STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================
More informationHeteroskedasticity and Homoskedasticity, and HomoskedasticityOnly Standard Errors
Heteroskedasticity and Homoskedasticity, and HomoskedasticityOnly Standard Errors (Section 5.4) What? Consequences of homoskedasticity Implication for computing standard errors What do these two terms
More informationStata Session 2. Tarjei Havnes. University of Oslo. Statistics Norway. ECON 4136, UiO, 2012
Stata Session 2 Tarjei Havnes 1 ESOP and Department of Economics University of Oslo 2 Research department Statistics Norway ECON 4136, UiO, 2012 Tarjei Havnes (University of Oslo) Stata Session 2 ECON
More informationINTRODUCTION TO PANEL DATA ANALYSIS
INTRODUCTION TO PANEL DATA ANALYSIS USING EVIEWS FARIDAH NAJUNA MISMAN, PhD FINANCE DEPARTMENT FACULTY OF BUSINESS & MANAGEMENT UiTM JOHOR PANEL DATA WORKSHOP23&24 MAY 2017 1 OUTLINE 1. Introduction 2.
More informationStat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors
Week 10: Autocorrelated errors This week, I have done one possible analysis and provided lots of output for you to consider. Case study: predicting body fat Body fat is an important health measure, but
More informationProgramming MLE models in STATA 1
Programming MLE models in STATA 1 Andreas Beger 30 November 2008 1 Overview You have learned about a lot of different MLE models so far, and most of them are available as predefined commands in STATA.
More informationA quick introduction to STATA
A quick introduction to STATA Data files and other resources for the course book Introduction to Econometrics by Stock and Watson is available on: http://wps.aw.com/aw_stock_ie_3/178/45691/11696965.cw/index.html
More informationHealth Disparities (HD): It s just about comparing two groups
A review of modern methods of estimating the size of health disparities May 24, 2017 Emil Coman 1 Helen Wu 2 1 UConn Health Disparities Institute, 2 UConn Health Modern Modeling conference, May 2224,
More informationPackage DBfit. October 25, 2018
Type Package Package DBfit October 25, 2018 Title A Double Bootstrap Method for Analyzing Linear Models with Autoregressive Errors Version 1.0 Date 20181016 Author Joseph W. McKean and Shaofeng Zhang
More informationWeek 11: Interpretation plus
Week 11: Interpretation plus Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline A bit of a patchwork
More informationWeek 5: Multiple Linear Regression II
Week 5: Multiple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Adjusted R
More informationTHE LINEAR PROBABILITY MODEL: USING LEAST SQUARES TO ESTIMATE A REGRESSION EQUATION WITH A DICHOTOMOUS DEPENDENT VARIABLE
PLS 802 Spring 2018 Professor Jacoby THE LINEAR PROBABILITY MODEL: USING LEAST SQUARES TO ESTIMATE A REGRESSION EQUATION WITH A DICHOTOMOUS DEPENDENT VARIABLE This handout shows the log of a Stata session
More informationTwoStage Least Squares
Chapter 316 TwoStage Least Squares Introduction This procedure calculates the twostage least squares (2SLS) estimate. This method is used fit models that include instrumental variables. 2SLS includes
More informationPackage mvprobit. November 2, 2015
Version 0.18 Date 20151102 Title Multivariate Probit Models Package mvprobit November 2, 2015 Author Arne Henningsen Maintainer Arne Henningsen
More informationSource:
Time Series Source: http://www.princeton.edu/~otorres/stata/ Time series data is data collected over time for a single or a group of variables. Date variable For this kind of data the first thing to do
More informationGetting started with Stata 2017: Cheatsheet
Getting started with Stata 2017: Cheatsheet 4. september 2017 1 Get started Graphical user interface (GUI). Clickable. Simple. Commands. Allows for use of dole. Easy to keep track. Command window: Write
More informationrange: [1,20] units: 1 unique values: 20 missing.: 0/20 percentiles: 10% 25% 50% 75% 90%
 log: \Term 2\Lecture_2s\regression1a.log log type: text opened on: 22 Feb 2008, 03:29:09. cmdlog using " \Term 2\Lecture_2s\regression1a.do" (cmdlog \Term 2\Lecture_2s\regression1a.do
More informationWeek 9: Modeling II. Marcelo Coca Perraillon. Health Services Research Methods I HSMP University of Colorado Anschutz Medical Campus
Week 9: Modeling II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Taking the log Retransformation
More informationPage 1. Notes: MB allocated to data 2. Stata running in batch mode. . do 2simpowervarests.do. . capture log close. .
tm / / / / / / / / / / / / 101 Copyright 19842009 Statistics/Data Analysis StataCorp 4905 Lakeway Drive College Station, Texas 77845 USA 800STATAPC http://wwwstatacom 9796964600 stata@statacom 9796964601
More informationMODULE ONE, PART THREE: READING DATA INTO STATA, CREATING AND RECODING VARIABLES, AND ESTIMATING AND TESTING MODELS IN STATA
MODULE ONE, PART THREE: READING DATA INTO STATA, CREATING AND RECODING VARIABLES, AND ESTIMATING AND TESTING MODELS IN STATA This Part Three of Module One provides a cookbooktype demonstration of the
More informationMissing Data Analysis for the Employee Dataset
Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup Random Variables: Y i =(Y i1,...,y ip ) 0 =(Y i,obs, Y i,miss ) 0 R i =(R i1,...,r ip ) 0 ( 1
More informationIntroduction to Hierarchical Linear Model. HsuehSheng Wu CFDR Workshop Series January 30, 2017
Introduction to Hierarchical Linear Model HsuehSheng Wu CFDR Workshop Series January 30, 2017 1 Outline What is Hierarchical Linear Model? Why do nested data create analytic problems? Graphic presentation
More informationSerial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090)  Week 11 Agustín Bénétrix
Serial Correlation and Heteroscedasticity in Time series Regressions Econometric (EC3090)  Week 11 Agustín Bénétrix 1 Properties of OLS with serially correlated errors OLS still unbiased and consistent
More informationBirkbeck College Department of Economics, Mathematics and Statistics.
Birkbeck College Department of Economics, Mathematics and Statistics. Graduate Certificates and Diplomas Economics, Finance, Financial Engineering 2012 Applied Statistics and Econometrics INTRODUCTION
More informationMaximum Likelihood estimation: Stata vs. Gauss
Maximum Likelihood estimation: Stata vs. Gauss Index Motivation Objective The Maximum Likelihood Method Capabilities: Stata vs Gauss Conclusions Motivation Stata is a powerful and flexible statistical
More informationSAS/ETS 14.2 User s Guide. The SPATIALREG Procedure
SAS/ETS 14.2 User s Guide The SPATIALREG Procedure This document is an individual chapter from SAS/ETS 14.2 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute
More informationoptimization_machine_probit_bush106.c
optimization_machine_probit_bush106.c. probit ybush black00 south hispanic00 income owner00 dwnom1n dwnom2n Iteration 0: log likelihood = 299.27289 Iteration 1: log likelihood = 154.89847 Iteration 2:
More information25 Working with categorical data and factor variables
25 Working with categorical data and factor variables Contents 25.1 Continuous, categorical, and indicator variables 25.1.1 Converting continuous variables to indicator variables 25.1.2 Converting continuous
More informationPAM 4280/ECON 3710: The Economics of Risky Health Behaviors Fall 2015 Professor John Cawley TA Christine Coyer. Stata Basics for PAM 4280/ECON 3710
PAM 4280/ECON 3710: The Economics of Risky Health Behaviors Fall 2015 Professor John Cawley TA Christine Coyer Stata Basics for PAM 4280/ECON 3710 I Introduction Stata is one of the most commonly used
More informationPackage sphet. March 16, 2018
Version 1.7 Date 20180316 Package sphet March 16, 2018 Title Estimation of Spatial Autoregressive Models with and without Heteroscedasticity Author Maintainer Depends R (>= 3.0.1) Imports nlme, spdep
More informationECONOMICS 452 TIME SERIES WITH STATA
1 ECONOMICS 452 01 Introduction TIME SERIES WITH STATA This manual is intended for the first half of the Economics 452 course and introduces some of the time series capabilities in Stata 8 I will be writing
More informationWorking Paper. Spatial Differencing: Estimation and Inference. Highlights. Federico Belotti, Edoardo Di Porto & Gianluca Santoni
No 201710 June Working Paper Spatial Differencing: Estimation and Inference Federico Belotti, Edoardo Di Porto & Gianluca Santoni Highlights Spatial differencing is a spatial data transformation pioneered
More informationEmpirical Asset Pricing
Department of Mathematics and Statistics, University of Vaasa, Finland Texas A&M University, May June, 2013 As of May 17, 2013 Part I Stata Introduction 1 Stata Introduction Interface Commands Command
More informationAn Introductory Guide to Stata
An Introductory Guide to Stata Scott L. Minkoff Assistant Professor Department of Political Science Barnard College sminkoff@barnard.edu Updated: July 9, 2012 1 TABLE OF CONTENTS ABOUT THIS GUIDE... 4
More informationFurther processing of estimation results: Basic programming with matrices
The Stata Journal (2005) 5, Number 1, pp. 83 91 Further processing of estimation results: Basic programming with matrices Ian Watson ACIRRT, University of Sydney i.watson@econ.usyd.edu.au Abstract. Rather
More informationLinear Model Selection and Regularization. especially usefull in high dimensions p>>100.
Linear Model Selection and Regularization especially usefull in high dimensions p>>100. 1 Why Linear Model Regularization? Linear models are simple, BUT consider p>>n, we have more features than data records
More informationRobust Linear Regression (Passing Bablok MedianSlope)
Chapter 314 Robust Linear Regression (Passing Bablok MedianSlope) Introduction This procedure performs robust linear regression estimation using the PassingBablok (1988) medianslope algorithm. Their
More informationConditional and Unconditional Regression with No Measurement Error
Conditional and with No Measurement Error /* reg2ways.sas */ %include 'readsenic.sas'; title2 ''; proc reg; title3 'Conditional Regression'; model infrisk = stay census; proc calis cov; /* Analyze the
More informationIntroduction to Bayesian Analysis in Stata
tools Introduction to Bayesian Analysis in Gustavo Sánchez Corp LLC September 15, 2017 Porto, Portugal tools 1 Bayesian analysis: 2 Basic Concepts The tools 14: The command 15: The bayes prefix Postestimation
More informationLabor Economics with STATA. Estimating the Human Capital Model Using Artificial Data
Labor Economics with STATA Liyousew G. Borga December 2, 2015 Estimating the Human Capital Model Using Artificial Data Liyou Borga Labor Economics with STATA December 2, 2015 84 / 105 Outline 1 The Human
More informationExample 1 of panel data : Data for 6 airlines (groups) over 15 years (time periods) Example 1
Panel data set Consists of n entities or subjects (e.g., firms and states), each of which includes T observations measured at 1 through t time period. total number of observations : nt Panel data have
More informationDescriptives. Graph. [DataSet1] C:\Documents and Settings\BuroK\Desktop\Prestige.sav
GET FILE='C:\Documents and Settings\BuroK\Desktop\Prestige.sav'. DESCRIPTIVES VARIABLES=prestige education income women /STATISTICS=MEAN STDDEV MIN MAX. Descriptives Input Missing Value Handling Resources
More informationMissing Data Part II: Multiple Imputation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 24, 2015
Missing Data Part II: Multiple Imputation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 24, 2015 Warning: I teach about Multiple Imputation with some trepidation.
More informationIntroduction to Stata: An Inclass Tutorial
Introduction to Stata: An I. The Basics  Stata is a commanddriven statistical software program. In other words, you type in a command, and Stata executes it. You can use the dropdown menus to avoid
More informationReview of Stata II AERC Training Workshop Nairobi, May 2002
Review of Stata II AERC Training Workshop Nairobi, 2024 May 2002 This note provides more information on the basics of Stata that should help you with the exercises in the remaining sessions of the workshop.
More informationGeneralized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS)
Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Linear Model in matrix notation for the population Y = Xβ + Var ( ) = In GLS, the error covariance matrix is known In EGLS
More informationA First Tutorial in Stata
A First Tutorial in Stata Stan Hurn Queensland University of Technology National Centre for Econometric Research www.ncer.edu.au Stan Hurn (NCER) Stata Tutorial 1 / 66 Table of contents 1 Preliminaries
More informationPerforming Cluster Bootstrapped Regressions in R
Performing Cluster Bootstrapped Regressions in R Francis L. Huang / October 6, 2016 Supplementary material for: Using Cluster Bootstrapping to Analyze Nested Data with a Few Clusters in Educational and
More informationCompute MI estimates of coefficients using previously saved estimation results
Title mi estimate using Estimation using previously saved estimation results Syntax Compute MI estimates of coefficients using previously saved estimation results mi estimate using miestfile [, options
More informationStata versions 12 & 13 Week 4 Practice Problems
Stata versions 12 & 13 Week 4 Practice Problems SOLUTIONS 1 Practice Screen Capture a Create a word document Name it using the convention lastname_lab1docx (eg bigelow_lab1docx) b Using your browser, go
More informationLinear regression Number of obs = 6,866 F(16, 326) = Prob > F = Rsquared = Root MSE =
 /*** To demonstrate use of 2SLS ***/ * Case: In the early 1990's Tanzania implemented a FP program to reduce fertility, which was among the highest in the world * The FP program had two main components:
More informationSTATA TUTORIAL B. Rabin with modifications by T. Marsh
STATA TUTORIAL B. Rabin with modifications by T. Marsh 5.2.05 (content also from http://www.ats.ucla.edu/stat/spss/faq/compare_packages.htm) Why choose Stata? Stata has a wide array of predefined statistical
More informationRegression. Page 1. Notes. Output Created Comments Data. 26Mar :31:18. Input. C:\Documents and Settings\BuroK\Desktop\Data Sets\Prestige.
GET FILE='C:\Documents and Settings\BuroK\Desktop\DataSets\Prestige.sav'. GET FILE='E:\MacEwan\Teaching\Stat252\Data\SPSS_data\MENTALID.sav'. DATASET ACTIVATE DataSet1. DATASET CLOSE DataSet2. GET FILE='E:\MacEwan\Teaching\Stat252\Data\SPSS_data\survey_part.sav'.
More informationMissing Data Part II: Multiple Imputation & Maximum Likelihood
Missing Data Part II: Multiple Imputation & Maximum Likelihood Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 12, 2017 Warning: I teach about Multiple
More informationEstimation and Inference by the Method of Projection Minimum Distance. Òscar Jordà Sharon Kozicki U.C. Davis Bank of Canada
Estimation and Inference by the Method of Projection Minimum Distance Òscar Jordà Sharon Kozicki U.C. Davis Bank of Canada The Paper in a Nutshell: An Efficient Limited Information Method Step 1: estimate
More informationLecture 13: Model selection and regularization
Lecture 13: Model selection and regularization Reading: Sections 6.16.2.1 STATS 202: Data mining and analysis October 23, 2017 1 / 17 What do we know so far In linear regression, adding predictors always
More informationIntroduction to Mixed Models: Multivariate Regression
Introduction to Mixed Models: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #9 March 30, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate
More informationset mem 10m we can also decide to have the more separation line on the screen or not when the software displays results: set more on set more off
Setting up Stata We are going to allocate 10 megabites to the dataset. You do not want to allocate to much memory to the dataset because the more memory you allocate to the dataset, the less memory will
More informationRepeated Measures Part 4: Blood Flow data
Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two withinsubjecs factors: Blood flow data (NWK p. 1181)'; proc format; value
More informationCrossvalidation and the Bootstrap
Crossvalidation and the Bootstrap In the section we discuss two resampling methods: crossvalidation and the bootstrap. 1/44 Crossvalidation and the Bootstrap In the section we discuss two resampling
More informationCHAPTER 1 INTRODUCTION
Introduction CHAPTER 1 INTRODUCTION Mplus is a statistical modeling program that provides researchers with a flexible tool to analyze their data. Mplus offers researchers a wide choice of models, estimators,
More informationThe linear mixed model: modeling hierarchical and longitudinal data
The linear mixed model: modeling hierarchical and longitudinal data Analysis of Experimental Data AED The linear mixed model: modeling hierarchical and longitudinal data 1 of 44 Contents 1 Modeling Hierarchical
More informationTimeVarying Volatility and ARCH Models
TimeVarying Volatility and ARCH Models ARCH MODEL AND TIMEVARYING VOLATILITY In this lesson we'll use Gretl to estimate several models in which the variance of the dependent variable changes over time.
More informationTitle. Syntax. svy: tabulate oneway Oneway tables for survey data. Basic syntax. svy: tabulate varname. Full syntax
Title svy: tabulate oneway Oneway tables for survey data Syntax Basic syntax svy: tabulate varname Full syntax svy [ vcetype ] [, svy options ] : tabulate varname [ if ] [ in ] [, tabulate options display
More information**************************************************************************************************************** * Single Wave Analyses.
ASDA2 ANALYSIS EXAMPLE REPLICATION SPSS C11 * Syntax for Analysis Example Replication C11 * Use data sets previously prepared in SAS, refer to SAS Analysis Example Replication for C11 for details. ****************************************************************************************************************
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 13: The bootstrap (v3) Ramesh Johari ramesh.johari@stanford.edu 1 / 30 Resampling 2 / 30 Sampling distribution of a statistic For this lecture: There is a population model
More informationIntroduction to Programming in Stata
Introduction to in Stata Laron K. University of Missouri Goals Goals Replicability! Goals Replicability! Simplicity/efficiency Goals Replicability! Simplicity/efficiency Take a peek under the hood! Data
More informationModern Spatial Econometrics in Practice: AGuidetoGeoDa, GeoDaSpace and PySAL
Modern Spatial Econometrics in Practice: AGuidetoGeoDa, GeoDaSpace and PySAL Luc Anselin Sergio J. Rey November 10, 2014 DRAFT Do Not Quote Copyright 2014 Luc Anselin and Sergio J. Rey, All Rights Reserved
More informationxtdpdml: Linear Dynamic PanelData Estimation using Maximum Likelihood and Structural Equation Modeling
Abstract xtdpdml: Linear Dynamic PanelData Estimation using Maximum Likelihood and Structural Equation Modeling Richard Williams, University of Notre Dame (rwilliam@nd.edu) Paul D. Allison, University
More informationPackage gppm. July 5, 2018
Version 0.2.0 Title Gaussian Process Panel Modeling Package gppm July 5, 2018 Provides an implementation of Gaussian process panel modeling (GPPM). GPPM is described in Karch (2016; )
More informationNonparametric MixedEffects Models for Longitudinal Data
Nonparametric MixedEffects Models for Longitudinal Data Zhang JinTing Dept of Stat & Appl Prob National University of Sinagpore University of Seoul, South Korea, 7 p.1/26 OUTLINE The Motivating Data
More informationCluster Randomization Create Cluster Means Dataset
Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters
More information1. (20%) a) Calculate and plot the impulse response functions for the model. u1t 1 u 2t 1
ECONOMETRICS II, Fall 2016 Bent E. Sørensen Final Exam. December 2. 7 questions. 1. 20% a Calculate and plot the impulse response functions for the model x1t x 2t = u1t u 2t 1.5.3.2 u1t 1 u 2t 1 1 2 0.5
More informationA Quick Guide to Stata 8 for Windows
Université de Lausanne, HEC Applied Econometrics II Kurt Schmidheiny October 22, 2003 A Quick Guide to Stata 8 for Windows 2 1 Introduction A Quick Guide to Stata 8 for Windows This guide introduces the
More informationDetecting and Circumventing Collinearity or IllConditioning Problems
Chapter 8 Detecting and Circumventing Collinearity or IllConditioning Problems Section 8.1 Introduction Multicollinearity/Collinearity/IllConditioning The terms multicollinearity, collinearity, and illconditioning
More information3 Nonlinear Regression
CSC 4 / CSC D / CSC C 3 Sometimes linear models are not sufficient to capture the realworld phenomena, and thus nonlinear models are necessary. In regression, all such models will have the same basic
More informationSimulating Multivariate Normal Data
Simulating Multivariate Normal Data You have a population correlation matrix and wish to simulate a set of data randomly sampled from a population with that structure. I shall present here code and examples
More informationNCSS Statistical Software. Robust Regression
Chapter 308 Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that chapter for in depth coverage of multiple
More informationECON Introductory Econometrics Seminar 4
ECON4150  Introductory Econometrics Seminar 4 Stock and Watson EE8.2 April 28, 2015 Stock and Watson EE8.2 ECON4150  Introductory Econometrics Seminar 4 April 28, 2015 1 / 20 Current Population Survey
More information1 Training/Validation/Testing
CPSC 340 Final (Fall 2015) Name: Student Number: Please enter your information above, turn off cellphones, space yourselves out throughout the room, and wait until the official start of the exam to begin.
More information