Session 2: Fixed and Random Effects Estimation
|
|
- Grant George
- 5 years ago
- Views:
Transcription
1 Session 2: Fixed and Random Effects Estimation Principal, Developing Trade Consultants Ltd. ARTNeT/RIS Capacity Building Workshop on the Use of Gravity Modeling Thursday, November 10,
2 Outline Fixed Effects Estimation 1 Fixed Effects Estimation
3 Using Dummies to Capture Multilateral Resistance The AvW Gravity Model Aggregate Data ) ] X ij = ln(y i ) + ln (E j ln(y ) + (1 σ) [lnτ ij + lnπ i + lnp j The term ln(y ) is common across all exporters and importers; thus, it can be captured through a constant in the regression model. ) The term ln (E j + lnp j is constant across all importers for a given exporter; thus, it can be captured through an importer dummy variable (fixed effect). The term ln ( ) Y i + lnπi is constant across all exporters for a given importer; thus, it can be captured through an 3 exporter dummy variable (fixed effect).
4 Using Dummies to Capture Multilateral Resistance Advantages of Dummy Variables An aggregate gravity model with a constant, and dummy variables for each exporter and each importer will therefore take proper account of multilateral resistance, and should produce unbiased estimates. Very simple to estimate, but takes account of some sophisticated effects. NxN observations, but N+N dummies; degrees of freedom are usually sufficient. 4
5 Using Dummies to Capture Multilateral Resistance Disadvantages of Dummy Variables Dimensionality quickly becomes an issue with sectoral models: N+N can be in the hundreds, or thousands. Because of collinearity constraints, we cannot identify separate effects due to factors that vary in the exporter or importer dimensions. Only factors varying bilaterally can be identified. 5
6 Using Dummies to Capture Multilateral Resistance Estimation using panel data techniques can make it possible to reduce the dimensionality problem somewhat, but it remains an issue in large/detailed datasets. To deal with the collinearity problem, variables can sometimes be transformed so as to vary by country pair: Sum of exporter and importer values. Average of exporter and importer values, etc. Try to go back to theory to see if this is an appropriate thing to do in a given circumstance. 6
7 Dummies in Sectoral Gravity Models As suggested previously, things get even more complicated with sectoral gravity models. Dummy variables need to be specified in the importer-sector, exporter-sector, and sector dimensions, because: ( ln X k ij ) ( = ln Yi k (1 σ k ) ) ( + ln E k j ) ( ln [ lnτ k ij + lnπ k i + lnp k j Y k) + ] In addition, trade costs need to be interacted with sector dummies in order to take account of varying elasticities of substitution across sectors. 7
8 Dummies in Sectoral Gravity Models Depending on the level of sectoral disaggregation used, this approach can result in huge numbers of parameters. Models can take a long time, and a big computer, to estimate. It is usually much easier to estimate separate models for each sector. 8
9 Dummies and Fixed Effects in Stata Option 1: enter the dummies manually and use OLS: tab importer, gen(imp_dum_*) reg ln_trade... imp_dum_*, robust Option 2: use a panel estimator (OLS + a trick) to account for one set of dummies: iis importers xtreg ln_trade..., robust fe 9
10 Random Effects: An Alternative to Dummies Fixed effects (dummy variables) are one way of accounting for unobserved heterogeneity across countries, in this case due to multilateral resistance. A common alternative in the econometrics literature is random effects: Fixed effects allow for free or structureless variation; Random effects require that unobserved heterogeneity obey some probability constraints, i.e. follow a particular distribution. 10
11 Random Effects: An Alternative to Dummies Advantages of Random Effects The dimensionality constraint is greatly relaxed: even very large models can be estimated relatively quickly. Allows inclusion of variables (like GDP) that vary in the same dimension as the random effects. Simple to estimate single-dimensional RE models in Stata: iis importers xtreg ln_trade ln_gdp_imp [etc.], re robust. 11
12 Random Effects: An Alternative to Dummies Disadvantages of Random Effects Random effects rely on a strong assumption: multilateral resistance is normally distributed across countries, with a given standard deviation. The AvW model tells us that multilateral resistance is important, but it doesn t tell us anything about its distribution. In practice, compare RE and FE estimates. 12
13 Random Effects: An Alternative to Dummies Distance coefficient from the original gravity model without fixed effects = ***. Distance coefficient from the fixed effects gravity model = ***. Distance coefficient from the random effects (importer) gravity model = *** Not statistically different from OLS, but significantly different from fixed effects. CAUTION! 13
14 Testing for Fixed vs. Random Effects Fixed effects estimates are always consistent, even if the true model is random effects. Random effects are only consistent if the true model is random effects, in which case they are also efficient. Intuitively, random effects are an acceptable simplification when the difference between the two sets of coefficients is small. 14
15 Testing for Fixed vs. Random Effects The Hausman test formalizes this intuition: it tests for a statistically significant difference between the two sets of coefficients. If the null hypothesis is rejected, random effects are inappropriate (inconsistent). If the null hypothesis is not rejected, random effects are an acceptable simplification, but either estimator can be used. 15
16 Testing for Fixed vs. Random Effects Beware: the Hausman test has poor properties in practice! Stata s implementation has to use non-robust covariance matrices. Sometimes, the test statistic cannot even be calculated due to a breakdown in its assumptions. If you are serious about random effects, try the Hausman test, but take it with a grain of salt. 16
17 Testing for Fixed vs. Random Effects In Stata, adopt the following procedure to test for fixed versus random effects: Estimate the fixed effects model without the robust option, then issue the command estimates store fixed. Estimate the random effects model without the robust option, then issue the command estimates store random. Issue the command hausman fixed random. 17
18 The Baier-Bergstrand Model It would be nice to have a methodology that enables us to account for the MR terms, but also include data that only varies by exporter or importer. Random effects can do the job, but only at the price of a strong assumption. Fixed effects requires us to play with the data to make variables that vary (artificially) by country pair. Baier and Bergstrand (2009 JIE) give us an alternative. 18
19 The Baier-Bergstrand Model Recall that the MR terms are complex, non-linear functions of trade costs in the AvW model: } ( ) 1 σk Π k 1 σk i = C Ej k j=1 ( P k j { τ k ij P k j ) 1 σk = C j=1 { τ k ij Π k i Y k } 1 σk Y k i Y k The Baier-Bergstrand model uses a first order Taylor series expansion to approximate the MR terms. 19
20 The Baier-Bergstrand Model The Taylor series approach gives a gravity model that looks like this: ln ( ) X ij = ln(yi ) + ln ( ) E j ln(y )+ [ (1 σ) lnτ ij i θ i lnτ ij j θ j lnτ ij + i ] θ i θ j lnτ ij j The two θ terms are GDP weights, i.e. θ i = Y i Y. 20
21 The Baier-Bergstrand Model To estimate the Baier-Bergstrand model, simply: Calculate the GDP weights. For each trade cost variable lnτ ij calculate lnτij = lnτ ij i θ i lnτ ij j θ j lnτ ij + i j θ i θ j lnτ ij. Remember to take logs before taking the averages! Estimate the gravity model using OLS, as usual, but including the transformed variables along with GDP, i.e.: ln ( X ij ) = ln(yi ) + ln ( E j ) ln(y ) + lnτ ij 21
22 The most common approach to the aggregate gravity model is to use fixed effects (dummy variables) by importer and by exporter. Random effects can also be used, but they rely on a stronger and possibly invalid assumption. For sectoral gravity models, the simplest approach is to estimate separately, sector by sector. The Baier-Bergstrand model provides a simple alternative that makes it possible to account for multilateral resistance at the same time as including exporter- and importer-specific variables. 22
BASIC STEPS TO DO A SIMPLE PANEL DATA ANALYSIS IN STATA
BASIC STEPS TO DO A SIMPLE PANEL DATA ANALYSIS IN STATA By: Mahyudin Ahmad @ 2017 Basic steps to do a panel data analysis in STATA Page 1 Outline Outline: 1. Setting up commands 2. Importing data to Stata
More informationWeek 4: Simple Linear Regression II
Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties
More informationBig Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1
Big Data Methods Chapter 5: Machine learning Big Data Methods, Chapter 5, Slide 1 5.1 Introduction to machine learning What is machine learning? Concerned with the study and development of algorithms that
More informationset mem 10m we can also decide to have the more separation line on the screen or not when the software displays results: set more on set more off
Setting up Stata We are going to allocate 10 megabites to the dataset. You do not want to allocate to much memory to the dataset because the more memory you allocate to the dataset, the less memory will
More informationINTRODUCTION TO PANEL DATA ANALYSIS
INTRODUCTION TO PANEL DATA ANALYSIS USING EVIEWS FARIDAH NAJUNA MISMAN, PhD FINANCE DEPARTMENT FACULTY OF BUSINESS & MANAGEMENT UiTM JOHOR PANEL DATA WORKSHOP-23&24 MAY 2017 1 OUTLINE 1. Introduction 2.
More informationSerial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix
Serial Correlation and Heteroscedasticity in Time series Regressions Econometric (EC3090) - Week 11 Agustín Bénétrix 1 Properties of OLS with serially correlated errors OLS still unbiased and consistent
More informationTwo-Stage Least Squares
Chapter 316 Two-Stage Least Squares Introduction This procedure calculates the two-stage least squares (2SLS) estimate. This method is used fit models that include instrumental variables. 2SLS includes
More informationUNBIASED ESTIMATION OF DESTINATION CHOICE MODELS WITH ATTRACTION CONSTRAINTS
UNBIASED ESTIMATION OF DESTINATION CHOICE MODELS WITH ATTRACTION CONSTRAINTS Vince Bernardin, PhD Steven Trevino John Gliebe, PhD APRIL 14, 2014 WHAT S WRONG WITH ESTIMATING DOUBLY CONSTRAINED DESTINATION
More informationOLS Assumptions and Goodness of Fit
OLS Assumptions and Goodness of Fit A little warm-up Assume I am a poor free-throw shooter. To win a contest I can choose to attempt one of the two following challenges: A. Make three out of four free
More informationBuilding Better Parametric Cost Models
Building Better Parametric Cost Models Based on the PMI PMBOK Guide Fourth Edition 37 IPDI has been reviewed and approved as a provider of project management training by the Project Management Institute
More informationAdditive hedonic regression models for the Austrian housing market ERES Conference, Edinburgh, June
for the Austrian housing market, June 14 2012 Ao. Univ. Prof. Dr. Fachbereich Stadt- und Regionalforschung Technische Universität Wien Dr. Strategic Risk Management Bank Austria UniCredit, Wien Inhalt
More informationPANEL DATA REGRESSION MODELS IN EVIEWS: Pooled OLS, Fixed or Random effect model?
PANEL DATA REGRESSION MODELS IN EVIEWS: Pooled OLS, Fixed or Random effect model? ADESETE, Ahmed Adefemi 12/6/2017 2 PANEL DATA REGRESSION MODELS IN EVIEWS: Pooled OLS, Fixed or Random effect model? Panel
More informationSection 2.3: Simple Linear Regression: Predictions and Inference
Section 2.3: Simple Linear Regression: Predictions and Inference Jared S. Murray The University of Texas at Austin McCombs School of Business Suggested reading: OpenIntro Statistics, Chapter 7.4 1 Simple
More informationINTRODUCTION to. Program in Statistics and Methodology (PRISM) Daniel Blake & Benjamin Jones January 15, 2010
INTRODUCTION to Program in Statistics and Methodology (PRISM) Daniel Blake & Benjamin Jones January 15, 2010 While we are waiting Everyone who wishes to work along with the presentation should log onto
More informationA First Tutorial in Stata
A First Tutorial in Stata Stan Hurn Queensland University of Technology National Centre for Econometric Research www.ncer.edu.au Stan Hurn (NCER) Stata Tutorial 1 / 66 Table of contents 1 Preliminaries
More informationSpatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data
Spatial Patterns We will examine methods that are used to analyze patterns in two sorts of spatial data: Point Pattern Analysis - These methods concern themselves with the location information associated
More informationSOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian. Panel Data Analysis: Fixed Effects Models
SOCY776: Longitudinal Data Analysis Instructor: Natasha Sarkisian Panel Data Analysis: Fixed Effects Models Fixed effects models are similar to the first difference model we considered for two wave data
More informationSection 3.2: Multiple Linear Regression II. Jared S. Murray The University of Texas at Austin McCombs School of Business
Section 3.2: Multiple Linear Regression II Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Multiple Linear Regression: Inference and Understanding We can answer new questions
More informationStatistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte
Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,
More informationCan double click the data file and it should open STATA
ECO 445: International Trade Professor Jack Rossbach Instructions on Doing Gravity Regressions in STATA Important: If you don t know how to use a command, use the help command in R. For example, type help
More informationZ-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown
Z-TEST / Z-STATISTIC: used to test hypotheses about µ when the population standard deviation is known and population distribution is normal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses
More informationGRETL FOR TODDLERS!! CONTENTS. 1. Access to the econometric software A new data set: An existent data set: 3
GRETL FOR TODDLERS!! JAVIER FERNÁNDEZ-MACHO CONTENTS 1. Access to the econometric software 3 2. Loading and saving data: the File menu 3 2.1. A new data set: 3 2.2. An existent data set: 3 2.3. Importing
More informationSection 3.4: Diagnostics and Transformations. Jared S. Murray The University of Texas at Austin McCombs School of Business
Section 3.4: Diagnostics and Transformations Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Regression Model Assumptions Y i = β 0 + β 1 X i + ɛ Recall the key assumptions
More informationWeek 5: Multiple Linear Regression II
Week 5: Multiple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Adjusted R
More informationWeek 4: Simple Linear Regression III
Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of
More informationIntroduction to SAS. I. Understanding the basics In this section, we introduce a few basic but very helpful commands.
Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 Introduction to SAS Workshop Objective This workshop
More informationVIDAEXPERT: DATA ANALYSIS Here is the Statistics button.
Here is the Statistics button. After creating dataset you can analyze it in different ways. First, you can calculate statistics. Open Statistics dialog, Common tabsheet, click Calculate. Min, Max: minimal
More informationIntroduction to STATA
Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 Introduction to STATA WORKSHOP OBJECTIVE: This workshop
More informationWeek 10: Heteroskedasticity II
Week 10: Heteroskedasticity II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Dealing with heteroskedasticy
More informationSOS3003 Applied data analysis for social science Lecture note Erling Berge Department of sociology and political science NTNU.
SOS3003 Applied data analysis for social science Lecture note 04-2009 Erling Berge Department of sociology and political science NTNU Erling Berge 2009 1 Missing data Literature Allison, Paul D 2002 Missing
More informationFrequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM
Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.
More informationChapter 7: Dual Modeling in the Presence of Constant Variance
Chapter 7: Dual Modeling in the Presence of Constant Variance 7.A Introduction An underlying premise of regression analysis is that a given response variable changes systematically and smoothly due to
More informationLecture 2: Analyzing Algorithms: The 2-d Maxima Problem
Lecture 2: Analyzing Algorithms: The 2-d Maxima Problem (Thursday, Jan 29, 1998) Read: Chapter 1 in CLR. Analyzing Algorithms: In order to design good algorithms, we must first agree the criteria for measuring
More informationLecture 3: Linear Classification
Lecture 3: Linear Classification Roger Grosse 1 Introduction Last week, we saw an example of a learning task called regression. There, the goal was to predict a scalar-valued target from a set of features.
More informationRobust Linear Regression (Passing- Bablok Median-Slope)
Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their
More informationHeteroskedasticity and Homoskedasticity, and Homoskedasticity-Only Standard Errors
Heteroskedasticity and Homoskedasticity, and Homoskedasticity-Only Standard Errors (Section 5.4) What? Consequences of homoskedasticity Implication for computing standard errors What do these two terms
More informationCSE 573: Artificial Intelligence Autumn 2010
CSE 573: Artificial Intelligence Autumn 2010 Lecture 16: Machine Learning Topics 12/7/2010 Luke Zettlemoyer Most slides over the course adapted from Dan Klein. 1 Announcements Syllabus revised Machine
More informationMean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types
More informationMissing Data Analysis for the Employee Dataset
Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup For our analysis goals we would like to do: Y X N (X, 2 I) and then interpret the coefficients
More informationModel selection and validation 1: Cross-validation
Model selection and validation 1: Cross-validation Ryan Tibshirani Data Mining: 36-462/36-662 March 26 2013 Optional reading: ISL 2.2, 5.1, ESL 7.4, 7.10 1 Reminder: modern regression techniques Over the
More informationExploring Econometric Model Selection Using Sensitivity Analysis
Exploring Econometric Model Selection Using Sensitivity Analysis William Becker Paolo Paruolo Andrea Saltelli Nice, 2 nd July 2013 Outline What is the problem we are addressing? Past approaches Hoover
More informationCluster Analysis. Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX April 2008 April 2010
Cluster Analysis Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX 7575 April 008 April 010 Cluster Analysis, sometimes called data segmentation or customer segmentation,
More informationIntroduction to Mixed Models: Multivariate Regression
Introduction to Mixed Models: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #9 March 30, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate
More informationTransitivity and Triads
1 / 32 Tom A.B. Snijders University of Oxford May 14, 2012 2 / 32 Outline 1 Local Structure Transitivity 2 3 / 32 Local Structure in Social Networks From the standpoint of structural individualism, one
More informationMigration and the Labour Market: Data and Intro to STATA
Migration and the Labour Market: Data and Intro to STATA Prof. Dr. Otto-Friedrich-University of Bamberg, Meeting May 27 and June 9, 2010 Contents of today s meeting 1 Repetition of last meeting Repetition
More informationApplied Statistics and Econometrics Lecture 6
Applied Statistics and Econometrics Lecture 6 Giuseppe Ragusa Luiss University gragusa@luiss.it http://gragusa.org/ March 6, 2017 Luiss University Empirical application. Data Italian Labour Force Survey,
More informationRobust Regression. Robust Data Mining Techniques By Boonyakorn Jantaranuson
Robust Regression Robust Data Mining Techniques By Boonyakorn Jantaranuson Outline Introduction OLS and important terminology Least Median of Squares (LMedS) M-estimator Penalized least squares What is
More informationMultiple Regression White paper
+44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms
More information7. Decision or classification trees
7. Decision or classification trees Next we are going to consider a rather different approach from those presented so far to machine learning that use one of the most common and important data structure,
More informationChapter 19 Estimating the dose-response function
Chapter 19 Estimating the dose-response function Dose-response Students and former students often tell me that they found evidence for dose-response. I know what they mean because that's the phrase I have
More informationResources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.
Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department
More informationPractical OmicsFusion
Practical OmicsFusion Introduction In this practical, we will analyse data, from an experiment which aim was to identify the most important metabolites that are related to potato flesh colour, from an
More informationFeature Detectors and Descriptors: Corners, Lines, etc.
Feature Detectors and Descriptors: Corners, Lines, etc. Edges vs. Corners Edges = maxima in intensity gradient Edges vs. Corners Corners = lots of variation in direction of gradient in a small neighborhood
More informationEconometric Tools 1: Non-Parametric Methods
University of California, Santa Cruz Department of Economics ECON 294A (Fall 2014) - Stata Lab Instructor: Manuel Barron 1 Econometric Tools 1: Non-Parametric Methods 1 Introduction This lecture introduces
More informationStatistical Analysis of List Experiments
Statistical Analysis of List Experiments Kosuke Imai Princeton University Joint work with Graeme Blair October 29, 2010 Blair and Imai (Princeton) List Experiments NJIT (Mathematics) 1 / 26 Motivation
More informationSegmentation: Clustering, Graph Cut and EM
Segmentation: Clustering, Graph Cut and EM Ying Wu Electrical Engineering and Computer Science Northwestern University, Evanston, IL 60208 yingwu@northwestern.edu http://www.eecs.northwestern.edu/~yingwu
More informationShared Variables and Interference
Solved Shared Variables and Interference CS 536: Science of Programming, Fall 2018 A. Why Parallel programs can coordinate their work using shared variables, but it s important for threads to not interfere
More informationData Validation Option Best Practices
Data Validation Option Best Practices 1993-2016 Informatica LLC. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording or otherwise) without
More informationDiscrete Optimization. Lecture Notes 2
Discrete Optimization. Lecture Notes 2 Disjunctive Constraints Defining variables and formulating linear constraints can be straightforward or more sophisticated, depending on the problem structure. The
More informationThis module presents the star schema, an alternative to 3NF schemas intended for analytical databases.
Topic 3.3: Star Schema Design This module presents the star schema, an alternative to 3NF schemas intended for analytical databases. Star Schema Overview The star schema is a simple database architecture
More informationMissing Data Techniques
Missing Data Techniques Paul Philippe Pare Department of Sociology, UWO Centre for Population, Aging, and Health, UWO London Criminometrics (www.crimino.biz) 1 Introduction Missing data is a common problem
More informationSpatial Outlier Detection
Spatial Outlier Detection Chang-Tien Lu Department of Computer Science Northern Virginia Center Virginia Tech Joint work with Dechang Chen, Yufeng Kou, Jiang Zhao 1 Spatial Outlier A spatial data point
More informationWeek 11: Interpretation plus
Week 11: Interpretation plus Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline A bit of a patchwork
More informationSPM Users Guide. This guide elaborates on powerful ways to combine the TreeNet and GPS engines to achieve model compression and more.
SPM Users Guide Model Compression via ISLE and RuleLearner This guide elaborates on powerful ways to combine the TreeNet and GPS engines to achieve model compression and more. Title: Model Compression
More informationRUDIMENTS OF STATA. After entering this command the data file WAGE1.DTA is loaded into memory.
J.M. Wooldridge Michigan State University RUDIMENTS OF STATA This handout covers the most often encountered Stata commands. It is not comprehensive, but the summary will allow you to do basic data management
More informationSTATA Tutorial. Introduction to Econometrics. by James H. Stock and Mark W. Watson. to Accompany
STATA Tutorial to Accompany Introduction to Econometrics by James H. Stock and Mark W. Watson STATA Tutorial to accompany Stock/Watson Introduction to Econometrics Copyright 2003 Pearson Education Inc.
More informationNotes on Simulations in SAS Studio
Notes on Simulations in SAS Studio If you are not careful about simulations in SAS Studio, you can run into problems. In particular, SAS Studio has a limited amount of memory that you can use to write
More informationMigration and the Labour Market. STATA: An Introduction into the Basics. Dr. Ehsan Vallizadeh. Bamberg, May 17, 2018
Migration and the Labour Market STATA: An Introduction into the Basics Dr. Ehsan Vallizadeh Bamberg, May 17, 2018 Contents I II III What do we have to do for the paper Brief introduction into the datasets
More informationCPSC 340: Machine Learning and Data Mining. Probabilistic Classification Fall 2017
CPSC 340: Machine Learning and Data Mining Probabilistic Classification Fall 2017 Admin Assignment 0 is due tonight: you should be almost done. 1 late day to hand it in Monday, 2 late days for Wednesday.
More informationLab 07: Multiple Linear Regression: Variable Selection
Lab 07: Multiple Linear Regression: Variable Selection OBJECTIVES 1.Use PROC REG to fit multiple regression models. 2.Learn how to find the best reduced model. 3.Variable diagnostics and influential statistics
More informationPredictive Analysis: Evaluation and Experimentation. Heejun Kim
Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training
More informationData Mining. Data preprocessing. Hamid Beigy. Sharif University of Technology. Fall 1395
Data Mining Data preprocessing Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1395 1 / 15 Table of contents 1 Introduction 2 Data preprocessing
More informationWorking Paper No. 782
Working Paper No. 782 Feasible Estimation of Linear Models with N-fixed Effects by Fernando Rios-Avila* Levy Economics Institute of Bard College December 2013 *Acknowledgements: This paper has benefited
More informationLecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)
School of Computer Science Probabilistic Graphical Models Structured Sparse Additive Models Junming Yin and Eric Xing Lecture 7, April 4, 013 Reading: See class website 1 Outline Nonparametric regression
More informationAnalysis of Panel Data. Third Edition. Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS
Analysis of Panel Data Third Edition Cheng Hsiao University of Southern California CAMBRIDGE UNIVERSITY PRESS Contents Preface to the ThirdEdition Preface to the Second Edition Preface to the First Edition
More informationEstimation of Bilateral Connections in a Network: Copula vs. Maximum Entropy
Estimation of Bilateral Connections in a Network: Copula vs. Maximum Entropy Pallavi Baral and Jose Pedro Fique Department of Economics Indiana University at Bloomington 1st Annual CIRANO Workshop on Networks
More informationSELECTION OF A MULTIVARIATE CALIBRATION METHOD
SELECTION OF A MULTIVARIATE CALIBRATION METHOD 0. Aim of this document Different types of multivariate calibration methods are available. The aim of this document is to help the user select the proper
More informationStandard Errors in OLS Luke Sonnet
Standard Errors in OLS Luke Sonnet Contents Variance-Covariance of ˆβ 1 Standard Estimation (Spherical Errors) 2 Robust Estimation (Heteroskedasticity Constistent Errors) 4 Cluster Robust Estimation 7
More informationUsing Machine Learning to Optimize Storage Systems
Using Machine Learning to Optimize Storage Systems Dr. Kiran Gunnam 1 Outline 1. Overview 2. Building Flash Models using Logistic Regression. 3. Storage Object classification 4. Storage Allocation recommendation
More informationPhysics 736. Experimental Methods in Nuclear-, Particle-, and Astrophysics. - Statistical Methods -
Physics 736 Experimental Methods in Nuclear-, Particle-, and Astrophysics - Statistical Methods - Karsten Heeger heeger@wisc.edu Course Schedule and Reading course website http://neutrino.physics.wisc.edu/teaching/phys736/
More informationCurve fitting. Lab. Formulation. Truncation Error Round-off. Measurement. Good data. Not as good data. Least squares polynomials.
Formulating models We can use information from data to formulate mathematical models These models rely on assumptions about the data or data not collected Different assumptions will lead to different models.
More informationBox-Cox Transformation for Simple Linear Regression
Chapter 192 Box-Cox Transformation for Simple Linear Regression Introduction This procedure finds the appropriate Box-Cox power transformation (1964) for a dataset containing a pair of variables that are
More informationFeature Extractors. CS 188: Artificial Intelligence Fall Some (Vague) Biology. The Binary Perceptron. Binary Decision Rule.
CS 188: Artificial Intelligence Fall 2008 Lecture 24: Perceptrons II 11/24/2008 Dan Klein UC Berkeley Feature Extractors A feature extractor maps inputs to feature vectors Dear Sir. First, I must solicit
More informationFeature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani
Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate
More informationShared Variables and Interference
Illinois Institute of Technology Lecture 24 Shared Variables and Interference CS 536: Science of Programming, Spring 2018 A. Why Parallel programs can coordinate their work using shared variables, but
More information5.5 Regression Estimation
5.5 Regression Estimation Assume a SRS of n pairs (x, y ),..., (x n, y n ) is selected from a population of N pairs of (x, y) data. The goal of regression estimation is to take advantage of a linear relationship
More informationLecture 26: Missing data
Lecture 26: Missing data Reading: ESL 9.6 STATS 202: Data mining and analysis December 1, 2017 1 / 10 Missing data is everywhere Survey data: nonresponse. 2 / 10 Missing data is everywhere Survey data:
More informationCPSC 340: Machine Learning and Data Mining. Kernel Trick Fall 2017
CPSC 340: Machine Learning and Data Mining Kernel Trick Fall 2017 Admin Assignment 3: Due Friday. Midterm: Can view your exam during instructor office hours or after class this week. Digression: the other
More informationAutomated Parameterization of the Joint Space Dynamics of a Robotic Arm. Josh Petersen
Automated Parameterization of the Joint Space Dynamics of a Robotic Arm Josh Petersen Introduction The goal of my project was to use machine learning to fully automate the parameterization of the joint
More informationEvaluating Robot Systems
Evaluating Robot Systems November 6, 2008 There are two ways of constructing a software design. One way is to make it so simple that there are obviously no deficiencies. And the other way is to make it
More informationData transformation in multivariate quality control
Motto: Is it normal to have normal data? Data transformation in multivariate quality control J. Militký and M. Meloun The Technical University of Liberec Liberec, Czech Republic University of Pardubice
More informationCustom Fields in QuickBooks
Custom Fields in QuickBooks November 20, 2013 By Charlie Russell 41 Replies Every business has some sort of unique information that is important to its operation. While QuickBooks Desktop provides the
More informationData Preprocessing. Data Mining 1
Data Preprocessing Today s real-world databases are highly susceptible to noisy, missing, and inconsistent data due to their typically huge size and their likely origin from multiple, heterogenous sources.
More informationWeka ( )
Weka ( http://www.cs.waikato.ac.nz/ml/weka/ ) The phases in which classifier s design can be divided are reflected in WEKA s Explorer structure: Data pre-processing (filtering) and representation Supervised
More informationCS 229: Machine Learning Final Report Identifying Driving Behavior from Data
CS 9: Machine Learning Final Report Identifying Driving Behavior from Data Robert F. Karol Project Suggester: Danny Goodman from MetroMile December 3th 3 Problem Description For my project, I am looking
More informationMachine Learning in Biology
Università degli studi di Padova Machine Learning in Biology Luca Silvestrin (Dottorando, XXIII ciclo) Supervised learning Contents Class-conditional probability density Linear and quadratic discriminant
More informationOptimal designs for comparing curves
Optimal designs for comparing curves Holger Dette, Ruhr-Universität Bochum Maria Konstantinou, Ruhr-Universität Bochum Kirsten Schorning, Ruhr-Universität Bochum FP7 HEALTH 2013-602552 Outline 1 Motivation
More informationRCR.R User Guide. Table of Contents. Version 0.9
RCR.R User Guide Version 0.9 Table of Contents 1 Introduction... 2 1.1 Obtaining the program... 2 1.2 Installation... 3 1.3 Running the example file... 3 1.4 Updates and technical support... 3 2 Estimating
More informationIntroduction to Lexical Analysis
Introduction to Lexical Analysis Outline Informal sketch of lexical analysis Identifies tokens in input string Issues in lexical analysis Lookahead Ambiguities Specifying lexical analyzers (lexers) Regular
More informationPosition Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function
Position Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function B.Moetakef-Imani, M.Pour Department of Mechanical Engineering, Faculty of Engineering, Ferdowsi University of
More information