Kim Mannemar Sønderskov. Stata. A practical introduction. Hans Reitzels Forlag

Size: px
Start display at page:

Download "Kim Mannemar Sønderskov. Stata. A practical introduction. Hans Reitzels Forlag"

Transcription

1 Stata

2

3 Kim Mannemar Sønderskov Stata A practical introduction Hans Reitzels Forlag

4 Stata: A practical introduction Edition 1, print run 1 The authors and Hans Reitzel Publishers, 2015 Publishing editor: Martin Laurberg Translator: Tam McTurk and the author Cover: Louise Glargaard Perlmutter/Louises design Graphic design: Louise Glargaard Perlmutter/Louises design Layout and typesetting: Lone Bjarkow/Louises design Font: Minion Pro, Taz and ScalaSans Printed by Livonia Press Printed in Latvia 2015 ISBN This book has been subjected to a process of independent peer review at the manuscript stage. All rights reserved. Mechanical, photographic or any other reproduction or copying of this book is permitted only in accordance with the agreement between the Ministry of Education and Copydan. Any other usage without the written consent of the publisher is prohibited by Danish copyright law. The only exception consists of short extracts for use in reviews.

5 Contents 1. Introduction Making the most of the book Online material Notation Getting started with Stata and data: User interface and data inspection The primary user interface Open datasets Commands Inspection of data/descriptive statistics Overview of observations and variables in the dataset with -describe Codebook with -codebook set more off The output from -codebook- and types of variables Codebook information for the selected variables Notes with -notes Notes in the codebook Inspecting the data matrix with -browse Structure of the data matrix...31 Missing values Inspect selected variables with -browse Sort data with -sort- and -gsort Frequency distributions with -tabulate Frequency distributions with -codebook Frequency distributions for multiple variables with -tab Visual inspection of data with -inspect- and -histogram Detailed histogram with the graphics module and -histogram Descriptive statistics with -summarize Using the Review, Variables and Command windows Using do-files and the Do-file Editor Open the Do-file Editor and do-files Ways to use the editor...40 Run commands from the Do-file Editor...41

6 2.4.3 Commenting on commands in the do-file Entering long lines in the do-file Saving datasets and do-files...44 *Auto-save do-files Opening existing do-files Open the do-file with solutions to the exercises in this book Stata's help functions, and external resources...46 External resources Updates Data manipulation 1: Entering, deleting and documenting data Assigning and changing variable labels Assigning labels with the Variables manager...51 Editing variable labels Add and edit notes Assigning existing value labels to variables Data entry Creating new variables with -generate Stata s variable types Entering data in matrices *The commands used to create variables and input data Creating new value labels...58 Enter data for variables with value labels Deleting variables with -drop- and -keep Implicit references to multiple variables (varlist) Use of wildcards: * and?...61 Use of hyphens: Deleting observations with -drop Data manipulation II: Modification of existing variables Navigating (large) datasets Searching datasets...65 Search for specific words in variable names and labels with -lookfor *Advanced searching with -ds *Searching in notes with -notes search Rename or copy variables with -rename-, -generate- and -clonevar *Rearrange the sequence of variables with -order Finding the dialogue box for a command

7 4.3 Recode variables with -recode Simple recoding...70 Checking the recoding: Cross-tabulation with -tabulate Assigning labels and notes to recoded variables Recoding from more than one value...73 Other references to multiple numbers Recode values to missing values Multiple types of missing values Copying and editing value labels when recoding Recoding multiple variables simultaneously with -recode Recoding variables from metric to ordinal scale Recoding with -replace- and -if One or two equals signs?...86 Operators...86 Be careful with missing values!...86 The and operator (&) The or operator ( ) and -inlist recode- and -if Recoding to dummy variable with -tabulate Other types of recoding and creating new variables Tab expansion and variable name abbreviations *Abbreviations of variable names *Abbreviations, varlist and wildcards Univariate analyses Save output to log files...95 The log file s file type File location and Stata s active directory...96 Existing log files...96 Closing log files Recommended use of log files User-written programs Univariate analyses of categorical variables Descriptive analysis with -tabulate Graphic illustration Pie chart with -graph pie Bar chart with -catplot- and -histogram Disappearing graphics...107

8 5.3.3 Inference Confidence interval and hypothesis tests for proportions with -ciand -bitest An alternative for normal sampling distributions: -prtest Univariate analyses of metric variables Descriptive analyses of metric data with -summarize Graphic illustration with -histogram Inference with -ci Inspecting the log file Reporting results: exporting tables and graphics Exporting graphs Manual export of tables *Command-based export of tables with -tabout General command structure Options if- and -in- qualifiers Prefixes Expressions weight- and -using General command syntax Command abbreviations Option abbreviations Common bivariate analyses Selected bivariate analysis techniques Basic table analysis: cross-tabulation with -tabulate Multiple bivariate tables with -tab Comparison of two groups' proportions Strength of the relationship Graphic illustration with -catplot Inference with -prtest *Small samples: Fisher s exact test Bivariate analyses of multinomial variables The correlation measure lambda with -lambda Graphic illustration bivariate relationships with multinomial dependent variables Inference: Pearson's chi-squared *Small samples: Fisher s exact test...135

9 6.3.3 Graphic illustration: Multinomial independent variables and nominal/ordinal dependent variables Bivariate analyses of ordinal-scaled variables Graphic illustration with -catplot Inference for gamma and tau-b Comparison of two groups' means Graphic illustration with -graph dot Inference with -ttest *Heteroscedasticity? ANOVA comparison of means for multiple groups Graphic illustration with -graph dot Inference with -oneway *Heteroscedasticity? Bivariate analyses of metric variables Multiple correlation coefficients/a correlation matrix Inference with -pwcorr Graphic illustration: scatterplot and sunflowerplot A clearer example of a scatterplot Graphs in general: -twoway-, tab display, etc Label observations OLS line/trend line Disappearing graphs Tab display with -set autotabgraphs Recall and save graphs with -graph The graph editor Stata without data Immediate versions of -bitest-, -ci-, etc *Stata as a calculator Using saved results Saved results in r() Individual results Practical use of saved results Towards multivariate analysis Constructing reflective indices Selection of indicators Missing values Replace with neutral category Replace with the variable mean

10 Replace with the observation s mean for other items Changing the range Identical range for discrete variables Identical range for continuous variables *Standardisation/z-transformation with -egen Changing direction Testing measurement validity Item-item analysis Item-scale analysis Test of reliability with -alpha The output from the alpha command The actual index construction Summation with -alpha Use -alpha- to replace missing values with the mean of the other items Rescaling with -generate Distribution analysis Command loops with -foreach Linear regression with OLS Bivariate linear regression Metric independent variable Standardised regression coefficients Dichotomous independent variable: comparison of two groups' means with -regress Categorical independent: comparison of means of multiple groups with -regress Manual dummy regression Dummy regression with factor variables (i.var) Joint F-test of categorical variable with -testparm Change of reference category Multivariate analysis with multiple linear regression Control variables Estimation Heteroscedasticity robust standard errors Multiple models estimated with identical observations Tabulation and export of regression results Export with -esttab

11 8.4 Multiple comparisons with -margins margins-, -pwcompare- and -contrast Comparing and testing multiple differences Summing up differences with the option -groups Adjusted mean Graphic illustration of results from linear regression *Bivariate relationships with -graph dot- and -twoway Multivariate relationships, categorical variables with -marginsplot *Multivariate relationships, metric variables with -marginscontplot *Multivariate relationships metric variables and categorical variables *Illustration of results from multiple models with -coefplot Assumptions and diagnostics for the linear model Linearity in the parameters Multiple acpr plots with -foreach Absence of influential observations Leverage-versus-residual-squared plot with -lvr2plot Cook s distance with -predict Partial regression plot with -avplot- and -avplots DFBETA Normally distributed error terms Histogram of the residuals distribution q-q-plot of the residuals distribution Homoscedasticity Absence of perfect (and strong) multicollinearity Modelling and illustration of non-linear relationships Logarithmic transformations Quadratic and polynomial models Graphic illustration *Use of macros Local and global macros Global macros Interaction in linear models: Estimation, interpretation and illustration A longer example Question, theory, hypothesis and model Operationalisation

12 Estimating the interaction model Estimation with factor variable notation Evaluation of assumptions Reporting of relevant estimates/statistics Interpretation of coefficients and associated P-values from interaction models Interpretation with -margins Illustration of the relationship Plotting marginal effects Plotting the predicted relationships Estimating and modelling other types of interaction Metric primary; dichotomous interacting Dichotomous primary; metric interacting Logistic regression Preparing Stata and data for logistic regression Coding the dependent variable Example: red or blue bloc? Estimation of logistic models with -logit The output from -logit *Alternative measures of fit with -fitstat Interpretation of the results from logistic regression Interpretation with log-odds *Multiple comparisons of log-odds coefficients Interpretation with changes in predicted probabilities and -margins Interpretation with average marginal effects (AME) *Interpretation with marginal effect at the mean *Interpretation with marginal effect for typical/interesting observations Interpretation with changes in predicted probabilities *Interpretation using odds ratios *Inverted odds-ratios Graphic illustration Metric independent variable Categorical independent variable Assumptions and diagnostics Linearity/correct functional form...270

13 Graphic inspection of linearity in bivariate analyses Graphic inspection of linearity in multivariate analyses The distribution of the error terms Absence of perfect (and strong) multicollinearity Number of observations and computational challenges *Lack of variation on the dependent variable within categories of an independent variable *Complete or quasi-complete separation *Lack of convergence Appendix A: More about the Results window Font colours in the Results window Lack of output Delete the content of the Results window Appendix B: Evaluate coder reliability with -datasignature- and merge Procedure 1: Comparison of dataset with -datasignature Procedure 2: Merging datasets with -merge Rename the variables in the control dataset, using names that differ from those in the original dataset Merge the two datasets Determine whether observations have different values and, if so, identify them Correction of errors Appendix C: Vertical bar charts for univariate distribution with -catplot Appendix D: -prtest- to test univariate proportions Appendix E: Multivariate analysis of categorical variables with table analysis Inference and graphic illustration Appendix F: More about index construction Constructing a formative index Item-scale analysis with gamma References Index...299

14

15 1. Introduction This book serves as an introduction to the statistics programme Stata. The primary audience is students and others using statistical software for the first time. Experienced users of other statistical programmes who would like to learn Stata or Stata users lacking a comprehensive introduction will probably also find it useful. The book is based on Stata 13 for Windows. Stata for Mac (and for Unix) only differ very slightly from the Windows version, and most of the differences are described in the text. The purpose of the book is to help the reader to perform and reproduce statistical analyses and data processing with Stata as efficiently as possible. Method ological considerations and the statistical background for analyses are sometimes only treated superficially, so the book should probably not be used as the only resource for statistical analyses. The book addresses three main topics: 1) Navigating Stata, 2) Data manipulation and documentation, and 3) Basic univariate, bivariate and multivariate analysis techniques including graphical illustrations. Chapter 2 introduces the most important elements of Stata, including the main user interface, the data matrix, the Do-file Editor and the help system. It also looks at inspecting data using tables, graphics and statistics. Chapters 3 and 4 deal with data processing, i.e. entry, recoding, documentation, etc. The following two chapters (5 and 6) deal with frequently used univariate and bivariate analysis techniques, as well as topics such as graphics. Chapter 7 looks at the construction of reflective indices, whilst chapters 8 11 deal with linear and binary regression analysis, regression diagnostics, and regression interaction models. 1.1 Making the most of the book The book is based on two didactic principles: people learn best by doing; and people learn best if what they are doing makes sense. 1. Introduction 15

16 As a consequence of the first principle, the book contains a number of exercises for the reader. By doing the exercises and using the functions described with real datasets, readers will develop their skills and improve their understanding of Stata. Unlike most textbooks, the exercises are dotted throughout the text, immediately after the descriptions of the functions. The book also builds sequentially upon the exercises in order to derive maximum benefit from the process, readers should do the exercises when they encounter them in the text. The choice of continuous exercises was based on several years experience teaching Stata. 1 As well as clear learning benefits, it also allows readers to produce the screenshots that would otherwise take up so much space in a book like this. Some screenshots will, however, be included. The second principle means that the degree of difficulty increases steadily, and attempts have been made to structure the book so that the functions presented make sense in real-life analysis situations. This, in turn, means that some functions may not be described from start to finish at one single point in the book. Instead, some functions are referred to several times, and picked up again when they will (hopefully) make more sense. However, for the benefit of readers who would like to learn more about specific functions right away, extensive use has been made of cross-referencing throughout the book. The index can also be used for this purpose, of course. References are also made to functions in the statistics program SPSS. These are solely for the benefit of SPSS users and may be ignored by other readers. 1.2 Online material At you will find datasets and solutions to the exercises in the book (see Section 2.5). Any errata and additional material will also be published there. Mac users who attempt to download datasets and do-files may find that the file does not download but instead opens in a new window. This can be overcome by right-clicking on the file, selecting Download Linked File As and then choosing the folder in which to save the file. 1 I should like to take this opportunity to thank the many students who have provided constructive comments on my teaching materials over the years. I should also like to thank Lene Aarøe, Lotte Bøgh Andersen, Mette Bisgaard, Katrine Degn, Camilla Bjarnøe Jensen, Erik Gahner Larsen, Søren Heldgaard Olesen, Rune Stubager, editor Martin Laurberg and three reviewers for their comments on this book. And thank you to Søren Risbjerg Thomsen and Peter Thisted Dinesen for a great deal of insightful input regarding the use of Stata and statistics. Any errors, omissions and ambiguities are, of course, my own responsibility. Finally, I would like to thank Rune Stubager and Jørgen Goul Andersen, who made some of the exercise material available to me. 16 Stata

17 1.3 Notation In the interests of clarity, the book makes use of a very small number of typographical functions. Instructions for using the menu system (which will be introduced in the next section) are written in. As such, means click on in the menu bar, then and then on. Command names (e.g. ), everything that is part of the Stata interface, keyboard combinations (e.g. ) and variable names in the active data set (e.g. ) are also written in. Commands, user interfaces, etc. are introduced in the next section. ## Exercises are marked like this section. The number refers to the numbering in the file containing the solutions. Commands that can be run from the Do-file Editor or command window are in boxes with shaded backgrounds like this. The Do-file Editor and Command window are introduced in the next section. These types of commands will often contain general references to variable names such as var1 in the next line: summarize var1 General names like these, which you have to replace with actual variable names, are in italics. *Entire sections that you can skip without losing the thread are marked with an asterisk, like this section. Box 1.1 Explanatory comments that you can bypass without losing the thread are in boxes like this. 1. Introduction 17

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

STATA 13 INTRODUCTION

STATA 13 INTRODUCTION STATA 13 INTRODUCTION Catherine McGowan & Elaine Williamson LONDON SCHOOL OF HYGIENE & TROPICAL MEDICINE DECEMBER 2013 0 CONTENTS INTRODUCTION... 1 Versions of STATA... 1 OPENING STATA... 1 THE STATA

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

Getting started with Stata 2017: Cheat-sheet

Getting started with Stata 2017: Cheat-sheet Getting started with Stata 2017: Cheat-sheet 4. september 2017 1 Get started Graphical user interface (GUI). Clickable. Simple. Commands. Allows for use of do-le. Easy to keep track. Command window: Write

More information

Introduction to Stata. Written by Yi-Chi Chen

Introduction to Stata. Written by Yi-Chi Chen Introduction to Stata Written by Yi-Chi Chen Center for Social Science Computation & Research 145 Savery Hall University of Washington Seattle, WA 98195 U.S.A (206)543-8110 September 2002 http://julius.csscr.washington.edu/pdf/stata.pdf

More information

Product Catalog. AcaStat. Software

Product Catalog. AcaStat. Software Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,

More information

1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file

1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file 1 SPSS Guide 2009 Content 1. Basic Steps for Data Analysis. 3 2. Data Editor. 2.4.To create a new SPSS file 3 4 3. Data Analysis/ Frequencies. 5 4. Recoding the variable into classes.. 5 5. Data Analysis/

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered

More information

Dr. Barbara Morgan Quantitative Methods

Dr. Barbara Morgan Quantitative Methods Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In

More information

Intro to Stata. University of Virginia Library data.library.virginia.edu. September 16, 2014

Intro to Stata. University of Virginia Library data.library.virginia.edu. September 16, 2014 to 1/12 Intro to University of Virginia Library data.library.virginia.edu September 16, 2014 Getting to Know to 2/12 Strengths Available A full-featured statistical programming language For Windows, Mac

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

An Introductory Guide to Stata

An Introductory Guide to Stata An Introductory Guide to Stata Scott L. Minkoff Assistant Professor Department of Political Science Barnard College sminkoff@barnard.edu Updated: July 9, 2012 1 TABLE OF CONTENTS ABOUT THIS GUIDE... 4

More information

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data Introduction About this Document This manual was written by members of the Statistical Consulting Program as an introduction to SPSS 12.0. It is designed to assist new users in familiarizing themselves

More information

Introduction to STATA

Introduction to STATA Introduction to STATA Duah Dwomoh, MPhil School of Public Health, University of Ghana, Accra July 2016 International Workshop on Impact Evaluation of Population, Health and Nutrition Programs Learning

More information

Intermediate Stata Workshop. Hsueh-Sheng Wu CFDR Workshop Series Spring 2009

Intermediate Stata Workshop. Hsueh-Sheng Wu CFDR Workshop Series Spring 2009 Intermediate Stata Workshop Hsueh-Sheng Wu CFDR Workshop Series Spring 2009 Outline of the Workshop A brief review of topics covered in the Introduction to Stata workshop (Fall 2008) Commands for data

More information

Introduction to Stata. Getting Started. This is the simple command syntax in Stata and more conditions can be added as shown in the examples.

Introduction to Stata. Getting Started. This is the simple command syntax in Stata and more conditions can be added as shown in the examples. Getting Started Command Syntax command varlist, option This is the simple command syntax in Stata and more conditions can be added as shown in the examples. Preamble mkdir tutorial /* to create a new directory,

More information

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Examples: Mixture Modeling With Cross-Sectional Data CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Mixture modeling refers to modeling with categorical latent variables that represent

More information

JMP Book Descriptions

JMP Book Descriptions JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked

More information

Creating a data file and entering data

Creating a data file and entering data 4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that

More information

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V)

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Based on Industry Cases, Live Exercises, & Industry Executed Projects Module (I) Analytics Essentials 81 hrs 1. Statistics

More information

Data Management - 50%

Data Management - 50% Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define

More information

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics PASW Complex Samples 17.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

A quick introduction to STATA

A quick introduction to STATA A quick introduction to STATA Data files and other resources for the course book Introduction to Econometrics by Stock and Watson is available on: http://wps.aw.com/aw_stock_ie_3/178/45691/11696965.cw/index.html

More information

Introduction to Stata - Session 1

Introduction to Stata - Session 1 Introduction to Stata - Session 1 Simon, Hong based on Andrea Papini ECON 3150/4150, UiO January 15, 2018 1 / 33 Preparation Before we start Sit in teams of two Download the file auto.dta from the course

More information

Data analysis using Stata , AMSE Master (M1), Spring semester

Data analysis using Stata , AMSE Master (M1), Spring semester Data analysis using Stata 2016-2017, AMSE Master (M1), Spring semester Notes Marc Sangnier Data analysis using Stata Virtually infinite number of tasks for data analysis. Almost infinite number of commands

More information

The Power and Sample Size Application

The Power and Sample Size Application Chapter 72 The Power and Sample Size Application Contents Overview: PSS Application.................................. 6148 SAS Power and Sample Size............................... 6148 Getting Started:

More information

Getting Started with JMP at ISU

Getting Started with JMP at ISU Getting Started with JMP at ISU 1 Introduction JMP (pronounced like jump ) is the new campus-wide standard statistical package for introductory statistics courses at Iowa State University. JMP is produced

More information

AcaStat User Manual. Version 8.3 for Mac and Windows. Copyright 2014, AcaStat Software. All rights Reserved.

AcaStat User Manual. Version 8.3 for Mac and Windows. Copyright 2014, AcaStat Software. All rights Reserved. AcaStat User Manual Version 8.3 for Mac and Windows Copyright 2014, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents INTRODUCTION... 5 GETTING HELP... 5 INSTALLATION... 5

More information

Data Science. Data Analyst. Data Scientist. Data Architect

Data Science. Data Analyst. Data Scientist. Data Architect Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 6 Available Statistical

More information

Introduction to Stata: An In-class Tutorial

Introduction to Stata: An In-class Tutorial Introduction to Stata: An I. The Basics - Stata is a command-driven statistical software program. In other words, you type in a command, and Stata executes it. You can use the drop-down menus to avoid

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement

More information

Workshop for empirical trade analysis. December 2015 Bangkok, Thailand

Workshop for empirical trade analysis. December 2015 Bangkok, Thailand Workshop for empirical trade analysis December 2015 Bangkok, Thailand Cosimo Beverelli (WTO) Rainer Lanz (WTO) Content a. Resources b. Stata windows c. Organization of the Bangkok_Dec_2015\Stata folder

More information

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions Technical Support Free technical support Worksheet Size All registered users, including students Registered instructors Number of worksheets Limited only by system resources 5 5 Number of cells per worksheet

More information

A quick introduction to STATA:

A quick introduction to STATA: 1 Revised September 2008 A quick introduction to STATA: (by E. Bernhardsen, with additions by H. Goldstein) 1. How to access STATA from the pc s at the computer lab After having logged in you have to log

More information

SAS Training Spring 2006

SAS Training Spring 2006 SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered

More information

Stata: A Brief Introduction Biostatistics

Stata: A Brief Introduction Biostatistics Stata: A Brief Introduction Biostatistics 140.621 2005-2006 1. Statistical Packages There are many statistical packages (Stata, SPSS, SAS, Splus, etc.) Statistical packages can be used for Analysis Data

More information

You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables

You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables Jennie Murack You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables How to conduct basic descriptive statistics

More information

Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018

Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018 Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018 Contents Introduction... 1 Start DIONE... 2 Load Data... 3 Missing Values... 5 Explore Data... 6 One Variable... 6 Two Variables... 7 All

More information

Lab 2: OLS regression

Lab 2: OLS regression Lab 2: OLS regression Andreas Beger February 2, 2009 1 Overview This lab covers basic OLS regression in Stata, including: multivariate OLS regression reporting coefficients with different confidence intervals

More information

Introduction to STATA

Introduction to STATA Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 Introduction to STATA WORKSHOP OBJECTIVE: This workshop

More information

Four steps in an effective workflow...

Four steps in an effective workflow... Four steps in an effective workflow... 1. Cleaning data Things to do: Verify your data are accurate Variables should be well named Variables should be properly labeled Ask yourself: Do the variables have

More information

StatCalc User Manual. Version 9 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.

StatCalc User Manual. Version 9 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved. StatCalc User Manual Version 9 for Mac and Windows Copyright 2018, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Introduction... 4 Getting Help... 4 Uninstalling StatCalc...

More information

An introduction to SPSS

An introduction to SPSS An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible

More information

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113 Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame

More information

Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table

Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table Q Cheat Sheets What to do when you cannot figure out how to use Q What to do when the data looks wrong Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help

More information

Technical Support Minitab Version Student Free technical support for eligible products

Technical Support Minitab Version Student Free technical support for eligible products Technical Support Free technical support for eligible products All registered users (including students) All registered users (including students) Registered instructors Not eligible Worksheet Size Number

More information

JMP 10 Student Edition Quick Guide

JMP 10 Student Edition Quick Guide JMP 10 Student Edition Quick Guide Instructions presume an open data table, default preference settings and appropriately typed, user-specified variables of interest. RMC = Click Right Mouse Button Graphing

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming

More information

Using SPSS with The Fundamentals of Political Science Research

Using SPSS with The Fundamentals of Political Science Research Using SPSS with The Fundamentals of Political Science Research Paul M. Kellstedt and Guy D. Whitten Department of Political Science Texas A&M University c Paul M. Kellstedt and Guy D. Whitten 2009 Contents

More information

A Short Guide to Stata 10 for Windows

A Short Guide to Stata 10 for Windows A Short Guide to Stata 10 for Windows 1. Introduction 2 2. The Stata Environment 2 3. Where to get help 2 4. Opening and Saving Data 3 5. Importing Data 4 6. Data Manipulation 5 7. Descriptive Statistics

More information

Empirical trade analysis

Empirical trade analysis Empirical trade analysis Introduction to Stata Cosimo Beverelli World Trade Organization Cosimo Beverelli Stata introduction Bangkok, 18-21 Dec 2017 1 / 23 Outline 1 Resources 2 How Stata looks like 3

More information

Subject index. ASCII data, reading comma-separated fixed column multiple lines per observation

Subject index. ASCII data, reading comma-separated fixed column multiple lines per observation Subject index Symbols %fmt... 106 110 * abbreviation character... 374 377 * comment indicator...346 + combining strings... 124 125 - abbreviation character... 374 377.,.a,.b,...,.z missing values.. 130

More information

Basic concepts and terms

Basic concepts and terms CHAPTER ONE Basic concepts and terms I. Key concepts Test usefulness Reliability Construct validity Authenticity Interactiveness Impact Practicality Assessment Measurement Test Evaluation Grading/marking

More information

UNIT 4. Research Methods in Business

UNIT 4. Research Methods in Business UNIT 4 Preparing Data for Analysis:- After data are obtained through questionnaires, interviews, observation or through secondary sources, they need to be edited. The blank responses, if any have to be

More information

Introduction to SAS. I. Understanding the basics In this section, we introduce a few basic but very helpful commands.

Introduction to SAS. I. Understanding the basics In this section, we introduce a few basic but very helpful commands. Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 Introduction to SAS Workshop Objective This workshop

More information

WELCOME! Lecture 3 Thommy Perlinger

WELCOME! Lecture 3 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 3 Thommy Perlinger Program Lecture 3 Cleaning and transforming data Graphical examination of the data Missing Values Graphical examination of the data It is important

More information

Minitab 17 commands Prepared by Jeffrey S. Simonoff

Minitab 17 commands Prepared by Jeffrey S. Simonoff Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save

More information

Antrix Academy of Data Science TM

Antrix Academy of Data Science TM TM Preparing for MOST Promising Career Opportunities in Data Analytics... Excel Tableau SAS Excel & SQL IBM SPSS Business Analytics COURSES # Duration* 1 Excel Proficiency 5 Hrs 2 Data Analytics with SAS

More information

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Learning Objectives Identify Different Types of Variables Appropriately Naming Variables Constructing

More information

Data Mining. SPSS Clementine k-means Algorithm. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Data Mining. SPSS Clementine k-means Algorithm. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine Data Mining SPSS 12.0 6. k-means Algorithm Spring 2010 Instructor: Dr. Masoud Yaghini Outline K-Means Algorithm in K-Means Node References K-Means Algorithm in Overview The k-means method is a clustering

More information

Exploring Data. This guide describes the facilities in SPM to gain initial insights about a dataset by viewing and generating descriptive statistics.

Exploring Data. This guide describes the facilities in SPM to gain initial insights about a dataset by viewing and generating descriptive statistics. This guide describes the facilities in SPM to gain initial insights about a dataset by viewing and generating descriptive statistics. 2018 by Minitab Inc. All rights reserved. Minitab, SPM, SPM Salford

More information

Chapter 17: INTERNATIONAL DATA PRODUCTS

Chapter 17: INTERNATIONAL DATA PRODUCTS Chapter 17: INTERNATIONAL DATA PRODUCTS After the data processing and data analysis, a series of data products were delivered to the OECD. These included public use data files and codebooks, compendia

More information

book 2014/5/6 15:21 page v #3 List of figures List of tables Preface to the second edition Preface to the first edition

book 2014/5/6 15:21 page v #3 List of figures List of tables Preface to the second edition Preface to the first edition book 2014/5/6 15:21 page v #3 Contents List of figures List of tables Preface to the second edition Preface to the first edition xvii xix xxi xxiii 1 Data input and output 1 1.1 Input........................................

More information

- 1 - Fig. A5.1 Missing value analysis dialog box

- 1 - Fig. A5.1 Missing value analysis dialog box WEB APPENDIX Sarstedt, M. & Mooi, E. (2019). A concise guide to market research. The process, data, and methods using SPSS (3 rd ed.). Heidelberg: Springer. Missing Value Analysis and Multiple Imputation

More information

Probabilistic Analysis Tutorial

Probabilistic Analysis Tutorial Probabilistic Analysis Tutorial 2-1 Probabilistic Analysis Tutorial This tutorial will familiarize the user with the Probabilistic Analysis features of Swedge. In a Probabilistic Analysis, you can define

More information

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application SAS/STAT 13.1 User s Guide The Power and Sample Size Application This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as

More information

An Introduction to Stata Part II: Data Analysis

An Introduction to Stata Part II: Data Analysis An Introduction to Stata Part II: Data Analysis Kerry L. Papps 1. Overview Do-files Sorting a dataset Combining datasets Creating a dataset of means or medians etc. Weights Panel data capabilities Dummy

More information

Introduction to Mplus

Introduction to Mplus Introduction to Mplus May 12, 2010 SPONSORED BY: Research Data Centre Population and Life Course Studies PLCS Interdisciplinary Development Initiative Piotr Wilk piotr.wilk@schulich.uwo.ca OVERVIEW Mplus

More information

Learn What s New. Statistical Software

Learn What s New. Statistical Software Statistical Software Learn What s New Upgrade now to access new and improved statistical features and other enhancements that make it even easier to analyze your data. The Assistant Data Customization

More information

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes. Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department

More information

STATA TUTORIAL B. Rabin with modifications by T. Marsh

STATA TUTORIAL B. Rabin with modifications by T. Marsh STATA TUTORIAL B. Rabin with modifications by T. Marsh 5.2.05 (content also from http://www.ats.ucla.edu/stat/spss/faq/compare_packages.htm) Why choose Stata? Stata has a wide array of pre-defined statistical

More information

A Quick Guide to Stata 8 for Windows

A Quick Guide to Stata 8 for Windows Université de Lausanne, HEC Applied Econometrics II Kurt Schmidheiny October 22, 2003 A Quick Guide to Stata 8 for Windows 2 1 Introduction A Quick Guide to Stata 8 for Windows This guide introduces the

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

Assignments Fill out this form to do the assignments or see your scores.

Assignments Fill out this form to do the assignments or see your scores. Assignments Assignment schedule General instructions for online assignments Troubleshooting technical problems Fill out this form to do the assignments or see your scores. Login Course: Statistics W21,

More information

An Introduction to Stata Part I: Data Management

An Introduction to Stata Part I: Data Management An Introduction to Stata Part I: Data Management Kerry L. Papps 1. Overview These two classes aim to give you the necessary skills to get started using Stata for empirical research. The first class will

More information

MS in Applied Statistics: Study Guide for the Data Science concentration Comprehensive Examination. 1. MAT 456 Applied Regression Analysis

MS in Applied Statistics: Study Guide for the Data Science concentration Comprehensive Examination. 1. MAT 456 Applied Regression Analysis MS in Applied Statistics: Study Guide for the Data Science concentration Comprehensive Examination. The Part II comprehensive examination is a three-hour closed-book exam that is offered on the second

More information

CLAREMONT MCKENNA COLLEGE. Fletcher Jones Student Peer to Peer Technology Training Program. Basic Statistics using Stata

CLAREMONT MCKENNA COLLEGE. Fletcher Jones Student Peer to Peer Technology Training Program. Basic Statistics using Stata CLAREMONT MCKENNA COLLEGE Fletcher Jones Student Peer to Peer Technology Training Program Basic Statistics using Stata An Introduction to Stata A Comparison of Statistical Packages... 3 Opening Stata...

More information

Study Guide. Module 1. Key Terms

Study Guide. Module 1. Key Terms Study Guide Module 1 Key Terms general linear model dummy variable multiple regression model ANOVA model ANCOVA model confounding variable squared multiple correlation adjusted squared multiple correlation

More information

range: [1,20] units: 1 unique values: 20 missing.: 0/20 percentiles: 10% 25% 50% 75% 90%

range: [1,20] units: 1 unique values: 20 missing.: 0/20 percentiles: 10% 25% 50% 75% 90% ------------------ log: \Term 2\Lecture_2s\regression1a.log log type: text opened on: 22 Feb 2008, 03:29:09. cmdlog using " \Term 2\Lecture_2s\regression1a.do" (cmdlog \Term 2\Lecture_2s\regression1a.do

More information

SPSS: AN OVERVIEW. V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi

SPSS: AN OVERVIEW. V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi SPSS: AN OVERVIEW V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi-110012 The abbreviation SPSS stands for Statistical Package for the Social Sciences and is a comprehensive system

More information

Making Tables and Figures

Making Tables and Figures Making Tables and Figures Don Quick Colorado State University Tables and figures are used in most fields of study to provide a visual presentation of important information to the reader. They are used

More information

User Services Spring 2008 OBJECTIVES Introduction Getting Help Instructors

User Services Spring 2008 OBJECTIVES  Introduction Getting Help  Instructors User Services Spring 2008 OBJECTIVES Use the Data Editor of SPSS 15.0 to to import data. Recode existing variables and compute new variables Use SPSS utilities and options Conduct basic statistical tests.

More information

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA ECL 290 Statistical Models in Ecology using R Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA Datasets in this problem set adapted from those provided

More information

Regression. Dr. G. Bharadwaja Kumar VIT Chennai

Regression. Dr. G. Bharadwaja Kumar VIT Chennai Regression Dr. G. Bharadwaja Kumar VIT Chennai Introduction Statistical models normally specify how one set of variables, called dependent variables, functionally depend on another set of variables, called

More information

STM103 Spring 2008 INTRODUCTION TO STATA 8.0

STM103 Spring 2008 INTRODUCTION TO STATA 8.0 STM103 Spring 2008 INTRODUCTION TO STATA 8.0 1. PREPARING BEFORE ENTERING THE LAB... 2 Getting the shared dataset... 2 Assignment 1 preparation... 2 2. STARTING A STATA SESSION... 3 Opening, Saving, and

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

Brief Guide on Using SPSS 10.0

Brief Guide on Using SPSS 10.0 Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new

More information

CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12

CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12 Tool 1: Standards for Mathematical ent: Interpreting Functions CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12 Name of Reviewer School/District Date Name of Curriculum Materials:

More information

SPSS for Survey Analysis

SPSS for Survey Analysis STC: SPSS for Survey Analysis 1 SPSS for Survey Analysis STC: SPSS for Survey Analysis 2 SPSS for Surveys: Contents Background Information... 4 Opening and creating new documents... 5 Starting SPSS...

More information

AcaStat User Manual. Version 10 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.

AcaStat User Manual. Version 10 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved. AcaStat User Manual Version 10 for Mac and Windows Copyright 2018, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents NEW IN VERSION 10... 6 INTRODUCTION... 7 GETTING HELP...

More information

Advanced Stata Skills

Advanced Stata Skills Advanced Stata Skills Contents Stata output in Excel & Word... 2 Copy as Table... 2 putexcel... 2 tabout (v3)... 2 estout / esttab... 2 Common Options... 2 Nice Looking Tables in Stata... 3 Descriptive

More information

REALCOM-IMPUTE: multiple imputation using MLwin. Modified September Harvey Goldstein, Centre for Multilevel Modelling, University of Bristol

REALCOM-IMPUTE: multiple imputation using MLwin. Modified September Harvey Goldstein, Centre for Multilevel Modelling, University of Bristol REALCOM-IMPUTE: multiple imputation using MLwin. Modified September 2014 by Harvey Goldstein, Centre for Multilevel Modelling, University of Bristol This description is divided into two sections. In the

More information

Forfattere Intro to SPSS 19.0 Description

Forfattere Intro to SPSS 19.0 Description Forfattere Nicholas Fritsche Rasmus Porsgaard Casper Voigt Rasmussen Martin Klint Hansen Morten Christoffersen Ulrick Tøttrup Niels Yding Sørensen Morten Mondrup Andreassen Jesper Pedersen Intro to SPSS

More information