7/18/16. Review. Review of Homework. Lecture 3: Programming Statistics in R. Questions from last lecture? Problems with Stata? Problems with Excel?
|
|
- Eunice Patterson
- 5 years ago
- Views:
Transcription
1 Lecture 3: Programming Statistics in R Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Questions from last lecture? Problems with Stata? Problems with Excel? 2 Review of Homework Create age and HLA mismatch categories Perform t tests for age, cold ischemia time, surgery duration, and HLA mismatches Perform chi-square tests for sex and race Use summarize command with if statement to get summaries Move summaries to Excel and format Make graph 3 1
2 cd /Volumes/Hollenbeak/Teaching/Residents/OutcomesResearch/ clear insheet using "ltd_data.csv" generate age039=0 replace age039=1 if age < 40 generate age4049=0 replace age4049=1 if age >= 40 & age < 50 generate age5059=0 replace age5059=1 if age >= 50 & age < 60 generate age60=0 replace age60=1 if age >= 60 generate ab0=0 replace ab0=1 if abmm==0 generate ab1=0 replace ab1=1 if abmm==1 generate ab2=0 replace ab2=1 if abmm==2 generate ab3=0 replace ab3=1 if abmm==3 generate ab4=0 replace ab4=1 if abmm==4 summarize age age039 age4049 age5059 age60 female male black nonblack coldtime dur_surg abmm ab0 ab1 ab2 ab3 ab4 if ssi==0, sep(0) summarize age age039 age4049 age5059 age60 female male black nonblack coldtime dur_surg abmm ab0 ab1 ab2 ab3 ab4 if ssi==1, sep(0) ttest age, by(ssi) ttest coldtime, by(ssi) ttest dur_surg, by(ssi) ttest abmm, by(ssi) tab female ssi, chi2 tab black ssi, chi2 twoway scatter los age, title("relationship between Age and LOS") xtitle("age (Years)") ytitle("hospital Stay (Days) ) 4 Stata Tip To find a variable quickly, use the command lookfor This will search all variable names and labels for your text For example: lookfor bmi will identify two variables in the data set that contain the text bmi bmi (which is patient s bmi at listing) bmi_d, which is donor BMI at hepatectomy 5 Overview R is an statistical computing environment Based on the S language (like S-Plus) Free and open source Extensive add-on packages available Also free and open source 2
3 Overview R has become the most widely used statistical software More flexible than Stata More extensible than Stata More up-to-date than Stata R language is object-oriented Usually requires fewer steps Can be unintuitive Did I mention it is free?? Common Tasks in R Import data Create variables Subset data Univariate statistics Chi-square tests T-tests ANOVA Graphics Histograms Scatterplots Boxplots R Interface We use RStudio to run R Process is (almost) the same as Stata Write a script file of commands Please make sure to save your script files Run blocks of commands and retrieve output Move to Word for cleaning Move output to Excel for formatting 3
4 R Interface Script Files Variables Console / Text output Browser Differences Between R and Stata Stata holds one data set only, and all commands apply to that data set R can hold multiple data sets Commands and variables must specific which data set Stata code is commands, separated by spaces, options after a single comma R code is functions, with parentheses, and options separated by multiple commas R is object oriented, so process is 1) create an object, 2) summarize the object Setting the Working Directory It is good practice to point R to a directory where it will thereafter retrieve files and write files The function is setwd() For example, if I have my.csv file with liver transplant data sitting in a projects folder Mac/Unix: setwd( ~/projects/ltd/ ) Windows: setwd( c:/users/chollenbeak/ projects/ltd/ ) Note that directories are separated by forward slashes / not backslashes \ even in Windows 4
5 Running a Command To run a command in R: Highlight the line of code in your script file Mac: Cmd + Enter Windows: Ctrl + Enter Import Data Create a comma-separate-value text file Can be done in Excel Save As CSV Variable names in first row R stores data in objects called data frames You can import, create, and subset as many data frames as you want during an R session R Command: read.csv Import the data using the command read.csv() e.g. data1 <- read.csv( c:/projects/ltd_data.csv ) Or, if you have your working directory set to c:/projects/ already, you can simply use: data1 <- read.csv( ltd_data.csv ) This creates a data object called data1 When you want to use these data, refer to this object The <- is called the assignment operator and assigns a value to the object 5
6 Viewing Data To view your data, click on the name of the data frame in the Environment window The data set will then appear in the script window Referring to Variables All variables are associated with a data frame To call a variable, use object$variable For example, to compute the mean cost contained in data frame data1, use: mean(data1$cost1) Create New Variables Variables should be created inside a data frame To create a new variable, use the assignment operator Example: Assume you have a male dummy variable but no female dummy variable Create a female dummy variable using: data1$female <- 1 data$male 6
7 Create New Variables Assume you have a sex variable coded as M and F You need to create male and female dummy variables Use the ifelse() function data$newvar <- ifelse(condition, value_if_true, value_if_false) data1$male <- ifelse(data1$gender== M,1,0) data1$female <- 1- data1$male Create New Variables Assume you have a race variable coded as White, Black, Hispanic, Asian, and Other You need to create race/ethnicity dummy variables data1$white <- ifelse(data1$race== White,1,0) data1$black <- ifelse(data1$race== Black,1,0) data1$hispanic <- ifelse(data1$race== Hispanic,1,0) data1$asian <- ifelse(data1$race== Asian,1,0) data1$other <- ifelse(data1$race== Other,1,0) Create New Variables Assume you have a dummy variables for race and need a categorical version You can embed the ifelse() functions data1$race <- ifelse(data1$white==1, white, ifelse(data1$black==1, black, ifelse(data1$hispanic==1, hispanic, ifelse(data1$asian==1, asian, other )))) 7
8 Subsetting Data To pull out a subset of observations use the subset command Create a new data frame based on the original data_new <- subset(dataset_old, condition) e.g. data2 <- subset(data1, data1$male==1) Like Stata, R distinguishes between the logical equals (==) and the assignment equals (=) Summary Statistics R has functions for all the summary statistics you care to present N: length() Mean: mean() Standard Deviation: sd() Sum: sum() Minimum: min() Maximum: max() However, it lacks a single, easy function to create a table of descriptive statistics No equivalent of summarize in Stata) Summary Statistics The solution is to write your own You can create your own functions in R using the function() command Here is mine: summarize <- function(var){ n <- length(var) m <- mean(var) s <- sd(var) v <- sum(var) return(cbind(n, m, s, v)) } 8
9 Summary Statistics To create a table of summary statistics 1. Run the summarize command in R 2. Copy the output to Word 3. Replace spaces with tabs The special character for tab is ^t May need to replace two adjacent tabs (^t^t) with a single tab The special character for paragraph is ^p 4. Copy to Excel 5. Format in Excel Univariate Statistics Categorical variables Contingency tables Chi-square tests Continuous variables T-tests ANOVA Contingency Tables To obtain a tabulation of data, use the table() command For example, to get a tabulation of HLA A and B mismatches table(data1$abmm) Note that the output is not formatted neatly Do your formatting in Excel 9
10 Contingency Tables To get the percents, you can find the total observations using the length command and the divide the table table(data1$abmm)/length(data1$abmm) Contingency Tables To get a cross-tabulation, include both variables e.g. To study whether mortality differs between men and women: > table(data1$male, data1$died) Chi-square Test To test whether the difference in the distribution is significant, use the chisq.test() command Note that this command is applied to the table, not the variables > chisq.test(table(data1$male, data1$died)) Pearson's Chi-squared test with Yates' continuity correction data: table(data1$male, data1$died) X-squared = , df = 1, p-value =
11 T-Test To compare the means between two groups, use the t.test() command Syntax is t.test(vdepvar ~ indvar) For example, to test whether the LOS differs between men and women: t.test(data1$los ~ data1$male) Welch Two Sample t-test data: data1$los by data1$male t = , df = , p-value = alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: sample estimates: mean in group 0 mean in group ANOVA To compare a continuous variable across more than two groups use the command aov() For example, to test whether LOS varies over HLA A and B mismatches: anova1 <- aov(data1$los ~ data1$abmm) summary(anova1) 11
12 ANOVA aov() creates an ANOVA object called anova1 summary() gives us the results > anova1 <- aov(data1$los ~ data1$abmm) > summary(anova1) Df Sum Sq Mean Sq F value Pr(>F) data1$abmm *** Residuals Signif. codes: 0 *** ** 0.01 * Missing Data R stores missing values as NA R cannot run analyses on missing data It will not produce output and will not tell you why You must identify variables with missing data and drop those observations Use the is.na() function to identify an observation with missing values Missing Data Example: Assume you can tell that the male dummy variable has some missing values. You need to drop those observations Strategy 1: Create a subset with no missing values data2 <- subset(data1, is.na(data1$male)==false) t.test(data2$los ~ data2$male) Strategy 2: Use original data set but limit the analysis to non-missing observations t.test(data1$los[is.na(data1$male)==false] ~ data1$male[is.na(data1$male)==false]) 12
13 Graphics in R R has fantastic graphing capabilities Almost any aspect of the graph can be controlled by the user The opposite of the Stata approach We will discuss three types of graphs Histograms Scatterplots Boxplots Histogram To summarize the distribution of a continuous variable, plot a histogram The R command is hist() To summarize the age of transplant recipients: hist(data1$age) Histogram of data1$age Frequency data1$age 13
14 Histograms Can improve this by changing options: main= Main Title ylab= Y axis label xlab= X axis label col= Gives the boxes color box() Adds a box around the inner plot R graphics are extremely flexible and customizable! hist(data1$cci_index, ylab="frequency", xlab="comorbidities", main="distribution of Charlson Comorbidity Index", col="red, breaks=25) box() Distribution of Age Frequency age 14
15 Scatterplot To create a scatterplot use the plot() function For example, to look at the correlation between one-year cost and five-year cost plot(data1$age, data1$los) data1$los data1$age Scatterplot Again we can customize with additional options pch=x Change the plot symbol xlim=c(lower,upper) Set x axis limits ylim=c(lower,upper) Set y axis limits cex=x Symbol scaling factor Default is 1, Half size is.5, Double size is 2, etc. 15
16 plot(data1$age, data1$los, ylab= Total Length of Stay", xlab= Age at Transplant", main="correlation Between Age and LOS", pch=16, cex=1.25, xlim=c(0,80), ylim=c(0,400)) Correlation Between Age and LOS Total Length of Stay Age at Transplant Boxplot To summarize distributions across strata Use boxplot(variable ~ strata) command For example, to plot the distribution of costs across stage: boxplot(data1$age ~ data1$abmm) 16
17 Boxplot We can customize outline=false Turn off outliers notch=true Adds notches to the boxes col= color Fills boxes with color boxplot(data1$los ~ data1$abmm, outline=false, notch=true, ylim=c(0,60), xlab="hla-a and -B Mismatches", ylab="inpatient Length of Stay", col= turquoise") Inpatient Length of Stay HLA A and B Mismatches 17
18 Annotating Graphs R has a few functions that make annotating graphs particularly easy 1. Add a vertical or horizontal line abline(v=x) puts a vertical line at x abline(h=y) puts a horizontal line at y We usually add these lines to the zeros or a null value 2. Add text to a graph text(x, y, text ) puts text at (x,y) We frequently add p-values to graphs Exporting Graphics Graphics can be exported directly from Rstudio PDF is the preferred format Export Save as PDF Other formats are available (eps, tiff, etc.) Export Save as image Homework Revisit the homework from Lecture 2 Do it all in R 18
Dr. Barbara Morgan Quantitative Methods
Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In
More informationAn Introduction to R- Programming
An Introduction to R- Programming Hadeel Alkofide, Msc, PhD NOT a biostatistician or R expert just simply an R user Some slides were adapted from lectures by Angie Mae Rodday MSc, PhD at Tufts University
More informationRegression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables:
Regression Lab The data set cholesterol.txt available on your thumb drive contains the following variables: Field Descriptions ID: Subject ID sex: Sex: 0 = male, = female age: Age in years chol: Serum
More information8. MINITAB COMMANDS WEEK-BY-WEEK
8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are
More informationLECTURE NOTES FOR ECO231 COMPUTER APPLICATIONS I. Part Two. Introduction to R Programming. RStudio. November Written by. N.
LECTURE NOTES FOR ECO231 COMPUTER APPLICATIONS I Part Two Introduction to R Programming RStudio November 2016 Written by N.Nilgün Çokça Introduction to R Programming 5 Installing R & RStudio 5 The R Studio
More informationAn Introduction to the R Commander
An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu
More informationBIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA
BIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA Learning objectives: Getting data ready for analysis: 1) Learn several methods of exploring the
More informationBrief Guide on Using SPSS 10.0
Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new
More informationTYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT
PRIMER FOR ACS OUTCOMES RESEARCH COURSE: TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT STEP 1: Install STATA statistical software. STEP 2: Read through this primer and complete the
More informationAcaStat User Manual. Version 8.3 for Mac and Windows. Copyright 2014, AcaStat Software. All rights Reserved.
AcaStat User Manual Version 8.3 for Mac and Windows Copyright 2014, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents INTRODUCTION... 5 GETTING HELP... 5 INSTALLATION... 5
More informationLAB #1: DESCRIPTIVE STATISTICS WITH R
NAVAL POSTGRADUATE SCHOOL LAB #1: DESCRIPTIVE STATISTICS WITH R Statistics (OA3102) Lab #1: Descriptive Statistics with R Goal: Introduce students to various R commands for descriptive statistics. Lab
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming
More informationSubject. Creating a diagram. Dataset. Importing the data file. Descriptive statistics with TANAGRA.
Subject Descriptive statistics with TANAGRA. The aim of descriptive statistics is to describe the main features of a collection of data in quantitative terms 1. The visualization of the whole data table
More informationSPSS. (Statistical Packages for the Social Sciences)
Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.
More informationR syntax guide. Richard Gonzalez Psychology 613. August 27, 2015
R syntax guide Richard Gonzalez Psychology 613 August 27, 2015 This handout will help you get started with R syntax. There are obviously many details that I cannot cover in these short notes but these
More informationMinitab 17 commands Prepared by Jeffrey S. Simonoff
Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save
More informationBasic Stata Tutorial
Basic Stata Tutorial By Brandon Heck Downloading Stata To obtain Stata, select your country of residence and click Go. Then, assuming you are a student, click New Educational then click Students. The capacity
More informationIntroduction to Stata Toy Program #1 Basic Descriptives
Introduction to Stata 2018-19 Toy Program #1 Basic Descriptives Summary The goal of this toy program is to get you in and out of a Stata session and, along the way, produce some descriptive statistics.
More informationStata version 13. First Session. January I- Launching and Exiting Stata Launching Stata Exiting Stata..
Stata version 13 January 2015 I- Launching and Exiting Stata... 1. Launching Stata... 2. Exiting Stata.. II - Toolbar, Menu bar and Windows.. 1. Toolbar Key.. 2. Menu bar Key..... 3. Windows..... III -...
More informationStata version 12. Lab Session 1 February Preliminary: How to Screen Capture.. 2. Preliminary: How to Keep a Log of Your Stata Session..
Stata version 12 Lab Session 1 February 2013 1. Preliminary: How to Screen Capture.. 2. Preliminary: How to Keep a Log of Your Stata Session.. 3. Preliminary: How to Save a Stata Graph... 4. Enter Data:
More informationR Workshop Guide. 1 Some Programming Basics. 1.1 Writing and executing code in R
R Workshop Guide This guide reviews the examples we will cover in today s workshop. It should be a helpful introduction to R, but for more details, you can access a more extensive user guide for R on the
More informationSAS Training Spring 2006
SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered
More informationI Launching and Exiting Stata. Stata will ask you if you would like to check for updates. Update now or later, your choice.
I Launching and Exiting Stata 1. Launching Stata Stata can be launched in either of two ways: 1) in the stata program, click on the stata application; or 2) double click on the short cut that you have
More informationStata v 12 Illustration. First Session
Launch Stata PC Users Stata v 12 Illustration Mac Users START > ALL PROGRAMS > Stata; or Double click on the Stata icon on your desktop APPLICATIONS > STATA folder > Stata; or Double click on the Stata
More informationR for IR. Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University
R for IR Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University For presentation at the June 2013 Meeting of the Higher Education Data Sharing Consortium Table of Contents I.
More informationIQR = number. summary: largest. = 2. Upper half: Q3 =
Step by step box plot Height in centimeters of players on the 003 Women s Worldd Cup soccer team. 157 1611 163 163 164 165 165 165 168 168 168 170 170 170 171 173 173 175 180 180 Determine the 5 number
More informationSPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL
SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered
More informationIntro to Stata. University of Virginia Library data.library.virginia.edu. September 16, 2014
to 1/12 Intro to University of Virginia Library data.library.virginia.edu September 16, 2014 Getting to Know to 2/12 Strengths Available A full-featured statistical programming language For Windows, Mac
More informationA Short Guide to Stata 10 for Windows
A Short Guide to Stata 10 for Windows 1. Introduction 2 2. The Stata Environment 2 3. Where to get help 2 4. Opening and Saving Data 3 5. Importing Data 4 6. Data Manipulation 5 7. Descriptive Statistics
More informationIntroduction to Minitab 1
Introduction to Minitab 1 We begin by first starting Minitab. You may choose to either 1. click on the Minitab icon in the corner of your screen 2. go to the lower left and hit Start, then from All Programs,
More informationResearch Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel
Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement
More informationIntroduction to R. Introduction to Econometrics W
Introduction to R Introduction to Econometrics W3412 Begin Download R from the Comprehensive R Archive Network (CRAN) by choosing a location close to you. Students are also recommended to download RStudio,
More informationPractical 2: Plotting
Practical 2: Plotting Complete this sheet as you work through it. If you run into problems, then ask for help - don t skip sections! Open Rstudio and store any files you download or create in a directory
More informationExcel 2010 with XLSTAT
Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with
More informationSTATA 13 INTRODUCTION
STATA 13 INTRODUCTION Catherine McGowan & Elaine Williamson LONDON SCHOOL OF HYGIENE & TROPICAL MEDICINE DECEMBER 2013 0 CONTENTS INTRODUCTION... 1 Versions of STATA... 1 OPENING STATA... 1 THE STATA
More informationIntroduction to R. UCLA Statistical Consulting Center R Bootcamp. Irina Kukuyeva September 20, 2010
UCLA Statistical Consulting Center R Bootcamp Irina Kukuyeva ikukuyeva@stat.ucla.edu September 20, 2010 Outline 1 Introduction 2 Preliminaries 3 Working with Vectors and Matrices 4 Data Sets in R 5 Overview
More informationIntroduction to R. Andy Grogan-Kaylor October 22, Contents
Introduction to R Andy Grogan-Kaylor October 22, 2018 Contents 1 Background 2 2 Introduction 2 3 Base R and Libraries 3 4 Working Directory 3 5 Writing R Code or Script 4 6 Graphical User Interface 4 7
More informationQuick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018
Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018 Contents Introduction... 1 Start DIONE... 2 Load Data... 3 Missing Values... 5 Explore Data... 6 One Variable... 6 Two Variables... 7 All
More informationImporting and visualizing data in R. Day 3
Importing and visualizing data in R Day 3 R data.frames Like pandas in python, R uses data frame (data.frame) object to support tabular data. These provide: Data input Row- and column-wise manipulation
More informationA (very) brief introduction to R
A (very) brief introduction to R You typically start R at the command line prompt in a command line interface (CLI) mode. It is not a graphical user interface (GUI) although there are some efforts to produce
More informationBluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition
Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created
More informationLAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA
LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to
More informationfile:///users/williams03/a/workshops/2015.march/final/intro_to_r.html
Intro to R R is a functional programming language, which means that most of what one does is apply functions to objects. We will begin with a brief introduction to R objects and how functions work, and
More informationSelected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2002 Minitab version 13.
Minitab@Oneonta.Manual: Selected Introductory Statistical and Data Manipulation Procedures Gordon & Johnson 2002 Minitab version 13.0 Minitab@Oneonta.Manual: Selected Introductory Statistical and Data
More informationA Quick Guide to Stata 8 for Windows
Université de Lausanne, HEC Applied Econometrics II Kurt Schmidheiny October 22, 2003 A Quick Guide to Stata 8 for Windows 2 1 Introduction A Quick Guide to Stata 8 for Windows This guide introduces the
More informationModule 1: Introduction RStudio
Module 1: Introduction RStudio Contents Page(s) Installing R and RStudio Software for Social Network Analysis 1-2 Introduction to R Language/ Syntax 3 Welcome to RStudio 4-14 A. The 4 Panes 5 B. Calculator
More informationStata version 14 Also works for versions 13 & 12. Lab Session 1 February Preliminary: How to Screen Capture..
Stata version 14 Also works for versions 13 & 12 Lab Session 1 February 2016 1. Preliminary: How to Screen Capture.. 2. Preliminary: How to Keep a Log of Your Stata Session.. 3. Preliminary: How to Save
More informationAn Introduction to R 2.2 Statistical graphics
An Introduction to R 2.2 Statistical graphics Dan Navarro (daniel.navarro@adelaide.edu.au) School of Psychology, University of Adelaide ua.edu.au/ccs/people/dan DSTO R Workshop, 29-Apr-2015 Scatter plots
More informationIntroduction to StatsDirect, 15/03/2017 1
INTRODUCTION TO STATSDIRECT PART 1... 2 INTRODUCTION... 2 Why Use StatsDirect... 2 ACCESSING STATSDIRECT FOR WINDOWS XP... 4 DATA ENTRY... 5 Missing Data... 6 Opening an Excel Workbook... 6 Moving around
More informationINSTRUCTIONS FOR USING MICROSOFT EXCEL PERFORMING DESCRIPTIVE AND INFERENTIAL STATISTICS AND GRAPHING
APPENDIX INSTRUCTIONS FOR USING MICROSOFT EXCEL PERFORMING DESCRIPTIVE AND INFERENTIAL STATISTICS AND GRAPHING (Developed by Dr. Dale Vogelien, Kennesaw State University) ** For a good review of basic
More informationWeek 1: Introduction to Stata
Week 1: Introduction to Stata Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED 1 Outline Log
More informationYou will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables
Jennie Murack You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables How to conduct basic descriptive statistics
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use
More informationAn Introductory Guide to Stata
An Introductory Guide to Stata Scott L. Minkoff Assistant Professor Department of Political Science Barnard College sminkoff@barnard.edu Updated: July 9, 2012 1 TABLE OF CONTENTS ABOUT THIS GUIDE... 4
More informationExploring and Understanding Data Using R.
Exploring and Understanding Data Using R. Loading the data into an R data frame: variable
More informationIntroduction to R and R-Studio Toy Program #1 R Essentials. This illustration Assumes that You Have Installed R and R-Studio
Introduction to R and R-Studio 2018-19 Toy Program #1 R Essentials This illustration Assumes that You Have Installed R and R-Studio If you have not already installed R and RStudio, please see: Windows
More informationMinitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.
Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see
More information1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file
1 SPSS Guide 2009 Content 1. Basic Steps for Data Analysis. 3 2. Data Editor. 2.4.To create a new SPSS file 3 4 3. Data Analysis/ Frequencies. 5 4. Recoding the variable into classes.. 5 5. Data Analysis/
More informationExploratory data analysis with one and two variables
Exploratory data analysis with one and two variables Instructions for Lab # 1 Statistics 111 - Probability and Statistical Inference DUE DATE: Upload on Sakai on July 10 Lab Objective To explore data with
More informationIndex. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113
Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame
More informationHow to extract suicide statistics by country from the. WHO Mortality Database Online Tool
Instructions for users How to extract suicide statistics by country from the WHO Mortality Database Online Tool This guide explains how to access suicide statistics and make graphs and tables, or export
More informationAfter opening Stata for the first time: set scheme s1mono, permanently
Stata 13 HELP Getting help Type help command (e.g., help regress). If you don't know the command name, type lookup topic (e.g., lookup regression). Email: tech-support@stata.com. Put your Stata serial
More informationTable of Contents (As covered from textbook)
Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression
More informationData Analyst Nanodegree Syllabus
Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working
More informationData Analyst Nanodegree Syllabus
Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working
More informationECO375 Tutorial 1 Introduction to Stata
ECO375 Tutorial 1 Introduction to Stata Matt Tudball University of Toronto Mississauga September 14, 2017 Matt Tudball (University of Toronto) ECO375H5 September 14, 2017 1 / 25 What Is Stata? Stata is
More informationInternational Graduate School of Genetic and Molecular Epidemiology (GAME) Computing Notes and Introduction to Stata
International Graduate School of Genetic and Molecular Epidemiology (GAME) Computing Notes and Introduction to Stata Paul Dickman September 2003 1 A brief introduction to Stata Starting the Stata program
More informationHomework 1 Excel Basics
Homework 1 Excel Basics Excel is a software program that is used to organize information, perform calculations, and create visual displays of the information. When you start up Excel, you will see the
More informationWELCOME! Lecture 3 Thommy Perlinger
Quantitative Methods II WELCOME! Lecture 3 Thommy Perlinger Program Lecture 3 Cleaning and transforming data Graphical examination of the data Missing Values Graphical examination of the data It is important
More informationPDQ-Notes. Reynolds Farley. PDQ-Note 3 Displaying Your Results in the Expert Query Window
PDQ-Notes Reynolds Farley PDQ-Note 3 Displaying Your Results in the Expert Query Window PDQ-Note 3 Displaying Your Results in the Expert Query Window Most of your options for configuring your query results
More informationAPI-202 Empirical Methods II Spring 2004 A SHORT INTRODUCTION TO STATA 8.0
API-202 Empirical Methods II Spring 2004 A SHORT INTRODUCTION TO STATA 8.0 Course materials and data sets will assume that you are using Stata to complete the analysis. Stata is available on all of the
More informationChecking whether the protocol was followed: gender and age 51
Checking whether the protocol was followed: gender and age 51 Session 4: Checking whether the protocol was followed: gender and age In the data cleaning workbook there are two worksheets which form the
More informationExcel Tips and FAQs - MS 2010
BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my
More informationGetting started with Stata 2017: Cheat-sheet
Getting started with Stata 2017: Cheat-sheet 4. september 2017 1 Get started Graphical user interface (GUI). Clickable. Simple. Commands. Allows for use of do-le. Easy to keep track. Command window: Write
More informationAcaStat User Manual. Version 10 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.
AcaStat User Manual Version 10 for Mac and Windows Copyright 2018, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents NEW IN VERSION 10... 6 INTRODUCTION... 7 GETTING HELP...
More informationHow to Download and Install R The R software can be downloaded from: Click on download R link.
Tutorial 1: Getting Acquainted with R (Windows version) How to Download and Install R The R software can be downloaded from: www.r-project.org. Click on download R link. Choose a CRAN mirror from a location
More informationBasics of Plotting Data
Basics of Plotting Data Luke Chang Last Revised July 16, 2010 One of the strengths of R over other statistical analysis packages is its ability to easily render high quality graphs. R uses vector based
More informationIBMSPSSSTATL1P: IBM SPSS Statistics Level 1
SPSS IBMSPSSSTATL1P IBMSPSSSTATL1P: IBM SPSS Statistics Level 1 Version: 4.4 QUESTION NO: 1 Which statement concerning IBM SPSS Statistics application windows is correct? A. At least one Data Editor window
More informationBIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26
Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 INTRODUCTION Graphs are one of the most important aspects of data analysis and presentation of your of data. They are visual representations
More informationEcon Stata Tutorial I: Reading, Organizing and Describing Data. Sanjaya DeSilva
Econ 329 - Stata Tutorial I: Reading, Organizing and Describing Data Sanjaya DeSilva September 8, 2008 1 Basics When you open Stata, you will see four windows. 1. The Results window list all the commands
More informationGraph tool instructions and R code
Graph tool instructions and R code 1) Prepare data: tab-delimited format Data need to be inputted in a tab-delimited format. This can be easily achieved by preparing the data in a spread sheet program
More informationMINITAB 17 BASICS REFERENCE GUIDE
MINITAB 17 BASICS REFERENCE GUIDE Dr. Nancy Pfenning September 2013 After starting MINITAB, you'll see a Session window above and a worksheet below. The Session window displays non-graphical output such
More informationStata versions 12 & 13 Week 4 Practice Problems
Stata versions 12 & 13 Week 4 Practice Problems SOLUTIONS 1 Practice Screen Capture a Create a word document Name it using the convention lastname_lab1docx (eg bigelow_lab1docx) b Using your browser, go
More informationA Short Introduction to STATA
A Short Introduction to STATA 1) Introduction: This session serves to link everyone from theoretical equations to tangible results under the amazing promise of Stata! Stata is a statistical package that
More informationBasic Medical Statistics Course
Basic Medical Statistics Course S0 SPSS Intro November 2013 Wilma Heemsbergen w.heemsbergen@nki.nl 1 13.00 ~ 15.30 Database (20 min) SPSS (40 min) Short break Exercise (60 min) This Afternoon During the
More informationIntro to Stata for Political Scientists
Intro to Stata for Political Scientists Andrew S. Rosenberg Junior PRISM Fellow Department of Political Science Workshop Description This is an Introduction to Stata I will assume little/no prior knowledge
More informationIntroduction to R (BaRC Hot Topics)
Introduction to R (BaRC Hot Topics) George Bell September 30, 2011 This document accompanies the slides from BaRC s Introduction to R and shows the use of some simple commands. See the accompanying slides
More informationAn Introduction to Stata Part II: Data Analysis
An Introduction to Stata Part II: Data Analysis Kerry L. Papps 1. Overview Do-files Sorting a dataset Combining datasets Creating a dataset of means or medians etc. Weights Panel data capabilities Dummy
More informationIntroduction to Stata First Session. I- Launching and Exiting Stata Launching Stata Exiting Stata..
Introduction to Stata 2016-17 01. First Session I- Launching and Exiting Stata... 1. Launching Stata... 2. Exiting Stata.. II - Toolbar, Menu bar and Windows.. 1. Toolbar Key.. 2. Menu bar Key..... 3.
More informationANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a
ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a Put the following data into an spss data set: Be sure to include variable and value labels and missing value specifications for all variables
More informationStata: A Brief Introduction Biostatistics
Stata: A Brief Introduction Biostatistics 140.621 2005-2006 1. Statistical Packages There are many statistical packages (Stata, SPSS, SAS, Splus, etc.) Statistical packages can be used for Analysis Data
More informationECOLOGY OF AMPHIBIANS & REPTILES. A Statistical R Primer
ECOLOGY OF AMPHIBIANS & REPTILES A Statistical R Primer Jeffrey Row, Stephen Lougheed & Grégory Bulté Queen s University Biological Station May, 2017 Row, Lougheed & Bulté; Ecology of Amphibians and Reptiles
More informationfor statistical analyses
Using for statistical analyses Robert Bauer Warnemünde, 05/16/2012 Day 6 - Agenda: non-parametric alternatives to t-test and ANOVA (incl. post hoc tests) Wilcoxon Rank Sum/Mann-Whitney U-Test Kruskal-Wallis
More informationLecture 1: Statistical Reasoning 2. Lecture 1. Simple Regression, An Overview, and Simple Linear Regression
Lecture Simple Regression, An Overview, and Simple Linear Regression Learning Objectives In this set of lectures we will develop a framework for simple linear, logistic, and Cox Proportional Hazards Regression
More informationEmpirical Reasoning Center R Workshop (Summer 2016) Session 1. 1 Writing and executing code in R. 1.1 A few programming basics
Empirical Reasoning Center R Workshop (Summer 2016) Session 1 This guide reviews the examples we will cover in today s workshop. It should be a helpful introduction to R, but for more details, the ERC
More informationProduct Catalog. AcaStat. Software
Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,
More informationUnit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users
BIOSTATS 640 Spring 2018 Review of Introductory Biostatistics STATA solutions Page 1 of 13 Key Comments begin with an * Commands are in bold black I edited the output so that it appears here in blue Unit
More informationInstall RStudio from - use the standard installation.
Session 1: Reading in Data Before you begin: Install RStudio from http://www.rstudio.com/ide/download/ - use the standard installation. Go to the course website; http://faculty.washington.edu/kenrice/rintro/
More informationStep-by-step user instructions to the hamlet-package
Step-by-step user instructions to the hamlet-package Teemu Daniel Laajala May 26, 2018 Contents 1 Analysis workflow 2 2 Loading data into R 2 2.1 Excel format data.......................... 4 2.2 CSV-files...............................
More informationBasic Medical Statistics Course
Basic Medical Statistics Course S0 SPSS Intro December 2014 Wilma Heemsbergen w.heemsbergen@nki.nl This Afternoon 13.00 ~ 15.00 SPSS lecture Short break Exercise 2 Database Example 3 Types of data Type
More information