R commander an introduction

Size: px
Start display at page:

Download "R commander an introduction"

Transcription

1 R commander an introduction free, user-friendly, and powerful software Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

2 Useful sites R is a free software with powerful tools The Comprehensive R Archives Network -> Windows -> base -> Download R for Windows Textbook : Simple R by John Verzani

3 Features of R R is free. R is open-source and runs on UNIX, Windows and Macintosh. R has an excellent built-in help system. R has excellent graphing capabilities. Students can easily migrate to the commercially supported S-Plus program if commercial software is desired. R's language has a powerful, easy to learn syntax with many built-in statistical functions. The language is easy to extend with user-written functions. R is a computer programming language. For programmers it will feel more familiar than others and for new computer users, the next leap to programming will not be so large.

4 Starting the R

5 Data manipulation Data input Data types Importing data Exporting data Viewing data Value labels Missing data Data management Variables Operators Sorting data Merging data Subsetting data Source: (Quick r)

6 [Data Input] Data types Vectors a <- c(1,2,5.3,6,-2,4) #numeric vector b <- c("one","two","three") #character vector c <- c(true,true,true,false,true,false) #logical vector a[c(2,4)] #2nd and 4th elements of vector Matrices # generates 5 x 4 numeric matrix y<-matrix(1:20, nrow=5,ncol=4) y[,4] # 4th column of matrix y[3,] # 3rd row of matrix y[2:4,1:3] # rows 2,3,4 of columns 1,2,3

7 [Data Input] Data types Dataframes d <- c(1,2,3,4) e <- c("red", "white", "red", NA) f <- c(true,true,true,false) mydata <- data.frame(d,e,f) names(mydata) <- c("id","color","passed") # variable names Lists # example of a list with 4 components - # a string, a numeric vector, a matrix, and a scaler w <- list(name="fred", mynumbers=a, mymatrix=y, age=5.3) Factors gender <- c(rep("male",20), rep("female", 30)) gender <- factor(gender) # R now treats gender as a nominal variable summary(gender)

8 [Data Input] Importing data From CSV file malaria <-read.table("c:\\r_data\\malaria.csv", header=true, sep=",") From Excel library(rodbc) channel <- odbcconnectexcel("c:\\r_data\\malaria.xls") malaria <- sqlfetch(channel, "mal") *odbcconnectexcel is only usable with 32-bit Windows From txt file malaria <- read.table("c:\\ R_data\\malaria.txt", header=true, sep="\t")

9 [Data Input] Exporting data To an CSV file write.table(malaria, "C:\\ R_data\\mal01.csv", row.names=f) To a tab delimited text file write.table(malaria, "C:\\ R_data\\mal02.txt", sep="\t", row.names=f)

10 Viewing data ls() # list objects in the working environment names(malaria) # list the variables in malaria str(malaria) # list the structure of malaria levels(malaria $v1) # list levels of factor v1 in malaria malaria$v1<-factor(malaria$mal) dim(malaria) # dimensions of an malaria class(malaria) # class of an malaria (numeric, matrix, dataframe, etc) malaria # print malaria head(malaria, n=10) # print first 10 rows of malaria tail(malaria, n=5) # print last 5 rows of malaria summary(malaria)

11 Value labels # variable v1 is coded 1, 2 or 3 # we want to attach value labels 1=red, 2=blue, 3=green v1<-c(1,1,1,2,2,3) v2 <- factor(v1, levels = c(1,2,3), labels = c("red", "blue", "green"))

12 Missing data Testing for missing values y <- c(1,2,3,na) is.na(y) # returns a vector (F F F T) Recoding values to missing malaria[malaria$age==99, age"] <- NA Excluding missing values from analyses x <- c(1,2,na,3) mean(x) # returns NA mean(x, na.rm=true) # returns 2

13 Help > help(mean) >?mean

14 Data manipulation Data input Data types Importing data Exporting data Viewing data Value labels Missing data Data management Variables Operators Sorting data Merging data Subsetting data

15 [Data management] Variables Recoding variables # create 2 age categories malaria$agecat <- ifelse(malaria$age >7, c( student"), c( baby")) attach(malaria) malaria$agecat2[age > 7] <- "student" malaria$agecat2[age <= 7] <- "baby" detach(malaria)

16 [Data management] Operators Comparison operators == equals!= not equals <= less than or equals >= greater than or equals = assignment (same as <- ) Logical operators & and or! not

17 [Data management] Sorting Data # sort by mal newdata <- malaria[order(malaria$mal),] # sort by mal and age newdata2 <- malaria[order(malaria$mal, malaria$age),] #sort by mal (ascending) and age (descending) newdata3 <- malaria[order(malaria$mal, -malaria$age),] Avoid Attach command when sorting the data

18 [Data management] Merging Data Raw dataset malaria2<-read.table("c:\\r_data\\malaria.csv", header=true, sep=",") Adding rows extra<-read.table ("C:\\R_data\\extra15.csv",header=T, sep=",") malaria3<-rbind(malaria2,extra) Adding columns region<-read.table ("C:\\R_data\\region.csv", header=t, sep=",") malaria4<-merge(malaria3, region, by="subject")

19 [Data management] Subsetting Data mal.1 <- subset(malaria,mal==1) summary(mal.1) mal.baby <- subset(malaria, mal == 1 & age < 8)

20 Installing R commander You need to first install R and then R commander.

21 Starting the R commander > library(rcmdr)

22 R commander windows

23 Importing datasets

24 Select the data set by clicking on this box

25 Checking continuous variables Statistics->Means options Single-sample t-test Independent samples t-test Paired t-test One-way ANOVA Multi-way ANOVA

26 Q Read Pepers.xls. Test whether the mean of angle is zero or not. Write down the null and alternative hypotheses. 26

27 single-sample t-test (Pepers.xls) Statistics > Means > Single-sample t-test (Enter the proposed mean (Null hypothesis: mu=))

28

29 1.2 Suppose that the mean of angle is already known to be 2. And you wan to claim that it is not true based on your data. Write down your hypotheses for this claim. * Perform statistical test with R commander. What do you conclude? 29

30 single-sample t-test (Pepers.xls) Statistics > Summaries > Shapiro-Wilk test of normality This is a hypothesis tests with the null hypothesis that the data comes from a normal distribution.

31 Q Read pulse.xls. What kinds of test can be used to see the difference between pre and post variables. * Write down null and alternative hypothesis. 2.2 Perform parametric and non-parametric tests for the above hypothesis using R commander. What do you conclude? 31

32 paired t-test (Pepers.xls) Statistics > Means > Paired t-test

33

34 paired t-test (Pepers.xls) Statistics > nonparametric tests > Pairedsamples Wilcoxon test

35

36 Q What would be the purpose of analyzing insul.xls. 3.2 Perform explanatory analysis (Statistics>Summaries) using R commander. Interpret the results. 3.3 What are the hypotheses to compare the glucose levels of 5 groups? 3.4 Perform ANOVA using R commander and interpret the results. 3.5 Perform post-hoc analyses to explain the differences between groups. 3.6 How would you compare group A (conc=1,2) and group B (conc=4,5)? Do this using R commander. 36

37 insul.xls Effect of glucose concentration on Insulin Measured the amount of insulin secretion after administration of five different concentrations of glucose into pancreatic tissue (animal experiments) Characteristics for each group Statistics > Summaries (according to the study objective) Graphs (according to the study objective) variable conc must be declared as a factor variable! 37

38

39

40 Conc 1,2 < 3 < conc 4,5 Graphs->Boxplot

41 insul.xls One-Way ANOVA Statistics > Means > One-way ANOVA Pairwise comparisons of means Tukey post-hoc comparison procedure (default)

42

43 t-test for (1,2) vs (4,5) comparison Re-define variables Data > Manage variable in active data set > Recode variables > select conc variable New variable name or prefix for multiple recodes : new Enter recode directives 1:2=1; 3=NA; 4:5=2 conc=3 as a missing Equality of variance test should be carried out before the t-test Statistics > Variances > Two variances F-test the variances are equal Statistics > Means > Independent samples t-test Mean concentration difference between two new groups (variances are assumed to be equal) Significant Insul.xls 43

44

45 Variance ratio test of the two groups: Statistics > Variances > Two variances F-test

46 Independent samples t-test (equal variances)

47 Insul.xls Nonparametric way of comparing (1,2) vs (4,5) Statistics > Nonparametric tests > Two sample Wilcoxon test 47

48 taillite2.sav data vehtype='vehicle Type group='group - Light On=1 Light Off=2 position='light Position speedzn='speed Zone resptime='response Time follotme='following Time in Vedio Frames folltmec='following Time in Categories ; resptme(continounous) difference by Vehtype(dichotomous) variable=> Analysis of variance? Looking at only Group=1 48

49 Q Read taillite2.sav. What is the aim of analyzing this data? 4.2 Apply ANOVA to see the differences of resptime by the level of vehtype. 4.3 Test the normality assumption. 4.4 Perform a non-parametric test to check the differences of resptime by the level of vehtype. 4.5 Do log-transformation and normality check. 4.6 Do ANOVA with log transformed variable. 4.7 Perform a non-parametric test for the log transformed variable. 4.8 Compare the results of (4.2 and 4.6), (4.4 and 4.7) and explain. 49

50 taillite2.sav data Trying ANOVA Statistics > Means > One-way ANOVA Response variable : resptime, Groups : vehtype Grouping variables should be converted as factor variables (Data > Manage variable in active data set > Convert numeric variables to factors) A significant difference between Vehtypes on resptime? 50

51 taillite2.sav data Normality test Statistics > Summaries > Shapiro-Wilk test of normality For normality test for Vehtype, by(taillite2$resptime, taillite2$vehtype, shapiro.test) Reject the null!! ANOVA can not be conducted. 51

52

53 taillite2.sav data Trying nonparametric way (Kruskal-Wallis test) Statistics > Nonparametric tests > Kruskal-Wallis test p=0.259 No difference between groups! 53

54 taillite2.sav data Data > Manage variable in active data set > Compute new variable New variable name : lresp Expression to compute : log(resptime) Normality test for lresp Edit command line as by(taillite2$lresp, taillite2$vehtype, shapiro.test) 54

55

56 taillite2.sav data Trying ANOVA with lresp p=0.063 What do you conclude? 56

57 electric.xls housize = 'House Size' income = 'Family Income aircapac = 'Air Conditioning Capacity applindx = 'Appliance Index family = 'Number of Family Members peak = 'Peak Hour Electric Load' ; Aim: Selecting variables that affect the variable peak (Maximum amount of electricity) and finding the regression equation Statistics > Fit models > Linear regression Create command line first if you want to use the stepwise method for model selection (use step(model) function) 57

58 Q Read eletric.xls and explain the purpose of the analysis. 5.2 Perform step-wise regression using peak as a dependent variable. Interpret the results. (Exclude variable family) Statistics -> Fit models -> Linear Regression 58

59

60

61 3D graphics

62 Rcmdr R commander was developed as an easy to use graphical user interface (GUI) for R Rcmdr is not perfect yet, but has been updated Expecting menu screen in Korean and Korean fonts variability

Introduction to Statistics using R/Rstudio

Introduction to Statistics using R/Rstudio Introduction to Statistics using R/Rstudio R and Rstudio Getting Started Assume that R for Windows and Macs already installed on your laptop. (Instructions for installations sent) R on Windows R on MACs

More information

for statistical analyses

for statistical analyses Using for statistical analyses Robert Bauer Warnemünde, 05/16/2012 Day 6 - Agenda: non-parametric alternatives to t-test and ANOVA (incl. post hoc tests) Wilcoxon Rank Sum/Mann-Whitney U-Test Kruskal-Wallis

More information

Product Catalog. AcaStat. Software

Product Catalog. AcaStat. Software Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,

More information

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015 STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, tsvv@steno.dk, Steno Diabetes Center June 11, 2015 Contents 1 Introduction 1 2 Recap: Variables 2 3 Data Containers 2 3.1 Vectors................................................

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

An Introduction to the R Commander

An Introduction to the R Commander An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu

More information

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113 Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame

More information

PSS718 - Data Mining

PSS718 - Data Mining Lecture 3 Hacettepe University, IPS, PSS October 10, 2016 Data is important Data -> Information -> Knowledge -> Wisdom Dataset a collection of data, a.k.a. matrix, table. Observation a row of a dataset,

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

SPSS. (Statistical Packages for the Social Sciences)

SPSS. (Statistical Packages for the Social Sciences) Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.

More information

Learn What s New. Statistical Software

Learn What s New. Statistical Software Statistical Software Learn What s New Upgrade now to access new and improved statistical features and other enhancements that make it even easier to analyze your data. The Assistant Data Customization

More information

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables:

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables: Regression Lab The data set cholesterol.txt available on your thumb drive contains the following variables: Field Descriptions ID: Subject ID sex: Sex: 0 = male, = female age: Age in years chol: Serum

More information

Entering and Outputting Data 2 nd best TA ever: Steele H. Valenzuela February 2-6, 2015

Entering and Outputting Data 2 nd best TA ever: Steele H. Valenzuela February 2-6, 2015 Entering and Outputting Data 2 nd best TA ever: Steele H. Valenzuela February 2-6, 2015 Contents Things to Know Before You Begin.................................... 1 Entering and Outputting Data......................................

More information

Introduction to R Commander

Introduction to R Commander Introduction to R Commander 1. Get R and Rcmdr to run 2. Familiarize yourself with Rcmdr 3. Look over Rcmdr metadata (Fox, 2005) 4. Start doing stats / plots with Rcmdr Tasks 1. Clear Workspace and History.

More information

Introductory Guide to SAS:

Introductory Guide to SAS: Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................

More information

Goals of this course. Crash Course in R. Getting Started with R. What is R? What is R? Getting you setup to use R under Windows

Goals of this course. Crash Course in R. Getting Started with R. What is R? What is R? Getting you setup to use R under Windows Oxford Spring School, April 2013 Effective Presentation ti Monday morning lecture: Crash Course in R Robert Andersen Department of Sociology University of Toronto And Dave Armstrong Department of Political

More information

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

AcaStat User Manual. Version 8.3 for Mac and Windows. Copyright 2014, AcaStat Software. All rights Reserved.

AcaStat User Manual. Version 8.3 for Mac and Windows. Copyright 2014, AcaStat Software. All rights Reserved. AcaStat User Manual Version 8.3 for Mac and Windows Copyright 2014, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents INTRODUCTION... 5 GETTING HELP... 5 INSTALLATION... 5

More information

Data analysis using Microsoft Excel

Data analysis using Microsoft Excel Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data

More information

A Manual for the Multivariate Permutation Test for Correlations

A Manual for the Multivariate Permutation Test for Correlations A Manual for the Multivariate Permutation Test for Correlations Jennifer Urbano Blackford, Geunyoung Kim, Niels Waller & Paul Yoder 1 Table of Contents 1 Introduction..... 1 2 Software Requirements...

More information

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered

More information

36-402/608 HW #1 Solutions 1/21/2010

36-402/608 HW #1 Solutions 1/21/2010 36-402/608 HW #1 Solutions 1/21/2010 1. t-test (20 points) Use fullbumpus.r to set up the data from fullbumpus.txt (both at Blackboard/Assignments). For this problem, analyze the full dataset together

More information

Introduction to R (BaRC Hot Topics)

Introduction to R (BaRC Hot Topics) Introduction to R (BaRC Hot Topics) George Bell September 30, 2011 This document accompanies the slides from BaRC s Introduction to R and shows the use of some simple commands. See the accompanying slides

More information

Multivariate Capability Analysis

Multivariate Capability Analysis Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8

More information

R commander an Introduction

R commander an Introduction R commander an Introduction Natasha A. Karp nk3@sanger.ac.uk May 2010 Preface This material is intended as an introductory guide to data analysis with R commander. It was produced as part of an applied

More information

Introduction to R Cmdr

Introduction to R Cmdr Introduction to R Cmdr MARS 6910 Spring 2015 David Hyrenbach Starting R Some Basic Unix Commands (http://cran.r-project.org/ doc/contrib/short-refcard.pdf) ls() show objects in search path; () default

More information

Step-by-step user instructions to the hamlet-package

Step-by-step user instructions to the hamlet-package Step-by-step user instructions to the hamlet-package Teemu Daniel Laajala May 26, 2018 Contents 1 Analysis workflow 2 2 Loading data into R 2 2.1 Excel format data.......................... 4 2.2 CSV-files...............................

More information

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Learning Objectives Identify Different Types of Variables Appropriately Naming Variables Constructing

More information

Nuts and Bolts Research Methods Symposium

Nuts and Bolts Research Methods Symposium Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Topics to Discuss: Types of Variables Constructing a Variable Code Book Developing Excel Spreadsheets

More information

and R Commander (Rcmdr) 1 by the example

and R Commander (Rcmdr) 1 by the example and R Commander (Rcmdr) 1 by the example 1. Introduction...1 Conventions:...2 2. Starting R and Rcmdr...3 Starting R...3 Starting R Commander (Rcmdr)...3 3. Importing data from Excel...4 4. Viewing and

More information

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values Chapter 500 Introduction This procedure produces tables of frequency counts and percentages for categorical and continuous variables. This procedure serves as a summary reporting tool and is often used

More information

R commander an Introduction

R commander an Introduction R commander an Introduction Natasha A. Karp nk3@sanger.ac.uk May 2010 Preface This material is intended as an introductory guide to data analysis with R commander. It was produced as part of an applied

More information

Dr. Barbara Morgan Quantitative Methods

Dr. Barbara Morgan Quantitative Methods Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In

More information

Minitab 17 commands Prepared by Jeffrey S. Simonoff

Minitab 17 commands Prepared by Jeffrey S. Simonoff Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save

More information

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions Technical Support Free technical support Worksheet Size All registered users, including students Registered instructors Number of worksheets Limited only by system resources 5 5 Number of cells per worksheet

More information

STATISTICS FOR PSYCHOLOGISTS

STATISTICS FOR PSYCHOLOGISTS STATISTICS FOR PSYCHOLOGISTS SECTION: JAMOVI CHAPTER: USING THE SOFTWARE Section Abstract: This section provides step-by-step instructions on how to obtain basic statistical output using JAMOVI, both visually

More information

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975.

Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975. Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975. SPSS Statistics were designed INTRODUCTION TO SPSS Objective About the

More information

Cluster Randomization Create Cluster Means Dataset

Cluster Randomization Create Cluster Means Dataset Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters

More information

Assumption 1: Groups of data represent random samples from their respective populations.

Assumption 1: Groups of data represent random samples from their respective populations. Tutorial 6: Comparing Two Groups Assumptions The following methods for comparing two groups are based on several assumptions. The type of test you use will vary based on whether these assumptions are met

More information

User Manual Mail Merge

User Manual Mail Merge User Manual Mail Merge Version: 1.0 Mail Merge Date: 27-08-2013 How to print letters using Mail Merge You can use Mail Merge to create a series of documents, such as a standard letter that you want to

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

In this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics:

In this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics: UPPSALA UNIVERSITY Department of Mathematics Måns Thulin, thulin@math.uu.se Analysis of regression and variance Fall 2011 COMPUTER EXERCISE 2: One-way ANOVA In this computer exercise we will work with

More information

Creating a data file and entering data

Creating a data file and entering data 4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that

More information

AcaStat User Manual. Version 10 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.

AcaStat User Manual. Version 10 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved. AcaStat User Manual Version 10 for Mac and Windows Copyright 2018, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents NEW IN VERSION 10... 6 INTRODUCTION... 7 GETTING HELP...

More information

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The

More information

Forfattere Intro to SPSS 19.0 Description

Forfattere Intro to SPSS 19.0 Description Forfattere Nicholas Fritsche Rasmus Porsgaard Casper Voigt Rasmussen Martin Klint Hansen Morten Christoffersen Ulrick Tøttrup Niels Yding Sørensen Morten Mondrup Andreassen Jesper Pedersen Intro to SPSS

More information

IST 3108 Data Analysis and Graphics Using R. Summarizing Data Data Import-Export

IST 3108 Data Analysis and Graphics Using R. Summarizing Data Data Import-Export IST 3108 Data Analysis and Graphics Using R Summarizing Data Data Import-Export Engin YILDIZTEPE, PhD Working with Vectors and Logical Subscripts >xsum(x) how many of the values were less than

More information

To finish the current project and start a new project. File Open a text data

To finish the current project and start a new project. File Open a text data GGEbiplot version 5 In addition to being the most complete, most powerful, and most user-friendly software package for biplot analysis, GGEbiplot also has powerful components for on-the-fly data manipulation,

More information

An introduction to R WS 2013/2014

An introduction to R WS 2013/2014 An introduction to R WS 2013/2014 Dr. Noémie Becker (AG Metzler) Dr. Sonja Grath (AG Parsch) Special thanks to: Dr. Martin Hutzenthaler (previously AG Metzler, now University of Frankfurt) course development,

More information

R syntax guide. Richard Gonzalez Psychology 613. August 27, 2015

R syntax guide. Richard Gonzalez Psychology 613. August 27, 2015 R syntax guide Richard Gonzalez Psychology 613 August 27, 2015 This handout will help you get started with R syntax. There are obviously many details that I cannot cover in these short notes but these

More information

WINKS SDA 7. Version 7

WINKS SDA 7. Version 7 WINKS SDA 7 Version 7 (For BASIC and PROFESSIONAL Editions of WINKS SDA) PowerPoint Slides for this Guide are svailable at the website Click Instructors. www.texasoft.com TexaSoft, 2015 Do these tutorials

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

Regression III: Advanced Methods

Regression III: Advanced Methods Lecture 2: Software Introduction Regression III: Advanced Methods William G. Jacoby Department of Political Science Michigan State University jacoby@msu.edu Getting Started with R What is R? A tiny R session

More information

An Introduction to R- Programming

An Introduction to R- Programming An Introduction to R- Programming Hadeel Alkofide, Msc, PhD NOT a biostatistician or R expert just simply an R user Some slides were adapted from lectures by Angie Mae Rodday MSc, PhD at Tufts University

More information

R: BASICS. Andrea Passarella. (plus some additions by Salvatore Ruggieri)

R: BASICS. Andrea Passarella. (plus some additions by Salvatore Ruggieri) R: BASICS Andrea Passarella (plus some additions by Salvatore Ruggieri) BASIC CONCEPTS R is an interpreted scripting language Types of interactions Console based Input commands into the console Examine

More information

Experimental epidemiology analyses with R and R commander. Lars T. Fadnes Centre for International Health University of Bergen

Experimental epidemiology analyses with R and R commander. Lars T. Fadnes Centre for International Health University of Bergen Experimental epidemiology analyses with R and R commander Lars T. Fadnes Centre for International Health University of Bergen 1 Click to add an outline 2 How to install R commander? - install.packages("rcmdr",

More information

Minitab 18 Feature List

Minitab 18 Feature List Minitab 18 Feature List * New or Improved Assistant Measurement systems analysis * Capability analysis Graphical analysis Hypothesis tests Regression DOE Control charts * Graphics Scatterplots, matrix

More information

Learn about the Display options Complete Review Questions and Activities Complete Training Survey

Learn about the Display options Complete Review Questions and Activities Complete Training Survey Intended Audience: Staff members who will be using the AdHoc reporting tools to query the Campus database. Description: To learn filter and report design capabilities available in Campus. Time: 3 hours

More information

R for IR. Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University

R for IR. Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University R for IR Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University For presentation at the June 2013 Meeting of the Higher Education Data Sharing Consortium Table of Contents I.

More information

Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table

Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table Q Cheat Sheets What to do when you cannot figure out how to use Q What to do when the data looks wrong Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help

More information

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA ECL 290 Statistical Models in Ecology using R Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA Datasets in this problem set adapted from those provided

More information

1/22/2018. Multivariate Applications in Ecology (BSC 747) Ecological datasets are very often large and complex

1/22/2018. Multivariate Applications in Ecology (BSC 747) Ecological datasets are very often large and complex Multivariate Applications in Ecology (BSC 747) Ecological datasets are very often large and complex Modern integrative approaches have allowed for collection of more data, challenge is proper integration

More information

R Commander Tutorial

R Commander Tutorial R Commander Tutorial Introduction R is a powerful, freely available software package that allows analyzing and graphing data. However, for somebody who does not frequently use statistical software packages,

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming

More information

Subset Selection in Multiple Regression

Subset Selection in Multiple Regression Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that

More information

Introduction to R, Github and Gitlab

Introduction to R, Github and Gitlab Introduction to R, Github and Gitlab 27/11/2018 Pierpaolo Maisano Delser mail: maisanop@tcd.ie ; pm604@cam.ac.uk Outline: Why R? What can R do? Basic commands and operations Data analysis in R Github and

More information

STAT 571A Advanced Statistical Regression Analysis. Introduction to R NOTES

STAT 571A Advanced Statistical Regression Analysis. Introduction to R NOTES STAT 571A Advanced Statistical Regression Analysis Introduction to R NOTES 2015 University of Arizona Statistics GIDP. All rights reserved, except where previous rights exist. No part of this material

More information

Technical Support Minitab Version Student Free technical support for eligible products

Technical Support Minitab Version Student Free technical support for eligible products Technical Support Free technical support for eligible products All registered users (including students) All registered users (including students) Registered instructors Not eligible Worksheet Size Number

More information

pairwise.t.test(dataset$measurement, dataset$group, p.adj = bonferroni ) TukeyHSD(aov(dataset$measurement~dataset$group))

pairwise.t.test(dataset$measurement, dataset$group, p.adj = bonferroni ) TukeyHSD(aov(dataset$measurement~dataset$group)) Tutorial 9: Comparing Three or More Groups One-way (single-factor) ANOVA (analysis of variance) Used to compare means of 3 or more groups based on a single explanatory (independent) variable, or factor.

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

SPSS Statistics 19.0 Fix Pack 2 Fix List Release notes Abstract Content Number Description

SPSS Statistics 19.0 Fix Pack 2 Fix List Release notes Abstract Content Number Description SPSS Statistics 19.0 Fix Pack 2 Fix List Release notes Abstract A comprehensive list of defect corrections for the SPSS Statistics 19.0 Fix Pack 2. Details of the fixes are listed below. If you have questions

More information

Version 1.6. UDW+ Quick Start Guide to Functionality. Program Services Office & Decision Support Group

Version 1.6. UDW+ Quick Start Guide to Functionality. Program Services Office & Decision Support Group Version 1.6 UDW+ Quick Start Guide to Functionality Program Services Office & Decision Support Group Table of Contents Access... 2 Log in/system Requirements... 2 Data Refresh... 2 00. FAME Chartfield

More information

Introduction to STATA

Introduction to STATA Introduction to STATA Duah Dwomoh, MPhil School of Public Health, University of Ghana, Accra July 2016 International Workshop on Impact Evaluation of Population, Health and Nutrition Programs Learning

More information

This document is designed to get you started with using R

This document is designed to get you started with using R An Introduction to R This document is designed to get you started with using R We will learn about what R is and its advantages over other statistics packages the basics of R plotting data and graphs What

More information

Surviving SPSS.

Surviving SPSS. Surviving SPSS http://dataservices.gmu.edu/workshops/spss http://dataservices.gmu.edu/software/spss Debby Kermer George Mason University Libraries Data Services Research Consultant Mason Data Services

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

FreeJSTAT for Windows. Manual

FreeJSTAT for Windows. Manual FreeJSTAT for Windows Manual (c) Copyright Masato Sato, 1998-2018 1 Table of Contents 1. Introduction 3 2. Functions List 6 3. Data Input / Output 7 4. Summary Statistics 8 5. t-test 9 6. ANOVA 10 7. Contingency

More information

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction In this exercise, we will learn how to reorganize and reformat a data

More information

Miscellaneous Code. Chapter Remove or search for duplicated GPS locations in a data frame. Contents

Miscellaneous Code. Chapter Remove or search for duplicated GPS locations in a data frame. Contents Chapter 10 Miscellaneous Code Contents 10.1 Remove or search for duplicated GPS locations in a data frame..... 172 10.2 Need to convert back to a matrix to be able to export the data or manipulate the

More information

Introduction to R. UCLA Statistical Consulting Center R Bootcamp. Irina Kukuyeva September 20, 2010

Introduction to R. UCLA Statistical Consulting Center R Bootcamp. Irina Kukuyeva September 20, 2010 UCLA Statistical Consulting Center R Bootcamp Irina Kukuyeva ikukuyeva@stat.ucla.edu September 20, 2010 Outline 1 Introduction 2 Preliminaries 3 Working with Vectors and Matrices 4 Data Sets in R 5 Overview

More information

DATA DEFINITION PHASE

DATA DEFINITION PHASE Twoway Analysis of Variance Unlike previous problems in the manual, the present problem involves two independent variables (gender of juror and type of crime committed by defendant). There are two levels

More information

CS130/230 Lecture 6 Introduction to StatView

CS130/230 Lecture 6 Introduction to StatView Thursday, January 15, 2004 Intro to StatView CS130/230 Lecture 6 Introduction to StatView StatView is a statistical analysis program that allows: o Data management in a spreadsheet-like format o Graphs

More information

IST Computational Tools for Statistics I. DEÜ, Department of Statistics

IST Computational Tools for Statistics I. DEÜ, Department of Statistics IST 1051 Computational Tools for Statistics I 1 DEÜ, Department of Statistics Course Objectives Computational Tools for Statistics-I course can increase the understanding of statistics and helps to learn

More information

Python for Data Analysis. Prof.Sushila Aghav-Palwe Assistant Professor MIT

Python for Data Analysis. Prof.Sushila Aghav-Palwe Assistant Professor MIT Python for Data Analysis Prof.Sushila Aghav-Palwe Assistant Professor MIT Four steps to apply data analytics: 1. Define your Objective What are you trying to achieve? What could the result look like? 2.

More information

Introduction to R. Andy Grogan-Kaylor October 22, Contents

Introduction to R. Andy Grogan-Kaylor October 22, Contents Introduction to R Andy Grogan-Kaylor October 22, 2018 Contents 1 Background 2 2 Introduction 2 3 Base R and Libraries 3 4 Working Directory 3 5 Writing R Code or Script 4 6 Graphical User Interface 4 7

More information

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide 2011 Version 6A Do these tutorials first This series of tutorials provides a quick start to using WINKS. Feel free

More information

Pair-Wise Multiple Comparisons (Simulation)

Pair-Wise Multiple Comparisons (Simulation) Chapter 580 Pair-Wise Multiple Comparisons (Simulation) Introduction This procedure uses simulation analyze the power and significance level of three pair-wise multiple-comparison procedures: Tukey-Kramer,

More information

Monitoring and Improving Quality of Data Handling

Monitoring and Improving Quality of Data Handling Monitoring and Improving Quality of Data Handling The purpose of this document is to: (a) (b) (c) Maximise the quality of the research process once the question has been formulated and the study designed.

More information

An Introduction to Using R

An Introduction to Using R An Introduction to Using R Dino Christenson & Scott Powell Ohio StateUniversity November 20, 2007 Introduction to R Outline I. What is R? II. Why use R? III. Where to get R? IV. GUI & scripts V. Objects

More information

Numerical Methods 5633

Numerical Methods 5633 Numerical Methods 5633 Lecture 1 Marina Krstic Marinkovic marina.marinkovic@cern.ch School of Mathematics Trinity College Dublin Marina Krstic Marinkovic 1 / 15 5633-Numerical Methods R programming https://www.r-project.org/

More information

Introduction to R: Using R for statistics and data analysis

Introduction to R: Using R for statistics and data analysis Why use R? Introduction to R: Using R for statistics and data analysis George W Bell, Ph.D. BaRC Hot Topics November 2015 Bioinformatics and Research Computing Whitehead Institute http://barc.wi.mit.edu/hot_topics/

More information

Introduction to R: Using R for statistics and data analysis

Introduction to R: Using R for statistics and data analysis Why use R? Introduction to R: Using R for statistics and data analysis George W Bell, Ph.D. BaRC Hot Topics November 2014 Bioinformatics and Research Computing Whitehead Institute http://barc.wi.mit.edu/hot_topics/

More information

MINITAB 17 BASICS REFERENCE GUIDE

MINITAB 17 BASICS REFERENCE GUIDE MINITAB 17 BASICS REFERENCE GUIDE Dr. Nancy Pfenning September 2013 After starting MINITAB, you'll see a Session window above and a worksheet below. The Session window displays non-graphical output such

More information

Getting Started with JMP at ISU

Getting Started with JMP at ISU Getting Started with JMP at ISU 1 Introduction JMP (pronounced like jump ) is the new campus-wide standard statistical package for introductory statistics courses at Iowa State University. JMP is produced

More information

Data Mining. ❷Chapter 2 Basic Statistics. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology

Data Mining. ❷Chapter 2 Basic Statistics. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology ❷Chapter 2 Basic Statistics Business School, University of Shanghai for Science & Technology 2016-2017 2nd Semester, Spring2017 Contents of chapter 1 1 recording data using computers 2 3 4 5 6 some famous

More information

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University While your data tables or spreadsheets may look good to

More information

Intro to Stata. University of Virginia Library data.library.virginia.edu. September 16, 2014

Intro to Stata. University of Virginia Library data.library.virginia.edu. September 16, 2014 to 1/12 Intro to University of Virginia Library data.library.virginia.edu September 16, 2014 Getting to Know to 2/12 Strengths Available A full-featured statistical programming language For Windows, Mac

More information

Computer lab 2 Course: Introduction to R for Biologists

Computer lab 2 Course: Introduction to R for Biologists Computer lab 2 Course: Introduction to R for Biologists April 23, 2012 1 Scripting As you have seen, you often want to run a sequence of commands several times, perhaps with small changes. An efficient

More information