Assumption 1: Groups of data represent random samples from their respective populations.
|
|
- Diane Phelps
- 5 years ago
- Views:
Transcription
1 Tutorial 6: Comparing Two Groups Assumptions The following methods for comparing two groups are based on several assumptions. The type of test you use will vary based on whether these assumptions are met or not. Assumption 1: Groups of data represent random samples from their respective populations. Assumption 2: Numerical data are normally distributed (follow a Gaussian curve) in each population. Assumption 3: The standard deviations of the sample groups are equal. You may also see this referred to as the assumption of homogeneity of variance. If Assumption 1 is violated, then you should probably not even be performing statistical analyses on your data, so let s move on to violations of the other two assumptions If both Assumption 2 and 3 are met, proceed with a Two-sample T-test. If Assumption 2 is met, but Assumption 3 is violated, proceed with Welch s approximate T-test. If Assumption 2 is violated, but Assumption 3 is met, proceed with Mann-Whitney U-test. If both Assumption 2 and 3 are violated, your situation is far less than ideal. Your best option is to try transforming your data to hopefully meet the assumption of normality, which would allow you to perform Welch s approximate T-test. If that doesn t work, you may be forced to perform a Mann-Whitney U-test; however, the results should be interpreted with caution because the power of this analysis is greatly reduced when variances are unequal. Normality Testing Before doing any statistical tests, you should first test the normality of your data to see whether it follows a Gaussian distribution (normal bell curve). If data are normal, you should proceed with parametric analyses. If data are non-normal, you should either attempt transformations to make data normal or proceed with non-parametric alternatives. The Shapiro-Wilk normality test, shapiro.test(), is the most commonly used normality test. The output will give a P-value, which if less than 0.05 indicates that data are non-normal. shapiro.test(dataset$measurement) D'Agostino-Pearson Omnibus K2 normality test, dagotest(), is used for data a sample size of at least 20. It requires the fbasics package. library(fbasics) dagotest(measurement) 1
2 Variance Testing The variance of two normally distributed samples can be compared using an F test. The group variable in the second example must only have two groups (levels) for this method to work. var.test(sample1, sample2) var.test(dataset$measurement~dataset$group) Bartlett test for homogeneity of variances is for use on normally data, but can handle unequal sample sizes. bartlett.test(dataset$measurement~dataset$group) Levene test used on non-normal data (Modified Levene s Equal Variance). Requires the car package. library(car) levenetest(dataset$measurement, dataset$group) Data Transformations If data are non-normally distributed, it is common to transform data in an attempt to make them normal. Every attempt at transformation should be followed by a normality test, and you should also graph your data to see what it looks like. Logarithmic transformation is used for right-skewed data with no negative values. If there are any zero values in your dataset, add one to all values. Natural Log You can assign your transformed data to a name ( logdata in this case). If the dataset contains zeros, add 1 to all values (accomplished by adding +1 ) log(dataset$measurement) logdata=log(dataset$measurement) log(dataset$measurement+1) Base 10 Log log10(dataset$measurement) 2
3 Square-root transformation is used on count data following the Poisson distribution. sqrt(dataset$measurement) Arc-sine transformation is used on proportion (percentage) data. asin(dataset$measurement/100) asin(sqrt(dataset$measurement/100)) Note: There are many more options for transformation not listed here. These are just some of the more common ones. Adding a Column to an Existing Data frame After performing a transformation or other calculation, it is useful to append the new data to your original dataset (dataframe format). Such calculations produce a vector of the same length as the number of rows in the data frame can be added to the data frame for the duration of your R session. This does not affect the original data file, but if you wish to save those changes, you can export the newer data frame for later use (covered at the end of this tutorial). This example takes an existing column of measurements in the dataset, multiplies each value by 5 and places them in a new column newcolumn at the end of the data frame. dataset$newcolumn=dataset$measurement*5 This takes an existing column of measurements in the dataset, natural log transforms each data point and places them in a new column logdata at the end of the data frame. dataset$logdata=log(dataset$measurement) Two-sample T-test Used to test whether the means of two groups are equal. For manipulative studies, a t-test compares one control group and one treatment group. For observational studies, a t-test compares two groups that differ in some categorical way (location, habitat, community type, etc.). The default is a Welch two-sample t-test that does not assume equal variance. There is an argument, var.equal=, that is by default set to FALSE. t.test(sample1, sample2, var.equal=false) 3
4 Setting the var.equal= argument to TRUE treats variances as equal between groups and performs a normal two-sample t-test. To be proper, data should be checked for homogeneity of variance, var.test(), prior to performing any statistical test. t.test(sample1, sample2, var.equal=true) The above examples compare vectors of data. You can also compare columns in a dataset as long as they represent your sample groups. t.test(dataset$sample1, dataset$sample2, var.equal=t) If all of the sample data is together in one column (dataset$measurement) with a second column indicating the two groups (dataset$grouping), the tilde symbol (~) can be used to tell R to separate the measurements into the groups. The grouping variable must only have two levels (groups) for this to work. t.test(dataset$measurement~dataset$grouping, var.equal=t) Paired Two-sample T-test Used to test whether the mean difference between two groups equals a certain value. Common experimental designs: 1) Before and after same individuals/samples/plots measured before and after some treatment (before restoration vs. after restoration) or at different time points. 2) Adjacent plot design pairs of adjacent plots are used to control for localized environmental variability; one plot in each pair is manipulated Adding the paired=true argument performs a paired t-test. t.test(sample1, sample2, var.equal=true, paired=true) t.test(dataset$sample1, dataset$sample2, var.equal=true, paired=true) t.test(dataset$measurement~dataset$grouping, var.equal= TRUE, paired=true) 4
5 Two-Sample Wilcoxon Rank Sum (Mann-Whitney) Test This is a non-parametric equivalent for a two-sample t-test. This test is used when data are nonnormal (distribution of data do not follow normal curve). Note: This may be called a Mann-Whitney U-test in other statistical software packages. wilcox.test(sample1, sample2) wilcox.test(dataset$sample1, dataset$sample2) wilcox.test(dataset$measurement~dataset$grouping) Wilcoxon Signed Rank Test This is a non-parametric alternative to a paired t-test. wilcox.test(sample1, sample2, paired=true) wilcox.test(dataset$sample1, dataset$sample2, paired=t) wilcox.test(dataset$measurement~dataset$grouping, paired=true) Exporting Data from R Data frames, matrices, and other outputs generated in R can be exported to a.csv file to save and use later. If not using a working directory, a full pathname must be given for where the file is to be saved. In this example, dataset is the name of the dataset (data frame, matrix, etc.) you want to export. The file= argument tells R the location (pathname) for the location and name of the file you want to create. The pathname must be in quotes. write.csv(dataset, file="/users/johndoe/desktop/ dataset.csv") write.csv(dataset, file="c:/documents and Settings/ Owner/Desktop/dataset.csv") If you have set a working directory, you do not need provide the full pathname, only the name of the file you want to create is required. The resulting file will be saved in the folder you set as your working directory. setwd("/users/johndoe/desktop/") setwd("c:/documents and Settings/Owner/Desktop/") write.csv(dataset, dataset.csv") 5
6 Tutorial Code setwd("/users/johndoe/desktop/") example=read.csv("r_example_dataframe.csv") young=example[grep("young", example$age), "Richness"] young old=example[grep("old", example$age), "Richness"] old shapiro.test(young) shapiro.test(old) library(fbasics) dagotest(example$richness) var.test(young, old) var.test(example$richness~example$age) bartlett.test(example$richness~example$age) library(car) levenetest(example$richness, example$age) levenetest(example$richness~example$age) #natural log transformation log.young=log(young) log.young #base 10 log transformation log10.young=log10(young) log10.young #square root transformation sqrt.young=sqrt(young) sqrt.young 6
7 #arcsine transformation asin.young=asin(young/10) #divide Young richness values by max. value (10) to get proportion data asin.young #Two-sample (Student's) T-test t.test(example$richness~example$age, var.equal=t) t.test(example$richness~example$age, var.equal=f) #Paired T-test t.test(example$richness~example$age, paired=t) #Wilcoxon Rank-Sum Test (non-parametric) wilcox.test(example$richness~example$age) #Paired Wilcoxon (non-parametric) wilcox.test(example$richness~example$age, paired=t) #save.csv file without working directory set write.csv(example, file="/users/johndoe/desktop/r_example_dataframe_revised.csv") #With working directory set write.csv(example, file="r_example_dataframe_revised.csv") 7
command.name(measurement, grouping, argument1=true, argument2=3, argument3= word, argument4=c( A, B, C ))
Tutorial 3: Data Manipulation Anatomy of an R Command Every command has a unique name. These names are specific to the program and case-sensitive. In the example below, command.name is the name of the
More informationThe Power and Sample Size Application
Chapter 72 The Power and Sample Size Application Contents Overview: PSS Application.................................. 6148 SAS Power and Sample Size............................... 6148 Getting Started:
More informationCluster Randomization Create Cluster Means Dataset
Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters
More informationProduct Catalog. AcaStat. Software
Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,
More informationIn this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics:
UPPSALA UNIVERSITY Department of Mathematics Måns Thulin, thulin@math.uu.se Analysis of regression and variance Fall 2011 COMPUTER EXERCISE 2: One-way ANOVA In this computer exercise we will work with
More informationSAS/STAT 13.1 User s Guide. The Power and Sample Size Application
SAS/STAT 13.1 User s Guide The Power and Sample Size Application This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as
More informationMinitab 17 commands Prepared by Jeffrey S. Simonoff
Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save
More informationpairwise.t.test(dataset$measurement, dataset$group, p.adj = bonferroni ) TukeyHSD(aov(dataset$measurement~dataset$group))
Tutorial 9: Comparing Three or More Groups One-way (single-factor) ANOVA (analysis of variance) Used to compare means of 3 or more groups based on a single explanatory (independent) variable, or factor.
More informationSTATS PAD USER MANUAL
STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11
More informationInterval Estimation. The data set belongs to the MASS package, which has to be pre-loaded into the R workspace prior to use.
Interval Estimation It is a common requirement to efficiently estimate population parameters based on simple random sample data. In the R tutorials of this section, we demonstrate how to compute the estimates.
More informationfor statistical analyses
Using for statistical analyses Robert Bauer Warnemünde, 05/16/2012 Day 6 - Agenda: non-parametric alternatives to t-test and ANOVA (incl. post hoc tests) Wilcoxon Rank Sum/Mann-Whitney U-Test Kruskal-Wallis
More informationIndex. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113
Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame
More information2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;
A. Goals of Exercise Biology 164 Laboratory Using Comparative Statistics in Biology "Statistics" is a mathematical tool for analyzing and making generalizations about a population from a number of individual
More informationBluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition
Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created
More informationStatistical Tests for Variable Discrimination
Statistical Tests for Variable Discrimination University of Trento - FBK 26 February, 2015 (UNITN-FBK) Statistical Tests for Variable Discrimination 26 February, 2015 1 / 31 General statistics Descriptional:
More informationFreeJSTAT for Windows. Manual
FreeJSTAT for Windows Manual (c) Copyright Masato Sato, 1998-2018 1 Table of Contents 1. Introduction 3 2. Functions List 6 3. Data Input / Output 7 4. Summary Statistics 8 5. t-test 9 6. ANOVA 10 7. Contingency
More informationMatlab Tutorial. The value assigned to a variable can be checked by simply typing in the variable name:
1 Matlab Tutorial 1- What is Matlab? Matlab is a powerful tool for almost any kind of mathematical application. It enables one to develop programs with a high degree of functionality. The user can write
More informationStatistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte
Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,
More informationFathom Dynamic Data TM Version 2 Specifications
Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other
More informationEksamen ERN4110, 6/ VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert)
Eksamen ERN4110, 6/9-2018 VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert) 1 Oppgave 1 Datafila I SPSS: Variabelnavn Beskrivelse Kjønn Kjønn (1=Kvinne, 2=Mann) Studieinteresse
More informationThe first few questions on this worksheet will deal with measures of central tendency. These data types tell us where the center of the data set lies.
Instructions: You are given the following data below these instructions. Your client (Courtney) wants you to statistically analyze the data to help her reach conclusions about how well she is teaching.
More informationTable Of Contents. Table Of Contents
Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store
More informationPsychology 282 Lecture #21 Outline Categorical IVs in MLR: Effects Coding and Contrast Coding
Psychology 282 Lecture #21 Outline Categorical IVs in MLR: Effects Coding and Contrast Coding In the previous lecture we learned how to incorporate a categorical research factor into a MLR model by using
More informationSPSS TRAINING SPSS VIEWS
SPSS TRAINING SPSS VIEWS Dataset Data file Data View o Full data set, structured same as excel (variable = column name, row = record) Variable View o Provides details for each variable (column in Data
More informationStatistical Research Consultants Bangladesh (SRCBD) Testing for Normality using SPSS
Testing for Normality using SPSS An assessment of the normality of data is a prerequisite for many statistical tests because normal data is an underlying assumption in parametric testing. There are two
More informationIntroduction to R. Introduction to Econometrics W
Introduction to R Introduction to Econometrics W3412 Begin Download R from the Comprehensive R Archive Network (CRAN) by choosing a location close to you. Students are also recommended to download RStudio,
More informationIntroduction to Geospatial Analysis
Introduction to Geospatial Analysis Introduction to Geospatial Analysis 1 Descriptive Statistics Descriptive statistics. 2 What and Why? Descriptive Statistics Quantitative description of data Why? Allow
More informationMATH3880 Introduction to Statistics and DNA MATH5880 Statistics and DNA Practical Session Monday, 16 November pm BRAGG Cluster
MATH3880 Introduction to Statistics and DNA MATH5880 Statistics and DNA Practical Session Monday, 6 November 2009 3.00 pm BRAGG Cluster This document contains the tasks need to be done and completed by
More informationVocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.
5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationIntegrated Math I. IM1.1.3 Understand and use the distributive, associative, and commutative properties.
Standard 1: Number Sense and Computation Students simplify and compare expressions. They use rational exponents and simplify square roots. IM1.1.1 Compare real number expressions. IM1.1.2 Simplify square
More informationIntroduction to scientific programming in R
Introduction to scientific programming in R John M. Drake & Pejman Rohani 1 Introduction This course will use the R language programming environment for computer modeling. The purpose of this exercise
More informationMacros and ODS. SAS Programming November 6, / 89
Macros and ODS The first part of these slides overlaps with last week a fair bit, but it doesn t hurt to review as this code might be a little harder to follow. SAS Programming November 6, 2014 1 / 89
More informationCCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12
Tool 1: Standards for Mathematical ent: Interpreting Functions CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12 Name of Reviewer School/District Date Name of Curriculum Materials:
More informationResources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.
Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department
More informationData Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski
Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...
More informationIntroduction to Mplus
Introduction to Mplus May 12, 2010 SPONSORED BY: Research Data Centre Population and Life Course Studies PLCS Interdisciplinary Development Initiative Piotr Wilk piotr.wilk@schulich.uwo.ca OVERVIEW Mplus
More informationNuts and Bolts Research Methods Symposium
Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Topics to Discuss: Types of Variables Constructing a Variable Code Book Developing Excel Spreadsheets
More informationModule 1: Introduction RStudio
Module 1: Introduction RStudio Contents Page(s) Installing R and RStudio Software for Social Network Analysis 1-2 Introduction to R Language/ Syntax 3 Welcome to RStudio 4-14 A. The 4 Panes 5 B. Calculator
More informationMultivariate Capability Analysis
Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8
More informationMean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types
More informationCorrectly Compute Complex Samples Statistics
SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationMicrosoft Access XP Queries. Student Manual
Microsoft Access XP Queries Student Manual Duplication is prohibited without the written consent of The Abreon Group. Foster Plaza 10 680 Andersen Drive Suite 500 Pittsburgh, PA 15220 412.539.1800 800.338.5185
More informationE-Campus Inferential Statistics - Part 2
E-Campus Inferential Statistics - Part 2 Group Members: James Jones Question 4-Isthere a significant difference in the mean prices of the stores? New Textbook Prices New Price Descriptives 95% Confidence
More informationCHAPTER 6. The Normal Probability Distribution
The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit
More informationWriting Reports with Report Designer and SSRS 2014 Level 1
Writing Reports with Report Designer and SSRS 2014 Level 1 Duration- 2days About this course In this 2-day course, students are introduced to the foundations of report writing with Microsoft SQL Server
More informationFurther Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables
Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables
More informationThe ctest Package. January 3, 2000
R objects documented: The ctest Package January 3, 2000 bartlett.test....................................... 1 binom.test........................................ 2 cor.test.........................................
More informationPage 1. Graphical and Numerical Statistics
TOPIC: Description Statistics In this tutorial, we show how to use MINITAB to produce descriptive statistics, both graphical and numerical, for an existing MINITAB dataset. The example data come from Exercise
More informationMeasures of Dispersion
Measures of Dispersion 6-3 I Will... Find measures of dispersion of sets of data. Find standard deviation and analyze normal distribution. Day 1: Dispersion Vocabulary Measures of Variation (Dispersion
More informationDealing with Categorical Data Types in a Designed Experiment
Dealing with Categorical Data Types in a Designed Experiment Part II: Sizing a Designed Experiment When Using a Binary Response Best Practice Authored by: Francisco Ortiz, PhD STAT T&E COE The goal of
More informationMeasures of Dispersion
Lesson 7.6 Objectives Find the variance of a set of data. Calculate standard deviation for a set of data. Read data from a normal curve. Estimate the area under a curve. Variance Measures of Dispersion
More informationExploring and Understanding Data Using R.
Exploring and Understanding Data Using R. Loading the data into an R data frame: variable
More informationTHE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann
Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG
More informationCategorical Data in a Designed Experiment Part 2: Sizing with a Binary Response
Categorical Data in a Designed Experiment Part 2: Sizing with a Binary Response Authored by: Francisco Ortiz, PhD Version 2: 19 July 2018 Revised 18 October 2018 The goal of the STAT COE is to assist in
More informationJMP Book Descriptions
JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked
More informationR commander an Introduction
R commander an Introduction Natasha A. Karp nk3@sanger.ac.uk May 2010 Preface This material is intended as an introductory guide to data analysis with R commander. It was produced as part of an applied
More informationMultivariate analyses in ecology. Cluster (part 2) Ordination (part 1 & 2)
Multivariate analyses in ecology Cluster (part 2) Ordination (part 1 & 2) 1 Exercise 9B - solut 2 Exercise 9B - solut 3 Exercise 9B - solut 4 Exercise 9B - solut 5 Multivariate analyses in ecology Cluster
More informationAverages and Variation
Averages and Variation 3 Copyright Cengage Learning. All rights reserved. 3.1-1 Section 3.1 Measures of Central Tendency: Mode, Median, and Mean Copyright Cengage Learning. All rights reserved. 3.1-2 Focus
More informationSPSS. (Statistical Packages for the Social Sciences)
Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.
More informationGeneral Program Description
General Program Description This program is designed to interpret the results of a sampling inspection, for the purpose of judging compliance with chosen limits. It may also be used to identify outlying
More informationSubset Selection in Multiple Regression
Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that
More informationAutomatic Selection of Compiler Options Using Non-parametric Inferential Statistics
Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics Masayo Haneda Peter M.W. Knijnenburg Harry A.G. Wijshoff LIACS, Leiden University Motivation An optimal compiler optimization
More informationDescriptive Statistics, Standard Deviation and Standard Error
AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.
More informationPackage st. July 8, 2015
Version 1.2.5 Date 2015-07-08 Package st July 8, 2015 Title Shrinkage t Statistic and Correlation-Adjusted t-score Author Rainer Opgen-Rhein, Verena Zuber, and Korbinian Strimmer. Maintainer Korbinian
More informationGetting Started with MATLAB
APPENDIX B Getting Started with MATLAB MATLAB software is a computer program that provides the user with a convenient environment for many types of calculations in particular, those that are related to
More informationR commander an introduction
R commander an introduction free, user-friendly, and powerful software Ho Kim SCHOOL OF PUBLIC HEALTH, SNU Useful sites R is a free software with powerful tools The Comprehensive R Archives Network http://cran.r-project.org/
More informationA. Incorrect! This would be the negative of the range. B. Correct! The range is the maximum data value minus the minimum data value.
AP Statistics - Problem Drill 05: Measures of Variation No. 1 of 10 1. The range is calculated as. (A) The minimum data value minus the maximum data value. (B) The maximum data value minus the minimum
More informationKernel Density Estimation (KDE)
Kernel Density Estimation (KDE) Previously, we ve seen how to use the histogram method to infer the probability density function (PDF) of a random variable (population) using a finite data sample. In this
More informationThemes in the Texas CCRS - Mathematics
1. Compare real numbers. a. Classify numbers as natural, whole, integers, rational, irrational, real, imaginary, &/or complex. b. Use and apply the relative magnitude of real numbers by using inequality
More informationJMP 10 Student Edition Quick Guide
JMP 10 Student Edition Quick Guide Instructions presume an open data table, default preference settings and appropriately typed, user-specified variables of interest. RMC = Click Right Mouse Button Graphing
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationOrganizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013
Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Learning Objectives Identify Different Types of Variables Appropriately Naming Variables Constructing
More informationFrequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM
Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.
More informationProject and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi
Project and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi Lecture - 8 Consistency and Redundancy in Project networks In today s lecture
More informationIntroductory Guide to SAS:
Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................
More informationBox-Cox Transformation for Simple Linear Regression
Chapter 192 Box-Cox Transformation for Simple Linear Regression Introduction This procedure finds the appropriate Box-Cox power transformation (1964) for a dataset containing a pair of variables that are
More informationWorkload Characterization Techniques
Workload Characterization Techniques Raj Jain Washington University in Saint Louis Saint Louis, MO 63130 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse567-08/
More informationIf the active datasheet is empty when the StatWizard appears, a dialog box is displayed to assist in entering data.
StatWizard Summary The StatWizard is designed to serve several functions: 1. It assists new users in entering data to be analyzed. 2. It provides a search facility to help locate desired statistical procedures.
More informationWhat s New in Spotfire DXP 1.1. Spotfire Product Management January 2007
What s New in Spotfire DXP 1.1 Spotfire Product Management January 2007 Spotfire DXP Version 1.1 This document highlights the new capabilities planned for release in version 1.1 of Spotfire DXP. In this
More informationLAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT
NAVAL POSTGRADUATE SCHOOL LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT Statistics (OA3102) Lab #2: Sampling, Sampling Distributions, and the Central Limit Theorem Goal: Use R to demonstrate sampling
More informationA Guide to Using Some Basic MATLAB Functions
A Guide to Using Some Basic MATLAB Functions UNC Charlotte Robert W. Cox This document provides a brief overview of some of the essential MATLAB functionality. More thorough descriptions are available
More informationStatCalc User Manual. Version 9 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.
StatCalc User Manual Version 9 for Mac and Windows Copyright 2018, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Introduction... 4 Getting Help... 4 Uninstalling StatCalc...
More informationFigure 1. Figure 2. The BOOTSTRAP
The BOOTSTRAP Normal Errors The definition of error of a fitted variable from the variance-covariance method relies on one assumption- that the source of the error is such that the noise measured has a
More informationRClimTool USER MANUAL
RClimTool USER MANUAL By Lizeth Llanos Herrera, student Statistics This tool is designed to support, process automation and analysis of climatic series within the agreement of CIAT-MADR. It is not intended
More informationTable of Contents (As covered from textbook)
Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression
More informationLab 5 - Risk Analysis, Robustness, and Power
Type equation here.biology 458 Biometry Lab 5 - Risk Analysis, Robustness, and Power I. Risk Analysis The process of statistical hypothesis testing involves estimating the probability of making errors
More informationUniversity of Alberta
A Brief Introduction to MATLAB University of Alberta M.G. Lipsett 2008 MATLAB is an interactive program for numerical computation and data visualization, used extensively by engineers for analysis of systems.
More informationOne way ANOVA when the data are not normally distributed (The Kruskal-Wallis test).
One way ANOVA when the data are not normally distributed (The Kruskal-Wallis test). Suppose you have a one way design, and want to do an ANOVA, but discover that your data are seriously not normal? Just
More informationfor Windows User guide Exeter Software Statistical software for biologists Version 3.3
for Windows Statistical software for biologists Version 3.3 User guide F. James Rohlf Dennis E. Slice Department of Ecology and Evolution State University of New York Stony Brook, NY 11794 Exeter Software
More informationIntroduction to R (BaRC Hot Topics)
Introduction to R (BaRC Hot Topics) George Bell September 30, 2011 This document accompanies the slides from BaRC s Introduction to R and shows the use of some simple commands. See the accompanying slides
More informationFrequency Distributions
Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,
More informationQUEEN MARY, UNIVERSITY OF LONDON. Introduction to Statistics
QUEEN MARY, UNIVERSITY OF LONDON MTH 4106 Introduction to Statistics Practical 1 10 January 2012 In this practical you will be introduced to the statistical computing package called Minitab. You will use
More information13 File Structures. Source: Foundations of Computer Science Cengage Learning. Objectives After studying this chapter, the student should be able to:
13 File Structures 13.1 Source: Foundations of Computer Science Cengage Learning Objectives After studying this chapter, the student should be able to: Define two categories of access methods: sequential
More informationPITMAN Randomization Tests Version 5.00S (c) "One of many STATOOLS(tm)..." by. Gerard E. Dallal 54 High Plain Road Andover, MA 01810
PITMAN Randomization Tests Version 5.00S (c) 1985-1991 "One of many STATOOLS(tm)..." by Gerard E. Dallal 54 High Plain Road Andover, MA 01810 PITMAN performs one- and two-sample exact randomization tests.
More informationNonparametric Testing
Nonparametric Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com
More informationThe goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up.
Fall 2010 Handout on Using R Page: 1 The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up. 1. Installing R First off,
More informationMinitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.
Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see
More informationWillmar Public Schools Curriculum Map
Subject Area Mathematics Senior High Course Name Advanced Algebra 2B Date June 2010 Advanced Algebra 2B and Algebra 2B courses parallel each other in content and time. The Advanced Algebra 2B class is
More informationLab #9: ANOVA and TUKEY tests
Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for
More information