PLS205 Lab 1 January 9, Laboratory Topics 1 & 2
|
|
- Tiffany Garrison
- 6 years ago
- Views:
Transcription
1 PLS205 Lab 1 January 9, 2014 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs saving results checking for errors Different ways to input/import data Proc Means, Proc Univariate (testing for normality) Hypothesis testing t-tests: One sample Two sample (Independent) Two sample (Paired) Power Calculations using Proc Power Proc Sort, Proc Print, Proc Means Nifty SAS Program: Critical values generator Niftier webpage APPENDIX: Data input examples Logistics and Organizational Matters 1. Homework is due at the beginning of lab, with 10 points off for every day it's late. If you don't submit it by the time the homework key is posted (usually 24 hours later), you will receive a zero. 2. Print the lab handouts before coming to lab they will be posted on the class website each week by Wednesday night at the latest. 3. To log on to the lab computers, you need a UCD user ID and password. 4. Bring a diskette/flashdrive to lab to copy examples from the class directory (G:\PLS205\*.*). 5. This is a demanding class, so make use of all your resources office hours, lab handouts, homework keys, each other (the 205 Buddy System). Introduction to SAS (your new best friend?) To open SAS Version 9.3: START All Programs Class Software SAS SAS 9.3 (English) The SAS Display Manager There are three basic Windows, listed in the order you should view them: 1) The Program Editor window: Where you tell SAS what to do. 2) The Log window: Where SAS tells you what it did and (usually) what you did wrong. 3) The Output window: Where you find the results of your analysis (i.e. the good stuff). Lab 1.1
2 Example 1 From ST&D p. 29 [Lab1ex1.sas] Data BirdCount * Creates a new data set called "BirdCount" Input Field Birds * Tells SAS the names of variables Cards * A throwback to the old days * SEMICOLON! SEMICOLON! SEMICOLON! Proc Means mean var std stderr cv Data = BirdCount Var Birds * Generate these requested statistics for the variable "Birds" in the dataset "BirdCount" Run Quit Output Analysis Variable : Birds Coeff of Mean Variance Std Dev Std Error Variation Things to Learn 1. Run (submit) a SAS program with a simple click on the running man icon. 2. Move between windows to scan for red-type errors (Log) and then view results (Output). 3. Clear Log and Output windows with a simple click on the blank page icon. 4. Save program to disk. From Program Editor window: File Save as. 5. Save output to disk. From Output window: File Save as. 6. Set the line size for output to 76 characters (the perfect fit for 10 point Courier font on a page with 1" margins): Tools Options System Log and procedure output control SAS log and procedure output Double click linesize Example 2 PROC UNIVARIATE test of Normality From ST&D pg. 30 [Lab1ex2.sas] Data Barley Input tells SAS to please read to the end of the line Cards Proc Univariate normal plot Data = Barley var Extract * Test for normality and generate plots for the variable Extract in the dataset Barley Run Quit Lab 1.2
3 Comments on the code 1. Use in the input statement when you have more Cards on a row than input variables. 2. The word "plot" in Proc Univariate is an example of an option. Its function is to generate several graphical displays of the data, including a stem-and-leaf display, a boxplot, and a normal probability plot (a.k.a. quantile-quantile or Q-Q plot) [see ST&D for interpretation of these displays: pages 30-32, ]. 3. The word "normal" in Proc Univariate is another option. Its function is to carry out tests for normality. In this class, we will be using the Shapiro-Wilk test for normality. Output Variable: Extract Moments N 14 Sum Weights 14 Mean Sum Observations Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation Std Error Mean Basic Statistical Measures Location Variability Mean Std Deviation Median Variance Mode Range Interquartile Range Tests for Location: Mu0=0 Test -Statistic p Value Student's t t Pr > t <.0001 Sign M 7 Pr >= M Signed Rank S 52.5 Pr >= S Tests for Normality Test --Statistic p Value Shapiro-Wilk W Pr < W Kolmogorov-Smirnov D Pr > D > Cramer-von Mises W-Sq Pr > W-Sq > Anderson-Darling A-Sq Pr > A-Sq > Stem Leaf # Boxplot *-----* Lab 1.3
4 Normal Probability Plot * *++* * *+*+ * * * *+ ++*+* +++* * NOTE: The Shapiro-Wilk W statistic measures the linear correlation between the data and their normal scores. The closer W is to 1, the better correlated the distribution is to a normal distribution. Normality is rejected when W is sufficiently smaller than one, that is, when the value Pr < W is less than In this example, p = > 0.05, so we conclude the data exhibit a normal distribution. Example 3 PROC TTEST One sample [Lab1ex3.sas] One sample TTEST To use Proc TTEST to do a t-test (e.g. testing if µ = xx), we must create: new variable = old variable expected µ In the following example, we will test the hypothesis that µ = 78 by creating a new variable TEST78 = Extract We will then perform a t-test for the new variable against the hypothesis µ= 0 (see similar example ST&D pg ). Data Barley Input Extract78 = Extract * Here's that new variable Cards Proc Print * Proc Print displays the inputted data, a nice check Proc TTEST Var Extract * T TEST original variety Var Extract78 * T TEST new variable Extract78 Run Quit Output [Note: In your work, you would accompany this output with a line of interpretation.] The one sample PROC TTEST produces a nice Q-Q graph and tests the probability that the mean is 0 (In this case P<0.0001, since 0 is not included in the confidence vale). For the original Extract variable: Lab 1.4
5 Variable: Extract N Mean Std Dev Std Err Mean 95% CL Mean Std Dev DF t Value Pr > t <.0001 To test if the mean is equal to a certain value, We generate a new variable Extract78 =Extract 78 (the mean we want to test). TTEST of this new variable produces the same nice graphs as before (not shown). Variable: Extract78 N Mean Std Dev Std Err Minimum Maximum Mean 95% CL Mean Std Dev DF t Value Pr > t <.0001 Things to Notice 1. The t-test is highly significant (p < 0.001) so we reject H The 95% confidence interval of the mean for Extract is [ ]. See that the value 78 is far above the upper limit of this confidence interval. That is why the test is highly significant. 3. The 95% confidence interval of the mean for Extract78 is [-2.77 to -1.35] which does not include the 0, so the mean of this sample is significantly different from 78 (P<0.0001). Try repeating the exercise using (the lower extreme of the confidence interval) as the Null Mean. What is the expected probability of the t-test? Lab 1.5
6 Example 4 PROC TTEST 2 independent samples [Lab1ex4.sas] A classification variable (named in this case Trt) is required to tell SAS which values belong to each group. Alphanumeric variables are indicated by $ after the name. Data Barley Input Trt $ Cards Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Var Proc TTEST class trt * assumes independent samples var extract Proc sort by Trt Proc Univariate normal plot Data = Barley var Extract by Trt Run Quit Confidence Interval TTEST Trt N Mean Std Dev Std Err Var Var Trt Mean 95% CL Mean Var Var Method Variances DF t Value Pr > t Pooled Equal Satterthwaite Unequal Equality of Variances Method Num DF Den DF F Value Pr > F Folded F In this case there are no significant differences in malt extract between the two varieties (P=0.16). The test for the equality of variances is NS so we use the P value for the Equal variances. If the test for the equality of variances is significant use the P value for the Unequal variances Lab 1.6
7 Tests for Normality using the by statement Test Statistic p Value Var1 Shapiro-Wilk W Pr < W NS (then we do not reject Normality) Var2 Shapiro-Wilk W Pr < W NS Example 5 PROC TTEST paired samples [Lab1ex5.sas] The two values are taken from the same experimental unit (NOT INDEPENDENT). For example, assume that Var2 is the same sample extracted at a different temperature. The code for paired TTEST is different. Data Barley Input Var1 *paired samples Cards Proc TTEST paired var1*var2 * assumes paired samples Run Quit The Paired TTEST Procedure generates a new variable equal to the difference: Var1 - Var2 and then performs a one sample TTEST to see if that difference is 0. Note that the 95% confidence interval [ ] does not include the 0. This agrees with the highly significance of this test P< N Mean Std Dev Std Err Minimum Maximum Mean 95% CL Mean Std Dev 95% CL Std Dev DF t Value Pr > t Lab 1.7
8 Example 6 Power calculation with PROC POWER [Lab1ex6.sas] One Sample power test. What is the power of a test to detect a difference between the observed mean of and alternative means of and (the same value)? proc power onesamplemeans mean = ntotal = 14 stddev = 1.23 nullmean= alpha= 0.05 power =. run quit The POWER Procedure One-sample t Test for Mean Fixed Scenario Elements Distribution Method Normal Exact Alpha 0.05 Mean Standard Deviation 1.23 Total Sample Size 14 Number of Sides 2 Computed Power Index Null Mean Power >.999 Things to Notice 1. The. after power indicates that you are requesting the power 2. The onesamplemeans is one line of code up to the. It is split in multiple lines to make it easier to read 3. The power to detect a difference from a null mean of 77 is 0.846, and the power increases to almost 1 when the alternative mean is 78. The minimum value of the power is =alpha when the alternative mean is the same as the observed mean. You generally want a power of at least 0.80 (80%). Notice that a 95% confidence interval of the mean is [ ] excludes both 77 and 78. See that the value 78 is far above the upper limit of this confidence interval. That is why the test is highly significant. Lab 1.8
9 Proc Power can be also used to estimate the number of samples required to obtain a certain power proc power onesamplemeans mean = ntotal =. stddev = 1.23 nullmean= 77 alpha= 0.05 power = run The POWER Procedure One-sample t Test for Mean Fixed Scenario Elements Distribution Normal Method Exact Null Mean 77 Alpha 0.05 Mean Standard Deviation 1.23 Number of Sides 2 Computed N Total Nominal Actual N Index Power Power Total SAS rounds the number estimation conservatively to the upper number if there are decimals, to guarantee at least the requested power. Two sample power test. What is the power of a test to detect a difference between two samples with the following mean and variances: Mean Variance N Sample Sample Mean difference= 5 Pooled s= SQRT( (15+17)/2)= 4 (not the same as the average of the standard deviations) proc power twosamplemeans test=diff meandiff = 5 stddev = 4 npergroup = power =. run quit Lab 1.9
10 The POWER Procedure Two-sample t Test for Mean Difference Distribution Normal Method Exact Mean Difference 5 Standard Deviation 4 Number of Sides 2 Null Difference 0 Alpha 0.05 Computed Power N Per Index Group Power Example 7 [Lab1ex7.sas] This next example illustrates the use of Proc Sort, Proc Print, and Proc Means: Data Grades Input StudentNo GradUG $ HWGrade Midterm Final * $ indicates a non-numeric class variable FinalGrade = 0.25*HWgrade *Midterm *Final Cards 13 G G G UG G UG Proc Sort * Orders the data by the variable named below By StudentNo Proc Print * Displays the inputted data in whatever order you wish Title 'Roster in order of Student Number' ID StudentNo Var HWGrade Midterm Final FinalGrade Proc Means n mean std var stderr maxdec=1* MaxDec limits all numbers to 1 decimal place Title 'Descriptive statistics' Var HWGrade Midterm Final FinalGrade Proc Sort By GradUG * Sorting is needed because of the Proc Means below Proc Means n mean std var stderr maxdec=1 Title 'Descriptive statistics by student level' Var HWGrade Midterm Final FinalGrade By GradUG * Without Proc Sort above, this would confuse SAS Proc Plot Plot Final*FinalGrade * Generates plot of Final (y) vs. FinalGrade(x) Run Quit Lab 1.10
11 Note: If you add a title to one Proc statement but not to the others, all the Proc outputs will have the same label. In fact, they will carry over to future programs! To avoid confusion, you should label everything, especially as your programs become more complicated and the output more profuse. Nifty SAS Program [SASCritValues.sas] Tables of critical values rarely contain the exact values you are looking for. Here's a way to use SAS to find critical values and p-values with precision: Data ValueFinder TITLE 'CRITICAL VALUES' * The functions below find the critical value for a specified probability 'p' * where 'p' is the proportion of the area to the **LEFT** of the critical value * [e.g will be the 'p' for a 5% two-tailed test] Nvalue = PROBIT (0.975) * This is Z Tvalue = TINV (0.975, 20) * This is t (p, df) Chivalue = CINV (0.975, 20) * This is chi-square (p, df) Fvalue = FINV (0.975, 20, 4) * This is F (p, NUM df, DEN df) TITLE 'PROBABILITY' * These functions return the probability that an observation is < x Nprob = PROBNORM (1.96) * Z Tprob = PROBT (2.086, 20) * t Chiprob = PROBCHI (34.2, 20) * chi-square Fprob = PROBF (8.56, 20, 4) * F Proc Print Run Quit Very very handy but if you use this, please be aware of what SAS is telling you, namely that it is the areas to the LEFT of the critical values that are being considered. Double-check your results with a table until you get the hang of it. Niftier Website There are a lot of free critical values calculators available on-line as well. Feel free to use them, but be sure you understand how they work. The best way to do this is by checking some test values against the tables in the book (or on the class webpage). A good site: Caution: Be aware of what these calculators are telling you, namely that it is the areas to the LEFT or RIGHT of the critical values that are being considered. Double-check your results with a table until you get the hang of it. Lab 1.11
12 APPENDIX: Data Input Examples Students lose a shocking number of points on homeworks and exams due to incorrect data input (i.e. careless typographical errors). Very rarely should you ever have to input data number-by-number because almost all the datasets will be provided to you already typed into Word documents. The challenge you have is to structure your data input routine in SAS such that it will read correctly whatever you cut-and-paste into your code. The "Do-End-loops" illustrated below may look complicated, but it is worth your time to understand how they work, especially as our data sets become bigger and bigger. Example dataset 1 5 treatments with 5 replications each Possible SAS data entry code: Data Example1 Input Treatment Do Replication = 1 to 5 Input Output End Cards A B C D E A B C D E If this is scary, you can also paste the above table into Excel and manipulate it (again, by cutting and pasting and transposing, not by retyping numbers) to give you something like this: A 3.08 A 5.51 A 5.07 A 4.41 A 3.85 B 3.3 B 3.19 B 4.29 B 1.87 B 1.32 C 5.73 C 5.18 C 5.06 Lab 1.12
13 Once you are here, the SAS code is straightforward: Data Example1 Input Treatment Response Cards A 3.08 A E 2.91 E 2.58 C 3.96 C 3.74 D 1.87 D 3.3 D 2.64 D 3.08 D 3.85 E 2.25 E 4.78 E 3.13 E 2.91 E 2.58 The two approaches are equivalent, but as the data sets become bigger, the Excel manipulations needed for the second approach will become more and more cumbersome. Example data set 2 Combinations of treatments with 10 replications each Trt1A Trt1B Trt2A Trt2B Trt2C Trt2A Trt2B Trt2C Possible SAS data entry code: Data Example2 Do Trt1 = 1 to 2 Do Trt2 = 1 to 3 Do Rep = 1 to 10 Input Output End End End Lab 1.13
14 Cards Here we ve set up the input routine in such a way that we could just cut-and-paste the data table into SAS. No chance for typographical errors. Example data set 3 Each data point identified by four classification variables A1 A2 C1 C2 C3 C4 D1 D2 D3 D1 D2 D3 D1 D2 D3 D1 D2 D3 B B B B B B B B Possible SAS data entry code: Data Example3 Do ClassA = 1 to 2 Do ClassB = 1 to 4 Do ClassC = 1 to 4 Do ClassD = 1 to 3 Input Output End End End End Cards Lab 1.14
15 Voila! Without the Do-End loops, the same dataset would be five times as large because you would have to input the individual classification address for each and every data point (e.g. A2, B3, C2, D1). Again, this may seem unnecessary to you now but please take the time to learn it. And if you have any questions, just ask. Example data set 3 Each data point identified by four classification variables data read input score datalines The following statements invoke the TTEST procedure to test if the mean test score is equal to 30. proc ttest data=read h0=30 var score freq count run Lab 1.15
Laboratory Topics 1 & 2
PLS205 Lab 1 January 12, 2012 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs; saving results; checking for errors
More informationSTAT 503 Fall Introduction to SAS
Getting Started Introduction to SAS 1) Download all of the files, sas programs (.sas) and data files (.dat) into one of your directories. I would suggest using your H: drive if you are using a computer
More informationTHE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions)
THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533 MIDTERM EXAMINATION: October 14, 2005 Instructor: Val LeMay Time: 50 minutes 40 Marks FRST 430 50 Marks FRST 533 (extra questions) This examination
More informationEXST3201 Mousefeed01 Page 1
EXST3201 Mousefeed01 Page 1 3 /* 4 Examine differences among the following 6 treatments 5 N/N85 fed normally before weaning and 85 kcal/wk after 6 N/R40 fed normally before weaning and 40 kcal/wk after
More informationSTAT:5400 Computing in Statistics
STAT:5400 Computing in Statistics Introduction to SAS Lecture 18 Oct 12, 2015 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowaedu SAS SAS is the statistical software package most commonly used in business,
More informationSTAT:5201 Applied Statistic II
STAT:5201 Applied Statistic II Two-Factor Experiment (one fixed blocking factor, one fixed factor of interest) Randomized complete block design (RCBD) Primary Factor: Day length (short or long) Blocking
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationSoci Statistics for Sociologists
University of North Carolina Chapel Hill Soci708-001 Statistics for Sociologists Fall 2009 Professor François Nielsen Stata Commands for Module 7 Inference for Distributions For further information on
More informationThe SAS interface is shown in the following screen shot:
The SAS interface is shown in the following screen shot: There are several items of importance shown in the screen shot First there are the usual main menu items, such as File, Edit, etc I seldom use anything
More informationBaruch College STA Senem Acet Coskun
Baruch College STA 9750 BOOK BUY A Predictive Mode Senem Acet Coskun Table of Contents Summary 3 Why this topic? 4 Data Sources 6 Variable Definitions 7 Descriptive Statistics 8 Univariate Analysis 9 Two-Sample
More informationEXST SAS Lab Lab #6: More DATA STEP tasks
EXST SAS Lab Lab #6: More DATA STEP tasks Objectives 1. Working from an current folder 2. Naming the HTML output data file 3. Dealing with multiple observations on an input line 4. Creating two SAS work
More informationLand Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida
Land Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida FINAL REPORT Submitted October 2004 Prepared by: Daniel Gann Geographic Information
More informationLab #1: Introduction to Basic SAS Operations
Lab #1: Introduction to Basic SAS Operations Getting Started: OVERVIEW OF SAS (access lab pages at http://www.stat.lsu.edu/exstlab/) There are several ways to open the SAS program. You may have a SAS icon
More informationLab #9: ANOVA and TUKEY tests
Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for
More informationIntroductory Guide to SAS:
Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationStat 302 Statistical Software and Its Applications SAS: Data I/O
Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the
More informationCluster Randomization Create Cluster Means Dataset
Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters
More information* Sample SAS program * Data set is from Dean and Voss (1999) Design and Analysis of * Experiments. Problem 3, page 129.
SAS Most popular Statistical software worldwide. SAS claims that its products are used at over 40,000 sites, including at 90% of the Fortune 500. This will not be all SAS as they make other products, such
More informationPrepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.
Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good
More informationSPSS. (Statistical Packages for the Social Sciences)
Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.
More informationAn introduction to SPSS
An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible
More informationIntermediate SAS: Statistics
Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square
More informationCentering and Interactions: The Training Data
Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three
More information2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;
A. Goals of Exercise Biology 164 Laboratory Using Comparative Statistics in Biology "Statistics" is a mathematical tool for analyzing and making generalizations about a population from a number of individual
More informationCSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM
options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall
More informationIntroduction to Statistical Analyses in SAS
Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in
More information5b. Descriptive Statistics - Part II
5b. Descriptive Statistics - Part II In this lab we ll cover how you can calculate descriptive statistics that we discussed in class. We also learn how to summarize large multi-level databases efficiently,
More informationCut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION
Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output Myungshin Oh, UCLA Department of Biostatistics Mel Widawski, UCLA School of Nursing ABSTRACT We, as statisticians, often spend more
More informationAverages and Variation
Averages and Variation 3 Copyright Cengage Learning. All rights reserved. 3.1-1 Section 3.1 Measures of Central Tendency: Mode, Median, and Mean Copyright Cengage Learning. All rights reserved. 3.1-2 Focus
More informationST Lab 1 - The basics of SAS
ST 512 - Lab 1 - The basics of SAS What is SAS? SAS is a programming language based in C. For the most part SAS works in procedures called proc s. For instance, to do a correlation analysis there is proc
More informationMinitab Guide for MA330
Minitab Guide for MA330 The purpose of this guide is to show you how to use the Minitab statistical software to carry out the statistical procedures discussed in your textbook. The examples usually are
More informationSAS Training Spring 2006
SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered
More informationThe Power and Sample Size Application
Chapter 72 The Power and Sample Size Application Contents Overview: PSS Application.................................. 6148 SAS Power and Sample Size............................... 6148 Getting Started:
More informationChoosing the Right Procedure
3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 6 Available Statistical
More informationEnter your UID and password. Make sure you have popups allowed for this site.
Log onto: https://apps.csbs.utah.edu/ Enter your UID and password. Make sure you have popups allowed for this site. You may need to go to preferences (right most tab) and change your client to Java. I
More informationStat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics
Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 19, 2015 2 Getting
More informationTable Of Contents. Table Of Contents
Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store
More informationTable of Contents (As covered from textbook)
Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression
More informationRepeated Measures Part 4: Blood Flow data
Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,
More informationChapter 3: Data Description - Part 3. Homework: Exercises 1-21 odd, odd, odd, 107, 109, 118, 119, 120, odd
Chapter 3: Data Description - Part 3 Read: Sections 1 through 5 pp 92-149 Work the following text examples: Section 3.2, 3-1 through 3-17 Section 3.3, 3-22 through 3.28, 3-42 through 3.82 Section 3.4,
More informationChapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data
Chapter 2 Descriptive Statistics: Organizing, Displaying and Summarizing Data Objectives Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically
More informationChapter 3 - Displaying and Summarizing Quantitative Data
Chapter 3 - Displaying and Summarizing Quantitative Data 3.1 Graphs for Quantitative Data (LABEL GRAPHS) August 25, 2014 Histogram (p. 44) - Graph that uses bars to represent different frequencies or relative
More informationExercise 1: Introduction to Stata
Exercise 1: Introduction to Stata New Stata Commands use describe summarize stem graph box histogram log on, off exit New Stata Commands Downloading Data from the Web I recommend that you use Internet
More informationReading data in SAS and Descriptive Statistics
P8130 Recitation 1: Reading data in SAS and Descriptive Statistics Zilan Chai Sep. 18 th /20 th 2017 Outline Intro to SAS (windows, basic rules) Getting Data into SAS Descriptive Statistics SAS Windows
More informationRobust Linear Regression (Passing- Bablok Median-Slope)
Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their
More informationBIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA
BIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA Learning objectives: Getting data ready for analysis: 1) Learn several methods of exploring the
More informationCreating a data file and entering data
4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming
More informationDepending on the computer you find yourself in front of, here s what you ll need to do to open SPSS.
1 SPSS 11.5 for Windows Introductory Assignment Material covered: Opening an existing SPSS data file, creating new data files, generating frequency distributions and descriptive statistics, obtaining printouts
More informationChoosing the Right Procedure
3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues
More informationIt s Proc Tabulate Jim, but not as we know it!
Paper SS02 It s Proc Tabulate Jim, but not as we know it! Robert Walls, PPD, Bellshill, UK ABSTRACT PROC TABULATE has received a very bad press in the last few years. Most SAS Users have come to look on
More informationNotes on Simulations in SAS Studio
Notes on Simulations in SAS Studio If you are not careful about simulations in SAS Studio, you can run into problems. In particular, SAS Studio has a limited amount of memory that you can use to write
More informationAssignment 5.5. Nothing here to hand in
Assignment 5.5 Nothing here to hand in Load the tidyverse before we start: library(tidyverse) ## Loading tidyverse: ggplot2 ## Loading tidyverse: tibble ## Loading tidyverse: tidyr ## Loading tidyverse:
More informationChapter 6: DESCRIPTIVE STATISTICS
Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling
More informationMultivariate Capability Analysis
Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8
More informationWithin-Cases: Multivariate approach part one
Within-Cases: Multivariate approach part one /* sleep2.sas */ options linesize=79 noovp formdlim=' '; title "Student's Sleep data: Matched t-tests with proc reg"; data bedtime; infile 'studentsleep.data'
More informationChapter 2 Describing, Exploring, and Comparing Data
Slide 1 Chapter 2 Describing, Exploring, and Comparing Data Slide 2 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data 2-4 Measures of Center 2-5 Measures of Variation 2-6 Measures of Relative
More informationFactorial ANOVA with SAS
Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'
More informationIntroductory SAS example
Introductory SAS example STAT:5201 1 Introduction SAS is a command-driven statistical package; you enter statements in SAS s language, submit them to SAS, and get output. A fairly friendly user interface
More informationMinitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.
Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see
More informationChemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt )
JMP Output from Chapter 9 Factorial Analysis through JMP Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) Fitting the Model and checking conditions Analyze > Fit Model
More informationAssignment 0. Nothing here to hand in
Assignment 0 Nothing here to hand in The questions here have solutions attached. Follow the solutions to see what to do, if you cannot otherwise guess. Though there is nothing here to hand in, it is very
More informationExample how not to do it: JMP in a nutshell 1 HR, 17 Apr Subject Gender Condition Turn Reactiontime. A1 male filler
JMP in a nutshell 1 HR, 17 Apr 2018 The software JMP Pro 14 is installed on the Macs of the Phonetics Institute. Private versions can be bought from
More informationSTAT 7000: Experimental Statistics I
STAT 7000: Experimental Statistics I 2. A Short SAS Tutorial Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2009 Peng Zeng (Auburn University) STAT 7000 Lecture Notes Fall 2009
More informationSTATS PAD USER MANUAL
STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11
More informationExample 5.25: (page 228) Screenshots from JMP. These examples assume post-hoc analysis using a Protected LSD or Protected Welch strategy.
JMP Output from Chapter 5 Factorial Analysis through JMP Example 5.25: (page 228) Screenshots from JMP. These examples assume post-hoc analysis using a Protected LSD or Protected Welch strategy. Fitting
More informationStatistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings
Statistical Good Practice Guidelines SSC home Using Excel for Statistics - Tips and Warnings On-line version 2 - March 2001 This is one in a series of guides for research and support staff involved in
More informationExcel Tips and FAQs - MS 2010
BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my
More informationSAS/STAT 13.1 User s Guide. The Power and Sample Size Application
SAS/STAT 13.1 User s Guide The Power and Sample Size Application This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as
More informationLab 5 - Risk Analysis, Robustness, and Power
Type equation here.biology 458 Biometry Lab 5 - Risk Analysis, Robustness, and Power I. Risk Analysis The process of statistical hypothesis testing involves estimating the probability of making errors
More informationEXAMS IN THE GENESIS GRADEBOOK
EXAMS IN THE GENESIS GRADEBOOK I. Introduction to Exams in the Genesis Gradebook II. Steps to Grading Exams in Genesis III. Setting Up Exams A. Selecting the Averaging Method for an Exam B. Adding Sections
More informationAnalysis of variance and regression. November 13, 2007
Analysis of variance and regression November 13, 2007 SAS language The SAS environments Reading in, data-step Summary statistics Subsetting data More on reading in, missing values Combination of data sets
More informationAnalysis of variance and regression. November 13, 2007
Analysis of variance and regression November 13, 2007 SAS language The SAS environments Reading in, data-step Summary statistics Subsetting data More on reading in, missing values Combination of data sets
More information14.2 Do Two Distributions Have the Same Means or Variances?
14.2 Do Two Distributions Have the Same Means or Variances? 615 that this is wasteful, since it yields much more information than just the median (e.g., the upper and lower quartile points, the deciles,
More informationCREATING THE DISTRIBUTION ANALYSIS
Chapter 12 Examining Distributions Chapter Table of Contents CREATING THE DISTRIBUTION ANALYSIS...176 BoxPlot...178 Histogram...180 Moments and Quantiles Tables...... 183 ADDING DENSITY ESTIMATES...184
More informationAPPENDIX. Appendix 2. HE Staining Examination Result: Distribution of of BALB/c
APPENDIX Appendix 2. HE Staining Examination Result: Distribution of of BALB/c mice nucleus liver cells changes in percents between control group and intervention groups. Descriptives Groups Statistic
More informationSurvey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9
Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Contents 1 Introduction to Using Excel Spreadsheets 2 1.1 A Serious Note About Data Security.................................... 2 1.2
More informationINTRODUCTION SAS Prepared by A. B. Billings West Virginia University May 1999 (updated August 2006)
INTRODUCTION To SAS Prepared by A. B. Billings West Virginia University May 1999 (updated August 2006) 1 Getting Started with SAS SAS stands for Statistical Analysis System. SAS is a computer software
More informationT-test og variansanalyse i SAS. T-test og variansanalyse i SAS p.1/18
T-test og variansanalyse i SAS T-test og variansanalyse i SAS p.1/18 T-test og variansanalyse i SAS T-test (Etstik, tostik, parrede observationer) Variansanalyse SAS-procedurer: PROC TTEST PROC GLM T-test
More informationExcel 2010 with XLSTAT
Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with
More informationLecture 1 Getting Started with SAS
SAS for Data Management, Analysis, and Reporting Lecture 1 Getting Started with SAS Portions reproduced with permission of SAS Institute Inc., Cary, NC, USA Goals of the course To provide skills required
More informationNonparametric Testing
Nonparametric Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com
More informationCHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.
1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed
More informationChapter 5: The beast of bias
Chapter 5: The beast of bias Self-test answers SELF-TEST Compute the mean and sum of squared error for the new data set. First we need to compute the mean: + 3 + + 3 + 2 5 9 5 3. Then the sum of squared
More informationSTAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.
STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,
More informationStatistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.
Statistical Methods Instructor: Lingsong Zhang 1 Issues before Class Statistical Methods Lingsong Zhang Office: Math 544 Email: lingsong@purdue.edu Phone: 765-494-7913 Office Hour: Monday 1:00 pm - 2:00
More informationMean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types
More information1 Introduction to Using Excel Spreadsheets
Survey of Math: Excel Spreadsheet Guide (for Excel 2007) Page 1 of 6 1 Introduction to Using Excel Spreadsheets This section of the guide is based on the file (a faux grade sheet created for messing with)
More informationST706: Spring One-Way Random effects example.
ST706: Spring 2013. One-Way Random effects example. DATA IS FROM CONTROL ARM OF SKIN CANCER STUDY AT DARTMOUTH. DC0 TO DC5 ARE MEASURES OF DIETARY INTAKE OVER 6 YEARS. For illustration here we treat the
More informationMore Summer Program t-shirts
ICPSR Blalock Lectures, 2003 Bootstrap Resampling Robert Stine Lecture 2 Exploring the Bootstrap Questions from Lecture 1 Review of ideas, notes from Lecture 1 - sample-to-sample variation - resampling
More informationData can be in the form of numbers, words, measurements, observations or even just descriptions of things.
+ What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and
More informationPage 1. Graphical and Numerical Statistics
TOPIC: Description Statistics In this tutorial, we show how to use MINITAB to produce descriptive statistics, both graphical and numerical, for an existing MINITAB dataset. The example data come from Exercise
More informationStatistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment
Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ Aiming Yang, Merck & Co., Inc., Rahway, NJ ABSTRACT Four pitfalls are commonly
More informationQuantitative - One Population
Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)
More informationOnline Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions:
Online Supplementary Appendix for Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Power, Sample Size, and Resource Considerations 1 Appendix
More informationEksamen ERN4110, 6/ VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert)
Eksamen ERN4110, 6/9-2018 VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert) 1 Oppgave 1 Datafila I SPSS: Variabelnavn Beskrivelse Kjønn Kjønn (1=Kvinne, 2=Mann) Studieinteresse
More informationSection 4 General Factorial Tutorials
Section 4 General Factorial Tutorials General Factorial Part One: Categorical Introduction Design-Ease software version 6 offers a General Factorial option on the Factorial tab. If you completed the One
More informationFactorial ANOVA. Skipping... Page 1 of 18
Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent
More information