Stat-340 Term Test Spring Term

Size: px
Start display at page:

Download "Stat-340 Term Test Spring Term"

Transcription

1 Stat-340 Term Test Spring Term Part 1 - Multiple Choice Enter your answers to the multiple choice questions on the provided bubble sheets. Each of the multiple choice question is worth 1 mark there is no correction for guessing. Be sure your student name and number are completed on the bubble sheets. 1. How many observations and variables are contained in the following dataset? data blah; infile datalines; length name $10 sex $1 partnername $10 partnersex $1; input name $ sex $ / partnername $ partnersex $; datalines; Carl M Lois F Matthew M Fred M Selina F David M Tim M Kim. ;;;; (a) 8 observations; 2 variables. (b) 4 observations, 4 variables. (c) 8 observation, 4 variables. (d) 3 observations, 4 variables. (e) 4 observations, 2 variables. Solution: (b) Option A - 14% chose Notice the slash in the input which makes SAS go to a new line for the last 2 variables. Option B - 60% chose Option C - 24% chose See (a) 1

2 2. Which of the following is TRUE about By group processing? (a) A different analysis can be performed for each BY group. (b) The BY variable must be a numeric or date variable. (c) The data does not have to be grouped together by values of the BY variables (d) The BY groups can have different numbers of variables. (e) BY group processing can be done for any procedure. Solution: (e) Option D - 11% chose All of the by groups are subsets of the data and so have the same number of variables. Option E - 88% chose By variables can be any type. 3. Which of the following is correct about a standard error of a statistic. (a) The se measures how much the sample size changes in a simulation study. (b) The se measures the standard deviation of the population slope over bootstraps samples from the data. (c) The se measures the standard deviation of the Gini-estimate of the standard deviation between different populations. (d) The se measures the increase in the number of calories for each additional gram of fat. (e) The se measures now much a statistic will vary when new samples are taken from a population. Solution: (e) Option A - SE never measure variation of sample size. Option B - Population parameters (slopes) are fixed and do not vary. Option C - This doesn t even make sense - there is only one population. Option D - This is the definition of a slope and not a standard error. Option E - 95% chose Consider the following segment of code: data birthdays; infile datalines; length name $30; input name $ bdate:yymmdd10.; format bdate mmddyy8.; datalines; carl 63/02/01 lois 48/14/02 fred 58/06/03 tim 52/07/04 dave 63/12/31 ;;;; proc print data=birthdays; c 2015 Carl James Schwarz 2

3 Which of the following is correct? (a) The birth day for Carl will be displayed as 02/01/63. (b) The birth day for Lois will be displayed as 14/02/48. (c) The birth day for Fred will be displayed as ; (d) The birth day for Tim will be displayed as 04/07/1952. (e) The birth day for Dave will be display as a missing value. Solution: (a) Option A - 73% chose Option D - 14% chose The mmmddyy out-format only has length 8 so show 2 digit years. 5. Consider the following code data blah; infile datalines; length name sex $10.; input name sex age weight; if age > 30 then delete; drop weight; datalines; A f B m C F D M E F. 43 ;;;; Which of the following is correct? (a) The blah dataset has 5 observations and 4 variables. (b) The blah dataset has 4 observations and 4 variables/ (c) The blah dataset has 3 observations and 3 variables; (d) The blah dataset has 4 observation and 3 variables; (e) The blah dataset has 5 observations and 3 variables. Solution: (d) Option B - 15% chose The Drop statement removes a variable. Option D - 65% chose Option E - 11% chose The If statement removes an observation. 6. Which of the following is correct? (a) PROC GLM is used to test hypotheses about population mean proportions. (b) PROC FREQ is used to test hypotheses about sample proportions. c 2015 Carl James Schwarz 3

4 (c) PROC REG is used to test hypotheses about population slopes. (d) PROC GENMOD is used to test hypotheses about sample proportions. (e) PROC TTEST is used to test hypotheses about paired sample means. Solution: (c) Option A - 14% chose There is no such thing as a MEAN proportion! Option C - 61% chose Option D - 10% chose Hypotheses are ALWAYS about POPULATION parameters, not sample statistics. Option E - 15% chose Hypotheses are ALWAYS about POPULATION parameters, not sample statistics. 7. Consider the following SAS code: data blah; infile datalines dlm=, input v1 v2 v3 v4 v5 v6; datalines; 1,,2,3,4,5,6,7,8,9 2,3,.,5,6,7,8,9,0 3,4,5,.,6,7,8,9,0,1,2 9,8,7,,6,5,4,3,2,1,0 7,,6,,5,,9,,4,,3,,1,, ;;;; dsd missover; Which of the following is correct? (a) The value of v2 in the first observation is 2. (b) The value of v3 in the second observation is 5. (c) The value of v4 in the third observation is missing. (d) The value of v6 in the fourth observation is 4. (e) The value of v3 in the fifth observation is 5. Solution: (c) Option C - 95% chose in Consider the following SAS code: data blah; infile datalines; length surname $10 sex $1; input surname sex age; datalines; schwarz m 56 c 2015 Carl James Schwarz 4

5 schwarz f 53 zhao f 48 zhao m 52 sun m 27 chao f 23 chao m 27 ;;;; proc sort data=blah; by surname; proc transpose data=blah out=transblah; by surname; var age; id sex; Which of the following is correct? (a) The resulting transblah dataset has 3 observations. (b) The value of the variable M for the first observation in the transblah dataset is 56. (c) The observation for surname Sun will have the value of 27 for the the ages of both sexes. (d) The value of the variable F for last observation in the transblah dataset is 23. (e) The 4th observation in the transblah dataset will have 52 as the value for the M variable. Solution: (e) Option A - 16% chose. There are 4 distinct values for the Surname variable so the resulting dataset will have 4 observations. Option B - 28% chose. Don t forget to sort before transposing. Option C - 11% chose. Because Sun does not have a complete set of a variables, the missing variables will be set to missing. Option E - 39% chose. 9. Consider the following SAS code: proc tabulate data=accidents missing; class month Accident_Severity; var fatality; table Accident_Severity ALL, month*fatality*mean*f=7.2; Which of the following is correct? (a) The Accident_Severity variable will be along the top of the table (the columns). (b) The mean number of fatalities in each month and Accident_Severity will be found. (c) Each row of the table will correspond to a different value of the Accident_Severity variable, with the final row a summary over all codes. c 2015 Carl James Schwarz 5

6 (d) The missing option on the Proc statement ensures that missing values are ignored during the tabulation. (e) If the Accident_Severity variable had 3 levels, and if the month variable had 12 levels, the table would have 36 cells. Solution: (c) Option B - 40% chose Month is not used in the Table statement. Option C - 35% chose Option D - 12% chose The missing option also tabulates the missing values. Option E - 12% chose The ALL option will generate a row at the end for all codes. 10. Consider the following piece of SAS code: data blah; infile datalines; length name $10 sex $1; input name sex YearOfBirth; Age = YearOfBirth; datalines; Carl M 1956 Lois Fred.. Matthew M 1926 Marianne F -1 David M 1922 Julia F 2016 ;;;; Which of the following is correct: (a) The computed value of Age for Carl is 59. (b) The computed value of Age for Fred is 0. (c) The computed value of Age for Marianne is missing. (d) The computed value of Age for David is (e) The computed value of Age for Julia is missing. Solution: (a) Option A - 96% chose c 2015 Carl James Schwarz 6

7 Part II - Long Answer Stat Spring Term - Term Test 1 Name Student Number: Put your name and student number on the upper right of each of the following pages as well in case the pages get separated. Answer the following questions in the space provided. Be sure that your answers are legible. The marks given to these questions are 5, 6, 3, 4, and 7 respectively. c 2015 Carl James Schwarz 7

8 1. Interpretation - 5 Marks: Consider the following output from an analysis of the cereal dataset: Write a SHORT paragraph here summarizing the results. Solution: The relationship between the calories/serving and the grams of fat/serving was investigated using linear regression (Figure 1). The fitted equation is Calories = (F at) There was strong evidence that the slope is different from 0 (p <.0001). For every gram of fat, the calories/serving is expected to increase by 9.8 (SE 2.2) calories/gram of fat. Common problems in solutions from students include: Reporting too many decimal places. Seldom do you need to report more than two significant digits. The intercept is usually not of interest and so you don t usually spend anytime discussing it. The whole point of regression is to estimate the slope. So the discussion needs to be about the slope. Many students discussed differences in means (which is not sensible), or differences in the mean among groups which is again not sensible. These students were likely confusing regression with ANOVA. Don t just give the table values as facts add some interpretation to the information in the table. For example, many student had sentences such as The parameter estimate for Fat was 9.8. The c 2015 Carl James Schwarz 8

9 standard error was The t-value was 4.44 and the p-value was <.0001 so we rejected the null hypothesis. These types of sentence provide no useful information to the reader over and above the table. c 2015 Carl James Schwarz 9

10 2. Reading and Recodes - 6 Marks: The csv file named atus.csv contains the following fields on television viewing from the American Time of Use Study. ID Number Name (up to 30 characters) Sex (single letter code) Age at time of interview. For example 26y3m indicates the subject was 26 years and 3 months old. Number of minutes of television watched. The first few lines of the data file are as follows: ID, name, sex, age, tvmin 123ABCDEF, Schwarz, m, 58y10m, 20 LJD1234LJ, Lank, m, 61y2m, LLJJ, Swartz, F, 21y10m, 75 LLKD2343K, Duncan, f, 87y2m, 150 OUEROE, Smith, f, 8y2m, 236 Write SAS code to do the following: Read in the data from the csv file as noted above. Convert the year/month age data to a decimal year, e.g. 26y3m is converted to years (3 months is 1/4 of a year). Recode the sex variable. Either f or F is recoded as female; either m or M is recoded as male; other values are recoded as illegal sex. Recode the decimal age to 3 age classes. Ages (including 16 but excluding 25) are recoded to 16-24; ages (including 25 but excluding 40) are recoded to 25-39; ages (including 40 but excluding 70) are recoded to Other ages are recoded to out of frame. Check your recodes for both sex and age using appropriate procedures. Put your SAS code here and the page overleaf (if needed) c 2015 Carl James Schwarz 10

11 One possible solution data atus; infile datalines dlm=, dsd missover firstobs=2; /* Need dsd, dlm and firstobs= length id $10 name $20 sex $1 cage $10; length cagey cagem $10; /* temporary character values */ length newsex $10 ageclass $20; /* recoded values need longer lengths */ input id $ name $ sex $ cage $ minutes; /* convert input age to decimal age */ wherey = index(cage, "y"); /* where is the y */ cagey = substr(cage, 1, wherey-1); /* extract the age in years */ agey = input(cagey, f30.0); /* convert to age in years to number */ wherem = index(cage, "m"); /* where is the m */ cagem = substr(cage, wherey+1, wherem-wherey-1); /* extract the months */ agem = input(cagem, f30.0); /* extract the months */ age = agey + agem/12; /* make decimal age */ /* recode the sex */ sex = upcase(sex); /* convert to upper case */ newsex = illegal ; if sex = F then newsex = female ; if sex = "M" then newsex = male ; /* recode the age classes */ ageclass = out of frame ; if 16 <= age < 25 then ageclass = ; if 25 <= age < 40 then ageclass = ; if 40 <= age < 70 then ageclass = ; datalines; 123ABCDEF, Schwarz, m, 58y10m, 20 LJD1234LJ, Lank, m, 61y2m, LLJJ, Swartz, F, 21y10m, 75 LLKD2343K, Duncan, f, 87y2m, 150 OUEROE, Smith, f, 8y2m, 236 ;;;; proc print data=atus; title2 Data after coding ; /* check the recodes */ /* You need to use Proc Tabulate/SGplot and compare the OLD values to the NEW values */ proc tabulate data=atus missing; c 2015 Carl James Schwarz 11

12 title2 check the recodes ; class sex newsex age ageclass; table sex, newsex *n*f=5.0; /* check sex coding */ table age, ageclass*n*f=5.0; /* possible but very long table */ /* because age is a continuous variable, it is better to use sgplot to check the recodes*/ proc sgplot data=atus; title2 check the recodes for age ; scatter x=ageclass y=age; Comments about student responses: I ve used the Datalines option, but you could replace it with the actual file named people.csv. You could use Proc Import as well to read in the data using proc import file= people.csv out=atus replace; Many students didn t use/forgot to correct for upper/lower case of the gender values. Rather than if gender = f then gender = F ; if gender = m then gender = M ; use the upcase() function directly as shown above. Be careful of code such as data blah; length sex $1; input sex; if sex = f then sex = female ; Because sex is defined with length 1, the new value of female gets truncated to 1 character. So you either have to define sex with a longer length, or define a new variable (as I did above) with a longer length to hold the new values. Be careful of code such as data blah;... if 16 <= age < 25 then age = ; Here you are using the age variable as both character and numeric. This won t work. You likely want a separate character variable for the age class as I did in my solution. Some students always thought that the month started in the 4th position. It may not. See the solution above for a completely general solution. Always try and code stuff in the most general fashion possible so that it works in all cases. c 2015 Carl James Schwarz 12

13 Using Proc Print to check your recodes is not sufficient, as you will only be able to check if the recoding worked for the first few records. You need to use Proc Tabulateand Proc SGplot as shown above and as was done in our assignments. Notice that proc tabulate data=blah; class newsex; table newsex; doesn t provide enough information to see that the values of oldest have been properly recoded to the newest variable. See the solution above. Some student tried code along the lines of data blah; infilel... input... age yyymmm; There is no informat in SASto hand this case and you need to use the methods as shown above. The only useful infomats needed are for dates, times, and datetime values. c 2015 Carl James Schwarz 13

14 3. Trends in TV watching - 3 Marks: We are now interested in comparing the average TV watched between sexes and among age classes (see previous question), and examining if the trends over age classes are the same for both sexes. Here is some output from such an analysis Source DF Type III SS Mean Square F Value Pr > F sex <.0001 ageclass <.0001 sex*ageclass (a) Write a (very) short paragraph on your conclusions from the above analysis. WRITE YOUR PARAGRAPH HERE Solution: We performed an analysis of variance (ANOVA) to investigate if the changes in the mean number of minutes of TV viewing across the age classes were similar for the two sexes. There was no evidence that the change in the mean TV watched across the age classes varied between the sexes (p = 0.78)., i.e. there was no evidence that the trends across age classes were not parallel for the two sexes. There was strong evidence that there were difference in the mean amount of TV watched between the sexes and among the age classes (both p <.0001). Comments on student answers: We never say that there was evidence of parallelism, bur rather we say that there was no evidence of non-parallelism. The reason for this is that with a large enough sample size, we can always find evidence that the trends are non-parallel, but the non-parallelism may be miniscule. (b) Give the SAS code that would give the above results. Just the procedure code is needed - no data step is needed. You may assume that the dataset is called atus and contains variables sex, ageclass, and tvwatched for the number of minutes of TV watched by the respondent. Assume that the data were collected from an SRS, so it is NOT necessary to weight the analysis. Put your SAS code here: Solution proc glm data=atus; class sex ageclass; model tvwatched= sex ageclass sex*ageclass; Comments on student answers: Many students used Proc Genmod. This procedure is usually only used for logistic and similar models and not for standard ANOVAs. You need terms for the main effects and the interactions to produce the above table. c 2015 Carl James Schwarz 14

15 4. Profile Plot - 4 Marks: The output from the procedure to analyze the ATUS included estimates of the marginal means (the LSmeans) along with the upper and lower confidence limits on each each marginal mean. Create a suitable profile plot comparing the changes in mean TV watched across the age classes for the two sexes. Be sure to label the axes properly. You can assume that the analysis procedure created a data set (named mylsmeans) with the following variables. sex ageclass estimate of the marginal mean TV watched (minutes) lcl, the lower confidence bound on the mean ucl, the upper confidence bound on the mean Put your SAS code here: One possible solution proc sgplot data=mylsmeans; title2 profile plot of mean tv watched ; scatter x=ageclass y=estimate / group=sex; series x=ageclass y=estimate / group=sex; highlow x=ageclass lower=lcl upper=ucl / group=sex; xaxis label= Age class ; yaxis label= Mean TV watched (minutes) with 95% confidence interval ; Comments on student solutions: Several students used a Proc Means to try and find some averages. I m guessing that they just copied a solution that looked similar on past exams. Here the dataset is ready to be plotted and no further processing is needed before using Proc SGplot. c 2015 Carl James Schwarz 15

16 5. More analyses of the ATUS study. - 7 Marks There are two files for the ATUS study. The first dataset (named tvwatch) records TV watching habits and has the following information ID - the ID Number of the family MinTV - Number of minutes of television watched for the selected person from the household. The second dataset (named demoinfo) contains demographic and other information about the respondent s household (including the respondent) with the following information: ID - the ID Number of the family name of household member sex - the sex of the household member coded as f or m. empstatus - the employment status (employed or unemployed, coded as em or un) of the household member at the time of interview So for each subject in the tvwatch dataset, there can be 1 or more observations in the demoinfo dataset. Write SAS code to accomplish the following tasks Processes the demoinfo data to count the number of household members, the number of males, and the number of employed members. Hint: remember how your counted the number of females in the vehicles dataset from the Accidents analysis. Combines the TV time dataset and the data set from the previous step. Removes any records where there are more than 4 people in the household. Computes the mean number of minutes watched for each combination of number of males and the number of employed members and saves the results to a data set. [You can make up an ODS table name if needed]. Put your SAS code here and overleaf (if needed). c 2015 Carl James Schwarz 16

17 /* create variables for male/female and employment status */ proc sort data=demoinfo; by id; data demoinfo; set demoinfo; ismale = 0; if sex = m then ismale=1; /* code 1 or 0 for number of males */ isemp = 0; if empstatus = em then isemp = 1; proc means data=demoinfo noprint; /* count number of males */ by id; var ismale isemp; output out=sumdemo n=nmembers sum=nmale nemp; /* combine the two datasets */ data both; merge tvwatch demoinfo; by id; if nmembers > 4 then delete; /* remove households with more than 4 members */ /* get the mean tv watched */ proc sort data=both; by nmale hemp; proc means data=both; by nmale nemp; var mintv; output out=meantv mean=mean_tv; /* or you could use proc glm and a lsmeans */ proc glm data=both; class nmale nemp; model mintv = nmale nemp nmale*nemp; lsmeans nmale*nemp; ods output lsmeans=mylsmeans; Comments about student solutions: Many students had difficulty with part 1 of the question. This was the hardest part of the question. You could also try variants of a Proc Tabulate but that is likely to be more difficult to do. Most students had no problems with the merges and deletion step. You could also use Proc Tabulate for the final step, but this is actually more difficult to implement in practise than the given solutions. c 2015 Carl James Schwarz 17

18 Statistics about the term test: c 2015 Carl James Schwarz 18

19 There is some evidence that grades on the assignments is related to the grades on the term tests as seen in the pairwise plots below. c 2015 Carl James Schwarz 19

20 c 2015 Carl James Schwarz 20

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

SAS Training Spring 2006

SAS Training Spring 2006 SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered

More information

Introductory Guide to SAS:

Introductory Guide to SAS: Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................

More information

Stat-340 Assignment Spring Term

Stat-340 Assignment Spring Term Stat-340 Assignment 2 2015 Spring Term Part 1 - Breakfast cereals - Easy In this part of the assignment, you will learn how to: importing a *.csv file using Proc Import; use Proc Tabulate; create dot,

More information

1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file

1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file 1 SPSS Guide 2009 Content 1. Basic Steps for Data Analysis. 3 2. Data Editor. 2.4.To create a new SPSS file 3 4 3. Data Analysis/ Frequencies. 5 4. Recoding the variable into classes.. 5 5. Data Analysis/

More information

Data Management - 50%

Data Management - 50% Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define

More information

ST Lab 1 - The basics of SAS

ST Lab 1 - The basics of SAS ST 512 - Lab 1 - The basics of SAS What is SAS? SAS is a programming language based in C. For the most part SAS works in procedures called proc s. For instance, to do a correlation analysis there is proc

More information

A. Using the data provided above, calculate the sampling variance and standard error for S for each week s data.

A. Using the data provided above, calculate the sampling variance and standard error for S for each week s data. WILD 502 Lab 1 Estimating Survival when Animal Fates are Known Today s lab will give you hands-on experience with estimating survival rates using logistic regression to estimate the parameters in a variety

More information

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS

More information

Dr. Barbara Morgan Quantitative Methods

Dr. Barbara Morgan Quantitative Methods Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In

More information

EXST SAS Lab Lab #6: More DATA STEP tasks

EXST SAS Lab Lab #6: More DATA STEP tasks EXST SAS Lab Lab #6: More DATA STEP tasks Objectives 1. Working from an current folder 2. Naming the HTML output data file 3. Dealing with multiple observations on an input line 4. Creating two SAS work

More information

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall

More information

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office) SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O

Stat 302 Statistical Software and Its Applications SAS: Data I/O Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the

More information

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered

More information

Centering and Interactions: The Training Data

Centering and Interactions: The Training Data Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three

More information

Brief Guide on Using SPSS 10.0

Brief Guide on Using SPSS 10.0 Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new

More information

EXST SAS Lab Lab #8: More data step and t-tests

EXST SAS Lab Lab #8: More data step and t-tests EXST SAS Lab Lab #8: More data step and t-tests Objectives 1. Input a text file in column input 2. Output two data files from a single input 3. Modify datasets with a KEEP statement or option 4. Prepare

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

STA9750 Lecture I OUTLINE 1. WELCOME TO 9750!

STA9750 Lecture I OUTLINE 1. WELCOME TO 9750! STA9750 Lecture I OUTLINE 1. Welcome to STA9750! a. Blackboard b. Tentative syllabus c. Remote access to SAS 2. Introduction to reading data with SAS a. Manual input b. Reading from a text file c. Import

More information

AURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1

AURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1 SAS TRAINING SAS/BASE BASIC THEORY & RULES ETC SAS WINDOWING ENVIRONMENT CREATION OF LIBRARIES SAS PROGRAMMING (BRIEFLY) - DATASTEP - PROC STEP WAYS TO READ DATA INTO SAS BACK END PROCESS OF DATASTEP INSTALLATION

More information

Stat 5100 Handout #14.a SAS: Logistic Regression

Stat 5100 Handout #14.a SAS: Logistic Regression Stat 5100 Handout #14.a SAS: Logistic Regression Example: (Text Table 14.3) Individuals were randomly sampled within two sectors of a city, and checked for presence of disease (here, spread by mosquitoes).

More information

Level I: Getting comfortable with my data in SAS. Descriptive Statistics

Level I: Getting comfortable with my data in SAS. Descriptive Statistics Level I: Getting comfortable with my data in SAS. Descriptive Statistics Quick Review of reading Data into SAS Preparing Data 1. Variable names in the first row make sure they are appropriate for the statistical

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users BIOSTATS 640 Spring 2018 Review of Introductory Biostatistics STATA solutions Page 1 of 13 Key Comments begin with an * Commands are in bold black I edited the output so that it appears here in blue Unit

More information

Intermediate SAS: Statistics

Intermediate SAS: Statistics Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square

More information

An introduction to SPSS

An introduction to SPSS An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible

More information

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/ STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape

More information

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University While your data tables or spreadsheets may look good to

More information

SPSS. (Statistical Packages for the Social Sciences)

SPSS. (Statistical Packages for the Social Sciences) Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.

More information

Week 6, Week 7 and Week 8 Analyses of Variance

Week 6, Week 7 and Week 8 Analyses of Variance Week 6, Week 7 and Week 8 Analyses of Variance Robyn Crook - 2008 In the next few weeks we will look at analyses of variance. This is an information-heavy handout so take your time reading it, and don

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

Creating a data file and entering data

Creating a data file and entering data 4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that

More information

Psychology 282 Lecture #21 Outline Categorical IVs in MLR: Effects Coding and Contrast Coding

Psychology 282 Lecture #21 Outline Categorical IVs in MLR: Effects Coding and Contrast Coding Psychology 282 Lecture #21 Outline Categorical IVs in MLR: Effects Coding and Contrast Coding In the previous lecture we learned how to incorporate a categorical research factor into a MLR model by using

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

PSY 9556B (Feb 5) Latent Growth Modeling

PSY 9556B (Feb 5) Latent Growth Modeling PSY 9556B (Feb 5) Latent Growth Modeling Fixed and random word confusion Simplest LGM knowing how to calculate dfs How many time points needed? Power, sample size Nonlinear growth quadratic Nonlinear growth

More information

Example1D.1.sas. * Procedures : ; * 1. print to show the dataset. ;

Example1D.1.sas. * Procedures : ; * 1. print to show the dataset. ; Example1D.1.sas * SAS example program 1D.1 ; * 1. Create a dataset called prob from the following data: ; * age prob lb ub ; * 24.25.20.31 ; * 36.26.21.32 ; * 48.28.24.33 ; * 60.31.28.36 ; * 72.35.32.39

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming

More information

The SAS interface is shown in the following screen shot:

The SAS interface is shown in the following screen shot: The SAS interface is shown in the following screen shot: There are several items of importance shown in the screen shot First there are the usual main menu items, such as File, Edit, etc I seldom use anything

More information

Statistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings

Statistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings Statistical Good Practice Guidelines SSC home Using Excel for Statistics - Tips and Warnings On-line version 2 - March 2001 This is one in a series of guides for research and support staff involved in

More information

WELCOME! Lecture 3 Thommy Perlinger

WELCOME! Lecture 3 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 3 Thommy Perlinger Program Lecture 3 Cleaning and transforming data Graphical examination of the data Missing Values Graphical examination of the data It is important

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 19, 2015 2 Getting

More information

Introduction to SAS. Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC

Introduction to SAS. Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC Introduction to SAS Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC cmurray-krezan@salud.unm.edu 20 August 2018 What is SAS? Statistical Analysis System,

More information

Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt )

Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) JMP Output from Chapter 9 Factorial Analysis through JMP Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) Fitting the Model and checking conditions Analyze > Fit Model

More information

SAS Online Training: Course contents: Agenda:

SAS Online Training: Course contents: Agenda: SAS Online Training: Course contents: Agenda: (1) Base SAS (6) Clinical SAS Online Training with Real time Projects (2) Advance SAS (7) Financial SAS Training Real time Projects (3) SQL (8) CV preparation

More information

Lastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software.

Lastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software. Welcome to Basic Excel, presented by STEM Gateway as part of the Essential Academic Skills Enhancement, or EASE, workshop series. Before we begin, I want to make sure we are clear that this is by no means

More information

Factorial ANOVA with SAS

Factorial ANOVA with SAS Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

2. Don t forget semicolons and RUN statements The two most common programming errors.

2. Don t forget semicolons and RUN statements The two most common programming errors. Randy s SAS hints March 7, 2013 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, March 8, 2013 ***************; 2. Don t forget semicolons and

More information

Base and Advance SAS

Base and Advance SAS Base and Advance SAS BASE SAS INTRODUCTION An Overview of the SAS System SAS Tasks Output produced by the SAS System SAS Tools (SAS Program - Data step and Proc step) A sample SAS program Exploring SAS

More information

STAT 7000: Experimental Statistics I

STAT 7000: Experimental Statistics I STAT 7000: Experimental Statistics I 2. A Short SAS Tutorial Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2009 Peng Zeng (Auburn University) STAT 7000 Lecture Notes Fall 2009

More information

PR3 & PR4 CBR Activities Using EasyData for CBL/CBR Apps

PR3 & PR4 CBR Activities Using EasyData for CBL/CBR Apps Summer 2006 I2T2 Process Page 23. PR3 & PR4 CBR Activities Using EasyData for CBL/CBR Apps The TI Exploration Series for CBR or CBL/CBR books, are all written for the old CBL/CBR Application. Now we can

More information

EXST3201 Mousefeed01 Page 1

EXST3201 Mousefeed01 Page 1 EXST3201 Mousefeed01 Page 1 3 /* 4 Examine differences among the following 6 treatments 5 N/N85 fed normally before weaning and 85 kcal/wk after 6 N/R40 fed normally before weaning and 40 kcal/wk after

More information

ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a

ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a Put the following data into an spss data set: Be sure to include variable and value labels and missing value specifications for all variables

More information

Quantitative - One Population

Quantitative - One Population Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)

More information

PLS205 Lab 1 January 9, Laboratory Topics 1 & 2

PLS205 Lab 1 January 9, Laboratory Topics 1 & 2 PLS205 Lab 1 January 9, 2014 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs saving results checking for errors Different

More information

Paper S Data Presentation 101: An Analyst s Perspective

Paper S Data Presentation 101: An Analyst s Perspective Paper S1-12-2013 Data Presentation 101: An Analyst s Perspective Deanna Chyn, University of Michigan, Ann Arbor, MI Anca Tilea, University of Michigan, Ann Arbor, MI ABSTRACT You are done with the tedious

More information

Week 4: Simple Linear Regression III

Week 4: Simple Linear Regression III Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of

More information

Getting Started with the SGPLOT Procedure

Getting Started with the SGPLOT Procedure ABSTRACT Getting Started with the SGPLOT Procedure Joshua M. Horstman, Nested Loop Consulting Do you want to create highly-customizable, publication-ready graphics in just minutes using SAS? This workshop

More information

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values Chapter 500 Introduction This procedure produces tables of frequency counts and percentages for categorical and continuous variables. This procedure serves as a summary reporting tool and is often used

More information

MHPE 494: Data Analysis. Welcome! The Analytic Process

MHPE 494: Data Analysis. Welcome! The Analytic Process MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your

More information

Preparing for Data Analysis

Preparing for Data Analysis Preparing for Data Analysis Prof. Andrew Stokes March 21, 2017 Managing your data Entering the data into a database Reading the data into a statistical computing package Checking the data for errors and

More information

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX Paper 152-27 From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX ABSTRACT This paper is a case study of how SAS products were

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

Factorial ANOVA. Skipping... Page 1 of 18

Factorial ANOVA. Skipping... Page 1 of 18 Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent

More information

STAT:5400 Computing in Statistics

STAT:5400 Computing in Statistics STAT:5400 Computing in Statistics Introduction to SAS Lecture 18 Oct 12, 2015 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowaedu SAS SAS is the statistical software package most commonly used in business,

More information

CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

CH5: CORR & SIMPLE LINEAR REFRESSION ======================================= STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

More information

IQR = number. summary: largest. = 2. Upper half: Q3 =

IQR = number. summary: largest. = 2. Upper half: Q3 = Step by step box plot Height in centimeters of players on the 003 Women s Worldd Cup soccer team. 157 1611 163 163 164 165 165 165 168 168 168 170 170 170 171 173 173 175 180 180 Determine the 5 number

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

Poisson Regressions for Complex Surveys

Poisson Regressions for Complex Surveys Poisson Regressions for Complex Surveys Overview Researchers often use sample survey methodology to obtain information about a large population by selecting and measuring a sample from that population.

More information

DSCI 325: Handout 2 Getting Data into SAS Spring 2017

DSCI 325: Handout 2 Getting Data into SAS Spring 2017 DSCI 325: Handout 2 Getting Data into SAS Spring 2017 Data sets come in many different formats. In some situations, data sets are stored on paper (e.g., surveys) and other times data are stored in huge

More information

STA 570 Spring Lecture 5 Tuesday, Feb 1

STA 570 Spring Lecture 5 Tuesday, Feb 1 STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row

More information

An Introduction to SAS University Edition

An Introduction to SAS University Edition An Introduction to SAS University Edition Ron Cody From An Introduction to SAS University Edition. Full book available for purchase here. Contents List of Programs... xi About This Book... xvii About the

More information

Chapter 6: DESCRIPTIVE STATISTICS

Chapter 6: DESCRIPTIVE STATISTICS Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

Statistical Tests for Variable Discrimination

Statistical Tests for Variable Discrimination Statistical Tests for Variable Discrimination University of Trento - FBK 26 February, 2015 (UNITN-FBK) Statistical Tests for Variable Discrimination 26 February, 2015 1 / 31 General statistics Descriptional:

More information

Macros and ODS. SAS Programming November 6, / 89

Macros and ODS. SAS Programming November 6, / 89 Macros and ODS The first part of these slides overlaps with last week a fair bit, but it doesn t hurt to review as this code might be a little harder to follow. SAS Programming November 6, 2014 1 / 89

More information

MINITAB 17 BASICS REFERENCE GUIDE

MINITAB 17 BASICS REFERENCE GUIDE MINITAB 17 BASICS REFERENCE GUIDE Dr. Nancy Pfenning September 2013 After starting MINITAB, you'll see a Session window above and a worksheet below. The Session window displays non-graphical output such

More information

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide 2011 Version 6A Do these tutorials first This series of tutorials provides a quick start to using WINKS. Feel free

More information

Computational Mathematics/Information Technology. Worksheet 2 Iteration and Excel

Computational Mathematics/Information Technology. Worksheet 2 Iteration and Excel Computational Mathematics/Information Technology Worksheet 2 Iteration and Excel This sheet uses Excel and the method of iteration to solve the problem f(x) = 0. It introduces user functions and self referencing

More information

3. Almost always use system options options compress =yes nocenter; /* mostly use */ options ps=9999 ls=200;

3. Almost always use system options options compress =yes nocenter; /* mostly use */ options ps=9999 ls=200; Randy s SAS hints, updated Feb 6, 2014 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, first version: March 8, 2013 ***************; 2. Don

More information

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data Introduction About this Document This manual was written by members of the Statistical Consulting Program as an introduction to SPSS 12.0. It is designed to assist new users in familiarizing themselves

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;

2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments; A. Goals of Exercise Biology 164 Laboratory Using Comparative Statistics in Biology "Statistics" is a mathematical tool for analyzing and making generalizations about a population from a number of individual

More information

Baruch College STA Senem Acet Coskun

Baruch College STA Senem Acet Coskun Baruch College STA 9750 BOOK BUY A Predictive Mode Senem Acet Coskun Table of Contents Summary 3 Why this topic? 4 Data Sources 6 Variable Definitions 7 Descriptive Statistics 8 Univariate Analysis 9 Two-Sample

More information

A Step by Step Guide to Learning SAS

A Step by Step Guide to Learning SAS A Step by Step Guide to Learning SAS 1 Objective Familiarize yourselves with the SAS programming environment and language. Learn how to create and manipulate data sets in SAS and how to use existing data

More information

- 1 - Fig. A5.1 Missing value analysis dialog box

- 1 - Fig. A5.1 Missing value analysis dialog box WEB APPENDIX Sarstedt, M. & Mooi, E. (2019). A concise guide to market research. The process, data, and methods using SPSS (3 rd ed.). Heidelberg: Springer. Missing Value Analysis and Multiple Imputation

More information

Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots

Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots STAT 350 (Spring 2015) Lab 3: SAS Solutions 1 Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots Note: The data sets are not included in the solutions;

More information

Experiment 1 CH Fall 2004 INTRODUCTION TO SPREADSHEETS

Experiment 1 CH Fall 2004 INTRODUCTION TO SPREADSHEETS Experiment 1 CH 222 - Fall 2004 INTRODUCTION TO SPREADSHEETS Introduction Spreadsheets are valuable tools utilized in a variety of fields. They can be used for tasks as simple as adding or subtracting

More information

Lab 1: Introduction to Data

Lab 1: Introduction to Data 1 Lab 1: Introduction to Data Some define Statistics as the field that focuses on turning information into knowledge. The first step in that process is to summarize and describe the raw information the

More information

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG

More information

Multiple Linear Regression Excel 2010Tutorial For use when at least one independent variable is qualitative

Multiple Linear Regression Excel 2010Tutorial For use when at least one independent variable is qualitative Multiple Linear Regression Excel 2010Tutorial For use when at least one independent variable is qualitative This tutorial combines information on how to obtain regression output for Multiple Linear Regression

More information

Contents of SAS Programming Techniques

Contents of SAS Programming Techniques Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal

More information

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The

More information