Stat-340 Term Test Spring Term
|
|
- Daniel Jennings
- 5 years ago
- Views:
Transcription
1 Stat-340 Term Test Spring Term Part 1 - Multiple Choice Enter your answers to the multiple choice questions on the provided bubble sheets. Each of the multiple choice question is worth 1 mark there is no correction for guessing. Be sure your student name and number are completed on the bubble sheets. 1. How many observations and variables are contained in the following dataset? data blah; infile datalines; length name $10 sex $1 partnername $10 partnersex $1; input name $ sex $ / partnername $ partnersex $; datalines; Carl M Lois F Matthew M Fred M Selina F David M Tim M Kim. ;;;; (a) 8 observations; 2 variables. (b) 4 observations, 4 variables. (c) 8 observation, 4 variables. (d) 3 observations, 4 variables. (e) 4 observations, 2 variables. Solution: (b) Option A - 14% chose Notice the slash in the input which makes SAS go to a new line for the last 2 variables. Option B - 60% chose Option C - 24% chose See (a) 1
2 2. Which of the following is TRUE about By group processing? (a) A different analysis can be performed for each BY group. (b) The BY variable must be a numeric or date variable. (c) The data does not have to be grouped together by values of the BY variables (d) The BY groups can have different numbers of variables. (e) BY group processing can be done for any procedure. Solution: (e) Option D - 11% chose All of the by groups are subsets of the data and so have the same number of variables. Option E - 88% chose By variables can be any type. 3. Which of the following is correct about a standard error of a statistic. (a) The se measures how much the sample size changes in a simulation study. (b) The se measures the standard deviation of the population slope over bootstraps samples from the data. (c) The se measures the standard deviation of the Gini-estimate of the standard deviation between different populations. (d) The se measures the increase in the number of calories for each additional gram of fat. (e) The se measures now much a statistic will vary when new samples are taken from a population. Solution: (e) Option A - SE never measure variation of sample size. Option B - Population parameters (slopes) are fixed and do not vary. Option C - This doesn t even make sense - there is only one population. Option D - This is the definition of a slope and not a standard error. Option E - 95% chose Consider the following segment of code: data birthdays; infile datalines; length name $30; input name $ bdate:yymmdd10.; format bdate mmddyy8.; datalines; carl 63/02/01 lois 48/14/02 fred 58/06/03 tim 52/07/04 dave 63/12/31 ;;;; proc print data=birthdays; c 2015 Carl James Schwarz 2
3 Which of the following is correct? (a) The birth day for Carl will be displayed as 02/01/63. (b) The birth day for Lois will be displayed as 14/02/48. (c) The birth day for Fred will be displayed as ; (d) The birth day for Tim will be displayed as 04/07/1952. (e) The birth day for Dave will be display as a missing value. Solution: (a) Option A - 73% chose Option D - 14% chose The mmmddyy out-format only has length 8 so show 2 digit years. 5. Consider the following code data blah; infile datalines; length name sex $10.; input name sex age weight; if age > 30 then delete; drop weight; datalines; A f B m C F D M E F. 43 ;;;; Which of the following is correct? (a) The blah dataset has 5 observations and 4 variables. (b) The blah dataset has 4 observations and 4 variables/ (c) The blah dataset has 3 observations and 3 variables; (d) The blah dataset has 4 observation and 3 variables; (e) The blah dataset has 5 observations and 3 variables. Solution: (d) Option B - 15% chose The Drop statement removes a variable. Option D - 65% chose Option E - 11% chose The If statement removes an observation. 6. Which of the following is correct? (a) PROC GLM is used to test hypotheses about population mean proportions. (b) PROC FREQ is used to test hypotheses about sample proportions. c 2015 Carl James Schwarz 3
4 (c) PROC REG is used to test hypotheses about population slopes. (d) PROC GENMOD is used to test hypotheses about sample proportions. (e) PROC TTEST is used to test hypotheses about paired sample means. Solution: (c) Option A - 14% chose There is no such thing as a MEAN proportion! Option C - 61% chose Option D - 10% chose Hypotheses are ALWAYS about POPULATION parameters, not sample statistics. Option E - 15% chose Hypotheses are ALWAYS about POPULATION parameters, not sample statistics. 7. Consider the following SAS code: data blah; infile datalines dlm=, input v1 v2 v3 v4 v5 v6; datalines; 1,,2,3,4,5,6,7,8,9 2,3,.,5,6,7,8,9,0 3,4,5,.,6,7,8,9,0,1,2 9,8,7,,6,5,4,3,2,1,0 7,,6,,5,,9,,4,,3,,1,, ;;;; dsd missover; Which of the following is correct? (a) The value of v2 in the first observation is 2. (b) The value of v3 in the second observation is 5. (c) The value of v4 in the third observation is missing. (d) The value of v6 in the fourth observation is 4. (e) The value of v3 in the fifth observation is 5. Solution: (c) Option C - 95% chose in Consider the following SAS code: data blah; infile datalines; length surname $10 sex $1; input surname sex age; datalines; schwarz m 56 c 2015 Carl James Schwarz 4
5 schwarz f 53 zhao f 48 zhao m 52 sun m 27 chao f 23 chao m 27 ;;;; proc sort data=blah; by surname; proc transpose data=blah out=transblah; by surname; var age; id sex; Which of the following is correct? (a) The resulting transblah dataset has 3 observations. (b) The value of the variable M for the first observation in the transblah dataset is 56. (c) The observation for surname Sun will have the value of 27 for the the ages of both sexes. (d) The value of the variable F for last observation in the transblah dataset is 23. (e) The 4th observation in the transblah dataset will have 52 as the value for the M variable. Solution: (e) Option A - 16% chose. There are 4 distinct values for the Surname variable so the resulting dataset will have 4 observations. Option B - 28% chose. Don t forget to sort before transposing. Option C - 11% chose. Because Sun does not have a complete set of a variables, the missing variables will be set to missing. Option E - 39% chose. 9. Consider the following SAS code: proc tabulate data=accidents missing; class month Accident_Severity; var fatality; table Accident_Severity ALL, month*fatality*mean*f=7.2; Which of the following is correct? (a) The Accident_Severity variable will be along the top of the table (the columns). (b) The mean number of fatalities in each month and Accident_Severity will be found. (c) Each row of the table will correspond to a different value of the Accident_Severity variable, with the final row a summary over all codes. c 2015 Carl James Schwarz 5
6 (d) The missing option on the Proc statement ensures that missing values are ignored during the tabulation. (e) If the Accident_Severity variable had 3 levels, and if the month variable had 12 levels, the table would have 36 cells. Solution: (c) Option B - 40% chose Month is not used in the Table statement. Option C - 35% chose Option D - 12% chose The missing option also tabulates the missing values. Option E - 12% chose The ALL option will generate a row at the end for all codes. 10. Consider the following piece of SAS code: data blah; infile datalines; length name $10 sex $1; input name sex YearOfBirth; Age = YearOfBirth; datalines; Carl M 1956 Lois Fred.. Matthew M 1926 Marianne F -1 David M 1922 Julia F 2016 ;;;; Which of the following is correct: (a) The computed value of Age for Carl is 59. (b) The computed value of Age for Fred is 0. (c) The computed value of Age for Marianne is missing. (d) The computed value of Age for David is (e) The computed value of Age for Julia is missing. Solution: (a) Option A - 96% chose c 2015 Carl James Schwarz 6
7 Part II - Long Answer Stat Spring Term - Term Test 1 Name Student Number: Put your name and student number on the upper right of each of the following pages as well in case the pages get separated. Answer the following questions in the space provided. Be sure that your answers are legible. The marks given to these questions are 5, 6, 3, 4, and 7 respectively. c 2015 Carl James Schwarz 7
8 1. Interpretation - 5 Marks: Consider the following output from an analysis of the cereal dataset: Write a SHORT paragraph here summarizing the results. Solution: The relationship between the calories/serving and the grams of fat/serving was investigated using linear regression (Figure 1). The fitted equation is Calories = (F at) There was strong evidence that the slope is different from 0 (p <.0001). For every gram of fat, the calories/serving is expected to increase by 9.8 (SE 2.2) calories/gram of fat. Common problems in solutions from students include: Reporting too many decimal places. Seldom do you need to report more than two significant digits. The intercept is usually not of interest and so you don t usually spend anytime discussing it. The whole point of regression is to estimate the slope. So the discussion needs to be about the slope. Many students discussed differences in means (which is not sensible), or differences in the mean among groups which is again not sensible. These students were likely confusing regression with ANOVA. Don t just give the table values as facts add some interpretation to the information in the table. For example, many student had sentences such as The parameter estimate for Fat was 9.8. The c 2015 Carl James Schwarz 8
9 standard error was The t-value was 4.44 and the p-value was <.0001 so we rejected the null hypothesis. These types of sentence provide no useful information to the reader over and above the table. c 2015 Carl James Schwarz 9
10 2. Reading and Recodes - 6 Marks: The csv file named atus.csv contains the following fields on television viewing from the American Time of Use Study. ID Number Name (up to 30 characters) Sex (single letter code) Age at time of interview. For example 26y3m indicates the subject was 26 years and 3 months old. Number of minutes of television watched. The first few lines of the data file are as follows: ID, name, sex, age, tvmin 123ABCDEF, Schwarz, m, 58y10m, 20 LJD1234LJ, Lank, m, 61y2m, LLJJ, Swartz, F, 21y10m, 75 LLKD2343K, Duncan, f, 87y2m, 150 OUEROE, Smith, f, 8y2m, 236 Write SAS code to do the following: Read in the data from the csv file as noted above. Convert the year/month age data to a decimal year, e.g. 26y3m is converted to years (3 months is 1/4 of a year). Recode the sex variable. Either f or F is recoded as female; either m or M is recoded as male; other values are recoded as illegal sex. Recode the decimal age to 3 age classes. Ages (including 16 but excluding 25) are recoded to 16-24; ages (including 25 but excluding 40) are recoded to 25-39; ages (including 40 but excluding 70) are recoded to Other ages are recoded to out of frame. Check your recodes for both sex and age using appropriate procedures. Put your SAS code here and the page overleaf (if needed) c 2015 Carl James Schwarz 10
11 One possible solution data atus; infile datalines dlm=, dsd missover firstobs=2; /* Need dsd, dlm and firstobs= length id $10 name $20 sex $1 cage $10; length cagey cagem $10; /* temporary character values */ length newsex $10 ageclass $20; /* recoded values need longer lengths */ input id $ name $ sex $ cage $ minutes; /* convert input age to decimal age */ wherey = index(cage, "y"); /* where is the y */ cagey = substr(cage, 1, wherey-1); /* extract the age in years */ agey = input(cagey, f30.0); /* convert to age in years to number */ wherem = index(cage, "m"); /* where is the m */ cagem = substr(cage, wherey+1, wherem-wherey-1); /* extract the months */ agem = input(cagem, f30.0); /* extract the months */ age = agey + agem/12; /* make decimal age */ /* recode the sex */ sex = upcase(sex); /* convert to upper case */ newsex = illegal ; if sex = F then newsex = female ; if sex = "M" then newsex = male ; /* recode the age classes */ ageclass = out of frame ; if 16 <= age < 25 then ageclass = ; if 25 <= age < 40 then ageclass = ; if 40 <= age < 70 then ageclass = ; datalines; 123ABCDEF, Schwarz, m, 58y10m, 20 LJD1234LJ, Lank, m, 61y2m, LLJJ, Swartz, F, 21y10m, 75 LLKD2343K, Duncan, f, 87y2m, 150 OUEROE, Smith, f, 8y2m, 236 ;;;; proc print data=atus; title2 Data after coding ; /* check the recodes */ /* You need to use Proc Tabulate/SGplot and compare the OLD values to the NEW values */ proc tabulate data=atus missing; c 2015 Carl James Schwarz 11
12 title2 check the recodes ; class sex newsex age ageclass; table sex, newsex *n*f=5.0; /* check sex coding */ table age, ageclass*n*f=5.0; /* possible but very long table */ /* because age is a continuous variable, it is better to use sgplot to check the recodes*/ proc sgplot data=atus; title2 check the recodes for age ; scatter x=ageclass y=age; Comments about student responses: I ve used the Datalines option, but you could replace it with the actual file named people.csv. You could use Proc Import as well to read in the data using proc import file= people.csv out=atus replace; Many students didn t use/forgot to correct for upper/lower case of the gender values. Rather than if gender = f then gender = F ; if gender = m then gender = M ; use the upcase() function directly as shown above. Be careful of code such as data blah; length sex $1; input sex; if sex = f then sex = female ; Because sex is defined with length 1, the new value of female gets truncated to 1 character. So you either have to define sex with a longer length, or define a new variable (as I did above) with a longer length to hold the new values. Be careful of code such as data blah;... if 16 <= age < 25 then age = ; Here you are using the age variable as both character and numeric. This won t work. You likely want a separate character variable for the age class as I did in my solution. Some students always thought that the month started in the 4th position. It may not. See the solution above for a completely general solution. Always try and code stuff in the most general fashion possible so that it works in all cases. c 2015 Carl James Schwarz 12
13 Using Proc Print to check your recodes is not sufficient, as you will only be able to check if the recoding worked for the first few records. You need to use Proc Tabulateand Proc SGplot as shown above and as was done in our assignments. Notice that proc tabulate data=blah; class newsex; table newsex; doesn t provide enough information to see that the values of oldest have been properly recoded to the newest variable. See the solution above. Some student tried code along the lines of data blah; infilel... input... age yyymmm; There is no informat in SASto hand this case and you need to use the methods as shown above. The only useful infomats needed are for dates, times, and datetime values. c 2015 Carl James Schwarz 13
14 3. Trends in TV watching - 3 Marks: We are now interested in comparing the average TV watched between sexes and among age classes (see previous question), and examining if the trends over age classes are the same for both sexes. Here is some output from such an analysis Source DF Type III SS Mean Square F Value Pr > F sex <.0001 ageclass <.0001 sex*ageclass (a) Write a (very) short paragraph on your conclusions from the above analysis. WRITE YOUR PARAGRAPH HERE Solution: We performed an analysis of variance (ANOVA) to investigate if the changes in the mean number of minutes of TV viewing across the age classes were similar for the two sexes. There was no evidence that the change in the mean TV watched across the age classes varied between the sexes (p = 0.78)., i.e. there was no evidence that the trends across age classes were not parallel for the two sexes. There was strong evidence that there were difference in the mean amount of TV watched between the sexes and among the age classes (both p <.0001). Comments on student answers: We never say that there was evidence of parallelism, bur rather we say that there was no evidence of non-parallelism. The reason for this is that with a large enough sample size, we can always find evidence that the trends are non-parallel, but the non-parallelism may be miniscule. (b) Give the SAS code that would give the above results. Just the procedure code is needed - no data step is needed. You may assume that the dataset is called atus and contains variables sex, ageclass, and tvwatched for the number of minutes of TV watched by the respondent. Assume that the data were collected from an SRS, so it is NOT necessary to weight the analysis. Put your SAS code here: Solution proc glm data=atus; class sex ageclass; model tvwatched= sex ageclass sex*ageclass; Comments on student answers: Many students used Proc Genmod. This procedure is usually only used for logistic and similar models and not for standard ANOVAs. You need terms for the main effects and the interactions to produce the above table. c 2015 Carl James Schwarz 14
15 4. Profile Plot - 4 Marks: The output from the procedure to analyze the ATUS included estimates of the marginal means (the LSmeans) along with the upper and lower confidence limits on each each marginal mean. Create a suitable profile plot comparing the changes in mean TV watched across the age classes for the two sexes. Be sure to label the axes properly. You can assume that the analysis procedure created a data set (named mylsmeans) with the following variables. sex ageclass estimate of the marginal mean TV watched (minutes) lcl, the lower confidence bound on the mean ucl, the upper confidence bound on the mean Put your SAS code here: One possible solution proc sgplot data=mylsmeans; title2 profile plot of mean tv watched ; scatter x=ageclass y=estimate / group=sex; series x=ageclass y=estimate / group=sex; highlow x=ageclass lower=lcl upper=ucl / group=sex; xaxis label= Age class ; yaxis label= Mean TV watched (minutes) with 95% confidence interval ; Comments on student solutions: Several students used a Proc Means to try and find some averages. I m guessing that they just copied a solution that looked similar on past exams. Here the dataset is ready to be plotted and no further processing is needed before using Proc SGplot. c 2015 Carl James Schwarz 15
16 5. More analyses of the ATUS study. - 7 Marks There are two files for the ATUS study. The first dataset (named tvwatch) records TV watching habits and has the following information ID - the ID Number of the family MinTV - Number of minutes of television watched for the selected person from the household. The second dataset (named demoinfo) contains demographic and other information about the respondent s household (including the respondent) with the following information: ID - the ID Number of the family name of household member sex - the sex of the household member coded as f or m. empstatus - the employment status (employed or unemployed, coded as em or un) of the household member at the time of interview So for each subject in the tvwatch dataset, there can be 1 or more observations in the demoinfo dataset. Write SAS code to accomplish the following tasks Processes the demoinfo data to count the number of household members, the number of males, and the number of employed members. Hint: remember how your counted the number of females in the vehicles dataset from the Accidents analysis. Combines the TV time dataset and the data set from the previous step. Removes any records where there are more than 4 people in the household. Computes the mean number of minutes watched for each combination of number of males and the number of employed members and saves the results to a data set. [You can make up an ODS table name if needed]. Put your SAS code here and overleaf (if needed). c 2015 Carl James Schwarz 16
17 /* create variables for male/female and employment status */ proc sort data=demoinfo; by id; data demoinfo; set demoinfo; ismale = 0; if sex = m then ismale=1; /* code 1 or 0 for number of males */ isemp = 0; if empstatus = em then isemp = 1; proc means data=demoinfo noprint; /* count number of males */ by id; var ismale isemp; output out=sumdemo n=nmembers sum=nmale nemp; /* combine the two datasets */ data both; merge tvwatch demoinfo; by id; if nmembers > 4 then delete; /* remove households with more than 4 members */ /* get the mean tv watched */ proc sort data=both; by nmale hemp; proc means data=both; by nmale nemp; var mintv; output out=meantv mean=mean_tv; /* or you could use proc glm and a lsmeans */ proc glm data=both; class nmale nemp; model mintv = nmale nemp nmale*nemp; lsmeans nmale*nemp; ods output lsmeans=mylsmeans; Comments about student solutions: Many students had difficulty with part 1 of the question. This was the hardest part of the question. You could also try variants of a Proc Tabulate but that is likely to be more difficult to do. Most students had no problems with the merges and deletion step. You could also use Proc Tabulate for the final step, but this is actually more difficult to implement in practise than the given solutions. c 2015 Carl James Schwarz 17
18 Statistics about the term test: c 2015 Carl James Schwarz 18
19 There is some evidence that grades on the assignments is related to the grades on the term tests as seen in the pairwise plots below. c 2015 Carl James Schwarz 19
20 c 2015 Carl James Schwarz 20
Introduction to Statistical Analyses in SAS
Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationLab #9: ANOVA and TUKEY tests
Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for
More informationSAS Training Spring 2006
SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered
More informationIntroductory Guide to SAS:
Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................
More informationStat-340 Assignment Spring Term
Stat-340 Assignment 2 2015 Spring Term Part 1 - Breakfast cereals - Easy In this part of the assignment, you will learn how to: importing a *.csv file using Proc Import; use Proc Tabulate; create dot,
More information1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file
1 SPSS Guide 2009 Content 1. Basic Steps for Data Analysis. 3 2. Data Editor. 2.4.To create a new SPSS file 3 4 3. Data Analysis/ Frequencies. 5 4. Recoding the variable into classes.. 5 5. Data Analysis/
More informationData Management - 50%
Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define
More informationST Lab 1 - The basics of SAS
ST 512 - Lab 1 - The basics of SAS What is SAS? SAS is a programming language based in C. For the most part SAS works in procedures called proc s. For instance, to do a correlation analysis there is proc
More informationA. Using the data provided above, calculate the sampling variance and standard error for S for each week s data.
WILD 502 Lab 1 Estimating Survival when Animal Fates are Known Today s lab will give you hands-on experience with estimating survival rates using logistic regression to estimate the parameters in a variety
More informationINTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS
TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS
More informationDr. Barbara Morgan Quantitative Methods
Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In
More informationEXST SAS Lab Lab #6: More DATA STEP tasks
EXST SAS Lab Lab #6: More DATA STEP tasks Objectives 1. Working from an current folder 2. Naming the HTML output data file 3. Dealing with multiple observations on an input line 4. Creating two SAS work
More informationCSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM
options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall
More informationBUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)
SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon
More informationSTAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.
STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,
More informationStat 302 Statistical Software and Its Applications SAS: Data I/O
Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the
More informationSPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL
SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered
More informationCentering and Interactions: The Training Data
Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three
More informationBrief Guide on Using SPSS 10.0
Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new
More informationEXST SAS Lab Lab #8: More data step and t-tests
EXST SAS Lab Lab #8: More data step and t-tests Objectives 1. Input a text file in column input 2. Output two data files from a single input 3. Modify datasets with a KEEP statement or option 4. Prepare
More informationMultiple Regression White paper
+44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms
More informationSTA9750 Lecture I OUTLINE 1. WELCOME TO 9750!
STA9750 Lecture I OUTLINE 1. Welcome to STA9750! a. Blackboard b. Tentative syllabus c. Remote access to SAS 2. Introduction to reading data with SAS a. Manual input b. Reading from a text file c. Import
More informationAURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1
SAS TRAINING SAS/BASE BASIC THEORY & RULES ETC SAS WINDOWING ENVIRONMENT CREATION OF LIBRARIES SAS PROGRAMMING (BRIEFLY) - DATASTEP - PROC STEP WAYS TO READ DATA INTO SAS BACK END PROCESS OF DATASTEP INSTALLATION
More informationStat 5100 Handout #14.a SAS: Logistic Regression
Stat 5100 Handout #14.a SAS: Logistic Regression Example: (Text Table 14.3) Individuals were randomly sampled within two sectors of a city, and checked for presence of disease (here, spread by mosquitoes).
More informationLevel I: Getting comfortable with my data in SAS. Descriptive Statistics
Level I: Getting comfortable with my data in SAS. Descriptive Statistics Quick Review of reading Data into SAS Preparing Data 1. Variable names in the first row make sure they are appropriate for the statistical
More informationTable Of Contents. Table Of Contents
Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store
More informationUnit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users
BIOSTATS 640 Spring 2018 Review of Introductory Biostatistics STATA solutions Page 1 of 13 Key Comments begin with an * Commands are in bold black I edited the output so that it appears here in blue Unit
More informationIntermediate SAS: Statistics
Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square
More informationAn introduction to SPSS
An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible
More informationSAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/
STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape
More informationMaximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University
Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University While your data tables or spreadsheets may look good to
More informationSPSS. (Statistical Packages for the Social Sciences)
Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.
More informationWeek 6, Week 7 and Week 8 Analyses of Variance
Week 6, Week 7 and Week 8 Analyses of Variance Robyn Crook - 2008 In the next few weeks we will look at analyses of variance. This is an information-heavy handout so take your time reading it, and don
More informationExcel 2010 with XLSTAT
Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with
More informationCreating a data file and entering data
4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that
More informationPsychology 282 Lecture #21 Outline Categorical IVs in MLR: Effects Coding and Contrast Coding
Psychology 282 Lecture #21 Outline Categorical IVs in MLR: Effects Coding and Contrast Coding In the previous lecture we learned how to incorporate a categorical research factor into a MLR model by using
More information8. MINITAB COMMANDS WEEK-BY-WEEK
8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are
More informationPSY 9556B (Feb 5) Latent Growth Modeling
PSY 9556B (Feb 5) Latent Growth Modeling Fixed and random word confusion Simplest LGM knowing how to calculate dfs How many time points needed? Power, sample size Nonlinear growth quadratic Nonlinear growth
More informationExample1D.1.sas. * Procedures : ; * 1. print to show the dataset. ;
Example1D.1.sas * SAS example program 1D.1 ; * 1. Create a dataset called prob from the following data: ; * age prob lb ub ; * 24.25.20.31 ; * 36.26.21.32 ; * 48.28.24.33 ; * 60.31.28.36 ; * 72.35.32.39
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming
More informationThe SAS interface is shown in the following screen shot:
The SAS interface is shown in the following screen shot: There are several items of importance shown in the screen shot First there are the usual main menu items, such as File, Edit, etc I seldom use anything
More informationStatistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings
Statistical Good Practice Guidelines SSC home Using Excel for Statistics - Tips and Warnings On-line version 2 - March 2001 This is one in a series of guides for research and support staff involved in
More informationWELCOME! Lecture 3 Thommy Perlinger
Quantitative Methods II WELCOME! Lecture 3 Thommy Perlinger Program Lecture 3 Cleaning and transforming data Graphical examination of the data Missing Values Graphical examination of the data It is important
More informationStat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics
Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 19, 2015 2 Getting
More informationIntroduction to SAS. Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC
Introduction to SAS Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC cmurray-krezan@salud.unm.edu 20 August 2018 What is SAS? Statistical Analysis System,
More informationChemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt )
JMP Output from Chapter 9 Factorial Analysis through JMP Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) Fitting the Model and checking conditions Analyze > Fit Model
More informationSAS Online Training: Course contents: Agenda:
SAS Online Training: Course contents: Agenda: (1) Base SAS (6) Clinical SAS Online Training with Real time Projects (2) Advance SAS (7) Financial SAS Training Real time Projects (3) SQL (8) CV preparation
More informationLastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software.
Welcome to Basic Excel, presented by STEM Gateway as part of the Essential Academic Skills Enhancement, or EASE, workshop series. Before we begin, I want to make sure we are clear that this is by no means
More informationFactorial ANOVA with SAS
Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'
More informationRobust Linear Regression (Passing- Bablok Median-Slope)
Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming
More information2. Don t forget semicolons and RUN statements The two most common programming errors.
Randy s SAS hints March 7, 2013 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, March 8, 2013 ***************; 2. Don t forget semicolons and
More informationBase and Advance SAS
Base and Advance SAS BASE SAS INTRODUCTION An Overview of the SAS System SAS Tasks Output produced by the SAS System SAS Tools (SAS Program - Data step and Proc step) A sample SAS program Exploring SAS
More informationSTAT 7000: Experimental Statistics I
STAT 7000: Experimental Statistics I 2. A Short SAS Tutorial Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2009 Peng Zeng (Auburn University) STAT 7000 Lecture Notes Fall 2009
More informationPR3 & PR4 CBR Activities Using EasyData for CBL/CBR Apps
Summer 2006 I2T2 Process Page 23. PR3 & PR4 CBR Activities Using EasyData for CBL/CBR Apps The TI Exploration Series for CBR or CBL/CBR books, are all written for the old CBL/CBR Application. Now we can
More informationEXST3201 Mousefeed01 Page 1
EXST3201 Mousefeed01 Page 1 3 /* 4 Examine differences among the following 6 treatments 5 N/N85 fed normally before weaning and 85 kcal/wk after 6 N/R40 fed normally before weaning and 40 kcal/wk after
More informationANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a
ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a Put the following data into an spss data set: Be sure to include variable and value labels and missing value specifications for all variables
More informationQuantitative - One Population
Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)
More informationPLS205 Lab 1 January 9, Laboratory Topics 1 & 2
PLS205 Lab 1 January 9, 2014 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs saving results checking for errors Different
More informationPaper S Data Presentation 101: An Analyst s Perspective
Paper S1-12-2013 Data Presentation 101: An Analyst s Perspective Deanna Chyn, University of Michigan, Ann Arbor, MI Anca Tilea, University of Michigan, Ann Arbor, MI ABSTRACT You are done with the tedious
More informationWeek 4: Simple Linear Regression III
Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of
More informationGetting Started with the SGPLOT Procedure
ABSTRACT Getting Started with the SGPLOT Procedure Joshua M. Horstman, Nested Loop Consulting Do you want to create highly-customizable, publication-ready graphics in just minutes using SAS? This workshop
More informationFrequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values
Chapter 500 Introduction This procedure produces tables of frequency counts and percentages for categorical and continuous variables. This procedure serves as a summary reporting tool and is often used
More informationMHPE 494: Data Analysis. Welcome! The Analytic Process
MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your
More informationPreparing for Data Analysis
Preparing for Data Analysis Prof. Andrew Stokes March 21, 2017 Managing your data Entering the data into a database Reading the data into a statistical computing package Checking the data for errors and
More informationFrom Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX
Paper 152-27 From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX ABSTRACT This paper is a case study of how SAS products were
More informationCorrectly Compute Complex Samples Statistics
SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationFactorial ANOVA. Skipping... Page 1 of 18
Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent
More informationSTAT:5400 Computing in Statistics
STAT:5400 Computing in Statistics Introduction to SAS Lecture 18 Oct 12, 2015 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowaedu SAS SAS is the statistical software package most commonly used in business,
More informationCH5: CORR & SIMPLE LINEAR REFRESSION =======================================
STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================
More informationIQR = number. summary: largest. = 2. Upper half: Q3 =
Step by step box plot Height in centimeters of players on the 003 Women s Worldd Cup soccer team. 157 1611 163 163 164 165 165 165 168 168 168 170 170 170 171 173 173 175 180 180 Determine the 5 number
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use
More informationResearch Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel
Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement
More informationWHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide
STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO
More informationPoisson Regressions for Complex Surveys
Poisson Regressions for Complex Surveys Overview Researchers often use sample survey methodology to obtain information about a large population by selecting and measuring a sample from that population.
More informationDSCI 325: Handout 2 Getting Data into SAS Spring 2017
DSCI 325: Handout 2 Getting Data into SAS Spring 2017 Data sets come in many different formats. In some situations, data sets are stored on paper (e.g., surveys) and other times data are stored in huge
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationAn Introduction to SAS University Edition
An Introduction to SAS University Edition Ron Cody From An Introduction to SAS University Edition. Full book available for purchase here. Contents List of Programs... xi About This Book... xvii About the
More informationChapter 6: DESCRIPTIVE STATISTICS
Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling
More informationData Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski
Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...
More informationStatistical Tests for Variable Discrimination
Statistical Tests for Variable Discrimination University of Trento - FBK 26 February, 2015 (UNITN-FBK) Statistical Tests for Variable Discrimination 26 February, 2015 1 / 31 General statistics Descriptional:
More informationMacros and ODS. SAS Programming November 6, / 89
Macros and ODS The first part of these slides overlaps with last week a fair bit, but it doesn t hurt to review as this code might be a little harder to follow. SAS Programming November 6, 2014 1 / 89
More informationMINITAB 17 BASICS REFERENCE GUIDE
MINITAB 17 BASICS REFERENCE GUIDE Dr. Nancy Pfenning September 2013 After starting MINITAB, you'll see a Session window above and a worksheet below. The Session window displays non-graphical output such
More informationWINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide
WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide 2011 Version 6A Do these tutorials first This series of tutorials provides a quick start to using WINKS. Feel free
More informationComputational Mathematics/Information Technology. Worksheet 2 Iteration and Excel
Computational Mathematics/Information Technology Worksheet 2 Iteration and Excel This sheet uses Excel and the method of iteration to solve the problem f(x) = 0. It introduces user functions and self referencing
More information3. Almost always use system options options compress =yes nocenter; /* mostly use */ options ps=9999 ls=200;
Randy s SAS hints, updated Feb 6, 2014 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, first version: March 8, 2013 ***************; 2. Don
More informationIntroduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data
Introduction About this Document This manual was written by members of the Statistical Consulting Program as an introduction to SPSS 12.0. It is designed to assist new users in familiarizing themselves
More informationSTATS PAD USER MANUAL
STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11
More information2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;
A. Goals of Exercise Biology 164 Laboratory Using Comparative Statistics in Biology "Statistics" is a mathematical tool for analyzing and making generalizations about a population from a number of individual
More informationBaruch College STA Senem Acet Coskun
Baruch College STA 9750 BOOK BUY A Predictive Mode Senem Acet Coskun Table of Contents Summary 3 Why this topic? 4 Data Sources 6 Variable Definitions 7 Descriptive Statistics 8 Univariate Analysis 9 Two-Sample
More informationA Step by Step Guide to Learning SAS
A Step by Step Guide to Learning SAS 1 Objective Familiarize yourselves with the SAS programming environment and language. Learn how to create and manipulate data sets in SAS and how to use existing data
More information- 1 - Fig. A5.1 Missing value analysis dialog box
WEB APPENDIX Sarstedt, M. & Mooi, E. (2019). A concise guide to market research. The process, data, and methods using SPSS (3 rd ed.). Heidelberg: Springer. Missing Value Analysis and Multiple Imputation
More informationLab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots
STAT 350 (Spring 2015) Lab 3: SAS Solutions 1 Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots Note: The data sets are not included in the solutions;
More informationExperiment 1 CH Fall 2004 INTRODUCTION TO SPREADSHEETS
Experiment 1 CH 222 - Fall 2004 INTRODUCTION TO SPREADSHEETS Introduction Spreadsheets are valuable tools utilized in a variety of fields. They can be used for tasks as simple as adding or subtracting
More informationLab 1: Introduction to Data
1 Lab 1: Introduction to Data Some define Statistics as the field that focuses on turning information into knowledge. The first step in that process is to summarize and describe the raw information the
More informationTHE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann
Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG
More informationMultiple Linear Regression Excel 2010Tutorial For use when at least one independent variable is qualitative
Multiple Linear Regression Excel 2010Tutorial For use when at least one independent variable is qualitative This tutorial combines information on how to obtain regression output for Multiple Linear Regression
More informationContents of SAS Programming Techniques
Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal
More informationWant to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research
Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The
More information