Week 3/4 [06+ Sept.] Class Activities. File: week sep07.doc Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts

Size: px
Start display at page:

Download "Week 3/4 [06+ Sept.] Class Activities. File: week sep07.doc Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts"

Transcription

1 Week 3/4 [06+ Sept.] Class Activities File: week sep07.doc Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts Week 3 Topic -- REPORT WRITING * Introduce the Output Delivery System (ODS) for customizing procedure output * PROC TABULATE for producing nicely-formatted tables Week 4 Topic INTRODUCTION TO MODELING PROCS * REG and GLM primarily Bonus Material conversational UNIX ODS References Gupta, S. (2003) Quick Results with the Output Delivery System. SAS Institute Inc., Cary, NC USA. Delwiche LD and Slaughter SJ. (2003) The Little SAS Book: A Primer, 3rd edition. SAS Institute. Cary, NC, USA. [pages ] Haworth LE (2001) Output Delivery System: The Basics. SAS Institute Inc. Cary, NC USA. ODS Basics What is ODS? * method of delivering output in a variety of formats (other than the default listing format ) * options available include HTML, Rich Text Format (RTF), PS, PDF, SAS data sets Basic ODS Terminology destinations locations to which ODS routes output (e.g. LISTING, HTML, RTF, PRINTER, PDF, OUTPUT new data set) objects output entities created by ODS to store the formatted results styles font/color/other attributes of a report Basic syntax of ODS statements * identify output objects; ODS TRACE ON </options>;

2 * open output destination; ODS destination <FILE=filename>; * create SAS data set with output object; ODS OUTPUT output-object-name=sas-data-set-name; * [optional] select particular objects for inclusion; ODS <destination> SELECT output-object-name; PROC PROC PROC ODS <destination> CLOSE; ODS TRACE OFF; ODS to different file types A familiar example proc format; value totfmt 0='none' 1-HIGH='some' ; data d1; infile "\\Muserver2\USERS\B\BAILERAJ\public.www\classes\sta402\SASprograms\ch2-dat.txt" firstobs=16 expandtabs missover pad ; * infile 'M:\public.www\classes\sta402\SAS-programs\ch2-dat.txt' firstobs=16 expandtabs missover pad ; animal conc brood1 brood2 brood3 total 2.; cbrood3 = brood3; format cbrood3 totfmt.; label animal = animal ID number; label conc = Nitrofen concentration; label brood1 = number of young in first brood; label brood2 = number of young in 2nd brood; label brood3 = number of young in 3rd brood; label total = total young produced in three broods; proc print;

3 where conc=0; /* aside: ODS LISTING open as a default. You can have multiple destinations open simultaneously. If you want to close the LISTING destination before generating output then type ODS LISTING CLOSE; before issuing the PROC for which output is desired. */ /* generate HTML files with objects from 3 PROCs */ ODS TRACE ON; * ODS HTML file='m:\public.www\classes\sta402\sas-programs\day6- example.html ; ODS HTML file="\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\ods-html-example.html ; proc plot; plot total*conc=cbrood3 / vaxis=0 to 40 by 2; proc freq; table conc*cbrood3 / nopct nocol chisq trend exact; proc univariate plot; by conc; var total; ODS HTML CLOSE; ODS TRACE OFF; /* now generate HTML files with additional linkage info */ ODS TRACE ON; ODS HTML path='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms body = day6-example2.html /* Output objects */ contents = day6-example2-toc.html /* Table of contents */ frame = day6-example2-frame.html /* organizes display */ newfile = NONE; /* all results to one file*/ /* old code where M drive referenced vs. specification of the full path ODS HTML path='m:\public.www\classes\sta402\sas-programs body = day6-example2.html /* Output objects */ contents = day6-example2-toc.html /* Table of contents */ frame = day6-example2-frame.html /* organizes display */ newfile = NONE; /* all results to one file*/ */ /* comment: by default, opens a new body file for each part of output so the

4 newfile=none directs all output to the same body file newfile=page creates new body file for each page of output */ proc plot; plot total*conc=cbrood3 / vaxis=0 to 40 by 2; proc freq; table conc*cbrood3 / nopct nocol chisq trend exact; proc univariate plot; by conc; var total; ODS HTML CLOSE; ODS TRACE OFF; /* select on one of the output objects for inclusion */ *ODS HTML file='m:\public.www\classes\sta402\sas-programs\day6- example3.html ; ODS HTML file= \\Muserver2\USERS\B\BAILERAJ\public.www\classes\sta402\SASprograms\day6-example3.html ; ODS HTML SELECT SSPLOTS; ODS HTML SHOW; /* write details to SASLOG confirming object sel. */ proc univariate plot; by conc; var total; ODS HTML CLOSE; /* select different destinations */ options orientation=landscape nocenter nodate; ODS ESCAPECHAR= ^ ; /* for fancy formatting later */ /* old program with M drive reference ODS RTF file='m:\public.www\classes\sta402\sas-programs\day6-example.rtf ; ODS PDF file='m:\public.www\classes\sta402\sas-programs\day6-example.pdf ;

5 ODS PS file='m:\public.www\classes\sta402\sas-programs\day6-example.ps ; */ ODS RTF file='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\day6-example.rtf ; ODS PDF file='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\day6-example.pdf ; ODS PS file='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\day6-example.ps ; Title Plot of number of young vs. Nitrofen concentration^{super a} ; Footnote1 ^{super a}s=some young produced in Brood 3, n= no young produced in Brood 3 ; proc plot; plot total*conc=cbrood3 / vaxis=0 to 40 by 2; ODS RTF CLOSE; ODS PDF CLOSE; ODS PS CLOSE; ODS to create output data sets proc sort data=d1; by conc; ODS TRACE ON; /* see what ODS objects are created by univariate */ proc univariate data=d1; by conc; var total; ODS TRACE OFF; ODS OUTPUT Quantiles=data_quant; /* extract quantiles */ proc univariate data=d1; by conc; var total; ODS OUTPUT CLOSE; proc print data=data_quant; Var Obs conc Name Quantile Estimate 1 0 total 100% Max total 99% total 95% total 90% total 75% Q total 50% Median total 25% Q total 10% total 5% total 1% total 0% Min edited output total 100% Max total 99% total 95% total 90% total 75% Q3 6.0

6 total 50% Median total 25% Q total 10% total 5% total 1% total 0% Min 0.0 /* can also create multiple data sets */ ODS OUTPUT Quantiles(MATCH_ALL=conc_name_macro)=data_quant; proc univariate data=d1; by conc; var total; ODS OUTPUT CLOSE; proc print data=data_quant; from the SAS LOG file NOTE: The data set WORK.DATA_QUANT has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=0 NOTE: The data set WORK.DATA_QUANT1 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=80 NOTE: The data set WORK.DATA_QUANT2 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=160 NOTE: The data set WORK.DATA_QUANT3 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=235 NOTE: The data set WORK.DATA_QUANT4 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=310 /* write the data set names to the SAS LOG */ %put The conc_name_macro variables contains the following data sets &conc_name_macro; 76 %put The conc_name_macro variables contains the following data sets &conc_name_macro; The conc_name_macro variables contains the following data sets DATA_QUANT DATA_QUANT1 DATA_QUANT2 DATA_QUANT3 DATA_QUANT4&conc_name_macro; /* merge the concentration summary files to create single table */ data c0; set DATA_QUANT; rename Estimate=C0_Est; key=_n_; drop VarName conc; data c80; set DATA_QUANT1; rename Estimate=C80_Est; key=_n_; drop VarName conc; data c160; set DATA_QUANT2; rename Estimate=C160_Est; key=_n_; drop VarName conc; data c235; set DATA_QUANT3; rename Estimate=C235_Est; key=_n_; drop VarName conc; data c310; set DATA_QUANT4; rename Estimate=C310_Est; key=_n_; drop VarName conc; data all; merge c0 c80 c160 c235 c310; by key; drop key; proc print data=all; Obs Quantile C0_Est C80_Est C160_Est C235_Est C310_Est

7 1 100% Max % % % % Q % Median % Q % % % % Min /* extract the rows-observations corresponding to the 5 number summary */ data fivenum; set all; if _n_=1 or _n_=5 or _n_=6 or _n_=7 or _n_=11; proc print; Obs Quantile C0_Est C80_Est C160_Est C235_Est C310_Est 1 100% Max % Q % Median % Q % Min Using ODS OUTPUT to create dataset in a simulation /* Extracting coefficients from simple linear regression simulation */ options formdlim="-" nodate; /* generate simulation data sets Y ~ N(mu(x)= 3+2x, sigma=2) */ data sims; do dataset=1 to 1000; do x=1 to 10; y = 3 + 2*x + 2*rannor(0); output; end; end; /* DEBUG: print to check generated data */ proc print data=sims; /* SORT for data set */ proc sort data=sims; by dataset;

8 /* USE OUTEST to extract the estimated coefficients */ proc reg data=sims outest=myparms; by dataset; model y=x; proc print data=myparms; /* HISTOGRAM for estimated slope */ proc gchart data=work.myparms; vbar x; /* Re-do this with ODS */ *ods trace on; * determine what output objects are constructed; ods output ParameterEstimates=reg_coefs; proc reg data=sims; by dataset; model y=x; proc print data=reg_coefs; ods output close; *ods trace off; proc print data=reg_coefs; proc contents data=reg_coefs; data slopes; set reg_coefs; if Variable="x"; slope=estimate; keep dataset slope; data intercepts; set reg_coefs; if Variable="Intercept"; Intercept = Estimate; keep dataset intercept; data both; merge slopes intercepts; by dataset;

9 proc gplot data=both; title "Plot of estimated slope vs. estimated intercept"; plot slope*intercept; proc gchart data=both; title "Sampling distribution of the estimated slope"; vbar slope; proc gchart data=both title "Sampling distribution of the estimated intercept"; vbar intercept; proc print data=slopes; PROC TABULATE (producing fancier results tables in SAS) PROC TABULATE <option(s)>; CLASS variable(s) </ options>; * identify non-numeric vars; FREQ variable; * identify variable containing frequency of observation; TABLE <<page-expression,> row-expression,> column-expression</ table-option(s)>; VAR analysis-variable(s)</ options>; * identify analysis vars; WEIGHT variable; * identify variable name e.g. sampling wts; * FORMATTING related subcommands CLASSLEV variable(s) / STYLE=<style-element-name PARENT> <[style-attribute-specification(s)] >; KEYLABEL keyword-1='description-1' <...keyword-n='description-n'>; KEYWORD keyword(s) / STYLE=<style-element-name PARENT> <[style-attribute-specification(s)] >;

10 [* check out results of search for Tabulate syntax on SAS doc] Comments: * concatenation (blank) operator * crossing (*) operator * format modifiers * grouping elements (parentheses) operator * ALL class variable data d1; infile 'M:\public.www\classes\sta402\SAS-programs\ch2-dat.txt' firstobs=16 expandtabs missover pad ; animal conc brood1 brood2 brood3 total 2.; proc tabulate data=d1; class conc; var brood1 brood2 brood3 total; table (brood1 brood2 brood3 total)*conc, min q1 median q3 max; proc tabulate data=d1; class conc; var total; table conc= Nitrofen Concentration all, total (mean var);

11 Week 04+/- [12+ Sept.] Class Activities AN INTRODUCTION TO STATISTICAL MODELING * PROC REG for linear modeling (a very basic introduction) * PROC GLM for anova models Other normal response modeling ANOVA balanced anova models Non-normal response modeling GENMOD generalized linear models LOGISTIC [grouped] binary regression PROBIT [grouped] binary regression (INVERSECL) CATMOD categorical data modeling Failure time modeling LIFEREG accelerated failure time models PHREG Cox s PH model And more REGRESSION using PROC REG Basic Model: Y i = X i + i [ simple linear regression ] = X i1 + 2 X i2 + 3 X i3 + 4 X i4 + 5 X i5 + ij [ multiple linear regression ] Error Assumption: ij ~ indep. N(0, 2 ) i=1,2,,n [observations]

12 /* example sas program that does simple linear regression */ options ls=75; data example1; input year nboats manatees; cards; ; /* WARNING: ODS RTF will place TITLE information along With SAS date/time/page number as part of a header in the RTP document. Check out Print Preview or view the header. */ ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\linregoutput.rtf ; proc reg; title Number of Manatees killed regressed on the number of boats registered in Florida ; model manatees = nboats / p r cli clm; plot manatees*nboats= o p.*nboats= + / overlay; plot r.*nboats r.*p.; ODS RTF CLOSE;

13 Source DF Analysis of Variance Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept nboats <.0001

14 Obs Dep Var manatees Predicted Value Output Statistics Std Error Mean Predict 95% CL Mean 95% CL Predict Residual Std Error Residual Student Residual Output Statistics Obs ** 3 ** 4 ** 5 6 * 7 **** 8 ** 9 Cook's D

15 Output Statistics Obs * * 14 Cook's D Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS) Multiple Regression with indicator variables Log(Brain Weight) i = Log(Body Weight) i + i Log(Brain Weight) i = Log(Body Weight) i + 2 I dinoi + i Log(Brain Weight) i = Log(Body Weight) i + 2 I dinoi + 3 I dinoi Log(Body Weight) i + i data mrexample; * Lunneborg (1994); * body weight brain example; input species $ bodywt logbody = log10(bodywt); logbrain = log10(brainwt); idino = 0; if (species="diplodoc" or species="tricerat" or species="brachios") then idino=1; idinobod = idino*logbody; cards; beaver cow wolf goat guipig diplodocus asielephant donkey horse potarmonkey cat giraffe gorilla human afrelephant triceratops rhemonkey kangaroo hamster mouse rabbit sheep jaguar chimp brachiosaurus rat mole pig

16 ; ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\mregoutput.rtf ; proc print; title brain wt - body wt data ; proc univariate; var bodywt brainwt; id species; proc reg; title2 allometric scaling - brain and body wt. ; title3 [All Species combined] ; model logbrain=logbody; plot logbrain*logbody="o" p.*logbody="+" / overlay; plot r.*logbody; proc reg; title2 Dinosaurs fitted with potentially different line ; model logbrain=logbody idino idinobod; plot logbrain*logbody="o" p.*logbody="+" / overlay; plot r.*logbody; proc reg; title2 Dinosaurs fitted with potentially different INTERCEPTS ; model logbrain=logbody idino; plot logbrain*logbody="o" p.*logbody="+" / overlay; plot r.*logbody; ODS RTF CLOSE; Obs species bodywt brainwt logbody logbrain idino idinobod 1 beaver cow wolf goat guipig diplodoc asieleph donkey horse

17 Obs species bodywt brainwt logbody logbrain idino idinobod 10 potarmon cat giraffe gorilla human afreleph tricerat rhemonke kangaroo hamster mouse rabbit sheep jaguar chimp brachios rat mole pig

18 brain wt - body wt data The UNIVARIATE Procedure Variable: bodywt BODY WEIGHT Moments N 28 Sum Weights 28 Mean Sum Observations Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation Std Error Mean Basic Statistical Measures Location Variability Mean Std Deviation Median Variance Mode. Range Interquartile Range Tests for Location: Mu0=0 Test Statistic p Value Student's t t Pr > t Sign M 14 Pr >= M <.0001 Signed Rank S 203 Pr >= S <.0001 Quantiles (Definition 5) Quantile Estimate 100% Max % % % % Q % Median % Q % % 0.120

19 brain wt - body wt data The UNIVARIATE Procedure Variable: bodywt Quantiles (Definition 5) Quantile Estimate 1% % Min Extreme Observations Lowest Highest Value species Obs Value species Obs mouse asieleph hamster afreleph mole tricerat rat diplodoc guipig brachios 25

20 brain wt - body wt data The UNIVARIATE Procedure Variable: brainwt BRAIN WEIGHT Moments N 28 Sum Weights 28 Mean Sum Observations Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation Std Error Mean Basic Statistical Measures Location Variability Mean Std Deviation 1335 Median Variance Mode Range 5712 Interquartile Range Tests for Location: Mu0=0 Test Statistic p Value Student's t t Pr > t Sign M 14 Pr >= M <.0001 Signed Rank S 203 Pr >= S <.0001 Quantiles (Definition 5) Quantile Estimate 100% Max % % % % Q % Median % Q % % 1.00

21 brain wt - body wt data The UNIVARIATE Procedure Variable: brainwt Quantiles (Definition 5) Quantile Estimate 1% % Min 0.40 Extreme Observations Lowest Highest Value species Obs Value species Obs 0.4 mouse horse hamster giraffe rat human mole asieleph guipig afreleph 15

22 Source Analysis of Variance DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept <.0001 logbody <.0001

23 Source Analysis of Variance DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept <.0001 logbody <.0001 idino idinobod

24 Source Analysis of Variance DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept <.0001 logbody <.0001 idino <.0001 One-way ANOVA Basic Model: Y ij = μ i + ij [ cell means coding] = μ + i + ij [ effects coding] (constraint for estimation? 1 =0 or g =0 or i =0) Error Assumption: ij ~ indep. N(0, 2 ) i=1,2,,g [treatments or populations] j=1,2,,n i [replications] H 0 : μ 1 = = μ g or equivalently, H 0 : 1 = = g =0 /* Bacteria in meat under 4 different conditions */ options ls = 75; data meat;

25 input condition $ datalines; Plastic 7.66 Plastic 6.98 Plastic 7.80 Vacuum 5.26 Vacuum 5.44 Vacuum 5.80 Mixed 7.41 Mixed 7.33 Mixed 7.04 Co Co Co ; title bacteria growth under 4 packaging conditions; ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\onewayoutput.rtf ; proc boxplot; plot logcount*condition; proc glm data=meat order=data; title2 fitting the one-way anova model via GLM; class condition; model logcount = condition; means condition / bon tukey scheffe cldiff lines; lsmeans condition / cl pdiff; contrast plastic vs. rest condition ; output out=new p=yhat r=resid stdr=eresid; proc plot data=new; title2 residual analyses; plot resid*yhat; proc univariate data=new plot; var resid; proc boxplot data=new; plot resid*condition; ODS RTF CLOSE;

26 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Class condition Class Level Information Levels Values 4 Plastic Vacuum Mixed Co2 Number of observations 12

27 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Dependent Variable: logcount Source DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total R-Square Coeff Var Root MSE logcount Mean Source DF Type I SS Mean Square F Value Pr > F condition <.0001 Source DF Type III SS Mean Square F Value Pr > F condition <.0001

28 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Tukey's Studentized Range (HSD) Test for logcount NOTE: This test controls the Type I experimentwise error rate. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of Studentized Range Minimum Significant Difference 0.89 Comparisons significant at the 0.05 level are indicated by ***. condition Comparison Difference Between Means Simultaneous 95% Confidence Limits Plastic - Mixed Plastic - Vacuum *** Plastic - Co *** Mixed - Plastic Mixed - Vacuum *** Mixed - Co *** Vacuum - Plastic *** Vacuum - Mixed *** Vacuum - Co *** Co2 - Plastic *** Co2 - Mixed *** Co2 - Vacuum ***

29 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Bonferroni (Dunn) t Tests for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than Tukey's for all pairwise comparisons. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of t Minimum Significant Difference Comparisons significant at the 0.05 level are indicated by ***. condition Comparison Difference Between Means Simultaneous 95% Confidence Limits Plastic - Mixed Plastic - Vacuum *** Plastic - Co *** Mixed - Plastic Mixed - Vacuum *** Mixed - Co *** Vacuum - Plastic *** Vacuum - Mixed *** Vacuum - Co *** Co2 - Plastic *** Co2 - Mixed *** Co2 - Vacuum ***

30 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Scheffe's Test for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than Tukey's for all pairwise comparisons. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of F Minimum Significant Difference Comparisons significant at the 0.05 level are indicated by ***. condition Comparison Difference Between Means Simultaneous 95% Confidence Limits Plastic - Mixed Plastic - Vacuum *** Plastic - Co *** Mixed - Plastic Mixed - Vacuum *** Mixed - Co *** Vacuum - Plastic *** Vacuum - Mixed *** Vacuum - Co *** Co2 - Plastic *** Co2 - Mixed *** Co2 - Vacuum ***

31 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Tukey's Studentized Range (HSD) Test for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than REGWQ. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of Studentized Range Minimum Significant Difference 0.89 Means with the same letter are not significantly different. Tukey Grouping Mean N condition A Plastic A A Mixed B Vacuum C Co2

32 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Bonferroni (Dunn) t Tests for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than REGWQ. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of t Minimum Significant Difference Means with the same letter are not significantly different. Bon Grouping Mean N condition A Plastic A A Mixed B Vacuum C Co2

33 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Scheffe's Test for logcount NOTE: This test controls the Type I experimentwise error rate. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of F Minimum Significant Difference Means with the same letter are not significantly different. Scheffe Grouping Mean N condition A Plastic A A Mixed B Vacuum C Co2

34 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Least Squares Means condition logcount LSMEAN LSMEAN Number Plastic Vacuum Mixed Co Least Squares Means for effect condition Pr > t for H0: LSMean(i)=LSMean(j) Dependent Variable: logcount i/j < < < < < <.0001 <.0001 <.0001 condition logcount LSMEAN 95% Confidence Limits Plastic Vacuum Mixed Co i Least Squares Means for Effect condition j Difference Between Means 95% Confidence Limits for LSMean(i)-LSMean(j) NOTE: To ensure overall protection level, only probabilities associated with pre-planned comparisons should be used.

35 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Dependent Variable: logcount Contrast DF Contrast SS Mean Square F Value Pr > F plastic vs. rest <.0001

36 bacteria growth under 4 packaging conditions residual analyses Plot of resid*yhat. Legend: A = 1 obs, B = 2 obs, etc. resid 0.4 ˆ A 0.3 ˆ A A 0.2 ˆ A A A 0.1 ˆ A 0.0 ˆ A -0.1 ˆ -0.2 ˆ A A -0.3 ˆ -0.4 ˆ A -0.5 ˆ A ƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆ ƒƒƒƒƒƒƒƒƒˆƒ yhat

37 bacteria growth under 4 packaging conditions residual analyses The UNIVARIATE Procedure Variable: resid Moments N 12 Sum Weights 12 Mean 0 Sum Observations 0 Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation. Std Error Mean Basic Statistical Measures Location Variability Mean Std Deviation Median Variance Mode Range Interquartile Range Tests for Location: Mu0=0 Test Statistic p Value Student's t t 0 Pr > t Sign M 1 Pr >= M Signed Rank S 2 Pr >= S Quantiles (Definition 5) Quantile Estimate 100% Max % % % % Q % Median % Q % % -0.50

38 bacteria growth under 4 packaging conditions residual analyses The UNIVARIATE Procedure Variable: resid Quantiles (Definition 5) Quantile Estimate 1% % Min Extreme Observations Lowest Highest Value Obs Value Obs Stem Leaf # Boxplot *-----* Multiply Stem.Leaf by 10**-1 Normal Probability Plot * *++ * ++++ * *+* *++++ * *+ * +++ +*++ *

39 Two-way ANOVA/ Factorial example 2 - interaction plots Patient Waiting Time data Factorial anova model Basic Model: Y ijk = μ ij + ijk [ cell means coding] = μ + i + j + ( ) ij + ijk [ effects coding] Error Assumption: ijk ~ indep. N(0, 2 ) i=1,2,,g [treatments or populations] j=1,2,,n i [replications] H 0 : all ( ) ij =0 [no interaction] H 0 : all i =0 [no A main effect] H 0 : all j =0 [no B main effect] ODS ESCAPECHAR= ^ ; /* for fancy formatting later */ ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\twowayoutput.rtf ; title Two-way ANOVA/ Factorial example 2 - interaction plots; title2 Patient Waiting Time data; data cwait; input doctype $ practype $ cards; gen group 15 gen group 20 gen group 25 gen group 20 gen solo 20 gen solo 25 gen solo 30 gen solo 25 spec group 30 spec group 25 spec group 30 spec group 35 spec solo 25 spec solo 20 spec solo 30 spec solo 30 ; proc print; proc sort; by doctype practype; proc means noprint; by doctype practype; output out=factmean mean=timemean; proc plot data=factmean; plot timemean*doctype=practype / vaxis=0 to 35 by 5; proc glm data=cwait order=data; class doctype practype; model time=doctype practype; /* equivalent model statement

40 Two-way ANOVA/ Factorial example 2 - interaction plots Patient Waiting Time data model time=doctype practype doctype*practype; */ output out=new p=yhat r=resid; lsmeans doctype practype doctype*practype / stderr pdiff; means doctype practype doctype*practype / tukey; ODS RTF CLOSE; Obs doctype practype time 1 gen group 15 2 gen group 20 3 gen group 25 4 gen group 20 5 gen solo 20 6 gen solo 25 7 gen solo 30 8 gen solo 25 9 spec group spec group spec group spec group spec solo spec solo spec solo spec solo 30

41 Two-way ANOVA/ Factorial example 2 - interaction plots Patient Waiting Time data Plot of timemean*doctype. Symbol is value of practype. g s timemean 35 ˆ 30 ˆ 25 ˆ s 20 ˆ g 15 ˆ 10 ˆ 5 ˆ 0 ˆ ƒƒˆƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒˆƒƒ gen spec doctype

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions)

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions) THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533 MIDTERM EXAMINATION: October 14, 2005 Instructor: Val LeMay Time: 50 minutes 40 Marks FRST 430 50 Marks FRST 533 (extra questions) This examination

More information

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall

More information

Centering and Interactions: The Training Data

Centering and Interactions: The Training Data Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three

More information

EXST3201 Mousefeed01 Page 1

EXST3201 Mousefeed01 Page 1 EXST3201 Mousefeed01 Page 1 3 /* 4 Examine differences among the following 6 treatments 5 N/N85 fed normally before weaning and 85 kcal/wk after 6 N/R40 fed normally before weaning and 40 kcal/wk after

More information

STAT:5400 Computing in Statistics

STAT:5400 Computing in Statistics STAT:5400 Computing in Statistics Introduction to SAS Lecture 18 Oct 12, 2015 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowaedu SAS SAS is the statistical software package most commonly used in business,

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

STAT 503 Fall Introduction to SAS

STAT 503 Fall Introduction to SAS Getting Started Introduction to SAS 1) Download all of the files, sas programs (.sas) and data files (.dat) into one of your directories. I would suggest using your H: drive if you are using a computer

More information

STAT:5201 Applied Statistic II

STAT:5201 Applied Statistic II STAT:5201 Applied Statistic II Two-Factor Experiment (one fixed blocking factor, one fixed factor of interest) Randomized complete block design (RCBD) Primary Factor: Day length (short or long) Blocking

More information

Factorial ANOVA. Skipping... Page 1 of 18

Factorial ANOVA. Skipping... Page 1 of 18 Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent

More information

Factorial ANOVA with SAS

Factorial ANOVA with SAS Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'

More information

Week 05 Class Activities

Week 05 Class Activities Week 05 Class Activities File: week-05-24sep07.doc Directory (hp/compaq): C:\baileraj\Classes\Fall 2007\sta402\handouts Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts Better Graphics

More information

Using PC SAS/ASSIST* for Statistical Analyses

Using PC SAS/ASSIST* for Statistical Analyses Using PC SAS/ASSIST* for Statistical Analyses Margaret A. Nemeth, Monsanto Company lptroductjon SAS/ASSIST, a user friendly, menu driven applications system, is available on several platforms. This paper

More information

1 The SAS System 23:01 Friday, November 9, 2012

1 The SAS System 23:01 Friday, November 9, 2012 2101f12HW9chickwts.log Saved: Wednesday, November 14, 2012 6:50:49 PM Page 1 of 3 1 The SAS System 23:01 Friday, November 9, 2012 NOTE: Copyright (c) 2002-2010 by SAS Institute Inc., Cary, NC, USA. NOTE:

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

* Sample SAS program * Data set is from Dean and Voss (1999) Design and Analysis of * Experiments. Problem 3, page 129.

* Sample SAS program * Data set is from Dean and Voss (1999) Design and Analysis of * Experiments. Problem 3, page 129. SAS Most popular Statistical software worldwide. SAS claims that its products are used at over 40,000 sites, including at 90% of the Fortune 500. This will not be all SAS as they make other products, such

More information

EXST SAS Lab Lab #6: More DATA STEP tasks

EXST SAS Lab Lab #6: More DATA STEP tasks EXST SAS Lab Lab #6: More DATA STEP tasks Objectives 1. Working from an current folder 2. Naming the HTML output data file 3. Dealing with multiple observations on an input line 4. Creating two SAS work

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming

More information

CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

CH5: CORR & SIMPLE LINEAR REFRESSION ======================================= STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

More information

The Kenton Study. (Applied Linear Statistical Models, 5th ed., pp , Kutner et al., 2005) Page 1 of 5

The Kenton Study. (Applied Linear Statistical Models, 5th ed., pp , Kutner et al., 2005) Page 1 of 5 The Kenton Study The Kenton Food Company wished to test four different package designs for a new breakfast cereal. Twenty stores, with approximately equal sales volumes, were selected as the experimental

More information

Repeated Measures Part 4: Blood Flow data

Repeated Measures Part 4: Blood Flow data Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value

More information

A Step by Step Guide to Learning SAS

A Step by Step Guide to Learning SAS A Step by Step Guide to Learning SAS 1 Objective Familiarize yourselves with the SAS programming environment and language. Learn how to create and manipulate data sets in SAS and how to use existing data

More information

Data Management - 50%

Data Management - 50% Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define

More information

Within-Cases: Multivariate approach part one

Within-Cases: Multivariate approach part one Within-Cases: Multivariate approach part one /* sleep2.sas */ options linesize=79 noovp formdlim=' '; title "Student's Sleep data: Matched t-tests with proc reg"; data bedtime; infile 'studentsleep.data'

More information

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model Topic 16 - Other Remedies Ridge Regression Robust Regression Regression Trees Outline - Fall 2013 Piecewise Linear Model Bootstrapping Topic 16 2 Ridge Regression Modification of least squares that addresses

More information

Contrasts and Multiple Comparisons

Contrasts and Multiple Comparisons Contrasts and Multiple Comparisons /* onewaymath.sas */ title2 'Oneway with contrasts and multiple comparisons (Exclude Other/DK)'; %include 'readmath.sas'; if ethnic ne 6; /* Otherwise, throw the case

More information

Week 1 [20-24 Aug 07] Class Activities. FILE: \\Muserver2\USERS\B\baileraj\Classes\sta402\handouts\week-01-22aug07.doc

Week 1 [20-24 Aug 07] Class Activities. FILE: \\Muserver2\USERS\B\baileraj\Classes\sta402\handouts\week-01-22aug07.doc Week 1 [20-24 Aug 07] Class Activities FILE: \\Muserver2\USERS\B\baileraj\Classes\sta402\handouts\week-01-22aug07.doc 0. Roster STA 402 (n=3) Menickelly,Matthew Mix, Nathan Weber, Madeline STA 502 (n=11)

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

Laboratory Topics 1 & 2

Laboratory Topics 1 & 2 PLS205 Lab 1 January 12, 2012 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs; saving results; checking for errors

More information

Introductory Guide to SAS:

Introductory Guide to SAS: Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................

More information

The ANOVA Procedure (Chapter)

The ANOVA Procedure (Chapter) SAS/STAT 9.3 User s Guide The ANOVA Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual

More information

Module 3: SAS. 3.1 Initial explorative analysis 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE

Module 3: SAS. 3.1 Initial explorative analysis 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE St@tmaster 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE Module 3: SAS 3.1 Initial explorative analysis....................... 1 3.1.1 SAS JMP............................

More information

Cell means coding and effect coding

Cell means coding and effect coding Cell means coding and effect coding /* mathregr_3.sas */ %include 'readmath.sas'; title2 ''; /* The data step continues */ if ethnic ne 6; /* Otherwise, throw the case out */ /* Indicator dummy variables

More information

Analysis of variance - ANOVA

Analysis of variance - ANOVA Analysis of variance - ANOVA Based on a book by Julian J. Faraway University of Iceland (UI) Estimation 1 / 50 Anova In ANOVAs all predictors are categorical/qualitative. The original thinking was to try

More information

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation:

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation: Topic 11. Unbalanced Designs [ST&D section 9.6, page 219; chapter 18] 11.1 Definition of missing data Accidents often result in loss of data. Crops are destroyed in some plots, plants and animals die,

More information

Statistics Lab #7 ANOVA Part 2 & ANCOVA

Statistics Lab #7 ANOVA Part 2 & ANCOVA Statistics Lab #7 ANOVA Part 2 & ANCOVA PSYCH 710 7 Initialize R Initialize R by entering the following commands at the prompt. You must type the commands exactly as shown. options(contrasts=c("contr.sum","contr.poly")

More information

Paper ODS, YES! Odious, NO! An Introduction to the SAS Output Delivery System

Paper ODS, YES! Odious, NO! An Introduction to the SAS Output Delivery System Paper 149-25 ODS, YES! Odious, NO! An Introduction to the SAS Output Delivery System Lara Bryant, University of North Carolina at Chapel Hill, Chapel Hill, NC Sally Muller, University of North Carolina

More information

Contents of SAS Programming Techniques

Contents of SAS Programming Techniques Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal

More information

Quick Results with the Output Delivery System

Quick Results with the Output Delivery System Paper 58-27 Quick Results with the Output Delivery System Sunil K. Gupta, Gupta Programming, Simi Valley, CA ABSTRACT SAS s new Output Delivery System (ODS) opens a whole new world of options in generating

More information

Lab #1: Introduction to Basic SAS Operations

Lab #1: Introduction to Basic SAS Operations Lab #1: Introduction to Basic SAS Operations Getting Started: OVERVIEW OF SAS (access lab pages at http://www.stat.lsu.edu/exstlab/) There are several ways to open the SAS program. You may have a SAS icon

More information

PLS205 Lab 1 January 9, Laboratory Topics 1 & 2

PLS205 Lab 1 January 9, Laboratory Topics 1 & 2 PLS205 Lab 1 January 9, 2014 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs saving results checking for errors Different

More information

SAS CURRICULUM. BASE SAS Introduction

SAS CURRICULUM. BASE SAS Introduction SAS CURRICULUM BASE SAS Introduction Data Warehousing Concepts What is a Data Warehouse? What is a Data Mart? What is the difference between Relational Databases and the Data in Data Warehouse (OLTP versus

More information

Using SAS to Analyze CYP-C Data: Introduction to Procedures. Overview

Using SAS to Analyze CYP-C Data: Introduction to Procedures. Overview Using SAS to Analyze CYP-C Data: Introduction to Procedures CYP-C Research Champion Webinar July 14, 2017 Jason D. Pole, PhD Overview SAS overview revisited Introduction to SAS Procedures PROC FREQ PROC

More information

Baruch College STA Senem Acet Coskun

Baruch College STA Senem Acet Coskun Baruch College STA 9750 BOOK BUY A Predictive Mode Senem Acet Coskun Table of Contents Summary 3 Why this topic? 4 Data Sources 6 Variable Definitions 7 Descriptive Statistics 8 Univariate Analysis 9 Two-Sample

More information

Stat 5100 Handout #19 SAS: Influential Observations and Outliers

Stat 5100 Handout #19 SAS: Influential Observations and Outliers Stat 5100 Handout #19 SAS: Influential Observations and Outliers Example: Data collected on 50 countries relevant to a cross-sectional study of a lifecycle savings hypothesis, which states that the response

More information

Base and Advance SAS

Base and Advance SAS Base and Advance SAS BASE SAS INTRODUCTION An Overview of the SAS System SAS Tasks Output produced by the SAS System SAS Tools (SAS Program - Data step and Proc step) A sample SAS program Exploring SAS

More information

The NESTED Procedure (Chapter)

The NESTED Procedure (Chapter) SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual

More information

Statistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment

Statistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ Aiming Yang, Merck & Co., Inc., Rahway, NJ ABSTRACT Four pitfalls are commonly

More information

5:2 LAB RESULTS - FOLLOW-UP ANALYSES FOR FACTORIAL

5:2 LAB RESULTS - FOLLOW-UP ANALYSES FOR FACTORIAL 5:2 LAB RESULTS - FOLLOW-UP ANALYSES FOR FACTORIAL T1. n F and n C for main effects = 2 + 2 + 2 = 6 (i.e., 2 observations in each of 3 cells for other factor) Den t = SQRT[3.333x(1/6+1/6)] = 1.054 Den

More information

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/ STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape

More information

STA 570 Spring Lecture 5 Tuesday, Feb 1

STA 570 Spring Lecture 5 Tuesday, Feb 1 STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row

More information

Week 6, Week 7 and Week 8 Analyses of Variance

Week 6, Week 7 and Week 8 Analyses of Variance Week 6, Week 7 and Week 8 Analyses of Variance Robyn Crook - 2008 In the next few weeks we will look at analyses of variance. This is an information-heavy handout so take your time reading it, and don

More information

Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS)

Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Linear Model in matrix notation for the population Y = Xβ + Var ( ) = In GLS, the error covariance matrix is known In EGLS

More information

BIOMETRICS INFORMATION

BIOMETRICS INFORMATION BIOMETRICS INFORMATION (You re 95% likely to need this information) PAMPHLET NO. # 57 DATE: September 5, 1997 SUBJECT: Interpreting Main Effects when a Two-way Interaction is Present Interpreting the analysis

More information

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS

More information

SAS/STAT 13.1 User s Guide. The NESTED Procedure

SAS/STAT 13.1 User s Guide. The NESTED Procedure SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute

More information

An introduction to SPSS

An introduction to SPSS An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible

More information

Introductory SAS example

Introductory SAS example Introductory SAS example STAT:5201 1 Introduction SAS is a command-driven statistical package; you enter statements in SAS s language, submit them to SAS, and get output. A fairly friendly user interface

More information

Stat 5100 Handout #6 SAS: Linear Regression Remedial Measures

Stat 5100 Handout #6 SAS: Linear Regression Remedial Measures Stat 5100 Handout #6 SAS: Linear Regression Remedial Measures Example: Age and plasma level for 25 healthy children in a study are reported. Of interest is how plasma level depends on age. (Text Table

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 19, 2015 2 Getting

More information

6:1 LAB RESULTS -WITHIN-S ANOVA

6:1 LAB RESULTS -WITHIN-S ANOVA 6:1 LAB RESULTS -WITHIN-S ANOVA T1/T2/T3/T4. SStotal =(1-12) 2 + + (18-12) 2 = 306.00 = SSpill + SSsubj + SSpxs df = 9-1 = 8 P1 P2 P3 Ms Ms-Mg 1 8 15 8.0-4.0 SSsubj= 3x(-4 2 + ) 4 17 15 12.0 0 = 96.0 13

More information

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

Stat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors

Stat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors Week 10: Autocorrelated errors This week, I have done one possible analysis and provided lots of output for you to consider. Case study: predicting body fat Body fat is an important health measure, but

More information

Chapter 2 (was Ch. 8) (September 21, 2008; 4:54 p.m.) Constructing a data set for analysis: reading, combining, managing and manipulating data sets

Chapter 2 (was Ch. 8) (September 21, 2008; 4:54 p.m.) Constructing a data set for analysis: reading, combining, managing and manipulating data sets Chapter 2 (was Ch. 8) (September 21, 2008 4:54 p.m.) Constructing a data set for analysis: reading, combining, managing and manipulating data sets 2. Constructing a data set for analysis: reading, combining,

More information

SAS Example A10. Output Delivery System (ODS) Sample Data Set sales.txt. Examples of currently available ODS destinations: Mervyn Marasinghe

SAS Example A10. Output Delivery System (ODS) Sample Data Set sales.txt. Examples of currently available ODS destinations: Mervyn Marasinghe SAS Example A10 data sales infile U:\Documents\...\sales.txt input Region : $8. State $2. +1 Month monyy5. Headcnt Revenue Expenses format Month monyy5. Revenue dollar12.2 proc sort by Region State Month

More information

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data Introduction About this Document This manual was written by members of the Statistical Consulting Program as an introduction to SPSS 12.0. It is designed to assist new users in familiarizing themselves

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office) SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon

More information

Please login. Take a seat Login with your HawkID Locate SAS 9.3. Raise your hand if you need assistance. Start / All Programs / SAS / SAS 9.

Please login. Take a seat Login with your HawkID Locate SAS 9.3. Raise your hand if you need assistance. Start / All Programs / SAS / SAS 9. Please login Take a seat Login with your HawkID Locate SAS 9.3 Start / All Programs / SAS / SAS 9.3 (64 bit) Raise your hand if you need assistance Introduction to SAS Procedures Sarah Bell Overview Review

More information

Chapter Seven: Multiple Regression II

Chapter Seven: Multiple Regression II Chapter Seven: Multiple Regression II Factorial ANOVA and Related Topics 4.1 A One-way Example The following is a textbook example taken from Neter et al.'s Applied Linear Statistical Models. The Kenton

More information

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables:

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables: Regression Lab The data set cholesterol.txt available on your thumb drive contains the following variables: Field Descriptions ID: Subject ID sex: Sex: 0 = male, = female age: Age in years chol: Serum

More information

Subset Selection in Multiple Regression

Subset Selection in Multiple Regression Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 6 Available Statistical

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O

Stat 302 Statistical Software and Its Applications SAS: Data I/O Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the

More information

Chapters 18, 19, 20 Solutions. Page 1 of 14. Demographics from COLLEGE Data Set

Chapters 18, 19, 20 Solutions. Page 1 of 14. Demographics from COLLEGE Data Set 18.2 proc tabulate data=learn.college format=7.; class schoolsize gender scholarship; table schoolsize ALL, gender scholarship ALL; n = ' '; Demographics from COLLEGE Data Set ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ

More information

SAS Programs SAS Lecture 4 Procedures. Aidan McDermott, April 18, Outline. Internal SAS formats. SAS Formats

SAS Programs SAS Lecture 4 Procedures. Aidan McDermott, April 18, Outline. Internal SAS formats. SAS Formats SAS Programs SAS Lecture 4 Procedures Aidan McDermott, April 18, 2006 A SAS program is in an imperative language consisting of statements. Each statement ends in a semi-colon. Programs consist of (at least)

More information

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues

More information

22S:166. Checking Values of Numeric Variables

22S:166. Checking Values of Numeric Variables 22S:1 Computing in Statistics Lecture 24 Nov. 2, 2016 1 Checking Values of Numeric Variables range checks when you know what the range of possible values is for a given quantitative variable internal consistency

More information

Stat 5303 (Oehlert): Unbalanced Factorial Examples 1

Stat 5303 (Oehlert): Unbalanced Factorial Examples 1 Stat 5303 (Oehlert): Unbalanced Factorial Examples 1 > section

More information

Model Selection and Inference

Model Selection and Inference Model Selection and Inference Merlise Clyde January 29, 2017 Last Class Model for brain weight as a function of body weight In the model with both response and predictor log transformed, are dinosaurs

More information

Intermediate SAS: Working with Data

Intermediate SAS: Working with Data Intermediate SAS: Working with Data OIT Technical Support Services 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Getting set up for the Intermediate SAS workshop:...

More information

ST Lab 1 - The basics of SAS

ST Lab 1 - The basics of SAS ST 512 - Lab 1 - The basics of SAS What is SAS? SAS is a programming language based in C. For the most part SAS works in procedures called proc s. For instance, to do a correlation analysis there is proc

More information

The SAS interface is shown in the following screen shot:

The SAS interface is shown in the following screen shot: The SAS interface is shown in the following screen shot: There are several items of importance shown in the screen shot First there are the usual main menu items, such as File, Edit, etc I seldom use anything

More information

Introduction to STATA

Introduction to STATA Introduction to STATA Duah Dwomoh, MPhil School of Public Health, University of Ghana, Accra July 2016 International Workshop on Impact Evaluation of Population, Health and Nutrition Programs Learning

More information

AURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1

AURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1 SAS TRAINING SAS/BASE BASIC THEORY & RULES ETC SAS WINDOWING ENVIRONMENT CREATION OF LIBRARIES SAS PROGRAMMING (BRIEFLY) - DATASTEP - PROC STEP WAYS TO READ DATA INTO SAS BACK END PROCESS OF DATASTEP INSTALLATION

More information

SAS CLINICAL SYLLABUS. DURATION: - 60 Hours

SAS CLINICAL SYLLABUS. DURATION: - 60 Hours SAS CLINICAL SYLLABUS DURATION: - 60 Hours BASE SAS PART - I Introduction To Sas System & Architecture History And Various Modules Features Variables & Sas Syntax Rules Sas Data Sets Data Set Options Operators

More information

Basic Concepts #6: Introduction to Report Writing

Basic Concepts #6: Introduction to Report Writing Basic Concepts #6: Introduction to Report Writing Using By-line, PROC Report, PROC Means, PROC Freq JC Wang By-Group Processing By-group processing in a procedure step, a BY line identifies each group

More information

Part 1. Getting Started. Chapter 1 Creating a Simple Report 3. Chapter 2 PROC REPORT: An Introduction 13. Chapter 3 Creating Breaks 57

Part 1. Getting Started. Chapter 1 Creating a Simple Report 3. Chapter 2 PROC REPORT: An Introduction 13. Chapter 3 Creating Breaks 57 Part 1 Getting Started Chapter 1 Creating a Simple Report 3 Chapter 2 PROC REPORT: An Introduction 13 Chapter 3 Creating Breaks 57 Chapter 4 Only in the LISTING Destination 75 Chapter 5 Creating and Modifying

More information

Dr. Barbara Morgan Quantitative Methods

Dr. Barbara Morgan Quantitative Methods Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In

More information

2011 NAICC ARM 8 Introduction Training, Jan. 2011

2011 NAICC ARM 8 Introduction Training, Jan. 2011 2011 NAICC ARM 8 Introduction Training, Jan. 2011 General Overview of ARM Menu Choices 1) Help: a) F1, any Help button, or displays help on current Topic. b) F5 or on a study editor displays help about

More information

1 Downloading files and accessing SAS. 2 Sorting, scatterplots, correlation and regression

1 Downloading files and accessing SAS. 2 Sorting, scatterplots, correlation and regression Statistical Methods and Computing, 22S:30/105 Instructor: Cowles Lab 2 Feb. 6, 2015 1 Downloading files and accessing SAS. We will be using the billion.dat dataset again today, as well as the OECD dataset

More information

a. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.

a. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one. Probability and Statistics Chapter 2 Notes I Section 2-1 A Steps to Constructing Frequency Distributions 1 Determine number of (may be given to you) a Should be between and classes 2 Find the Range a The

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

SAS Training Spring 2006

SAS Training Spring 2006 SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered

More information

15 Wyner Statistics Fall 2013

15 Wyner Statistics Fall 2013 15 Wyner Statistics Fall 2013 CHAPTER THREE: CENTRAL TENDENCY AND VARIATION Summary, Terms, and Objectives The two most important aspects of a numerical data set are its central tendencies and its variation.

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

Introduction to Stata: An In-class Tutorial

Introduction to Stata: An In-class Tutorial Introduction to Stata: An I. The Basics - Stata is a command-driven statistical software program. In other words, you type in a command, and Stata executes it. You can use the drop-down menus to avoid

More information

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data. Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting

More information

An Introduction to ODS for Statistical Graphics in SAS 9.1 Robert N. Rodriguez SAS Institute Inc., Cary, North Carolina, USA

An Introduction to ODS for Statistical Graphics in SAS 9.1 Robert N. Rodriguez SAS Institute Inc., Cary, North Carolina, USA An Introduction to ODS for Statistical Graphics in SAS 9.1 Robert N. Rodriguez SAS Institute Inc., Cary, North Carolina, USA ABSTRACT In SAS 9.1, over two dozen SAS/STAT and SAS/ETS procedures have been

More information

STAT 3304/5304 Introduction to Statistical Computing. Introduction to SAS

STAT 3304/5304 Introduction to Statistical Computing. Introduction to SAS STAT 3304/5304 Introduction to Statistical Computing Introduction to SAS What is SAS? SAS (originally an acronym for Statistical Analysis System, now it is not an acronym for anything) is a program designed

More information