Week 3/4 [06+ Sept.] Class Activities. File: week sep07.doc Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts
|
|
- Rebecca Golden
- 5 years ago
- Views:
Transcription
1 Week 3/4 [06+ Sept.] Class Activities File: week sep07.doc Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts Week 3 Topic -- REPORT WRITING * Introduce the Output Delivery System (ODS) for customizing procedure output * PROC TABULATE for producing nicely-formatted tables Week 4 Topic INTRODUCTION TO MODELING PROCS * REG and GLM primarily Bonus Material conversational UNIX ODS References Gupta, S. (2003) Quick Results with the Output Delivery System. SAS Institute Inc., Cary, NC USA. Delwiche LD and Slaughter SJ. (2003) The Little SAS Book: A Primer, 3rd edition. SAS Institute. Cary, NC, USA. [pages ] Haworth LE (2001) Output Delivery System: The Basics. SAS Institute Inc. Cary, NC USA. ODS Basics What is ODS? * method of delivering output in a variety of formats (other than the default listing format ) * options available include HTML, Rich Text Format (RTF), PS, PDF, SAS data sets Basic ODS Terminology destinations locations to which ODS routes output (e.g. LISTING, HTML, RTF, PRINTER, PDF, OUTPUT new data set) objects output entities created by ODS to store the formatted results styles font/color/other attributes of a report Basic syntax of ODS statements * identify output objects; ODS TRACE ON </options>;
2 * open output destination; ODS destination <FILE=filename>; * create SAS data set with output object; ODS OUTPUT output-object-name=sas-data-set-name; * [optional] select particular objects for inclusion; ODS <destination> SELECT output-object-name; PROC PROC PROC ODS <destination> CLOSE; ODS TRACE OFF; ODS to different file types A familiar example proc format; value totfmt 0='none' 1-HIGH='some' ; data d1; infile "\\Muserver2\USERS\B\BAILERAJ\public.www\classes\sta402\SASprograms\ch2-dat.txt" firstobs=16 expandtabs missover pad ; * infile 'M:\public.www\classes\sta402\SAS-programs\ch2-dat.txt' firstobs=16 expandtabs missover pad ; animal conc brood1 brood2 brood3 total 2.; cbrood3 = brood3; format cbrood3 totfmt.; label animal = animal ID number; label conc = Nitrofen concentration; label brood1 = number of young in first brood; label brood2 = number of young in 2nd brood; label brood3 = number of young in 3rd brood; label total = total young produced in three broods; proc print;
3 where conc=0; /* aside: ODS LISTING open as a default. You can have multiple destinations open simultaneously. If you want to close the LISTING destination before generating output then type ODS LISTING CLOSE; before issuing the PROC for which output is desired. */ /* generate HTML files with objects from 3 PROCs */ ODS TRACE ON; * ODS HTML file='m:\public.www\classes\sta402\sas-programs\day6- example.html ; ODS HTML file="\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\ods-html-example.html ; proc plot; plot total*conc=cbrood3 / vaxis=0 to 40 by 2; proc freq; table conc*cbrood3 / nopct nocol chisq trend exact; proc univariate plot; by conc; var total; ODS HTML CLOSE; ODS TRACE OFF; /* now generate HTML files with additional linkage info */ ODS TRACE ON; ODS HTML path='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms body = day6-example2.html /* Output objects */ contents = day6-example2-toc.html /* Table of contents */ frame = day6-example2-frame.html /* organizes display */ newfile = NONE; /* all results to one file*/ /* old code where M drive referenced vs. specification of the full path ODS HTML path='m:\public.www\classes\sta402\sas-programs body = day6-example2.html /* Output objects */ contents = day6-example2-toc.html /* Table of contents */ frame = day6-example2-frame.html /* organizes display */ newfile = NONE; /* all results to one file*/ */ /* comment: by default, opens a new body file for each part of output so the
4 newfile=none directs all output to the same body file newfile=page creates new body file for each page of output */ proc plot; plot total*conc=cbrood3 / vaxis=0 to 40 by 2; proc freq; table conc*cbrood3 / nopct nocol chisq trend exact; proc univariate plot; by conc; var total; ODS HTML CLOSE; ODS TRACE OFF; /* select on one of the output objects for inclusion */ *ODS HTML file='m:\public.www\classes\sta402\sas-programs\day6- example3.html ; ODS HTML file= \\Muserver2\USERS\B\BAILERAJ\public.www\classes\sta402\SASprograms\day6-example3.html ; ODS HTML SELECT SSPLOTS; ODS HTML SHOW; /* write details to SASLOG confirming object sel. */ proc univariate plot; by conc; var total; ODS HTML CLOSE; /* select different destinations */ options orientation=landscape nocenter nodate; ODS ESCAPECHAR= ^ ; /* for fancy formatting later */ /* old program with M drive reference ODS RTF file='m:\public.www\classes\sta402\sas-programs\day6-example.rtf ; ODS PDF file='m:\public.www\classes\sta402\sas-programs\day6-example.pdf ;
5 ODS PS file='m:\public.www\classes\sta402\sas-programs\day6-example.ps ; */ ODS RTF file='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\day6-example.rtf ; ODS PDF file='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\day6-example.pdf ; ODS PS file='\\muserver2\users\b\baileraj\public.www\classes\sta402\sasprograms\day6-example.ps ; Title Plot of number of young vs. Nitrofen concentration^{super a} ; Footnote1 ^{super a}s=some young produced in Brood 3, n= no young produced in Brood 3 ; proc plot; plot total*conc=cbrood3 / vaxis=0 to 40 by 2; ODS RTF CLOSE; ODS PDF CLOSE; ODS PS CLOSE; ODS to create output data sets proc sort data=d1; by conc; ODS TRACE ON; /* see what ODS objects are created by univariate */ proc univariate data=d1; by conc; var total; ODS TRACE OFF; ODS OUTPUT Quantiles=data_quant; /* extract quantiles */ proc univariate data=d1; by conc; var total; ODS OUTPUT CLOSE; proc print data=data_quant; Var Obs conc Name Quantile Estimate 1 0 total 100% Max total 99% total 95% total 90% total 75% Q total 50% Median total 25% Q total 10% total 5% total 1% total 0% Min edited output total 100% Max total 99% total 95% total 90% total 75% Q3 6.0
6 total 50% Median total 25% Q total 10% total 5% total 1% total 0% Min 0.0 /* can also create multiple data sets */ ODS OUTPUT Quantiles(MATCH_ALL=conc_name_macro)=data_quant; proc univariate data=d1; by conc; var total; ODS OUTPUT CLOSE; proc print data=data_quant; from the SAS LOG file NOTE: The data set WORK.DATA_QUANT has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=0 NOTE: The data set WORK.DATA_QUANT1 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=80 NOTE: The data set WORK.DATA_QUANT2 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=160 NOTE: The data set WORK.DATA_QUANT3 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=235 NOTE: The data set WORK.DATA_QUANT4 has 11 observations and 4 variables. NOTE: The above message was for the following by-group: Nitrofen concentration=310 /* write the data set names to the SAS LOG */ %put The conc_name_macro variables contains the following data sets &conc_name_macro; 76 %put The conc_name_macro variables contains the following data sets &conc_name_macro; The conc_name_macro variables contains the following data sets DATA_QUANT DATA_QUANT1 DATA_QUANT2 DATA_QUANT3 DATA_QUANT4&conc_name_macro; /* merge the concentration summary files to create single table */ data c0; set DATA_QUANT; rename Estimate=C0_Est; key=_n_; drop VarName conc; data c80; set DATA_QUANT1; rename Estimate=C80_Est; key=_n_; drop VarName conc; data c160; set DATA_QUANT2; rename Estimate=C160_Est; key=_n_; drop VarName conc; data c235; set DATA_QUANT3; rename Estimate=C235_Est; key=_n_; drop VarName conc; data c310; set DATA_QUANT4; rename Estimate=C310_Est; key=_n_; drop VarName conc; data all; merge c0 c80 c160 c235 c310; by key; drop key; proc print data=all; Obs Quantile C0_Est C80_Est C160_Est C235_Est C310_Est
7 1 100% Max % % % % Q % Median % Q % % % % Min /* extract the rows-observations corresponding to the 5 number summary */ data fivenum; set all; if _n_=1 or _n_=5 or _n_=6 or _n_=7 or _n_=11; proc print; Obs Quantile C0_Est C80_Est C160_Est C235_Est C310_Est 1 100% Max % Q % Median % Q % Min Using ODS OUTPUT to create dataset in a simulation /* Extracting coefficients from simple linear regression simulation */ options formdlim="-" nodate; /* generate simulation data sets Y ~ N(mu(x)= 3+2x, sigma=2) */ data sims; do dataset=1 to 1000; do x=1 to 10; y = 3 + 2*x + 2*rannor(0); output; end; end; /* DEBUG: print to check generated data */ proc print data=sims; /* SORT for data set */ proc sort data=sims; by dataset;
8 /* USE OUTEST to extract the estimated coefficients */ proc reg data=sims outest=myparms; by dataset; model y=x; proc print data=myparms; /* HISTOGRAM for estimated slope */ proc gchart data=work.myparms; vbar x; /* Re-do this with ODS */ *ods trace on; * determine what output objects are constructed; ods output ParameterEstimates=reg_coefs; proc reg data=sims; by dataset; model y=x; proc print data=reg_coefs; ods output close; *ods trace off; proc print data=reg_coefs; proc contents data=reg_coefs; data slopes; set reg_coefs; if Variable="x"; slope=estimate; keep dataset slope; data intercepts; set reg_coefs; if Variable="Intercept"; Intercept = Estimate; keep dataset intercept; data both; merge slopes intercepts; by dataset;
9 proc gplot data=both; title "Plot of estimated slope vs. estimated intercept"; plot slope*intercept; proc gchart data=both; title "Sampling distribution of the estimated slope"; vbar slope; proc gchart data=both title "Sampling distribution of the estimated intercept"; vbar intercept; proc print data=slopes; PROC TABULATE (producing fancier results tables in SAS) PROC TABULATE <option(s)>; CLASS variable(s) </ options>; * identify non-numeric vars; FREQ variable; * identify variable containing frequency of observation; TABLE <<page-expression,> row-expression,> column-expression</ table-option(s)>; VAR analysis-variable(s)</ options>; * identify analysis vars; WEIGHT variable; * identify variable name e.g. sampling wts; * FORMATTING related subcommands CLASSLEV variable(s) / STYLE=<style-element-name PARENT> <[style-attribute-specification(s)] >; KEYLABEL keyword-1='description-1' <...keyword-n='description-n'>; KEYWORD keyword(s) / STYLE=<style-element-name PARENT> <[style-attribute-specification(s)] >;
10 [* check out results of search for Tabulate syntax on SAS doc] Comments: * concatenation (blank) operator * crossing (*) operator * format modifiers * grouping elements (parentheses) operator * ALL class variable data d1; infile 'M:\public.www\classes\sta402\SAS-programs\ch2-dat.txt' firstobs=16 expandtabs missover pad ; animal conc brood1 brood2 brood3 total 2.; proc tabulate data=d1; class conc; var brood1 brood2 brood3 total; table (brood1 brood2 brood3 total)*conc, min q1 median q3 max; proc tabulate data=d1; class conc; var total; table conc= Nitrofen Concentration all, total (mean var);
11 Week 04+/- [12+ Sept.] Class Activities AN INTRODUCTION TO STATISTICAL MODELING * PROC REG for linear modeling (a very basic introduction) * PROC GLM for anova models Other normal response modeling ANOVA balanced anova models Non-normal response modeling GENMOD generalized linear models LOGISTIC [grouped] binary regression PROBIT [grouped] binary regression (INVERSECL) CATMOD categorical data modeling Failure time modeling LIFEREG accelerated failure time models PHREG Cox s PH model And more REGRESSION using PROC REG Basic Model: Y i = X i + i [ simple linear regression ] = X i1 + 2 X i2 + 3 X i3 + 4 X i4 + 5 X i5 + ij [ multiple linear regression ] Error Assumption: ij ~ indep. N(0, 2 ) i=1,2,,n [observations]
12 /* example sas program that does simple linear regression */ options ls=75; data example1; input year nboats manatees; cards; ; /* WARNING: ODS RTF will place TITLE information along With SAS date/time/page number as part of a header in the RTP document. Check out Print Preview or view the header. */ ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\linregoutput.rtf ; proc reg; title Number of Manatees killed regressed on the number of boats registered in Florida ; model manatees = nboats / p r cli clm; plot manatees*nboats= o p.*nboats= + / overlay; plot r.*nboats r.*p.; ODS RTF CLOSE;
13 Source DF Analysis of Variance Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept nboats <.0001
14 Obs Dep Var manatees Predicted Value Output Statistics Std Error Mean Predict 95% CL Mean 95% CL Predict Residual Std Error Residual Student Residual Output Statistics Obs ** 3 ** 4 ** 5 6 * 7 **** 8 ** 9 Cook's D
15 Output Statistics Obs * * 14 Cook's D Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS) Multiple Regression with indicator variables Log(Brain Weight) i = Log(Body Weight) i + i Log(Brain Weight) i = Log(Body Weight) i + 2 I dinoi + i Log(Brain Weight) i = Log(Body Weight) i + 2 I dinoi + 3 I dinoi Log(Body Weight) i + i data mrexample; * Lunneborg (1994); * body weight brain example; input species $ bodywt logbody = log10(bodywt); logbrain = log10(brainwt); idino = 0; if (species="diplodoc" or species="tricerat" or species="brachios") then idino=1; idinobod = idino*logbody; cards; beaver cow wolf goat guipig diplodocus asielephant donkey horse potarmonkey cat giraffe gorilla human afrelephant triceratops rhemonkey kangaroo hamster mouse rabbit sheep jaguar chimp brachiosaurus rat mole pig
16 ; ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\mregoutput.rtf ; proc print; title brain wt - body wt data ; proc univariate; var bodywt brainwt; id species; proc reg; title2 allometric scaling - brain and body wt. ; title3 [All Species combined] ; model logbrain=logbody; plot logbrain*logbody="o" p.*logbody="+" / overlay; plot r.*logbody; proc reg; title2 Dinosaurs fitted with potentially different line ; model logbrain=logbody idino idinobod; plot logbrain*logbody="o" p.*logbody="+" / overlay; plot r.*logbody; proc reg; title2 Dinosaurs fitted with potentially different INTERCEPTS ; model logbrain=logbody idino; plot logbrain*logbody="o" p.*logbody="+" / overlay; plot r.*logbody; ODS RTF CLOSE; Obs species bodywt brainwt logbody logbrain idino idinobod 1 beaver cow wolf goat guipig diplodoc asieleph donkey horse
17 Obs species bodywt brainwt logbody logbrain idino idinobod 10 potarmon cat giraffe gorilla human afreleph tricerat rhemonke kangaroo hamster mouse rabbit sheep jaguar chimp brachios rat mole pig
18 brain wt - body wt data The UNIVARIATE Procedure Variable: bodywt BODY WEIGHT Moments N 28 Sum Weights 28 Mean Sum Observations Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation Std Error Mean Basic Statistical Measures Location Variability Mean Std Deviation Median Variance Mode. Range Interquartile Range Tests for Location: Mu0=0 Test Statistic p Value Student's t t Pr > t Sign M 14 Pr >= M <.0001 Signed Rank S 203 Pr >= S <.0001 Quantiles (Definition 5) Quantile Estimate 100% Max % % % % Q % Median % Q % % 0.120
19 brain wt - body wt data The UNIVARIATE Procedure Variable: bodywt Quantiles (Definition 5) Quantile Estimate 1% % Min Extreme Observations Lowest Highest Value species Obs Value species Obs mouse asieleph hamster afreleph mole tricerat rat diplodoc guipig brachios 25
20 brain wt - body wt data The UNIVARIATE Procedure Variable: brainwt BRAIN WEIGHT Moments N 28 Sum Weights 28 Mean Sum Observations Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation Std Error Mean Basic Statistical Measures Location Variability Mean Std Deviation 1335 Median Variance Mode Range 5712 Interquartile Range Tests for Location: Mu0=0 Test Statistic p Value Student's t t Pr > t Sign M 14 Pr >= M <.0001 Signed Rank S 203 Pr >= S <.0001 Quantiles (Definition 5) Quantile Estimate 100% Max % % % % Q % Median % Q % % 1.00
21 brain wt - body wt data The UNIVARIATE Procedure Variable: brainwt Quantiles (Definition 5) Quantile Estimate 1% % Min 0.40 Extreme Observations Lowest Highest Value species Obs Value species Obs 0.4 mouse horse hamster giraffe rat human mole asieleph guipig afreleph 15
22 Source Analysis of Variance DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept <.0001 logbody <.0001
23 Source Analysis of Variance DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept <.0001 logbody <.0001 idino idinobod
24 Source Analysis of Variance DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Variable DF Parameter Estimates Parameter Estimate Standard Error t Value Pr > t Intercept <.0001 logbody <.0001 idino <.0001 One-way ANOVA Basic Model: Y ij = μ i + ij [ cell means coding] = μ + i + ij [ effects coding] (constraint for estimation? 1 =0 or g =0 or i =0) Error Assumption: ij ~ indep. N(0, 2 ) i=1,2,,g [treatments or populations] j=1,2,,n i [replications] H 0 : μ 1 = = μ g or equivalently, H 0 : 1 = = g =0 /* Bacteria in meat under 4 different conditions */ options ls = 75; data meat;
25 input condition $ datalines; Plastic 7.66 Plastic 6.98 Plastic 7.80 Vacuum 5.26 Vacuum 5.44 Vacuum 5.80 Mixed 7.41 Mixed 7.33 Mixed 7.04 Co Co Co ; title bacteria growth under 4 packaging conditions; ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\onewayoutput.rtf ; proc boxplot; plot logcount*condition; proc glm data=meat order=data; title2 fitting the one-way anova model via GLM; class condition; model logcount = condition; means condition / bon tukey scheffe cldiff lines; lsmeans condition / cl pdiff; contrast plastic vs. rest condition ; output out=new p=yhat r=resid stdr=eresid; proc plot data=new; title2 residual analyses; plot resid*yhat; proc univariate data=new plot; var resid; proc boxplot data=new; plot resid*condition; ODS RTF CLOSE;
26 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Class condition Class Level Information Levels Values 4 Plastic Vacuum Mixed Co2 Number of observations 12
27 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Dependent Variable: logcount Source DF Sum of Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total R-Square Coeff Var Root MSE logcount Mean Source DF Type I SS Mean Square F Value Pr > F condition <.0001 Source DF Type III SS Mean Square F Value Pr > F condition <.0001
28 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Tukey's Studentized Range (HSD) Test for logcount NOTE: This test controls the Type I experimentwise error rate. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of Studentized Range Minimum Significant Difference 0.89 Comparisons significant at the 0.05 level are indicated by ***. condition Comparison Difference Between Means Simultaneous 95% Confidence Limits Plastic - Mixed Plastic - Vacuum *** Plastic - Co *** Mixed - Plastic Mixed - Vacuum *** Mixed - Co *** Vacuum - Plastic *** Vacuum - Mixed *** Vacuum - Co *** Co2 - Plastic *** Co2 - Mixed *** Co2 - Vacuum ***
29 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Bonferroni (Dunn) t Tests for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than Tukey's for all pairwise comparisons. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of t Minimum Significant Difference Comparisons significant at the 0.05 level are indicated by ***. condition Comparison Difference Between Means Simultaneous 95% Confidence Limits Plastic - Mixed Plastic - Vacuum *** Plastic - Co *** Mixed - Plastic Mixed - Vacuum *** Mixed - Co *** Vacuum - Plastic *** Vacuum - Mixed *** Vacuum - Co *** Co2 - Plastic *** Co2 - Mixed *** Co2 - Vacuum ***
30 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Scheffe's Test for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than Tukey's for all pairwise comparisons. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of F Minimum Significant Difference Comparisons significant at the 0.05 level are indicated by ***. condition Comparison Difference Between Means Simultaneous 95% Confidence Limits Plastic - Mixed Plastic - Vacuum *** Plastic - Co *** Mixed - Plastic Mixed - Vacuum *** Mixed - Co *** Vacuum - Plastic *** Vacuum - Mixed *** Vacuum - Co *** Co2 - Plastic *** Co2 - Mixed *** Co2 - Vacuum ***
31 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Tukey's Studentized Range (HSD) Test for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than REGWQ. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of Studentized Range Minimum Significant Difference 0.89 Means with the same letter are not significantly different. Tukey Grouping Mean N condition A Plastic A A Mixed B Vacuum C Co2
32 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Bonferroni (Dunn) t Tests for logcount NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than REGWQ. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of t Minimum Significant Difference Means with the same letter are not significantly different. Bon Grouping Mean N condition A Plastic A A Mixed B Vacuum C Co2
33 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Scheffe's Test for logcount NOTE: This test controls the Type I experimentwise error rate. Alpha 0.05 Error Degrees of Freedom 8 Error Mean Square Critical Value of F Minimum Significant Difference Means with the same letter are not significantly different. Scheffe Grouping Mean N condition A Plastic A A Mixed B Vacuum C Co2
34 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Least Squares Means condition logcount LSMEAN LSMEAN Number Plastic Vacuum Mixed Co Least Squares Means for effect condition Pr > t for H0: LSMean(i)=LSMean(j) Dependent Variable: logcount i/j < < < < < <.0001 <.0001 <.0001 condition logcount LSMEAN 95% Confidence Limits Plastic Vacuum Mixed Co i Least Squares Means for Effect condition j Difference Between Means 95% Confidence Limits for LSMean(i)-LSMean(j) NOTE: To ensure overall protection level, only probabilities associated with pre-planned comparisons should be used.
35 bacteria growth under 4 packaging conditions fitting the one-way anova model via GLM The GLM Procedure Dependent Variable: logcount Contrast DF Contrast SS Mean Square F Value Pr > F plastic vs. rest <.0001
36 bacteria growth under 4 packaging conditions residual analyses Plot of resid*yhat. Legend: A = 1 obs, B = 2 obs, etc. resid 0.4 ˆ A 0.3 ˆ A A 0.2 ˆ A A A 0.1 ˆ A 0.0 ˆ A -0.1 ˆ -0.2 ˆ A A -0.3 ˆ -0.4 ˆ A -0.5 ˆ A ƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒƒˆ ƒƒƒƒƒƒƒƒƒˆƒ yhat
37 bacteria growth under 4 packaging conditions residual analyses The UNIVARIATE Procedure Variable: resid Moments N 12 Sum Weights 12 Mean 0 Sum Observations 0 Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation. Std Error Mean Basic Statistical Measures Location Variability Mean Std Deviation Median Variance Mode Range Interquartile Range Tests for Location: Mu0=0 Test Statistic p Value Student's t t 0 Pr > t Sign M 1 Pr >= M Signed Rank S 2 Pr >= S Quantiles (Definition 5) Quantile Estimate 100% Max % % % % Q % Median % Q % % -0.50
38 bacteria growth under 4 packaging conditions residual analyses The UNIVARIATE Procedure Variable: resid Quantiles (Definition 5) Quantile Estimate 1% % Min Extreme Observations Lowest Highest Value Obs Value Obs Stem Leaf # Boxplot *-----* Multiply Stem.Leaf by 10**-1 Normal Probability Plot * *++ * ++++ * *+* *++++ * *+ * +++ +*++ *
39 Two-way ANOVA/ Factorial example 2 - interaction plots Patient Waiting Time data Factorial anova model Basic Model: Y ijk = μ ij + ijk [ cell means coding] = μ + i + j + ( ) ij + ijk [ effects coding] Error Assumption: ijk ~ indep. N(0, 2 ) i=1,2,,g [treatments or populations] j=1,2,,n i [replications] H 0 : all ( ) ij =0 [no interaction] H 0 : all i =0 [no A main effect] H 0 : all j =0 [no B main effect] ODS ESCAPECHAR= ^ ; /* for fancy formatting later */ ODS RTF file='d:\baileraj\classes\fall 2003\sta402\SAS-programs\twowayoutput.rtf ; title Two-way ANOVA/ Factorial example 2 - interaction plots; title2 Patient Waiting Time data; data cwait; input doctype $ practype $ cards; gen group 15 gen group 20 gen group 25 gen group 20 gen solo 20 gen solo 25 gen solo 30 gen solo 25 spec group 30 spec group 25 spec group 30 spec group 35 spec solo 25 spec solo 20 spec solo 30 spec solo 30 ; proc print; proc sort; by doctype practype; proc means noprint; by doctype practype; output out=factmean mean=timemean; proc plot data=factmean; plot timemean*doctype=practype / vaxis=0 to 35 by 5; proc glm data=cwait order=data; class doctype practype; model time=doctype practype; /* equivalent model statement
40 Two-way ANOVA/ Factorial example 2 - interaction plots Patient Waiting Time data model time=doctype practype doctype*practype; */ output out=new p=yhat r=resid; lsmeans doctype practype doctype*practype / stderr pdiff; means doctype practype doctype*practype / tukey; ODS RTF CLOSE; Obs doctype practype time 1 gen group 15 2 gen group 20 3 gen group 25 4 gen group 20 5 gen solo 20 6 gen solo 25 7 gen solo 30 8 gen solo 25 9 spec group spec group spec group spec group spec solo spec solo spec solo spec solo 30
41 Two-way ANOVA/ Factorial example 2 - interaction plots Patient Waiting Time data Plot of timemean*doctype. Symbol is value of practype. g s timemean 35 ˆ 30 ˆ 25 ˆ s 20 ˆ g 15 ˆ 10 ˆ 5 ˆ 0 ˆ ƒƒˆƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒˆƒƒ gen spec doctype
THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions)
THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533 MIDTERM EXAMINATION: October 14, 2005 Instructor: Val LeMay Time: 50 minutes 40 Marks FRST 430 50 Marks FRST 533 (extra questions) This examination
More informationCSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM
options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall
More informationCentering and Interactions: The Training Data
Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three
More informationEXST3201 Mousefeed01 Page 1
EXST3201 Mousefeed01 Page 1 3 /* 4 Examine differences among the following 6 treatments 5 N/N85 fed normally before weaning and 85 kcal/wk after 6 N/R40 fed normally before weaning and 40 kcal/wk after
More informationSTAT:5400 Computing in Statistics
STAT:5400 Computing in Statistics Introduction to SAS Lecture 18 Oct 12, 2015 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowaedu SAS SAS is the statistical software package most commonly used in business,
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationSTAT 503 Fall Introduction to SAS
Getting Started Introduction to SAS 1) Download all of the files, sas programs (.sas) and data files (.dat) into one of your directories. I would suggest using your H: drive if you are using a computer
More informationSTAT:5201 Applied Statistic II
STAT:5201 Applied Statistic II Two-Factor Experiment (one fixed blocking factor, one fixed factor of interest) Randomized complete block design (RCBD) Primary Factor: Day length (short or long) Blocking
More informationFactorial ANOVA. Skipping... Page 1 of 18
Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent
More informationFactorial ANOVA with SAS
Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'
More informationWeek 05 Class Activities
Week 05 Class Activities File: week-05-24sep07.doc Directory (hp/compaq): C:\baileraj\Classes\Fall 2007\sta402\handouts Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts Better Graphics
More informationUsing PC SAS/ASSIST* for Statistical Analyses
Using PC SAS/ASSIST* for Statistical Analyses Margaret A. Nemeth, Monsanto Company lptroductjon SAS/ASSIST, a user friendly, menu driven applications system, is available on several platforms. This paper
More information1 The SAS System 23:01 Friday, November 9, 2012
2101f12HW9chickwts.log Saved: Wednesday, November 14, 2012 6:50:49 PM Page 1 of 3 1 The SAS System 23:01 Friday, November 9, 2012 NOTE: Copyright (c) 2002-2010 by SAS Institute Inc., Cary, NC, USA. NOTE:
More informationLab #9: ANOVA and TUKEY tests
Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for
More information* Sample SAS program * Data set is from Dean and Voss (1999) Design and Analysis of * Experiments. Problem 3, page 129.
SAS Most popular Statistical software worldwide. SAS claims that its products are used at over 40,000 sites, including at 90% of the Fortune 500. This will not be all SAS as they make other products, such
More informationEXST SAS Lab Lab #6: More DATA STEP tasks
EXST SAS Lab Lab #6: More DATA STEP tasks Objectives 1. Working from an current folder 2. Naming the HTML output data file 3. Dealing with multiple observations on an input line 4. Creating two SAS work
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming
More informationCH5: CORR & SIMPLE LINEAR REFRESSION =======================================
STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================
More informationThe Kenton Study. (Applied Linear Statistical Models, 5th ed., pp , Kutner et al., 2005) Page 1 of 5
The Kenton Study The Kenton Food Company wished to test four different package designs for a new breakfast cereal. Twenty stores, with approximately equal sales volumes, were selected as the experimental
More informationRepeated Measures Part 4: Blood Flow data
Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value
More informationA Step by Step Guide to Learning SAS
A Step by Step Guide to Learning SAS 1 Objective Familiarize yourselves with the SAS programming environment and language. Learn how to create and manipulate data sets in SAS and how to use existing data
More informationData Management - 50%
Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define
More informationWithin-Cases: Multivariate approach part one
Within-Cases: Multivariate approach part one /* sleep2.sas */ options linesize=79 noovp formdlim=' '; title "Student's Sleep data: Matched t-tests with proc reg"; data bedtime; infile 'studentsleep.data'
More informationOutline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model
Topic 16 - Other Remedies Ridge Regression Robust Regression Regression Trees Outline - Fall 2013 Piecewise Linear Model Bootstrapping Topic 16 2 Ridge Regression Modification of least squares that addresses
More informationContrasts and Multiple Comparisons
Contrasts and Multiple Comparisons /* onewaymath.sas */ title2 'Oneway with contrasts and multiple comparisons (Exclude Other/DK)'; %include 'readmath.sas'; if ethnic ne 6; /* Otherwise, throw the case
More informationWeek 1 [20-24 Aug 07] Class Activities. FILE: \\Muserver2\USERS\B\baileraj\Classes\sta402\handouts\week-01-22aug07.doc
Week 1 [20-24 Aug 07] Class Activities FILE: \\Muserver2\USERS\B\baileraj\Classes\sta402\handouts\week-01-22aug07.doc 0. Roster STA 402 (n=3) Menickelly,Matthew Mix, Nathan Weber, Madeline STA 502 (n=11)
More informationSTAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.
STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,
More informationLaboratory Topics 1 & 2
PLS205 Lab 1 January 12, 2012 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs; saving results; checking for errors
More informationIntroductory Guide to SAS:
Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................
More informationThe ANOVA Procedure (Chapter)
SAS/STAT 9.3 User s Guide The ANOVA Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual
More informationModule 3: SAS. 3.1 Initial explorative analysis 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE
St@tmaster 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE Module 3: SAS 3.1 Initial explorative analysis....................... 1 3.1.1 SAS JMP............................
More informationCell means coding and effect coding
Cell means coding and effect coding /* mathregr_3.sas */ %include 'readmath.sas'; title2 ''; /* The data step continues */ if ethnic ne 6; /* Otherwise, throw the case out */ /* Indicator dummy variables
More informationAnalysis of variance - ANOVA
Analysis of variance - ANOVA Based on a book by Julian J. Faraway University of Iceland (UI) Estimation 1 / 50 Anova In ANOVAs all predictors are categorical/qualitative. The original thinking was to try
More informationRecall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation:
Topic 11. Unbalanced Designs [ST&D section 9.6, page 219; chapter 18] 11.1 Definition of missing data Accidents often result in loss of data. Crops are destroyed in some plots, plants and animals die,
More informationStatistics Lab #7 ANOVA Part 2 & ANCOVA
Statistics Lab #7 ANOVA Part 2 & ANCOVA PSYCH 710 7 Initialize R Initialize R by entering the following commands at the prompt. You must type the commands exactly as shown. options(contrasts=c("contr.sum","contr.poly")
More informationPaper ODS, YES! Odious, NO! An Introduction to the SAS Output Delivery System
Paper 149-25 ODS, YES! Odious, NO! An Introduction to the SAS Output Delivery System Lara Bryant, University of North Carolina at Chapel Hill, Chapel Hill, NC Sally Muller, University of North Carolina
More informationContents of SAS Programming Techniques
Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal
More informationQuick Results with the Output Delivery System
Paper 58-27 Quick Results with the Output Delivery System Sunil K. Gupta, Gupta Programming, Simi Valley, CA ABSTRACT SAS s new Output Delivery System (ODS) opens a whole new world of options in generating
More informationLab #1: Introduction to Basic SAS Operations
Lab #1: Introduction to Basic SAS Operations Getting Started: OVERVIEW OF SAS (access lab pages at http://www.stat.lsu.edu/exstlab/) There are several ways to open the SAS program. You may have a SAS icon
More informationPLS205 Lab 1 January 9, Laboratory Topics 1 & 2
PLS205 Lab 1 January 9, 2014 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs saving results checking for errors Different
More informationSAS CURRICULUM. BASE SAS Introduction
SAS CURRICULUM BASE SAS Introduction Data Warehousing Concepts What is a Data Warehouse? What is a Data Mart? What is the difference between Relational Databases and the Data in Data Warehouse (OLTP versus
More informationUsing SAS to Analyze CYP-C Data: Introduction to Procedures. Overview
Using SAS to Analyze CYP-C Data: Introduction to Procedures CYP-C Research Champion Webinar July 14, 2017 Jason D. Pole, PhD Overview SAS overview revisited Introduction to SAS Procedures PROC FREQ PROC
More informationBaruch College STA Senem Acet Coskun
Baruch College STA 9750 BOOK BUY A Predictive Mode Senem Acet Coskun Table of Contents Summary 3 Why this topic? 4 Data Sources 6 Variable Definitions 7 Descriptive Statistics 8 Univariate Analysis 9 Two-Sample
More informationStat 5100 Handout #19 SAS: Influential Observations and Outliers
Stat 5100 Handout #19 SAS: Influential Observations and Outliers Example: Data collected on 50 countries relevant to a cross-sectional study of a lifecycle savings hypothesis, which states that the response
More informationBase and Advance SAS
Base and Advance SAS BASE SAS INTRODUCTION An Overview of the SAS System SAS Tasks Output produced by the SAS System SAS Tools (SAS Program - Data step and Proc step) A sample SAS program Exploring SAS
More informationThe NESTED Procedure (Chapter)
SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual
More informationStatistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment
Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ Aiming Yang, Merck & Co., Inc., Rahway, NJ ABSTRACT Four pitfalls are commonly
More information5:2 LAB RESULTS - FOLLOW-UP ANALYSES FOR FACTORIAL
5:2 LAB RESULTS - FOLLOW-UP ANALYSES FOR FACTORIAL T1. n F and n C for main effects = 2 + 2 + 2 = 6 (i.e., 2 observations in each of 3 cells for other factor) Den t = SQRT[3.333x(1/6+1/6)] = 1.054 Den
More informationSAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/
STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationWeek 6, Week 7 and Week 8 Analyses of Variance
Week 6, Week 7 and Week 8 Analyses of Variance Robyn Crook - 2008 In the next few weeks we will look at analyses of variance. This is an information-heavy handout so take your time reading it, and don
More informationGeneralized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS)
Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Linear Model in matrix notation for the population Y = Xβ + Var ( ) = In GLS, the error covariance matrix is known In EGLS
More informationBIOMETRICS INFORMATION
BIOMETRICS INFORMATION (You re 95% likely to need this information) PAMPHLET NO. # 57 DATE: September 5, 1997 SUBJECT: Interpreting Main Effects when a Two-way Interaction is Present Interpreting the analysis
More informationINTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS
TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS
More informationSAS/STAT 13.1 User s Guide. The NESTED Procedure
SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute
More informationAn introduction to SPSS
An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible
More informationIntroductory SAS example
Introductory SAS example STAT:5201 1 Introduction SAS is a command-driven statistical package; you enter statements in SAS s language, submit them to SAS, and get output. A fairly friendly user interface
More informationStat 5100 Handout #6 SAS: Linear Regression Remedial Measures
Stat 5100 Handout #6 SAS: Linear Regression Remedial Measures Example: Age and plasma level for 25 healthy children in a study are reported. Of interest is how plasma level depends on age. (Text Table
More informationStat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics
Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 19, 2015 2 Getting
More information6:1 LAB RESULTS -WITHIN-S ANOVA
6:1 LAB RESULTS -WITHIN-S ANOVA T1/T2/T3/T4. SStotal =(1-12) 2 + + (18-12) 2 = 306.00 = SSpill + SSsubj + SSpxs df = 9-1 = 8 P1 P2 P3 Ms Ms-Mg 1 8 15 8.0-4.0 SSsubj= 3x(-4 2 + ) 4 17 15 12.0 0 = 96.0 13
More informationIntroduction to Statistical Analyses in SAS
Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in
More informationStat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors
Week 10: Autocorrelated errors This week, I have done one possible analysis and provided lots of output for you to consider. Case study: predicting body fat Body fat is an important health measure, but
More informationChapter 2 (was Ch. 8) (September 21, 2008; 4:54 p.m.) Constructing a data set for analysis: reading, combining, managing and manipulating data sets
Chapter 2 (was Ch. 8) (September 21, 2008 4:54 p.m.) Constructing a data set for analysis: reading, combining, managing and manipulating data sets 2. Constructing a data set for analysis: reading, combining,
More informationSAS Example A10. Output Delivery System (ODS) Sample Data Set sales.txt. Examples of currently available ODS destinations: Mervyn Marasinghe
SAS Example A10 data sales infile U:\Documents\...\sales.txt input Region : $8. State $2. +1 Month monyy5. Headcnt Revenue Expenses format Month monyy5. Revenue dollar12.2 proc sort by Region State Month
More informationIntroduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data
Introduction About this Document This manual was written by members of the Statistical Consulting Program as an introduction to SPSS 12.0. It is designed to assist new users in familiarizing themselves
More informationFurther Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables
Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables
More informationBUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)
SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon
More informationPlease login. Take a seat Login with your HawkID Locate SAS 9.3. Raise your hand if you need assistance. Start / All Programs / SAS / SAS 9.
Please login Take a seat Login with your HawkID Locate SAS 9.3 Start / All Programs / SAS / SAS 9.3 (64 bit) Raise your hand if you need assistance Introduction to SAS Procedures Sarah Bell Overview Review
More informationChapter Seven: Multiple Regression II
Chapter Seven: Multiple Regression II Factorial ANOVA and Related Topics 4.1 A One-way Example The following is a textbook example taken from Neter et al.'s Applied Linear Statistical Models. The Kenton
More informationRegression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables:
Regression Lab The data set cholesterol.txt available on your thumb drive contains the following variables: Field Descriptions ID: Subject ID sex: Sex: 0 = male, = female age: Age in years chol: Serum
More informationSubset Selection in Multiple Regression
Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that
More informationChoosing the Right Procedure
3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 6 Available Statistical
More informationStat 302 Statistical Software and Its Applications SAS: Data I/O
Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the
More informationChapters 18, 19, 20 Solutions. Page 1 of 14. Demographics from COLLEGE Data Set
18.2 proc tabulate data=learn.college format=7.; class schoolsize gender scholarship; table schoolsize ALL, gender scholarship ALL; n = ' '; Demographics from COLLEGE Data Set ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
More informationSAS Programs SAS Lecture 4 Procedures. Aidan McDermott, April 18, Outline. Internal SAS formats. SAS Formats
SAS Programs SAS Lecture 4 Procedures Aidan McDermott, April 18, 2006 A SAS program is in an imperative language consisting of statements. Each statement ends in a semi-colon. Programs consist of (at least)
More informationTHE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann
Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG
More informationChoosing the Right Procedure
3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues
More information22S:166. Checking Values of Numeric Variables
22S:1 Computing in Statistics Lecture 24 Nov. 2, 2016 1 Checking Values of Numeric Variables range checks when you know what the range of possible values is for a given quantitative variable internal consistency
More informationStat 5303 (Oehlert): Unbalanced Factorial Examples 1
Stat 5303 (Oehlert): Unbalanced Factorial Examples 1 > section
More informationModel Selection and Inference
Model Selection and Inference Merlise Clyde January 29, 2017 Last Class Model for brain weight as a function of body weight In the model with both response and predictor log transformed, are dinosaurs
More informationIntermediate SAS: Working with Data
Intermediate SAS: Working with Data OIT Technical Support Services 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Getting set up for the Intermediate SAS workshop:...
More informationST Lab 1 - The basics of SAS
ST 512 - Lab 1 - The basics of SAS What is SAS? SAS is a programming language based in C. For the most part SAS works in procedures called proc s. For instance, to do a correlation analysis there is proc
More informationThe SAS interface is shown in the following screen shot:
The SAS interface is shown in the following screen shot: There are several items of importance shown in the screen shot First there are the usual main menu items, such as File, Edit, etc I seldom use anything
More informationIntroduction to STATA
Introduction to STATA Duah Dwomoh, MPhil School of Public Health, University of Ghana, Accra July 2016 International Workshop on Impact Evaluation of Population, Health and Nutrition Programs Learning
More informationAURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1
SAS TRAINING SAS/BASE BASIC THEORY & RULES ETC SAS WINDOWING ENVIRONMENT CREATION OF LIBRARIES SAS PROGRAMMING (BRIEFLY) - DATASTEP - PROC STEP WAYS TO READ DATA INTO SAS BACK END PROCESS OF DATASTEP INSTALLATION
More informationSAS CLINICAL SYLLABUS. DURATION: - 60 Hours
SAS CLINICAL SYLLABUS DURATION: - 60 Hours BASE SAS PART - I Introduction To Sas System & Architecture History And Various Modules Features Variables & Sas Syntax Rules Sas Data Sets Data Set Options Operators
More informationBasic Concepts #6: Introduction to Report Writing
Basic Concepts #6: Introduction to Report Writing Using By-line, PROC Report, PROC Means, PROC Freq JC Wang By-Group Processing By-group processing in a procedure step, a BY line identifies each group
More informationPart 1. Getting Started. Chapter 1 Creating a Simple Report 3. Chapter 2 PROC REPORT: An Introduction 13. Chapter 3 Creating Breaks 57
Part 1 Getting Started Chapter 1 Creating a Simple Report 3 Chapter 2 PROC REPORT: An Introduction 13 Chapter 3 Creating Breaks 57 Chapter 4 Only in the LISTING Destination 75 Chapter 5 Creating and Modifying
More informationDr. Barbara Morgan Quantitative Methods
Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In
More information2011 NAICC ARM 8 Introduction Training, Jan. 2011
2011 NAICC ARM 8 Introduction Training, Jan. 2011 General Overview of ARM Menu Choices 1) Help: a) F1, any Help button, or displays help on current Topic. b) F5 or on a study editor displays help about
More information1 Downloading files and accessing SAS. 2 Sorting, scatterplots, correlation and regression
Statistical Methods and Computing, 22S:30/105 Instructor: Cowles Lab 2 Feb. 6, 2015 1 Downloading files and accessing SAS. We will be using the billion.dat dataset again today, as well as the OECD dataset
More informationa. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.
Probability and Statistics Chapter 2 Notes I Section 2-1 A Steps to Constructing Frequency Distributions 1 Determine number of (may be given to you) a Should be between and classes 2 Find the Range a The
More informationRobust Linear Regression (Passing- Bablok Median-Slope)
Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their
More informationSAS Training Spring 2006
SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered
More information15 Wyner Statistics Fall 2013
15 Wyner Statistics Fall 2013 CHAPTER THREE: CENTRAL TENDENCY AND VARIATION Summary, Terms, and Objectives The two most important aspects of a numerical data set are its central tendencies and its variation.
More informationFathom Dynamic Data TM Version 2 Specifications
Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other
More informationIntroduction to Stata: An In-class Tutorial
Introduction to Stata: An I. The Basics - Stata is a command-driven statistical software program. In other words, you type in a command, and Stata executes it. You can use the drop-down menus to avoid
More informationAcquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.
Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting
More informationAn Introduction to ODS for Statistical Graphics in SAS 9.1 Robert N. Rodriguez SAS Institute Inc., Cary, North Carolina, USA
An Introduction to ODS for Statistical Graphics in SAS 9.1 Robert N. Rodriguez SAS Institute Inc., Cary, North Carolina, USA ABSTRACT In SAS 9.1, over two dozen SAS/STAT and SAS/ETS procedures have been
More informationSTAT 3304/5304 Introduction to Statistical Computing. Introduction to SAS
STAT 3304/5304 Introduction to Statistical Computing Introduction to SAS What is SAS? SAS (originally an acronym for Statistical Analysis System, now it is not an acronym for anything) is a program designed
More information