Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION

Size: px
Start display at page:

Download "Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION"

Transcription

1 Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output Myungshin Oh, UCLA Department of Biostatistics Mel Widawski, UCLA School of Nursing ABSTRACT We, as statisticians, often spend more time cutting and pasting from pure SAS output in making a report for clients, than we spend writing the programs. This is especially true, when we perform the same PROC step for many variables over and over and get hundreds pages of output, we need to somehow summarize them. In order to save time, these Macros produce a summarized version of output containing just the information needed. We have developed Macros for paired t-test, reliability analysis (Cronbach s alpha), Wilcoxon test, and analysis of variance. The Macro for paired t-test combines information from PROC MEANS, PROC TTEST, and PROC CORR. We often need to present reliability information on a number of scales and sub-scales simultaneously; this macro produces a condensed table. Combining the information from PROC NPAR1WAY and PROC MEANS, our Wilcoxon macro presents a condensed table including Wilcoxon statistics and medians, which are more understandable by clients than the rank sums. The last of our macros is designed for repeated measure analysis using PROC GLM. It produces as simple table as containing main ANOVA table with descriptive statistics in each time and each combination of classes, eliminating much extraneous information. INTRODUCTION The job of a statistician is taken up with preparing data for analysis, determining the appropriate analysis, and programming statistical procedures. Unfortunately much of the time is actually spent cutting, pasting, and reformatting summarized but informative reports for clients. Much of the time SAS output is much too voluminous to present to the client just as it is produced. And often the output of more than one procedure needs to be combined for meaningful reports. This requires long and tedious work, such as cutting the essential information from SAS pure output, pasting it to separate editor program and rearranging it for the client to catch the information easily. The complete output from containing all possible information is often only meaningful for statisticians, and distracting to clients. The output often needs to be rearranged to clearly present what has been found. Finally, information from disparate procedures is more meaningful when presented in single tables. Frequently we perform the same PROC step for many variables over and over and produce hundreds pages of output; however we usually only need the few key values for each variable to present to the client. For example, after performing PROC GLM or PROC TTEST for several variables, we go through all the output detail to check all statistical information. However, if all of the diagnostics look good, all we want to report for the client is the values of the statistic computed, degrees of freedom, and p-values from each test. What we are left with is an endless simple cut and paste job. To save our time in cutting, pasting and editing for presenting statistical output, we wrote SAS Macro programs to produce the SAS output from repeated PROC steps, save it as SAS data sets, and create a condensed table with the information we need displayed in the format we want. The names of the target data sets, target variables, and some options to control the procedure(s) are supplied to the macro in the form of macro parameters. The output of the macros is the condensed, informative, and hopefully, clear output usually presented SAS LOG WINDOW. The four macros are listed here: 1. PTTEST: Paired t-test with correlations. 2. WILCOX: Wilcoxon test (two sample rank-sum test) and medians 3. GLM: Analysis of variance limited to type III sum of squares and mean and standard deviations. 4. ALPHA: Reliability analysis (Cronbach s alpha) on a number of scales and sub-scales simultaneously. Supplemental material, including SAS source code for the macros, sample programs that call the macros, and data for these macros, is available on the web ( There will be links to the macros from this staff web page at the UCLA School of Nursing Web site. Since you may find some of our decisions differ from your preferences, the macros are presented completely commented. This should enable to modify them for your requirements. We have found that the time for producing these macros is approximately equal to the programming and cutting and pasting involved in 2½ moderately large runs involving these procedures. If you find our choices adequate for your needs then you have saved even that overhead. 1

2 PAIRED T-TEST Paired t-tests are used to test for differences between two means when measures are taken at two time points, or those being measured are matched in some way and independence cannot be assumed. Standard SAS output from PROC TTEST for paired samples contains more information than is usually needed for presentation to the client. Statistics like the confidence limits of mean differences, and confidence limits of standard deviations of differences usually are not meaningful to the client. In fact, clients prefer seeing actual means rather than mean differences. Clients often would like to see correlations between measures (or two time points) in addition to the standard t-test output. The PTTEST macro produces a single condensed table containing: 1. Basic descriptive statistics for each of the two variables including means, standard deviations, medians, the minimum values, and the maximum values 2. T-test statistics and corresponding p-values 3. Pearson correlations between the two variables and corresponding p-values We have found that this is the information our clients usually request, and easily understand. SAMPLE MACRO CALL AND OUTPUT This is a sample macro call for the PTTEST macro that uses the TRIAL data set. This data set is available on our web site and also available at the SAS site. It is one of the sample datasets for use with SAS examples. In the TRIAL data set, two responses, Y1 and Y2, are each measure three times for each subject (pre-treatment, post-treatment, and in a later follow-up). Thus, the variable PREY1 is the value on measure Y1 pre-treatment. We want to perform four t-tests to see whether there are significant different between pre and post, and between post and follow-up (for variable Y1 and Y2). We provide values for the parameters required by this macro through the definition of Macro variables. The macro variables LIBR and data provide the input library name and the input data-set name, respectively. These two macro variables are common in other macros. The number of pair(s) to be compared is 4 for npair macro variable, and listv1 and listv2 have the first and second variable names for each pair. In out sample input below, the first variable name in listv1, PREY1 and the first variable name in listv2, POSTY1 (post treatment for y2) is one pair-variables for paired t-test. Title is for the output title. Then, call the PTTEST macro with these all macro input macro variable variables. %let LIBR = work; /* input library name */ %let data = trial; /* input data set name */ %let npair = 4; /* number of pairs of variables */ %let listv1 = prey1 prey2 posty1 posty2; /* 1st variable of pair (e.g. pre) */ %let listv2 = posty1 posty2 foly1 foly2 ; /* 2nd variable of pair (e.g. post) */ %let title = (Pre vs. Post) & (Post vs. Follow-up); %pttest (&LIBR, &data, &npair, &listv1, &listv2, &title); RUN; The output produced by this macro appears in LOG WINDOW in the MS Windows version of SAS. It is reproduced in the box below. As you can see information from a number of statistical procedures are combined in a single easily accessible table. Mean, median, standard deviation, minimum and maximum are obtained from PROC MEANS. The t-test is obtained from PROC TTEST, and the correlations from PROC CORR. Any title you choose will be used for the table. The output for each of the 4 pairs is presented separated by a dotted line. ****** PAIRED T TESTS : (Pre vs. Post) & (Post vs. Follow-up) TTest Corr Variable Label N Mean Med STD Min Max Df (ProbT) (ProbF) PreY1 Pre Y PostY1 Post Y (0.0003) (0.8379) PreY2 Pre Y PostY2 Post Y (0.0797) (0.1850) PostY1 Post Y FolY1 FollUp Y (0.2435) (0.1686) PostY2 Post Y FolY2 FollUp Y (0.0005) (0.9879) This example was chosen because it was available not because it is a good example of the use of the procedure. But it does afford an opportunity to explore each statistic presented. The medians are approximately equal to the means 2

3 so we can tell that the distribution is at least symmetrical. The standard deviations for the most part are equivalent, though the Y2 follow-up may be a problem. Where there appears to be a significant difference the pre versus post for Y1 the difference is also reflected in the medians, and in the minimum and maximum values. This precludes worry about a ceiling effect. However The lack of relationship between pre and post measures for Y1 implies the people are responding to the treatment differently. On the other hand even thought there is a trend toward a decrease for prepost Y2, the mild correlation indicates that there is some tendency toward equal change in Y2 for different subjects. It might be useful to contrast the table above to the output you would obtain from running each of the procedures separately. This output is presented below. The MEANS Procedure Variable Label N Mean Median Std Dev Minimum Maximum PreY1 Pre Y PreY2 Pre Y PostY1 Post Y PostY2 Post Y FolY1 FollUp Y FolY2 FollUp Y page break here The TTEST Procedure Statistics Lower CL Upper CL Lower CL Upper CL Difference N Mean Mean Mean Std Dev Std Dev Std Dev Std Err Minimum Maximum PreY1 - PostY PreY2 - PostY PostY1 - FolY PostY2 - FolY T-Tests Difference DF t Value Pr > t PreY1 - PostY PreY2 - PostY PostY1 - FolY PostY2 - FolY page break here The CORR Procedure 4 With Variables: PostY1 PostY2 FolY1 FolY2 4 Variables: PreY1 PreY2 PostY1 PostY2 Simple Statistics Variable N Mean Std Dev Sum Minimum Maximum Label PostY Post Y1 PostY Post Y2 FolY FollUp Y1 FolY FollUp Y2 PreY Pre Y1 PreY Pre Y2 Pearson Correlation Coefficients, N = 18 Prob > r under H0: Rho=0 PreY1 PreY2 PostY1 PostY2 PostY Post Y PostY Post Y FolY FollUp Y FolY FollUp Y

4 Notice that this output is much more verbose, and elements are not easily assimilated together. In addition the program is longer than the simple macro call. The statistics of interest, such as correlation, are often buried in a larger table. As more comparisons are added the hunt, cut, and paste time increases. WILCOXON TWO SAMPLE RANK SUM TEST The Wilcoxon rank sum test is used when the criterion variable is not normalizable and two groups are to be compared. The WILCOXON macro presents a condensed table for Wilcoxon two sample rank-sum test, including Wilcoxon statistics. Medians are presented instead of the rank sums in standard NPAR1WAY output as they tend to be more accessible to our clients. While it is possible to have a significant Wilcoxon test when the medians are the same, this tends to raise doubts about the adequacy of the test for the purpose. The condensed output the macro produces contains medians and numbers for target variables for each of the two groups, combined with Wilcoxon s rank-sum test statistics and their corresponding p-values. In addition p-values less than or equal to 0.05 are flagged with *******, and p-value less than or equal to 0.1 are flagged with *. SAMPLE MACRO CALL AND OUTPUT Let us assume that we want to compare the effects for treatment A with treatment B separately at each time period for both variables Y1 and Y2. In addition, suppose that the variables of interest are completely ill conditioned for parametric analysis. In that case we may find Wilcoxon two sample rank-sum tests more appropriate. Here we are using the TRIAL data set even though there is no evidence that parametric statistics are inappropriate here. The where macro variable supplies a WHERE statement, similar to what you would use in PROC NPAR1WAY, that restricts the target data set. Since this sample data set has more than 2 groups in the TRT variable, this macro variable is very useful to restrict the analysis to the 2 target groups. The class macro variable supplies the name of a group variable, and the var macro variable contains the name(s) of the target variable(s) to be analyzed. %let LIBR = work ; /* input library name */ %let data = trial; /* input data set name */ %let where = where (trt="a" or trt="b") ; /* WHERE statement in NPAR1WAY */ %let class = trt ; /* class (group) variable */ %let var = PreY1 PostY1 FolY1 DiffY1 PreY2 PostY2 FolY2 DiffY2; %let title = Between Treatments (A vs. B); %wilcox(&libr, &data, &where, &class, &var, &title); RUN; The Wilcox macro output is presented below. For each target variable, the medians and the numbers of observation for each of the two groups, and test statistics comparing the two groups are aligned in a row. Follow-up of Y2 (FOLY2) and difference between pretreatment and follow-up of Y2 (DIFFY2) are significantly different (p-value <=0.05) between treatments A and B. These significant comparisons are flagged in the output for quick scanning. The test of the difference is analogous to the test of an interaction between time and group in an analyses of variance. We present one sided tests as we always know the direction, the macro can be easily modified to perform two sided test. WILCOXON - Between Treatments (A vs. B) Summary Stats are Medians Values in Parentheses () are N's Tests are Approximate t from Wilcoxon Rank Sums NAME A B Approx t p>=t PreY PostY FolY DiffY PreY PostY FolY ******** DiffY ******** 4

5 Note that unlike parametric statistics where the mean difference should equal the differences of the means when the N is constant, the median of the differences does not necessarily equal the median of the differences. Compare this with the output of the standard procedures presented below for just two of the six variables in our table. In addition the following output does not contain medians. Wilcoxon - Between Treatments (A vs. B) 1 The NPAR1WAY Procedure Wilcoxon Scores (Rank Sums) for Variable PreY1 Classified by Variable Trt Sum of Expected Std Dev Mean Trt N Scores Under H0 Under H0 Score A B Average scores were used for ties. Wilcoxon Two-Sample Test Statistic Normal Approximation Z One-Sided Pr < Z Two-Sided Pr > Z t Approximation One-Sided Pr < Z Two-Sided Pr > Z Z includes a continuity correction of 0.5. Kruskal-Wallis Test Chi-Square DF 1 Pr > Chi-Square page break here Wilcoxon - Between Treatments (A vs. B) 2 The NPAR1WAY Procedure Wilcoxon Scores (Rank Sums) for Variable PostY1 Classified by Variable Trt Sum of Expected Std Dev Mean Trt N Scores Under H0 Under H0 Score A B Average scores were used for ties. Wilcoxon Two-Sample Test Statistic Normal Approximation Z One-Sided Pr < Z Two-Sided Pr > Z t Approximation One-Sided Pr < Z Two-Sided Pr > Z Z includes a continuity correction of 0.5. Kruskal-Wallis Test Chi-Square DF 1 Pr > Chi-Square page break here

6 ANALYSIS OF VARIANCE The GLM macro is designed for a more condensed output from PROC GLM. It will handle repeated measures analysis or independent groups analysis at a single time period. It produces a main ANOVA table for Type III sum of squares, and mean and standard deviations at each level of the class variable and at each time point. All of the output comes from PROC GLM. Incomplete models may be specified. This macro is most useful when the same model will be used for a number of different dependent measures (e.g. Y1 and Y2). The repeated measures are provided and time periods represented as separate variables in the dataset. For example, PREY1 is the measurement of Y1 before treatment; POSTY1 is the measurement of Y1 after treatment. SAMPLE MACRO CALL AND OUTPUT This sample macro call performs a repeated measures analysis of variance. Time is the repeated factor with 3 levels (Pre, Post, Follow-up) and treatment is the between groups factor with 3 levels as well (A, B and Control). The data come from the same TRIAL data set we have seen earlier. Main effects and interactions will be included in the model. The nclass macro variable provides the number of class(independent) variables (factors) and the class macro variable provides name(s) of the grouping variables (factors) to be used in the GLM model. They are listed with a space between them as you would in a CLASS statement in PROC GLM. The model macro variable provides model specifications as you would in a MODEL statement in GLM. If the model specification is null, the full model with all class variables and interactions will be used. The means macro variable controls the output of variable means, it is specified as you would in a MEANS statement. In order to request means broken down by effects those effects must be included in the model macro variable as well. The remaining three macro variables provide information on how to interpret the dependent variable list. The ndep macro variable indicates the number of dependent measures (variable sets analyzed in separate analysis), the time macro variable indicates the number of time points (variables of measurements at each time point), and deplist lists all of variable names from the data set needed for analysis. Sets of variables for each measure are separated by a (vertical bar). There should be as many variables in each set of variable names on the deplist as there are time periods. In the example below we are using two different measures Y1 and Y2, and measuring them at three time periods each. %let LIBR = work ; /* input library name */ %let data = trial; /* input data set name */ %let nclass = 1 ; /* number of class(indep.) var. */ %let class = trt; /* names of class(indep.) var.(s)*/ %let model =; /* model specification, leave */ /* null for the full model */ %let means = trt; /* MEANS statement in GLM */ %let ndep = 2 ; /* # of dep. Var. sets (measures)*/ %let time = 3 ; /* # of time periods(variables) */ %let deplist = PreY1 PostY1 FolY1 PreY2 PostY2 FolY2; %glm(&libr, &data, &nclass, &class, &model, &means, &ndep, &time,&deplist) ; RUN; As the sample macro call indicates, two separate PROC GLM s will be run using the same GLM model. We provide PROC GLM with two dependent variable sets of three variables each (for the 3 time periods). Notice that for Y1 there is significant time effect and we can see the trend by checking the means at each time point and in each group. That is, each group s means for Y1 increase across time. Though, it seems that the increase for the control group is slightly slower reaching the peak by follow-up rather than immediately after treatment as were done by groups A and B, no significant interaction was detected. For Y2, the time main effect, and the time by treatment interaction effect are significant. Generally, if an interaction effect is significant that drives the discussion. When the interaction is significant you must take treatment into account when discussing time, and you must consider time when discussing treatment. Scanning the means for the measure Y2 reveals that the source of the interaction depend on increases over time for the CONTROL and treatment B as compared with the relatively stable treatment A. 6

7 ******* Summary of GLM (Repeated : 3 time-point): Model = TRT ########### Dependent Variables : Time1=PREY1 Time2=POSTY1 Time3=FOLY1 PRE Y1 POST Y1 FOLLUP Y1 SOURCE DF SumSquare MeanSquare F Value Pr > F BetweenS Trt BetweenS Error WithinSu time <.0001 WithinSu time*trt WithinSu Error(time) Means/Stdev in each Level of Class variables -- TRT N Mean_T1 Sd_T1 Mean_T2 Sd_T2 Mean_T3 Sd_T3 A B Control ########### Dependent Variables : Time1=PREY2 Time2=POSTY2 Time3=FOLY2 PRE Y2 POST Y2 FOLLUP Y2 SOURCE DF SumSquare MeanSquare F Value Pr > F BetweenS Trt BetweenS Error WithinSu time <.0001 WithinSu time*trt <.0001 WithinSu Error(time) Means/Stdev in each Level of Class variables -- TRT N Mean_T1 Sd_T1 Mean_T2 Sd_T2 Mean_T3 Sd_T3 A B Control In retrospect, we could have further condensed the output into a single table summarizing the results followed by tables of means. In such a table the error variances may be eliminated and the output restricted to the degrees of freedom, the F values, and the p-values. We took the approach we did since most of our clients would like to see the means with the presentation of each analysis. If we develop an additional method of presenting the results the macro will be referenced at the same web site. RELIABILITY ANALYSIS Reliability analysis is commonly used to assess internal consistency of several variables that measure a single underlying factor / latent structure. One of the most common measures of internal consistency of a scale is Chronbach s Alpha. Very often data contains several sub-scales, and clients would like information on the reliability of each sub-scale as well as the scale as a whole. Especially in scale construction it is often useful to present information on alpha with item deleted. This is useful for determining items that do not seem to fit with the over all scale, and also items that may need to be reversed. All of this information and more is presented when the ALPHA option is requested on PROC CORR; however, presentation of this information by PROC CORR separates the basic information on scale and sub-scale reliability. The ALPHA macro presents a summary table of Cronbach s alphas for scales and sub-scales at the beginning of the output. If the option to present the alphas with the item deleted ( alpha with deleted variable ) is chosen that is presented for each scale or sub-scale in turn following the summary table. Finally complete correlation matrices are also available on request, but these are presented in the OUTPUT WINDOW instead of the LOG WINDOW. SAMPLE MACRO CALL AND OUTPUT The PSYCHDAT data set has 6 items assessing metal stability of psychiatric patients. Assume that there are two different scales; SCALE1 contains ANXIETY, DEPRESS, and SLEEP, and SEX, LIFE and WGHCHG (Weight Change) are for SCAL2. The sample macro call below requests reliability information for each of these two scales and for the total scale. The ncorr macro variable specifies the number of scales, which is 3 in this example. The nv, var, and scale macro variables are the number of variables in each scale, the variable names, and a label for each scale(s), respectively. Thus, in the example below the first scale contains 3 variables; those variables are ANXIETY, DEPRESS, and SLEEP; and the scale is labeled SCALE 1. Information for each scale should be separated by and be arranged in the same order on the %LET statements for each macro variable. The dvar macro variable controls the production of tables for alpha with deleted variable. Code dvar =1 on the %LET statement to produce alpha 7

8 with deleted variable in LOG WINDOW. If you want to see the original PROC CORR output in OUTPUT WINDOW, set the prtout macro variable equal to 1. %let LIBR = work; /* input library name */ %let data = psychdat; /* input data set name */ %let ncorr = 3; /* number of scales */ %let nv = 3 3 6; /* number of variables in scale */ %let var = Anxiety Depress Sleep Sex Life WghChg Anxiety--WghChg; %let scale = scale 1 scale 2 Total scale; %let dvar = 1; /* 1 requests Alpha with deleted variable */ %let prtout = ; /* 1 requests PROC CORR output in WINDOW */ %alpha (&LIBR, &data, &ncorr, &var, &scale, &nv, &dvar, &prtout); RUN; Notice that we are not requesting the complete print out in the example above, but we are requesting to see the alpha with deleted variable presented below the summary table. Consider the output produced by the ALPHA macro as specified in our sample call. The first section is raw, standardized Alpha and variable names for each scale presented in a single table. This is the default output of the macro. Since we specified dvar=1 above, the alpha with deleted variable section for each scale will follow this summary table. Notice that the TOTAL SCALE, which includes all 6 variables, has a higher standardized alpha score of than either of the sub-scales. When we examine the alpha with deleted variable section for the TOTAL SCALE we can see that the standardized alpha increases when the variable SEX is removed from the scale. Thus, removing that variable would make the TOTAL SCALE more internally consistent. We use the standardized alpha for this because the variables used in this example are not scaled consistently. ****** Cronbachs ALPHA ******** ---- A L P H A ---- SCALE # of var. N RAW STANDARDIZED scale scale Total scale Variables in Scale scale 1 Anxiety Depress Sleep scale 2 Sex Life WghChg Total scale Anxiety--WghChg ******* Cronbach Alpha With Deleted Variable ******* scale 1 - Raw Alpha: Std Alpha: Raw Variables Standardized Variables Deleted var Corr.W/Total Alpha Del Corr.W/Total Alpha Del Anxiety Depress Sleep scale 2 - Raw Alpha: Std Alpha: Raw Variables Standardized Variables Deleted var Corr.W/Total Alpha Del Corr.W/Total Alpha Del Sex Life WghChg Total scale - Raw Alpha: Std Alpha: Raw Variables Standardized Variables Deleted var Corr.W/Total Alpha Del Corr.W/Total Alpha Del Anxiety Depress Sleep Sex Life WghChg

9 CONCLUSION We hope you will find these macros useful in saving time spent on cutting, pasting and reformatting. Feel free to use the macros as is or modify them as needed. Distribute the macros to others please leave the attributions to us intact in the macros. If you modify them add your attribution to ours. We have presented four macros above for modifying SAS output for presentation to clients. These macros are: 1. PTTEST: Paired t-test with correlations. 2. WILCOX: Wilcoxon test (two sample rank-sum test) and medians 3. GLM: Analysis of variance limited to type III sum of squares and mean and standard deviations. 4. ALPHA: Reliability analysis (Cronbach s alpha) on a number of scales and sub-scales simultaneously. One caution should be noted: if you use these macros but your variable naming conventions differ markedly from ours (i.e. super long variable names), or formats used are ones we have not anticipated, then the macros may not work correctly. For those interested in the internals of the macros we make use of these features of SAS: 1. Macro variables, and Macro statements to drive the program. 2. PROC CONTENTS to capture variable names, labels, and formats. 3. Output statements in procedures to produce SAS data sets containing the statistics of interest. 4. ODS specifications in procedures to produce additional SAS data sets containing statistics. 5. Data management procedures such as PROC SORT and PROC TRANSPOSE to manipulate file to make them easier to use. 6. Data steps for merging, reformatting, and producing tables. In addition to the macros presented here, there are some additional macros available on our web site, including the following: 1. XMEANS which presents all of the groups for each variable together rather than all variables grouped together for each group. 2. UNIFORM presents descriptive statistics along with information on normality and skew. 3. CHI presents summaries of chi-square statistics for a number of tables in a single condensed table. (always examine the complete output for error messages as well) In addition, a number of general-purpose file and data management macros will be made available at this site. REFERENCES Art Carpenter. (1998), Carpenter s Complete Guide to the SAS Macro Language, NC: SAS Institute Inc. SAS Institute Inc., SAS/STAT User s Guide, Version 8, Cary, NC: SAS Institute Inc., pp. SAS Institute Inc., SAS Language Reference, Version 8, Cary, NC: SAS Institute Inc., pp. SAS Institute Inc. (1990), SAS Guide to Macro Processing, Version 6, Second Edition, Cary, NC: SAS Institute Inc. ACKNOWLEDGMENTS We would like to acknowledge our clients, especially Drs. Donna Vredevoe and Deborah Koniak-Griffin, whose requirements drove us to develop these macros. CONTACT INFORMATION MyungShin Oh UCLA adumas@hanmail.net Mel Widawski UCLA mel@ucla.edu 9

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

PLS205 Lab 1 January 9, Laboratory Topics 1 & 2

PLS205 Lab 1 January 9, Laboratory Topics 1 & 2 PLS205 Lab 1 January 9, 2014 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs saving results checking for errors Different

More information

Online Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions:

Online Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Online Supplementary Appendix for Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Power, Sample Size, and Resource Considerations 1 Appendix

More information

Intermediate SAS: Statistics

Intermediate SAS: Statistics Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square

More information

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The

More information

Lecture 1 Getting Started with SAS

Lecture 1 Getting Started with SAS SAS for Data Management, Analysis, and Reporting Lecture 1 Getting Started with SAS Portions reproduced with permission of SAS Institute Inc., Cary, NC, USA Goals of the course To provide skills required

More information

Cluster Randomization Create Cluster Means Dataset

Cluster Randomization Create Cluster Means Dataset Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters

More information

SAS/STAT 13.1 User s Guide. The NESTED Procedure

SAS/STAT 13.1 User s Guide. The NESTED Procedure SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute

More information

One way ANOVA when the data are not normally distributed (The Kruskal-Wallis test).

One way ANOVA when the data are not normally distributed (The Kruskal-Wallis test). One way ANOVA when the data are not normally distributed (The Kruskal-Wallis test). Suppose you have a one way design, and want to do an ANOVA, but discover that your data are seriously not normal? Just

More information

Statistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment

Statistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ Aiming Yang, Merck & Co., Inc., Rahway, NJ ABSTRACT Four pitfalls are commonly

More information

Laboratory Topics 1 & 2

Laboratory Topics 1 & 2 PLS205 Lab 1 January 12, 2012 Laboratory Topics 1 & 2 Welcome, introduction, logistics, and organizational matters Introduction to SAS Writing and running programs; saving results; checking for errors

More information

Product Catalog. AcaStat. Software

Product Catalog. AcaStat. Software Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,

More information

Macros and ODS. SAS Programming November 6, / 89

Macros and ODS. SAS Programming November 6, / 89 Macros and ODS The first part of these slides overlaps with last week a fair bit, but it doesn t hurt to review as this code might be a little harder to follow. SAS Programming November 6, 2014 1 / 89

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O

Stat 302 Statistical Software and Its Applications SAS: Data I/O Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the

More information

Data Analysis and Hypothesis Testing Using the Python ecosystem

Data Analysis and Hypothesis Testing Using the Python ecosystem ARISTOTLE UNIVERSITY OF THESSALONIKI Data Analysis and Hypothesis Testing Using the Python ecosystem t-test & ANOVAs Stavros Demetriadis Assc. Prof., School of Informatics, Aristotle University of Thessaloniki

More information

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Learning Objectives Identify Different Types of Variables Appropriately Naming Variables Constructing

More information

The NESTED Procedure (Chapter)

The NESTED Procedure (Chapter) SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 19, 2015 2 Getting

More information

186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95

186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95 A Statistical Analysis Macro Library in SAS Carl R. Haske, Ph.D., STATPROBE, nc., Ann Arbor, M Vivienne Ward, M.S., STATPROBE, nc., Ann Arbor, M ABSTRACT Statistical analysis plays a major role in pharmaceutical

More information

Land Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida

Land Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida Land Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida FINAL REPORT Submitted October 2004 Prepared by: Daniel Gann Geographic Information

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

6:1 LAB RESULTS -WITHIN-S ANOVA

6:1 LAB RESULTS -WITHIN-S ANOVA 6:1 LAB RESULTS -WITHIN-S ANOVA T1/T2/T3/T4. SStotal =(1-12) 2 + + (18-12) 2 = 306.00 = SSpill + SSsubj + SSpxs df = 9-1 = 8 P1 P2 P3 Ms Ms-Mg 1 8 15 8.0-4.0 SSsubj= 3x(-4 2 + ) 4 17 15 12.0 0 = 96.0 13

More information

R-Square Coeff Var Root MSE y Mean

R-Square Coeff Var Root MSE y Mean STAT:50 Applied Statistics II Exam - Practice 00 possible points. Consider a -factor study where each of the factors has 3 levels. The factors are Diet (,,3) and Drug (A,B,C) and there are n = 3 observations

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues

More information

SPSS INSTRUCTION CHAPTER 9

SPSS INSTRUCTION CHAPTER 9 SPSS INSTRUCTION CHAPTER 9 Chapter 9 does no more than introduce the repeated-measures ANOVA, the MANOVA, and the ANCOVA, and discriminant analysis. But, you can likely envision how complicated it can

More information

humor... May 3, / 56

humor... May 3, / 56 humor... May 3, 2017 1 / 56 Power As discussed previously, power is the probability of rejecting the null hypothesis when the null is false. Power depends on the effect size (how far from the truth the

More information

Factorial ANOVA. Skipping... Page 1 of 18

Factorial ANOVA. Skipping... Page 1 of 18 Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent

More information

The Power and Sample Size Application

The Power and Sample Size Application Chapter 72 The Power and Sample Size Application Contents Overview: PSS Application.................................. 6148 SAS Power and Sample Size............................... 6148 Getting Started:

More information

Simulating Multivariate Normal Data

Simulating Multivariate Normal Data Simulating Multivariate Normal Data You have a population correlation matrix and wish to simulate a set of data randomly sampled from a population with that structure. I shall present here code and examples

More information

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application SAS/STAT 13.1 User s Guide The Power and Sample Size Application This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as

More information

A SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited

A SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited A SAS and Java Application for Reporting Clinical Trial Data Kevin Kane MSc Infoworks (Data Handling) Limited Reporting Clinical Trials Is Resource Intensive! Reporting a clinical trial program for a new

More information

Conditional and Unconditional Regression with No Measurement Error

Conditional and Unconditional Regression with No Measurement Error Conditional and with No Measurement Error /* reg2ways.sas */ %include 'readsenic.sas'; title2 ''; proc reg; title3 'Conditional Regression'; model infrisk = stay census; proc calis cov; /* Analyze the

More information

Laboratory for Two-Way ANOVA: Interactions

Laboratory for Two-Way ANOVA: Interactions Laboratory for Two-Way ANOVA: Interactions For the last lab, we focused on the basics of the Two-Way ANOVA. That is, you learned how to compute a Brown-Forsythe analysis for a Two-Way ANOVA, as well as

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 6 Available Statistical

More information

Unit 8 SUPPLEMENT Normal, T, Chi Square, F, and Sums of Normals

Unit 8 SUPPLEMENT Normal, T, Chi Square, F, and Sums of Normals BIOSTATS 540 Fall 017 8. SUPPLEMENT Normal, T, Chi Square, F and Sums of Normals Page 1 of Unit 8 SUPPLEMENT Normal, T, Chi Square, F, and Sums of Normals Topic 1. Normal Distribution.. a. Definition..

More information

What s New in Spotfire DXP 1.1. Spotfire Product Management January 2007

What s New in Spotfire DXP 1.1. Spotfire Product Management January 2007 What s New in Spotfire DXP 1.1 Spotfire Product Management January 2007 Spotfire DXP Version 1.1 This document highlights the new capabilities planned for release in version 1.1 of Spotfire DXP. In this

More information

Paper S Data Presentation 101: An Analyst s Perspective

Paper S Data Presentation 101: An Analyst s Perspective Paper S1-12-2013 Data Presentation 101: An Analyst s Perspective Deanna Chyn, University of Michigan, Ann Arbor, MI Anca Tilea, University of Michigan, Ann Arbor, MI ABSTRACT You are done with the tedious

More information

Averages and Variation

Averages and Variation Averages and Variation 3 Copyright Cengage Learning. All rights reserved. 3.1-1 Section 3.1 Measures of Central Tendency: Mode, Median, and Mean Copyright Cengage Learning. All rights reserved. 3.1-2 Focus

More information

Tutorial: Using Tina Vision s Quantitative Pattern Recognition Tool.

Tutorial: Using Tina Vision s Quantitative Pattern Recognition Tool. Tina Memo No. 2014-004 Internal Report Tutorial: Using Tina Vision s Quantitative Pattern Recognition Tool. P.D.Tar. Last updated 07 / 06 / 2014 ISBE, Medical School, University of Manchester, Stopford

More information

Frequency Distributions

Frequency Distributions Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,

More information

CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

CH5: CORR & SIMPLE LINEAR REFRESSION ======================================= STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

More information

for statistical analyses

for statistical analyses Using for statistical analyses Robert Bauer Warnemünde, 05/16/2012 Day 6 - Agenda: non-parametric alternatives to t-test and ANOVA (incl. post hoc tests) Wilcoxon Rank Sum/Mann-Whitney U-Test Kruskal-Wallis

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

Introduction to SAS proc calis

Introduction to SAS proc calis Introduction to SAS proc calis /* path1.sas */ %include 'SenicRead.sas'; title2 ''; /************************************************************************ * * * Cases are hospitals * * * * stay Average

More information

A Lazy Programmer s Macro for Descriptive Statistics Tables

A Lazy Programmer s Macro for Descriptive Statistics Tables Paper SA19-2011 A Lazy Programmer s Macro for Descriptive Statistics Tables Matthew C. Fenchel, M.S., Cincinnati Children s Hospital Medical Center, Cincinnati, OH Gary L. McPhail, M.D., Cincinnati Children

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

Paper CC-016. METHODOLOGY Suppose the data structure with m missing values for the row indices i=n-m+1,,n can be re-expressed by

Paper CC-016. METHODOLOGY Suppose the data structure with m missing values for the row indices i=n-m+1,,n can be re-expressed by Paper CC-016 A macro for nearest neighbor Lung-Chang Chien, University of North Carolina at Chapel Hill, Chapel Hill, NC Mark Weaver, Family Health International, Research Triangle Park, NC ABSTRACT SAS

More information

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall

More information

Notes on Simulations in SAS Studio

Notes on Simulations in SAS Studio Notes on Simulations in SAS Studio If you are not careful about simulations in SAS Studio, you can run into problems. In particular, SAS Studio has a limited amount of memory that you can use to write

More information

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement

More information

PHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION

PHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION PHARMACOKINETIC STATISTICAL ANALYSIS SYSTEM - - A SAS/AF AND SAS/FSP APPLICATION Sharon M. Passe, Hoffmann-La Roche Inc. Andrea L Contino, Hoffmann-La Roche Inc. ABSTRACT The statistician responsible for

More information

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening Variables Entered/Removed b Variables Entered GPA in other high school, test, Math test, GPA, High school math GPA a Variables Removed

More information

SAS/STAT 14.3 User s Guide The SURVEYFREQ Procedure

SAS/STAT 14.3 User s Guide The SURVEYFREQ Procedure SAS/STAT 14.3 User s Guide The SURVEYFREQ Procedure This document is an individual chapter from SAS/STAT 14.3 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

SAS/STAT 13.1 User s Guide. The SURVEYFREQ Procedure

SAS/STAT 13.1 User s Guide. The SURVEYFREQ Procedure SAS/STAT 13.1 User s Guide The SURVEYFREQ Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS

More information

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview Chapter 888 Introduction This procedure generates D-optimal designs for multi-factor experiments with both quantitative and qualitative factors. The factors can have a mixed number of levels. For example,

More information

2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;

2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments; A. Goals of Exercise Biology 164 Laboratory Using Comparative Statistics in Biology "Statistics" is a mathematical tool for analyzing and making generalizations about a population from a number of individual

More information

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG

More information

Data Management - 50%

Data Management - 50% Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define

More information

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data Chapter 2 Descriptive Statistics: Organizing, Displaying and Summarizing Data Objectives Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically

More information

Lecture Notes 3: Data summarization

Lecture Notes 3: Data summarization Lecture Notes 3: Data summarization Highlights: Average Median Quartiles 5-number summary (and relation to boxplots) Outliers Range & IQR Variance and standard deviation Determining shape using mean &

More information

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate

More information

PharmaSUG Paper SP04

PharmaSUG Paper SP04 PharmaSUG 2015 - Paper SP04 Means Comparisons and No Hard Coding of Your Coefficient Vector It Really Is Possible! Frank Tedesco, United Biosource Corporation, Blue Bell, Pennsylvania ABSTRACT When doing

More information

Soci Statistics for Sociologists

Soci Statistics for Sociologists University of North Carolina Chapel Hill Soci708-001 Statistics for Sociologists Fall 2009 Professor François Nielsen Stata Commands for Module 7 Inference for Distributions For further information on

More information

STAT 3304/5304 Introduction to Statistical Computing. Introduction to SAS

STAT 3304/5304 Introduction to Statistical Computing. Introduction to SAS STAT 3304/5304 Introduction to Statistical Computing Introduction to SAS What is SAS? SAS (originally an acronym for Statistical Analysis System, now it is not an acronym for anything) is a program designed

More information

An Econometric Study: The Cost of Mobile Broadband

An Econometric Study: The Cost of Mobile Broadband An Econometric Study: The Cost of Mobile Broadband Zhiwei Peng, Yongdon Shin, Adrian Raducanu IATOM13 ENAC January 16, 2014 Zhiwei Peng, Yongdon Shin, Adrian Raducanu (UCLA) The Cost of Mobile Broadband

More information

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University Sta$s$cs & Experimental Design with R Barbara Kitchenham Keele University 1 Comparing two or more groups Part 5 2 Aim To cover standard approaches for independent and dependent groups For two groups Student

More information

The SAS interface is shown in the following screen shot:

The SAS interface is shown in the following screen shot: The SAS interface is shown in the following screen shot: There are several items of importance shown in the screen shot First there are the usual main menu items, such as File, Edit, etc I seldom use anything

More information

Nonparametric Testing

Nonparametric Testing Nonparametric Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Introductory Guide to SAS:

Introductory Guide to SAS: Introductory Guide to SAS: For UVM Statistics Students By Richard Single Contents 1 Introduction and Preliminaries 2 2 Reading in Data: The DATA Step 2 2.1 The DATA Statement............................................

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

SAS Structural Equation Modeling 1.3 for JMP

SAS Structural Equation Modeling 1.3 for JMP SAS Structural Equation Modeling 1.3 for JMP SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2012. SAS Structural Equation Modeling 1.3 for JMP. Cary,

More information

Binary Diagnostic Tests Clustered Samples

Binary Diagnostic Tests Clustered Samples Chapter 538 Binary Diagnostic Tests Clustered Samples Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. In the twogroup case, each cluster

More information

STATA 13 INTRODUCTION

STATA 13 INTRODUCTION STATA 13 INTRODUCTION Catherine McGowan & Elaine Williamson LONDON SCHOOL OF HYGIENE & TROPICAL MEDICINE DECEMBER 2013 0 CONTENTS INTRODUCTION... 1 Versions of STATA... 1 OPENING STATA... 1 THE STATA

More information

Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table

Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table Q Cheat Sheets What to do when you cannot figure out how to use Q What to do when the data looks wrong Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help

More information

Using PROC SQL to Generate Shift Tables More Efficiently

Using PROC SQL to Generate Shift Tables More Efficiently ABSTRACT SESUG Paper 218-2018 Using PROC SQL to Generate Shift Tables More Efficiently Jenna Cody, IQVIA Shift tables display the change in the frequency of subjects across specified categories from baseline

More information

Week 4: Simple Linear Regression III

Week 4: Simple Linear Regression III Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of

More information

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References...

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References... Chapter 15 Mixed Models Chapter Table of Contents Introduction...309 Split Plot Experiment...311 Clustered Data...320 References...326 308 Chapter 15. Mixed Models Chapter 15 Mixed Models Introduction

More information

Exam Questions A00-281

Exam Questions A00-281 Exam Questions A00-281 SAS Certified Clinical Trials Programmer Using SAS 9 Accelerated Version https://www.2passeasy.com/dumps/a00-281/ 1.Given the following data at WORK DEMO: Which SAS program prints

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

It s Proc Tabulate Jim, but not as we know it!

It s Proc Tabulate Jim, but not as we know it! Paper SS02 It s Proc Tabulate Jim, but not as we know it! Robert Walls, PPD, Bellshill, UK ABSTRACT PROC TABULATE has received a very bad press in the last few years. Most SAS Users have come to look on

More information

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types

More information

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables:

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables: Regression Lab The data set cholesterol.txt available on your thumb drive contains the following variables: Field Descriptions ID: Subject ID sex: Sex: 0 = male, = female age: Age in years chol: Serum

More information

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.

More information

Chapter 3 Analyzing Normal Quantitative Data

Chapter 3 Analyzing Normal Quantitative Data Chapter 3 Analyzing Normal Quantitative Data Introduction: In chapters 1 and 2, we focused on analyzing categorical data and exploring relationships between categorical data sets. We will now be doing

More information

Statistics 1 - Basic Commands. Basic Commands. Consider the data set: {15, 22, 32, 31, 52, 41, 11}

Statistics 1 - Basic Commands. Basic Commands. Consider the data set: {15, 22, 32, 31, 52, 41, 11} Statistics 1 - Basic Commands http://mathbits.com/mathbits/tisection/statistics1/basiccommands.htm Page 1 of 3 Entering Data: Basic Commands Consider the data set: {15, 22, 32, 31, 52, 41, 11} Data is

More information

Enter your UID and password. Make sure you have popups allowed for this site.

Enter your UID and password. Make sure you have popups allowed for this site. Log onto: https://apps.csbs.utah.edu/ Enter your UID and password. Make sure you have popups allowed for this site. You may need to go to preferences (right most tab) and change your client to Java. I

More information

Tasks Menu Reference. Introduction. Data Management APPENDIX 1

Tasks Menu Reference. Introduction. Data Management APPENDIX 1 229 APPENDIX 1 Tasks Menu Reference Introduction 229 Data Management 229 Report Writing 231 High Resolution Graphics 232 Low Resolution Graphics 233 Data Analysis 233 Planning Tools 235 EIS 236 Remote

More information

MHPE 494: Data Analysis. Welcome! The Analytic Process

MHPE 494: Data Analysis. Welcome! The Analytic Process MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your

More information

Confidence interval for sample mean = Upper and lower confidence interval for sample standard deviation = Sample standard error =

Confidence interval for sample mean = Upper and lower confidence interval for sample standard deviation = Sample standard error = A Macro To Perform A T-Test For 2 Independent Samples Using Sufficient Statistics Lan-Feng Tsai, Edwards Lifesciences LLC, Irvine, California Abstract The T-test is a commonly used statistical test to

More information

STA 570 Spring Lecture 5 Tuesday, Feb 1

STA 570 Spring Lecture 5 Tuesday, Feb 1 STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row

More information

Using SAS Macros to Extract P-values from PROC FREQ

Using SAS Macros to Extract P-values from PROC FREQ SESUG 2016 ABSTRACT Paper CC-232 Using SAS Macros to Extract P-values from PROC FREQ Rachel Straney, University of Central Florida This paper shows how to leverage the SAS Macro Facility with PROC FREQ

More information

Factorial ANOVA with SAS

Factorial ANOVA with SAS Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'

More information

Predict Outcomes and Reveal Relationships in Categorical Data

Predict Outcomes and Reveal Relationships in Categorical Data PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,

More information

Eksamen ERN4110, 6/ VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert)

Eksamen ERN4110, 6/ VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert) Eksamen ERN4110, 6/9-2018 VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert) 1 Oppgave 1 Datafila I SPSS: Variabelnavn Beskrivelse Kjønn Kjønn (1=Kvinne, 2=Mann) Studieinteresse

More information

A new graphic for one-way ANOVA

A new graphic for one-way ANOVA A new graphic for one-way ANOVA Bob Pruzek, SUNY Albany [July 2011] This document describes and illustrates a new elemental graphic for one-way analysis of variance, i.e., ANOVA. The primary motivation

More information

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value.

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value. Calibration OVERVIEW... 2 INTRODUCTION... 2 CALIBRATION... 3 ANOTHER REASON FOR CALIBRATION... 4 CHECKING THE CALIBRATION OF A REGRESSION... 5 CALIBRATION IN SIMPLE REGRESSION (DISPLAY.JMP)... 5 TESTING

More information

THE ALIGNED RANK TRANSFORM PROCEDURE

THE ALIGNED RANK TRANSFORM PROCEDURE Kansas State University Libraries pplied Statistics in Agriculture 1990-2nd Annual Conference Proceedings THE ALIGNED RANK TRANSFORM PROCEDURE James J. Higgins R. Clifford Blair Suleiman Tashtoush Follow

More information