Inter-Rater Agreement in Medical Studies: Analysis Tools within the SAS System
|
|
- Anis Gregory
- 5 years ago
- Views:
Transcription
1 Inter-Rater Agreement in Medical Studies: Analysis Tools within the SAS System Dipl.Ing. Dr. Barbara Schneider, University of Vienna, Austria Abstract In this presentation the intraclass correlation coefficient, a graphical technique to check agreement visually and the κ-measure are discussed. Emphasis is laid on how these statistical methods and techniques can be performed within the SAS System and if they are not available within SAS Software how they can be implemented. Data from clinical trials are presented. SAS code is reported using the following tools: PROC GLM, PROC GPLOT, PROC FREQ, Macro Language and SAS/IML. Introduction Measurement in medical studies is very often an imprecise task and the errors associated with this should be quantified and understood. Since measurement error can seriously affect statistical analyses and interpretation, it is important to assess the amount of such error by calculating a riability index. To investigate agreement in continuous variables a graphical technique proposed by Bland and Altman (1986) and the intraclass correlation coefficient are used. For categorial data the κ-measure is appropriate. Studies comparing two or more methods are common. The aim of these studies is to see if the methods agree well enough for one method to replace the others or for the methods to be used interchangeably. The same considerations apply to studies comparing two or more observers using one method. An important aspect of method or observer comparison is the comparison of the repeatability of each observer or method. Graphical Technique The inter-observer (within observer or between methods) agreement is demonstrated by a graphical technique suggested by Bland and Altman (Lancet, Feb. 8, 1989).
2 In order to assess inter-observer agreement graphically, the difference between the two measurements of each of the two observers are plotted against the mean of the two values of the observers for each patient. Limits of agreement, defined as twice the standard deviation of the difference between the observers, are calculated and plotted in the figure. If we suppose that these differences follow a normal distribution, 95% of the differences will lie between these limits. To demonstrate the procedure, data from Altman (1991) are used comparing two measurement methods. Measurements of transmitral volumetric flow (MF) by Doppler echocardiography and left ventricular stroke volume (SV) by crosssectional echocardiography in 21 patients without aortic valve disease are shown. The following program implements this technique within the SAS System and produces this plot: DIFF MEAN
3 goptions reset=(axis, legend, pattern, symbol, title, footnote) norotate hpos=0 vpos=0 htext= ftext= ctext= target= gaccess= gsfmode= ; title; data ms; input patient mf sv; cards; ; proc print data=o; data o; set o; d1=m_diff+2*s_diff; d2=m_diff-2*s_diff; call symput("m1",m_diff); call symput("d1",d1); call symput("d2",d2); symbol1 c=black v=circle i=none l=1; symbol2 c=black v=none i=join l=1; proc gplot data=temp; plot diff*mean=1 r*mean=2 / overlay vref= &m1 &d1 &d2 lvref=2 cvref=black caxis=black vaxis=&l to &u by &s ; title; %mend p; %p(mf,sv,-20,20,10); quit; %macro p(x,y,l,u,s); data temp; set ms; mean=mean(&x,&y); diff=&y-&x; r=0; /*in order to show perfect agreement: diff=0 */ proc univariate data=temp normal; var mean diff ; output out=o mean= m_mean m_diff std= s_mean s_diff ;
4 Intraclass correlation coefficient To quantify agreement the intraclass correlation coefficient (ICC) can be used for continuous variables. Roughly speaking the ICC can be viewed as a ratio of the variance of interest over the sum of the variance of interest plus error. Analysis of variance models are applied to calculate the ICC. To estimate the repeatability of each method or observer a one-way random effect model is performed. The resulting variance components are taken to calculate the ICC. Let x ij denote the ith rating (repetition) (i=1,...,r) on the jth target (e.g. patient) (j=1,...,n). Then the following linear model is assumed : x ij = µ + b j + e ij. In this equation, the component µ is the overall population mean of the ratings. b j is the difference from µ of the mean across the repeated jth target ratings. e ij represents the residual component. The component b j is assumed to vary normally with a mean of zero and a variance of σ T 2. It is also assumed that the e ij terms are distributed independently and normally with a mean of zero and a variance of σ E 2. The ICC takes the form of the following ratio: ICC = σ T 2 / (σ T 2 + σ E 2 ) To estimate the ICC variance components have to be estimated. PROC VARCOMP within the SAS System computes the necessary variance components but unfortunately does not provide an output data set. Therefore PROC GLM has to be used. The next program shows how to produce an estimate of the ICC within the SAS System.
5 data msu ; set libname.name; %macro i(t,y); data f ; keep t y; set msu; y=&y; /* the variable containing the scores of interest is mapped to an internal variable y */ t= &t; /* the variable containing targets (random effect ) information is mapped to t*/ proc glm data=f outstat=s1 noprint; class t ; model y= t ; random t ; data s1; set s1 ; mss=ss/df; if _TYPE_="SS3" or _TYPE_="ERROR"; proc transpose data=s1 out=o1 prefix=ms_; var mss; id _SOURCE_; proc transpose data=s1 out=o2 prefix=df_; var df; id _SOURCE_; data icc1; merge o1 o2; r_1=(df_error)/(df_t+1); r=r_1+1; vare=ms_error; vart=(ms_t-vare)/r; ICC=vart/(vart+vare); title2 "estimation of the ICC - scoring variable : &y"; proc print data=icc1 noobs; var icc; title2; %mend i; %i(tname,varname); Investigating the agreement between two or more observers or methods a twoway random effects model is appropriate. Let x ijl denote the ith rating (repetition) (i=1,...,r) on the jth target (e.g. patient) (j=1,...,n) of the lth observer (method) (l=1,...,k). Then the following linear model is assumed : x ijl = µ + a l + b j +(ab) lj + e ijl. In this equation, the component µ is the overall population mean of the ratings. a l is the difference from µ of the mean of the lth judge s ratings. b j is defined as before. (ab) lj is the interaction term between judges and targets. e ijl represents the error term. The component a l is assumed to vary normally with a mean of zero
6 and a variance of σ 2 J. b j is assumed to vary normally with a mean of zero and a variance of σ 2 T. (ab) lj are assumed to be mutually independent with a mean of zero and a variance of σ 2 I It is also assumed that e ijl are distributed independently and normally with a mean of zero and a variance of σ 2 E. Then the ICC takes the form of the following ratio: ICC = σ T 2 / (σ T 2 +σ J 2 + σ I 2 + σ E 2 ) In the absence of repeated ratings by each judge on each target, the interaction and error components cannot be estimated separately. This situation will be taken into account using the program statement if ms_error=. then ms_error=0; The following program shows how to produce an estimate of the ICC within the SAS System : data msu ; set libname.name; %macro i(t,j,y); data f ; keep t j y; set msu; y=&y; t=&t; j=&j; /* variable containing judge information */ proc glm data=f outstat=s2 ; class t j; model y= j t t*j; random t j t*j ; data s2; set s2 ; mss=ss/df; if _TYPE_="SS3" or _TYPE_="ERROR"; proc transpose data=s2 out=o3 prefix=ms_; var mss; id _SOURCE_; id _SOURCE_; data icc2; merge o3 o4; n=df_t+1; k_1=df_j; k=df_j+1; r=(1+df_error/(n*k)); if ms_error=. then ms_error=0; /* formal reasons*/ vare=ms_error; vari=(ms_t_j-vare)/r; varj=(ms_j-ms_t_j)/(r*n); vart=(ms_t-ms_t_j)/(r*k); icc=vart/(vart+varj+vari+vare); title2 estimation of the ICC - scoring variable : &y ; proc print data=icc2 noobs; title2; %mend i; %i(tname,jname,varname); proc transpose data=s2 out=o4 prefix=df_; var df;
7 Example As before data from measurements of transmitral volumetric flow (MF) by Doppler echocardiography and left ventricular stroke volume (SV) by crosssectional echocardiography in 21 patients without aortic valve disease are taken to estimate the ICC. This is a method comparison analysis. For each patient (target) and per method (judge) only one measurement is taken. In this situation the interaction and the error component cannot be estimated separately as stated above. The dataset ms is multivariate structured. To make the information stored in ms available for computing the ICC it has to be reshaped: data msu ; keep patient methode value; set ms; methode=1; value=mf; output; methode=2; value=sv; output; Running the macro: %i(patient,methode,value); produces the output: estimation of the ICC - scoring variable : value ICC κ-measure
8 Agreement between categorial assessments is usually considered as a problem of comparing the ability of different raters (observers, judges) to classify subjects (targets) into one of several groups. The investgations outlined below will also apply to method comparison studies for catigorial data; studies that compare two alternative categorization schemes. Categorial data can be formed in a contingency table where the row levels represent the ratings of one observer (method) and the column levels the ratings of onother observer (method). Suppose p ij is the probability of a target being classified in the ith category by n the first observer and the jth category by the second observer. Then p 0 = is p ii i = 1 the probability that the observers agree. If the ratings were independent, the n probability of agreement would be p e = p p. Thus p 0 - p e is the amount of i= 1 i.. i agreement beyond that expected by chance. The κ-measure (κ coefficient) is defined as: κ = p p 0 e. The κ - measure of agreement can be interpreted as 1 pe the chance corrected proportional agreement. A weakness of the κ is that it takes no account of the degree of disagreement i.e. all disagreements are treated (weighted) equally. When the categories are ordered (ordinal scaled data), it is sometimes preferable to put different weights to disagreements according to the magnitude of discrepancy. This idea leads to the so called weighted κ. Weighted κ is obtained by weighting the frequencies in each cell of the table according to their distance from the diagonal that indicates agreement. For a detailed discussion see e.g. Fleiss (1981). PROC FREQ within the SAS System can be used to compute the κ measures. Example Data show the classification by two radiologists of 85 xeromammograms. 0 corresponds to normal, 1 to benign diseas, 2 to suspicion of cancer and 3 to cancer.
9 /* For n*n Tables */ /* Simple and Weighted kappa */ /* Altman p.403 Bsp. Tab.14.2 */ data k1; input x y weight; cards; ; proc freq data=k1; tables x*y / agree; weight weight; This results in the output: TABLE OF X BY Y X Y Frequency Percent Row Pct Col Pct Total ƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆ ƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆ ƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆ ƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆ ƒƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆƒƒƒƒƒƒƒƒˆ Total STATISTICS FOR TABLE OF X BY Y Test of Symmetry Statistic = DF = 6 Prob = Kappa Coefficients Statistic Value ASE 95% Confidence Bounds ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
10 Simple Kappa Weighted Kappa Sample Size = 85 The following IML code shows an alternative approach for computing the κ - measure including estimates of the standard error and 95% confidence limits. Comparing the IML approach to PROC FREQ it provides more flexibility in specifying weights. Another advantage is that the IML program is not restricted to n*n tables. If it seems to make sense r*s tables can be used. Restrictions for data entry are that the values of the score variables have to range from 0 to n and the dimension of the smallest squared table in which the resulting r*s table fits has to be specified. A detailed description of the mathematical formulas used in the IML statements is given in Fleiss (1981). /* Not only for n*n Tables */ /* Weighted kappa */ /* Altman S.403 Example Tab.14.2 */ %let dim=4; /* weights for simple kappa*/ %let w=%str( { , , , } ); data index; do ind1=1 to &dim; do ind2=1 to &dim; output; end; end; data k1; input x y weight; cards; ; data k; set k1; drop weight i; do i=1 to weight; output; end; proc freq data=k; This results in the output: tables y*x / out=m1 noprint; proc sort data=m1; by x y; data m1; set m1; ind1=x+1; ind2=y+1; data m2; merge index m1; by ind1 ind2; data m2; keep ind1 ind2 count; set m2; if count=. then count=0; proc transpose data=m2 out=mat prefix=c; var count; id ind2; by ind1; data mat; keep c1-c&dim; set mat; /*** IML ***/ proc iml; w=&w; use mat; read all into x; n=sum(x); x1=x#w; po=(1/n)*sum(x1); csum=x[+,]; rsum=x[,+]; KAPPA y=rsum*csum; y_=csum*rsum; y=(1/n)*y; p=(1/n)*y; p_=(1/n)*(1/n)*y_; pcsum=p[+,]; prsum=p[,+]; wi=w#prsum; wj=w#pcsum; wqi=wj[,+]; wqj=wi[+,]; wij=j(&dim,&dim); do i=1 to &dim; do j=1 to &dim; wij[i,j]=wqi[i,]+wqj[,j]; end; end; wq=(w-wij)##2; pw=p#wq; pws=sum(pw); y1=y#w; pe=(1/n)*sum(y1); pke=sum(p_); kappa=(po-pe)/(1-pe); print kappa; var0=(1/(n*(1-pke)*(1- pke)))*(pws-pke*pke); std=sqrt(var0); cl_l=kappa-1.96*std; cl_u=kappa+1.96*std; print var0 std; print cl_l cl_u; quit;
11 VAR0 STD CL_L CL_U SAS, Base SAS, SAS/STAT, SAS/GRAPH and SAS/IML software are registered trademarks of SAS Institute Inc., Cary, NC, USA Address for correspondence: Dipl.Ing. Dr. Barbara Schneider Institut für Medizinische Statistik Universität Wien Schwarzspanierstr. 17 A Wien Austria
12 REFERENCES: Bland J.M., Altman D. G. (1986) Statistical Methods for Assessing Agreement between two Methods of Clinical Measurement, THE LANCET, Feb. 8, 1986 Altman D. G. (1991) Practical Statistics for Medical Research, 1st edn. Chapman&Hall Fleiss J. L. et al. (1979), Inter-examiner Reliability in Caries Trials, J Dent Res 58(29: , February 1979 Fleiss J. L. (1981) Statistical Methods for Rates and Proportions, 2nd edn. New York: Wiley Shrout P. E., Fleiss J. L. (1979) Intraclass Correlations: Uses in Assessing Rater Reliability,Psychological Bulletin 1979, Vol. 86, No. 2, SAS Language: Reference, Version 6, First Edition SAS Procedures Guide, Version 6, Third Edition SAS/GRAPH Software: Reference, Version 6, First Edition, Volumes 1 and 2 SAS/IML Software: Usage and Reference, Version 6, First Edition SAS/STAT Software: Reference, Version 6, First Edition, Volumes 1 and 2
Teaching statistics and the SAS System
Teaching statistics and the SAS System Ass. Prof. Dipl.Ing. Dr. Barbara Schneider Institute for Medical Statistics, University of Vienna,Vienna, Austria Abstract This presentation offers ideas and examples
More informationSAS/STAT 13.1 User s Guide. The NESTED Procedure
SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute
More informationSpatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data
Spatial Patterns We will examine methods that are used to analyze patterns in two sorts of spatial data: Point Pattern Analysis - These methods concern themselves with the location information associated
More informationGraphical Techniques for Displaying Multivariate Data
Graphical Techniques for Displaying Multivariate Data James R. Schwenke Covance Periapproval Services, Inc. Brian J. Fergen Pfizer Inc * Abstract When measuring several response variables, multivariate
More informationR FUNCTIONS IN SCRIPT FILE agree.coeff2.r
B.1 The R Software. - 3 - R FUNCTIONS IN SCRIPT FILE agree.coeff2.r If your analysis is limited to two raters, then you may organize your data in a contingency table that shows the count of subjects by
More informationBiostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests
Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests Example 1: 152 patients were randomly assigned to 4 dose groups in a clinical study. During the course of the study,
More informationPicturing Statistics Diana Suhr, University of Northern Colorado
Picturing Statistics Diana Suhr, University of Northern Colorado Abstract Statistical results could be easier to understand if you visualize them. This Hands On Workshop will give you an opportunity to
More informationFrequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS
ABSTRACT Paper 1938-2018 Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS Robert M. Lucas, Robert M. Lucas Consulting, Fort Collins, CO, USA There is confusion
More informationThe NESTED Procedure (Chapter)
SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual
More information186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95
A Statistical Analysis Macro Library in SAS Carl R. Haske, Ph.D., STATPROBE, nc., Ann Arbor, M Vivienne Ward, M.S., STATPROBE, nc., Ann Arbor, M ABSTRACT Statistical analysis plays a major role in pharmaceutical
More informationWant to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research
Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The
More informationChapter 1 Introduction. Chapter Contents
Chapter 1 Introduction Chapter Contents OVERVIEW OF SAS/STAT SOFTWARE................... 17 ABOUT THIS BOOK.............................. 17 Chapter Organization............................. 17 Typographical
More informationA Generalized Procedure to Create SAS /Graph Error Bar Plots
Generalized Procedure to Create SS /Graph Error Bar Plots Sanjiv Ramalingam, Consultant, Octagon Research Solutions, Inc. BSTRCT Different methodologies exist to create error bar related plots. Procedures
More informationOnline Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions:
Online Supplementary Appendix for Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Power, Sample Size, and Resource Considerations 1 Appendix
More informationOutline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model
Topic 16 - Other Remedies Ridge Regression Robust Regression Regression Trees Outline - Fall 2013 Piecewise Linear Model Bootstrapping Topic 16 2 Ridge Regression Modification of least squares that addresses
More informationContents of SAS Programming Techniques
Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationAn introduction to SPSS
An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming
More informationSAS CLINICAL SYLLABUS. DURATION: - 60 Hours
SAS CLINICAL SYLLABUS DURATION: - 60 Hours BASE SAS PART - I Introduction To Sas System & Architecture History And Various Modules Features Variables & Sas Syntax Rules Sas Data Sets Data Set Options Operators
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationDisplaying Multiple Graphs to Quickly Assess Patient Data Trends
Paper AD11 Displaying Multiple Graphs to Quickly Assess Patient Data Trends Hui Ping Chen and Eugene Johnson, Eli Lilly and Company, Indianapolis, IN ABSTRACT Populating multiple graphs, up to 15, on a
More informationBinary Diagnostic Tests Clustered Samples
Chapter 538 Binary Diagnostic Tests Clustered Samples Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. In the twogroup case, each cluster
More informationTwo useful macros to nudge SAS to serve you
Two useful macros to nudge SAS to serve you David Izrael, Michael P. Battaglia, Abt Associates Inc., Cambridge, MA Abstract This paper offers two macros that augment the power of two SAS procedures: LOGISTIC
More informationUsing PROC SQL to Generate Shift Tables More Efficiently
ABSTRACT SESUG Paper 218-2018 Using PROC SQL to Generate Shift Tables More Efficiently Jenna Cody, IQVIA Shift tables display the change in the frequency of subjects across specified categories from baseline
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Features and Patterns The Curse of Size and
More informationA SAS Macro for measuring and testing global balance of categorical covariates
A SAS Macro for measuring and testing global balance of categorical covariates Camillo, Furio and D Attoma,Ida Dipartimento di Scienze Statistiche, Università di Bologna via Belle Arti,41-40126- Bologna,
More informationSAS Training BASE SAS CONCEPTS BASE SAS:
SAS Training BASE SAS CONCEPTS BASE SAS: Dataset concept and creating a dataset from internal data Capturing data from external files (txt, CSV and tab) Capturing Non-Standard data (date, time and amounts)
More informationMultivariate Capability Analysis
Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8
More informationMultiple Regression White paper
+44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms
More informationChoosing the Right Procedure
3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues
More informationVALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES
Libraries Conference on Applied Statistics in Agriculture 1996-8th Annual Conference Proceedings VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES Stephen N. Sly Jeffrey S.
More informationA Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN
Paper 045-29 A Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN ABSTRACT: PROC MEANS analyzes datasets according to the variables listed in its Class
More informationSTAT 5200 Handout #25. R-Square & Design Matrix in Mixed Models
STAT 5200 Handout #25 R-Square & Design Matrix in Mixed Models I. R-Square in Mixed Models (with Example from Handout #20): For mixed models, the concept of R 2 is a little complicated (and neither PROC
More informationStatistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment
Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ Aiming Yang, Merck & Co., Inc., Rahway, NJ ABSTRACT Four pitfalls are commonly
More informationBland-Altman Plot and Analysis
Chapter 04 Bland-Altman Plot and Analysis Introduction The Bland-Altman (mean-difference or limits of agreement) plot and analysis is used to compare two measurements of the same variable. That is, it
More informationModule 3: SAS. 3.1 Initial explorative analysis 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE
St@tmaster 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE Module 3: SAS 3.1 Initial explorative analysis....................... 1 3.1.1 SAS JMP............................
More informationWindows Application Using.NET and SAS to Produce Custom Rater Reliability Reports Sailesh Vezzu, Educational Testing Service, Princeton, NJ
Windows Application Using.NET and SAS to Produce Custom Rater Reliability Reports Sailesh Vezzu, Educational Testing Service, Princeton, NJ ABSTRACT Some of the common measures used to monitor rater reliability
More informationData Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski
Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...
More informationCluster Randomization Create Cluster Means Dataset
Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters
More informationEvaluation Measures. Sebastian Pölsterl. April 28, Computer Aided Medical Procedures Technische Universität München
Evaluation Measures Sebastian Pölsterl Computer Aided Medical Procedures Technische Universität München April 28, 2015 Outline 1 Classification 1. Confusion Matrix 2. Receiver operating characteristics
More informationRanking Between the Lines
Ranking Between the Lines A %MACRO for Interpolated Medians By Joe Lorenz SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in
More informationPaper CC-016. METHODOLOGY Suppose the data structure with m missing values for the row indices i=n-m+1,,n can be re-expressed by
Paper CC-016 A macro for nearest neighbor Lung-Chang Chien, University of North Carolina at Chapel Hill, Chapel Hill, NC Mark Weaver, Family Health International, Research Triangle Park, NC ABSTRACT SAS
More informationStatistics and Data Analysis. Paper
Paper 264-27 Using SAS to Perform Robust I-Sample Analysis of Means Type Randomization Tests for Variances for Unbalanced Designs Peter Wludyka, University of North Florida, Jacksonville, FL Ping Sa, University
More informationIntermediate SAS: Statistics
Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square
More informationCut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION
Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output Myungshin Oh, UCLA Department of Biostatistics Mel Widawski, UCLA School of Nursing ABSTRACT We, as statisticians, often spend more
More information8. MINITAB COMMANDS WEEK-BY-WEEK
8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are
More informationMAT 110 WORKSHOP. Updated Fall 2018
MAT 110 WORKSHOP Updated Fall 2018 UNIT 3: STATISTICS Introduction Choosing a Sample Simple Random Sample: a set of individuals from the population chosen in a way that every individual has an equal chance
More informationWorkload Characterization Techniques
Workload Characterization Techniques Raj Jain Washington University in Saint Louis Saint Louis, MO 63130 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse567-08/
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and
More informationSimulating Multivariate Normal Data
Simulating Multivariate Normal Data You have a population correlation matrix and wish to simulate a set of data randomly sampled from a population with that structure. I shall present here code and examples
More informationStatements with the Same Function in Multiple Procedures
67 CHAPTER 3 Statements with the Same Function in Multiple Procedures Overview 67 Statements 68 BY 68 FREQ 70 QUIT 72 WEIGHT 73 WHERE 77 Overview Several statements are available and have the same function
More informationBluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition
Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created
More informationSAS PROC GLM and PROC MIXED. for Recovering Inter-Effect Information
SAS PROC GLM and PROC MIXED for Recovering Inter-Effect Information Walter T. Federer Biometrics Unit Cornell University Warren Hall Ithaca, NY -0 biometrics@comell.edu Russell D. Wolfinger SAS Institute
More informationChapter 13 Multivariate Techniques. Chapter Table of Contents
Chapter 13 Multivariate Techniques Chapter Table of Contents Introduction...279 Principal Components Analysis...280 Canonical Correlation...289 References...298 278 Chapter 13. Multivariate Techniques
More informationQuantitative - One Population
Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)
More informationProduct Catalog. AcaStat. Software
Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,
More informationData Annotations in Clinical Trial Graphs Sudhir Singh, i3 Statprobe, Cary, NC
PharmaSUG2010 - Paper TT16 Data Annotations in Clinical Trial Graphs Sudhir Singh, i3 Statprobe, Cary, NC ABSTRACT Graphical representation of clinical data is used for concise visual presentations of
More informationThe DMSPLIT Procedure
The DMSPLIT Procedure The DMSPLIT Procedure Overview Procedure Syntax PROC DMSPLIT Statement FREQ Statement TARGET Statement VARIABLE Statement WEIGHT Statement Details Examples Example 1: Creating a Decision
More informationUsing Templates Created by the SAS/STAT Procedures
Paper 081-29 Using Templates Created by the SAS/STAT Procedures Yanhong Huang, Ph.D. UMDNJ, Newark, NJ Jianming He, Solucient, LLC., Berkeley Heights, NJ ABSTRACT SAS procedures provide a large quantity
More informationSAS Macros CORR_P and TANGO: Interval Estimation for the Difference Between Correlated Proportions in Dependent Samples
Paper SD-03 SAS Macros CORR_P and TANGO: Interval Estimation for the Difference Between Correlated Proportions in Dependent Samples Patricia Rodríguez de Gil, Jeanine Romano Thanh Pham, Diep Nguyen, Jeffrey
More informationCoders' Corner. Paper ABSTRACT GLOBAL STATEMENTS INTRODUCTION
Paper 70-26 Data Visualization of Outliers from a Health Research Perspective Using SAS/GRAPH and the Annotate Facility Nadia Redmond Kaiser Permanente Center for Health Research, Portland, Oregon ABSTRACT
More informationRobust Linear Regression (Passing- Bablok Median-Slope)
Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their
More informationUsing SAS Macro to Include Statistics Output in Clinical Trial Summary Table
Using SAS Macro to Include Statistics Output in Clinical Trial Summary Table Amy C. Young, Ischemia Research and Education Foundation, San Francisco, CA Sharon X. Zhou, Ischemia Research and Education
More informationCreating Forest Plots Using SAS/GRAPH and the Annotate Facility
PharmaSUG2011 Paper TT12 Creating Forest Plots Using SAS/GRAPH and the Annotate Facility Amanda Tweed, Millennium: The Takeda Oncology Company, Cambridge, MA ABSTRACT Forest plots have become common in
More informationFor example, the system. 22 may be represented by the augmented matrix
Matrix Solutions to Linear Systems A matrix is a rectangular array of elements. o An array is a systematic arrangement of numbers or symbols in rows and columns. Matrices (the plural of matrix) may be
More informationPaper SDA-11. Logistic regression will be used for estimation of net error for the 2010 Census as outlined in Griffin (2005).
Paper SDA-11 Developing a Model for Person Estimation in Puerto Rico for the 2010 Census Coverage Measurement Program Colt S. Viehdorfer, U.S. Census Bureau, Washington, DC This report is released to inform
More informationCLUSTER ANALYSIS. V. K. Bhatia I.A.S.R.I., Library Avenue, New Delhi
CLUSTER ANALYSIS V. K. Bhatia I.A.S.R.I., Library Avenue, New Delhi-110 012 In multivariate situation, the primary interest of the experimenter is to examine and understand the relationship amongst the
More informationSAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/
STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Features and Patterns The Curse of Size and
More informationIntroduction to Statistical Analyses in SAS
Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in
More informationMultiple Graphical and Tabular Reports on One Page, Multiple Ways to Do It Niraj J Pandya, CT, USA
Paper TT11 Multiple Graphical and Tabular Reports on One Page, Multiple Ways to Do It Niraj J Pandya, CT, USA ABSTRACT Creating different kind of reports for the presentation of same data sounds a normal
More informationAnalysis of Variance in R
nalysis of Variance in R Dale arr R Training: University of Glasgow Dale arr (R Training: University of Glasgow) nalysis of Variance in R 1 / 19 When is NOV applicable? When you wish to assess the independent/joint
More informationThe STANDARD Procedure
1135 CHAPTER 35 The STANDARD Procedure Overview 1135 Procedure Syntax 1137 PROC STANDARD Statement 1138 BY Statement 1140 FREQ Statement 1141 VAR Statement 1141 WEIGHT Statement 1142 Results 1142 Missing
More informationThe EMCLUS Procedure. The EMCLUS Procedure
The EMCLUS Procedure Overview Procedure Syntax PROC EMCLUS Statement VAR Statement INITCLUS Statement Output from PROC EMCLUS EXAMPLES-SECTION Example 1: Syntax for PROC FASTCLUS Example 2: Use of the
More informationIntroduction to SAS. I. Understanding the basics In this section, we introduce a few basic but very helpful commands.
Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 Introduction to SAS Workshop Objective This workshop
More informationAn Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio
PharmaSUG 2012 - Paper CC12 An Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio ABSTRACT Do you know how to
More informationIAT 355 Visual Analytics. Data and Statistical Models. Lyn Bartram
IAT 355 Visual Analytics Data and Statistical Models Lyn Bartram Exploring data Example: US Census People # of people in group Year # 1850 2000 (every decade) Age # 0 90+ Sex (Gender) # Male, female Marital
More informationSPSS INSTRUCTION CHAPTER 9
SPSS INSTRUCTION CHAPTER 9 Chapter 9 does no more than introduce the repeated-measures ANOVA, the MANOVA, and the ANCOVA, and discriminant analysis. But, you can likely envision how complicated it can
More informationResponse to API 1163 and Its Impact on Pipeline Integrity Management
ECNDT 2 - Tu.2.7.1 Response to API 3 and Its Impact on Pipeline Integrity Management Munendra S TOMAR, Martin FINGERHUT; RTD Quality Services, USA Abstract. Knowing the accuracy and reliability of ILI
More informationExploring and Understanding Data Using R.
Exploring and Understanding Data Using R. Loading the data into an R data frame: variable
More informationA SAS Macro to Generate Caterpillar Plots. Guochen Song, i3 Statprobe, Cary, NC
PharmaSUG2010 - Paper CC21 A SAS Macro to Generate Caterpillar Plots Guochen Song, i3 Statprobe, Cary, NC ABSTRACT Caterpillar plots are widely used in meta-analysis and it only requires a click in software
More informationDATA CLASSIFICATORY TECHNIQUES
DATA CLASSIFICATORY TECHNIQUES AMRENDER KUMAR AND V.K.BHATIA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 akjha@iasri.res.in 1. Introduction Rudimentary, exploratory
More informationAcquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.
Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting
More informationAutomating the Plotting of Significant Effects in Analysis of Variance
Automating the Plotting of Significant Effects in Analysis of Variance Huixing Tang & Matthew Grover, The Psychological Corporation ABSTRACT The current SAS procedures for analysis of variance (e.g., PROC
More informationMIXED_RELIABILITY: A SAS Macro for Estimating Lambda and Assessing the Trustworthiness of Random Effects in Multilevel Models
SESUG 2015 Paper SD189 MIXED_RELIABILITY: A SAS Macro for Estimating Lambda and Assessing the Trustworthiness of Random Effects in Multilevel Models Jason A. Schoeneberger ICF International Bethany A.
More informationLaboratory for Two-Way ANOVA: Interactions
Laboratory for Two-Way ANOVA: Interactions For the last lab, we focused on the basics of the Two-Way ANOVA. That is, you learned how to compute a Brown-Forsythe analysis for a Two-Way ANOVA, as well as
More informationGetting to Know Your Data
Chapter 2 Getting to Know Your Data 2.1 Exercises 1. Give three additional commonly used statistical measures (i.e., not illustrated in this chapter) for the characterization of data dispersion, and discuss
More informationClustering and Visualisation of Data
Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some
More informationPaper Abstract. Introduction. SAS Version 7/8 Web Tools. Using ODS to Create HTML Formatted Output. Background
Paper 43-25 The International Studies Project : SAS Version 7/8 Web Tools To The Rescue Lilin She, UNC-CH, Department Of Biostatistics, Chapel Hill, NC Jeffrey M. Abolafia, UNC-CH, Department Of Biostatistics,
More informationGeneralized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS)
Generalized Least Squares (GLS) and Estimated Generalized Least Squares (EGLS) Linear Model in matrix notation for the population Y = Xβ + Var ( ) = In GLS, the error covariance matrix is known In EGLS
More informationApplied Multivariate Analysis
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Choosing Statistical Method 1 Choice an appropriate method 2 Cross-tabulation More advance analysis of frequency tables
More informationPoisson Regressions for Complex Surveys
Poisson Regressions for Complex Surveys Overview Researchers often use sample survey methodology to obtain information about a large population by selecting and measuring a sample from that population.
More informationModule I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design
Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design Randomized the Clinical Trails About the Uncontrolled Trails The protocol Development The
More informationChapter 13 Introduction to Graphics Using SAS/GRAPH (Self-Study)
Chapter 13 Introduction to Graphics Using SAS/GRAPH (Self-Study) 13.1 Introduction... 2 13.2 Creating Bar and Pie Charts... 8 13.3 Creating Plots... 20 13-2 Chapter 13 Introduction to Graphics Using SAS/GRAPH
More informationFurther Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables
Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables
More informationLab #9: ANOVA and TUKEY tests
Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for
More informationA SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited
A SAS and Java Application for Reporting Clinical Trial Data Kevin Kane MSc Infoworks (Data Handling) Limited Reporting Clinical Trials Is Resource Intensive! Reporting a clinical trial program for a new
More informationAn Application of PROC NLP to Survey Sample Weighting
An Application of PROC NLP to Survey Sample Weighting Talbot Michael Katz, Analytic Data Information Technologies, New York, NY ABSTRACT The classic weighting formula for survey respondents compensates
More information2 = Disagree 3 = Neutral 4 = Agree 5 = Strongly Agree. Disagree
PharmaSUG 2012 - Paper HO01 Multiple Techniques for Scoring Quality of Life Questionnaires Brandon Welch, Rho, Inc., Chapel Hill, NC Seungshin Rhee, Rho, Inc., Chapel Hill, NC ABSTRACT In the clinical
More information