Statistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment
|
|
- Archibald Daniel
- 6 years ago
- Views:
Transcription
1 Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ Aiming Yang, Merck & Co., Inc., Rahway, NJ ABSTRACT Four pitfalls are commonly encountered in the statistical analysis of clinical trials. The first pitfall can occur in a cross-over study when the McNemar's test is used to estimate the significance level of treatment group differences for a binary outcome result. The second pitfall concerns the ordering in PROC LOGISTIC. The third pitfall involves PROC GLM and its handling of character CLASS variables. The fourth pitfall relates to non-estimable issues with PROC GLM. In a massive clinical trial program production environment, we need to watch for these pitfalls when recycling, reusing, or creating macros. The four pitfalls are described in detail. Recommendations and SAS coding solutions are provided. KEYWORDS Clinical Trial, Cross Over Study, McNemar's Test, PROC LOGISTIC, PROC GLM INTRODUCTION In clinical trials, statistical programmers usually have to deliver a lot of statistical analysis results in tight timelines. Very often a programmer would use some existing macros. In this type of mass production environment, it is important that the macros are robust and generate correct results. This paper lists several undesirable scenarios and provides possible methods/techniques to prevent them from happening. For example, in a cross over study, the McNemar's test is used to estimate the significance level between treatments for a binary outcome result. Usually the testing is carried out on a 2 X 2 frequency table. For some relatively rare events, the single column or single row happens. In this case, a warning message will pop out in the log. To prevent this warning message, a small screening step within the macro could be added beforehand to decide whether the method should be applied. Another example is to use the DESCENDING option properly in a PROC LOGISTIC procedure. The third example is that character treatment group variable, sometimes, could cause a wrong estimation in a CONTRAST or ESTIMATE statement in a PROC GLM procedure. The last example is to recommend using LSMeanDIFFCL in PROC GLM to avoid a potential problem caused by multicollinear data. It is important that every programmer examine existing programs carefully and make an effort to enhance the robustness of new macros. EXAMPLE 1: MCNEMAR'S TEST IN PROC FREQ When estimating the significance level between treatments for a binary outcome result for a 2 * 2 crossover design, the McNemar's test is often to be used. The 2-way adverse event (AE) table is like the following: Treatment A (AE1) With AE without AE Treatment B With AE a b (AE2) without AE c d The McNemar's test statistic is shown below: The following code could carry out the testing. proc freq data=temp; weight count; table AE1*AE2/agree; output out=mcnemar(keep= _MCNEM_ P_MCNEM) agree; Pitfall In clinical trials, safety analysis is a must. The clinician and statistician would like to know whether the occurrence rate of an adverse event is more likely to occur in one treatment arm than another. Since the response variable is the occurrence of an adverse event, due to the nature of the data and the relative rarity of the event, it is highly possible to have cells containing zero in the 2 by 2 table. For example, in the following pseudo data, 100 patients are grouped into two groups: 50 patients 1
2 receive treatment A in the first period and switch to treatment B in the second period. Another 50 patients are in treatment B in the first period and switch to treatment A in the second period. Assume that 98 patients do not have any adverse event in both periods. Two patients have an adverse event in treatment A. This means there is no adverse event for treatment B. The 2 way table would be like the following: Treatment B Treatment A With AE without AE With AE 0 0 without AE 2 98 PROC FREQ will not be able to carry out the analysis if either variable in the 2 * 2 table has less than 2 non-missing levels. There will be a note in the log indicating the data is not suitable to perform the test. SAS will not create a dataset if there is an output statement in the PROC FREQ procedure. Hence, there will be a warning message. NOTE: No statistics are computed for ae1 * ae2 since ae2 has less than 2 nonmissing levels. WARNING: No OUTPUT data set is produced because no statistics can be computed for this table, which has a row or column variable with less than 2 nonmissing levels. For another example, in the same pseudo data, 100 patients are grouped into two groups: 50 patients receive treatment A in the first period and switch to treatment B in the second period. Another 50 patients are in treatment B in the first period and switch to treatment A in the second period. Assume that 98 patients do not have any adverse event in both periods. Two patients have the adverse event in both the treatment A and treatment B period. This means that there is no patient who has the adverse event only in one treatment period. The 2-way table would be like the following: Treatment B Treatment A With AE without AE With AE 2 0 without AE 0 98 The McNemar's test would not be able to carry out the analysis if the data in a 2 * 2 table having zero frequencies in the offdiagonal cells. If a NOPRINT option is specified, there will be a note message in the log indicating there are no discordant data to compute McNemar's test. NOTE: There are no discordant pairs when computing McNemar's test, for the table of ae1 by ae2. However, SAS still will create a dataset when there is an output statement in the PROC FREQ procedure, though the data contain missing values. Suggested Solution These two examples show that a warning message is not necessarily produced if a test could not be carried out. If the data result in a single row or single column in a 2-way table, there will be a warning message. If the data satisfy the 2 X 2 table layout but with zero frequency in off-diagonal cells, the warning message will not show. A common practice for presenting the statistics when a test could not be carried out is simply put a 'N/A' message in the table output. For the first example, this paper provides the following technique to avoid the warning message. proc sql ; select sum(count) into :countr0 from temp where AE1=0; select sum(count) into :countr1 from temp where AE1=1; select sum(count) into :countc0 from temp where AE2=0; select sum(count) into :countc1 from temp where AE2=1; 2
3 %if (&countr0=0 or &countr1=0 or &countc0=0 or &countc1=0) %then %do; data McNemar; _MCNEM_ =.; P_MCNEM =.; %end; %else %do; proc freq data=temp noprint; weight count; table AE1*AE2/agree; output out=mcnemar(keep=_mcnem_ P_MCNEM) agree; %end; For the second example, there is no warning message. Programmers and statisticians should carefully check the SAS output, or, if the NOPRINT is specified, check the note message related to the "No discordant pair'" information. EXAMPLE 2: PROC LOGISTIC / DESCENDING In clinical trial or health care research, a binary variable is often the key result of interest in the study. In a clinical trial, for example, the binary result could be whether there is an AE event or not. In health care research, the binary result could be if a patient enters the long term care facility after being discharged from the hospital. PROC LOGISTIC is frequently used to estimate the impact from the various factors on the probability of having this event. Prior to using the procedure, it is important to know the value of the binary dependent variable. The DESCENDING option should be added to the PROC LOGISTIC statement if the binary dependent variable has value 1 (YES) vs. 0 (NO). The code is as following: proc logistic data=raw outstat=_outstat descending; class trt; model resp2 = trt age; On the other hand, the DESCENDING option should not be added if the binary dependent variable has value 1 (YES) vs. 2 (NO). If one does not know the data and blindly uses the procedure, the result might be interpreted in the wrong direction because the meaning of the sign of the coefficient is opposite from what is assumed. To compare the result of using DESCENDING with the result of not using it, here are three examples. Pitfall One pseudo dataset is created to present the issue. Example 1: Correct Approach --- binary value 1 (YES) vs. 0 (NO) / with DESCENDING option Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept trt trt trt age
4 Example 2: Correct Approach --- binary value 1 (YES) vs. 2 (NO) / without DESCENDING option Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept trt trt trt age Example 3: Incorrect Approach --- binary value 0 (NO) vs. 1 (YES) / without DESCENDING option Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept trt trt trt age The first two outputs have exactly same results because they are estimating the same thing the probability of having a binary response of YES. In the third output, the signs of the estimates are opposite from expected, because this incorrect approach is estimating the probability of having binary response as NO. Suggested Solution It is likely that many SAS/STAT users know that the DESCENDING option in PROC LOGISTIC model should be added if the binary response variable takes value 1 and 0. However, sometimes people use a macro borrowed from other studies, without checking whether the data are appropriate to use beforehand. To avoid this type of mistake, an ideal macro should alert the user to consider the use of the DESCENDING option in the PROC LOGISTIC model to prevent the wrong interpretation. One solution to avoid the misuse of the procedure is a bulletproofing. See below: proc sql noprint; select distinct(&var) into :resp separated by ' ' from raw; %if %scan(&resp,1) ne 0 or %scan(&resp,2) ne 1 or %length(%sysfunc(compress(&resp))) ne 2 %then %do; %put *** the response variable is not 0 / 1 ***; %end; There are many possible approaches to implement this procedure. How to proceed in the macro is up to the macro developer's preference. One possible approach is to stop executing the macro. Another Approach is to drop the DESCENDING statement from the model. If the binary response variable has value 1 (YES) vs. 2 (NO), converting 2 to 0 is also an appropriate solution. EXAMPLE 3: PROC GLM (WITH CHARACTER TREATMENT GROUP) In PROC GLM or PROC MIXED, the coefficient vector estimates β impact from the independent variables on the dependent variables. When testing a null hypothesis using the estimated coefficients, a CONTRAST statement could be added to perform a custom hypothesis test by specifying an L vector or matrix for testing the hypothesis L * β= 0. For example, say there are two treatment groups in a clinical trial study. The null hypothesis is that there is no difference between treatments in the percentage change from baseline in a lab test. In the CONTRAST statement, the L vector should be specified as the following: 4
5 proc glm data=raw outstat=_outstat; class trt; model pchg = trt / ss3; CONTRAST 'trtdiff' trt 1-1; The vector specified in the CONTRAST statement should align with the ordering of the CLASS variable. Assuming the variable TRT has values 1 and 2, and the coefficients of these two treatments are β1 and β2, then the contrast statement above tests β1 β2 = 0. Without knowing the correct ordering of the CLASS variable, the testing statistics might be calculated for a wrong null hypothesis. Pitfall A dose range studies are common in phase II clinical trials. If the study interest is on the right doses of single primary therapy drug as well and a co-administrated drug, then it is possible to have at least 10 treatment arms in this study. Whether the treatment group variable is a character variable or a numeric variable will cause a difference. The sorting order of the CLASS variables by default is ORDER = FORMATED. For unformatted numeric variables, the levels are ordered by their numeric value. However, for character variables, the sorting order is based on the following order: blank! " # $ % & ' ( ) * +, -. / : ; < = A B C D E F G H I J K L M N O P Q R S T U V W X Y Z[ \] ˆ_ a b c d e f g h i j k l m n o p q r s t u v w x y z { } ~ Based on this rule, if the treatment group variable is a character variable with length 2 bytes and with the following value: "1 ", "2 ", "3 ", "4 ", "5 ", "6 ", "7 ", "8 ", "9 ", "10". The ordering will not be like the ordering of numeric variable: Instead, it will be the following: On the other hand, if the treatment group variable is a character variable with length 2 bytes but with the following value: " 1", " 2", " 3", " 4", " 5", " 6", " 7", " 8", " 9", "10". The ordering will still be what people usually expect: Suggested Solution To see the ordering of the CLASS variable, add option E in CONTRAST statement. This will display the entire L vector and hence it is useful in confirming the ordering of parameters for specifying L. Here is the example code and output with E option. proc glm data=raw outstat=_outstat; class trta; model pchg= trta / ss3; CONTRAST 'TRENDPB_5' trta / E; The GLM Procedure Coefficients for Contrast TRENDPB_5 Row 1 Intercept 0 trta 1-2 5
6 trta 2-1 trta 3 0 trta 4 1 trta 5 2 trta 6 0 trta 7 0 trta 8 0 trta 9 0 trta 10 0 Dependent Variable: pchg Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE pchg Mean Source DF Type III SS Mean Square F Value Pr > F trta Contrast DF Contrast SS Mean Square F Value Pr > F TRENDPB_ It is recommended having that treatment group variables be numeric because it could avoid the possible wrong use of the contrast statement. EXAMPLE 4: PROC GLM (NON ESTIMABLE) In a clinical trial, efficacy and / or safety are always the main interest. The drug effect could be a categorical variable or a continuous variable. To estimate the treatment difference on a continuous variable, the ANOVA or ANCOVA model is a popular method. In SAS, PROC GLM or PROC MIXED are very often used for this purpose. When data have no multicollinearity issues, there is more than one way to calculate a treatment difference and its confidence level. One way is to use the LSMEANS statement and ODS to generate the output table LSMeanDIFFCL. Another way is to use the ESTIMATE statement. Here is an example of code in which the ODS output statement with LSMeanDIFFCL and LSMEANS statements are used together: ods output LSMeanDIFFCL=lsmdifci; proc glm data=lipids outstat=glmout ; class trt site region; model ldl = trt site region/e1 ss1 ss3; lsmeans trt/pdiff cl; 6
7 The output is here Obs Effect Dependent i j LowerCL Difference UpperCL 1 trt ldl Some statisticians and programmers use the ESTIMATE statement to calculate a confidence interval for a treatment difference. title 'GLM MODEL FOR TABLE CREATION'; ods output Estimates=estimate; proc glm data=lipids outstat=glmout ; class trt site region; model ldl = trt site region/e1 ss1 ss3; estimate 'diff2_1' trt -1 1 ; data rmse; set glmout (where=(_source_ eq 'ERROR') keep=_source_ DF SS); t = tinv(0.975, DF); rmse = sqrt(ss/df); byid = 1; call symput ('rmse', trim(left(put(rmse, 8.2))) ); data estimate; set estimate; byid = 1; data part2; merge estimate rmse; by byid; uci = -(estimate - tinv(.975,df)*stderr) ; lci = -(estimate + tinv(.975,df)*stderr) ; This way could generate exactly the same confidence interval as long as the LSMEANS are estimable. Obs Dependent Parameter Estimate uci lci The two approaches have the same confidence interval. 1 ldl diff2_ Pitfall Each LS-mean is computed as L * β where L is the coefficient matrix associated with the least-squares mean and β is the estimate of the fixed-effects parameter vector. As in the LSMEANS statement, the L is tested for estimability, and if this test fails, PROC GLM displays "Non-est" for the LS-means entries. It could be that the chosen independent variables are highly correlated. In this case, the model should be corrected. Here, the least squares means are non-estimable, and hence, using the ODS LSMeanDIFFCL statement combined with LSMEANS to retrieve the confidence interval will generate a warning message as following: WARNING: Output 'LSMeanDIFFCL' was not created 7
8 However, if the second way is used here, the output listing would have the following output, though the warning message won't show up. The GLM Procedure Least Squares Means trt ldl LSMEAN 1 Non-est 2 Non-est trt ldl LSMEAN 95% Confidence Limits Without checking the SAS output, this way still could generate a confidence interval but that is not valid. Obs Dependent Parameter Estimate uci lci 1 ldl diff2_ In a mass production environment, some statisticians and programmers may only check if the log has a "WARNING" or "ERROR" messages. In this scenario, an inappropriate model as in the example might not be noticed. Suggested Solution People should not just rely on checking if there are "WARNING" or "ERROR" messages in the log. They should read the output listings to see if the model is appropriate for the data. Using the ODS LSMeanDIFFCL statement combined with LSMEANS to retrieve the confidence interval is recommended to detect the Non-estimable issues. To be safe, programmers should not only check the log file but also output listing file for any "Non-est" message. CONCLUSION When generating large amounts, sometimes it could be hundreds or thousands, of analysis reports, the common practice is to save the output to the PDF or RTF or WORD document. Prior to delivery of the tables to the reviewer, there are several quality checks that should be done. First, the front line programmers and statisticians should review the SAS logs and SAS output listings. Second, the manager should review the tables. However, in a tight timeline, programmers and statisticians may end up only checking for warning or error message in the log file. A manager who reviews the results might just look at the numbers in the final tables. In this kind of a quality checking chain, checking the SAS output is often skipped. However, that missing step may be actually the most essential part. The first scenario illustrates how an expected warning message could be avoided. The second and third ones show how the model could be used incorrectly even though there is no warning and error message in the log. The last scenario demonstrates that even though there is more than one way in SAS to produce results, different methods provide certain advantages. In this paper, one of the advantages shown is that the method used could detect the data problem and the user alerted that there was an issue with the model used. REFERENCES Wuwei Wayne Feng and Dong Ding SAS@ APPLICATION IN 2 * 2 CROSSOVER CLINICAL TRIAL, in Proceedings of the Pharmaceutical SAS Users Group Conference (PharmaSUG 2004) John Troxell Bulletproofing and Knowledge Encapsulation in Statistical Macros, in Proceedings of the Pharmaceutical SAS Users Group Conference (PharmaSUG 2002) ACKNOWLEDGMENTS The author would like to thank John Troxell and Beilei Xu of Merck Research Laboratories for their Advices on this paper/presentation. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the authors at: 8
9 Huei-Ling Chen Merck & Co., Inc. 126 Lincoln Avenue P.O. Box 2000 Rahway, NJ Phone: Aiming Yang Merck & Co., Inc. 126 Lincoln Avenue P.O. Box 2000 Rahway, NJ Phone: TRADEMARK SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. 9
CC13 An Automatic Process to Compare Files. Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ
CC13 An Automatic Process to Compare Files Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ ABSTRACT Comparing different versions of output files is often performed
More informationPharmaSUG Paper SP04
PharmaSUG 2015 - Paper SP04 Means Comparisons and No Hard Coding of Your Coefficient Vector It Really Is Possible! Frank Tedesco, United Biosource Corporation, Blue Bell, Pennsylvania ABSTRACT When doing
More informationWant to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research
Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The
More informationFactorial ANOVA. Skipping... Page 1 of 18
Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent
More informationA Taste of SDTM in Real Time
A Taste of SDTM in Real Time Changhong Shi, Merck & Co., Inc., Rahway, NJ Beilei Xu, Merck & Co., Inc., Rahway, NJ ABSTRACT The Study Data Tabulation Model (SDTM) is a Clinical Data Interchange Standards
More informationIndenting with Style
ABSTRACT Indenting with Style Bill Coar, Axio Research, Seattle, WA Within the pharmaceutical industry, many SAS programmers rely heavily on Proc Report. While it is used extensively for summary tables
More informationData Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc.
PharmaSUG2011 - Paper DM03 Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc., TX ABSTRACT In the Clinical trials data analysis
More informationPooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit
Pooling Clinical Data: Key points and Pitfalls October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit Introduction Are there any pre-defined rules to pool clinical data? Are there any pre-defined
More informationFactorial ANOVA with SAS
Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'
More informationAn Efficient Tool for Clinical Data Check
PharmaSUG 2018 - Paper AD-16 An Efficient Tool for Clinical Data Check Chao Su, Merck & Co., Inc., Rahway, NJ Shunbing Zhao, Merck & Co., Inc., Rahway, NJ Cynthia He, Merck & Co., Inc., Rahway, NJ ABSTRACT
More informationSAS Training BASE SAS CONCEPTS BASE SAS:
SAS Training BASE SAS CONCEPTS BASE SAS: Dataset concept and creating a dataset from internal data Capturing data from external files (txt, CSV and tab) Capturing Non-Standard data (date, time and amounts)
More informationPharmaSUG Paper TT11
PharmaSUG 2014 - Paper TT11 What is the Definition of Global On-Demand Reporting within the Pharmaceutical Industry? Eric Kammer, Novartis Pharmaceuticals Corporation, East Hanover, NJ ABSTRACT It is not
More informationClinical Data Visualization using TIBCO Spotfire and SAS
ABSTRACT SESUG Paper RIV107-2017 Clinical Data Visualization using TIBCO Spotfire and SAS Ajay Gupta, PPD, Morrisville, USA In Pharmaceuticals/CRO industries, you may receive requests from stakeholders
More informationLet s Get FREQy with our Statistics: Data-Driven Approach to Determining Appropriate Test Statistic
PharmaSUG 2018 - Paper EP-09 Let s Get FREQy with our Statistics: Data-Driven Approach to Determining Appropriate Test Statistic Richann Watson, DataRich Consulting, Batavia, OH Lynn Mullins, PPD, Cincinnati,
More informationUsing PROC SQL to Generate Shift Tables More Efficiently
ABSTRACT SESUG Paper 218-2018 Using PROC SQL to Generate Shift Tables More Efficiently Jenna Cody, IQVIA Shift tables display the change in the frequency of subjects across specified categories from baseline
More informationBiostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests
Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests Example 1: 152 patients were randomly assigned to 4 dose groups in a clinical study. During the course of the study,
More informationPharmaSUG Paper PO12
PharmaSUG 2015 - Paper PO12 ABSTRACT Utilizing SAS for Cross-Report Verification in a Clinical Trials Setting Daniel Szydlo, Fred Hutchinson Cancer Research Center, Seattle, WA Iraj Mohebalian, Fred Hutchinson
More informationRepeated Measures Part 4: Blood Flow data
Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value
More information%ANYTL: A Versatile Table/Listing Macro
Paper AD09-2009 %ANYTL: A Versatile Table/Listing Macro Yang Chen, Forest Research Institute, Jersey City, NJ ABSTRACT Unlike traditional table macros, %ANTL has only 3 macro parameters which correspond
More informationAn Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio
PharmaSUG 2012 - Paper CC12 An Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio ABSTRACT Do you know how to
More informationAn Introduction to Visit Window Challenges and Solutions
ABSTRACT Paper 125-2017 An Introduction to Visit Window Challenges and Solutions Mai Ngo, SynteractHCR In clinical trial studies, statistical programmers often face the challenge of subjects visits not
More information186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95
A Statistical Analysis Macro Library in SAS Carl R. Haske, Ph.D., STATPROBE, nc., Ann Arbor, M Vivienne Ward, M.S., STATPROBE, nc., Ann Arbor, M ABSTRACT Statistical analysis plays a major role in pharmaceutical
More informationProgrammatic Automation of Categorizing and Listing Specific Clinical Terms
SESUG 2012 Paper CT-13 Programmatic Automation of Categorizing and Listing Specific Clinical Terms Ravi Kankipati, Pinnacle Technical Resources, Dallas, TX Abhilash Chimbirithy, Accenture, Florham Park,
More informationThis paper describes a report layout for reporting adverse events by study consumption pattern and explains its programming aspects.
PharmaSUG China 2015 Adverse Event Data Programming for Infant Nutrition Trials Ganesh Lekurwale, Singapore Clinical Research Institute, Singapore Parag Wani, Singapore Clinical Research Institute, Singapore
More informationA Practical and Efficient Approach in Generating AE (Adverse Events) Tables within a Clinical Study Environment
A Practical and Efficient Approach in Generating AE (Adverse Events) Tables within a Clinical Study Environment Abstract Jiannan Hu Vertex Pharmaceuticals, Inc. When a clinical trial is at the stage of
More informationUsing Templates Created by the SAS/STAT Procedures
Paper 081-29 Using Templates Created by the SAS/STAT Procedures Yanhong Huang, Ph.D. UMDNJ, Newark, NJ Jianming He, Solucient, LLC., Berkeley Heights, NJ ABSTRACT SAS procedures provide a large quantity
More informationOne Project, Two Teams: The Unblind Leading the Blind
ABSTRACT PharmaSUG 2017 - Paper BB01 One Project, Two Teams: The Unblind Leading the Blind Kristen Reece Harrington, Rho, Inc. In the pharmaceutical world, there are instances where multiple independent
More informationSAS/STAT 13.1 User s Guide. The NESTED Procedure
SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute
More informationIntroduction to Statistical Analyses in SAS
Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in
More informationThe NESTED Procedure (Chapter)
SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual
More informationUsing SAS Macros to Extract P-values from PROC FREQ
SESUG 2016 ABSTRACT Paper CC-232 Using SAS Macros to Extract P-values from PROC FREQ Rachel Straney, University of Central Florida This paper shows how to leverage the SAS Macro Facility with PROC FREQ
More informationSquare Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint
PharmaSUG 2018 - Paper DV-01 Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint Jane Eslinger, SAS Institute Inc. ABSTRACT An output table is a square. A slide
More informationPharmaSUG Paper PO10
PharmaSUG 2013 - Paper PO10 How to make SAS Drug Development more efficient Xiaopeng Li, Celerion Inc., Lincoln, NE Chun Feng, Celerion Inc., Lincoln, NE Peng Chai, Celerion Inc., Lincoln, NE ABSTRACT
More informationAssessing superiority/futility in a clinical trial: from multiplicity to simplicity with SAS
PharmaSUG2010 Paper SP10 Assessing superiority/futility in a clinical trial: from multiplicity to simplicity with SAS Phil d Almada, Duke Clinical Research Institute (DCRI), Durham, NC Laura Aberle, Duke
More informationAnalysis of Complex Survey Data with SAS
ABSTRACT Analysis of Complex Survey Data with SAS Christine R. Wells, Ph.D., UCLA, Los Angeles, CA The differences between data collected via a complex sampling design and data collected via other methods
More informationPharmaSUG China. model to include all potential prognostic factors and exploratory variables, 2) select covariates which are significant at
PharmaSUG China A Macro to Automatically Select Covariates from Prognostic Factors and Exploratory Factors for Multivariate Cox PH Model Yu Cheng, Eli Lilly and Company, Shanghai, China ABSTRACT Multivariate
More informationWhen Simpler is Better Visualizing Laboratory Data Using SG Procedures Wei Cheng, Isis Pharmaceuticals, Inc., Carlsbad, CA
When Simpler is Better Visualizing Laboratory Data Using SG Procedures Wei Cheng, Isis Pharmaceuticals, Inc., Carlsbad, CA ABSTRACT In SAS 9.2, SAS/GRAPH introduces a family of new procedures to create
More informationPharmaSUG China
PharmaSUG China 2016-39 Smart Statistical Graphics A Comparison Between SAS and TIBCO Spotfire In Data Visualization Yi Gu, Roche Product Development in Asia Pacific, Shanghai, China ABSTRACT Known for
More informationA SAS Macro to Automate the Process of Pooling Sites
A SAS Macro to Automate the Process of Pooling Sites Changhong Shi, Merck & Co., Inc. Rahway, NJ Lili Chen, Merck & Co., Inc. Rahway, NJ ABSTRACT In an international clinical trial, the patients are located
More informationRun your reports through that last loop to standardize the presentation attributes
PharmaSUG2011 - Paper TT14 Run your reports through that last loop to standardize the presentation attributes Niraj J. Pandya, Element Technologies Inc., NJ ABSTRACT Post Processing of the report could
More informationCreating output datasets using SQL (Structured Query Language) only Andrii Stakhniv, Experis Clinical, Ukraine
ABSTRACT PharmaSUG 2015 Paper QT22 Andrii Stakhniv, Experis Clinical, Ukraine PROC SQL is one of the most powerful procedures in SAS. With this tool we can easily manipulate data and create a large number
More informationIntermediate SAS: Statistics
Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square
More informationCentering and Interactions: The Training Data
Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three
More informationPharmaSUG Paper PO21
PharmaSUG 2015 - Paper PO21 Evaluating SDTM SUPP Domain For AdaM - Trash Can Or Buried Treasure Xiaopeng Li, Celerion, Lincoln, NE Yi Liu, Celerion, Lincoln, NE Chun Feng, Celerion, Lincoln, NE ABSTRACT
More informationAutomated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ
PharmaSUG 2015 - Paper QT41 Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ ABSTRACT Most often clinical trial data analysis has tight deadlines with very
More informationPaper CC-016. METHODOLOGY Suppose the data structure with m missing values for the row indices i=n-m+1,,n can be re-expressed by
Paper CC-016 A macro for nearest neighbor Lung-Chang Chien, University of North Carolina at Chapel Hill, Chapel Hill, NC Mark Weaver, Family Health International, Research Triangle Park, NC ABSTRACT SAS
More informationThere s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA
Paper HW04 There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA ABSTRACT Clinical Trials data comes in all shapes and sizes depending
More informationPharmaSUG Paper TT10 Creating a Customized Graph for Adverse Event Incidence and Duration Sanjiv Ramalingam, Octagon Research Solutions Inc.
Abstract PharmaSUG 2011 - Paper TT10 Creating a Customized Graph for Adverse Event Incidence and Duration Sanjiv Ramalingam, Octagon Research Solutions Inc. Adverse event (AE) analysis is a critical part
More informationModule I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design
Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design Randomized the Clinical Trails About the Uncontrolled Trails The protocol Development The
More informationCell means coding and effect coding
Cell means coding and effect coding /* mathregr_3.sas */ %include 'readmath.sas'; title2 ''; /* The data step continues */ if ethnic ne 6; /* Otherwise, throw the case out */ /* Indicator dummy variables
More informationAutomate Clinical Trial Data Issue Checking and Tracking
PharmaSUG 2018 - Paper AD-31 ABSTRACT Automate Clinical Trial Data Issue Checking and Tracking Dale LeSueur and Krishna Avula, Regeneron Pharmaceuticals Inc. Well organized and properly cleaned data are
More informationReal Time Clinical Trial Oversight with SAS
PharmaSUG 2017 - Paper DA01 Real Time Clinical Trial Oversight with SAS Ashok Gunuganti, Trevena ABSTRACT A clinical trial is an expensive and complex undertaking with multiple teams working together to
More informationAdvanced Visualization using TIBCO Spotfire and SAS
PharmaSUG 2018 - Paper DV-04 ABSTRACT Advanced Visualization using TIBCO Spotfire and SAS Ajay Gupta, PPD, Morrisville, USA In Pharmaceuticals/CRO industries, you may receive requests from stakeholders
More informationSorting big datasets. Do we really need it? Daniil Shliakhov, Experis Clinical, Kharkiv, Ukraine
PharmaSUG 2015 - Paper QT21 Sorting big datasets. Do we really need it? Daniil Shliakhov, Experis Clinical, Kharkiv, Ukraine ABSTRACT Very often working with big data causes difficulties for SAS programmers.
More informationBUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)
SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon
More informationPreparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA
PharmaSUG 2018 - Paper EP15 Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA Ellen Lin, Wei Cui, Ran Li, and Yaling Teng Amgen Inc, Thousand Oaks, CA ABSTRACT The
More informationCFB: A Programming Pattern for Creating Change from Baseline Datasets Lei Zhang, Celgene Corporation, Summit, NJ
Paper TT13 CFB: A Programming Pattern for Creating Change from Baseline Datasets Lei Zhang, Celgene Corporation, Summit, NJ ABSTRACT In many clinical studies, Change from Baseline analysis is frequently
More informationABSTRACT INTRODUCTION WORK FLOW AND PROGRAM SETUP
A SAS Macro Tool for Selecting Differentially Expressed Genes from Microarray Data Huanying Qin, Laia Alsina, Hui Xu, Elisa L. Priest Baylor Health Care System, Dallas, TX ABSTRACT DNA Microarrays measure
More informationJMP Clinical. Release Notes. Version 5.0
JMP Clinical Version 5.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive
More informationData Management - 50%
Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define
More informationIt s Proc Tabulate Jim, but not as we know it!
Paper SS02 It s Proc Tabulate Jim, but not as we know it! Robert Walls, PPD, Bellshill, UK ABSTRACT PROC TABULATE has received a very bad press in the last few years. Most SAS Users have come to look on
More informationA SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited
A SAS and Java Application for Reporting Clinical Trial Data Kevin Kane MSc Infoworks (Data Handling) Limited Reporting Clinical Trials Is Resource Intensive! Reporting a clinical trial program for a new
More informationPharmaSUG China Mina Chen, Roche (China) Holding Ltd.
PharmaSUG China 2017-50 Writing Efficient Queries in SAS Using PROC SQL with Teradata Mina Chen, Roche (China) Holding Ltd. ABSTRACT The emergence of big data, as well as advancements in data science approaches
More informationLab #9: ANOVA and TUKEY tests
Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for
More informationA SAS Macro Utility to Modify and Validate RTF Outputs for Regional Analyses Jagan Mohan Achi, PPD, Austin, TX Joshua N. Winters, PPD, Rochester, NY
PharmaSUG 2014 - Paper BB14 A SAS Macro Utility to Modify and Validate RTF Outputs for Regional Analyses Jagan Mohan Achi, PPD, Austin, TX Joshua N. Winters, PPD, Rochester, NY ABSTRACT Clinical Study
More informationAn Approach Finding the Right Tolerance Level for Clinical Data Acceptance
Paper P024 An Approach Finding the Right Tolerance Level for Clinical Data Acceptance Karen Walker, Walker Consulting LLC, Chandler Arizona ABSTRACT Highly broadcasted zero tolerance initiatives for database
More informationPharmaSUG 2013 CC26 Automating the Labeling of X- Axis Sanjiv Ramalingam, Vertex Pharmaceuticals, Inc., Cambridge, MA
PharmaSUG 2013 CC26 Automating the Labeling of X- Axis Sanjiv Ramalingam, Vertex Pharmaceuticals, Inc., Cambridge, MA ABSTRACT Labeling of the X-axis usually involves a tedious axis statement specifying
More informationCreating Forest Plots Using SAS/GRAPH and the Annotate Facility
PharmaSUG2011 Paper TT12 Creating Forest Plots Using SAS/GRAPH and the Annotate Facility Amanda Tweed, Millennium: The Takeda Oncology Company, Cambridge, MA ABSTRACT Forest plots have become common in
More information%MAKE_IT_COUNT: An Example Macro for Dynamic Table Programming Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma
Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma ABSTRACT Today there is more pressure on programmers to deliver summary outputs faster without sacrificing quality. By using just a few programming
More informationOne-Step Change from Baseline Calculations
Paper CC08 One-Step Change from Baseline Calculations Nancy Brucken, i3 Statprobe, Ann Arbor, MI ABSTRACT Change from baseline is a common measure of safety and/or efficacy in clinical trials. The traditional
More informationCut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION
Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output Myungshin Oh, UCLA Department of Biostatistics Mel Widawski, UCLA School of Nursing ABSTRACT We, as statisticians, often spend more
More informationBY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc.
ABSTRACT BY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc., Malvern, PA What if the usual sort and usual group processing would eliminate
More informationPharmaSUG China 2018 Paper AD-62
PharmaSUG China 2018 Paper AD-62 Decomposition and Reconstruction of TLF Shells - A Simple, Fast and Accurate Shell Designer Chengeng Tian, dmed Biopharmaceutical Co., Ltd., Shanghai, China ABSTRACT Table/graph
More informationRecall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation:
Topic 11. Unbalanced Designs [ST&D section 9.6, page 219; chapter 18] 11.1 Definition of missing data Accidents often result in loss of data. Crops are destroyed in some plots, plants and animals die,
More informationGet SAS sy with PROC SQL Amie Bissonett, Pharmanet/i3, Minneapolis, MN
PharmaSUG 2012 - Paper TF07 Get SAS sy with PROC SQL Amie Bissonett, Pharmanet/i3, Minneapolis, MN ABSTRACT As a data analyst for genetic clinical research, I was often working with familial data connecting
More informationCorrectly Compute Complex Samples Statistics
SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationWhat Do You Mean My CSV Doesn t Match My SAS Dataset?
SESUG 2016 Paper CC-132 What Do You Mean My CSV Doesn t Match My SAS Dataset? Patricia Guldin, Merck & Co., Inc; Young Zhuge, Merck & Co., Inc. ABSTRACT Statistical programmers are responsible for delivering
More informationGenerating Least Square Means, Standard Error, Observed Mean, Standard Deviation and Confidence Intervals for Treatment Differences using Proc Mixed
Generating Least Square Means, Standard Error, Observed Mean, Standard Deviation and Confidence Intervals for Treatment Differences using Proc Mixed Richann Watson ABSTRACT Have you ever wanted to calculate
More informationPhUse Practical Uses of the DOW Loop in Pharmaceutical Programming Richard Read Allen, Peak Statistical Services, Evergreen, CO, USA
PhUse 2009 Paper Tu01 Practical Uses of the DOW Loop in Pharmaceutical Programming Richard Read Allen, Peak Statistical Services, Evergreen, CO, USA ABSTRACT The DOW-Loop was originally developed by Don
More informationSubset Selection in Multiple Regression
Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that
More informationCreating Macro Calls using Proc Freq
Creating Macro Calls using Proc Freq, Educational Testing Service, Princeton, NJ ABSTRACT Imagine you were asked to get a series of statistics/tables for each country in the world. You have the data, but
More informationContrasts and Multiple Comparisons
Contrasts and Multiple Comparisons /* onewaymath.sas */ title2 'Oneway with contrasts and multiple comparisons (Exclude Other/DK)'; %include 'readmath.sas'; if ethnic ne 6; /* Otherwise, throw the case
More informationHow to write ADaM specifications like a ninja.
Poster PP06 How to write ADaM specifications like a ninja. Caroline Francis, Independent SAS & Standards Consultant, Torrevieja, Spain ABSTRACT To produce analysis datasets from CDISC Study Data Tabulation
More informationAre you Still Afraid of Using Arrays? Let s Explore their Advantages
Paper CT07 Are you Still Afraid of Using Arrays? Let s Explore their Advantages Vladyslav Khudov, Experis Clinical, Kharkiv, Ukraine ABSTRACT At first glance, arrays in SAS seem to be a complicated and
More informationPharmaceuticals, Health Care, and Life Sciences. An Approach to CDISC SDTM Implementation for Clinical Trials Data
An Approach to CDISC SDTM Implementation for Clinical Trials Data William T. Chen, Merck Research Laboratories, Rahway, NJ Margaret M. Coughlin, Merck Research Laboratories, Rahway, NJ ABSTRACT The Clinical
More informationTo conceptualize the process, the table below shows the highly correlated covariates in descending order of their R statistic.
Automating the process of choosing among highly correlated covariates for multivariable logistic regression Michael C. Doherty, i3drugsafety, Waltham, MA ABSTRACT In observational studies, there can be
More informationFigure 1. Table shell
Reducing Statisticians Programming Load: Automated Statistical Analysis with SAS and XML Michael C. Palmer, Zurich Biostatistics, Inc., Morristown, NJ Cecilia A. Hale, Zurich Biostatistics, Inc., Morristown,
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationStatistics, Data Analysis & Econometrics
ST009 PROC MI as the Basis for a Macro for the Study of Patterns of Missing Data Carl E. Pierchala, National Highway Traffic Safety Administration, Washington ABSTRACT The study of missing data patterns
More informationPharmaSUG Paper SP09
PharmaSUG 2013 - Paper SP09 SAS 9.3: Better graphs, Easier lives for SAS programmers, PK scientists and pharmacometricians Alice Zong, Janssen Research & Development, LLC, Spring House, PA ABSTRACT Data
More informationTools to Facilitate the Creation of Pooled Clinical Trials Databases
Paper AD10 Tools to Facilitate the Creation of Pooled Clinical Trials Databases Patricia Majcher, Johnson & Johnson Pharmaceutical Research & Development, L.L.C., Raritan, NJ ABSTRACT Data collected from
More informationPaper An Automated Reporting Macro to Create Cell Index An Enhanced Revisit. Shi-Tao Yeh, GlaxoSmithKline, King of Prussia, PA
ABSTRACT Paper 236-28 An Automated Reporting Macro to Create Cell Index An Enhanced Revisit When generating tables from SAS PROC TABULATE or PROC REPORT to summarize data, sometimes it is necessary to
More informationEudraCT release notes
Wednesday 13 January 2016 EMA/19969/2016 Information Management Division EudraCT release notes Results version: 10.2.1.0 Goal: Restoration of service Release date: 13 January 2016 Environment: Production
More informationStatistics Lab #7 ANOVA Part 2 & ANCOVA
Statistics Lab #7 ANOVA Part 2 & ANCOVA PSYCH 710 7 Initialize R Initialize R by entering the following commands at the prompt. You must type the commands exactly as shown. options(contrasts=c("contr.sum","contr.poly")
More informationSAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/
STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape
More informationCH5: CORR & SIMPLE LINEAR REFRESSION =======================================
STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================
More informationStat 5100 Handout #14.a SAS: Logistic Regression
Stat 5100 Handout #14.a SAS: Logistic Regression Example: (Text Table 14.3) Individuals were randomly sampled within two sectors of a city, and checked for presence of disease (here, spread by mosquitoes).
More informationAmie Bissonett, inventiv Health Clinical, Minneapolis, MN
PharmaSUG 2013 - Paper TF12 Let s get SAS sy Amie Bissonett, inventiv Health Clinical, Minneapolis, MN ABSTRACT The SAS language has a plethora of procedures, data step statements, functions, and options
More informationMultiple imputation using chained equations: Issues and guidance for practice
Multiple imputation using chained equations: Issues and guidance for practice Ian R. White, Patrick Royston and Angela M. Wood http://onlinelibrary.wiley.com/doi/10.1002/sim.4067/full By Gabrielle Simoneau
More informationProgramming Gems that are worth learning SQL for! Pamela L. Reading, Rho, Inc., Chapel Hill, NC
Paper CC-05 Programming Gems that are worth learning SQL for! Pamela L. Reading, Rho, Inc., Chapel Hill, NC ABSTRACT For many SAS users, learning SQL syntax appears to be a significant effort with a low
More information