Statistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment

Size: px
Start display at page:

Download "Statistics and Data Analysis. Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment"

Transcription

1 Common Pitfalls in SAS Statistical Analysis Macros in a Mass Production Environment Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ Aiming Yang, Merck & Co., Inc., Rahway, NJ ABSTRACT Four pitfalls are commonly encountered in the statistical analysis of clinical trials. The first pitfall can occur in a cross-over study when the McNemar's test is used to estimate the significance level of treatment group differences for a binary outcome result. The second pitfall concerns the ordering in PROC LOGISTIC. The third pitfall involves PROC GLM and its handling of character CLASS variables. The fourth pitfall relates to non-estimable issues with PROC GLM. In a massive clinical trial program production environment, we need to watch for these pitfalls when recycling, reusing, or creating macros. The four pitfalls are described in detail. Recommendations and SAS coding solutions are provided. KEYWORDS Clinical Trial, Cross Over Study, McNemar's Test, PROC LOGISTIC, PROC GLM INTRODUCTION In clinical trials, statistical programmers usually have to deliver a lot of statistical analysis results in tight timelines. Very often a programmer would use some existing macros. In this type of mass production environment, it is important that the macros are robust and generate correct results. This paper lists several undesirable scenarios and provides possible methods/techniques to prevent them from happening. For example, in a cross over study, the McNemar's test is used to estimate the significance level between treatments for a binary outcome result. Usually the testing is carried out on a 2 X 2 frequency table. For some relatively rare events, the single column or single row happens. In this case, a warning message will pop out in the log. To prevent this warning message, a small screening step within the macro could be added beforehand to decide whether the method should be applied. Another example is to use the DESCENDING option properly in a PROC LOGISTIC procedure. The third example is that character treatment group variable, sometimes, could cause a wrong estimation in a CONTRAST or ESTIMATE statement in a PROC GLM procedure. The last example is to recommend using LSMeanDIFFCL in PROC GLM to avoid a potential problem caused by multicollinear data. It is important that every programmer examine existing programs carefully and make an effort to enhance the robustness of new macros. EXAMPLE 1: MCNEMAR'S TEST IN PROC FREQ When estimating the significance level between treatments for a binary outcome result for a 2 * 2 crossover design, the McNemar's test is often to be used. The 2-way adverse event (AE) table is like the following: Treatment A (AE1) With AE without AE Treatment B With AE a b (AE2) without AE c d The McNemar's test statistic is shown below: The following code could carry out the testing. proc freq data=temp; weight count; table AE1*AE2/agree; output out=mcnemar(keep= _MCNEM_ P_MCNEM) agree; Pitfall In clinical trials, safety analysis is a must. The clinician and statistician would like to know whether the occurrence rate of an adverse event is more likely to occur in one treatment arm than another. Since the response variable is the occurrence of an adverse event, due to the nature of the data and the relative rarity of the event, it is highly possible to have cells containing zero in the 2 by 2 table. For example, in the following pseudo data, 100 patients are grouped into two groups: 50 patients 1

2 receive treatment A in the first period and switch to treatment B in the second period. Another 50 patients are in treatment B in the first period and switch to treatment A in the second period. Assume that 98 patients do not have any adverse event in both periods. Two patients have an adverse event in treatment A. This means there is no adverse event for treatment B. The 2 way table would be like the following: Treatment B Treatment A With AE without AE With AE 0 0 without AE 2 98 PROC FREQ will not be able to carry out the analysis if either variable in the 2 * 2 table has less than 2 non-missing levels. There will be a note in the log indicating the data is not suitable to perform the test. SAS will not create a dataset if there is an output statement in the PROC FREQ procedure. Hence, there will be a warning message. NOTE: No statistics are computed for ae1 * ae2 since ae2 has less than 2 nonmissing levels. WARNING: No OUTPUT data set is produced because no statistics can be computed for this table, which has a row or column variable with less than 2 nonmissing levels. For another example, in the same pseudo data, 100 patients are grouped into two groups: 50 patients receive treatment A in the first period and switch to treatment B in the second period. Another 50 patients are in treatment B in the first period and switch to treatment A in the second period. Assume that 98 patients do not have any adverse event in both periods. Two patients have the adverse event in both the treatment A and treatment B period. This means that there is no patient who has the adverse event only in one treatment period. The 2-way table would be like the following: Treatment B Treatment A With AE without AE With AE 2 0 without AE 0 98 The McNemar's test would not be able to carry out the analysis if the data in a 2 * 2 table having zero frequencies in the offdiagonal cells. If a NOPRINT option is specified, there will be a note message in the log indicating there are no discordant data to compute McNemar's test. NOTE: There are no discordant pairs when computing McNemar's test, for the table of ae1 by ae2. However, SAS still will create a dataset when there is an output statement in the PROC FREQ procedure, though the data contain missing values. Suggested Solution These two examples show that a warning message is not necessarily produced if a test could not be carried out. If the data result in a single row or single column in a 2-way table, there will be a warning message. If the data satisfy the 2 X 2 table layout but with zero frequency in off-diagonal cells, the warning message will not show. A common practice for presenting the statistics when a test could not be carried out is simply put a 'N/A' message in the table output. For the first example, this paper provides the following technique to avoid the warning message. proc sql ; select sum(count) into :countr0 from temp where AE1=0; select sum(count) into :countr1 from temp where AE1=1; select sum(count) into :countc0 from temp where AE2=0; select sum(count) into :countc1 from temp where AE2=1; 2

3 %if (&countr0=0 or &countr1=0 or &countc0=0 or &countc1=0) %then %do; data McNemar; _MCNEM_ =.; P_MCNEM =.; %end; %else %do; proc freq data=temp noprint; weight count; table AE1*AE2/agree; output out=mcnemar(keep=_mcnem_ P_MCNEM) agree; %end; For the second example, there is no warning message. Programmers and statisticians should carefully check the SAS output, or, if the NOPRINT is specified, check the note message related to the "No discordant pair'" information. EXAMPLE 2: PROC LOGISTIC / DESCENDING In clinical trial or health care research, a binary variable is often the key result of interest in the study. In a clinical trial, for example, the binary result could be whether there is an AE event or not. In health care research, the binary result could be if a patient enters the long term care facility after being discharged from the hospital. PROC LOGISTIC is frequently used to estimate the impact from the various factors on the probability of having this event. Prior to using the procedure, it is important to know the value of the binary dependent variable. The DESCENDING option should be added to the PROC LOGISTIC statement if the binary dependent variable has value 1 (YES) vs. 0 (NO). The code is as following: proc logistic data=raw outstat=_outstat descending; class trt; model resp2 = trt age; On the other hand, the DESCENDING option should not be added if the binary dependent variable has value 1 (YES) vs. 2 (NO). If one does not know the data and blindly uses the procedure, the result might be interpreted in the wrong direction because the meaning of the sign of the coefficient is opposite from what is assumed. To compare the result of using DESCENDING with the result of not using it, here are three examples. Pitfall One pseudo dataset is created to present the issue. Example 1: Correct Approach --- binary value 1 (YES) vs. 0 (NO) / with DESCENDING option Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept trt trt trt age

4 Example 2: Correct Approach --- binary value 1 (YES) vs. 2 (NO) / without DESCENDING option Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept trt trt trt age Example 3: Incorrect Approach --- binary value 0 (NO) vs. 1 (YES) / without DESCENDING option Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept trt trt trt age The first two outputs have exactly same results because they are estimating the same thing the probability of having a binary response of YES. In the third output, the signs of the estimates are opposite from expected, because this incorrect approach is estimating the probability of having binary response as NO. Suggested Solution It is likely that many SAS/STAT users know that the DESCENDING option in PROC LOGISTIC model should be added if the binary response variable takes value 1 and 0. However, sometimes people use a macro borrowed from other studies, without checking whether the data are appropriate to use beforehand. To avoid this type of mistake, an ideal macro should alert the user to consider the use of the DESCENDING option in the PROC LOGISTIC model to prevent the wrong interpretation. One solution to avoid the misuse of the procedure is a bulletproofing. See below: proc sql noprint; select distinct(&var) into :resp separated by ' ' from raw; %if %scan(&resp,1) ne 0 or %scan(&resp,2) ne 1 or %length(%sysfunc(compress(&resp))) ne 2 %then %do; %put *** the response variable is not 0 / 1 ***; %end; There are many possible approaches to implement this procedure. How to proceed in the macro is up to the macro developer's preference. One possible approach is to stop executing the macro. Another Approach is to drop the DESCENDING statement from the model. If the binary response variable has value 1 (YES) vs. 2 (NO), converting 2 to 0 is also an appropriate solution. EXAMPLE 3: PROC GLM (WITH CHARACTER TREATMENT GROUP) In PROC GLM or PROC MIXED, the coefficient vector estimates β impact from the independent variables on the dependent variables. When testing a null hypothesis using the estimated coefficients, a CONTRAST statement could be added to perform a custom hypothesis test by specifying an L vector or matrix for testing the hypothesis L * β= 0. For example, say there are two treatment groups in a clinical trial study. The null hypothesis is that there is no difference between treatments in the percentage change from baseline in a lab test. In the CONTRAST statement, the L vector should be specified as the following: 4

5 proc glm data=raw outstat=_outstat; class trt; model pchg = trt / ss3; CONTRAST 'trtdiff' trt 1-1; The vector specified in the CONTRAST statement should align with the ordering of the CLASS variable. Assuming the variable TRT has values 1 and 2, and the coefficients of these two treatments are β1 and β2, then the contrast statement above tests β1 β2 = 0. Without knowing the correct ordering of the CLASS variable, the testing statistics might be calculated for a wrong null hypothesis. Pitfall A dose range studies are common in phase II clinical trials. If the study interest is on the right doses of single primary therapy drug as well and a co-administrated drug, then it is possible to have at least 10 treatment arms in this study. Whether the treatment group variable is a character variable or a numeric variable will cause a difference. The sorting order of the CLASS variables by default is ORDER = FORMATED. For unformatted numeric variables, the levels are ordered by their numeric value. However, for character variables, the sorting order is based on the following order: blank! " # $ % & ' ( ) * +, -. / : ; < = A B C D E F G H I J K L M N O P Q R S T U V W X Y Z[ \] ˆ_ a b c d e f g h i j k l m n o p q r s t u v w x y z { } ~ Based on this rule, if the treatment group variable is a character variable with length 2 bytes and with the following value: "1 ", "2 ", "3 ", "4 ", "5 ", "6 ", "7 ", "8 ", "9 ", "10". The ordering will not be like the ordering of numeric variable: Instead, it will be the following: On the other hand, if the treatment group variable is a character variable with length 2 bytes but with the following value: " 1", " 2", " 3", " 4", " 5", " 6", " 7", " 8", " 9", "10". The ordering will still be what people usually expect: Suggested Solution To see the ordering of the CLASS variable, add option E in CONTRAST statement. This will display the entire L vector and hence it is useful in confirming the ordering of parameters for specifying L. Here is the example code and output with E option. proc glm data=raw outstat=_outstat; class trta; model pchg= trta / ss3; CONTRAST 'TRENDPB_5' trta / E; The GLM Procedure Coefficients for Contrast TRENDPB_5 Row 1 Intercept 0 trta 1-2 5

6 trta 2-1 trta 3 0 trta 4 1 trta 5 2 trta 6 0 trta 7 0 trta 8 0 trta 9 0 trta 10 0 Dependent Variable: pchg Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE pchg Mean Source DF Type III SS Mean Square F Value Pr > F trta Contrast DF Contrast SS Mean Square F Value Pr > F TRENDPB_ It is recommended having that treatment group variables be numeric because it could avoid the possible wrong use of the contrast statement. EXAMPLE 4: PROC GLM (NON ESTIMABLE) In a clinical trial, efficacy and / or safety are always the main interest. The drug effect could be a categorical variable or a continuous variable. To estimate the treatment difference on a continuous variable, the ANOVA or ANCOVA model is a popular method. In SAS, PROC GLM or PROC MIXED are very often used for this purpose. When data have no multicollinearity issues, there is more than one way to calculate a treatment difference and its confidence level. One way is to use the LSMEANS statement and ODS to generate the output table LSMeanDIFFCL. Another way is to use the ESTIMATE statement. Here is an example of code in which the ODS output statement with LSMeanDIFFCL and LSMEANS statements are used together: ods output LSMeanDIFFCL=lsmdifci; proc glm data=lipids outstat=glmout ; class trt site region; model ldl = trt site region/e1 ss1 ss3; lsmeans trt/pdiff cl; 6

7 The output is here Obs Effect Dependent i j LowerCL Difference UpperCL 1 trt ldl Some statisticians and programmers use the ESTIMATE statement to calculate a confidence interval for a treatment difference. title 'GLM MODEL FOR TABLE CREATION'; ods output Estimates=estimate; proc glm data=lipids outstat=glmout ; class trt site region; model ldl = trt site region/e1 ss1 ss3; estimate 'diff2_1' trt -1 1 ; data rmse; set glmout (where=(_source_ eq 'ERROR') keep=_source_ DF SS); t = tinv(0.975, DF); rmse = sqrt(ss/df); byid = 1; call symput ('rmse', trim(left(put(rmse, 8.2))) ); data estimate; set estimate; byid = 1; data part2; merge estimate rmse; by byid; uci = -(estimate - tinv(.975,df)*stderr) ; lci = -(estimate + tinv(.975,df)*stderr) ; This way could generate exactly the same confidence interval as long as the LSMEANS are estimable. Obs Dependent Parameter Estimate uci lci The two approaches have the same confidence interval. 1 ldl diff2_ Pitfall Each LS-mean is computed as L * β where L is the coefficient matrix associated with the least-squares mean and β is the estimate of the fixed-effects parameter vector. As in the LSMEANS statement, the L is tested for estimability, and if this test fails, PROC GLM displays "Non-est" for the LS-means entries. It could be that the chosen independent variables are highly correlated. In this case, the model should be corrected. Here, the least squares means are non-estimable, and hence, using the ODS LSMeanDIFFCL statement combined with LSMEANS to retrieve the confidence interval will generate a warning message as following: WARNING: Output 'LSMeanDIFFCL' was not created 7

8 However, if the second way is used here, the output listing would have the following output, though the warning message won't show up. The GLM Procedure Least Squares Means trt ldl LSMEAN 1 Non-est 2 Non-est trt ldl LSMEAN 95% Confidence Limits Without checking the SAS output, this way still could generate a confidence interval but that is not valid. Obs Dependent Parameter Estimate uci lci 1 ldl diff2_ In a mass production environment, some statisticians and programmers may only check if the log has a "WARNING" or "ERROR" messages. In this scenario, an inappropriate model as in the example might not be noticed. Suggested Solution People should not just rely on checking if there are "WARNING" or "ERROR" messages in the log. They should read the output listings to see if the model is appropriate for the data. Using the ODS LSMeanDIFFCL statement combined with LSMEANS to retrieve the confidence interval is recommended to detect the Non-estimable issues. To be safe, programmers should not only check the log file but also output listing file for any "Non-est" message. CONCLUSION When generating large amounts, sometimes it could be hundreds or thousands, of analysis reports, the common practice is to save the output to the PDF or RTF or WORD document. Prior to delivery of the tables to the reviewer, there are several quality checks that should be done. First, the front line programmers and statisticians should review the SAS logs and SAS output listings. Second, the manager should review the tables. However, in a tight timeline, programmers and statisticians may end up only checking for warning or error message in the log file. A manager who reviews the results might just look at the numbers in the final tables. In this kind of a quality checking chain, checking the SAS output is often skipped. However, that missing step may be actually the most essential part. The first scenario illustrates how an expected warning message could be avoided. The second and third ones show how the model could be used incorrectly even though there is no warning and error message in the log. The last scenario demonstrates that even though there is more than one way in SAS to produce results, different methods provide certain advantages. In this paper, one of the advantages shown is that the method used could detect the data problem and the user alerted that there was an issue with the model used. REFERENCES Wuwei Wayne Feng and Dong Ding SAS@ APPLICATION IN 2 * 2 CROSSOVER CLINICAL TRIAL, in Proceedings of the Pharmaceutical SAS Users Group Conference (PharmaSUG 2004) John Troxell Bulletproofing and Knowledge Encapsulation in Statistical Macros, in Proceedings of the Pharmaceutical SAS Users Group Conference (PharmaSUG 2002) ACKNOWLEDGMENTS The author would like to thank John Troxell and Beilei Xu of Merck Research Laboratories for their Advices on this paper/presentation. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the authors at: 8

9 Huei-Ling Chen Merck & Co., Inc. 126 Lincoln Avenue P.O. Box 2000 Rahway, NJ Phone: Aiming Yang Merck & Co., Inc. 126 Lincoln Avenue P.O. Box 2000 Rahway, NJ Phone: TRADEMARK SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. 9

CC13 An Automatic Process to Compare Files. Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ

CC13 An Automatic Process to Compare Files. Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ CC13 An Automatic Process to Compare Files Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ ABSTRACT Comparing different versions of output files is often performed

More information

PharmaSUG Paper SP04

PharmaSUG Paper SP04 PharmaSUG 2015 - Paper SP04 Means Comparisons and No Hard Coding of Your Coefficient Vector It Really Is Possible! Frank Tedesco, United Biosource Corporation, Blue Bell, Pennsylvania ABSTRACT When doing

More information

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The

More information

Factorial ANOVA. Skipping... Page 1 of 18

Factorial ANOVA. Skipping... Page 1 of 18 Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent

More information

A Taste of SDTM in Real Time

A Taste of SDTM in Real Time A Taste of SDTM in Real Time Changhong Shi, Merck & Co., Inc., Rahway, NJ Beilei Xu, Merck & Co., Inc., Rahway, NJ ABSTRACT The Study Data Tabulation Model (SDTM) is a Clinical Data Interchange Standards

More information

Indenting with Style

Indenting with Style ABSTRACT Indenting with Style Bill Coar, Axio Research, Seattle, WA Within the pharmaceutical industry, many SAS programmers rely heavily on Proc Report. While it is used extensively for summary tables

More information

Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc.

Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc. PharmaSUG2011 - Paper DM03 Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc., TX ABSTRACT In the Clinical trials data analysis

More information

Pooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit

Pooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit Pooling Clinical Data: Key points and Pitfalls October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit Introduction Are there any pre-defined rules to pool clinical data? Are there any pre-defined

More information

Factorial ANOVA with SAS

Factorial ANOVA with SAS Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'

More information

An Efficient Tool for Clinical Data Check

An Efficient Tool for Clinical Data Check PharmaSUG 2018 - Paper AD-16 An Efficient Tool for Clinical Data Check Chao Su, Merck & Co., Inc., Rahway, NJ Shunbing Zhao, Merck & Co., Inc., Rahway, NJ Cynthia He, Merck & Co., Inc., Rahway, NJ ABSTRACT

More information

SAS Training BASE SAS CONCEPTS BASE SAS:

SAS Training BASE SAS CONCEPTS BASE SAS: SAS Training BASE SAS CONCEPTS BASE SAS: Dataset concept and creating a dataset from internal data Capturing data from external files (txt, CSV and tab) Capturing Non-Standard data (date, time and amounts)

More information

PharmaSUG Paper TT11

PharmaSUG Paper TT11 PharmaSUG 2014 - Paper TT11 What is the Definition of Global On-Demand Reporting within the Pharmaceutical Industry? Eric Kammer, Novartis Pharmaceuticals Corporation, East Hanover, NJ ABSTRACT It is not

More information

Clinical Data Visualization using TIBCO Spotfire and SAS

Clinical Data Visualization using TIBCO Spotfire and SAS ABSTRACT SESUG Paper RIV107-2017 Clinical Data Visualization using TIBCO Spotfire and SAS Ajay Gupta, PPD, Morrisville, USA In Pharmaceuticals/CRO industries, you may receive requests from stakeholders

More information

Let s Get FREQy with our Statistics: Data-Driven Approach to Determining Appropriate Test Statistic

Let s Get FREQy with our Statistics: Data-Driven Approach to Determining Appropriate Test Statistic PharmaSUG 2018 - Paper EP-09 Let s Get FREQy with our Statistics: Data-Driven Approach to Determining Appropriate Test Statistic Richann Watson, DataRich Consulting, Batavia, OH Lynn Mullins, PPD, Cincinnati,

More information

Using PROC SQL to Generate Shift Tables More Efficiently

Using PROC SQL to Generate Shift Tables More Efficiently ABSTRACT SESUG Paper 218-2018 Using PROC SQL to Generate Shift Tables More Efficiently Jenna Cody, IQVIA Shift tables display the change in the frequency of subjects across specified categories from baseline

More information

Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests

Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests Example 1: 152 patients were randomly assigned to 4 dose groups in a clinical study. During the course of the study,

More information

PharmaSUG Paper PO12

PharmaSUG Paper PO12 PharmaSUG 2015 - Paper PO12 ABSTRACT Utilizing SAS for Cross-Report Verification in a Clinical Trials Setting Daniel Szydlo, Fred Hutchinson Cancer Research Center, Seattle, WA Iraj Mohebalian, Fred Hutchinson

More information

Repeated Measures Part 4: Blood Flow data

Repeated Measures Part 4: Blood Flow data Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value

More information

%ANYTL: A Versatile Table/Listing Macro

%ANYTL: A Versatile Table/Listing Macro Paper AD09-2009 %ANYTL: A Versatile Table/Listing Macro Yang Chen, Forest Research Institute, Jersey City, NJ ABSTRACT Unlike traditional table macros, %ANTL has only 3 macro parameters which correspond

More information

An Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio

An Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio PharmaSUG 2012 - Paper CC12 An Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio ABSTRACT Do you know how to

More information

An Introduction to Visit Window Challenges and Solutions

An Introduction to Visit Window Challenges and Solutions ABSTRACT Paper 125-2017 An Introduction to Visit Window Challenges and Solutions Mai Ngo, SynteractHCR In clinical trial studies, statistical programmers often face the challenge of subjects visits not

More information

186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95

186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95 A Statistical Analysis Macro Library in SAS Carl R. Haske, Ph.D., STATPROBE, nc., Ann Arbor, M Vivienne Ward, M.S., STATPROBE, nc., Ann Arbor, M ABSTRACT Statistical analysis plays a major role in pharmaceutical

More information

Programmatic Automation of Categorizing and Listing Specific Clinical Terms

Programmatic Automation of Categorizing and Listing Specific Clinical Terms SESUG 2012 Paper CT-13 Programmatic Automation of Categorizing and Listing Specific Clinical Terms Ravi Kankipati, Pinnacle Technical Resources, Dallas, TX Abhilash Chimbirithy, Accenture, Florham Park,

More information

This paper describes a report layout for reporting adverse events by study consumption pattern and explains its programming aspects.

This paper describes a report layout for reporting adverse events by study consumption pattern and explains its programming aspects. PharmaSUG China 2015 Adverse Event Data Programming for Infant Nutrition Trials Ganesh Lekurwale, Singapore Clinical Research Institute, Singapore Parag Wani, Singapore Clinical Research Institute, Singapore

More information

A Practical and Efficient Approach in Generating AE (Adverse Events) Tables within a Clinical Study Environment

A Practical and Efficient Approach in Generating AE (Adverse Events) Tables within a Clinical Study Environment A Practical and Efficient Approach in Generating AE (Adverse Events) Tables within a Clinical Study Environment Abstract Jiannan Hu Vertex Pharmaceuticals, Inc. When a clinical trial is at the stage of

More information

Using Templates Created by the SAS/STAT Procedures

Using Templates Created by the SAS/STAT Procedures Paper 081-29 Using Templates Created by the SAS/STAT Procedures Yanhong Huang, Ph.D. UMDNJ, Newark, NJ Jianming He, Solucient, LLC., Berkeley Heights, NJ ABSTRACT SAS procedures provide a large quantity

More information

One Project, Two Teams: The Unblind Leading the Blind

One Project, Two Teams: The Unblind Leading the Blind ABSTRACT PharmaSUG 2017 - Paper BB01 One Project, Two Teams: The Unblind Leading the Blind Kristen Reece Harrington, Rho, Inc. In the pharmaceutical world, there are instances where multiple independent

More information

SAS/STAT 13.1 User s Guide. The NESTED Procedure

SAS/STAT 13.1 User s Guide. The NESTED Procedure SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute

More information

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

The NESTED Procedure (Chapter)

The NESTED Procedure (Chapter) SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual

More information

Using SAS Macros to Extract P-values from PROC FREQ

Using SAS Macros to Extract P-values from PROC FREQ SESUG 2016 ABSTRACT Paper CC-232 Using SAS Macros to Extract P-values from PROC FREQ Rachel Straney, University of Central Florida This paper shows how to leverage the SAS Macro Facility with PROC FREQ

More information

Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint

Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint PharmaSUG 2018 - Paper DV-01 Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint Jane Eslinger, SAS Institute Inc. ABSTRACT An output table is a square. A slide

More information

PharmaSUG Paper PO10

PharmaSUG Paper PO10 PharmaSUG 2013 - Paper PO10 How to make SAS Drug Development more efficient Xiaopeng Li, Celerion Inc., Lincoln, NE Chun Feng, Celerion Inc., Lincoln, NE Peng Chai, Celerion Inc., Lincoln, NE ABSTRACT

More information

Assessing superiority/futility in a clinical trial: from multiplicity to simplicity with SAS

Assessing superiority/futility in a clinical trial: from multiplicity to simplicity with SAS PharmaSUG2010 Paper SP10 Assessing superiority/futility in a clinical trial: from multiplicity to simplicity with SAS Phil d Almada, Duke Clinical Research Institute (DCRI), Durham, NC Laura Aberle, Duke

More information

Analysis of Complex Survey Data with SAS

Analysis of Complex Survey Data with SAS ABSTRACT Analysis of Complex Survey Data with SAS Christine R. Wells, Ph.D., UCLA, Los Angeles, CA The differences between data collected via a complex sampling design and data collected via other methods

More information

PharmaSUG China. model to include all potential prognostic factors and exploratory variables, 2) select covariates which are significant at

PharmaSUG China. model to include all potential prognostic factors and exploratory variables, 2) select covariates which are significant at PharmaSUG China A Macro to Automatically Select Covariates from Prognostic Factors and Exploratory Factors for Multivariate Cox PH Model Yu Cheng, Eli Lilly and Company, Shanghai, China ABSTRACT Multivariate

More information

When Simpler is Better Visualizing Laboratory Data Using SG Procedures Wei Cheng, Isis Pharmaceuticals, Inc., Carlsbad, CA

When Simpler is Better Visualizing Laboratory Data Using SG Procedures Wei Cheng, Isis Pharmaceuticals, Inc., Carlsbad, CA When Simpler is Better Visualizing Laboratory Data Using SG Procedures Wei Cheng, Isis Pharmaceuticals, Inc., Carlsbad, CA ABSTRACT In SAS 9.2, SAS/GRAPH introduces a family of new procedures to create

More information

PharmaSUG China

PharmaSUG China PharmaSUG China 2016-39 Smart Statistical Graphics A Comparison Between SAS and TIBCO Spotfire In Data Visualization Yi Gu, Roche Product Development in Asia Pacific, Shanghai, China ABSTRACT Known for

More information

A SAS Macro to Automate the Process of Pooling Sites

A SAS Macro to Automate the Process of Pooling Sites A SAS Macro to Automate the Process of Pooling Sites Changhong Shi, Merck & Co., Inc. Rahway, NJ Lili Chen, Merck & Co., Inc. Rahway, NJ ABSTRACT In an international clinical trial, the patients are located

More information

Run your reports through that last loop to standardize the presentation attributes

Run your reports through that last loop to standardize the presentation attributes PharmaSUG2011 - Paper TT14 Run your reports through that last loop to standardize the presentation attributes Niraj J. Pandya, Element Technologies Inc., NJ ABSTRACT Post Processing of the report could

More information

Creating output datasets using SQL (Structured Query Language) only Andrii Stakhniv, Experis Clinical, Ukraine

Creating output datasets using SQL (Structured Query Language) only Andrii Stakhniv, Experis Clinical, Ukraine ABSTRACT PharmaSUG 2015 Paper QT22 Andrii Stakhniv, Experis Clinical, Ukraine PROC SQL is one of the most powerful procedures in SAS. With this tool we can easily manipulate data and create a large number

More information

Intermediate SAS: Statistics

Intermediate SAS: Statistics Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square

More information

Centering and Interactions: The Training Data

Centering and Interactions: The Training Data Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three

More information

PharmaSUG Paper PO21

PharmaSUG Paper PO21 PharmaSUG 2015 - Paper PO21 Evaluating SDTM SUPP Domain For AdaM - Trash Can Or Buried Treasure Xiaopeng Li, Celerion, Lincoln, NE Yi Liu, Celerion, Lincoln, NE Chun Feng, Celerion, Lincoln, NE ABSTRACT

More information

Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ

Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ PharmaSUG 2015 - Paper QT41 Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ ABSTRACT Most often clinical trial data analysis has tight deadlines with very

More information

Paper CC-016. METHODOLOGY Suppose the data structure with m missing values for the row indices i=n-m+1,,n can be re-expressed by

Paper CC-016. METHODOLOGY Suppose the data structure with m missing values for the row indices i=n-m+1,,n can be re-expressed by Paper CC-016 A macro for nearest neighbor Lung-Chang Chien, University of North Carolina at Chapel Hill, Chapel Hill, NC Mark Weaver, Family Health International, Research Triangle Park, NC ABSTRACT SAS

More information

There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA

There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA Paper HW04 There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA ABSTRACT Clinical Trials data comes in all shapes and sizes depending

More information

PharmaSUG Paper TT10 Creating a Customized Graph for Adverse Event Incidence and Duration Sanjiv Ramalingam, Octagon Research Solutions Inc.

PharmaSUG Paper TT10 Creating a Customized Graph for Adverse Event Incidence and Duration Sanjiv Ramalingam, Octagon Research Solutions Inc. Abstract PharmaSUG 2011 - Paper TT10 Creating a Customized Graph for Adverse Event Incidence and Duration Sanjiv Ramalingam, Octagon Research Solutions Inc. Adverse event (AE) analysis is a critical part

More information

Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design

Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design Randomized the Clinical Trails About the Uncontrolled Trails The protocol Development The

More information

Cell means coding and effect coding

Cell means coding and effect coding Cell means coding and effect coding /* mathregr_3.sas */ %include 'readmath.sas'; title2 ''; /* The data step continues */ if ethnic ne 6; /* Otherwise, throw the case out */ /* Indicator dummy variables

More information

Automate Clinical Trial Data Issue Checking and Tracking

Automate Clinical Trial Data Issue Checking and Tracking PharmaSUG 2018 - Paper AD-31 ABSTRACT Automate Clinical Trial Data Issue Checking and Tracking Dale LeSueur and Krishna Avula, Regeneron Pharmaceuticals Inc. Well organized and properly cleaned data are

More information

Real Time Clinical Trial Oversight with SAS

Real Time Clinical Trial Oversight with SAS PharmaSUG 2017 - Paper DA01 Real Time Clinical Trial Oversight with SAS Ashok Gunuganti, Trevena ABSTRACT A clinical trial is an expensive and complex undertaking with multiple teams working together to

More information

Advanced Visualization using TIBCO Spotfire and SAS

Advanced Visualization using TIBCO Spotfire and SAS PharmaSUG 2018 - Paper DV-04 ABSTRACT Advanced Visualization using TIBCO Spotfire and SAS Ajay Gupta, PPD, Morrisville, USA In Pharmaceuticals/CRO industries, you may receive requests from stakeholders

More information

Sorting big datasets. Do we really need it? Daniil Shliakhov, Experis Clinical, Kharkiv, Ukraine

Sorting big datasets. Do we really need it? Daniil Shliakhov, Experis Clinical, Kharkiv, Ukraine PharmaSUG 2015 - Paper QT21 Sorting big datasets. Do we really need it? Daniil Shliakhov, Experis Clinical, Kharkiv, Ukraine ABSTRACT Very often working with big data causes difficulties for SAS programmers.

More information

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office) SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon

More information

Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA

Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA PharmaSUG 2018 - Paper EP15 Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA Ellen Lin, Wei Cui, Ran Li, and Yaling Teng Amgen Inc, Thousand Oaks, CA ABSTRACT The

More information

CFB: A Programming Pattern for Creating Change from Baseline Datasets Lei Zhang, Celgene Corporation, Summit, NJ

CFB: A Programming Pattern for Creating Change from Baseline Datasets Lei Zhang, Celgene Corporation, Summit, NJ Paper TT13 CFB: A Programming Pattern for Creating Change from Baseline Datasets Lei Zhang, Celgene Corporation, Summit, NJ ABSTRACT In many clinical studies, Change from Baseline analysis is frequently

More information

ABSTRACT INTRODUCTION WORK FLOW AND PROGRAM SETUP

ABSTRACT INTRODUCTION WORK FLOW AND PROGRAM SETUP A SAS Macro Tool for Selecting Differentially Expressed Genes from Microarray Data Huanying Qin, Laia Alsina, Hui Xu, Elisa L. Priest Baylor Health Care System, Dallas, TX ABSTRACT DNA Microarrays measure

More information

JMP Clinical. Release Notes. Version 5.0

JMP Clinical. Release Notes. Version 5.0 JMP Clinical Version 5.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive

More information

Data Management - 50%

Data Management - 50% Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define

More information

It s Proc Tabulate Jim, but not as we know it!

It s Proc Tabulate Jim, but not as we know it! Paper SS02 It s Proc Tabulate Jim, but not as we know it! Robert Walls, PPD, Bellshill, UK ABSTRACT PROC TABULATE has received a very bad press in the last few years. Most SAS Users have come to look on

More information

A SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited

A SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited A SAS and Java Application for Reporting Clinical Trial Data Kevin Kane MSc Infoworks (Data Handling) Limited Reporting Clinical Trials Is Resource Intensive! Reporting a clinical trial program for a new

More information

PharmaSUG China Mina Chen, Roche (China) Holding Ltd.

PharmaSUG China Mina Chen, Roche (China) Holding Ltd. PharmaSUG China 2017-50 Writing Efficient Queries in SAS Using PROC SQL with Teradata Mina Chen, Roche (China) Holding Ltd. ABSTRACT The emergence of big data, as well as advancements in data science approaches

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

A SAS Macro Utility to Modify and Validate RTF Outputs for Regional Analyses Jagan Mohan Achi, PPD, Austin, TX Joshua N. Winters, PPD, Rochester, NY

A SAS Macro Utility to Modify and Validate RTF Outputs for Regional Analyses Jagan Mohan Achi, PPD, Austin, TX Joshua N. Winters, PPD, Rochester, NY PharmaSUG 2014 - Paper BB14 A SAS Macro Utility to Modify and Validate RTF Outputs for Regional Analyses Jagan Mohan Achi, PPD, Austin, TX Joshua N. Winters, PPD, Rochester, NY ABSTRACT Clinical Study

More information

An Approach Finding the Right Tolerance Level for Clinical Data Acceptance

An Approach Finding the Right Tolerance Level for Clinical Data Acceptance Paper P024 An Approach Finding the Right Tolerance Level for Clinical Data Acceptance Karen Walker, Walker Consulting LLC, Chandler Arizona ABSTRACT Highly broadcasted zero tolerance initiatives for database

More information

PharmaSUG 2013 CC26 Automating the Labeling of X- Axis Sanjiv Ramalingam, Vertex Pharmaceuticals, Inc., Cambridge, MA

PharmaSUG 2013 CC26 Automating the Labeling of X- Axis Sanjiv Ramalingam, Vertex Pharmaceuticals, Inc., Cambridge, MA PharmaSUG 2013 CC26 Automating the Labeling of X- Axis Sanjiv Ramalingam, Vertex Pharmaceuticals, Inc., Cambridge, MA ABSTRACT Labeling of the X-axis usually involves a tedious axis statement specifying

More information

Creating Forest Plots Using SAS/GRAPH and the Annotate Facility

Creating Forest Plots Using SAS/GRAPH and the Annotate Facility PharmaSUG2011 Paper TT12 Creating Forest Plots Using SAS/GRAPH and the Annotate Facility Amanda Tweed, Millennium: The Takeda Oncology Company, Cambridge, MA ABSTRACT Forest plots have become common in

More information

%MAKE_IT_COUNT: An Example Macro for Dynamic Table Programming Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma

%MAKE_IT_COUNT: An Example Macro for Dynamic Table Programming Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma ABSTRACT Today there is more pressure on programmers to deliver summary outputs faster without sacrificing quality. By using just a few programming

More information

One-Step Change from Baseline Calculations

One-Step Change from Baseline Calculations Paper CC08 One-Step Change from Baseline Calculations Nancy Brucken, i3 Statprobe, Ann Arbor, MI ABSTRACT Change from baseline is a common measure of safety and/or efficacy in clinical trials. The traditional

More information

Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION

Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output Myungshin Oh, UCLA Department of Biostatistics Mel Widawski, UCLA School of Nursing ABSTRACT We, as statisticians, often spend more

More information

BY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc.

BY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc. ABSTRACT BY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc., Malvern, PA What if the usual sort and usual group processing would eliminate

More information

PharmaSUG China 2018 Paper AD-62

PharmaSUG China 2018 Paper AD-62 PharmaSUG China 2018 Paper AD-62 Decomposition and Reconstruction of TLF Shells - A Simple, Fast and Accurate Shell Designer Chengeng Tian, dmed Biopharmaceutical Co., Ltd., Shanghai, China ABSTRACT Table/graph

More information

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation:

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation: Topic 11. Unbalanced Designs [ST&D section 9.6, page 219; chapter 18] 11.1 Definition of missing data Accidents often result in loss of data. Crops are destroyed in some plots, plants and animals die,

More information

Get SAS sy with PROC SQL Amie Bissonett, Pharmanet/i3, Minneapolis, MN

Get SAS sy with PROC SQL Amie Bissonett, Pharmanet/i3, Minneapolis, MN PharmaSUG 2012 - Paper TF07 Get SAS sy with PROC SQL Amie Bissonett, Pharmanet/i3, Minneapolis, MN ABSTRACT As a data analyst for genetic clinical research, I was often working with familial data connecting

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

What Do You Mean My CSV Doesn t Match My SAS Dataset?

What Do You Mean My CSV Doesn t Match My SAS Dataset? SESUG 2016 Paper CC-132 What Do You Mean My CSV Doesn t Match My SAS Dataset? Patricia Guldin, Merck & Co., Inc; Young Zhuge, Merck & Co., Inc. ABSTRACT Statistical programmers are responsible for delivering

More information

Generating Least Square Means, Standard Error, Observed Mean, Standard Deviation and Confidence Intervals for Treatment Differences using Proc Mixed

Generating Least Square Means, Standard Error, Observed Mean, Standard Deviation and Confidence Intervals for Treatment Differences using Proc Mixed Generating Least Square Means, Standard Error, Observed Mean, Standard Deviation and Confidence Intervals for Treatment Differences using Proc Mixed Richann Watson ABSTRACT Have you ever wanted to calculate

More information

PhUse Practical Uses of the DOW Loop in Pharmaceutical Programming Richard Read Allen, Peak Statistical Services, Evergreen, CO, USA

PhUse Practical Uses of the DOW Loop in Pharmaceutical Programming Richard Read Allen, Peak Statistical Services, Evergreen, CO, USA PhUse 2009 Paper Tu01 Practical Uses of the DOW Loop in Pharmaceutical Programming Richard Read Allen, Peak Statistical Services, Evergreen, CO, USA ABSTRACT The DOW-Loop was originally developed by Don

More information

Subset Selection in Multiple Regression

Subset Selection in Multiple Regression Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that

More information

Creating Macro Calls using Proc Freq

Creating Macro Calls using Proc Freq Creating Macro Calls using Proc Freq, Educational Testing Service, Princeton, NJ ABSTRACT Imagine you were asked to get a series of statistics/tables for each country in the world. You have the data, but

More information

Contrasts and Multiple Comparisons

Contrasts and Multiple Comparisons Contrasts and Multiple Comparisons /* onewaymath.sas */ title2 'Oneway with contrasts and multiple comparisons (Exclude Other/DK)'; %include 'readmath.sas'; if ethnic ne 6; /* Otherwise, throw the case

More information

How to write ADaM specifications like a ninja.

How to write ADaM specifications like a ninja. Poster PP06 How to write ADaM specifications like a ninja. Caroline Francis, Independent SAS & Standards Consultant, Torrevieja, Spain ABSTRACT To produce analysis datasets from CDISC Study Data Tabulation

More information

Are you Still Afraid of Using Arrays? Let s Explore their Advantages

Are you Still Afraid of Using Arrays? Let s Explore their Advantages Paper CT07 Are you Still Afraid of Using Arrays? Let s Explore their Advantages Vladyslav Khudov, Experis Clinical, Kharkiv, Ukraine ABSTRACT At first glance, arrays in SAS seem to be a complicated and

More information

Pharmaceuticals, Health Care, and Life Sciences. An Approach to CDISC SDTM Implementation for Clinical Trials Data

Pharmaceuticals, Health Care, and Life Sciences. An Approach to CDISC SDTM Implementation for Clinical Trials Data An Approach to CDISC SDTM Implementation for Clinical Trials Data William T. Chen, Merck Research Laboratories, Rahway, NJ Margaret M. Coughlin, Merck Research Laboratories, Rahway, NJ ABSTRACT The Clinical

More information

To conceptualize the process, the table below shows the highly correlated covariates in descending order of their R statistic.

To conceptualize the process, the table below shows the highly correlated covariates in descending order of their R statistic. Automating the process of choosing among highly correlated covariates for multivariable logistic regression Michael C. Doherty, i3drugsafety, Waltham, MA ABSTRACT In observational studies, there can be

More information

Figure 1. Table shell

Figure 1. Table shell Reducing Statisticians Programming Load: Automated Statistical Analysis with SAS and XML Michael C. Palmer, Zurich Biostatistics, Inc., Morristown, NJ Cecilia A. Hale, Zurich Biostatistics, Inc., Morristown,

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

Statistics, Data Analysis & Econometrics

Statistics, Data Analysis & Econometrics ST009 PROC MI as the Basis for a Macro for the Study of Patterns of Missing Data Carl E. Pierchala, National Highway Traffic Safety Administration, Washington ABSTRACT The study of missing data patterns

More information

PharmaSUG Paper SP09

PharmaSUG Paper SP09 PharmaSUG 2013 - Paper SP09 SAS 9.3: Better graphs, Easier lives for SAS programmers, PK scientists and pharmacometricians Alice Zong, Janssen Research & Development, LLC, Spring House, PA ABSTRACT Data

More information

Tools to Facilitate the Creation of Pooled Clinical Trials Databases

Tools to Facilitate the Creation of Pooled Clinical Trials Databases Paper AD10 Tools to Facilitate the Creation of Pooled Clinical Trials Databases Patricia Majcher, Johnson & Johnson Pharmaceutical Research & Development, L.L.C., Raritan, NJ ABSTRACT Data collected from

More information

Paper An Automated Reporting Macro to Create Cell Index An Enhanced Revisit. Shi-Tao Yeh, GlaxoSmithKline, King of Prussia, PA

Paper An Automated Reporting Macro to Create Cell Index An Enhanced Revisit. Shi-Tao Yeh, GlaxoSmithKline, King of Prussia, PA ABSTRACT Paper 236-28 An Automated Reporting Macro to Create Cell Index An Enhanced Revisit When generating tables from SAS PROC TABULATE or PROC REPORT to summarize data, sometimes it is necessary to

More information

EudraCT release notes

EudraCT release notes Wednesday 13 January 2016 EMA/19969/2016 Information Management Division EudraCT release notes Results version: 10.2.1.0 Goal: Restoration of service Release date: 13 January 2016 Environment: Production

More information

Statistics Lab #7 ANOVA Part 2 & ANCOVA

Statistics Lab #7 ANOVA Part 2 & ANCOVA Statistics Lab #7 ANOVA Part 2 & ANCOVA PSYCH 710 7 Initialize R Initialize R by entering the following commands at the prompt. You must type the commands exactly as shown. options(contrasts=c("contr.sum","contr.poly")

More information

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/ STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape

More information

CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

CH5: CORR & SIMPLE LINEAR REFRESSION ======================================= STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

More information

Stat 5100 Handout #14.a SAS: Logistic Regression

Stat 5100 Handout #14.a SAS: Logistic Regression Stat 5100 Handout #14.a SAS: Logistic Regression Example: (Text Table 14.3) Individuals were randomly sampled within two sectors of a city, and checked for presence of disease (here, spread by mosquitoes).

More information

Amie Bissonett, inventiv Health Clinical, Minneapolis, MN

Amie Bissonett, inventiv Health Clinical, Minneapolis, MN PharmaSUG 2013 - Paper TF12 Let s get SAS sy Amie Bissonett, inventiv Health Clinical, Minneapolis, MN ABSTRACT The SAS language has a plethora of procedures, data step statements, functions, and options

More information

Multiple imputation using chained equations: Issues and guidance for practice

Multiple imputation using chained equations: Issues and guidance for practice Multiple imputation using chained equations: Issues and guidance for practice Ian R. White, Patrick Royston and Angela M. Wood http://onlinelibrary.wiley.com/doi/10.1002/sim.4067/full By Gabrielle Simoneau

More information

Programming Gems that are worth learning SQL for! Pamela L. Reading, Rho, Inc., Chapel Hill, NC

Programming Gems that are worth learning SQL for! Pamela L. Reading, Rho, Inc., Chapel Hill, NC Paper CC-05 Programming Gems that are worth learning SQL for! Pamela L. Reading, Rho, Inc., Chapel Hill, NC ABSTRACT For many SAS users, learning SQL syntax appears to be a significant effort with a low

More information