PharmaSUG Paper PO22

Similar documents
How to write ADaM specifications like a ninja.

Material covered in the Dec 2014 FDA Binding Guidances

An Alternate Way to Create the Standard SDTM Domains

Traceability Look for the source of your analysis results

PharmaSUG 2014 PO16. Category CDASH SDTM ADaM. Submission in standardized tabular form. Structure Flexible Rigid Flexible * No Yes Yes

ADaM Compliance Starts with ADaM Specifications

An Efficient Solution to Efficacy ADaM Design and Implementation

Deriving Rows in CDISC ADaM BDS Datasets

Creating an ADaM Data Set for Correlation Analyses

Dealing with changing versions of SDTM and Controlled Terminology (CT)

Applying ADaM Principles in Developing a Response Analysis Dataset

PharmaSUG Paper PO10

Improving Metadata Compliance and Assessing Quality Metrics with a Standards Library

The Benefits of Traceability Beyond Just From SDTM to ADaM in CDISC Standards Maggie Ci Jiang, Teva Pharmaceuticals, Great Valley, PA

It s All About Getting the Source and Codelist Implementation Right for ADaM Define.xml v2.0

Xiangchen (Bob) Cui, Tathabbai Pakalapati, Qunming Dong Vertex Pharmaceuticals, Cambridge, MA

Introduction to ADaM and What s new in ADaM

Programming checks: Reviewing the overall quality of the deliverables without parallel programming

PharmaSUG Paper PO03

DCDISC Users Group. Nate Freimark Omnicare Clinical Research Presented on

PharmaSUG Paper PO21

ABSTRACT INTRODUCTION WHERE TO START? 1. DATA CHECK FOR CONSISTENCIES

Hands-On ADaM ADAE Development Sandra Minjoe, Accenture Life Sciences, Wayne, Pennsylvania

The Implementation of Display Auto-Generation with Analysis Results Metadata Driven Method

Lex Jansen Octagon Research Solutions, Inc.

Optimization of the traceability when applying an ADaM Parallel Conversion Method

From Implementing CDISC Using SAS. Full book available for purchase here. About This Book... xi About The Authors... xvii Acknowledgments...

PharmaSUG Paper DS06 Designing and Tuning ADaM Datasets. Songhui ZHU, K&L Consulting Services, Fort Washington, PA

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

Paper FC02. SDTM, Plus or Minus. Barry R. Cohen, Octagon Research Solutions, Wayne, PA

PharmaSUG2014 Paper DS09

Customer oriented CDISC implementation

ADaM for Medical Devices: Extending the Current ADaM Structures

Implementing CDISC Using SAS. Full book available for purchase here.

How a Metadata Repository enables dynamism and automation in SDTM-like dataset generation

How to review a CRF - A statistical programmer perspective

ADaM and traceability: Chiesi experience

Yes! The basic principles of ADaM are also best practice for our industry Yes! ADaM a standard with enforceable rules and recognized structures Yes!

Advanced Visualization using TIBCO Spotfire and SAS

esubmission - Are you really Compliant?

CDISC SDTM and ADaM Real World Issues

Hands-On ADaM ADAE Development Sandra Minjoe, Accenture Life Sciences, Wayne, Pennsylvania Kim Minkalis, Accenture Life Sciences, Wayne, Pennsylvania

Pharmaceuticals, Health Care, and Life Sciences. An Approach to CDISC SDTM Implementation for Clinical Trials Data

Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development

Harmonizing CDISC Data Standards across Companies: A Practical Overview with Examples

Making a List, Checking it Twice (Part 1): Techniques for Specifying and Validating Analysis Datasets

Traceability in the ADaM Standard Ed Lombardi, SynteractHCR, Inc., Carlsbad, CA

AUTOMATED CREATION OF SUBMISSION-READY ARTIFACTS SILAS MCKEE

Why organizations need MDR system to manage clinical metadata?

SAS Clinical Data Integration 2.4

What is the ADAM OTHER Class of Datasets, and When Should it be Used? John Troxell, Data Standards Consulting

CDASH Standards and EDC CRF Library. Guang-liang Wang September 18, Q3 DCDISC Meeting

Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA

Sandra Minjoe, Accenture Life Sciences John Brega, PharmaStat. PharmaSUG Single Day Event San Francisco Bay Area

PharmaSUG. companies. This paper. will cover how. processes, a fairly linear. before moving. be carried out. Lifecycle. established.

Streamline SDTM Development and QC

Riepilogo e Spazio Q&A

PhUSE US Connect 2019

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

Automated Creation of Submission-Ready Artifacts Silas McKee, Accenture, Pennsylvania, USA Lourdes Devenney, Accenture, Pennsylvania, USA

Creating Define-XML v2 with the SAS Clinical Standards Toolkit 1.6 Lex Jansen, SAS

SAS Clinical Data Integration 2.6

Common Programming Errors in CDISC Data

Leveraging Study Data Reviewer s Guide (SDRG) in Building FDA s Confidence in Sponsor s Submitted Datasets

A Taste of SDTM in Real Time

ADaM Reviewer s Guide Interpretation and Implementation

Clarifications About ADaM Implementation Provided in ADaMIG Version 1.1

CDASH MODEL 1.0 AND CDASHIG 2.0. Kathleen Mellars Special Thanks to the CDASH Model and CDASHIG Teams

PharmaSUG Paper CC02

Customizing SAS Data Integration Studio to Generate CDISC Compliant SDTM 3.1 Domains

DIA 11234: CDER Data Standards Common Issues Document webinar questions

Study Data Reviewer s Guide Completion Guideline

Metadata and ADaM.

Edwin Ponraj Thangarajan, PRA Health Sciences, Chennai, India Giri Balasubramanian, PRA Health Sciences, Chennai, India

Considerations on creation of SDTM datasets for extended studies

Codelists Here, Versions There, Controlled Terminology Everywhere Shelley Dunn, Regulus Therapeutics, San Diego, California

PharmaSUG Paper DS24

PharmaSUG Paper AD03

Working with Composite Endpoints: Constructing Analysis Data Pushpa Saranadasa, Merck & Co., Inc., Upper Gwynedd, PA

Introduction to ADaM standards

PhUSE Paper SD09. "Overnight" Conversion to SDTM Datasets Ready for SDTM Submission Niels Mathiesen, mathiesen & mathiesen, Basel, Switzerland

Analysis Data Model Implementation Guide Version 1.1 (Draft) Prepared by the CDISC Analysis Data Model Team

START CONVERTING FROM TEXT DATE/TIME VALUES

Making the most of SAS Jobs in LSAF

Clinical Metadata Metadata management with a CDISC mindset

Beyond OpenCDISC: Using Define.xml Metadata to Ensure End-to-End Submission Integrity. John Brega Linda Collins PharmaStat LLC

From SAP to BDS: The Nuts and Bolts Nancy Brucken, i3 Statprobe, Ann Arbor, MI Paul Slagle, United BioSource Corp., Ann Arbor, MI

Planning to Pool SDTM by Creating and Maintaining a Sponsor-Specific Controlled Terminology Database

From ODM to SDTM: An End-to-End Approach Applied to Phase I Clinical Trials

Standards Metadata Management (System)

PhUSE EU Connect Paper PP15. Stop Copying CDISC Standards. Craig Parry, SyneQuaNon, Diss, England

OpenCDISC Validator 1.4 What s New?

Advanced Data Visualization using TIBCO Spotfire and SAS using SDTM. Ajay Gupta, PPD

Once the data warehouse is assembled, its customers will likely

Creating a Patient Profile using CDISC SDTM Marc Desgrousilliers, Clinovo, Sunnyvale, CA Romain Miralles, Clinovo, Sunnyvale, CA

Step Up Your ADaM Compliance Game Ramesh Ayyappath & Graham Oakley

The Submission Data File System Automating the Creation of CDISC SDTM and ADaM Datasets

Clinical Data Visualization using TIBCO Spotfire and SAS

Improving CDISC SDTM Data Quality & Compliance Right from the Beginning

PharmaSUG Paper DS-24. Family of PARAM***: PARAM, PARAMCD, PARAMN, PARCATy(N), PARAMTYP

Transcription:

PharmaSUG 2015 - Paper PO22 Challenges in Developing ADSL with Baseline Data Hongyu Liu, Vertex Pharmaceuticals Incorporated, Boston, MA Hang Pang, Vertex Pharmaceuticals Incorporated, Boston, MA ABSTRACT For a CDISC compliant submission, the ADSL (Analysis Data Subject Level) domain is the minimum required analysis dataset. However, developing an ADSL dataset can be quite challenging. One of the challenges is how to acquire and organize the data for ADSL creation. Should ADSL be developed using SDTM data as its source data so that all ADaM (Analysis Data Model) domains are programmed independently without recursion? Or should ADSL be developed partially depending on other ADaM domain s programming? This paper will discuss various models that a sponsor may consider during the ADSL dataset generation. INTRODUCTION ADSL is the subject-level analysis dataset for ADaM. ADSL and its related metadata are required in a CDISCbased submission of data from a clinical trial even if no other analysis datasets are submitted [1, 2]. The FDA defines that the core variables, including all covariates presented in the study protocol should be included in each ADaM dataset [3, 4]. These core variables are typically developed and included in the ADSL dataset. Examples of key core variables include: study/protocol number, center/site number, geographic region, country, treatment assignment information, sex, age, race, analysis population flags (e.g., Intent-to-Treat (ITT), Full Analysis Set (FAS), Safety, Per-Protocol (PP)), and other demographic variables. Although CDISC Guidance does not explicitly specify that ADSL should include subject variables (BL), the FDA states clearly in Study Data Technical Conformation Guide that; In addition to the variables specified for ADSL in the ADaMIG such as those previously listed in the core variables section, the sponsor should include multiple additional variables representing various subject s/covariates presented in the study protocol. [3, 4] For some clinical trials, these variables can be easily derived directly from SDTM domains without involving sophisticated programming. However and most likely, ADSL needs to include some data that may require a complicated derivation and can t be simply transferred from SDTM data. Also, those variables are usually programmed in other individual ADaM domains. It is very costly if these variables are programmed with the same logic or even a macro in both ADSL and individual domains. An alternative is to only program these variables once in individual ADaM domains and then map these values to ADSL. Now, the data flow relevant to the programming sequence and relationship between ADSL and individual domains may result in some processing issues, namely, recursion. Recursion, if not properly performed, may greatly increase the risk of errors in both ADSL development and validation. For ADaM data development, we also need to consider the following fundamentals: traceability, analysis-ready, and programming efficiency, as well as FDA requirement compliance etc., though these are not the discussion focus of this paper. There is no clear consensus in the industry on how ADSL should be programmed. It is now obvious that ADSL development is very challenging [5], even five years after CDISC ADaM and ADaMIG was published and accepted by the industry. For this paper, four different models are discussed based on our programming experiences and understanding. 1

FOUR PROPOSED ADSL DEVELOPMENT MODELS MODEL 1: ONE-STEP ADSL DERIVATION This is the simplest development approach among the four models described in this paper for ADSL programming (see Figure 1 below). It is only meaningful when all core and variables that are needed in ADSL domain can be easily derived from source data without any complicated procedure. For example, values that will be included in ADSL are the SDTM flagged values and the corresponding ADaM data use same definition for values. These values can be directly mapped into ADSL variables from source data. After ADSL is developed, all other ADaM domains can be programmed with common variables (core and ) from ADSL. The ADSL development does not depend on other ADaM domain development. LB ADSL (Core+BL) AD (Core+BL) AD (Core+BL) ADLB (Core+BL) Other (Core+BL) Figure 1 One-Step ADSL Derivation MODEL 2: ONE STEP ADSL DERIVATION WITH AN ADDITIONAL SUBJECT LEVEL DATASET For most of the scenarios in the late stage trials, variables cannot be derived directly from SDTM data without further processing. Even derivations for some core population flags such as a protocol deviation flag are relatively complicated. Their derivations usually are performed in individual ADaM domains. One of the options is to program these variables in both ADSL and their relevant individual ADaM domains using the same algorithm. Obviously, doing so will not only double the programming work but will also increase the risk of error. Thus, it is not a practical approach. Some companies in the industry have a practice with adding another subject level ADaM dataset, such as ADBASE (different dataset name may be used) [6, 7]. In this model, ADSL and other ADaM domains do not include variables. These variables will be programmed at the end of the ADaM development and in a separate ADaM dataset as shown in Figure 2. AD (Core) AD (Core) ADSL (Core) ADLB (Core) ADBASE LB Other (Core) Figure 2 ADSL Derivation with an additional subject level dataset 2

MODEL 3: ADSL DERIVED LAST This model develops an analysis dataset for all core variables first, say ADCORE for example, then develops all ADaM domains except ADSL. ADCORE is still a subject level dataset with one observation per subject structure. This dataset provides all core variables that are needed for all ADaM domains development. After all ADaM domains that are relevant to ADSL data are programmed, the ADSL dataset will be created last. This model is similar to model 2, however, variables are not included in the individual ADaM domains, only ADSL. AD (Core) AD (Core) ADCORE ADLB (Core) ADSL (Core+BL) LB Other (Core) Figure 3 ADSL Derived Last MODEL 4: TWO-STEP ADSL DERIVATION This model includes two steps in programming ADSL: 1) to derive variables which can be programmed directly from datasets for ADSL (draft). Most of the core variables can be derived at this step. And 2) to derive other ADSL variables which cannot be programmed without relatively complicated procedures from datasets, including variables or per protocol population flag from the relevant ADaM domains as shown below in Figure 4. Once the ADSL is drafted out with core variables, other ADaM domains that contain data required for variables in ADSL will be programmed. Note that not all ADaM domains need to be drafted at this stage, only the ones with relevant data are required. At the second step of ADSL development, the drafted ADSL merges with those drafted ADaM datasets to populate required values for variables in the ADSL. After all ADSL variables are populated, individual ADaM domains will be developed based on data and ADSL data for core and variables. AD (draft) AD (Core+BL) AD (draft) AD (Core+BL) ADSL (draft) ADDV (draft) ADSL (Core+BL) ADLB (Core+BL) DV Other (draft) ADyy (Core+BL) ADxx (Core+BL) Figure 4 Two-Step ADSL Derivation 3

CONSIDERATION AND DISCUSSION We may summarize the four ADSL derivation models and their advantages and challenges in table 1 below. Table 1 ADSL derivation models and their advantages and challenges Model ADSL Content Other ADaM Domain Content 1 All core and important variables included. All core and Advantage 1. Requirements by CDISC ADaMIG and FDA Study Data Technical Conformance Guide completely met. 2. No dependency on any other ADaM domains. 3. Simple and clear data flow from source to ADaM. 4. Early completion of ADSL development. 5. No additional ADaM domain needed for fulfilling core and variable requirements. 6. Analysis-ready principle met. Challenge Application only to clinical trials that both core and variables required for ADSL can be derived directly from source data without substantial complicated algorithms. May be used in most of the phase 1 trials. For many of the late phase trials it will be very challenging to use this approach. 2 Core variables included, but not variables. Core variables included, but not variables. 1. No dependence on any other ADaM domains. 2. Simple and clear data flow from source to ADaM 3. Early completion of ADSL development. 4. ADSL contains core variables that will be needed for other ADaM domain development. 1. Requirements by CDISC ADaMIG and FDA Study Data Technical Conformance Guide not completely met. 2. Additional ADaM needed for important variables that are not included in ADSL and individual ADaM domains. 3. An additional data merge step with ADBASE will be necessary for TFL creation. 4. CDISC ADaM Analysis- Ready principle not met. 4

CONSIDERATION AND DISCUSSION CONT. Model ADSL Content Other ADaM Domain Content Advantage Challenge 3 All core and Core variables included, but not variables. 1. Requirements for ADSL by CDISC ADaMIG and FDA Study Data Technical Conformance Guide completely met. 2. No dependency of other ADaM domain development on ADSL development. 3. Simple and clear data flow from source to ADaM. 1. Important variables are not included in individual ADaM domains. An additional data merge step with ADSL will be necessary for TFL creation. 2. CDISC ADaM Analysis-Ready principle not completely met. 3. Early completion of ADSL development not practical. ADSL cannot be finalized before other ADaM domains relevant to variables are done. 4. Additional subject level ADaM dataset needs to be programmed. 4 All core and All core and 1. Requirements by CDISC ADaMIG and FDA Study Data Technical Conformance Guide completely met. 2. Analysis-ready principle met. 3. No additional ADaM domain needed for fulfilling core and variable requirements. 4. Only draft ADaM domains relevant to variables need to be developed earlier for ADSL. 1. Data flow needs to account for recursion. 2. Dependency of ADSL development on some other ADaM domains. ADSL cannot be finalized without drafting other ADaM domains relevant to variables. 5

CONCLUSION There are various challenges when developing the ADSL domain with variable involvement. There are some fundamental factors that we need to take into consideration: traceability, analysis-ready, programming efficiency, clearness of connection between content and source data, and FDA requirements compliance, etc. It seems that there s no perfect solution or approach which currently exists or has been endorsed throughout the industry. Model 1 is the best option when ADSL is developed from (e.g. Phase 1 studies) as it has a simple process and clear traceability, meets FDA requirements and CDISC Analysis-ready principles. For late stage studies, we would prefer Model 4. This approach can be used in most of the programming scenarios and creates ADSL and other ADaM datasets that will meet both the CDISC guidelines and the FDA Study Data Technical Conformance Guide recommendations with both core and variables. REFERENCES [1] CDISC Analysis Data Model (ADaM) Implementation Guide, Version 1.0 Final. Published by CDISC December 17, 2009. Available at http://www.cdisc.org. [2] CDISC Analysis Data Model (ADaM), Version 2.1 Final. Published by CDISC December 17, 2009. Available at http://www.cdisc.org. [3] FDA Guidance for Industry: Providing Regulatory Submissions in Electronic Format Standardized Study Data. Available at http://www.fda.gov/forindustry/datastandards/studydatastandards/default.htm, December, 2014. [4] FDA Technical Specifications Document: Study Data Technical Conformance Guide. Available at http://www.fda.gov/forindustry/datastandards/studydatastandards/default.htm, December, 2014. [5] The Biggest Challenges of ADaM Terek Peterson and David Izard, from the Conference Proceedings for NESUG 2010. [6] Does All One-Record-per-Subject Data Belong in ADSL Sandra Minjoe, from the Conference Proceedings for PharmaSUG 2012. [7] Interpreting CDISC ADaM IG through Users Interpretation Angelo Tinazzi, from the Conference Proceedings for PhUSE 2013. ACKNOWLEDGMENTS The authors would like to thank Tracy Turschman, Jun Yu and the Vertex Statistical Programming group for their input, support and encouragement. CONTACT INFORMATION The authors can be reached at: Hongyu Liu Vertex Pharmaceuticals Incorporated 50 Northern Ave. Boston, MA 02210 hongyu_liu@vrtx.com Hang Pang Vertex Pharmaceuticals Incorporated 50 Northern Ave. Boston, MA 02210 hang_pang@vrtx.com SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. 6