Beyond OpenCDISC: Using Define.xml Metadata to Ensure End-to-End Submission Integrity. John Brega Linda Collins PharmaStat LLC

Similar documents
Introduction to Define.xml

Lex Jansen Octagon Research Solutions, Inc.

Sandra Minjoe, Accenture Life Sciences John Brega, PharmaStat. PharmaSUG Single Day Event San Francisco Bay Area

Creating Define-XML v2 with the SAS Clinical Standards Toolkit 1.6 Lex Jansen, SAS

Define.xml 2.0: More Functional, More Challenging

It s All About Getting the Source and Codelist Implementation Right for ADaM Define.xml v2.0

Dealing with changing versions of SDTM and Controlled Terminology (CT)

Out-of-the-box %definexml

Material covered in the Dec 2014 FDA Binding Guidances

What is high quality study metadata?

Lex Jansen Octagon Research Solutions, Inc.

ADaM Reviewer s Guide Interpretation and Implementation

The Wonderful World of Define.xml.. Practical Uses Today. Mark Wheeldon, CEO, Formedix DC User Group, Washington, 9 th December 2008

Implementing CDISC Using SAS. Full book available for purchase here.

Study Data Reviewer s Guide

Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development

Study Data Reviewer s Guide. FDA/PhUSE Project Summary

PhUSE EU Connect 2018 SI05. Define ing the Future. Nicola Perry and Johan Schoeman

Comparison of FDA and PMDA Requirements for Electronic Submission of Study Data

From Implementing CDISC Using SAS. Full book available for purchase here. About This Book... xi About The Authors... xvii Acknowledgments...

Aquila's Lunch And Learn CDISC The FDA Data Standard. Disclosure Note 1/17/2014. Host: Josh Boutwell, MBA, RAC CEO Aquila Solutions, LLC

NCI/CDISC or User Specified CT

CDASH Standards and EDC CRF Library. Guang-liang Wang September 18, Q3 DCDISC Meeting

Business & Decision Life Sciences

Managing CDISC version changes: how & when to implement? Presented by Lauren Shinaberry, Project Manager Business & Decision Life Sciences

Study Data Reviewer s Guide Completion Guideline

Optimization of the traceability when applying an ADaM Parallel Conversion Method

PhUSE US Connect 2019

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

Taming the SHREW. SDTM Heuristic Research and Evaluation Workshop

Edwin Ponraj Thangarajan, PRA Health Sciences, Chennai, India Giri Balasubramanian, PRA Health Sciences, Chennai, India

SDTM-ETL TM. New features in version 1.6. Author: Jozef Aerts XML4Pharma July SDTM-ETL TM : New features in v.1.6

Adding, editing and managing links to external documents in define.xml

A SDTM Legacy Data Conversion

Revision of Technical Conformance Guide on Electronic Study Data Submissions

How to handle different versions of SDTM & DEFINE generation in a Single Study?

Helping The Define.xml User

esubmission - Are you really Compliant?

DIA 11234: CDER Data Standards Common Issues Document webinar questions

Now let s take a look

Introduction to ADaM and What s new in ADaM

OpenCDISC Validator 1.4 What s New?

Experience of electronic data submission via Gateway to PMDA

CDISC SDTM and ADaM Real World Issues

Updates on CDISC Standards Validation

Robust approach to create Define.xml v2.0. Vineet Jain

How to write ADaM specifications like a ninja.

Traceability Look for the source of your analysis results

Tips on Creating a Strategy for a CDISC Submission Rajkumar Sharma, Nektar Therapeutics, San Francisco, CA

Generating Define.xml from Pinnacle 21 Community

Deriving Rows in CDISC ADaM BDS Datasets

Advantages of a real end-to-end approach with CDISC standards

Improving CDISC SDTM Data Quality & Compliance Right from the Beginning

Automate Analysis Results Metadata in the Define-XML v2.0. Hong Qi, Majdoub Haloui, Larry Wu, Gregory T Golm Merck & Co., Inc.

Legacy to SDTM Conversion Workshop: Tools and Techniques

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

PharmaSUG Paper AD03

ADaM and traceability: Chiesi experience

Making a List, Checking it Twice (Part 1): Techniques for Specifying and Validating Analysis Datasets

Introduction to ADaM standards

Creating Define-XML version 2 including Analysis Results Metadata with the SAS Clinical Standards Toolkit

SDTM-ETL 3.1 User Manual and Tutorial

An Efficient Solution to Efficacy ADaM Design and Implementation

Dataset-XML - A New CDISC Standard

Best Practice for Explaining Validation Results in the Study Data Reviewer s Guide

CDISC Standards and the Semantic Web

Step Up Your ADaM Compliance Game Ramesh Ayyappath & Graham Oakley

Streamline SDTM Development and QC

Common Programming Errors in CDISC Data

SAS offers technology to facilitate working with CDISC standards : the metadata perspective.

ADaM Implementation Guide Status Update

define.xml: A Crash Course Frank DiIorio

Customizing SAS Data Integration Studio to Generate CDISC Compliant SDTM 3.1 Domains

SDTM-ETL 4.0 Preview of New Features

Use of Traceability Chains in Study Data and Metadata for Regulatory Electronic Submission

PharmaSUG Paper DS24

Improving Metadata Compliance and Assessing Quality Metrics with a Standards Library

Harmonizing CDISC Data Standards across Companies: A Practical Overview with Examples

PMDA Valida+on Rules. Preparing for your next submission to Japanese Pharmaceu6cals and Medical Devices Agency. Max Kanevsky December 10, 2015

SAS Clinical Data Integration 2.6

Standards Driven Innovation

Standards Implementation: It Should be Simple Right? Thursday January 18, 2018

Xiangchen (Bob) Cui, Tathabbai Pakalapati, Qunming Dong Vertex Pharmaceuticals, Cambridge, MA

Introduction to define.xml

Paper FC02. SDTM, Plus or Minus. Barry R. Cohen, Octagon Research Solutions, Wayne, PA

Introduction to define.xml

TS04. Running OpenCDISC from SAS. Mark Crangle

Why organizations need MDR system to manage clinical metadata?

Paper DS07 PhUSE 2017 CDISC Transport Standards - A Glance. Giri Balasubramanian, PRA Health Sciences Edwin Ponraj Thangarajan, PRA Health Sciences

CDASH MODEL 1.0 AND CDASHIG 2.0. Kathleen Mellars Special Thanks to the CDASH Model and CDASHIG Teams

From SDTM to displays, through ADaM & Analyses Results Metadata, a flight on board METADATA Airlines

Applying ADaM Principles in Developing a Response Analysis Dataset

SAS Clinical Data Integration 2.4

Codelists Here, Versions There, Controlled Terminology Everywhere Shelley Dunn, Regulus Therapeutics, San Diego, California

Considerations on creation of SDTM datasets for extended studies

The Submission Data File System Automating the Creation of CDISC SDTM and ADaM Datasets

Pharmaceuticals, Health Care, and Life Sciences. An Approach to CDISC SDTM Implementation for Clinical Trials Data

Define.xml tools supporting SEND/SDTM data process

Challenges with the interpreta/on of CDISC - Who can we trust?

Standards Metadata Management (System)

Transcription:

Beyond OpenCDISC: Using Define.xml Metadata to Ensure End-to-End Submission Integrity John Brega Linda Collins PharmaStat LLC

Topics Part 1: A Standard with Many Uses Status of the Define.xml Standard What s new in Define 2.0 Documenting everything in a submission Uses Other than Documentation Part 2: Using Analysis Specification Metadata to Drive Data Production and Documentation Alternative Approaches to the Analysis Process Turning Specifications into Programs and Data Turning Specifications and Data into Define.xml Using Define.xml to Verify the Integrity of the ectd Package Driving Analysis Results and Define.xml from Metadata

Part 1: A Standard with Many Uses Status of the Define.xml Standard Define.xml 1.0 was published in 2015 Ten years without updates is quite a run Define.xml 2.0 Draft published in August, 2012 Define.xml 2.0 Final published in March, 2013 Yes, we ve already had it for two years! The Analysis Results Metadata (ARM) extension was published in January, 2015 OpenCDISC Community supports version 2.0. You can use it to create and edit Define content, and generate XML. Does not support ARM yet.

What s New in Define 2.0 You need to round up more details for Comments, Methods, Origins and Codelists It accommodates more useful descriptions of datasets New, stricter rules are enforced for some content The key innovation is to use WHERE clauses to define value-level items. This is a game-changer. Analysis Results Metadata! The first visualization of ARM was in the ADaM 2.0 Implementation Guide, published in 2006. We ve been holding our breath for almost nine years

Analysis Results Metadata in Define 2.0

Not Just for SDTM or ADaM Hardly anything in the define.xml standard even references CDISC data standards. You can use it for any tabulations or analysis data. (In fact, maybe any research data.) (In fact, maybe any data ) Studies, datasets, variables, value-level items, codelists, acrfs and Reviewer s Guides are still the same things, with or without a CDISC standard

Why Use it for Item 11 or Other Data? The same 1999 Guidance that defined the Item 11 data format also has a specification for define.pdf. Why not just produce that in a Word doc and pdf it? Define.xml yields better documentation at a lower cost. The 1999 define.pdf spec has no place for value-level metadata. This can be a big shortcoming, especially for analysis data. The format does not support the same precision and quality of documentation that your reviewers will get in other studies. Automated production and validation tools for define.xml reduce manual production and QC by a lot, and deliver better quality and integrity in the final product. It s a great QC check for the data as well. You can use one documentation process for all your data.

Not Just for Documentation Define.xml is based on the ODM data interchange standard, so it can also be used for purposes unrelated to documentation. There are current applications that use it simply as a metadata repository. The XML format CDISC terminology files you can download from the NCI website are Define.xml 2.0. OpenCDISC Community uses these files as its terminology repositories. This is how you install new terminology versions in the software. Sponsors can use it to manage their sponsor-defined extensions to CDISC terminology.

Part 2: Using Analysis Specification Metadata to Drive Data Production and Documentation Analysis Process Alternatives Turning Specifications into Programs and Data Turning Specifications and Data into Define.xml Using Define.xml to Verify the Integrity of the ectd Submission Package Driving Analysis Results and Define.xml from Metadata Conclusions

Producing an Analysis for Regulatory Submission Dataset specs Codelist specs Report specs Task 1: Write specifications based on SAP and Mockups SDTM Datasets Task 2: Analysis Task 3: Documentation Prepare submission data & docs Create Analysis Datasets Analysis Report Programs ADaM Datasets Tables, Listings, Graphs Selected code Submission.xpt Define.xml Data Guide

SDTM Datasets Create Analysis Datasets Dataset specs Codelist specs Report specs ADaM Datasets Analysis Programs Tables, Listings, Graphs The analysis task is driven by dataset and report specifications based on the SAP and mockups

SDTM Datasets Create Analysis Datasets Dataset specs Codelist specs Report specs Standards set requirements for the end result, but say nothing about a process to get there. ADaM Datasets Prepare submission data & docs That s your call Analysis Programs Scope of Standards Tables, Listings, Graphs Selected code Submission.xpt Define.xml Data Guide

SDTM Datasets Create Analysis Datasets Dataset specs Codelist specs Report specs Metadata can be derived from the datasets and edited to include derivations and other information ADaM Datasets Dataset metadata Prepare submission data & docs Analysis Programs Not necessarily the best process.. Tables, Listings, Graphs Selected code Submission.xpt Define.xml Data Guide

SDTM Datasets Create Analysis Datasets ADaM Datasets Dataset specs Codelist specs Report specs Why not use the same specifications for both production and documentation? Analysis Programs Prepare submission data & docs Tables, Listings, Graphs Selected code Submission.xpt Define.xml Data Guide

SDTM Datasets Create Analysis Datasets ADaM Datasets Dataset specs Codelist specs Report specs Using the same metadata for production can ensure the integrity of the entire package of data and documentation. Analysis Programs Prepare submission data & docs Tables, Listings, Graphs Selected code Submission.xpt Define.xml Data Guide

Typical steps to produce ectd Step Confirm that datasets are clean Convert datasets to transport format Prepare define.xml input metadata Generate define.xml Run OpenCDISC diagnostics Write Analysis Data Reviewer s Guide Description User defined QC process. **Consider user diagnostics** User defined conversion program. Required categories of metadata are study, document, dataset, variable, value, and codelist. **Consider user diagnostics and methods to embed links** User defined or OTS generation process. Review diagnostics and make corrections as needed. Diagnostics that are considered acceptable should be written up in the ADRG. Use the PhUSE ADRG template as referenced in FDA s Technical Conformance Guide. Provide details based on analysis dataset design. When complete, convert to PDF. Follow ADRG Guidelines to finalize.

Typical steps continued Step Description Generate define.pdf if using define.xml 1.0 This step is not needed when using define.xml 2.0. Copy datasets and documentation to delivery folder FDA now asks for programs to be submitted. Identify and segregate all submitted code. Folder names are based on ectd standards: \analysis \adam \datasets \programs Identify programs to submit. Consider including a program index in the documents submitted, or in ADRG. Finish hyperlinking of pdf documents if using define.xml 1.0. Test your hyperlinks! Additional hyperlinking is not needed when using define.xml 2.0.

What does package integrity mean? Define.xml (and associated docs) match dataset content: Contains all (and only) existing datasets and variables Match on all attributes Content matches associated codelists Datasets are consistent with relevant CDISC standards (if used): Consistent with published naming conventions and attributes Populated per rules in standards Define.xml (and associated docs) are technically conforming: Well-formed XML Render properly using style sheet All navigation and links work correctly Legitimate values for non-displayed variables (mandatory, signif digits) In Define.xml 2.0 the package can include analysis results metadata This will involve new consistency rules

Structural Consistency Consistency between define and the structure of datasets: All/only the datasets in submission are in define All/only the variables in submission are in define Data description matches content of datasets Dataset names, labels, descriptions and keys match Variable names, attributes match structure of datasets

Conformance with Standards

Define.xml Dataset level Name Label Variable level Name Label Length Type Format Codelist Values Adxx.xpt Name Label Name Label Length Type Format Values Coded values ADaM IG Name Label Name Label Length Type Format Values (per IG rules)

CDISC standard conformance

If OpenCDISC over-errors: what to do?

Review, fix, and document

ADRG Conformance Summary

Other Useful Consistency Checks Key variables in domain actually identify unique records A Codelist should reflect the variable s permissible value set (for example, for a lab dataset list only the units that were used, not all possible lab units) Codelist values are appropriate type for variable No unused codelist names Length of variable is appropriate for a numeric variable (e.g. 8, not 1) Order of variables in define.xml matches order in dataset

Samples of define.xml metadata based on CDISC pilot project data

CDISC pilot project metadata

Alternate CDISC pilot metadata

Embedded links in the define.xml can take the user directly to more detailed explanations in the ADRG.

Define.xml 2.0: What s Changing? Value-level much more flexible Ability to attach comments to entries Where clauses Analysis Results metadata Some metadata structures will be different

title1 Table 14.2.2 ; title2 Laboratory Summary ; data file ; set adamdata.adlbhy ; where (paramcd = 'HYLAW') ; keep usubjid paramcd avalc avisit avisitn ; run ;

Conclusions Using the same metadata to drive both production and documentation makes a better quality product. Most processes, and many diagnostics, will apply whether the data is CDISC or not. Use OpenCDISC, but also consider developing your own additional diagnostics. Define 2.0 requires somewhat different metadata from 1.0, but allows greater flexibility and clarity. In Define 2.0, the concept of integrated metadata can extend to analysis results.

SDTM Datasets Create Analysis Datasets Dataset specs Codelist specs Specifications based on SAP and Mockups Report specs ADaM Datasets Analysis Programs Prepare submission data & docs Tables, Listings, Graphs Selected code Submission.xpt Define.xml Data Guide