Legacy to SDTM Conversion Workshop: Tools and Techniques

Similar documents
How a Metadata Repository enables dynamism and automation in SDTM-like dataset generation

Optimization of the traceability when applying an ADaM Parallel Conversion Method

Lex Jansen Octagon Research Solutions, Inc.

How to write ADaM specifications like a ninja.

CDISC SDTM and ADaM Real World Issues

Cost-Benefit Analysis of Retrospective vs. Prospective Data Standardization

DIA 11234: CDER Data Standards Common Issues Document webinar questions

SAS offers technology to facilitate working with CDISC standards : the metadata perspective.

Implementing CDISC Using SAS. Full book available for purchase here.

From Implementing CDISC Using SAS. Full book available for purchase here. About This Book... xi About The Authors... xvii Acknowledgments...

PhUSE Paper SD09. "Overnight" Conversion to SDTM Datasets Ready for SDTM Submission Niels Mathiesen, mathiesen & mathiesen, Basel, Switzerland

Introduction to Define.xml

PhUSE US Connect 2019

Conversion of Company Standard Trial Data to SDTM

CDISC Variable Mapping and Control Terminology Implementation Made Easy

Out-of-the-box %definexml

CDASH MODEL 1.0 AND CDASHIG 2.0. Kathleen Mellars Special Thanks to the CDASH Model and CDASHIG Teams

Paper FC02. SDTM, Plus or Minus. Barry R. Cohen, Octagon Research Solutions, Wayne, PA

Standards Driven Innovation

Lex Jansen Octagon Research Solutions, Inc.

CDASH Standards and EDC CRF Library. Guang-liang Wang September 18, Q3 DCDISC Meeting

DCDISC Users Group. Nate Freimark Omnicare Clinical Research Presented on

An Efficient Solution to Efficacy ADaM Design and Implementation

Study Data Reviewer s Guide Completion Guideline

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

The Wonderful World of Define.xml.. Practical Uses Today. Mark Wheeldon, CEO, Formedix DC User Group, Washington, 9 th December 2008

Material covered in the Dec 2014 FDA Binding Guidances

Streamline SDTM Development and QC

Beyond OpenCDISC: Using Define.xml Metadata to Ensure End-to-End Submission Integrity. John Brega Linda Collins PharmaStat LLC

Tips on Creating a Strategy for a CDISC Submission Rajkumar Sharma, Nektar Therapeutics, San Francisco, CA

Managing CDISC version changes: how & when to implement? Presented by Lauren Shinaberry, Project Manager Business & Decision Life Sciences

Why organizations need MDR system to manage clinical metadata?

CDISC Migra+on. PhUSE 2010 Berlin. 47 of the top 50 biopharmaceu+cal firms use Cytel sofware to design, simulate and analyze their clinical studies.

Creating Define-XML v2 with the SAS Clinical Standards Toolkit 1.6 Lex Jansen, SAS

Best Practices for E2E DB build process and Efficiency on CDASH to SDTM data Tao Yang, FMD K&L, Nanjing, China

Introduction to ADaM and What s new in ADaM

Sandra Minjoe, Accenture Life Sciences John Brega, PharmaStat. PharmaSUG Single Day Event San Francisco Bay Area

Step Up Your ADaM Compliance Game Ramesh Ayyappath & Graham Oakley

SAS Clinical Data Integration 2.4

Advantages of a real end-to-end approach with CDISC standards

Leveraging Study Data Reviewer s Guide (SDRG) in Building FDA s Confidence in Sponsor s Submitted Datasets

SAS Clinical Data Integration 2.6

From SDTM to displays, through ADaM & Analyses Results Metadata, a flight on board METADATA Airlines

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

SAS (Statistical Analysis Software/System)

Advanced Data Visualization using TIBCO Spotfire and SAS using SDTM. Ajay Gupta, PPD

The Submission Data File System Automating the Creation of CDISC SDTM and ADaM Datasets

ADaM Compliance Starts with ADaM Specifications

A SDTM Legacy Data Conversion

Automate Analysis Results Metadata in the Define-XML v2.0. Hong Qi, Majdoub Haloui, Larry Wu, Gregory T Golm Merck & Co., Inc.

Use of Traceability Chains in Study Data and Metadata for Regulatory Electronic Submission

Hanming Tu, Accenture, Berwyn, USA

Data Science Services Dirk Engfer Page 1 of 5

Reporting & Visualisation : D un Dun standard maison au format CDISC 02/02/2016 CDISC GUF 1

Optimization of the traceability when applying an ADaM Parallel Conversion Method

AUTOMATED CREATION OF SUBMISSION-READY ARTIFACTS SILAS MCKEE

Edwin Ponraj Thangarajan, PRA Health Sciences, Chennai, India Giri Balasubramanian, PRA Health Sciences, Chennai, India

Aquila's Lunch And Learn CDISC The FDA Data Standard. Disclosure Note 1/17/2014. Host: Josh Boutwell, MBA, RAC CEO Aquila Solutions, LLC

Study Composer: a CRF design tool enabling the re-use of CDISC define.xml metadata

How to handle different versions of SDTM & DEFINE generation in a Single Study?

ABSTRACT INTRODUCTION WHERE TO START? 1. DATA CHECK FOR CONSISTENCIES

Customizing SAS Data Integration Studio to Generate CDISC Compliant SDTM 3.1 Domains

Customer oriented CDISC implementation

esubmission - Are you really Compliant?

Planning to Pool SDTM by Creating and Maintaining a Sponsor-Specific Controlled Terminology Database

Now let s take a look

A Taste of SDTM in Real Time

Converting Data to the SDTM Standard Using SAS Data Integration Studio

The Benefits of Traceability Beyond Just From SDTM to ADaM in CDISC Standards Maggie Ci Jiang, Teva Pharmaceuticals, Great Valley, PA

Metadata and ADaM.

Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA

SDTM Implementation Guide Clear as Mud: Strategies for Developing Consistent Company Standards

SDTM-ETL TM. New features in version 1.6. Author: Jozef Aerts XML4Pharma July SDTM-ETL TM : New features in v.1.6

OpenCDISC Validator 1.4 What s New?

Automated Creation of Submission-Ready Artifacts Silas McKee, Accenture, Pennsylvania, USA Lourdes Devenney, Accenture, Pennsylvania, USA

Harnessing the Web to Streamline Statistical Programming Processes

An Alternate Way to Create the Standard SDTM Domains

Standardizing FDA Data to Improve Success in Pediatric Drug Development

SAS Clinical Data Integration Server 2.1

Improving Metadata Compliance and Assessing Quality Metrics with a Standards Library

Challenges with the interpreta/on of CDISC - Who can we trust?

Dataset-XML - A New CDISC Standard

One-PROC-Away: The Essence of an Analysis Database Russell W. Helms, Ph.D. Rho, Inc.

Adding, editing and managing links to external documents in define.xml

Updates on CDISC Standards Validation

SIMPLIFY AND STREAMLINE USING PYTHON

Programming checks: Reviewing the overall quality of the deliverables without parallel programming

Data Integrity through DEFINE.PDF and DEFINE.XML

Automate Clinical Trial Data Issue Checking and Tracking

ODM The Operational Efficiency Model: Using ODM to Deliver Proven Cost and Time Savings in Study Set-up

Common Programming Errors in CDISC Data

Taming the SHREW. SDTM Heuristic Research and Evaluation Workshop

Global Checklist to QC SDTM Lab Data Murali Marneni, PPD, LLC, Morrisville, NC Sekhar Badam, PPD, LLC, Morrisville, NC

Power Data Explorer (PDE) - Data Exploration in an All-In-One Dynamic Report Using SAS & EXCEL

PharmaSUG 2014 PO16. Category CDASH SDTM ADaM. Submission in standardized tabular form. Structure Flexible Rigid Flexible * No Yes Yes

CRF Design for Data Standards. David A. Scocca

Developing an Integrated Platform

PharmaSUG Paper AD03

PhUSE EU Connect Paper PP15. Stop Copying CDISC Standards. Craig Parry, SyneQuaNon, Diss, England

Standards Metadata Management (System)

Transcription:

Legacy to SDTM Conversion Workshop: Tools and Techniques Mike Todd President Nth Analytics

Legacy Data Old studies never die Legacy studies are often required for submissions or pharmacovigilence. Often there are multiple Legacy systems, disparate from each other Problem: design an efficient method for converting legacy data to SDTM

Legacy CDISC Implementation Goals Design a strategy such that: No knowledge needed of system that originally produced the legacy data Applicable to files from any system Implementation is flexible enough to adapt to different study designs Minimal programming support required for maintenance Reasonable cost

Study Scenarios Well-documented Raw data available Analysis data reliable Study report, SAP, CRF available Someone familiar with the study available to answer questions Less well-documented Analysis data either not available, or not reliable No SAP Study report missing appendices No one remembers the study Requires a lot of prework before automated methods can be applied

Well Documented Studies This presentation focuses on the best case These studies lend themselves to automated, non-expert driven solutions

The Other Ones, Briefly Becomes a data management problem Problematic data may be excluded or imputed, so long as a reasonable, welldocumented process can be defined Replace unreliable lab normal ranges with published ranges Requires expert data manager to identify problems and propose solutions

Well-Documented Studies ETL Process

Our approach Start with the analysis files Transform these to SDTM Usually contain most of the data required for SDTM Better ones tend to be self-documenting Compare the analysis files to the SDTM domains Ensure that required and expected SDTM variables are available Understand all derivations from SAP or study report

Issues Looking Back Raw Data SDTM ADaM Looking Ahead Legacy Study Report Raw data, analysis data, or combination? SDTM is designed to represent raw data raw data not included must be documented Must match results in legacy CSR or explain why

ETL Process Define how raw/analysis data fits into SDTM domains and variables Match data to required, permitted and expected SDTM data when possible Provide an automated mechanism for specifying the data sources and algorithms Basis for the FDA-mandated DEFINE.PDF documentation Provide the metadata for the SDTM files

Implementing an ETL Process Programs read table-driven metadata to translate the analysis data into SDTM formats Tells the SAS code which analysis variables populate the SDTM variables Indicates when specialized code is required All code is developed to be generic using the metadata to indicate when variations are required New studies only require changes to metadata

Sample Mapping Spreadsheet

Process Without Automation Project 1 Data 1 Data 1 Data Data 1 Project Programs Reports Project 2 Data Project Programs Reports etc.

ETL Process

Software Market leader : SAS Data Integration Studio Formerly ETL Studio $80K per server Off-the-Shelf ETL Metadata: Excel and Access work very well Converts easily to DEFINE.PDF SAS macros read metadata, generate custom SAS code to create SDTM domains from source data Generates standalone, submission-ready programs

Data Flow Process Programmer Legacy Study Report SAS Macros Labor- Intensive Annotated CRF Raw Data Data Manager Raw Data Annotated CRF SDTM Data Metadata ETL System SDTM Data DM CO SUPPDM SDTM Implementation Guide Legacy Analysis Data AE Etc... SUPPAE Statistician Standalone SDTM Programs DEFINE.XML Validation Reports Automated

Validation: SDTM Structure SDTM compliance checks Conformance with Implementation Guide rules can be automated Variable names, labels, type etc. are correct All required variables have values, etc.

Validation: Source Data Validation Verify metadata Data manager/statistician Manual review process Possible approaches independent programming for each raw dataset, verify raw to SDTM conversion for a random sample of subjects

Validation: All Raw Data Mapped? Leverage metadata Use metadata to query raw data structure and determine differences List of raw data variables not mapped Raw datasets vs. domains Raw data variables to SDTM variables

Validation: New SDTM/ADaM Match Legacy Study Report? Demonstrate that programs using SDTM (or ADaM) data can reproduce results in the legacy study report Nasty no shortcuts Tasks can split among multiple programmers they will be needed!

Advantages Does not disrupt existing clinical trial systems It works for all legacy data Reduces cost since only metadata changes for each new study Only new study specific situations cause additional coding Changes to CDISC standards are made to the metadata avoiding costly programming revisions

Advantages (continued) Self-documenting Metadata easily converted to DEFINE.PDF If you follow the system, metadata are guaranteed to provide complete and accurate documentation Generates submission-ready SAS programs System macros create standalone code Code looks good because it is machine-written Use PROC COMPARE to verify that standalone programs accuracy create the SDTM datasets

Challenges Incomplete Analysis Data Have to go back to raw data for domains/variables not covered by analysis data Example: Inclusion/Exclusion Incomplete Raw Data If annotated CRF is unavailable, it may be difficult to determine what certain raw data actually represent. Data quality issues: was the data cleaned?

Challenges (continued) Trial Design and Subject domains Trial Design and Subject domains have to be created manually ADaM Ideally, ADaM data are created from SDTM data For derived variables already in analysis datasets, recreate them, then use original variables for validation

Summary Messy legacy studies require a lot of data management expertise before automated methods can be used Metadata is the key for successful automation Off-the-shelf tools can provide a powerful ETL solution