TS04. Running OpenCDISC from SAS. Mark Crangle

Similar documents
ADaM Compliance Starts with ADaM Specifications

OpenCDISC Validator 1.4 What s New?

An Efficient Solution to Efficacy ADaM Design and Implementation

Controlling OpenCDISC using R. Martin Gregory PhUSE 2012 Budapest, October 2012

Implementing CDISC Using SAS. Full book available for purchase here.

From Implementing CDISC Using SAS. Full book available for purchase here. About This Book... xi About The Authors... xvii Acknowledgments...

Improving CDISC SDTM Data Quality & Compliance Right from the Beginning

PhUSE US Connect 2019

Out-of-the-box %definexml

Improving Metadata Compliance and Assessing Quality Metrics with a Standards Library

Beyond OpenCDISC: Using Define.xml Metadata to Ensure End-to-End Submission Integrity. John Brega Linda Collins PharmaStat LLC

Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development

DCDISC Users Group. Nate Freimark Omnicare Clinical Research Presented on

Creating Define-XML v2 with the SAS Clinical Standards Toolkit

Introduction to ADaM and What s new in ADaM

define.xml: A Crash Course Frank DiIorio

How to write ADaM specifications like a ninja.

Step Up Your ADaM Compliance Game Ramesh Ayyappath & Graham Oakley

SAS offers technology to facilitate working with CDISC standards : the metadata perspective.

Creating Define-XML v2 with the SAS Clinical Standards Toolkit 1.6 Lex Jansen, SAS

esubmission - Are you really Compliant?

Introduction to Define.xml

Lex Jansen Octagon Research Solutions, Inc.

Harmonizing CDISC Data Standards across Companies: A Practical Overview with Examples

Use of Traceability Chains in Study Data and Metadata for Regulatory Electronic Submission

Material covered in the Dec 2014 FDA Binding Guidances

Prove It! Importance and Methodologies of Validation in Clinical Trials Reporting

SAS Clinical Data Integration 2.4

How to Automate Validation with Pinnacle 21 Command Line Interface and SAS

Streamline SDTM Development and QC

It s All About Getting the Source and Codelist Implementation Right for ADaM Define.xml v2.0

Dealing with changing versions of SDTM and Controlled Terminology (CT)

Define.xml tools supporting SEND/SDTM data process

SAS Online Training: Course contents: Agenda:

AUTOMATED CREATION OF SUBMISSION-READY ARTIFACTS SILAS MCKEE

Automate Analysis Results Metadata in the Define-XML v2.0. Hong Qi, Majdoub Haloui, Larry Wu, Gregory T Golm Merck & Co., Inc.

Business & Decision Life Sciences

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

Define.xml 2.0: More Functional, More Challenging

ADaM and traceability: Chiesi experience

PharmaSUG Paper AD03

SAS Clinical Data Integration 2.6

Lex Jansen Octagon Research Solutions, Inc.

Customizing SAS Data Integration Studio to Generate CDISC Compliant SDTM 3.1 Domains

Adding, editing and managing links to external documents in define.xml

Legacy to SDTM Conversion Workshop: Tools and Techniques

Making a List, Checking it Twice (Part 1): Techniques for Specifying and Validating Analysis Datasets

From SDTM to displays, through ADaM & Analyses Results Metadata, a flight on board METADATA Airlines

Edwin Ponraj Thangarajan, PRA Health Sciences, Chennai, India Giri Balasubramanian, PRA Health Sciences, Chennai, India

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

Data Science Services Dirk Engfer Page 1 of 5

ADaM for Medical Devices: Extending the Current ADaM Structures

Planning to Pool SDTM by Creating and Maintaining a Sponsor-Specific Controlled Terminology Database

CDISC Variable Mapping and Control Terminology Implementation Made Easy

CDISC SDTM and ADaM Real World Issues

PhUSE EU Connect 2018 SI05. Define ing the Future. Nicola Perry and Johan Schoeman

Robust approach to create Define.xml v2.0. Vineet Jain

Automated Creation of Submission-Ready Artifacts Silas McKee, Accenture, Pennsylvania, USA Lourdes Devenney, Accenture, Pennsylvania, USA

Experience of electronic data submission via Gateway to PMDA

Optimization of the traceability when applying an ADaM Parallel Conversion Method

What is the ADAM OTHER Class of Datasets, and When Should it be Used? John Troxell, Data Standards Consulting

THE DATA DETECTIVE HINTS AND TIPS FOR INDEPENDENT PROGRAMMING QC. PhUSE Bethan Thomas DATE PRESENTED BY

Traceability in the ADaM Standard Ed Lombardi, SynteractHCR, Inc., Carlsbad, CA

PharmaSUG Paper CC02

Taming the SHREW. SDTM Heuristic Research and Evaluation Workshop

Advantages of a real end-to-end approach with CDISC standards

Application of SDTM Trial Design at GSK. 9 th of December 2010

Download Instructions

SDTM-ETL TM. New features in version 1.6. Author: Jozef Aerts XML4Pharma July SDTM-ETL TM : New features in v.1.6

The Implementation of Display Auto-Generation with Analysis Results Metadata Driven Method

Creating an ADaM Data Set for Correlation Analyses

Updates on CDISC Standards Validation

ADaM Reviewer s Guide Interpretation and Implementation

R1 Test Case that tests this Requirement Comments Manage Users User Role Management

Creating Define-XML version 2 including Analysis Results Metadata with the SAS Clinical Standards Toolkit

Metadata and ADaM.

How to handle different versions of SDTM & DEFINE generation in a Single Study?

The Submission Data File System Automating the Creation of CDISC SDTM and ADaM Datasets

Biostatistics & SAS programming. Kevin Zhang

Riepilogo e Spazio Q&A

What Is SAS? CHAPTER 1 Essential Concepts of Base SAS Software

Importing CSV Data to All Character Variables Arthur L. Carpenter California Occidental Consultants, Anchorage, AK

Sandra Minjoe, Accenture Life Sciences John Brega, PharmaStat. PharmaSUG Single Day Event San Francisco Bay Area

How to validate clinical data more efficiently with SAS Clinical Standards Toolkit

The Benefits of Traceability Beyond Just From SDTM to ADaM in CDISC Standards Maggie Ci Jiang, Teva Pharmaceuticals, Great Valley, PA

Dataset-XML - A New CDISC Standard

PharmaSUG Paper DS24

Revision of Technical Conformance Guide on Electronic Study Data Submissions

Why organizations need MDR system to manage clinical metadata?

Study Data Reviewer s Guide Completion Guideline

%check_codelist: A SAS macro to check SDTM domains against controlled terminology

CS05 Creating define.xml from a SAS program

CDISC Standards End-to-End: Enabling QbD in Data Management Sam Hume

Lab #3: Probability, Simulations, Distributions:

SAS, XML, and CDISC. Anthony T Friebel XML Development Manager, SAS XML Libname Engine Architect SAS Institute Inc.

Organizing Deliverables for Clinical Trials The Concept of Analyses and its Implementation in EXACT

ABSTRACT MORE THAN SYNTAX ORGANIZE YOUR WORK THE SAS ENTERPRISE GUIDE PROJECT. Paper 50-30

NCI/CDISC or User Specified CT

Clinical Metadata Metadata management with a CDISC mindset

Stat 302 Statistical Software and Its Applications SAS: Data I/O

Transcription:

TS04 Running OpenCDISC from SAS Mark Crangle

Introduction The OpenCDISC validator is a tool used to check the compliance of datasets with CDISC standards Open-source Freely available and created by team of experts Metadata driven Java based Can validate a range of CDISC datasets Validation rules can be easily modified Usable on a variety of operating systems Solution can be applied to many types of dataset Tool is packaged with a graphical user interface but can also be run from a command line interface (CLI) 2

Introduction Commonly a package of datasets is only checked for CDISC compliance once all are complete If any issues are identified then the dataset is updated and re-validated Inefficient as can result in rework Changes could affect other dataset/analysis programs that use the data More efficient solution would be to check CDISC compliance at the same time as other validation activities Issues can be identified and fixed early The OpenCDISC Validator can be configured to work more accurately for individual datasets The CLI can be used directly from a SAS session Tool can be run alongside other validation activities Results can be read back in to a SAS dataset, analysed and reported with other validation output 3

The OpenCDISC Validator The OpenCDISC validator can be used to create the Define.xml that is required for electronic submission 4

The OpenCDISC Validator The As mentioned OpenCDISC above, the tool validator can be used can to be used validate to create different the types Define.xml of CDISC that standard is required items. for electronic submission 5

The OpenCDISC Validator As It is mentioned possible to above, include the datasets tool can in xpt, be xml used or to validate delimited different format types of CDISC standard items. 6

The OpenCDISC Validator It Datasets is possible to be to include checked datasets are in specified xpt, xml here or delimited format 7

The OpenCDISC Validator Datasets The to be checked configuration are specified specifies the here checks that will be performed on the data and the metadata that these checks will be using. The validator is packaged with configuration files but these can be edited or new files created 8

The OpenCDISC Validator The A define.xml file configuration for the dataset specifies specified the can be checks supplied that to will be provide performed more on the relevant data checks and the metadata that these checks will be using. The validator is packaged with configuration files but these can be edited or new files created 9

The OpenCDISC Validator A The define.xml validation file for report the can dataset be specified generated can in be supplied excel or CSV. to provide Excel format more is relevant the easiest checks to read but the CSV file can be more easily read into a SAS dataset if required The tool is packaged with different versions of the ADaM, SDTM and SEND Controlled Terminology which can be selected here. 10

Validator Configuration The rules the validator uses are specified in xml configuration files The tool is packaged with preset configuration files for each release of CDISC standards Configuration file has section for global rules and then one for each dataset type In ADaM configuration, one for ADSL and one for BDS Each section is further divided into the variable metadata and rules specific to that datatype. 11

Validator Configuration The configuration file can be viewed in a web browser to show the validation rules in a user friendly format Unique Yes/No identifier flag that defines for each if rule. each For rule many is applied of the by rules the the validator. ID corresponds Each rule to will the have ID of an a active check flag for each dataset from type the validation where it is checks applied published allowing it by to CDISC be easily switched off 12

Validator Configuration Setting for Individual Datasets Some of the checks are used to check consistency between datasets but these are not needed for checking individual datasets Checking an ADaM package contains the required ADSL dataset. (AD0001) Checking subjects present in a dataset are also included in ADSL/DM. (AD0053) To de-activate a specific check the xml file needs to be edited in a text editor Rules are applied to each dataset type by the <val:validationruleref> tag <val:validationruleref RuleID="AD0001" Active="Yes"/> To de-activate a check set Active to No Once the file is saved the change can be easily seen in the web-browser view 13

Command Line Interface The GUI is the most commonly used way to run the validator but it is also accessible using a command line interface. This is accessed using the file validator-cli-version.jar (eg. validatorcli-1.5.jar) Same options as the GUI are available as parameters: Parameter Valid Values Description - task Validate, Generate Validate data or generate a Define.xml - type SDTM, ADaM, Data Standard/Structure to validate SEND, Define, Custom - source Path to the source data files - config Path to the xml configuration document specifying the rules/metadata to validate - config:define Path of the define.xml for the study - config:cdisc CDISC Controlled Terminology Version - report Path and filename where the validation report will be saved - report:type Excel, CSV Report format 14

Command Line Interface Running The command line interface can be used through the command prompt Navigate to the folder where the CLI file is located Run the command java -jar validator-cli-1.5.jar adding the parameters needed For example java -jar validator-cli-1.5.jar -task=validate -type=adam -source="u:/test data/*.xpt" -config="u:/test data/opencdisc-validator/config/config-adam-1.0.xml" -config:cdisc=2011-07-22 -report="u:/test data/opencdisc-validator/reports/report.xls" -report:type=excel 15

Using OpenCDISC in dataset validation Combine CLI with ability to run system commands from SAS to control the OpenCDISC validator from within SAS The edited configuration file is used to eliminate error messages that are caused by only submitting one dataset This is handled by a macro that can be called in a dataset validation program Example /*Code to create validation dataset*/ ODS RTF FILE = outputfile.rtf PROC COMPARE BASE=dev COMPARE=val;... RUN; %runopencdisc(version, dataset, etc.); ODS RTF CLOSE; 16

Using OpenCDISC in dataset validation - Steps Validator outputs report to CSV file as easier to read into SAS Read CSV file into SAS dataset PROC IMPORT OUT=test DATAFILE = "U:\OpenCDISC ADLB.csv" DBMS=CSV REPLACE; GETNAMES=YES; DATAROW=2; GUESSINGROWS=100000; RUN; Summarise issues PROC FREQ data=report NOPRINT; TABLE rule_id*message / OUT=numobs; RUN; 17

Further Processing Report could be checked programmatically and further code run conditionally if common issues found Common issues is that PARAM must have the same value within each unique value of PARAMCD (1-1 correspondence) Difficult to see complete issue just by searching through report Check if report contains this issue then run code to show all unique combinations of PARAM/PARAMCD so can easily see where the issue lies. Permanent copies of datasets containing OpenCDISC report could be kept to allow tracking of issues Any warnings/notes that have been investigated and deemed acceptable could be flagged so that they are not reported each time Tracking of common issues could be used to identify training needs across department 18

Conclusion CDISC compliance is something that must be considered at all stages of dataset design, development and validation Only checking compliance after all datasets are complete is inefficient and could require re-work Using the CLI, the OpenCDISC validator can be controlled from within a SAS session to check individual datasets This can be done inline with other validation activities OpenCDISC report can be processed by SAS to summarise issues Reporting can be done alongside output from other validation methods to create a complete record of dataset quality 19

ANY QUESTIONS? 20