define.xml: A Crash Course Frank DiIorio

Similar documents
Lex Jansen Octagon Research Solutions, Inc.

Now let s take a look

Introduction to Define.xml

Lex Jansen Octagon Research Solutions, Inc.

Creating Define-XML v2 with the SAS Clinical Standards Toolkit 1.6 Lex Jansen, SAS

PhUSE US Connect 2019

Brave New World: How to Adapt to the CDISC Statistical Computing Environment

Beyond OpenCDISC: Using Define.xml Metadata to Ensure End-to-End Submission Integrity. John Brega Linda Collins PharmaStat LLC

TS04. Running OpenCDISC from SAS. Mark Crangle

Material covered in the Dec 2014 FDA Binding Guidances

The Wonderful World of Define.xml.. Practical Uses Today. Mark Wheeldon, CEO, Formedix DC User Group, Washington, 9 th December 2008

Dataset-XML - A New CDISC Standard

Helping The Define.xml User

Advantages of a real end-to-end approach with CDISC standards

Creating Define-XML version 2 including Analysis Results Metadata with the SAS Clinical Standards Toolkit

Improving Metadata Compliance and Assessing Quality Metrics with a Standards Library

Legacy to SDTM Conversion Workshop: Tools and Techniques

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

esubmission - Are you really Compliant?

PhUSE Paper SD09. "Overnight" Conversion to SDTM Datasets Ready for SDTM Submission Niels Mathiesen, mathiesen & mathiesen, Basel, Switzerland

Streamline SDTM Development and QC

Define.xml 2.0: More Functional, More Challenging

Out-of-the-box %definexml

SDTM-ETL TM. New features in version 1.6. Author: Jozef Aerts XML4Pharma July SDTM-ETL TM : New features in v.1.6

SAS offers technology to facilitate working with CDISC standards : the metadata perspective.

SDTM-ETL 3.2 User Manual and Tutorial

XML. Jonathan Geisler. April 18, 2008

PharmaSUG Paper AD03

How to handle different versions of SDTM & DEFINE generation in a Single Study?

Aquila's Lunch And Learn CDISC The FDA Data Standard. Disclosure Note 1/17/2014. Host: Josh Boutwell, MBA, RAC CEO Aquila Solutions, LLC

Managing CDISC version changes: how & when to implement? Presented by Lauren Shinaberry, Project Manager Business & Decision Life Sciences

Sandra Minjoe, Accenture Life Sciences John Brega, PharmaStat. PharmaSUG Single Day Event San Francisco Bay Area

PharmaSUG 2014 PO16. Category CDASH SDTM ADaM. Submission in standardized tabular form. Structure Flexible Rigid Flexible * No Yes Yes

Edwin Ponraj Thangarajan, PRA Health Sciences, Chennai, India Giri Balasubramanian, PRA Health Sciences, Chennai, India

Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA

From raw data to submission: A metadata-driven, repository-based process of data conversion to CDISC models

Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development

OpenCDISC Validator 1.4 What s New?

Robust approach to create Define.xml v2.0. Vineet Jain

CDISC SDTM and ADaM Real World Issues

XML. Objectives. Duration. Audience. Pre-Requisites

ADaM Compliance Starts with ADaM Specifications

Improving CDISC SDTM Data Quality & Compliance Right from the Beginning

Creating Define-XML v2 with the SAS Clinical Standards Toolkit

AD07 A Tool to Automate TFL Bundling

SDTM-ETL. New features in version 3.2. SDTM-ETLTM: New features in v.3.2

How to write ADaM specifications like a ninja.

CDISC Variable Mapping and Control Terminology Implementation Made Easy

Automated Creation of Submission-Ready Artifacts Silas McKee, Accenture, Pennsylvania, USA Lourdes Devenney, Accenture, Pennsylvania, USA

From Implementing CDISC Using SAS. Full book available for purchase here. About This Book... xi About The Authors... xvii Acknowledgments...

PhUSE Protocol Representation: The Forgotten CDISC Model

Customizing SAS Data Integration Studio to Generate CDISC Compliant SDTM 3.1 Domains

Standards Driven Innovation

Prove It! Importance and Methodologies of Validation in Clinical Trials Reporting

Study Composer: a CRF design tool enabling the re-use of CDISC define.xml metadata

Harnessing the Web to Streamline Statistical Programming Processes

Display the XML Files for Disclosure to Public by Using User-defined XSL Zhiping Yan, BeiGene, Beijing, China Huadan Li, BeiGene, Beijing, China

Why organizations need MDR system to manage clinical metadata?

SDTM Automation with Standard CRF Pages Taylor Markway, SCRI Development Innovations, Carrboro, NC

PhUSE EU Connect Paper PP15. Stop Copying CDISC Standards. Craig Parry, SyneQuaNon, Diss, England

XSL Languages. Adding styles to HTML elements are simple. Telling a browser to display an element in a special font or color, is easy with CSS.

Adding, editing and managing links to external documents in define.xml

Implementing CDISC Using SAS. Full book available for purchase here.

An Efficient Solution to Efficacy ADaM Design and Implementation

CDISC Standards and the Semantic Web

Understanding Page Template Components. Brandon Scheirman Instructional Designer, OmniUpdate

Structured documents

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

XML4Pharma's ODM Study Designer New features of version 2010-R1 and 2010-R2

Automation of makefile For Use in Clinical Development Nalin Tikoo, BioMarin Pharmaceutical Inc., Novato, CA

XSLT: How Do We Use It?

R1 Test Case that tests this Requirement Comments Manage Users User Role Management

Step Up Your ADaM Compliance Game Ramesh Ayyappath & Graham Oakley

SAS, XML, and CDISC. Anthony T Friebel XML Development Manager, SAS XML Libname Engine Architect SAS Institute Inc.

From SDTM to displays, through ADaM & Analyses Results Metadata, a flight on board METADATA Airlines

COP 4814 Florida International University Kip Irvine XSLT. Updated: 2/9/2016 Based on Goldberg, Chapter 2. Irvine COP 4814

SDTM-ETL 3.1 User Manual and Tutorial

Harmonizing CDISC Data Standards across Companies: A Practical Overview with Examples

SDTM Attribute Checking Tool Ellen Xiao, Merck & Co., Inc., Rahway, NJ

Dealing with changing versions of SDTM and Controlled Terminology (CT)

Define.xml tools supporting SEND/SDTM data process

ODS TAGSETS - a Powerful Reporting Method

1. Please, please, please look at the style sheets job aid that I sent to you some time ago in conjunction with this document.

Tips on Creating a Strategy for a CDISC Submission Rajkumar Sharma, Nektar Therapeutics, San Francisco, CA

Style Sheet A. Bellaachia Page: 22

PharmaSUG Paper DS16

Introduction to ADaM and What s new in ADaM

Semantic Web. XSLT: XML Transformation. Morteza Amini. Sharif University of Technology Fall 95-96

Standards Implementation: It Should be Simple Right? Thursday January 18, 2018

Taming Rave: How to control data collection standards?

SAS Clinical Data Integration 2.6

SAS CLINICAL SYLLABUS. DURATION: - 60 Hours

SAS Clinical Data Integration 2.4

INTERNET & WEB APPLICATION DEVELOPMENT SWE 444. Fall Semester (081) Module 4 (III): XSL

HTML/CSS Lesson Plans

It s All About Getting the Source and Codelist Implementation Right for ADaM Define.xml v2.0

Data Integrity through DEFINE.PDF and DEFINE.XML

Best Practices for E2E DB build process and Efficiency on CDASH to SDTM data Tao Yang, FMD K&L, Nanjing, China

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 14 Database Connectivity and Web Technologies

Figure 1. Table shell

Transcription:

sponsor requests ODM extensions XSL-FO schema/xsd define.xml define.xml: A Crash Course metadata tables XML4Pharma metadata interface (the other) define.pdf ODM itext Frank DiIorio CodeCrafters, Inc. Philadelphia PA XSL XMLPad XML Mapper Xpath JavaScript HTML CSS validation define.pdf Oracle/database define version x old school brute force metadata storage SAS Clinical Standards Toolkit CDISC standard version x Remember define.pdf? Purpose: document deliverables Datasets: description, structure, sort order Variables: attributes, codes, derivation, et al. Created using: Metadata, SAS macros Contents validated by: Visual inspection Programmatic checks of the metadata FDA now requests define.xml, aka CDSISC s Case Report Tabulation Data Definition Specification And visually it resembles define.pdf 1

define.xml: Dataset-Level (transformed by XSL) define.xml: Variable-Level (transformed by XSL) 2

define.xml: Similar, but define.xml differs from define classic : Unlike a PDF, it is easily machine-readable It follows a strictly defined format (an extension of CDISC s ODM schema) It s meatier than define.pdf, requiring much richer metadata Requires validation of Syntax Semantics (compliance with schema) Clearly, we re dealing with something new and complex This Presentation Briefly reviews XML basics Describes metadata needed to support construction of define.xml Presents one way to build the XML file Shows how to validate the file Discusses define.pdf (no, not that define.pdf!) Focuses on define Version 1 but identifies issues relevant to Version 2 Is simply an overview of the file creation and validation process 3

XML Basics Extensible Markup Language: plain text with mark-up ( tags ) similar in look & feel to HTML Content is user-defined, by schemas Files are collections of elements, each of which can have one or more attributes. Elements can, and usually are, arranged in a hierarchy. Unlike HTML, emphasis is on data content, not its display XML is part of a family of specifications that include: XSL transforms XML into another format XPath navigates within the document. Used by XSL. XSD/Schema defines rules for content and structure of an XML file XSLFO formatting objects (XSL-like, creates PDF) XML Basics, Illustrated Study element OID attribute of Study element Element hierarchy: GlobalVariables is child of Study Schema specifies which elements can repeat Schema specifies valid attribute values 4

define.xml Basics What s the advantage over define.pdf? The document is machine readable. This encourages exchange of data between orgs/app s. define.xml must be valid from two perspectives: Syntax Content (compliance with schema) define schema/content An extension of the CDISC Operational Data Model (ODM) Schema controls content, not display Rules for names, attributes, number of occurrences, order of nodes, etc. A value can conform to the schema but still be wrong! (e.g., type is Integer but really should be Float) Available at CDISC, OpenCDISC web sites Determining what goes where is, arguably, the hardest part of the file creation process. Element Order Define Metadata 1.0.xml Start of OpenCDISC XML file showing element order and content: 5

What You ll Need An XML Viewer/Editor (display ODM schema, define.xml, XSL) such as: XMLpad SAS XML Mapper Validator OpenCDISC SAS Clinical Standards Toolkit XML4Pharma Can be supplemented with home-grown tools Knowledge and patience W3Schools.com, other sites/books Between the Tags: Metadata Metadata Drives the creation of the XML And can also be used for various tasks throughout the project life cycle (next slide) Metadata tables can include: Study-level: protocol name, standard name/version Datasets: name, structure, key fields Variables: attributes, controlled terminology usage, derivation/crf source Values: detail of variable values (test codes, etc.) Computational algorithms: extended and/or repeated derivations Controlled terms: descriptions and values of coded/enumerated terminology 6

Metadata: Usage Throughout Study Life Cycle Study setup \study \data \prog m data. config. sdtm. adam Variables Table %cre8 Spec %attrib %dom Split %dom Chk %cr XFDF %xpt domain variable type length label order definitionprog definitionsub EDC / raw program / dataset spec export validate domain define other XPT blankcrf. pdf use crflocation core %def XML define. xml/pdf %def PDF Metadata Issues Design Ideally, maps (directly/views) to XML elements and attributes with a minimum of transformation Should be sensitive to changes in standards: define.xml data (SDTM, ADaM) Storage The metadata should be regarded as a valuable corporate asset. So don t store it in Excel! Oracle or similar enterprise-level database is a far better choice (though more resource intensive). 7

Metadata Issues: Entry (Dataset-Level) Metadata Issues: Entry (Variable-Level) 8

Building the XML Many ways to do this, among them: SAS Clinical Standards Toolkit Brute force: Macros, DATA steps Benefits: extreme flexibility with respect to order of dataset display, control of Comments content, selection of XSL, etc. Also, tool (macros) can perform XML validation, create ZIP file of deliverables Drawbacks: lots of code; has to be responsive to changes in the standards Building (or not) the XSL XSL transforms XML into other formats (HTML is the most common) and makes the XML reader friendly. Since the define XML is in a predictable format, transformation of any file for any study can be done with a standard XSL file (the XML Promise ) The XSL is identified by a reference in the XML: <?xml version="1.0" encoding="iso-8859-1"?> <?xml-stylesheet type="text/xsl" href= define.xsl"?> Your choice: Use XSL found in the CDISC pilots Write your own (as with define.xml: flexibility, at the cost of writing a lot of code) 9

A Word About XSL Before writing your own XSL, consider Different type of language: badly shaped learning curve (for most of us) Think about functionality to provide over and above CDISC-supplied files Table sorting, printing Additional navigation (next/previous table, etc.) Consider whether the sponsor will accept the XSL (ActiveX, JavaScript, security considerations) Sample XSL from Early CDISC Pilot <!-- ***************************************** Element selection requires --> <!-- Code List Items knowledge of XPath --> <!-- ***************************************** --> <xsl:if test="/odm:odm/odm:study/odm:metadataversion/odm: Syntax is similar to XML CodeList[odm:CodeListItem]"> Inclusion of pure HTML <div id="decodelist"> <xsl:for-each select="/odm:odm/odm:study/odm:metadataversion/odm CodeList[odm:CodeListItem]"> The XSL can build <fieldset> HTML statements <xsl:attribute name="id">cl.<xsl:value-of select="@oid"/></xsl:attribute> <legend>code List - <xsl:value-of select="@name"/>, Reference Name (<xsl:value-of select="@oid"/>) </legend> <table> <tr class="header"> <th scope="col" class="codedvalue">coded Value</th> <th scope="col">decode</th> </tr> <xsl:for-each select="./odm:codelistitem"> Coding of XSL can dramatically affect transformation and readability of an XML file, as shown in next slides 10

define.xml: Style Sheet 1 The difference is in the HTML created by the XSL, not in the XML itself! define.xml: Style Sheet 2 The difference is in the HTML created by the XSL, not in the XML itself! 11

Did We Get It Right? Validating the XML Recall define.pdf v. define.xml discussion: different, more stringent and definable validation requirements Ensures names/values, attributes, occurrences, order of nodes conform to the schema. But we can t validate that the data makes sense! Var. length of 20 may be valid according to the schema, but if length in the dataset was >20, problem lies elsewhere Tools OpenCDISC SAS Clinical Standards Toolkit XML4Pharma CDISC Define.xml Checker Home-grown (specialized, client-requested checks) Validation: OpenCDISC V1.3 Rules http://www.opencdisc.org/projects/validator/cdisc-define.xml-1.0-validation-rules Level of severity is arguable! 12

Validation: OpenCDISC Results (Summary) Validation report has become part of our deliverables to the client. Inclusion of any item flagged as an Error or Warning must be explained. Validation: OpenCDISC Results (Detail) 13

You re Not Done Yet: define.pdf You mean define.xml No, define.pdf a PDF rendering of the XML Why (oh why, oh why,?) How Read the XML with SAS XML maps, then use REPORT for the various pieces (Jansen paper) itext open source library (Java) XSL-FO (Formatting Objects) document description language Our old friend, Brute Force (next slide) define.pdf: Brute Force, No Finesse definexml.sas data work.defpdf_value; set work.value; write value-level XML definexmlpdf.sas ODS PROCLABEL, other proc report data=work.defpdf_value; Calling Program %setup(project=study) %definexml( parameters ) %definexmlpdf( parameters ) 14

define.pdf: define.xml Transformed Everything You Know Is Wrong, aka Define- XML Version 2 Various issues with Version 1 Representation of Controlled Terminology Lack of specificity and flexibility of value-level metadata Limited linking to external documents (acrf, reviewer s guide, et al.) Didn t link ADaM metadata to TFLs. No results-level metadata. These and other issues are addressed in Version 2, now called Define- XML. Implications for developers and end-users: More metadata Changes to metadata entry interface Modification of XML-writing and ancillary tools New validation rules Training: developers, end-users, among others Pilot is available on CDISC web site. OpenCDISC 1.5 has validation config file for Version 2. 15

Closing Comments The process to create define.xml is more complex than define.pdf: New technologies More moving partss metadata, XML, XSL, Stringent validation Keys: Organizational commitment Transparent access to robust metadata Tools that facilitate flexible display (especially important to CROs) Thank You! Your comments are valued and encouraged: frank@codecraftersinc.com 16