Introduction to using the CALERIE Public Use Database

Similar documents
Practical Approaches for Using the CTN Datashare: Proposing Parameters, Creating Data Tables, and Analyzing Results.

3. SOURCES OF DATA. 3.1 Data Management Activities

How to write ADaM specifications like a ninja.

Chemistry & Physics Department

Legacy to SDTM Conversion Workshop: Tools and Techniques

Creating an ADaM Data Set for Correlation Analyses

Implementing CDISC Using SAS. Full book available for purchase here.

PPMI Data Access. Chelsea Caspell-Garcia, Eric Foster. The University of Iowa. PPMI Annual Meeting May 12-13, 2015

CDASH MODEL 1.0 AND CDASHIG 2.0. Kathleen Mellars Special Thanks to the CDASH Model and CDASHIG Teams

Study Data Reviewer s Guide Completion Guideline

Programming checks: Reviewing the overall quality of the deliverables without parallel programming

Pooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit

Using Excel to Troubleshoot EMIS Data

A Taste of SDTM in Real Time

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

Automate Clinical Trial Data Issue Checking and Tracking

DCDISC Users Group. Nate Freimark Omnicare Clinical Research Presented on

SAS (Statistical Analysis Software/System)

An Introduction to Analysis (and Repository) Databases (ARDs)

PORTAL User Guide Proficiency Online Reporting and Trend AnaLysis

CDISC SDTM and ADaM Real World Issues

CDISC Laboratory Standards Release Notes. for. Base Model Version Schema Version Microbiology Extension Review Version

ACCUPLACER Placement Validity Study Guide

WarwickWARE. Data Blender Manual

GeneSys. Unit Six.Two: Administering a 360 Project. genesysonline.net. psytech.com

Once the data warehouse is assembled, its customers will likely

From Implementing CDISC Using SAS. Full book available for purchase here. About This Book... xi About The Authors... xvii Acknowledgments...

Patrice M. Anderson Instructional Designer

Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

Importing Geochemical Data

OpenCDISC Validator 1.4 What s New?

1. Getting Started Navigating the Gateway Configuring chambers questions Advertising Application Administration 13

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University

One-Step Change from Baseline Calculations

Web Report Library User Guide

2015 PMF Adult Education End of Year Data Collection Processes Released May 2015

Clinical Data Visualization using TIBCO Spotfire and SAS

The Data Curation Profiles Toolkit: Interview Worksheet

HEALTH QUALITY ONTARIO. Quality Improvement Reporting & Analysis Platform (QI RAP) User Guide

Lab Session: Cost Management of Software/CIS Development Project (using 2016 Microsoft Project tool) Lab Manual

Using UWSuperior and Turnitin Together

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS.

How to Access If Rubrics does not appear on your course navbar, click Edit Course, Tools, Rubrics to activate..

Introduction to ADaM and What s new in ADaM

Automate Analysis Results Metadata in the Define-XML v2.0. Hong Qi, Majdoub Haloui, Larry Wu, Gregory T Golm Merck & Co., Inc.

Use of Traceability Chains in Study Data and Metadata for Regulatory Electronic Submission

CFB: A Programming Pattern for Creating Change from Baseline Datasets Lei Zhang, Celgene Corporation, Summit, NJ

An Introduction to Visit Window Challenges and Solutions

Edwin Ponraj Thangarajan, PRA Health Sciences, Chennai, India Giri Balasubramanian, PRA Health Sciences, Chennai, India

The SA (Special Assistance) Form CA-908 Deliverable is due the 5th day of each month for the previous months data.

Follow these instructions to navigate to your application:


Introduction. Welcome to the new CPD Area!

AVANTUS TRAINING PTE LTD

Using PROC SQL to Generate Shift Tables More Efficiently

Acknowledgements. Date of publication. The purpose of this guide

Material covered in the Dec 2014 FDA Binding Guidances

Cloud CFO Trial: Step by Step Guide

ehepqual- HCV Quality of Care Performance Measure Program

Computer Lab # 2 Project Cost Management (with Microsoft Project 2007) Lab Manual (with Master s sets of input and output data for Team # 0)

One-PROC-Away: The Essence of an Analysis Database Russell W. Helms, Ph.D. Rho, Inc.

Biostatistics & SAS programming. Kevin Zhang

Planning to Pool SDTM by Creating and Maintaining a Sponsor-Specific Controlled Terminology Database

Overview. Experiment Specifications. This tutorial will enable you to

Parental responsibility measures: attendance data collection (PRM-A) Instructions for local authorities on how to use COLLECT 2015

All About PlexSet Technology Data Analysis in nsolver Software

OpenClinica - Data Import & Export

Demystifying Laboratory Data Reconciliation. The issues at hand

1. Study Registration. 2. Confirm Registration

Data Import and Quality Control in Geochemistry for ArcGIS

Locating the Dropbox Tool:

Medication Precertification Requests

Consolidate Trial Balances

Working with Standards and Duplicates in Geochemistry for ArcGIS

SPSS 11.5 for Windows Assignment 2

Cohuborate Ltd Warranty Services User Manual

Document Version: 1.0. Purpose: This document provides an overview of IBM Clinical Development released by the IBM Corporation.

Improving Metadata Compliance and Assessing Quality Metrics with a Standards Library

Using an ICPSR set-up file to create a SAS dataset

Now let s take a look

SharePoint 2013 Business Intelligence

Working with Composite Endpoints: Constructing Analysis Data Pushpa Saranadasa, Merck & Co., Inc., Upper Gwynedd, PA

Exceptions Management Guide

PharmaSUG Paper DS24

Sandra Minjoe, Accenture Life Sciences John Brega, PharmaStat. PharmaSUG Single Day Event San Francisco Bay Area

Alpha Cycler Endpoint PCR range

Paper FC02. SDTM, Plus or Minus. Barry R. Cohen, Octagon Research Solutions, Wayne, PA

Admissions & Intro to Report Editing Participants Guide

Importing from Blackboard Learn Grade Center Data to Banner 9 User Learning Scenarios

Chat Activity. Moodle: Collaborative Activities & Blocks. Creating Chats

An Efficient Method to Create Titles for Multiple Clinical Reports Using Proc Format within A Do Loop Youying Yu, PharmaNet/i3, West Chester, Ohio

Standards Driven Innovation

STAT 7000: Experimental Statistics I

Data Science Services Dirk Engfer Page 1 of 5

How do I request a new user name or delete the account of an existing user?

Interactive Data Submission System (IDSS) Frequently Asked Questions

MAY CANVAS UPDATES FOR TEACHERS

Automation of SDTM Programming in Oncology Disease Response Domain Yiwen Wang, Yu Cheng, Ju Chen Eli Lilly and Company, China

Skillsoft: Step-by-Step Guide

Transcription:

Introduction to using the CALERIE Public Use Database

Outline Where to find information Data download instructions Database installation Raw and Analysis datasets Important Data usage notes Major data domains

Where to find information / get help Go to calerie.duke.edu In the Quick Navigation panel on the left, click on Database Documentation Guide to using the Database (similar to this presentation) Data Contents: Evaluation schedule Visit codes Rawdata Contents: datasets and variables in raw database Analysis Data Contents: datasets and variables in analysis database Analysis Dataset Details: detailed derivations and value lists of analysis dataset variables

Where to find information / get help (cont d) Data Handling Rules : rules for derivation of key study endpoints eg, RMR, Core body Temperature, QOL scoring algorithms, Adherence to CR, etc. Forms: CRFs (with and without annotation), and documentation for non-crf data. or send questions to calerie@dm.duke.edu

Data Download Instructions Go to calerie.duke.edu In the Quick Navigation panel on the left, click on Apply for Samples & Data Analysis At bottom of page click on Click here to register and download the public database Fill out database submission form: email address, name, institution, department, and submit. Click on ASCII Database or SAS Database to download ascii_database.zip (15.7 MB) or sas_database.zip (19.2 MB)

Install SAS database Create or designate a folder for the data. Click on downloaded sas_database.zip à SAS_database folder contains 2 folders: Rawdata contains raw data transport file à crf.xpt Analysis_data contains analysis data transport files à analysis.xpt and residual.xpt Highlight the desired data folder (Analysis_data, Rawdata or SAS_database for all data) and extract to the designated folder. Instructions and sample code for installing SAS transport files and formats using proc copy are in the guide to using the public database Note special steps for installing SAS data formats

Install ASCII data files Create or designate a folder for the data. Click on downloaded ASCII_database.zip à ASCII_database folder contains 2 folders: Raw_ascii_data contains (84) raw data csv files analysis_ascii_data contains (52) analysis data csv files Highlight the desired data folder (analysis_ascii_data, raw_asci_data or ASCII_database for all data) and extract to the designated folder. Instructions for converting csv files to Excel spreadsheets are in guide_to_using_the _public_database.pdf

Raw Datasets Raw datasets correspond to data collected on CRFs, data received from central labs (eg Blood chemistry, DLW, DXA), electronic files of data collected from instruments (eg RMR, Core temperature) in its original database structure. Raw dataset variables can be mapped to annotated CRFs (or documentation for non-crf data sources) in Database Documentation / Forms / CALERIE CRF Annotated Form pages 1-155 etc. Raw data structure is not convenient for analysis eg data for a single assessment might be split into multiple raw datasets that need to be merged, no derived endpoints, etc. Raw datasets will not be useful for most users.

Analysis Datasets Analysis datasets are derived from raw datasets according to the study data handling rules and transformed to an analyzable structure. Most analysis datasets have 1 record per Subject per study Visit*. Analysis dataset variables are documented in Database Documentation / Data Contents / Analysis Dataset Details Use Analysis datasets (not raw)!

Analysis Populations The database includes all subjects who attended at least one in clinic screening visit (n=1,069), but only 220 were randomized and 218 started the intervention. Make sure to use only the relevant subject population for an analysis! Randomized subjects: SUBJECT1.RAND=1 Subjects who started intervention: SUBJECT1.INTERVEN=1 Randomized treatment arm: IVRSRAND.TX= A (CR) B (AL)

Study Visits Major Study Visits: Baseline (4-6 weeks before randomization) Month 6* Month 12 Month 18* Month 24 *at months 6 and 18, subjects in AL arm had limited set of evaluations. Major Study Visits consist of one or more Sub-Visits. Eg, there were up to 6 Sub-visits at Baseline Study assessments were done one or more times at Baseline and at some or all follow-up study visits. See evaluation schedule.

Study Visits (cont d) Some assessments (eg Weight, RMR, DXA, DLW) were done multiple times in a study visit. Eg DLW, DXA and RMR were performed 2x at Baseline for all subjects DXA was performed 2x at Month 6 for CR subjects For evaluations performed more than once at a Major Study Visit, the analysis dataset has a record for each individual evaluation, AND another record for the average of all evaluations at that Visit.

Study Visits (cont d) eg The Analysis dataset TEERQ (TEE from DLW analysis) has: 1 record for Baseline 1 DLW data (VISIT=4) 1 record for Baseline 2 DLW data (VISIT=5) 1 record for Baseline Mean DLW data (VISIT=0) (average over BL1 and BL2) Make sure to include only the relevant visits in your Analysis! You probably don t want to include all 3 Baseline records!

VISIT codes 0=Baseline (mean across all Baseline visits) 1-3 : Screening Visits 1-3 (weights only) 4=BL1 (Baseline 1 covers up to 3 sub-visits) 5=BL2 (Baseline 2 covers up to 4 sub-visits) 6=Randomization 7=Month 1 8=Month 3 9=Month 6 (covers 4 subvisits) 10=Month 9 11=Month 12 (covers 5 subvisits) 12=Month 18 (covers 3 subvisits) 13=Month 24 (covers 5 subvisits)

Key Variables DEIDNUM unique subject identifier, exists in all datasets. VISIT visit identifier. Most datasets have one record per DEIDNUM/VISIT, and can be merged by DEIDNUM/VISIT Randomized Intervention Arm: IVRSRAND.TX

Some important Analysis Datasets SUBJECT1: subject level data 1 record per DEIDNUM IVRSRAND.TX: randomized treatment arm The following datasets have 1 record per DEIDNUM / VISIT (with separate records for Baseline 1, Baseline 2, and Baseline mean) CLWTVIS: Clinic Weights, (averaged over multiple Sub-Visits in each VISIT) DXAA: Body Composition via DXA TEERQ: TEE via DLW RMRA: Resting Metabolic Rate COREMPA: Core Body Temperature

Important Analysis Datasets (cont d) OCLABFLT : Laboratory assays PCTCR : Calculated % Caloric Restriction based on TEE from DLW and changes in FM and FFM from DXA And many more! See summary of important data domains in Guide to the public database