Fancy Data Visualisations Without Additional Toolkits

Similar documents
When Simpler is Better Visualizing Laboratory Data Using SG Procedures Wei Cheng, Isis Pharmaceuticals, Inc., Carlsbad, CA

PharmaSUG China

Creating Forest Plots Using SAS/GRAPH and the Annotate Facility

Select the Points You ll Use. Tech Assignment: Find a Quadratic Function for College Costs

Standard Safety Visualization Set-up Using Spotfire

Easing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide

Working with Composite Endpoints: Constructing Analysis Data Pushpa Saranadasa, Merck & Co., Inc., Upper Gwynedd, PA

A Combined AE + CM Graph using SAS

How to write ADaM specifications like a ninja.

SAS Report Viewer 8.2 Documentation

A Visual Revolution Statistical Graphics in SAS 9.2 Bob Newman, Amadeus Software Limited

Technology Assignment: Scatter Plots

Now let s take a look

Real Time Clinical Trial Oversight with SAS

SAS macro package to automate coding graphs with ClinXport.

Multiple Graphical and Tabular Reports on One Page, Multiple Ways to Do It Niraj J Pandya, CT, USA

Advanced Data Visualization using TIBCO Spotfire and SAS using SDTM. Ajay Gupta, PPD

Asthma UK data portal user guide

SAS CLINICAL SYLLABUS. DURATION: - 60 Hours

Indenting with Style

%EventChart: A Macro to Visualize Data with Multiple Timed Events

Microsoft Excel Office 2016/2013/2010/2007 Tips and Tricks

Patient Profile Graphs using SAS. Sanjay Matange, SAS Institute, Inc.

Need a Scientific Journal Ready Graphic? No Problem!

Study Composer: a CRF design tool enabling the re-use of CDISC define.xml metadata

testo Comfort Software Professional 4 Instruction manual

From Clicking to Coding: Using ODS Graphics Designer as a Tool to Learn Graph Template Language

Models for Nurses: Quadratic Model ( ) Linear Model Dx ( ) x Models for Doctors:

Advanced Visualization using TIBCO Spotfire and SAS

Automate Clinical Trial Data Issue Checking and Tracking

The first thing we ll need is some numbers. I m going to use the set of times and drug concentration levels in a patient s bloodstream given below.

A Picture is worth 3000 words!! 3D Visualization using SAS Suhas R. Sanjee, Novartis Institutes for Biomedical Research, INC.

Creating Accessible, Section 508 Compliant Documents

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

PharmaSUG China Big Insights in Small Data with RStudio Shiny Mina Chen, Roche Product Development in Asia Pacific, Shanghai, China

Tips and Guidance for Analyzing Data. Executive Summary

Advanced Graphs using Axis Tables

Entering and Managing Data in EvaluationWeb for CDC Program Announcement PS

North Carolina Community Colleges Business Intelligence Reporting System

SAS Graph a Million with the SGPLOT Procedure. Prashant Hebbar, Sanjay Matange

SAS Visual Analytics 8.2: Getting Started with Reports

Appendix C. Vernier Tutorial

An Introduction to Analysis (and Repository) Databases (ARDs)

KIN 147 Lab 02: Acceleration Data Analysis

Separate Text Across Cells The Convert Text to Columns Wizard can help you to divide the text into columns separated with specific symbols.

Scottish Improvement Skills

Guidance for building Study and CRF in OpenClinica

SAS Report Viewer 8.3 Documentation

MS SharePoint Tip Sheet

Entering and Managing Data in EvaluationWeb for CDC Program Announcement PS

Programming checks: Reviewing the overall quality of the deliverables without parallel programming

How to review a CRF - A statistical programmer perspective

Using Microsoft Excel to write SAS code Andrew Boyd, Quanticate, Edinburgh, UK

Create Flowcharts Using Annotate Facility. Priya Saradha & Gurubaran Veeravel

Figure 1. Table shell

Graphical Analysis of Data using Microsoft Excel [2016 Version]

MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 18) Finding Bad Data in Excel

Integrated Clinical Systems, Inc. announces JReview 13.1 with new AE Incidence

addition + =5+C2 adds 5 to the value in cell C2 multiplication * =F6*0.12 multiplies the value in cell F6 by 0.12

Scientific Graphing in Excel 2013

Easy CSR In-Text Table Automation, Oh My

Displaying Multiple Graphs to Quickly Assess Patient Data Trends

Submission Guidelines

Welsh Clinical Communications Gateway (WCCG) User Guide

Customizing Survival Curves

Excel 2010 Charts and Graphs

A Breeze through SAS options to Enter a Zero-filled row Kajal Tahiliani, ICON Clinical Research, Warrington, PA

Clinical Data Visualization using TIBCO Spotfire and SAS

SAS (Statistical Analysis Software/System)

UIS USER GUIDE SEPTEMBER 2013 USER GUIDE FOR UIS.STAT (BETA)

SpineAnalyzer 3.2 Demonstrator Edition Installation Guide & Release Notes

What s New in Spotfire DXP 1.1. Spotfire Product Management January 2007

Heart Foundation South Australian Heart Maps

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2018/10/30) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

PharmaSUG Paper PO10

The Power and Sample Size Application

Analysing crime data in Maps for Office and ArcGIS Online

CostX 6.6 Release Notes

Introduction to CS graphs and plots in Excel Jacek Wiślicki, Laurent Babout,

Integrating SAS and Non-SAS Tools and Systems for Behavioral Health Data Collection, Processing, and Reporting

SAS (Statistical Analysis Software/System)

A Taste of SDTM in Real Time

What could ODS graphics do about Box Plot?

Module I: Clinical Trials a Practical Guide to Design, Analysis, and Reporting 1. Fundamentals of Trial Design

One Project, Two Teams: The Unblind Leading the Blind

Institutional administrator guide. bestpractice.bmj.com

Studying in the Sciences

One-PROC-Away: The Essence of an Analysis Database Russell W. Helms, Ph.D. Rho, Inc.

Once the data warehouse is assembled, its customers will likely

Opening a Document: Printer Selection:

Remodeling Your Office A New Look for the SAS Add-In for Microsoft Office

Developing Graphical Standards: A Collaborative, Cross-Functional Approach Mayur Uttarwar, Seattle Genetics, Inc., Bothell, WA

Patient Portal User Guide The Patient s Guide to Using the Portal

Power BI 1 - Create a dashboard on powerbi.com... 1 Power BI 2 - Model Data with the Power BI Desktop... 1

San Francisco State University

Advantages of a real end-to-end approach with CDISC standards

Metadata integrated programming

Omron Health Management Software Ver 1.3 User s Help Manual. Copyright Omron Healthcare Co., Ltd. World Rights Reserved

How to Make a Line Graph with Phase Lines in Excel 2007

testo Comfort Software Professional 4 Instruction manual

Transcription:

Paper CT11 Fancy Data Visualisations Without Additional Toolkits Kirsty Parker-Hodds, Veramed, Twickenham, UK ABSTRACT Clinical trials often involve collecting data of varying quality from sites around the world. Clinicians and other study team members may not have immediate access to the study databases which can make visualising data difficult during the blinded phase of a study when looking for data issues. This paper presents a tool created using SAS that exports data into Excel in order to display individual patient profile plots of blood pressure data over time. This Excel tool can be used by any member of the study team and has drop-down menus to display graphs of different measurements for subjects. This tool has been extended to produce static PDF outputs of each patient profile for ease of viewing two study endpoints side by side. Simple adaptations of the code allows these tools to easily be adapted and replicated over multiple studies for different purposes of displaying data. INTRODUCTION This paper discusses the creation of two tools that enable study teams to visualise data during the blinded phase of a clinical trial. The paper highlights the importance of creating tools that are used by everyone in the study team without the need for fancy software. Instead, packages that are widely accessible and familiar to all team members can be used to display data to a high standard. The software used for viewing the tools discussed in this paper are Microsoft Excel and Adobe PDF Reader, neither of which are currently used to their full potential to display data in the pharmaceutical industry. THE PROBLEM Clinical trials often have varying quality of data. Data issues and queries can arise which require discussion amongst the study team. Data visualisation is very important when discussing data issues with other study team members and ensures that everyone fully understands the problem. Although programmers and statisticians have instant access to study data, this is usually not the case for the rest of the clinical team. During a recent study that collected longitudinal blood pressure data, it became clear that it is difficult to identify data issues when looking at single data points. Given the preceding and following values, a value might be valid for one subject but an outlier for another. One method for finding any potential data issues or outliers for data collected over time is to plot the data for each subject ensuring that all data points are roughly following an expected trend. THE TOOL A tool is required to plot data by subject making it easy for the whole study team to visualise the data. This ensures that data issues are found more efficiently and can then be discussed amongst all team members. For the tool to be successful it requires a package that is widely accessible and familiar to everyone in the team, so that it can be used to its full potential. The tool also requires good graphics, so that when data is plotted it is clear to see and well displayed. THE DATA This paper focuses on using tools to analyse subjects with abnormal data points in longitudinal data measuring blood pressure. Systolic and diastolic blood pressure are the study endpoints being considered in this paper. The tools can be applied to other studies analysing different end points. FIRST TOOL - EXCEL The first tool is created in Microsoft Excel. This software is part of Microsoft Office, it is widely available and is familiar to most people. The tool creates multiple tabs in Excel so that data can be displayed in numerous ways. The tabs include demographic variables and the type of data (either systolic or diastolic blood pressure) which can be filtered on and in turn fed into a cover page. The cover page displays data plotted by subject and type of blood pressure, chosen by using drop down menus. The cover page displays the chosen subject s demographics and a listing of the numeric data used to create the plot. When presenting this tool to the study team many advantages can be found: it displays data in a clear and simple manner, the use of drop down menus allows a reviewer to choose which subject and type of data to be viewed in detail, it is simple to code, full listings of data can be viewed on the different tabs and it is easy to adapt for use in another study. However, although data is displayed in an informative manner it is very time consuming to click between each subject and the specific type of blood pressure that needs to be reviewed. Therefore, finding subjects which need investigating due to abnormalities in their data is time consuming. 1

EXAMPLE Below is an example of the cover page which the Excel tool creates. The diastolic blood pressure at day 1144 (visit 14) clearly does not fit the trend for this subject. However, the reading of 98mmHg is perfectly valid for another subject. Therefore, this data point may not have necessarily stood out as a potential data issue when reviewing the total dataset. The screenshot below displays an example of the cover page of the Excel tool, with the subject s demographics displayed in the top left-hand corner and the type of blood pressure being investigated next to it. The example shows how the tool is used in a study examining blood pressure, where it is important to look at the data for both systolic and diastolic blood pressure. Therefore, a drop-down menu is created for both subject and type. The data used to create the plot is also displayed as a listing on the right-hand side of the page, so that it is easy to view the numerical data. The minimum, maximum, range and treatment start and end date are displayed at the bottom of the page. Figure 1: The Excel tool for a chosen subject and parameter CODE SAS is used to export the data into Excel, which is then used to program the tool. The main function used in the Excel file is VLOOKUP, this is used to find the column of data required from the tab containing the listing of all the data for subjects and different types of blood pressure. This selected column of data is then displayed on the cover page and is used to create the plot. When using VLOOKUP, error messages can occur if cells being selected are of different data types, so to avoid this an IFERROR formula can be used. This formula is IFERROR(IF(VLOOKUP(),NA(),VLOOKUP()),NA()). To create the drop-down menus for subject and type, a point and click method is used by going to data validation, changing Allow: to list and then clicking OK. Figure 2: Creating a drop-down menu in Excel 2

SECOND TOOL - ADOBE As previously discussed in this paper the Excel tool is informative and easy to use but is too time consuming to change between different subjects and types of blood pressure. Therefore, the tool is extended to PDF which can be viewed in Adobe Reader. Again, this is software most people are familiar with and feel comfortable using. By extending the tool to PDF it allowed static versions of the plots to be created. This made it quicker and more efficient for study team members to view the data for all subjects. The PDF tool enables two study endpoints to be viewed side by side for each subject, resulting in any potential data abnormalities or queries to be identified in a shorter period of time. The PDF tool included additional figure annotations such as displaying a data point in a different colour if it did not meet a certain criterion, identifying any issues with the blood pressure gauge for a subject at a visit and the addition of a vertical reference line at the screening visit and a horizontal reference line at the baseline value for each subject. It is highlighted in the plots when a new blood pressure gauge is used, this is important because different machines have slightly different calibrations. Additionally, tables are included underneath the figures to show the visit for each data point, the value at that visit, the change from baseline and the change from the previous visit. Some tips to the PDF tool are: Tip 1: If you want to quickly browse through the PDF pages to find subjects with remarkably different data points, then you should ensure the axes remain fixed over all pages. Tip 2: The plots dimensions can be adapted to perfectly fit a computer screen in full screen mode in Adobe Reader by using Ctrl+L. This adaptation enables an ease of viewing for study team members working on small laptop screens. Tip 3: In the PDF abnormal values are given a different colour for extra emphasis. Tip 4: It is very simple to adapt the code to work for other studies, simply change the dataset and variables being used. The main disadvantage of the PDF tool is the initial coding takes longer than the Excel tool. EXAMPLE It can be seen from the example below of the PDF tool that the type of data being measured and the subject s demographics are included in the title. The reference line for the screening and baseline visits can be seen on the figures. These reference lines enable the reader to see how much the data varies from the baseline value and whether the trend of the data for that subject stays consistent over the visits. Figure 3: One PDF page, for a chosen subject displaying two study endpoints side by side 3

CODE The code to produce the PDF tool is created in SAS. Some important parts of the code include: noautolegend - this suppresses the automatic creation of the legend, dattrmap - allows specification of the colours used depending on values in the data, refline - adds a horizontal reference line for the baseline value and a vertical reference line at the screening visit. The table at the bottom of the figures is created using xaxistable, this requires a dataset with the variables being displayed. This method of annotating figures has many formatting options which are easy to use, some of these include position, location and labelattrs. An example of the dataset used to create the table is shown in Figure 4. proc sgplot data=plots noautolegend dattrmap=myattrmap; where subject=&subject and type='diastolic Blood Pressure'; by subject type; title " Diastolic Blood Pressure Profile Plot Subject &subject"; title2 "Site:&pp_siteid (&pp_country) Investigator:&pp_invnam"; title3 "Age:&pp_age Gender:&pp_sex"; scatter x=day y=dxa_result / group=flag markerattrs=(symbol=circlefilled) attrid=myid; option nobyline; xaxis label="time (Days)" values=(-30 to 1270 by 50); yaxis label="blood Pressure (mmhg)" values=(0.2 to 1.4 by 0.2); refline 0 / axis=x lineattrs=(color=black pattern=dot); refline &pp_baseline / axis=y lineattrs=(color=black pattern=dot) label="baseline"; xaxistable visit_no reading_c chg_c chg_previous_visit_no_c/ position=bottom location=outside labelattrs= (size=8pt); run; Figure 4: An example of the dataset needed for xaxistable CONCLUSION In conclusion, both the Excel and PDF tools are informative and easy for the rest of the study team to use. The tools enable visualisation when identifying and discussing potential data abnormalities and queries. It is initially more time consuming to code the PDF tool in comparison to the Excel tool, but now both tools exist they would be easy to replicate for different types of studies by changing the dataset going into the code and the variables used. Each tool explores a different aspect of the data and so they are best used in parallel. The PDF tool can be used to quickly compare two study endpoints across multiple subjects. This can identify any data points that need further investigation and discussion with the rest of the study team. The Excel tool can then be used to look at the desired subject in detail as the tool contains more in-depth information. This enables a thorough discussion about any data queries within the study team. This paper has shown that fancy software is not needed to create tools that visually represent data well and the rest of the study team find easy to use. 4

ACKNOWLEDGMENTS I would like to thank Nick Cowans for his help and support. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the author at: Kirsty Parker-Hodds Veramed Ltd 5th Floor Regal House 70 London Road Twickenham TW1 3QS 020 3696 7240 kirsty.parkerhodds@veramed.co.uk www.veramed.co.uk Brand and product names are trademarks of their respective companies. 5