Dynamic Data Integration in Clinical Research

Size: px
Start display at page:

Download "Dynamic Data Integration in Clinical Research"

Transcription

1 Dynamic Data Integration in Clinical Research Judith R. Logan, MD, MS Department of Medical Informatics & Clinical Epidemiology Oregon Health & Science University Buenos Aires, June 16, 2011

2 in collaboration with Lois M. L. Delcambre, PhD James F. Terwilliger, PhD Scott Britell Vandana Kapoor Gabe Buckmaster with funding support from Collins Medical Trust National Institutes of Health, National Library of Medicine

3 The problem Data, data, data How do you we integrate these silos of data for research? *

4 The motivation

5 National Endoscopic Database ProVation Medical

6 Creating an analytical dataset From a single data source Data extraction Data transformation (units, categorization, etc) Loading into an analytical database From multiple data sources Repeat the same steps for each source then integrate the data Integrate or federate the databases first, then follow those steps

7 Our principles: The data analyst (the domain expert)should make the integration decisions. To do this, the data analyst should have an easy to understand data schema information about the context in which the data was collected tools to make integration easy Integration decisions should be dynamic, made at the time of the study and not just once

8 Overview of GUAVA and MultiClass SQL data sources SQL DBA dashboard channels G-trees Analyst dashboard Select records/field s Classification of data Analytical dataset File GUI-As-View (GUAVA) MultiClass

9 The G-tree An XML representation of the natural schema with contextual annotations The natural schema often mimics the user interface in which the data is collected one table per screen The contextual annotations are information about how the data is collected, like Was the data element required? What were the options for a menu or option group? What was the text on the screen?

10 The G-tree

11 - Patient - COLONOSCOPY The G-tree in a display + History + Physical Exam + Liver Disease - Indications - Evaluation of suspected malignancy (Group) Hereditary Nonpolyposis Colorectal Cancer Familial Adenomatous Polyposis Ulcerative Colitis Crohn s disease + History of colorectal cancer - History of adenomatous polyps History of breast or ovarian cancer Option group; Yes, No Age at diagnosis + Family history of adenomatous polyps + Family history of colorectal cancer + Abnormal XRay Average risk + Evaluation of Blood Loss (Group) + Therapeutic Procedure (Group)

12 G-tree as a query screen

13 Great! How do I get a G-tree? SQL Annotations

14 Overview of GUAVA and MultiClass SQL data sources SQL DBA dashboard channels G-trees Analyst dashboard Select records/field s Classification of data Analytical dataset File GUI-As-View (GUAVA) MultiClass

15 What the database expert (DBA) does The channel is a series of transformations defined by the DBA that change the database schema to the natural schema Vertical Split (VSplit) Vertical Merge (VMerge) Horizontal Split (HSplit) Horizontal Merge (HMerge) Pivot Unpivot Function Application (Apply) Renaming

16 What the database expert (DBA) does A query to the natural schema is transformed in the channel to a query that the database understands. The transformed query is then run against the database, returning data matching the natural schema The data itself is not transformed The analyst never has to know what the database looks like

17 Why have a channel? Employee table, generic schema ID Attribute Value 1 Name Joe 1 Gender male 1 Exp 1 Employee table natural schema ID Name Age Gender Exp 1 Joe male 1 2 Sue Name Sue 2 Age 31 2 Exp 4

18 O VSplit Operator Before: Patient Patient After: Patient_Details Patient_ID SSN FirstName LastName HomePhone CellPhone Patient_ID Patient_ID SSN HomePhone FirstName CellPhone LastName VSplit(Patient, Patient_Details, {SSN, FirstName, LastName}) Represents the decision to split some of the columns into their own table 18

19 O VMerge Operator Patient Before: Patient_Details After: Patient Patient_ID Patient_ID SSN HomePhone FirstName CellPhone LastName VMerge(Patient, Patient_Details) Patient_ID SSN FirstName LastName HomePhone CellPhone Represents the decision to merge two tables into a single table

20 O Pivot Operator Before: Patient Patient_ID Attribute Value Pivot(Patient, Patient_ID) After: Patient Patient_ID SSN FirstName LastName HomePhone CellPhone Represents the decision to transform a generic table into a traditional table.

21 O Unpivot Operator Before: Patient Patient_ID SSN FirstName LastName HomePhone CellPhone After: Patient Patient_ID Attribute Value Unpivot(Patient) Represents the decision to transform a table into (key, attribute, value) rows 21

22 Putting operators together to build a channel Physical DB DDL/DML O1 O2 O3 O4 On Queries Results Natural Schema

23 Overview of GUAVA and MultiClass SQL data sources SQL DBA dashboard channels G-trees Analyst dashboard Select records/field s Classification of data Analytical dataset File GUI-As-View (GUAVA) MultiClass

24 The Analyst Dashboard The data analyst selects the G-tree, which automatically links to the channel file and data source and is displayed as a tree which makes formulation of queries easy The data analyst designs a query with inclusion/exclusion criteria and a data dictionary Using the G-tree, the analyst maps the query elements to the data source For integrating data, the query elements are mapped to more than one data source The analyst runs the query to obtain an analytical dataset The analyst saves the query to be run again if needed

25 The Analyst Dashboard The data analyst designs a query with inclusion/exclusion criteria and a data dictionary The data analyst selects the G-tree, which automatically links to the channel file and data source and is displayed as a tree which makes formulation of queries easy Using the G-tree, the analyst maps the query elements to the data source For integrating data, the query elements are mapped to more than one data source The analyst runs the query to obtain an analytical dataset The analyst saves the query to be run again if needed

26 1. Define inclusion/exclusion criteria PolypIncidence.owl Study Inclusion criteria Exclusion criteria Data dictionary Data sources Mapping Results Recent Studies Filters allow you to limit the rows to be included in the result set. PolypIncidence.owl PolypRemoval2008.owl CecalIntubationRate.owl SedMedDocumentation.ow Name l Options Description Procs2008 Include colonoscopies performed in 2008 only AgeGT18 Include data on patients greater than age 18 only Exit Inclusion criteria Edit Save Delete Delete

27 2. Build the data dictionary PolypIncidence.owl Study Filters Inclusion criteria Data Exclusion criteria elements Data dictionary Data sources Mapping Results Recent Studies Data elements describe the columns to be included in the result set. PolypIncidence.owl PolypRemoval2008.owl CecalIntubationRate.owl Name Description Null? SedMedDocumentation.ow l Age Polyp Depth Options Age at time of procedure, may be calculated as date of procedure - date of birth Polyp found on examination Value Data Dictionary Exit Description Depth reached on examination Yes Value Description proximal terminal ileum to cecum Edit Delete distal cecum to rectum Save Delete Done

28 3. Select one or more data sources PolypIncidence.owl Study Inclusion criteria Exclusion criteria Data dictionary Data sources Recent Studies Select data sources to be used in the study. PolypIncidence.owl PolypRemoval2008.owl Open data source file: CecalIntubationRate.owl SedMedDocumentation.ow l Open data source file: Data Sources CORIv4.guv ProVation.guv Edit Edit Delete Delete Mapping Open data source file: Browse Results Add data source Options Exit Done

29 4. Map the elements to each data source PolypIncidence.owl CORIv4 Provider Patient dateofbirth gender Procedure Egd Colonoscopy procdate Procedure Recent Studies depthreached Finding Polyp Tumor Multiple polyps PolypIncidence.owl PolypRemoval2008.owl Depth = Proximal CecalIntubationRate.owl SedMedDocumentation.ow l Options depthreached Depth = Distal depthreached Mapping: Depth Data Element Exit in in cecum ascending colon hepatic flexure transverse colon splenic flexure descending colon sigmoid colon rectum cecum ascending colon hepatic flexure transverse colon splenic flexure descending colon sigmoid colon rectum

30 5. Run the query SQL data sources SQL channels GUAVA server G-trees Analyst dashboard Select records/field s Classification of data Analytical dataset File

31 What happens when the analyst clicks Run? The query is formulated using relational algebra and sent to the GUAVA server The GUAVA server applies the query to one end of the channel The query is transformed in the channel to an equivalent query in the appropriate database language (components in the channel are specific to the database system)

32 What happens when the analyst clicks Run? The transformed query is run against the database and retrieves all data, which are returned to the GUAVA server The data transformations (the mappings) are then applied The data sets are integrated The integrated data is sent to the data analyst who outputs it to a statistical package.

33 Saving your work This could be a lot of work. What if you want to run the study again? Or modify it a little? The study (the query) can be saved we have demonstrated the ability to save it as an Operational Data Model (ODM) file this file can be imported and rerun or modified

34 How have we used this? We have already seen the motivating example (CORI). Currently CORI has over 2 million records in the National Endoscopic Database and is beginning to receive ProVation data. The CORI data is from two versions We have demonstrated the ability to integrate data from a study on breast cancer, part captured electronically and part with scannable paper forms

35 SQL SQL File Our goal Integrate with statistical software Deploy in a grid computing environment such as the National Cancer Institute cagrid project

36 Publications Terwilliger JF, Delcambre LML, Logan J. The User Interface is the Conceptual Model. Proc 25 th International Conference on Conceptual Modeling (ER2006), Tucson, AZ. November 6-9, 2006, Logan JR, Terwilliger JF, Delcambre LML. Exploiting the User Interface for Tomorrow's Clinical Data Analysis. Journal on Information Technology in Healthcare, April 2008, 6(2): Reprinted from Today s Information for Tomorrow s Improvements 2007, an international conference addressing Information Technology and Communications in Health (ITCH 2007). Terwilliger JF, Delcambre LML, Logan JR. Querying Through a User Interface. Data and Knowledge Engineering, 2007;63: Logan JR, Britell S, Delcambre LML, Kapoor V, Buckmaster JG. Representing Multi- Database Study Schemas for Reusability. Proc AMIA Summits Trans Sci : PMC Terwilliger JF, LML Delcambre, JR Logan, D Maier, DW Archer, J Steinhauer, S Britell. Enabling the Revisitation of Fine-Grained, Clinical Information. Proc of the 1 st ACM International Health Informatics Symp (IHI '10). 2010;: Britell S, JG Buckmaster, JR Logan, LML Delcambre, V Kapoor. Providing Domain Analysts with Natural Schemas to Enable Clinical Research. Accepted for publication: Journal of Biomedical Informatics.

Context-Sensitive Clinical Data Integration

Context-Sensitive Clinical Data Integration Context-Sensitive Clinical Data Integration James F. Terwilliger 1,LoisM.L.Delcambre 1, and Judith Logan 2 1 Department of Computer Science Portland State University, Portland OR 97207, USA {jterwill,

More information

Translating an ER Diagram to a Relational Schema

Translating an ER Diagram to a Relational Schema Translating an ER Diagram to a Relational Schema CS386/586 Introduction to Database Systems, Lois Delcambre 1999-2009 Slide 1 Translate each entity set into a table, with keys. Entity set: represented

More information

The IEEE Metadata Standard for Supporting Big Data Management

The IEEE Metadata Standard for Supporting Big Data Management The IEEE Metadata Standard for Supporting Big Data Management Alex MH Kuo 1,2 (Ph.D) 1 School of Health Information Science University of Victoria, BC, Canada. 2 CEDAR, School of Medicine University of

More information

VS2010 C# Programming - DB intro 1

VS2010 C# Programming - DB intro 1 VS2010 C# Programming - DB intro 1 Topics Database Relational - linked tables SQL ADO.NET objects Referencing Data Using the Wizard Displaying data 1 VS2010 C# Programming - DB intro 2 Database A collection

More information

Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research

Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research Ian Fore, D.Phil. Associate Director, Biorepository and Pathology Informatics Senior Program

More information

Collecting Public Health Data at a Global Scale. D. Cenk Erdil, PhD Marist College

Collecting Public Health Data at a Global Scale. D. Cenk Erdil, PhD Marist College Collecting Public Health Data at a Global Scale D. Cenk Erdil, PhD Marist College DataCloud 2015 November 15, 2015 Outline Background A Public Health CPS Data Collection Marist MAgIC Summary Q&A 2 Background

More information

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3 Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

NOTICE: This is the author s version of a work that was accepted for publication

NOTICE: This is the author s version of a work that was accepted for publication NOTICE: This is the author s version of a work that was accepted for publication in Journal of Biomedical Informatics. Changes resulting from the publishing process, such as peer review, editing, corrections,

More information

The BIRO software. 1st EUBIROD Annual Meeting, Dasman Centre, Kuwait City

The BIRO software. 1st EUBIROD Annual Meeting, Dasman Centre, Kuwait City The BIRO software 1st EUBIROD Annual Meeting, Dasman Centre, Kuwait City BIROBox BIRO Box: a Graphical User Interface to configure and run all BIRO software tools for the local BIRO system XML files BIROBox

More information

Supporting Imprecision in Multidimensional Databases Using Granularities

Supporting Imprecision in Multidimensional Databases Using Granularities Supporting Imprecision in Multidimensional Databases Using Granularities Torben Bach Pedersen Christian S. Jensen Curtis E. Dyreson Center for Health Information Services, Kommunedata, P.O. Pedersens Vej

More information

HEREDITARY COLORECTAL CANCER: INFORMATION-BASED APPROACH. Elena Manilich. Submitted in partial fulfillment of the requirements

HEREDITARY COLORECTAL CANCER: INFORMATION-BASED APPROACH. Elena Manilich. Submitted in partial fulfillment of the requirements !!!! HEREDITARY COLORECTAL CANCER: INFORMATION-BASED APPROACH By Elena Manilich Submitted in partial fulfillment of the requirements For the degree of Doctor of Philosophy Dissertation Adviser: Dr. Z.

More information

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2.

Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2. Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

The Beau Biden Cancer Moonshot and the NCI Blue Ribbon Panel Recommendations. NAACCR Registry of the Future Presentation June 19, 2017

The Beau Biden Cancer Moonshot and the NCI Blue Ribbon Panel Recommendations. NAACCR Registry of the Future Presentation June 19, 2017 The Beau Biden Cancer Moonshot and the NCI Blue Ribbon Panel Recommendations NAACCR Registry of the Future Presentation June 19, 2017 The Beau Biden Cancer Moonshot Accelerate progress in cancer, including

More information

Graphical User Interfaces as Updatable Views

Graphical User Interfaces as Updatable Views Portland State University PDXScholar Dissertations and Theses Dissertations and Theses 2009 Graphical User Interfaces as Updatable Views James Felger Terwilliger Portland State University Let us know how

More information

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby: Dr. Joe V. Selby, MD, MPH Executive Director Patient-Centered Outcomes Research Institute 1828 L Street, NW, Suite 900 Washington, DC 20036 Submitted electronically at: http://www.pcori.org/webform/data-access-and-data-sharing-policypublic-comment

More information

In-Memory Technology in Life Sciences

In-Memory Technology in Life Sciences in Life Sciences Dr. Matthieu-P. Schapranow In-Memory Database Applications in Healthcare 2016 Apr Intelligent Healthcare Networks in the 21 st Century? Hospital Research Center Laboratory Researcher Clinician

More information

Original Article (HNPCC) Downloaded from tmuj.iautmu.ac.ir at 19: on Friday March 22nd Log rank

Original Article (HNPCC) Downloaded from tmuj.iautmu.ac.ir at 19: on Friday March 22nd Log rank > >>?> @ >> @ 44 40 > @?>? @>@? @ @>??> 4 > >@? 4 3 2 > < E @> > @ >> 2 @ @ >> < @? < @??> >

More information

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets Jeffrey Brown, Lesley Curtis, and Rich Platt June 13, 2014 Previously The NIH Collaboratory:

More information

CTSI Module 8 Workshop Introduction to Biomedical Informatics, Part V

CTSI Module 8 Workshop Introduction to Biomedical Informatics, Part V CTSI Module 8 Workshop Introduction to Biomedical Informatics, Part V Practical Tools: Data Processing & Analysis William Hsu, PhD Assistant Professor Medical Imaging Informatics Group Dept of Radiological

More information

Figure 1: Relationship among Three Entities/Tables

Figure 1: Relationship among Three Entities/Tables Creating a Relational Database in Access Farrokh Alemi, Ph.D. The objective of this note is to help you understand how a relational database is organized as a collection of tables, linked to each other.

More information

Anomaly Detection. You Chen

Anomaly Detection. You Chen Anomaly Detection You Chen 1 Two questions: (1) What is Anomaly Detection? (2) What are Anomalies? Anomaly detection refers to the problem of finding patterns in data that do not conform to expected behavior

More information

GRID COMPUTING IN MEDICAL APPLICATIONS

GRID COMPUTING IN MEDICAL APPLICATIONS GRID COMPUTING IN MEDICAL APPLICATIONS P. Cerello, INFN, Sezione di Torino, Torino, Italy. Abstract Medical Applications can exploit GRID Services in many ways: some of them are computing intensive and

More information

DATABASE MANAGERS. Basic database queries. Open the file Pfizer vs FDA.mdb, then double click to open the table Pfizer payments.

DATABASE MANAGERS. Basic database queries. Open the file Pfizer vs FDA.mdb, then double click to open the table Pfizer payments. DATABASE MANAGERS We ve already seen how spreadsheets can filter data and calculate subtotals. But spreadsheets are limited by the amount of data they can handle (about 65,000 rows for Excel 2003). Database

More information

Interoperability. Doug Fridsma, MD PhD President & CEO, AMIA

Interoperability. Doug Fridsma, MD PhD President & CEO, AMIA Interoperability Doug Fridsma, MD PhD President & CEO, AMIA Getting to Interoperability To get to interoperability (or to avoid information blocking) we need a common understanding of the problem Can t

More information

ICTR UW Institute of Clinical and Translational Research. i2b2 User Guide. Version 1.0 Updated 9/11/2017

ICTR UW Institute of Clinical and Translational Research. i2b2 User Guide. Version 1.0 Updated 9/11/2017 ICTR UW Institute of Clinical and Translational Research i2b2 User Guide Version 1.0 Updated 9/11/2017 Table of Contents Background/Search Criteria... 2 Accessing i2b2... 3 Navigating the Workbench...

More information

SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL

SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL Wang Wei, Payam M. Barnaghi School of Computer Science and Information Technology The University of Nottingham Malaysia Campus {Kcy3ww, payam.barnaghi}@nottingham.edu.my

More information

EVIDENCE SEARCHING IN EBM. By: Masoud Mohammadi

EVIDENCE SEARCHING IN EBM. By: Masoud Mohammadi EVIDENCE SEARCHING IN EBM By: Masoud Mohammadi Steps in EBM Auditing the outcome Defining the question or problem Applying the results Searching for the evidence Critically appraising the literature Clinical

More information

Alpha 1 i2b2 User Guide

Alpha 1 i2b2 User Guide Alpha 1 i2b2 User Guide About i2b2 Accessing i2b2 Data Available in i2b2 Navigating the Workbench Workbench Screen Layout Additional Workbench Features Creating and Running a Query Creating a Query Excluding

More information

Executive Summary for deliverable D6.1: Definition of the PFS services (requirements, initial design)

Executive Summary for deliverable D6.1: Definition of the PFS services (requirements, initial design) Electronic Health Records for Clinical Research Executive Summary for deliverable D6.1: Definition of the PFS services (requirements, initial design) Project acronym: EHR4CR Project full title: Electronic

More information

Pivoting Entity-Attribute-Value data Using MapReduce for Bulk Extraction

Pivoting Entity-Attribute-Value data Using MapReduce for Bulk Extraction Pivoting Entity-Attribute-Value data Using MapReduce for Bulk Extraction Augustus Kamau Faculty Member, Computer Science, DeKUT Overview Entity-Attribute-Value (EAV) data Pivoting MapReduce Experiment

More information

The Charité University Hospital Berlin is one of the largest University Hospitals in Europe.

The Charité University Hospital Berlin is one of the largest University Hospitals in Europe. 1 2 The Charité University Hospital Berlin is one of the largest University Hospitals in Europe. It has a yearly revenue of 1.3 Billion Euros of which 158 million euros comes from outside funding. There

More information

Renae Barger, Executive Director NN/LM Middle Atlantic Region

Renae Barger, Executive Director NN/LM Middle Atlantic Region Renae Barger, Executive Director NN/LM Middle Atlantic Region rbarger@pitt.edu http://nnlm.gov/mar/ DANJ Meeting, November 4, 2011 Advanced PubMed (20 min) General Information PubMed Citation Types Automatic

More information

User Manual. phr.mtbc.com

User Manual. phr.mtbc.com User Manual Table of Contents Introduction Appointments Appointment History Claims History CCDA Report Demographics Health History Lab Reports Online Payment Secure Messages Health Recommendation Patient

More information

WHO ICD11 Wiki LexWiki, Semantic MediaWiki and the International Classification of Diseases

WHO ICD11 Wiki LexWiki, Semantic MediaWiki and the International Classification of Diseases WHO ICD11 Wiki LexWiki, Semantic MediaWiki and the International Classification of Diseases Guoqian Jiang, PhD Harold Solbrig Division of Biomedical Statistics and Informatics Mayo Clinic College of Medicine

More information

School of Computing and Information Technology. Examination Paper Autumn Session 2017

School of Computing and Information Technology. Examination Paper Autumn Session 2017 School of Computing and Information Technology CSIT115 Data Management and Security Wollongong Campus Student to complete: Family name Other names Student number Table number Examination Paper Autumn Session

More information

ICPC-2e-v.4.2_SQL Technical Guide

ICPC-2e-v.4.2_SQL Technical Guide (version 4.2.4) ICPC-2e-v.4.2_SQL Technical Guide This document guides you in the use of the distributed SQL database and the zipped data files. Version 4.2.4 Terje Sagen MD The Norwegian Directorate of

More information

Data-Transformation on historical data using the RDF Data Cube Vocabulary

Data-Transformation on historical data using the RDF Data Cube Vocabulary Data-Transformation on historical data using the RD Data Cube Vocabulary Sebastian Bayerl, Michael Granitzer Department of Media Computer Science University of Passau SWIB15 Semantic Web in Libraries 22.10.2015

More information

DSE 203 DAY 1: REVIEW OF DBMS CONCEPTS

DSE 203 DAY 1: REVIEW OF DBMS CONCEPTS DSE 203 DAY 1: REVIEW OF DBMS CONCEPTS Data Models A specification that precisely defines The structure of the data The fundamental operations on the data The logical language to specify queries on the

More information

i2b2 User Guide University of Minnesota Clinical and Translational Science Institute

i2b2 User Guide University of Minnesota Clinical and Translational Science Institute Clinical and Translational Science Institute i2b2 User Guide i2b2 is a tool for discovering research cohorts using existing, de-identified, clinical data This guide is provided by the Office of Biomedical

More information

International Journal of Scientific & Engineering Research, Volume 7, Issue 2, February ISSN

International Journal of Scientific & Engineering Research, Volume 7, Issue 2, February ISSN International Journal of Scientific & Engineering Research, Volume 7, Issue 2, February-2016 1271 Design and Implementing Cancer Data Warehouse to Support Clinical Decisions Alaa Khalaf Hamoud, Dr. Hasan

More information

2017 GridGain Systems, Inc. In-Memory Performance Durability of Disk

2017 GridGain Systems, Inc. In-Memory Performance Durability of Disk In-Memory Performance Durability of Disk Meeting the Challenges of Fast Data in Healthcare with In-Memory Technologies Akmal Chaudhri Technology Evangelist GridGain Agenda Introduction Fast Data in Healthcare

More information

CloudExpo November 2017 Tomer Levi

CloudExpo November 2017 Tomer Levi CloudExpo November 2017 Tomer Levi About me Full Stack Engineer @ Intel s Advanced Analytics group. Artificial Intelligence unit at Intel. Responsible for (1) Radical improvement of critical processes

More information

Keh-Dong Shiang, Department of Biostatistics & Department of Diabetes, City of Hope National Medical Center, Duarte, CA

Keh-Dong Shiang, Department of Biostatistics & Department of Diabetes, City of Hope National Medical Center, Duarte, CA Validating Data Via PROC SQL Keh-Dong Shiang, Department of Biostatistics & Department of Diabetes, City of Hope National Medical Center, Duarte, CA ABSTRACT The Structured Query Language (SQL) is a standardized

More information

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

An Overview of various methodologies used in Data set Preparation for Data mining Analysis An Overview of various methodologies used in Data set Preparation for Data mining Analysis Arun P Kuttappan 1, P Saranya 2 1 M. E Student, Dept. of Computer Science and Engineering, Gnanamani College of

More information

DATA VALIDATION RESOURCES RCHC Data Group Webinar By Ben Fouts MPH November 14, 2017

DATA VALIDATION RESOURCES RCHC Data Group Webinar By Ben Fouts MPH November 14, 2017 DATA VALIDATION RESOURCES RCHC Data Group Webinar By Ben Fouts MPH November 14, 2017 Agenda 1. Documents on new RCHC Peer Collaboration web page 2. Data Validation Resources (Introduction to the new version

More information

Table of Contents. Page 1 of 51

Table of Contents. Page 1 of 51 Table of Contents Introduction/Background/Search Criteria...2 Accessing i2b2.3 Navigating the Workbench..14 Resize the Workspace 17 Constructing and Running a Query.18 Selecting Query Criteria. 18 Building

More information

Searching PubMed. Enter your concepts into the search box and click Search. Your results are displayed below the search box.

Searching PubMed. Enter your concepts into the search box and click Search. Your results are displayed below the search box. Searching PubMed UCL Library Services, Gower St., London WC1E 6BT 020 7679 7700 E-mail: library@ucl.ac.uk http://www.ucl.ac.uk/library/ 1. What is PubMed? http://www.pubmed.gov PubMed is a free interface

More information

isystoc User Guide

isystoc User Guide isystoc 4.3.1 User Guide Table of Contents isystoc 4.3.1 User Guide...4 About isystoc...5 Logging into isystoc...6 Automatic Logout / Refresh...7 isystoc Icons...8 Using isystoc...9 Search Options...10

More information

Contents (click on a header to go to that section)

Contents (click on a header to go to that section) PPM+ Unplanned Patient List (UPL) User Guide Contents (click on a header to go to that section) Unplanned Patient List (UPL)... 2 Example UPL... 2 Adding a Patient to the UPL... 3 Updating UPL Details...

More information

REDCap User Guide 3 Entering Patient Data on REDCap

REDCap User Guide 3 Entering Patient Data on REDCap REDCap User Guide 3 Entering Patient Data on REDCap Prerequisite: We also use REDCap to collect patient data, and that is done within the Data project. Steps: 1. First go to globalsurg.org/gs3, and this

More information

AUTOMATED REAL-TIME OBJECTS DETECTION IN COLONOSCOPY VIDEOS FOR QUALITY MEASUREMENTS. Muthukudage Jayantha Kumara

AUTOMATED REAL-TIME OBJECTS DETECTION IN COLONOSCOPY VIDEOS FOR QUALITY MEASUREMENTS. Muthukudage Jayantha Kumara AUTOMATED REAL-TIME OBJECTS DETECTION IN COLONOSCOPY VIDEOS FOR QUALITY MEASUREMENTS Muthukudage Jayantha Kumara Dissertation Prepared for the Degree of DOCTOR OF PHILOSOPHY UNIVERSITY OF NORTH TEXAS August

More information

Magic-5. Medical Applications in a GRID Infrastructure Connection. Ivan De Mitri* on behalf of MAGIC-5 collaboration

Magic-5. Medical Applications in a GRID Infrastructure Connection. Ivan De Mitri* on behalf of MAGIC-5 collaboration Magic-5 Medical Applications in a GRID Infrastructure Connection * on behalf of MAGIC-5 collaboration *Dipartimento di Fisica dell Università di Lecce and INFN Lecce, Italy ivan.demitri@le.infn.it HEALTHGRID

More information

Where Are We? Next Few Lectures. Integrity Constraints Motivation. Constraints in E/R Diagrams. Keys in E/R Diagrams

Where Are We? Next Few Lectures. Integrity Constraints Motivation. Constraints in E/R Diagrams. Keys in E/R Diagrams Where Are We? Introduction to Data Management CSE 344 Lecture 15: Constraints We know quite a bit about using a DBMS Start with real-world problem, design ER diagram From ER diagram to relations -> conceptual

More information

ehealth action in the EU

ehealth action in the EU ehealth action in the EU ehealth for smart and inclusive growth 13 February 2014 Jerome Boehm DG SANCO ehealth and Health Technology Assessment General Health Objectives of the EU cooperation on ehealth

More information

The HUMANE roadmaps towards future human-machine networks Oxford, UK 21 March 2017

The HUMANE roadmaps towards future human-machine networks Oxford, UK 21 March 2017 The HUMANE roadmaps towards future human-machine networks Oxford, UK 21 March 2017 Eva Jaho, ATC e.jaho@atc.gr 1 Outline HMNs Trends: How are HMNs evolving? The need for future-thinking and roadmaps of

More information

Oracle Database 10g Express

Oracle Database 10g Express Oracle Database 10g Express This tutorial prepares the Oracle Database 10g Express Edition Developer to perform common development and administrative tasks of Oracle Database 10g Express Edition. Objectives

More information

The URL of the whole system is:

The URL of the whole system is: &6)LQDO3URMHFW )DOO %RRN6WRUH'DWDEDVH :HE%DVHG 0DQDJHPHQW,PSOHPHQWDWLRQ 7R 'U6RQ&DR7UDQ )URP 7HDP %R'X

More information

NCI Thesaurus, managing towards an ontology

NCI Thesaurus, managing towards an ontology NCI Thesaurus, managing towards an ontology CENDI/NKOS Workshop October 22, 2009 Gilberto Fragoso Outline Background on EVS The NCI Thesaurus BiomedGT Editing Plug-in for Protege Semantic Media Wiki supports

More information

DB Fundamentals Exam.

DB Fundamentals Exam. IBM 000-610 DB2 10.1 Fundamentals Exam TYPE: DEMO http://www.examskey.com/000-610.html Examskey IBM 000-610 exam demo product is here for you to test the quality of the product. This IBM 000-610 demo also

More information

IBM InfoSphere Information Server Version 8 Release 7. Reporting Guide SC

IBM InfoSphere Information Server Version 8 Release 7. Reporting Guide SC IBM InfoSphere Server Version 8 Release 7 Reporting Guide SC19-3472-00 IBM InfoSphere Server Version 8 Release 7 Reporting Guide SC19-3472-00 Note Before using this information and the product that it

More information

A Vision for Bigger Biomedical Data: Integration of REDCap with Other Data Sources

A Vision for Bigger Biomedical Data: Integration of REDCap with Other Data Sources A Vision for Bigger Biomedical Data: Integration of REDCap with Other Data Sources Ram Gouripeddi Assistant Professor, Department of Biomedical Informatics, University of Utah Senior Biomedical Informatics

More information

Acquiring Experience with Ontology and Vocabularies

Acquiring Experience with Ontology and Vocabularies Acquiring Experience with Ontology and Vocabularies Walt Melo Risa Mayan Jean Stanford The author's affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended

More information

BioGrid Australia - Health Through Information

BioGrid Australia - Health Through Information Images and Oracle Database 11g BioGrid Australia - Health Through Information PRANABH JAIN and NAOMI RAFAEL Presented by Susan Mavris, Oracle Multimedia Agenda Purpose and Description of BioGrid Oracle

More information

Hyperion Interactive Reporting Reports & Dashboards Essentials

Hyperion Interactive Reporting Reports & Dashboards Essentials Oracle University Contact Us: +27 (0)11 319-4111 Hyperion Interactive Reporting 11.1.1 Reports & Dashboards Essentials Duration: 5 Days What you will learn The first part of this course focuses on two

More information

Hanming Tu, Accenture, Berwyn, USA

Hanming Tu, Accenture, Berwyn, USA Hanming Tu, Accenture, Berwyn, USA Agenda Issue Statement Create Mapping Build Reusable Codes Define Repeatable Workflow Check compliance Conclusion Copyright 2016 Accenture. All rights reserved. 2 Issue

More information

National metadata repository for databases of registers and trials

National metadata repository for databases of registers and trials National metadata repository for databases of registers and trials IBE, Medical Faculty, Ludwig-Maximilians-Universität München, Germany Institute for Medical Informatics (IMISE), Universität Leipzig,

More information

OpenClinica - Data Import & Export

OpenClinica - Data Import & Export OpenClinica - Data Import & Export Purpose of this document: To provide instruction and best practice guidelines for the import and export of study data into and out of OpenClinica. Version: OpenClinica

More information

ACS-3902 Fall Ron McFadyen 3D21 Slides are based on chapter 5 (7 th edition) (chapter 3 in 6 th edition)

ACS-3902 Fall Ron McFadyen 3D21 Slides are based on chapter 5 (7 th edition) (chapter 3 in 6 th edition) ACS-3902 Fall 2016 Ron McFadyen 3D21 ron.mcfadyen@acs.uwinnipeg.ca Slides are based on chapter 5 (7 th edition) (chapter 3 in 6 th edition) 1 The Relational Data Model and Relational Database Constraints

More information

Annotating PDFs for ereturn version 1.0; August 28, 2007; full release

Annotating PDFs for ereturn version 1.0; August 28, 2007; full release Annotating PDFs for ereturn version 1.0; August 28, 2007; full release 1. Introduction eproof files are self-contained PDF documents for viewing on-screen and for printing. They contain all appropriate

More information

DONE! You can now close the browser.

DONE! You can now close the browser. Visit My Doctor Online at kp.org/mydoctor. Prepare for your visit This form will help you prepare for your upcoming visit with your doctor. You can complete it on your computer (Mac or PC) and e-mail it

More information

Physician Quality Reporting System Program Year Group Practice Reporting Option (GPRO) Web Interface XML Specification

Physician Quality Reporting System Program Year Group Practice Reporting Option (GPRO) Web Interface XML Specification Centers for Medicare & Medicaid Services CMS expedited Life Cycle (XLC) Physician Quality Reporting System Program Year 2013 Group Practice Reporting Option (GPRO) Web Interface XML Specification Version:

More information

Towards Comprehensive Longitudinal Healthcare Data Capture

Towards Comprehensive Longitudinal Healthcare Data Capture Towards Comprehensive Longitudinal Healthcare Data Capture Delroy Cameron *, Varun Bhagwan +, Amit P. Sheth * Kno.e.sis Center, Wright State University * Dayton OH 45324 IBM Almaden Research Center + San

More information

DESIGN AND IMPLEMENTATION OF AN AUTOMATION MAPPING

DESIGN AND IMPLEMENTATION OF AN AUTOMATION MAPPING DESIGN AND IMPLEMENTATION OF AN AUTOMATION TOOL FOR HL7 RIM-TO-RELATIONAL DATABASE MAPPING Shagufta Umer, Muhammad Afzal, Maqbool Hussain, Hafiz Farooq Ahmad, Khalid Latif Presenter NUST SCHOOL OF ELECTRICAL

More information

Available Online through

Available Online through Available Online through www.ijptonline.com ISSN: 0975-766X CODEN: IJPTFI Research Article ANALYSIS OF CT LIVER IMAGES FOR TUMOUR DIAGNOSIS BASED ON CLUSTERING TECHNIQUE AND TEXTURE FEATURES M.Krithika

More information

Considerations of Analysis of Healthcare Claims Data

Considerations of Analysis of Healthcare Claims Data Considerations of Analysis of Healthcare Claims Data ABSTRACT Healthcare related data is estimated to grow exponentially over the next few years, especially with the growing adaptation of electronic medical

More information

Efficient Data Distribution in CDBMS based on Location and User Information for Real Time Applications

Efficient Data Distribution in CDBMS based on Location and User Information for Real Time Applications ISSN (e): 2250 3005 Volume, 05 Issue, 03 March 2015 International Journal of Computational Engineering Research (IJCER) Efficient Data Distribution in CDBMS based on Location and User Information for Real

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Introduction to Azure DocumentDB. Jeff Renz, BI Architect RevGen Partners

Introduction to Azure DocumentDB. Jeff Renz, BI Architect RevGen Partners Introduction to Azure DocumentDB Jeff Renz, BI Architect RevGen Partners Thank You Presenting Sponsors Gain insights through familiar tools while balancing monitoring and managing user created content

More information

Bridging Clinical and Research Data through Informatics

Bridging Clinical and Research Data through Informatics Bridging Clinical and Research Data through Informatics Bas de Veer MS University of Washington ITHS Introduction One of 62 CTSA sites nationally Approximately 160 faculty and staff More than 6,000 researchers

More information

REDCap under FHIR Enhancing electronic data capture with FHIR capability

REDCap under FHIR Enhancing electronic data capture with FHIR capability REDCap under FHIR Enhancing electronic data capture with FHIR capability Hugo Leroux Research Scientist 09 August 2017 THE AUSTRALIAN E-HEALTH RESEARCH CENTRE, HEALTH AND BIOSECURITY Rationale Clinical

More information

Secure Enterprise Access to Support Collaboration on Clinical Research

Secure Enterprise Access to Support Collaboration on Clinical Research Secure Enterprise Access to Support Collaboration on Clinical Research Oracle9iR2 Database Security Oracle World, Sept. 9, 2003 Nitin Sawhney, Ph.D. Cal Collins and Tom Hickerson Akaza Research, Cambridge,

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

Calder Library Resident Orientation

Calder Library Resident Orientation University of Miami Scholarly Repository Faculty Research, Publications, and Presentations Department of Health Informatics 1-1-2015 Calder Library Resident Orientation Carmen Bou-Crick University of Miami

More information

A Pipeline for Computer Aided Polyp Detection and Segmentation in Ct Colonography

A Pipeline for Computer Aided Polyp Detection and Segmentation in Ct Colonography IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 9, Issue 4, Ver. III (Jul - Aug. 2014), PP 68-75 A Pipeline for Computer Aided Polyp Detection

More information

Using Ontologies for Data and Semantic Integration

Using Ontologies for Data and Semantic Integration Using Ontologies for Data and Semantic Integration Monica Crubézy Stanford Medical Informatics, Stanford University ~~ November 4, 2003 Ontologies Conceptualize a domain of discourse, an area of expertise

More information

Towards An Era Of Epidemiological Databases For Autoimmune Diseases

Towards An Era Of Epidemiological Databases For Autoimmune Diseases IEEE 12 TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING NOVEMBER 11-13, LARNACA, CYPRUS Towards An Era Of Epidemiological Databases For Autoimmune Diseases Vassiliki Gkantouna 1, Marina

More information

A New GPU-Based Level Set Method for Medical Image Segmentation

A New GPU-Based Level Set Method for Medical Image Segmentation A New GPU-Based Level Set Method for Medical Image Segmentation Wenzhe Xue Research Assistant Radiology Department Mayo Clinic, Scottsdale, AZ Ph.D. Student Biomedical Informatics Arizona State University,

More information

Optimization of the PubMed Automatic Term Mapping

Optimization of the PubMed Automatic Term Mapping 238 Medical Informatics in a United and Healthy Europe K.-P. Adlassnig et al. (Eds.) IOS Press, 2009 2009 European Federation for Medical Informatics. All rights reserved. doi:10.3233/978-1-60750-044-5-238

More information

3D Human Internal Organs Representation Model (Updates)

3D Human Internal Organs Representation Model (Updates) 3D Human Internal Organs Representation Model (Updates) ISO/IEC JTC 1/SC 24/WG 9 & Web3D Meetings January 21-24, 2019 Myeong Won Lee (The University of Suwon) Requirements (1) 3D printing and scanning

More information

ehepqual- HCV Quality of Care Performance Measure Program

ehepqual- HCV Quality of Care Performance Measure Program NEW YORK STATE DEPARTMENT OF HEALTH AIDS INSTITUTE ehepqual- HCV Quality of Care Performance Measure Program USERS GUIDE A GUIDE FOR PRIMARY CARE AND HEPATITIS C CARE PROVIDERS * * For use with ehepqual,

More information

POLICY. Create a governance process to manage requests to extract de- identified data from the Information Exchange (IE).

POLICY. Create a governance process to manage requests to extract de- identified data from the Information Exchange (IE). Academic Health Center Office of Biomedical Health Informatics POLICY Extraction of De- Identifiable Data from the Information Exchange Approved Proposal Purpose Create a governance process to manage requests

More information

A National Metadata Repository for Empirical Research in Germany

A National Metadata Repository for Empirical Research in Germany A National Metadata Repository for Empirical Research in Germany 15 th International Open Forum on Metadata Jürgen Stausberg 1, Johannes Drepper 2, Matthias Löbe 3, Sylvie MN Ngouongo 1, Philippe Verplancke

More information

NeuroLOG WP1 Sharing Data & Metadata

NeuroLOG WP1 Sharing Data & Metadata Software technologies for integration of process and data in medical imaging NeuroLOG WP1 Sharing Data & Metadata Franck MICHEL Paris, May 18 th 2010 NeuroLOG ANR-06-TLOG-024 http://neurolog.polytech.unice.fr

More information

Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger. Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight

Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger. Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight Origin and Outcomes Currently funded through a Wellcome Trust Seed award Collaboration

More information

Designing is the most important phase of software development. It requires

Designing is the most important phase of software development. It requires Chapter 7 System Design Designing is the most important phase of software development. It requires careful planning and thinking on the part of system designer. Designing software means to plan how the

More information

CDISC Journal. Generating a cabig Patient Study Calendar from a Study Design in ODM with Study Design Model Extension. By Jozef Aerts.

CDISC Journal. Generating a cabig Patient Study Calendar from a Study Design in ODM with Study Design Model Extension. By Jozef Aerts. CDISC Journal Clinical Data Interchange Standards Consortium october 2011 Generating a cabig Patient Study Calendar from a Study Design in ODM with Study Design Model Extension By Jozef Aerts Abstract

More information

KEY BASED APPROACH FOR INTEGRATION OF HETEROGENEOUS DATA SOURCES

KEY BASED APPROACH FOR INTEGRATION OF HETEROGENEOUS DATA SOURCES KEY BASED APPROACH FOR INTEGRATION OF HETEROGENEOUS DATA SOURCES 1 KAMSURIAH AHMAD, 1 TENGKU SITI FATIMAH TENGKU WOOK, 2 REDUAN SAMAD 1 Strategic Management Research Group Faculty of Information Science

More information

Microsoft Exam Querying Microsoft SQL Server 2012 Version: 13.0 [ Total Questions: 153 ]

Microsoft Exam Querying Microsoft SQL Server 2012 Version: 13.0 [ Total Questions: 153 ] s@lm@n Microsoft Exam 70-461 Querying Microsoft SQL Server 2012 Version: 13.0 [ Total Questions: 153 ] Question No : 1 CORRECT TEXT Microsoft 70-461 : Practice Test You have a database named Sales that

More information

Tutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017

Tutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017 Identification of Variants Using GATK November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com

More information

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University While your data tables or spreadsheets may look good to

More information