D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research

Similar documents
Building innovative drug discovery alliances. Migrating to ChemAxon

Integration in the 21 st -Century Enterprise. Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003

Life Sciences Oracle Based Solutions. June 2004

From Visual Data Exploration and Analysis to Scientific Conclusions

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Rapid Application Development using InforSense Open Workflow and Oracle Chemistry Cartridge Technologies

The Information Platform of the Future. MarkLogic and Smartlogic

XML in the bipharmaceutical

Building innovative drug discovery alliances. Knime Desktop tools for chemists

Turning Data Science into a reality with TIBCO Spotfire

Louis J Culot Vice President, Enterprise Applications & New Business CambridgeSoft Corporation

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London

Unstructured Text in Big Data The Elephant in the Room

JChem Extensions for KNIME KNIME.com products

Technical Computing with MATLAB

Supporting Customer Growth Strategies by Anticipating Market Change End-to-end Optimization of Value Chains

[ PARADIGM SCIENTIFIC SEARCH ] A POWERFUL SOLUTION for Enterprise-Wide Scientific Information Access

a measurable difference

Chem3D Ultra 10.0 Ultimate Modeling, Visualization & Analysis Suite. Inventory Ultra 10.0 Ultimate Chemical Materials Managemend Suite

Cost effective Cheminformatics for Small Chemistry Teams Integrated Within Larger Discovery Groups

Morphit 5. Edge User Group Meeting 2017 Ted Hawkins PhD VP Product Delivery

A chemistry friendly system integrating drug design tools and a consistent visual interface Xin Zhang, Christian Baber

Big Data in Translational Science

Transitioning to Symyx

ChemInformatics in SharePoint: A Big Pharma Perspective

Transforming the Economics of

An early challenge that LabKey users encounter is what is the best way to get their data into the server and what are the appropriate data types to

Utilizing Container Technology to Streamline Data Science

Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure

Semantic Technologies in a Chemical Context

Data Immersion : Providing Integrated Data to Infinity Scientists. Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004

ISO INTERNATIONAL STANDARD. Health informatics Genomic Sequence Variation Markup Language (GSVML)

Build Scientific Computing Infrastructure with Rebar3 and Docker. Eric Sage

Composite Data Virtualization Maximizing Value from Enterprise Data Warehouse Investments

Cost-Benefit Analysis of Retrospective vs. Prospective Data Standardization

Enhanced retrieval using semantic technologies:

Data systems supporting chemical informatics and small molecule discovery for crop protection research.

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

Where do these data come from? What technologies do they use?? Whatever they use, they need models (schemas, metadata, )

Sunil G. Singh Bill Caulkins. DBMS Consulting October 2009 Life Sciences HUB Focus Group Session 24

Third generation of Data Virtualization

Optimizing Your Analytics Life Cycle with SAS & Teradata. Rick Lower

Moving away from ISIS software with ChemAxon's Help. Debra Kassabian Novartis Institutes for Biomedical Research (NIBR) IT

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

<Insert Picture Here> Managing Oracle Exadata Database Machine with Oracle Enterprise Manager 11g

<Insert Picture Here> Enterprise Data Management using Grid Technology

One Search Many Answers

An Introduction to Analysis (and Repository) Databases (ARDs)

EPHAR Certification. EUROPEAN CERTIFIED PHARMACOLOGIST (EuCP) Guidelines for Certification

OnCommand Insight 7.1 Planning Guide

Fast Innovation requires Fast IT

How Managers and Executives Can Leverage SAS Enterprise Guide

Use of Semantic Technologies at Eli Lilly and Company. J Phil Brooks Information Consultant, SE Data Team Discover IT Eli Lilly and Company

Ecosystems Research & Environmental Assessment Biologist Department Division/Region Community Location Environment Wildlife Igloolik Nunavut

SAS 9.4 Intelligence Platform: Overview, Second Edition

GE Healthcare. Visualize Analyze. Realize. IN Cell Miner HCM Data management for high-content analysis and screening

Project II. argument/reasoning based on the dataset)

Power of the Portfolio. Copyright 2012 EMC Corporation. All rights reserved.

E-Notebook User Guide

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective

The Emerging Data Lake IT Strategy

Customer Success Story Los Alamos National Laboratory

Comprehensive Capabilities Comparison

The Value of Data Modeling for the Data-Driven Enterprise

Accelerate Your Enterprise Private Cloud Initiative

Manage your environmental monitoring data with power, depth and ease

How a Metadata Repository enables dynamism and automation in SDTM-like dataset generation

Cray's YarcData claims early success for graph database appliance

Introduction to the Active Everywhere Database

The Why, What, and How of Cisco Tetration

Key. General. Host Account Functionality. Included, no-cost Not Included Add-on/Low cost $ Add-on/High cost $$$

Information retrieval (IR)

Contractual Approaches to Data Protection in Clinical Research Projects

Directorate of Open and Distance Learning. Information on Courses, Eligibility and Fee Structure

Project Name. The Eclipse Integrated Computational Environment. Jay Jay Billings, ORNL Parent Project. None selected yet.

Data Mining Technologies for Bioinformatics Sequences

Security analytics: From data to action Visual and analytical approaches to detecting modern adversaries

Oracle Enterprise Manager 12c Sybase ASE Database Plug-in

Composite Data Virtualization Eight Ways Composite Data Virtualization Adds Value to Enterprise Data Warehousing

Insid&Out: how to save Medicinal chemists from data-related frustration

Cypress Adopts Questa Formal Apps to Create Pristine IP

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Revolutionary mobile health technology

Bringing Connected Diagnostics to Scale

Know Your Customer. c360 Microsoft Dynamics CRM 4.0 Product Catalog

HOW TO ENABLE AFFORDABLE ENTERPRISE VIDEO FOR EVERYONE

Building a Data Strategy for a Digital World

Data Governance: Are Governance Models Keeping Up?

The #1 Key to Removing the Chaos. in Modern Analytical Environments

REACH-IT Stakeholder Workshop. REACH-IT Architecture

Cisco Industrial Network Director

WEBMETHODS AGILITY FOR THE DIGITAL ENTERPRISE WEBMETHODS. What you can expect from webmethods

Introduction to Data Science

Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data

Oracle Enterprise Manager 12c IBM DB2 Database Plug-in

New York ISO - Dynamic Data Response. Copyr i ght 2014 O SIs oft, LLC.

Geospatial Enterprise Search. June

Big Data Specialized Studies

McAfee Total Protection for Data Loss Prevention

Transcription:

D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research Dr. Fabian Bös, Senior Application Scientist Certara Spain SL Martin-Kollar-Str. 17, 81829 Munich www.certara.com/d360

Certara is the amalgamation of 3 companies covering different areas of drug discovery & development Tripos - Discovery Simcyp Preclinical Pharsight Clinical Building product portfolio through acquisition and organic growth to address scientific data needs for Translational approaches to life science research. 3 key areas of interest In Silico modeling and prediction across discovery/development Improved prediction of biological events from the molecular to the whole organism level Leverage of technologies from formerly disparate areas Data access within and between expertise silos Improved access to and thus use of data for all scientists throughout the process Providing an ability to generate answers to a very wide range of scientific questions Integration of data access with modeling Model creation for specific discovery and development end points Model deployment throughout the discovery and development process 2

Software Portfolio 3

Certara Consulting 4

Big Data, Complex Data, Many Sources More disciplines are generating more data than ever Combinatorial chemistry High throughput screening High content screening Proteomics Genomics The data is inherently complex Context a value only has meaning when you know how it was obtained Relationships 11010100101001010010101001010101001010111 010101101100100000101001010101111101010010 10001010010101010100101101000100100010001 00010010010010010101011001010101010101000 01010010010100100100010101001010101001010 10010001000101001010010010100101001010010 10101001010101011100010100010100010001001 00101001001010010010001001111010100101000 meaning can only be derived when the right relationships between data are in place Data is stored in many sources Scientists often manually construct their datasets after using different applications Systems built to help scientists are custom to address a single bottleneck 5

Hard Coded Data Access Chemical data access Safety Data Access Research reporting Chemical Inventory Scientific literature search Discovery information portal Structure visualization Data manipulation & analysis Data analysis In Vitro data access Electronic notebook Research communication 6

A Data Network In Vitro Results Observations Patients Projects Batches Compounds Clinical Studies Treatments Assays Preclinical Studies Animals Treatments Observations 7

A Data Access Problem Discovery Discovery data available in multiple data sources Data is accessible, through an internally built application but Application was last updated ~2 years ago Requires a team of FTEs to support (just the application) Supports general user workflows for accessing project data in standard views But User requests for enhancements not satisfied for some time Many important data sources not integrated In vivo data Licensed commercial databases Very few analysis tools or integrations with other applications complex workflow 8

The Discovery Data Network in D360 Data Network for discovery data Data from chemistry and biology databases cast in terms of entities of interest Compounds Targets User can choose the data level Compound, Salt, Batch/Lot, Project, Assay Na + Salts Projects Assays Supports a wide variety of workflows: Standard Project data views Exploratory SAR data mining Assay QA/QC Batches Results Initial system deployed in ~ 4 weeks Currently in production at top 10 pharma I can get datasets and perform analyses that were difficult of not impossible before 9

A Data Access Problem Preclinical Safety Safety Study Data entered into a commercial system Data is accessible, but Only at a study level, one study at a time Easy to get all data for a project Monitoring and Study Report Study ABC-123 Study Groups Animals Very difficult for Pathologists to understand data across studies e.g. Historical control group behavior Can take days/weeks to answer project team questions on current study findings Can take days/weeks to answer regulatory body questions The situation gets worse when additional data is brought into the picture TK data is stored in different LIMS system Assay Data is stored in a different system Findings & Measurements 10

The Safety Data Network in D360 Data Network for preclinical safety Data from Safety database cast in terms of entities of interest User can choose the data level Study, Study Group, Animal, Observation, treatment Supports a wide variety of workflows: Cross study control group analysis Study Design from historical data Logistical Study monitoring System deployed in ~ 2 weeks Currently in production at top 10 pharma Cross study analysis of control groups can now be performed in minutes rather than weeks 11

Expanding this Data Network Additional Data Entities Chemical Structures TK parameters Discovery Assay Data From Additional Data sources Chemical Structure Cartridge LIMS system Assay database Adds additional workflows: Improved preclinical study design Structure-Safety relationships Exposure-Safety relationships Clinical finding Assay data relationships Current status Configuring TK data 12

What does the user of D360 see? Any user of the system can build a query Without knowledge of specific data location/format Without knowledge of Oracle, web services and other related technologies System is configured to show scientific entities of interest Presents data in the user s context 13

User Adoption The Google Effect: Everyone is used to Google very simple to use Not Every user wants to build a query D360 allows search capabilities to be Googleised Users who build data searches in D360 can widgetise them Leverages the intellectual assets of the team 70 14

Recent D360 Partnerships - Building the ecosystem Connectivity with the Cortellis web based data sources. Connectivity with MarvinSketch and JChem Cartridge. Connectivity with pre-clinical safety data from Pristima or customized data marts. Competitive intelligence decision support. 15

Summary Provides access to scientific data Accesses scientific data from structured storage Single databases, Multiple databases, non-database sources (web services) D360 transforms scientific data (mere access is not enough) Transforms from a complex structure into something a user actually finds useful Presents scientific data So a user can build a query themselves Spreadsheets, forms, grids, graphs and charts, correlation matrices, formatting, statistics, equations, etc. Whatever is most appropriate to the user s workflow Help the user derive meaning Shares scientific results and supports decision making Workspaces, annotations, notifications Integration with productivity tools and existing corporate systems 16