Rapid Application Development using InforSense Open Workflow and Oracle Chemistry Cartridge Technologies

Size: px
Start display at page:

Download "Rapid Application Development using InforSense Open Workflow and Oracle Chemistry Cartridge Technologies"

Transcription

1 Rapid Application Development using InforSense Open Workflow and Oracle Chemistry Cartridge Technologies Anthony C. Arvanites Lead Discovery Informatics

2 Company Introduction Founded: 1999 Platform: Combining chemical and genetic screening to discover drugs Number of Scientists: 20 Areas of Focus: CNS, Parasitic diseases, Drug rescue Business Model: Organic growth - grants & partnership

3 What are Chemical Data Cartridges? An additional component that can be added to extend the functionality of Oracle to handle chemical objects like other standard data types within an industry-standard SQL framework. Store, index, manipulate and search chemical objects, such as molecules and reactions, directly within Oracle databases. Oracle Server

4 Sample SQL Statements SELECT mol FROM moltable WHERE CsCartridge.Molecule Contains(mol,'Nc1cc(O)c(C(O)=O)cc1','','')=1; SELECT mol from moltable WHERE CsCartridge.MoleculeContains(mol,'Nc1cc(O)c(C(O)=O)cc1','','similar= yes,simthreshold=80')=1; Query data using structural matching or similarity Calculate derived properties such as molecular weight and chemical formula Calculate molecular descriptors or chemical fingerprints Import and export external representations, such as MDL mol files or Smiles strings Each Chemical Cartridge has there own list of features: Daylight DayCart, MDL Direct, CambridgeSoft, Accelrys DS Accord, IDBS ChemXtra

5 InforSense 2.0: Oracle Edition IOE -- accelerates productivity by accessing Oracle 10g analytics via the intuitive InforSense workflow-based visual analytics environment Workflow-based analytics Real-time in-database analysis Select, connect and execute data, analysis, visualization components to build analytical processes Deploy analytical processes as web services/applications Quick integration of 3rd party Oracle solution Oracle Components Data Processing Data Mining Statistics, Blast, Text

6 InforSense as an Integration Platform Inforsense IOE platform can be an interface that connects the latest database technology, the chemical data cartridge combined with the power of oracle 10g analytics, to be an ideal rapid application development platform. Existing derive or search & replace components can be used to call chemical cartridge operators

7 DayCart Component Prototype Using Existing IOE nodes operator contains ( smiles1 IN VARCHAR2_OR_CLOB, smiles2 IN VARCHAR2_OR_CLOB ) => NUMBER operator smi2cansmi ( smiles IN VARCHAR2_OR_CLOB, type IN NUMBER ) => VARCHAR2_OR_CLOB Type is either 0 or 1, for unique or absolute SMILES operator tanimoto ( fp_or_smi1 IN VARCHAR2_OR_CLOB, fp_or_smi2 IN VARCHAR2_OR_CLOB ) => NUMBER

8 CambridgeSoft s Oracle Cartridge Based Structure to Name or Name to Structure Conversion CSCARTRIDGE.CONVERTCDX.CD XTONAME (STRUCTURE) CSCARTRIDGE.MOLECULECONTAI NS (STRUCTURE, Query,'','') Expression Editor Built into Oracle Derive Node Can be used with analyzing unstructured data (text-mining) with Oracle Text or Inforsense s TextScense

9 Prototype of Reusable DayCart Service Nodes Drag, Drop and Connect to build workflows Reusable Service nodes Quickly integrate new cartridge functionality (e.g. DayCart v4.9 s conversion toolkit ) Build and deploy custom solutions in less than a day Components can be programmed using Java

10 Examples of the Daylight DayCart Components Daylight Contrib code : MOL2SMI and SMI2MOL vcs_desalt and vcs_normalize operators Remove molecular fragments found in c$dcischem.salts SMIRKS-based structure normalization on input Smiles based on c$dcischem.transform Access Dayprop via Dayproptalk from DayCart exact, graph, blob and role index creation Substructure Match and Filter Operators exact, contains Similarity Operators tanimoto, euclid, Tversky Generate Daylight Fingerprints Operator smi2fp User able to define min, max and number of bits

11 Chemical Warehouse / Modeling Application Chemical Warehouse Capability of integrating with other in-house chemical databases (e.g. compound repository, subscription databases, virtual collections ) Easily create / modify using a visual programming environment Capability of performing external or in-database data-mining analytics Support chemical registrations Capability of integrating with other databases (e.g. BioAssay, Activity Base) Modeling Predictive ADME models Deployment of application to benefit bench scientists Oracle Data Mining Support Vector Machine (SVM) model Decision Tree Classification models Blood Brain Barrier Permeation(1670 compounds classified BBB+ or BBB-) P-glycoprotein (PgP), Human Intestinal Absorption (HIA), Torsades de Pointes (TdP)

12 Data Entry and Processing of Commercial Databases using DayCart Creating a compound warehouse: Collect individual database files from compound vendors Data entry and processing in using an oracle based chemical cartridge & IOE Generate Indexes using Components or use SQL Plus Deploy via a web portal

13 Creating a chemical warehouse via in-database processing Six commercial cleaned compound vendor databases are being preprocessed to unionize matching data columns (e.g. structure, CAS number, name, availability) creating a master chemical data warehouse within Oracle. DayCart structure Indexing (exact, role, graph, blob) was performed to enable fast structure searching on over 1 million compounds.

14 MDL & Daylight Similarity Searching Workflow query results Execute chemical searches using different chemical cartridges in one reusable and deployable workflow Link results to tables or views that access compound plate or biological data

15 Oracle Data Mining Blood Brain Barrier Model Launch Oracle Data Miner from within KDE Blood Brain Barrier dataset in SD format is converted into Daylight Smiles via contrib code and imported into an Oracle table. The dataset is joined with a repository of molecular descriptors and used to produce an SVM model all within the Oracle database. Published evaluation metrics are used to determine the efficiency of the model.

16 Predictive ADME solution using Oracle Data Mining A wrapped reusable service node Blood Brain Barrier (oracle SVM) P-glycoprotein (KDE Decision Tree) Human Intestinal Absorption (Weka Decision Tree) Torsades de Points (libsvm) J. Chem. Inf. Comput. Sci., Vol. 44, pg (2004) Effect of Molecular Descriptor Feature Selection in Support Vector Machine Classification of Pharmacokinetic and Toxicological Properties of Chemical Agents. J. Chem. Inf. Comput. Sci., Vol. 44, No.4 (2004), Prediction of P-Glycoprotein Substrates by a Support Vector Machine Approach. J. Chem. Inf. Comput. Sci.,Vol. 38, (1998) Prediction of Human Intestinal Absorption of Drug Compounds from Molecular Structure. J. Chem. Inf. Comput. Sci., Vol. 44, No.1 (2004) Blood-Brain Barrier Permeation Models: Discrimination between Potential CNS and Non-CNS Drugs Including P- Glycoprotein Substrates.

17 Deploying an application within InforSense

18 A Deployed Solution for Internal R&D

19 Questions? InforSense s workflow technology will be used as a rapid application development platform within Cambria Biosciences so that custom solutions can be deployed to staff scientists.

Great Migrations! Approaches to Moving your Chemistry. Michael Dippolito 2013 ChemAxon UGM Budapest

Great Migrations! Approaches to Moving your Chemistry. Michael Dippolito 2013 ChemAxon UGM Budapest Great Migrations! Approaches to Moving your Chemistry Michael Dippolito DeltaSoft Migrations R Us ChemCart ChemCart Choose your chemistry Query, Browse, Update, Report, Analyze Accelrys Accord Accelrys

More information

Life Sciences Oracle Based Solutions. June 2004

Life Sciences Oracle Based Solutions. June 2004 Life Sciences Oracle Based Solutions June 2004 Overview of Accelrys Leading supplier of computation tools to the life science and informatics research community: Bioinformatics Cheminformatics Modeling/Simulation

More information

Lead Discovery 5.2. Installation Guide. Powered by TIBCO Spotfire

Lead Discovery 5.2. Installation Guide. Powered by TIBCO Spotfire Installation Guide Powered by TIBCO Spotfire Last Updated: August 7, 2013 Table of Contents 1 Introduction... 3 2 Prerequisites... 4 3 Installation... 5 3.1 Installation Overview... 5 3.2 Deploying the

More information

Outline. Part II Pre-processing Description of pre-processing nodes Build using table editor

Outline. Part II Pre-processing Description of pre-processing nodes Build using table editor www.inforsense.com Outline Part I Import & Export Import data from different data sources and file types Query from a relational database Export data to different locations Part II Pre-processing Description

More information

KNIME Enalos+ Molecular Descriptor nodes

KNIME Enalos+ Molecular Descriptor nodes KNIME Enalos+ Molecular Descriptor nodes A Brief Tutorial Novamechanics Ltd Contact: info@novamechanics.com Version 1, June 2017 Table of Contents Introduction... 1 Step 1-Workbench overview... 1 Step

More information

Lead Discovery 5.2. User Guide. Powered by TIBCO Spotfire

Lead Discovery 5.2. User Guide. Powered by TIBCO Spotfire User Guide Powered by TIBCO Spotfire Last Modified: July 26, 2013 Table of Contents 1. Introduction... 5 2. Loading Data... 6 2.1. Opening an SDFile... 6 2.2. Importing a ChemDraw for Excel File... 6 2.3.

More information

KNIME Enalos+ Modelling nodes

KNIME Enalos+ Modelling nodes KNIME Enalos+ Modelling nodes A Brief Tutorial Novamechanics Ltd Contact: info@novamechanics.com Version 1, June 2017 Table of Contents Introduction... 1 Step 1-Workbench overview... 1 Step 2-Building

More information

Transitioning to Symyx

Transitioning to Symyx Whitepaper Transitioning to Symyx Notebook by Accelrys from Third-Party Electronic Lab Notebooks Ordinarily in a market with strong growth, vendors do not focus on competitive displacement of competitor

More information

Integration in the 21 st -Century Enterprise. Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003

Integration in the 21 st -Century Enterprise. Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003 Integration in the 21 st -Century Enterprise Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003 The Integration Bill of Rights Integrate = to form, coordinate, or blend into

More information

D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research

D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research D360: Unlock the value of your scientific data Solving Informatics Problems for Translational Research Dr. Fabian Bös, Senior Application Scientist Certara Spain SL Martin-Kollar-Str. 17, 81829 Munich

More information

Gábor Imre MADFAST SIMILARITY SEARCH

Gábor Imre MADFAST SIMILARITY SEARCH Gábor Imre MADFAST SIMILARITY SEARCH How fast is MadFast? Some numbers measured on an Amazon EC2 c3.8xlarge/r3.8xlarge machine How fast is MadFast? Some numbers measured on an Amazon EC2 c3.8xlarge/r3.8xlarge

More information

Luke S. Fisher, Ph.D. Manager, Client Services US Modeling and Simulation Support. July 24 th, 2008

Luke S. Fisher, Ph.D. Manager, Client Services US Modeling and Simulation Support. July 24 th, 2008 Workflow Customization with the DS Developer Client Luke S. Fisher, Ph.D. Manager, Client Services US Modeling and Simulation Support July 24 th, 2008 New Science and Customized Workflows for Drug Discovery

More information

Building innovative drug discovery alliances. Migrating to ChemAxon

Building innovative drug discovery alliances. Migrating to ChemAxon Building innovative drug discovery alliances Migrating to ChemAxon Evotec AG, Migrating to ChemAxon, May 2011 Agenda Evotec Why migrate? Searching for Library Enumeration Replacement Migrating a small

More information

Data systems supporting chemical informatics and small molecule discovery for crop protection research.

Data systems supporting chemical informatics and small molecule discovery for crop protection research. Data systems supporting chemical informatics and small molecule discovery for crop protection research. Mark Forster - Oracle Life Science User Group Meeting. April 2006. Presentation Outline. Syngenta

More information

Louis J Culot Vice President, Enterprise Applications & New Business CambridgeSoft Corporation

Louis J Culot Vice President, Enterprise Applications & New Business CambridgeSoft Corporation Louis J Culot Vice President, Enterprise Applications & New Business CambridgeSoft Corporation lculot@cambridgesoft.com CambridgeSoft Background CambridgeSoft Large life-science enterprise customer base

More information

Daylight Chemistry Cartridge for Postgresql

Daylight Chemistry Cartridge for Postgresql Table of Contents...1 1. Installation...1 1.1 Installation of the Daylight Tar File...1 1.2 Postgresql Installation...1 1.3 Schema and Function Creation...2 1.4 Testing the Installation...2 1.5 Installation

More information

Programming for Chemical and Life Science Informatics

Programming for Chemical and Life Science Informatics Programming for Chemical and Life Science Informatics I573 - Week 9 (Databases for Chemistry) Rajarshi Guha 10 th & 12 th March, 2009 Part I Chemical Databases An Example Scenario We can extend our previous

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Data Immersion : Providing Integrated Data to Infinity Scientists. Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004

Data Immersion : Providing Integrated Data to Infinity Scientists. Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004 Data Immersion : Providing Integrated Data to Infinity Scientists Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004 Informatics at Infinity Understand the nature of the science

More information

Building innovative drug discovery alliances. Knime Desktop tools for chemists

Building innovative drug discovery alliances. Knime Desktop tools for chemists Building innovative drug discovery alliances Knime Desktop tools for chemists Evotec AG, Knime desktop tools for chemists, May 2011 Agenda Knime Getting project data from Excel spreadsheets Getting project

More information

Moving away from ISIS software with ChemAxon's Help. Debra Kassabian Novartis Institutes for Biomedical Research (NIBR) IT

Moving away from ISIS software with ChemAxon's Help. Debra Kassabian Novartis Institutes for Biomedical Research (NIBR) IT Moving away from ISIS software with ChemAxon's Help Debra Kassabian Novartis Institutes for Biomedical Research (NIBR) IT Chemistry software at NIBR, before ChemAxon Direct Cartridge Chemistry search in

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Page 1. Oracle9i OLAP. Agenda. Mary Rehus Sales Consultant Patrick Larkin Vice President, Oracle Consulting. Oracle Corporation. Business Intelligence

Page 1. Oracle9i OLAP. Agenda. Mary Rehus Sales Consultant Patrick Larkin Vice President, Oracle Consulting. Oracle Corporation. Business Intelligence Oracle9i OLAP A Scalable Web-Base Business Intelligence Platform Mary Rehus Sales Consultant Patrick Larkin Vice President, Oracle Consulting Agenda Business Intelligence Market Oracle9i OLAP Business

More information

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry Data Mining Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335 5934

More information

AmbitXT v2.1.0 Manual

AmbitXT v2.1.0 Manual AmbitXT v2.1.0 Manual June 2009 1 Table of Contents Introduction... 2 Functions of AMBIT XT v2.1.0... 2 Workflow of AMBIT XT v2.1.0... 3 Using Database Utilities... 4 General Information... 4 Prerequisite

More information

Building a Recommendation System for EverQuest Landmark s Marketplace

Building a Recommendation System for EverQuest Landmark s Marketplace Building a Recommendation System for EverQuest Landmark s Marketplace Ben G. Weber Director of BI & Analytics, Daybreak Game Company Motivation Content discovery is becoming a challenge for players Questions

More information

Laserfiche Product Suite 2011

Laserfiche Product Suite 2011 Laserfiche Product Suite 2011 The Laserfiche enterprise content management system is designed to be straightforward to purchase, deploy, extend, administer and support. Our solutions give IT managers central

More information

Overview. IBEX - access and exploit SAR data from patents and journals

Overview. IBEX - access and exploit SAR data from patents and journals Better Compounds. Faster IBEX - access and exploit SAR data from patents and journals Péter Várkonyi, Christian Hoppe, Sorel Muresan AZ Global Compound Sciences Computational Chemistry Overview GVKBIO

More information

Installation KNIME AG. All rights reserved. 1

Installation KNIME AG. All rights reserved. 1 Installation 1. Install KNIME Analytics Platform (from thumb drive) 2. Help > Install New Software > Add (> Archive): 00_InstallationFiles/CommunityContributions_trunk.zip https://update.knime.org/community-contributions/trunk

More information

Daylight XVMerlin Manual

Daylight XVMerlin Manual Table of Contents XVMerlin Manual...1 1. Introduction to XVMerlin...1 2. Basic Operation of XVMerlin...2 3. Using the XVMerlin Window Menus...4 3.1 The Hitlist Menu...4 3.2 The Display Menu...5 3.3 The

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz Nov 10, 2016 Class Announcements n Database Assignment 2 posted n Due 11/22 The Database Approach to Data Management The Final Database Design

More information

Thor, Merlin, tdt s, Thorfilters. Summer School 2004 Documentation Reference

Thor, Merlin, tdt s, Thorfilters. Summer School 2004 Documentation Reference Thor, Merlin, tdt s, Thorfilters Summer School 2004 Documentation Reference http://www.daylight.com/dayhtml/doc/admin/ Chemical nomenclature Even before the science of chemistry as we know it evolved,

More information

ToxPredict Beta Testing Report Template

ToxPredict Beta Testing Report Template ToxPredict Beta Testing Report Template Grant Agreement Acronym Name Coordinator Health-F5-2008-200787 OpenTox An Open Source Predictive Toxicology Framework Douglas Connect Contract No. Document Type:

More information

XML in the bipharmaceutical

XML in the bipharmaceutical XML in the bipharmaceutical sector XML holds out the opportunity to integrate data across both the enterprise and the network of biopharmaceutical alliances - with little technological dislocation and

More information

razi Documentation Release 2.0.0b0 Riccardo Vianello

razi Documentation Release 2.0.0b0 Riccardo Vianello razi Documentation Release 2.0.0b0 Riccardo Vianello Dec 23, 2017 Contents 1 Introduction 3 1.1 Installation................................................ 3 1.2 Documentation..............................................

More information

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry Data Mining Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University it of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335

More information

Data Set. What is Data Mining? Data Mining (Big Data Analytics) Illustrative Applications. What is Knowledge Discovery?

Data Set. What is Data Mining? Data Mining (Big Data Analytics) Illustrative Applications. What is Knowledge Discovery? Data Mining (Big Data Analytics) Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://user.engineering.uiowa.edu/~ankusiak/

More information

KNIME What s new?! Bernd Wiswedel KNIME.com AG, Zurich, Switzerland

KNIME What s new?! Bernd Wiswedel KNIME.com AG, Zurich, Switzerland KNIME What s new?! Bernd Wiswedel KNIME.com AG, Zurich, Switzerland Data Access ASCII (File/CSV Reader, ) Excel Web Services Remote Files (http, ftp, ) Other domain standards (e.g. Sdf) Databases Data

More information

TWO SIDES OF A MIGRATION PROCESS

TWO SIDES OF A MIGRATION PROCESS EGIS TWO SIDES OF A MIGRATION PROCESS Tamás Nagy (Egis), András Dancsó (Egis), László Vágó (Egis), Balázs Volk (Egis), Gábor Pőcze (ComCix), Ferenc Darvas (ComCix) ChemAxon EUGM 2015 Kamilla: the choice

More information

TIBCO Spotfire Lead Discovery 2.1 User s Manual

TIBCO Spotfire Lead Discovery 2.1 User s Manual TIBCO Spotfire Lead Discovery 2.1 User s Manual Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO SOFTWARE IS SOLELY TO ENABLE THE

More information

University of Bath Seminar - Biology

University of Bath Seminar - Biology CambridgeSoft Solutions University of Bath Seminar - Biology ChemBioOffice Ultra ChemBioDraw ChemBio3D Ultra ChemBioFinder/ Ultra ChemBioViz Ultra E-Notebook Ultra Inventory Ultra BioAssay Ultra ChemDraw

More information

Tutorial on Machine Learning. Impact of dataset composition on models performance. G. Marcou, N. Weill, D. Horvath, D. Rognan, A.

Tutorial on Machine Learning. Impact of dataset composition on models performance. G. Marcou, N. Weill, D. Horvath, D. Rognan, A. Part 1. Tutorial on Machine Learning. Impact of dataset composition on models performance G. Marcou, N. Weill, D. Horvath, D. Rognan, A. Varnek 1 Introduction Predictive performance of QSAR model depends

More information

Database Infrastructure to Support Knowledge Management in Physicochemical Data - Application in NIST/TRC SOURCE Data System

Database Infrastructure to Support Knowledge Management in Physicochemical Data - Application in NIST/TRC SOURCE Data System 18 th CODATA Conference, Montreal, CANADA September 29 to October 3, 2002 Database Infrastructure to Support Knowledge Management in Physicochemical Data - Application in NIST/TRC SOURCE Data System Qian

More information

Data management and integration

Data management and integration Development of Predictive Toxicology Applications An OpenTox Workshop 19 Sep 2010, Rhodes, Greece Data management and integration presented by Nina Jeliazkova (Ideaconsult Ltd., Bulgaria) Outline Ontology

More information

Data Mining: Approach Towards The Accuracy Using Teradata!

Data Mining: Approach Towards The Accuracy Using Teradata! Data Mining: Approach Towards The Accuracy Using Teradata! Shubhangi Pharande Department of MCA NBNSSOCS,Sinhgad Institute Simantini Nalawade Department of MCA NBNSSOCS,Sinhgad Institute Ajay Nalawade

More information

Rules Toolset & services Stathis Marinos, Electrical & Computer Engineer BEL-ICCS & SafeCape Software Solutions Ltd

Rules Toolset & services Stathis Marinos, Electrical & Computer Engineer BEL-ICCS & SafeCape Software Solutions Ltd If you have the knowledge, we have the software to unleash it! Rules Toolset & services Stathis Marinos, Electrical & Computer Engineer BEL-ICCS & SafeCape Software Solutions Ltd What is this presentation

More information

Chem3D Ultra 10.0 Ultimate Modeling, Visualization & Analysis Suite. Inventory Ultra 10.0 Ultimate Chemical Materials Managemend Suite

Chem3D Ultra 10.0 Ultimate Modeling, Visualization & Analysis Suite. Inventory Ultra 10.0 Ultimate Chemical Materials Managemend Suite Chem3D Ultra 10.0 Ultimate Modeling, Visualization & Analysis Suite ChemDraw LiveLink Hydrogen Bonds & Partial Surfaces Measurements PowerPoint Computation Chem3D Ultra brings workstation-quality molecular

More information

JChem Extensions for KNIME KNIME.com products

JChem Extensions for KNIME KNIME.com products JChem Extensions for KNIME KNIME.com products ChemAxon 2011 US User Group Meeting San Diego, CA Takahiro Ohshima Overview INFOCOM KNIME JChem Extensions Marvin Family Nodes KNIME.com products KNIME Enterprise

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz May 20, 2014 Announcements DB 2 Due Tuesday Next Week The Database Approach to Data Management Database: Collection of related files containing

More information

Introduction to Data Mining and Data Analytics

Introduction to Data Mining and Data Analytics 1/28/2016 MIST.7060 Data Analytics 1 Introduction to Data Mining and Data Analytics What Are Data Mining and Data Analytics? Data mining is the process of discovering hidden patterns in data, where Patterns

More information

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management Management Information Systems Review Questions Chapter 6 Foundations of Business Intelligence: Databases and Information Management 1) The traditional file environment does not typically have a problem

More information

[ PARADIGM SCIENTIFIC SEARCH ] A POWERFUL SOLUTION for Enterprise-Wide Scientific Information Access

[ PARADIGM SCIENTIFIC SEARCH ] A POWERFUL SOLUTION for Enterprise-Wide Scientific Information Access A POWERFUL SOLUTION for Enterprise-Wide Scientific Information Access ENABLING EASY ACCESS TO Enterprise-Wide Scientific Information Waters Paradigm Scientific Search Software enables fast, easy, high

More information

Manual of SPCI (structural and physico-chemical interpretation) open-source software version 0.1.5

Manual of SPCI (structural and physico-chemical interpretation) open-source software version 0.1.5 Manual of SPCI (structural and physico-chemical interpretation) open-source software version 0.1.5 Version (date) Changes and comments 0.1.0 (02.02.2015) Changes from alpha version: 1. More precise default

More information

Knowledge Discovery. URL - Spring 2018 CS - MIA 1/22

Knowledge Discovery. URL - Spring 2018 CS - MIA 1/22 Knowledge Discovery Javier Béjar cbea URL - Spring 2018 CS - MIA 1/22 Knowledge Discovery (KDD) Knowledge Discovery in Databases (KDD) Practical application of the methodologies from machine learning/statistics

More information

JMP and SAS : One Completes The Other! Philip Brown, Predictum Inc, Potomac, MD! Wayne Levin, Predictum Inc, Toronto, ON!

JMP and SAS : One Completes The Other! Philip Brown, Predictum Inc, Potomac, MD! Wayne Levin, Predictum Inc, Toronto, ON! Paper JM08-2014 JMP and SAS : One Completes The Other Philip Brown, Predictum Inc, Potomac, MD Wayne Levin, Predictum Inc, Toronto, ON ABSTRACT Integrating JMP with SAS creates a formidable data management

More information

Oracle Warehouse Builder 10g Runtime Environment, an Update. An Oracle White Paper February 2004

Oracle Warehouse Builder 10g Runtime Environment, an Update. An Oracle White Paper February 2004 Oracle Warehouse Builder 10g Runtime Environment, an Update An Oracle White Paper February 2004 Runtime Environment, an Update Executive Overview... 3 Introduction... 3 Runtime in warehouse builder 9.0.3...

More information

ChemBioFinder for Office 13.0 User Guide

ChemBioFinder for Office 13.0 User Guide User Guide Table of Contents Chapter 1: ChemBioFinder for Office 13.0 1 The user interface (UI) 1 Selecting files to search 2 Searching by chemical structure 3 Searching by multiple properties 4 Browsing

More information

The use of KNIME to support research activity at Lhasa Limited

The use of KNIME to support research activity at Lhasa Limited The use of KNIME to support research activity at Lhasa Limited Data processing through to proof-of-concept implementations Sam Webb samuel.webb@lhasalimited.org Overview The Lhasa-KNIME timeline Internal

More information

ChemFinder for Office 17.0 User Guide

ChemFinder for Office 17.0 User Guide User Guide Table of Contents Chapter 1: ChemFinder for Office 1 The user interface (UI) 1 Selecting files to search 2 Searching by chemical structure 3 Searching by multiple properties 4 Browsing search

More information

Tips and Tricks using Discovery Studio

Tips and Tricks using Discovery Studio Tips and Tricks using Discovery Studio Allister J. Maynard, Ph.D. Senior Manager, R&D July 31 st, 2008 New Science and Customized Workflows for Drug Discovery Research Webinar Series June 12, 2008 - Advances

More information

CAS / SciFinder Web Basic Training (Eng.)

CAS / SciFinder Web Basic Training (Eng.) A division of the American Chemical Society www.cas.org CAS / SciFinder Web Basic Training (Eng.) 2009.10 Agenda Briefly introduce Explore Reference Explore Substance Explore Reaction 2 SciFinder Web https://scifinder.cas.org

More information

An Introduction to Analysis (and Repository) Databases (ARDs)

An Introduction to Analysis (and Repository) Databases (ARDs) An Introduction to Analysis (and Repository) TM Databases (ARDs) Russell W. Helms, Ph.D. Rho, Inc. Chapel Hill, NC RHelms@RhoWorld.com www.rhoworld.com Presented to DIA-CDM: Philadelphia, PA, 1 April 2003

More information

JOURNAL OF INTERNATIONAL ACADEMIC RESEARCH FOR MULTIDISCIPLINARY Impact Factor 1.393, ISSN: , Volume 2, Issue 3, April 2014

JOURNAL OF INTERNATIONAL ACADEMIC RESEARCH FOR MULTIDISCIPLINARY Impact Factor 1.393, ISSN: , Volume 2, Issue 3, April 2014 PESTIBASE : A STRUCTURE (2D & 3D) AND PROPERTY DATABASE OF ORGANIC INSECTICIDES USED IN CROPS AND IN SILICO ADME-TOX STUDIES OF THE DATABASE MOLECULES PARMITA CHOWDHURY* SAUROV MAHANTA, RINTU DAS** *National

More information

SAS 9.4 Intelligence Platform: Overview, Second Edition

SAS 9.4 Intelligence Platform: Overview, Second Edition SAS 9.4 Intelligence Platform: Overview, Second Edition SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2016. SAS 9.4 Intelligence

More information

Semantic Knowledge Discovery OntoChem IT Solutions

Semantic Knowledge Discovery OntoChem IT Solutions Semantic Knowledge Discovery OntoChem IT Solutions OntoChem IT Solutions GmbH Blücherstr. 24 06120 Halle (Saale) Germany Tel. +49 345 4780472 Fax: +49 345 4780471 mail: info(at)ontochem.com Get the Gold!

More information

ChemScript About this document

ChemScript About this document About this document 12.0 This document is the "" section of the manual Chem & Bio Office Chem& Bio3D, Finder & Bio Viz and is made available as an excerpt for fast downloading. To read the manual in its

More information

One Search Many Answers

One Search Many Answers One Search Many Answers Bringing together results from multiple databases through the DiscoveryGate Platform Carmen Nitsche, VP Content Fall 2009 ACS Meeting Washington, D.C. Information Driven R&D Is

More information

Seminars of Software and Services for the Information Society

Seminars of Software and Services for the Information Society DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society

More information

Getting Started with SciFinder Scholar TM (2004 Edition)

Getting Started with SciFinder Scholar TM (2004 Edition) Getting Started with SciFinder Scholar TM (2004 Edition) for Windows and Macintosh August 2003 Copyright 2003 American Chemical Society All Rights Reserved Getting Started 3 Getting Started with SciFinder

More information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services Patrick Wendel Imperial College, London Data Mining and Exploration Middleware for Distributed and Grid Computing,

More information

How to Create a Substance Answer Set

How to Create a Substance Answer Set How to Create a Substance Answer Set Select among five search techniques to find substances Substances can be described by multiple names or other characteristics, so SciFinder gives you the flexibility

More information

End-to-End data mining feature integration, transformation and selection with Datameer Datameer, Inc. All rights reserved.

End-to-End data mining feature integration, transformation and selection with Datameer Datameer, Inc. All rights reserved. End-to-End data mining feature integration, transformation and selection with Datameer Fastest time to Insights Rapid Data Integration Zero coding data integration Wizard-led data integration & No ETL

More information

Enabling Science. IDBS Products HARDWARE REQUIREMENTS. ID Business Solutions

Enabling Science. IDBS Products HARDWARE REQUIREMENTS. ID Business Solutions Enabling Science IDBS Products HARDWARE REQUIREMENTS ID Business Solutions www.idbs.com IDBS 2016 Information in this document is subject to change without notice. The software described in this document

More information

PYRAMID Headline Features. April 2018 Release

PYRAMID Headline Features. April 2018 Release PYRAMID 2018.03 April 2018 Release The April release of Pyramid brings a big list of over 40 new features and functional upgrades, designed to make Pyramid s OS the leading solution for customers wishing

More information

Knowledge Discovery in Data Bases

Knowledge Discovery in Data Bases Knowledge Discovery in Data Bases Chien-Chung Chan Department of CS University of Akron Akron, OH 44325-4003 2/24/99 1 Why KDD? We are drowning in information, but starving for knowledge John Naisbett

More information

The Data Mining usage in Production System Management

The Data Mining usage in Production System Management The Data Mining usage in Production System Management Pavel Vazan, Pavol Tanuska, Michal Kebisek Abstract The paper gives the pilot results of the project that is oriented on the use of data mining techniques

More information

Call: Hyperion Planning Course Content:35-40hours Course Outline Planning Overview

Call: Hyperion Planning Course Content:35-40hours Course Outline Planning Overview Hyperion Planning Course Content:35-40hours Course Outline Planning Overview Oracle's Enterprise Performance Management Planning Architecture Planning and Essbase Navigating Workspace Launching Workspace

More information

Giving Your Headings Meaningful Names (Desktop and Plus) p. 158 Rearranging the Order of the Output p. 160 Formatting Data p. 163 Formatting Columns

Giving Your Headings Meaningful Names (Desktop and Plus) p. 158 Rearranging the Order of the Output p. 160 Formatting Data p. 163 Formatting Columns Acknowledgments p. xxi Introduction p. xxiii Getting Started with Discoverer An Overview of Discoverer p. 3 Business Intelligence and Your Organization p. 4 Business Intelligence and Trends p. 5 Discoverer's

More information

From Corkscrew to Swiss Army Knife

From Corkscrew to Swiss Army Knife From Corkscrew to Swiss Army Knife The evolving role of KNIME at DNS Tim Parrott & Brock Luty September 16, 2016 Dart NeuroScience (DNS) To create drugs that enhance Long Term Memory (LTM) in Humans Chemical

More information

As a reference, please find a version of the Machine Learning Process described in the diagram below.

As a reference, please find a version of the Machine Learning Process described in the diagram below. PREDICTION OVERVIEW In this experiment, two of the Project PEACH datasets will be used to predict the reaction of a user to atmospheric factors. This experiment represents the first iteration of the Machine

More information

How to Create a Reaction Answer Set

How to Create a Reaction Answer Set How to Create a Reaction Answer Set Find all relevant reactions based on criteria you specify Search the world s largest, publicly available source of reactions and quickly find highly relevant results,

More information

Module 1.Introduction to Business Objects. Vasundhara Sector 14-A, Plot No , Near Vaishali Metro Station,Ghaziabad

Module 1.Introduction to Business Objects. Vasundhara Sector 14-A, Plot No , Near Vaishali Metro Station,Ghaziabad Module 1.Introduction to Business Objects New features in SAP BO BI 4.0. Data Warehousing Architecture. Business Objects Architecture. SAP BO Data Modelling SAP BO ER Modelling SAP BO Dimensional Modelling

More information

Introducing SAS Model Manager 15.1 for SAS Viya

Introducing SAS Model Manager 15.1 for SAS Viya ABSTRACT Paper SAS2284-2018 Introducing SAS Model Manager 15.1 for SAS Viya Glenn Clingroth, Robert Chu, Steve Sparano, David Duling SAS Institute Inc. SAS Model Manager has been a popular product since

More information

From Visual Data Exploration and Analysis to Scientific Conclusions

From Visual Data Exploration and Analysis to Scientific Conclusions From Visual Data Exploration and Analysis to Scientific Conclusions Alexandra Vamvakidou, PhD September 15th, 2016 HUMAN HEALTH ENVIRONMENTAL HEALTH 2014 PerkinElmer The Power of a Visual Data We Collect

More information

Introduction to Federation Server

Introduction to Federation Server Introduction to Federation Server Alex Lee IBM Information Integration Solutions Manager of Technical Presales Asia Pacific 2006 IBM Corporation WebSphere Federation Server Federation overview Tooling

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Mapping the materials genome:

Mapping the materials genome: Brown University: March 29 th 2012 Mapping the materials genome: transforming data to knowledge for enabling Materials-by-Design Krishna Rajan Wilkinson Professor of Interdisciplinary Engineering Iowa

More information

Quick Reference Guide

Quick Reference Guide Quick Reference Guide Table of Contents Homepage My Settings Generate a Structure from a Name Reactions Query tab Query tab Add further Search Conditions Results General Overview 7 Results Reactions tab

More information

Getting Started with SciFinder 2007

Getting Started with SciFinder 2007 Getting Started with SciFinder 2007 for Windows November 2006 Copyright 2006 American Chemical Society. All Rights Reserved. SciFinder is a registered trademark of the American Chemical Society. Getting

More information

pka-prospector Release OpenEye Scientific Software, Inc.

pka-prospector Release OpenEye Scientific Software, Inc. pka-prospector Release 1.0.0.3 OpenEye Scientific Software, Inc. September 25, 2013 CONTENTS 1 Front Matter 1 2 Installation 3 2.1 Linux................................................... 3 2.2 Windows.................................................

More information

Week 1 Unit 1: Introduction to Data Science

Week 1 Unit 1: Introduction to Data Science Week 1 Unit 1: Introduction to Data Science The next 6 weeks What to expect in the next 6 weeks? 2 Curriculum flow (weeks 1-3) Business & Data Understanding 1 2 3 Data Preparation Modeling (1) Introduction

More information

Developing Applications with Business Intelligence Beans and Oracle9i JDeveloper: Our Experience. IOUG 2003 Paper 406

Developing Applications with Business Intelligence Beans and Oracle9i JDeveloper: Our Experience. IOUG 2003 Paper 406 Developing Applications with Business Intelligence Beans and Oracle9i JDeveloper: Our Experience IOUG 2003 Paper 406 Chris Claterbos claterbos@vlamis.com Vlamis Software Solutions, Inc. (816) 781-2880

More information

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data ( Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (http://bioqueries.uma.es) María Jesús García-Godoy, Ismael Navas-Delgado, José Francisco Aldana Montes Computing

More information

1 Topic. Image classification using Knime.

1 Topic. Image classification using Knime. 1 Topic Image classification using Knime. The aim of image mining is to extract valuable knowledge from image data. In the context of supervised image classification, we want to assign automatically a

More information

Enterprise Data Catalog for Microsoft Azure Tutorial

Enterprise Data Catalog for Microsoft Azure Tutorial Enterprise Data Catalog for Microsoft Azure Tutorial VERSION 10.2 JANUARY 2018 Page 1 of 45 Contents Tutorial Objectives... 4 Enterprise Data Catalog Overview... 5 Overview... 5 Objectives... 5 Enterprise

More information

enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria

enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria www.ideaconsult.net Ø enanomapper database: data model, technology; NANoREG data transfer

More information

Suggested Experience Required Exams Recommended Teradata Courses. TE Teradata 12 Basics

Suggested Experience Required Exams Recommended Teradata Courses. TE Teradata 12 Basics Exam Objectives Teradata 12 Certification Track Use the convenient matrix as a reference to Teradata 12 Certification exam objectives and requirements. A suggested range of experience and recommended Teradata

More information

Data Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44

Data Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44 Data Mining Piotr Paszek piotr.paszek@us.edu.pl Introduction (Piotr Paszek) Data Mining DM KDD 1 / 44 Plan of the lecture 1 Data Mining (DM) 2 Knowledge Discovery in Databases (KDD) 3 CRISP-DM 4 DM software

More information

ChemInformatics in SharePoint: A Big Pharma Perspective

ChemInformatics in SharePoint: A Big Pharma Perspective ChemInformatics in SharePoint: A Big Pharma Perspective Location: ChemAxon UGM, Budapest, Hungary Date: 19 th May 2010 Presenter: Luke Bullard Pfizer Internal Use What s all the fuss about??? The answer

More information