RDF Workshop. Building an RDF representation of the the ChEMBL Database. Mark Davies. ChEMBL Group, Technical Lead 30/04/2014
|
|
- Martha Ford
- 6 years ago
- Views:
Transcription
1 RDF Workshop Building an RDF representation of the the ChEMBL Database Mark Davies ChEMBL Group, Technical Lead 30/04/2014
2 Overview Brief introduction to ChEMBL database Approaches to mapping relational data to RDF data model based on ChEMBL experience New features in ChEMBL RDF (version 18) Future ChEMBL plans
3 What is ChEMBL? Open access database for drug discovery Freely available (searchable and downloadable) Content: Bioactivity data manually extracted from the primary medicinal chemistry literature from journals such as J. Med. Chem. Subset of data from PubChem Deposited data e.g. neglected disease screening, GSK kinase set Bioactivity data is associated with a biological target and a chemical structure Compounds are stored in a structure searchable format Protein targets are linked to protein sequences in UniProt Updated regularly with new data Secure searching ( )
4 ChEMBL ChEMBL 18 Release 1,359,508 compounds 12,419,715 activities 1,042,374 assays 9,414 targets 53,298 documents 19 bioactivity sources 6 compound-only sources
5 What does ChEMBL data look like? Compound Activity Assay Target Ref
6 How can I access ChEMBL data? Web Services Widgets Website Downloads Virtual Machine (mychembl) Semantic Web
7 ChEMBL + Semantic Web The creation of the RDF version of ChEMBL is funded by the Open PHACTS project - Migrate the ChEMBL relational data model to RDF based data model triplify everything RDF generation is part of official ChEMBL release process Identify and use ontologies important in the field of bioactivity data ChEMBL RDF to be made available through EBI RDF Platform
8 Goals of the ChEMBL RDF conversion Responding to the demands of the community Academic and more recently industry Semantic data conversion and querying Reasoning/inferencing - providing a starting point for the community Ensure the conversion is part of the ChEMBL release cycle ChEMBL data model is still evolving so almost impossible for external efforts to keep up to speed with changes
9 ChEMBL RDF Compound Bioactivity Assay Target Ref ftp://ftp.ebi.ac.uk/pub/databases/chembl/chembl-rdf/
10 Conversion process ChEMBL Relational Schema ChEMBL RDF Schema
11 Migrating a relational data model Where to start? What is your goal? How often will it be updated? Who is the audience? What tools to use? Do you write you own ontology? Which ontologies to use? Which format to use? Will it be made available to the public? Two approaches were used to convert the ChEMBL relational model to an RDF based model Approach 1: Semi-automated using the D2RQ Platform Approach 2: Manual model building
12 Approach 1: Semi-automated using the D2RQ Platform
13 D2RQ platform overview Query a non-rdf database using SPARQL Access the content of the database as Linked Data over the Web Create custom dumps of the database in RDF formats for loading into an RDF store Access information in a non- RDF database using the Apache Jena API
14 ChEMBL relational schema 3 core domains Compound Activity Target 52 tables (52 primary keys J ) 341 columns 4 data types (40 if length, scale and precision included) Many indexes, constraints, triggers..
15 D2RQ: Prerequisites Download the software from Java 1.5 or higher Oracle users will need to download database driver
16 D2RQ: Example usage Aims Create RDF representation of your relational data model Run and test SPARQL queries against your database Online data access and representation Process Generate D2R Mapping File Start up D2R Server using D2R Mapping File Refine model Support databases Oracle, SQL Server, PostgreSQL, MySQL, HSQLDB,
17 D2RQ: Mapping File Creation Command used to generate D2R Mapping File (you just need a database connection string): $>./generate-mapping -o example_d2r_mapping.ttl \! -u <USER> \! -p <PASSWORD> \! -d oracle.jdbc.driver.oracledriver jdbc:oracle:thin:@<server>:<port>:<database>! Command above will inspect database and create RDF based definitions of tables and relationships between tables Possible to skip/restrict schemas/tables/columns with additional argument useful for Oracle
18 D2RQ: Mapping file map: db: vocab: rdf: rdfs: xsd: d2rq: jdbc: < map:database a d2rq:database;! d2rq:jdbcdriver "oracle.jdbc.driver.oracledriver";! d2rq:jdbcdsn "jdbc:oracle:thin:@<server>:<port>:<database>";! d2rq:username "<USER>";! d2rq:password "<PASSWORD>";!.!! # Table CHEMBL_18.ACTION_TYPE! map:chembl_18_action_type a d2rq:classmap;! d2rq:datastorage map:database;! d2rq:uripattern "CHEMBL_18/ACTION_TYPE/@@CHEMBL_18.ACTION_TYPE.ACTION_TYPE urlify@@";! d2rq:class vocab:chembl_18_action_type;! d2rq:classdefinitionlabel "CHEMBL_18.ACTION_TYPE";!.! map:chembl_18_action_type label a d2rq:propertybridge;! d2rq:belongstoclassmap map:chembl_18_action_type;! d2rq:property rdfs:label;! d2rq:pattern "ACTION_TYPE #@@CHEMBL_18.ACTION_TYPE.ACTION_TYPE@@";!.! map:chembl_18_action_type_action_type a d2rq:propertybridge;! d2rq:belongstoclassmap map:chembl_18_action_type;! d2rq:property vocab:chembl_18_action_type_action_type;! d2rq:propertydefinitionlabel "ACTION_TYPE ACTION_TYPE";! d2rq:column "CHEMBL_18.ACTION_TYPE.ACTION_TYPE";!.! map:chembl_18_action_type_description a d2rq:propertybridge;! d2rq:belongstoclassmap map:chembl_18_action_type;! d2rq:property vocab:chembl_18_action_type_description;! d2rq:propertydefinitionlabel "ACTION_TYPE DESCRIPTION";! d2rq:column "CHEMBL_18.ACTION_TYPE.DESCRIPTION";!.! map:chembl_18_action_type_parent_type a d2rq:propertybridge;! d2rq:belongstoclassmap map:chembl_18_action_type;! d2rq:property vocab:chembl_18_action_type_parent_type;! d2rq:propertydefinitionlabel "ACTION_TYPE PARENT_TYPE";! d2rq:column "CHEMBL_18.ACTION_TYPE.PARENT_TYPE";!.!
19 D2R Server SPARQL endpoint and explorer Browsing database contents Resolvable URIs Content negotiation Downloading contents of BLOBs/CLOBs Serving the vocabulary Publishing metadata Command used to start D2R server (assuming you have generated mapping file): $>./d2r-server example_d2r_mapping.ttl!
20 D2R Server
21 D2R Server
22 (Quick) D2RQ Demo
23 D2RQ: Data modeling It is possible to model data and also create more meaningful class and property names Approach 1 Edit the mapping file using advanced features of the D2RQ query language: Approach 2 Create mapping file based on restricted set of database objects e.g. users, schemas, views, materialised views modeling within the database
24 D2RQ: Optimisations Review D2R server deployment Increase D2R server heap space Review configurations settings, such as page sizes, resultset limits Use latest built-in D2RQ optimisations, by specifying d2rq:usealloptimizations (or --fast flag on server startup) Use D2RQ s dump-rdf command to export RDF representation of database Exported RDF can then be imported into a triplestore, e.g. Virtuoso
25 D2RQ: Limitations General limitations Integration of multiple databases not possible achieved within the database Read only access update extension is available Limited inferencing available Named graphs not supported Users tend to end up creating weird mapping files Database models are often not perfect or clean, which complicates mapping file creation process Just mapping to RDF is not enough
26 Approach 2: Manual model building
27 Approach 2 outline Building a basic ontology and instantiate with data Steps: Define entities in data source Define relationships between entities Define properties of the entities Identify and use external ontologies
28 Model building considerations Review available technologies/languages Preferred may not have good RDF support/libs RDF data formats e.g. rdf, ttl, n3 All are interchangeable, but some are considered more readable and offer reduced size
29 ChEMBL relational schema revisited References Activities Targets Assays Molecules Binding Sites MOAs Drugs
30 ChEMBL Entities/Classes Journal Source Target- Component Document Assay Protein- Classification Substance Activity Target Bio- Component ChEMBL 17 classes
31 ChEMBL class definition An OWL based ontology used to define ChEMBL classes OWL snippet used to define ChEMBL : < :Assay! rdf:type owl:class ;! rdfs:label "ChEMBL Assay Class"^^xsd:string ;! rdfs:subclassof :ChEMBL.! Tools available to help build and write ontologies, e.g. Protégé and TopBraid Composer
32 Entity RDF representation Entity names become classes, which allow you to type your data! chembl_assay:chembl a cco:assay.! chembl_assay: and cco: are prefixes and a is a turtle shorthand for rdf:type
33 ChEMBL Entity relationships Journal Source Target- Component Document Assay Protein- Classification Substance Activity Target Bio- Component
34 Relationship RDF representation Relationships defined between instances of your entities are object properties! chembl_assay:chembl a cco:assay ;! cco:hastarget chembl_target:chembl ;! cco:hasactivity chembl_activity:chembl_act_ !
35 ChEMBL assay attributes Identify attributes from database you want to include in RDF model Map attribute types e.g. integers, strings, Booleans Some attributes map to external resources/ontologies see later Denormalisation of relational data, e.g. FKs
36 Attribute RDF representation Attributes you define for your classes are datatype properties! chembl_assay:chembl a cco:assay ;! cco:hastarget chembl_target:chembl ;! cco:hasactivity chembl_activity:chembl_act_ ;! rdfs:label "CHEMBL615672" ;! cco:chemblid "CHEMBL615672" ;! cco:assaytype "Functional" ;! cco:assaycelltype "3LL cell line" ;! cco:organismname "Mus musculus.!! Good practice to add a rdfs:label to all instances
37 More examples ChEMBL Entity properties Substance Target Activity TopBraid Composer (Free Edition)
38 Mapping to external ontologies Examples of ontologies/taxonomies mapped to in ChEMBL RDF include: BioAssay Ontology (BAO) ChEBI Chemical Infomation Ontology (CHEMINF) Bibliographic Ontology Unit Ontology (UO) QUDT Ontology Semantic Science Ontology (SIO) Cell Line Ontology (CLO) Experimental Factor Ontology (EFO)
39 External ontologies/taxonomies Identification of relevant external ontologies Community consensus + recommendations BioPortal - Ontology Lookup Service -
40 ChEMBL assay data Journal Source Target- Component Document Assay Protein- Classification Substance Activity Target Bio- Component
41 ChEMBL assay annotation The assay is the central component to the ChEMBL data model Current model not ideal Single category - binding, functional, ADMET, physchem Unstructured/free text used to describe assay Many assay parameters not captured although often not available Ontologies are now being used to improve ChEMBL assay annotations - ChEMBL_17 onwards Mappings to BAO bioassays, assay_format, endpoints
42 BioAssay Ontology Bioassay parent class, 92 descendant classes How do we map to all these BAO assay classes?
43 External ontology mapping process In many cases mapping is straight forward Use common bridging identifier e.g. UniProt Simple text based conversion e.g. units - actually units not so straight forward in ChEMBL Some mappings require complex rules e.g. assay details Multiple database parameters Complex text processing Manual curation Tools available to assist with mapping process BioPortal Annotator ( Zooma ( ChEMBL Assay Annotator
44 BioPortal Annotator ChEMBL Assay Description Restricted to Ontology interest (optional) Results API available
45 BioPortal Annotator Example CHEMBL assay description Induction of apoptosis in human Jurkat T cells overexpressing Neo assessed as loss in mitochondrial membrane potential at 30 ug/ml after 36 hrs by DiO6-based flow cytometry (Rvb = 5.4%) More information here: Now use BioPortal Annotator to annotate
46 ChEMBL Assay Annotator ChEMBL Assay Annotator developed by Samuel Croset Aim is to map ChEMBL assays to BAO assay classes Tailored mapping rules developed
47 External mapping representation In this example defining assay_format:! chembl_assay:chembl a cco:assay ;! cco:hastarget chembl_target:chembl ;! cco:hasactivity chembl_activity:chembl_act_ ;! rdfs:label "CHEMBL615672" ;! cco:chemblid "CHEMBL615672" ;! cco:assaytype "Functional" ;! cco:assaycelltype "3LL cell line" ;! cco:organismname "Mus musculus ;! bao:bao_ bao:bao_ !!! BAO_ = has_assay_format BAO_ = Cell based
48 ChEMBL Core Ontology (CCO) The skeleton schema used to store ChEMBL classes, object properties and datatype properties The file is also RDF, so can be queried independent of an instances ftp://ftp.ebi.ac.uk/pub/databases/chembl/chembl-rdf/18.0/ cco.ttl.gz Namespace: Initial focus on Substance (Molecule) and Target Classification In future an additional mapping file may be provided, which maps/aligns ChEMBL classes and properties to external resources
49 ChEMBL Core Ontology (CCO) Classes Substance Classes Target Classes
50 ChEMBL RDF schema
51 The (raw) end result
52 Querying ChEMBL data Need to load files into triplestore (Virtuoso Open Source) CCO ChEMBL SPARQL Interface/LD Browser Reactome Triplestore UniProt Triplestore ChEMBL Data ChEMBL Triplestore External Ontos e.g. BAO Bio2RDF
53 Using external RDF data sources Questions to think about when using external RDF data sources Who creates resource and the RDF representation? When was the resource last updated? When was RDF last updated? Does the data model make sense? Basic queries work? Shared entities and ontologies? Any data licensing issues?
54 VoID can help VoID = Vocabulary of Interlinked Datasets Acts a bridge between publishers and users EBI RDF resources provide a VoID (just an extra RDF file) Information contained in VoID Creation timestamps Publisher details Versioning Ontologies/vocabularies used Licensing Data formats available and where they live (not just RDF) More complex information such as Subsets and Linksets
55 Quick look at the ChEMBL VoID
56 Model building recommendations Technology review URIs should resolve and be future proofed Ensure the correct external namespaces are being used Add rdfs:label to everything Consider using identifiers for ontology names instead of textual descriptions As small descriptive ontology grows consistent naming conventions can breakdown Not used for CCO, but may consider future format switch e.g. CCO_ = ChEMBL Activity, CCO_ = ChEMBL Assay and so on
57 Technology stack Triple Processing Groovy OpenRDF Sesame Java API ( Rapper useful command line utility Triplestore/Storage Virtuoso Open Source Edition (Upgrade to Version 7 planned) Raw.ttl files available to download from ChEMBL FTP site ftp://ftp.ebi.ac.uk/pub/databases/chembl/chembl-rdf/ Domain/class specific.ttl files created helps processing and loading
58 New features ChEMBL 18 RDF More data, now 409,989,782 triples New types of data Binding sites, cell lines, mechanism of action New properties Molecule hierarchy mappings Target complex mappings Assay parameters Improved mappings to the BAO ontology assay_format, e.g. biochemical, physiochemical, cell based, Some example queries now follow ->
59 Example query 1 Get all human cell-lines: PREFIX rdfs: < PREFIX dcterms: < PREFIX bao: < PREFIX cco: < SELECT?cellLine?cellName! WHERE {!?cellline a cco:cellline ;! cco:taxonomy < ;! rdfs:label?cellname.! }!
60 Example query 2 Get all compounds that have been tested in a cell-based (bao:bao_ ) toxicity assay in HepG2 cells: PREFIX rdfs: < PREFIX dcterms: < PREFIX bao: < PREFIX cco: < SELECT?mol?assayDesc! WHERE {!?mol?p cco:substance ;! cco:substancetype?moltype ;! cco:hasactivity?activity.!?activity cco:hasassay?assay.!?assay cco:assaycelltype 'HepG2' ;! cco:assaytype 'Toxicity' ;! bao:bao_ bao:bao_ ;! dcterms:description?assaydesc.! }!
61 Example query 3 Get all concentration response assays (bao:bao_ ) for monoamine receptor targets (CHEMBL_PC_1266): PREFIX rdfs: < PREFIX dcterms: < PREFIX bao: < PREFIX cco: < SELECT distinct?assay?assaydesc! WHERE{! < cco:hastargetdescendant?target.!?target cco:hasassay?assay.!?assay cco:hasactivity?activity ;! dcterms:description?assaydesc.!?activity bao:bao_ ?endpoint.!?endpoint rdfs:subclassof bao:bao_ ! }!
62 Example query 4 Get the number of ADME assays carried out in organismbased (bao:bao_ ) format for FDA approved drugs: PREFIX rdfs: < PREFIX dcterms: < PREFIX bao: < PREFIX cco: < SELECT?molname?mol (count(distinct?assay) as?assay_count)! WHERE{!?assay a cco:assay ;! cco:assaytype 'ADME' ;! bao:bao_ bao:bao_ !?assay cco:hasactivity?activity.!?activity cco:hasmolecule?mol.!?mol cco:highestdevelopmentphase 4 ;! rdfs:label?molname.! }! GROUP BY?molname?mol! ORDER BY DESC(count(distinct?assay))!
63 Example query 5 Get all cell-lines that have been used in physical property assays (bao:bao_ ): PREFIX rdfs: < PREFIX dcterms: < PREFIX bao: < PREFIX cco: < SELECT?cellLine?assay! WHERE {!?cellline a cco:cellline ;! cco:iscelllineforassay?assay.!?assay cco:hasactivity?activity.!?activity bao:bao_ ?endpoint.!?endpoint rdfs:subclassof bao:bao_ ! }!
64 Example query 6 Get all Protein Kinase (CHEMBL_PC_1100) inhibitor binding sites: PREFIX rdfs: < PREFIX dcterms:< PREFIX bao: < PREFIX cco:< SELECT?target?bindingSite?siteName?inhibitor! WHERE{!?bindingSite a cco:bindingsite ;! cco:bindingsitename?sitename ;! cco:hastarget?target.! < cco:hastargetdescendant?target.!?target rdfs:label?targetname ;! cco:istargetformechanism?mechanism.!?mechanism cco:mechanismactiontype 'INHIBITOR' ;! cco:mechanismdescription?mechanismdesc ;! cco:hasmolecule?molecule.!?molecule rdfs:label?inhibitor.! }!
65 Future Plans: SureChEMBL December 2013 EMBL-EBI acquired SureChem a leading chemistry patent mining product from Digital Science, Macmillan Group SureChem provides a live (updated daily) view chemical patent space Rebranded SureChEMBL
66 Open PHACTS extension Open PHACTS project is keen to include patent data in future extensions to the project ENSO approved - funding to include SureChEMBL data in Open PHACTS RDF conversion, target indexing and API development EBI-RDF project benefit from RDF conversion SureChEMBL is updated daily, compared to quarterly ChEMBL updates Interesting challenge for us creating exports and systems loading SureChEMBL
67 Open PHACTS Platform Apps Core Platform Identity Resolution Service Identifier Management Service Adenosine receptor 2a P12374 EC CS4532 Linked Data API (RDF/XML, TTL, JSON) Semantic Workflow Engine Data Cache (Virtuoso Triple Store) Chemistry Registration Normalisation & Q/C Domain Specific Services Indexing VoID VoID VoID VoID VoID Nanopub Nanopub Nanopub Public Ontologies Db Db Public Content Db Commercial Db User Annotations (slide author: Lee Harland)
68 Summary Review of the ChEMBL database Two approaches used to modeling ChEMBL data Approach 2 used to build RDF representation of ChEMBL New features included in ChEMBL_18 release Model enhancements More data Plans for the future Patents Open PHACTS
69 Acknowledgements Groups and people involved in the RDF representation of ChEMBL include: ChEMBL Group Anna Gaulton Samual Croset John Overington EBI-RDF Group Andy Jenkinson Simon Jupp James Malone Open PHACTS Alasdair Gray Antonis Loizou Lee Harland Egon Willighagen
Mapping Relational data to RDF
RDF and RDB 2 D2RQ Mapping Relational data to RDF Suppose we have data in a relational database that we want to export as RDF 1. Choose an RDF vocabulary to represent the data 2. Define a mapping from
More informationRDF and RDB 2 D2RQ. Mapping Relational data to RDF D2RQ. D2RQ Features. Suppose we have data in a relational database that we want to export as RDF
Mapping Relational data to RDF RDF and RDB 2 D2RQ Suppose we have data in a relational database that we want to export as RDF 1. Choose an RDF vocabulary to represent the data 2. Define a mapping from
More informationRajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute Ø The majority of data underpinning the Web are stored in Relational Databases (RDB). Ø Advantages: Secure and scalable architecture.
More informationThe Implementation of Semantic Web Technology in Traditional Plant Medicine
The Implementation of Semantic Web Technology in Traditional Plant Medicine Nur Ana 1, A la Syauqi 2, M Faisal 3 123 Informatics Engineering, Faculty Science and Technology State Islamic University Maulana
More informationIntroduction to RDF and the Semantic Web for the life sciences
Introduction to RDF and the Semantic Web for the life sciences Simon Jupp Sample Phenotypes and Ontologies Team European Bioinformatics Institute jupp@ebi.ac.uk Practical sessions Converting data to RDF
More informationSureChem and ChEMBL. ACS CINF webinar. John P. Overington & Nicko Goncharoff
SureChem and ChEMBL ACS CINF webinar John P. Overington & Nicko Goncharoff 8 th April 2014 Assay/Target ChEMBL Data for Drug Discovery 1. Scientific facts 3. Insight, tools and resources for translational
More informationBIOLOGICAL PATHWAYS AND THE SEMANTIC WEB
BIOLOGICAL PATHWAYS AND THE SEMANTIC WEB Andra Waagmeester, Tina Kutmon, Egon Willighagen, and Alex Pico Univ. Maastricht, NL, and Gladstone Institutes, CA, USA What we will talk about today Introduc*on
More informationW3C Workshop on RDF Access to Relational Databases October, 2007 Boston, MA, USA D2RQ. Lessons Learned
W3C Workshop on RDF Access to Relational Databases 25-26 October, 2007 Boston, MA, USA D2RQ Lessons Learned Christian Bizer Richard Cyganiak Freie Universität Berlin The D2RQ Plattform 2002: D2R MAP dump
More informationAn overview of RDB2RDF techniques and tools
An overview of RDB2RDF techniques and tools DERI Reading Group Presentation Nuno Lopes August 26, 2009 Main purpose of RDB2RDF WG... standardize a language for mapping Relational Database schemas into
More informationLanguages and tools for building and using ontologies. Simon Jupp, James Malone
An overview of ontology technology Languages and tools for building and using ontologies Simon Jupp, James Malone jupp@ebi.ac.uk, malone@ebi.ac.uk Outline Languages OWL and OBO classes, individuals, relations,
More informationenanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria
enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria www.ideaconsult.net Ø enanomapper database: data model, technology; NANoREG data transfer
More informationTaking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria
Taking a view on bio-ontologies Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Who we are European Bioinformatics Institute one of world s largest bio data and service providers
More informationPractical experience: FP7 DebugIT project
Practical experience: Setting up a Clinical Data Repository FP7 DebugIT project Douglas Teodoro On behalf of DebugIT partners Oslo, 29 August 2011 Scope The Clinical Data Repository (CDR) components. How
More informationThe D2RQ mapping language. Richard Cyganiak Presentation to W3C RDB2RDF XG, 23 May 2008
The D2RQ mapping language Richard Cyganiak Presentation to W3C RDB2RDF XG, 23 May 2008 D2RQ DB-to-RDF mapper written in Java In: any JDBC database Out: SPARQL, Linked Data, or Jena API GPL, popular, easy
More informationFrameWeb. Vítor E. Silva Souza ~ vitorsouza
FrameWeb Vítor E. Silva Souza (vitorsouza@inf.ufes.br) http://www.inf.ufes.br/ ~ vitorsouza Department of Informatics Federal University of Espírito Santo (Ufes), Vitória, ES Brazil License for use and
More informationQuerying the Semantic Web
Querying the Semantic Web CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University http://www3.cs.stonybrook.edu/~pfodor/courses/cse595.html Lecture Outline SPARQL Infrastructure Basics:
More informationCOMPUTER AND INFORMATION SCIENCE JENA DB. Group Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara
JENA DB Group - 10 Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara OUTLINE Introduction Data Model Query Language Implementation Features Applications Introduction Open Source
More information3. Queries Applied Artificial Intelligence Prof. Dr. Bernhard Humm Faculty of Computer Science Hochschule Darmstadt University of Applied Sciences
3. Queries Applied Artificial Intelligence Prof. Dr. Bernhard Humm Faculty of Computer Science Hochschule Darmstadt University of Applied Sciences 1 Retrospective Knowledge Representation (1/2) What is
More informationUpdate: MIRIAM Registry and SBO
Update: MIRIAM Registry and SBO Nick Juty, EMBL-EBI 3rd Sept, 2011 Overview MIRIAM Registry MIRIAM Guidelines.. MIRIAM Registry content URIs (URN form), example Summary/current developments SBO Purpose
More informationOpen PHACTS. An Introduction and Explanation March Acknowledgements: Contains contributions from across the Open PHACTS partners.
Open PHACTS An Introduction and Explanation March 2012 Acknowledgements: Contains contributions from across the Open PHACTS partners. Public Domain Drug Discovery Data: Pharma are accessing, processing,
More informationBuilding Blocks of Linked Data
Building Blocks of Linked Data Technological foundations Identifiers: URIs Data Model: RDF Terminology and Semantics: RDFS, OWL 23,019,148 People s Republic of China 20,693,000 population located in capital
More informationIncremental Export of Relational Database Contents into RDF Graphs
National Technical University of Athens School of Electrical and Computer Engineering Multimedia, Communications & Web Technologies Incremental Export of Relational Database Contents into RDF Graphs Nikolaos
More informationDay 2. RISIS Linked Data Course
Day 2 RISIS Linked Data Course Overview of the Course: Friday 9:00-9:15 Coffee 9:15-9:45 Introduction & Reflection 10:30-11:30 SPARQL Query Language 11:30-11:45 Coffee 11:45-12:30 SPARQL Hands-on 12:30-13:30
More informationEnhanced Search and Vocabulary Access for the Shipboard Automated Meteorological and Oceanographic System (SAMOS) using the Semantic Web Stack
Enhanced Search and Vocabulary Access for the Shipboard Automated Meteorological and Oceanographic System (SAMOS) using the Semantic Web Stack Author: Nkemdirim Dockery Center for Ocean-Atmospheric Prediction
More informationBUILDING THE SEMANTIC WEB
BUILDING THE SEMANTIC WEB You might have come across the term Semantic Web Applications often, during talks about the future of Web apps. Check out what this is all about There are two aspects to the possible
More informationA Semantic Model for Federated Queries Over a Normalized Corpus
A Semantic Model for Federated Queries Over a Normalized Corpus Samuel Croset, Christoph Grabmüller, Dietrich Rebholz-Schuhmann 17 th March 2010, Hinxton EBI is an Outstation of the European Molecular
More informationKnowledge Representation RDF Turtle Namespace
Knowledge Representation RDF Turtle Namespace Jan Pettersen Nytun, UiA 1 URIs Identify Web Resources Web addresses are the most common URIs, i.e., uniform Resource Locators (URLs). RDF resources are usually
More informationSemi-Automatic Discovery of Meaningful Ontology from a Relational Database
University of Colorado, Boulder CU Scholar Computer Science Graduate Theses & Dissertations Computer Science Spring 1-1-2011 Semi-Automatic Discovery of Meaningful Ontology from a Relational Database David
More informationSELF-SERVICE SEMANTIC DATA FEDERATION
SELF-SERVICE SEMANTIC DATA FEDERATION WE LL MAKE YOU A DATA SCIENTIST Contact: IPSNP Computing Inc. Chris Baker, CEO Chris.Baker@ipsnp.com (506) 721 8241 BIG VISION: SELF-SERVICE DATA FEDERATION Biomedical
More informationJENA: A Java API for Ontology Management
JENA: A Java API for Ontology Management Hari Rajagopal IBM Corporation Page Agenda Background Intro to JENA Case study Tools and methods Questions Page The State of the Web Today The web is more Syntactic
More informationOpen PHACTS Explorer: Pharmacology by Enzyme Family
Open PHACTS Explorer: Pharmacology by Enzyme Family This document is a tutorial for using Open PHACTS Explorer (explorer.openphacts.org) to obtain pharmacological information for families of enzymes classified
More informationData management and integration
Development of Predictive Toxicology Applications An OpenTox Workshop 19 Sep 2010, Rhodes, Greece Data management and integration presented by Nina Jeliazkova (Ideaconsult Ltd., Bulgaria) Outline Ontology
More informationUsing RDF to Model the Structure and Process of Systems
Using RDF to Model the Structure and Process of Systems Marko A. Rodriguez Jennifer H. Watkins Johan Bollen Los Alamos National Laboratory {marko,jhw,jbollen}@lanl.gov Carlos Gershenson New England Complex
More informationSemantic Integration with Apache Jena and Apache Stanbol
Semantic Integration with Apache Jena and Apache Stanbol All Things Open Raleigh, NC Oct. 22, 2014 Overview Theory (~10 mins) Application Examples (~10 mins) Technical Details (~25 mins) What do we mean
More informationOrchestrating Music Queries via the Semantic Web
Orchestrating Music Queries via the Semantic Web Milos Vukicevic, John Galletly American University in Bulgaria Blagoevgrad 2700 Bulgaria +359 73 888 466 milossmi@gmail.com, jgalletly@aubg.bg Abstract
More informationCHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION Most of today s Web content is intended for the use of humans rather than machines. While searching documents on the Web using computers, human interpretation is required before
More informationSPARQL UniProt.RDF. Everyone has had some introduction slash knowledge of RDF.
SPARQL UniProt.RDF Everyone has had some introduction slash knowledge of RDF. Jerven Bolleman Developer Swiss-Prot Group Swiss Institute of Bioinformatics Tutorial plan You should have used Topbraid composer
More informationSADI Semantic Web Services
SADI Semantic Web Services London, UK 8 December 8 2011 SADI Semantic Web Services Instructor: Luke McCarthy http:// sadiframework.org/training/ 2 Contents 2.1 Introduction to Semantic Web Services 2.1
More informationR2RML by Assertion: A Semi-Automatic Tool for Generating Customised R2RML Mappings
R2RML by Assertion: A Semi-Automatic Tool for Generating Customised R2RML Mappings Luís Eufrasio T. Neto 1, Vânia Maria P. Vidal 1, Marco A. Casanova 2, José Maria Monteiro 1 1 Federal University of Ceará,
More informationTools for Mapping Ontologies to Relational Databases: A Comparative Evaluation
Tools for Mapping Ontologies to Relational Databases: A Comparative Evaluation Dorin Moldovan, Marcel Antal, Dan Valea, Claudia Pop, Tudor Cioara, Ionut Anghel, Ioan Salomie Computer Science Department
More informationCustomisable Curation Workflows in Argo
Customisable Curation Workflows in Argo Rafal Rak*, Riza Batista-Navarro, Andrew Rowley, Jacob Carter and Sophia Ananiadou National Centre for Text Mining, University of Manchester, UK *Corresponding author:
More informationA discovery platform for translational research
A discovery platform for translational research - DisGeNET-RDF&SPARQL - Usage and Modeling Challenges Núria Queralt Rosinach Integrative Biomedical Informatics Group (IBI) Research Programme on Biomedical
More informationSemantic Web Fundamentals
Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2017/18 Vedran Sabol with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz December 11 th 2017 Overview What is Semantic
More informationComparative Study of RDB to RDF Mapping using D2RQ and R2RML Mapping Languages
International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 10, Number 1 (2018), pp. 23-36 International Research Publication House http://www.irphouse.com Comparative Study of
More informationProfiles Research Networking Software API Guide
Profiles Research Networking Software API Guide Documentation Version: March 13, 2013 Software Version: ProfilesRNS_1.0.3 Table of Contents Overview... 2 PersonID, URI, and Aliases... 3 1) Profiles RNS
More informationJena.
Jena http://openjena.org/ The Beginning... From: McBride, Brian Date: Mon, 28 Aug 2000 13:40:03 +0100 To: "RDF Interest (E-mail)" A few weeks ago I posted
More informationA Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision
A Semantic Web-Based Approach for Harvesting Multilingual Textual Definitions from Wikipedia to Support ICD-11 Revision Guoqian Jiang 1,* Harold R. Solbrig 1 and Christopher G. Chute 1 1 Department of
More informationSemantic Web Fundamentals
Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2018/19 with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz January 7 th 2019 Overview What is Semantic Web? Technology
More informationContents. G52IWS: The Semantic Web. The Semantic Web. Semantic web elements. Semantic Web technologies. Semantic Web Services
Contents G52IWS: The Semantic Web Chris Greenhalgh 2007-11-10 Introduction to the Semantic Web Semantic Web technologies Overview RDF OWL Semantic Web Services Concluding comments 1 See Developing Semantic
More informationKNIME Enalos+ Molecular Descriptor nodes
KNIME Enalos+ Molecular Descriptor nodes A Brief Tutorial Novamechanics Ltd Contact: info@novamechanics.com Version 1, June 2017 Table of Contents Introduction... 1 Step 1-Workbench overview... 1 Step
More informationDigital Public Space: Publishing Datasets
Digital Public Space: Publishing Datasets Mo McRoberts, April 2012 I. Organise your data into sets. Implications Your data should ideally exist within a conceptual hierarchy (even if it's a singlelevel
More informationINF3580 SPRING 2014 EXERCISES WEEK 4
INF3580 SPRING 2014 EXERCISES WEEK 4 Martin G. Skjæveland 10 mars 2014 4 SPARQL Read Semantic Web Programming: chapter 6. Foundations of Semantic Web Technologies: chapter 7. 4.1 Query engine In this exercise
More informationThe P2 Registry
The P2 Registry -------------------------------------- Where the Semantic Web and Web 2.0 meet Digital Preservation David Tarrant, Steve Hitchcock & Les Carr davetaz / sh94r / lac @ecs.soton.ac.uk School
More informationDeveloping a Benchmark Suite for Semantic Web Data from Existing Workflows
Developing a Benchmark Suite for Semantic Web Data from Existing Workflows Antonis Troumpoukis 1, Angelos Charalambidis 1, Giannis Mouchakis 1, Stasinos Konstantopoulos 1, Ronald Siebes 2, Victor de Boer
More informationFuseki Server Installation
Fuseki Server Installation Related task of the project (Task # and full name): Author: Prepared by: Approved by: Task 43 Ontology standard and Metadata Sachin Deshmukh Sachin Deshmukh Richard Kaye Page:
More informationMapping Relational Data to RDF with Virtuoso's RDF Views
Mapping Relational Data to RDF with Virtuoso's RDF Views Among its many talents, OpenLink Virtuoso Universal Server includes SPARQL support and an RDF data store tightly integrated with its relational
More informationDBPedia (dbpedia.org)
Matt Harbers Databases and the Web April 22 nd, 2011 DBPedia (dbpedia.org) What is it? DBpedia is a community whose goal is to provide a web based open source data set of RDF triples based on Wikipedia
More informationThe Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr
The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts Leslie Carr http://id.ecs.soton.ac.uk/people/60 What s the Web For? To share information 1. Ad hoc home pages 2. Structured
More informationApache Marmotta. Multimedia Management
for Multimedia Management Jakob Frank, Thomas Kurz http://marmotta.apache.org/ Who are we? Jakob Frank Researcher at Salzburg Research Solution Architect at Redlink GmbH ASF Committer of Marmotta Thomas
More informationBuilding innovative drug discovery alliances. Knime Desktop tools for chemists
Building innovative drug discovery alliances Knime Desktop tools for chemists Evotec AG, Knime desktop tools for chemists, May 2011 Agenda Knime Getting project data from Excel spreadsheets Getting project
More informationKNIME Enalos+ Modelling nodes
KNIME Enalos+ Modelling nodes A Brief Tutorial Novamechanics Ltd Contact: info@novamechanics.com Version 1, June 2017 Table of Contents Introduction... 1 Step 1-Workbench overview... 1 Step 2-Building
More informationGrid Resources Search Engine based on Ontology
based on Ontology 12 E-mail: emiao_beyond@163.com Yang Li 3 E-mail: miipl606@163.com Weiguang Xu E-mail: miipl606@163.com Jiabao Wang E-mail: miipl606@163.com Lei Song E-mail: songlei@nudt.edu.cn Jiang
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 5 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2465 1 Semantic
More informationSoftware + Services for Data Storage, Management, Discovery, and Re-Use
Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External
More informationLinked Data in the Clouds : a Sindice.com perspective
Linked Data in the Clouds : a Sindice.com perspective Giovanni Tummarello, FBK - DERI Copyright 2008. All rights reserved. Some definitions Linked Open Data: Official Definition The data that is available
More informationSemantic Web Technologies
1/33 Semantic Web Technologies Lecture 11: SWT for the Life Sciences 4: BioRDF and Scientifc Workflows Maria Keet email: keet -AT- inf.unibz.it home: http://www.meteck.org blog: http://keet.wordpress.com/category/computer-science/72010-semwebtech/
More informationRDF Mapper easy conversion of relational databases to RDF
RDF Mapper easy conversion of relational databases to RDF Eliot Bytyçi, Lule Ahmedi and Granit Gashi University of Prishtina Hasan Prishtina, 10000, Prishtinë, Kosovo {eliot.bytyci, lule.ahmedi}@uni-pr.edu,
More informationAdvances in Data Integration & Representation in Systems Biology
Advances in Data Integration & Representation in Systems Biology Susie Stephens Principal Product Manager, Life Sciences Oracle susie.stephens@oracle.com Outline Systems Biology Data Requirements Semantic
More informationOpen And Linked Data Oracle proposition Subtitle
Presented with Open And Linked Data Oracle proposition Subtitle Pascal GUY Master Sales Consultant Cloud Infrastructure France May 30, 2017 Copyright 2014, Oracle and/or its affiliates. All rights reserved.
More informationFrom Raw Sensor Data to Semantic Web Triples Information Flow in Semantic Sensor Networks
From Raw Sensor Data to Semantic Web Triples Information Flow in Semantic Sensor Networks Nikolaos Konstantinou, Ph.D. Autonomic and Grid Computing group Athens Information Technology Lecture Outline Introduction
More informationThe ELIXIR of Linked Data
The ELIXIR of Linked Data Professor Carole Goble (UK node) Barend Mons (NL node), Helen Parkinson (EMBL-EBI node) The Interoperability Services Backbone Team European Life Sciences Infrastructure for Biological
More informationBlink Project: Linked Open Data for Countway Library Final report for Phase 1 (June-Nov. 2012) Prepared by Sophia Cheng
Blink Project: Linked Open Data for Countway Library Final report for Phase 1 (June-Nov. 2012) Prepared by Sophia Cheng Summary We propose to improve the usefulness and discoverability of Countway Library
More informationMaximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009
Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images
More informationEnrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr.
Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies Student: Alexandra Moraru Mentor: Prof. Dr. Dunja Mladenić Environmental Monitoring automation Traffic Monitoring integration
More informationFrom the Web to the Semantic Web: RDF and RDF Schema
From the Web to the Semantic Web: RDF and RDF Schema Languages for web Master s Degree Course in Computer Engineering - (A.Y. 2016/2017) The Semantic Web [Berners-Lee et al., Scientific American, 2001]
More informationPublishing Statistical Data and Geospatial Data as Linked Data Creating a Semantic Data Platform
Publishing Statistical Data and Geospatial Data as Linked Data Creating a Semantic Data Platform Hans Viehmann Product Manager EMEA ORACLE Corporation January 22, 2017 @SpatialHannes Safe Harbor Statement
More informationEmerging practices for mapping and linking life sciences data using RDF - a case series
Emerging practices for mapping and linking life sciences data using RDF - a case series M. Scott Marshall, PhD 1, Richard Boyce, PhD 2, Helena F. Deus, PhD 3, Jun Zhao, PhD 4, Egon L Willighagen, PhD 5,
More informationSemantics. Matthew J. Graham CACR. Methods of Computational Science Caltech, 2011 May 10. matthew graham
Semantics Matthew J. Graham CACR Methods of Computational Science Caltech, 2011 May 10 semantic web The future of the Internet (Web 3.0) Decentralized platform for distributed knowledge A web of databases
More informationGenerating of RDF graph from a relational database using Jena API
Generating of RDF graph from a relational database using Jena API Jamal BAKKAS #1, Mohamed BAHAJ #2 # Department of Mathematics and computer science, University Hassan I, FSTS FST Settat, Km 3, B.P.:57,
More informationGábor Imre MADFAST SIMILARITY SEARCH
Gábor Imre MADFAST SIMILARITY SEARCH How fast is MadFast? Some numbers measured on an Amazon EC2 c3.8xlarge/r3.8xlarge machine How fast is MadFast? Some numbers measured on an Amazon EC2 c3.8xlarge/r3.8xlarge
More informationSolving problem of semantic terminology in digital library
International Journal of Advances in Intelligent Informatics ISSN: 2442-6571 20 Solving problem of semantic terminology in digital library Herlina Jayadianti Universitas Pembangunan Nasional Veteran Yogyakarta,
More informationCreating provenance supergraphs using pingbacks The Provenance Sharing Network
Creating provenance supergraphs using pingbacks The Provenance Sharing Network Nicholas J Car MODSIM 2015, 4 th December 2015 LAND & WATER Outline Supergraphs PROV-AQ Overview Pingback messages PROMS extensions
More informationKnowledge Representation VII - IKT507. SPARQL stands for SPARQL Protocol And RDF Query Language
Knowledge Representation VII - IKT507 This sentence is false! PARQL stands for PARQL Protocol And RDF Query Language Jan Pettersen Nytun, UiA 1 The emantic Web tack Hierarchy of languages, where each layer
More informationDatabase of historical places, persons, and lemmas
Database of historical places, persons, and lemmas Natalia Korchagina Outline 1. Introduction 1.1 Swiss Law Sources Foundation as a Digital Humanities project 1.2 Data to be stored 1.3 Final goal: how
More informationISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA)
ISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA) Expert contract supporting the Study on RDF and PIDs for INSPIRE Deliverable D.EC.3.2 RDF in INSPIRE Open issues, tools, and implications
More informationRDF /RDF-S Providing Framework Support to OWL Ontologies
RDF /RDF-S Providing Framework Support to OWL Ontologies Rajiv Pandey #, Dr.Sanjay Dwivedi * # Amity Institute of information Technology, Amity University Lucknow,India * Dept.Of Computer Science,BBA University
More informationLinked data and its role in the semantic web. Dave Reynolds, Epimorphics
Linked data and its role in the semantic web Dave Reynolds, Epimorphics Ltd @der42 Roadmap What is linked data? Modelling Strengths and weaknesses Examples Access other topics image: Leo Oosterloo @ flickr.com
More informationVocabulary and Semantics in the Virtual Observatory
Vocabulary and Semantics in the Virtual Observatory Norman Gray VO-TECH / AstroGrid / Uni. Leicester / Uni. Glasgow, UK VOEvent BoF, ADASS, London, 2007 September 24 rdf Resource Description Framework
More informationSemantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent
Semantic Technologies and CDISC Standards Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent Part I Introduction to Semantic Technology Resource Description Framework
More informationQuick Reference Guide
Quick Reference Guide Contents 1. The Query Page 3 2. Constructing Queries: Reactions 4 Substances 5 Medical Chemistry 7 Literature 8 Properties 9 Natural Products 10 3. Results: Filters 11 Analysis View
More informationMaking BioPAX SPARQL
Making BioPAX SPARQL hands on... start a terminal create a directory jena_workspace, move into that directory download jena.jar (http://tinyurl.com/3vlp7rw) download biopax data (http://www.biopax.org/junk/homosapiens.nt
More informationUniversity of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication
Citation for published version: Patel, M & Duke, M 2004, 'Knowledge Discovery in an Agents Environment' Paper presented at European Semantic Web Symposium 2004, Heraklion, Crete, UK United Kingdom, 9/05/04-11/05/04,.
More informationMetadata. Week 4 LBSC 671 Creating Information Infrastructures
Metadata Week 4 LBSC 671 Creating Information Infrastructures Muddiest Points Memory madness Hard drives, DVD s, solid state disks, tape, Digitization Images, audio, video, compression, file names, Where
More informationMinimal Metadata Standards and MIIDI Reports
Dryad-UK Workshop Wolfson College, Oxford 12 September 2011 Minimal Metadata Standards and MIIDI Reports David Shotton, Silvio Peroni and Tanya Gray Image BioInformatics Research Group Department of Zoology
More informationSemantic Web and Natural Language Processing
Semantic Web and Natural Language Processing Wiltrud Kessler Institut für Maschinelle Sprachverarbeitung Universität Stuttgart Semantic Web Winter 2014/2015 This work is licensed under a Creative Commons
More informationSemantic Information Retrieval: An Ontology and RDFbased
Semantic Information Retrieval: An Ontology and RDFbased Model S. Mahaboob Hussain Assistant Professor, CSE Prathyusha Kanakam Assistant Professor, CSE D. Suryanarayana Professor, CSE Swathi Gunnam PG
More informationExploring the Generation and Integration of Publishable Scientific Facts Using the Concept of Nano-publications
Exploring the Generation and Integration of Publishable Scientific Facts Using the Concept of Nano-publications Amanda Clare 1,3, Samuel Croset 2,3 (croset@ebi.ac.uk), Christoph Grabmueller 2,3, Senay
More informationAn R2RML Mapping Management API in Java. Making an API Independent of its Dependencies
An R2RML Mapping Management API in Java Making an API Independent of its Dependencies Marius Strandhaug Master s Thesis Spring 2014 Abstract When developing an Application Programming Interface (API),
More informationBridging the Gap between Semantic Web and Networked Sensors: A Position Paper
Bridging the Gap between Semantic Web and Networked Sensors: A Position Paper Xiang Su and Jukka Riekki Intelligent Systems Group and Infotech Oulu, FIN-90014, University of Oulu, Finland {Xiang.Su,Jukka.Riekki}@ee.oulu.fi
More informationIntroduction to metadata cleansing using SPARQL update queries. April 2014 PwC EU Services
Introduction to metadata cleansing using SPARQL update queries April 2014 PwC EU Services Learning objectives By the end of this module, you will have an understanding of: How to transform your metadata
More information