A discovery platform for translational research
|
|
- Cecily Dean
- 6 years ago
- Views:
Transcription
1 A discovery platform for translational research - DisGeNET-RDF&SPARQL - Usage and Modeling Challenges Núria Queralt Rosinach Integrative Biomedical Informatics Group (IBI) Research Programme on Biomedical Informatics (GRIB) Hospital del Mar Research Institute (IMIM) Pompeu Fabra University (UPF) Barcelona
2 Outline How can DisGeNET help your research? DisGeNET Discovery Platform Overview DisGeNET Linked Open Data Introduction RDF-LD Description: Data Model, VoID, Interlinking Implementation Accessibility Documentation Use Cases Querying the DisGeNET-RDF Hands-on
3 How can DisGeNET help your research?
4 Big Questions 4 Big Data -Data Bases -Literature -OMICS -Animal models - Human Biology Understanding Human Diseases Medical Sciences -EMR, EHR, IoT -Imaging -Patient registries -Clinical trials -Epidemiologic studies - PPI Genotype Phenotype Comorbidities Environment (life-style, chemicals, radiation, infections, clinical care intervention, ) DDI
5 Translational Research -Data Bases -Literature -OMICS -Animal models - Molecular Understanding Human Diseases Patient -EMR, EHR, IoT -Imaging -Patient registries -Clinical trials -Epidemiologic studies - Key in Translational Research Genotype Environment Phenotype Decision-making Prevention Diagnosis Therapies Research Discovery
6 Access to Gene-Disease Associations Mental retardation -?- SOX3 SOX3 OMIM:300123; OMIM: ORPHA393; ORPHA90695; ORPHA3157; ORPHA79495; ORPHA67045 Mental Retardation; Panhypopituitarism; 46,XX sex reversal 3 No Data MESH:C538613; MESH:C538613
7 Access to Gene-Disease Associations Mental retardation -?- SOX3 SOX3 OMIM:300123; OMIM: ORPHA393; ORPHA90695; ORPHA3157; ORPHA79495; ORPHA67045 Mental Retardation; Panhypopituitarism; 46,XX sex reversal 3 No Data MESH:C538613; MESH:C Lack of: Normalization Semantic integration Data model harmonization Unified access
8 Knowledge platform on human gene-disease associations (GDAs) Integrates information from expert-curated databases and from the literature (text mining) All disease areas Supporting evidence Analysis tools Piñero et al. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes. Database (2015) Vol. 2015: article ID bav028, (2015)
9 Research Questions Which genes are associated to Marfan syndrome? Which disease genes have approved drugs annotated? Which disease genes have differential expression? Which disease genes share a pathway? Is there genetic variation related to the MECP2 and Rett Syndrome association? What evidence supports the association between APP gene and Alzheimer Disease? Which genes and evidence support the comorbidity between Chronic Kidney disease and Diabetes Mellitus, Type 2? Evidence ANALYSIS KNOWLEDGE DISCOVERY ACTIONABLE INFORMATION
10 DisGeNET Discovery Platform Overview
11 DisGeNET Implementation Text mining Biomedical databases Bio-Entity Finder and Relation Extraction Gene-disease associations Gene-disease associations
12 DisGeNET Sources DisGeNET v3.0 GAD LHGDN CURATED PREDICTED LITERATURE
13 Data Integration EVIDENCE DisGeNET ONTOLOGY S = Scurated + Spredicted + Sliterature Score Source Article Gene-Disease Sentence SNP Disease Gene MeSH Class UMLS STY Protein Panther Class Pathway Use of Standards and controlled vocabularies
14 DisGeNET Statistics (May 15th, 2015) Annual Release DisGeNET v3.0 Source Genes Diseases Associations Curated 7,878 6,761 26,522 Predicted 2,557 2,003 9,536 Literature 16,298 11, ,175 All 17,181 14, , % Large volume of information unlocked by text mining the literature
15 Tools for exploration (Onexus) Usage stats (Ago2014-Ago2015): 12,040 users, 22,696 sessions (4:33 min/session) 14,494 downloads (database, Cytoscape plugin, RDF/Nanopubs) DisGeNET used in +20 publications, cited in +60 articles
16 Web Interface SOX3 NCBI:6658 Panhypopituitarism UMLS:C
17 Cytoscape Plugin SOX3 NCBI:6658 Panhypopituitarism UMLS:C
18 DisGeNET Linked Open Data
19 DisGeNET as Linked Open Data RDF and trusty nanopublications URIs: RDF providers or SIO Use of standards (11 ontologies in NCBO) Metadata description (W3C HCLS) Interlinking Bio2RDF Linked Life Data Access Download Data Dump SPARQL Endpoint Faceted Browser Open PHACTS Nanopublication Network Open license Datahub Software BioHackathon Aug 2014
20 DisGeNET-RDF
21 Data Model How to describe an association? Gene associated Disease a) As a property S P O b) As a class Gene Association Disease O P S P O
22 Data Model How to describe an association? Gene associated Disease a) As a property S P O b) As a class Gene Association Disease O P S P O
23 Data Model How to describe an association? Gene associated Disease a) As a property Provenance and Evidence S RDF triples P O b) As a class Gene Association Disease O P S P O
24 Ontology-based integration Data Model DisGeNET Association Type Ontology DisGeNET Standards Shared IDs Standard ontologies rdf:type Gene Association Disease O P S P O
25 Data Model Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit NCI Thesaurus sio SIO up UniProt void VoID foaf FOAF Vocabulary dcterms DCMI Terms rdf RDF rdfs RDF Schema xsd XML Schema owl OWL skos SKOS
26 Data Model Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit NCI Thesaurus sio SIO up UniProt void VoID foaf FOAF Vocabulary dcterms DCMI Terms rdf RDF rdfs RDF Schema xsd XML Schema owl OWL skos SKOS RDF Structure
27 Data Model Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit NCI Thesaurus sio SIO up UniProt void VoID foaf FOAF Vocabulary dcterms DCMI Terms rdf RDF rdfs RDF Schema xsd XML Schema owl OWL skos SKOS Biomedical entities Relationships RDF Structure
28 Data Model Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit NCI Thesaurus sio SIO up UniProt void VoID foaf FOAF Vocabulary dcterms DCMI Terms rdf RDF rdfs RDF Schema xsd XML Schema owl OWL skos SKOS Biomedical entities Relationships Metadata RDF Structure
29 Data Model URIs in DisGeNET: shared, cool & dereferenceable ID Normalization DisGeNET URIs: Estable URIs from primary data providers Identifiers.org Unique association attributes
30 Data Model URIs in DisGeNET: shared, cool & dereferenceable ID Normalization Gene-Disease Association::DisGeNET ID Entity URI Semantics Gene-Disease Association DGNf5cb3969d75871f05a5d5f984f8dfc34 sio:sio_ PubMed article ncit:c47902 Source Score ncbigene:4728_umls:c _association_disgenet Score dctypes:dataset, dcat:distribution ncit:c25338 SNP ncit:c18279
31 Data Model URIs in DisGeNET: shared, cool & dereferenceable ID Normalization Gene::NCBI Gene ID Entity URI Semantics Gene ncit:c16612 HGNC Gene Symbol ncit:c43568 Protein ncit:c17021 Panther Class /PC00211 rdfs:class Pathway ncit:c20633
32 Data Model URIs in DisGeNET: shared, cool & dereferenceable ID Normalization Disease::UMLS Concept Unique Identifier (CUI) Entity URI Semantics Disease C ncit:c7057 MeSH Class rdfs:class UMLS Semantic Type umlssn.owl#t047 rdfs:class Phenotype sio:sio_ Cross References Human Disease Ontology, MesH, OMIM, Orphanet, Decipher, NCIt, ICD9, Human Phenotype Ontology
33 Data Model
34 Data Model /DisGeNET-RDF-Example.ttl (Turtle)
35 Metada Dataset Description DisGeNET-RDF VoID file (Vocabulary of Interlinked Datasets) Gene Disease Association Disease Class Pathway SNP STY PubMed Panther Class Protein DisGeNET-RDF Pathway HGNC Symbol subsets Dataset DisGeNET Database DisGeNET CTD UniProt ClinVar MGD BeFree 1º sources
36 Interlinking DisGeNET -- RDF link -> LOD cloud Dataset 1 Dataset 2 DisGeNET skos:exactmatch UniProt Important in Federated Queries!
37 Interlinking?s skos:exactmatch?o Biomedical Databases and Disease Terminologies NCBI Gene PubMed Orphanet UniProt DisGeNET OMIM dbsnp UMLS DBpedia MeSH
38 DisGeNET as Linked Open Data Interlinking: 4,962,315 RDF links to RDF datasets in the LOD (more statistics)
39 Federated Query Support SPARQL 1.1: SERVICE <sparql endpoint> {} Skos:exactMatch GDA ID Disease ID Gene ID
40 Implementation DisGeNET RDF data, VoID dataset description, and six OWL ontologies loaded into the RDF Store Total number of triples: 24,882,432 (8,5G) LODEStar: SPARQL + LD Browser SPARQL Endpoint Faceted Browser Hardware: Usage Restrictions SPARQL: only SELECT, DESCRIBE, ASK performance opt: Max # of rows per result Max query cost estimation time Max query execution time Security: basic setup
41 Accessibility Download: RDF dump + linksets Faceted Browser SPARQL endpoint EBI::LODEStar SPARQL + Linked Data Browser Open PHACTS APIs
42 Documentation Descriptions RDF Schema Points of access SPARQL query
43 Use Cases What genes are associated to Marfan syndrome? What evidence supports the association between APP gene and Alzheimer Disease? What disease classes are associated with APP gene? Which genes and evidence support the comorbidity between Chronic Kidney disease and Diabetes Mellitus, Type 2? What SNPs are related to the MECP2 and Rett Syndrome association? Which diseases are associated to post-translational modifications type of association? What disease genes are hitted by compounds in ChEMBL? What disease genes have differential expression in Gene Expression Atlas? What disease genes are in WikiPathways? Find compounds (from ChEMBL) that target genes (from DisGeNET) that participate in the same pathway (from WikiPathways)
44 Querying the DisGeNET-RDF
45 SPARQL QUERIES Not easy RDF Schema-aware Performance issues Optimal queries: there is a trade off between the amount of time you spend analyzing and transforming the query and the performance gains of those transformations Technology-dependant crossing a lot of information decrease speed (making the system fails): better local Other approaches on development Q/A based on natural language Linked Data Fragments ElasticSearch
46 Querying DisGeNET SPARQL Queries over DisGeNET data SPARQL query graph
47 Data Model
48 Querying DisGeNET SPARQL Queries over DisGeNET data Minimal Resource Description Graph?label?type rdf:type?subject?rdfsource?comment?title?id SELECT DISTINCT * FROM < WHERE{?subject rdf:type?type ; rdfs:label?label; rdfs:comment?comment ; dcterms:identifier?id ; dcterms:title?title ; void:indataset?rdfsource. } LIMIT 100
49 Querying DisGeNET SPARQL Queries over DisGeNET data Gene-Disease Association Graph sio:sio_ rdf:type gda/dgnf5cb3969d75871f05a5d 5f984f8dfc34 SELECT DISTINCT?gda FROM < WHERE{?gda rdf:type sio:sio_ } LIMIT 100 Which is the sio:sio_ class? For each?gda, show me the?gene and the?disease associated, and the?typeofassociation For each?gda, show me the?gene and the?disease associated, the?paper, and the?sentence description of the relationship in the paper For each?gda show me the?gene,?disease,?source, and the level of?evidence of the association For each gene-disease pair show me the score?value For each?gda show me the?snp Go to the Web and understand and execute Q1.1-Q1.4
50 Querying DisGeNET SPARQL Queries over DisGeNET data Gene Graph Gene rdf:type SELECT DISTINCT?gene FROM < WHERE{?gene rdf:type ncit:c } LIMIT 100
51 Querying DisGeNET SPARQL Queries over DisGeNET data Gene Graph Gene rdf:type SELECT DISTINCT?gene FROM < WHERE{?gene a ncit:c } LIMIT 100 For each?gene show me:?identifier,?name,?genesymbol?protein(s)?panther class(es) and?pantherclassname?pathway(s) and?pathwayname Go to web and understand/execute Q1.5
52 Querying DisGeNET SPARQL Queries over DisGeNET data Disease Graph Disease rdf:type /id/c SELECT DISTINCT?disease FROM < WHERE{?disease a Disease. } LIMIT 100 For the disease < show me: the disease?name, MeSH disease class?label, and the umlssty?title show all cross-references to other disease terminologies Go to the Web and understand/execute Q1.6
53 Querying DisGeNET SPARQL Queries over DisGeNET data Ontology Walking queries Grouping of similar instances Filtering data Query data by classes Ontologies loaded in our RDF triple store: SIO, DO, ORDO, NCIT, HPO, and ECO (OWL) Go to the Web and understand/execute Q1.7and Q1.11
54 Querying DisGeNET SPARQL Queries over DisGeNET data Disease-Phenotype Association Graph (curated from HPO) Disease rdf:type /id/c ?link Disease rdf:type sio: is-manifested-as?phenotype Go to the Web and understand/execute Q1.10 and Q1.12
55 Querying DisGeNET SPARQL Queries over DisGeNET data Disease-Phenotype Association Graph (curated from HPO) Why this model? SELECT DISTINCT?disease count(distinct?hpdisease) as?hpdiseases count(distinct?phenotype) as?phenotypes WHERE {?disease rdf:type ncit:c7057.?disease skos:exactmatch?hpdisease.?hpdisease sio:sio_000341?phenotype. } ORDER BY DESC(?hpdiseases) LIMIT 100 SELECT DISTINCT?disease?hpdisease count(distinct?phenotype) as?phenotypes WHERE {?disease rdf:type ncit:c7057.?disease skos:exactmatch?hpdisease.?hpdisease sio:sio_000341?phenotype. FILTER (?disease = < } GROUP BY?disease?hpdisease
56 Querying DisGeNET + LOD cloud Federated Queries: DisGeNET + external datasets Go to the Web and understand/execute the Federated Queries
57 Use Cases What genes are associated to Marfan syndrome? What evidence supports the association between APP gene and Alzheimer Disease? What disease classes are associated with APP gene? Which genes and evidence support the comorbidity between Chronic Kidney disease and Diabetes Mellitus, Type 2? What SNPs are related to the MECP2 and Rett Syndrome association? Which diseases are associated to post-translational modifications type of association? What disease genes are hitted by compounds in ChEMBL? What disease genes have differential expression in Gene Expression Atlas? What disease genes are in WikiPathways? Find compounds (from ChEMBL) that target genes (from DisGeNET) that participate in the same pathway (from WikiPathways)
58 Acknowledgments IBI Group Alba Gutiérrez-Sacristán Àlex Bravo Janet Piñero Núria Queralt Rosinach Alexia Giannoula Miguel A. Mayer Laura I. Furlong Ferran Sanz PRBB Computational Seminars
59 Thanks for your attention! Questions are welcome
Publishing DisGeNET as Nanopublications
Publishing DisGeNET as Nanopublications Núria Queralt-Rosinach a, Tobias Kuhn b, Christine Chichester c, Michel Dumontier d, Ferran Sanz a, and Laura I. Furlong a,* a IBI group, Research Programme on Biomedical
More informationA Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision
A Semantic Web-Based Approach for Harvesting Multilingual Textual Definitions from Wikipedia to Support ICD-11 Revision Guoqian Jiang 1,* Harold R. Solbrig 1 and Christopher G. Chute 1 1 Department of
More informationBIOLOGICAL PATHWAYS AND THE SEMANTIC WEB
BIOLOGICAL PATHWAYS AND THE SEMANTIC WEB Andra Waagmeester, Tina Kutmon, Egon Willighagen, and Alex Pico Univ. Maastricht, NL, and Gladstone Institutes, CA, USA What we will talk about today Introduc*on
More informationThe ELIXIR of Linked Data
The ELIXIR of Linked Data Professor Carole Goble (UK node) Barend Mons (NL node), Helen Parkinson (EMBL-EBI node) The Interoperability Services Backbone Team European Life Sciences Infrastructure for Biological
More informationIntroduction to RDF and the Semantic Web for the life sciences
Introduction to RDF and the Semantic Web for the life sciences Simon Jupp Sample Phenotypes and Ontologies Team European Bioinformatics Institute jupp@ebi.ac.uk Practical sessions Converting data to RDF
More informationData Integration Framework of Pharmacology Databases Using Ontology
Data Integration Framework of Pharmacology Databases Using Ontology Phimphan Thipphayasaeng 1, Poonpong Boonbrahm 1, Marut Buranarach 2 and Anunchai Assawamakin 3 1 School of Informatics, Walailak University,
More informationSemantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent
Semantic Technologies and CDISC Standards Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent Part I Introduction to Semantic Technology Resource Description Framework
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationLanguages and tools for building and using ontologies. Simon Jupp, James Malone
An overview of ontology technology Languages and tools for building and using ontologies Simon Jupp, James Malone jupp@ebi.ac.uk, malone@ebi.ac.uk Outline Languages OWL and OBO classes, individuals, relations,
More informationDay 2. RISIS Linked Data Course
Day 2 RISIS Linked Data Course Overview of the Course: Friday 9:00-9:15 Coffee 9:15-9:45 Introduction & Reflection 10:30-11:30 SPARQL Query Language 11:30-11:45 Coffee 11:45-12:30 SPARQL Hands-on 12:30-13:30
More informationSemantic Annotation and Linking of Medical Educational Resources
5 th European IFMBE MBEC, Budapest, September 14-18, 2011 Semantic Annotation and Linking of Medical Educational Resources N. Dovrolis 1, T. Stefanut 2, S. Dietze 3, H.Q. Yu 3, C. Valentine 3 & E. Kaldoudi
More informationwarwick.ac.uk/lib-publications
Original citation: Zhao, Lei, Lim Choi Keung, Sarah Niukyun and Arvanitis, Theodoros N. (2016) A BioPortalbased terminology service for health data interoperability. In: Unifying the Applications and Foundations
More informationSELF-SERVICE SEMANTIC DATA FEDERATION
SELF-SERVICE SEMANTIC DATA FEDERATION WE LL MAKE YOU A DATA SCIENTIST Contact: IPSNP Computing Inc. Chris Baker, CEO Chris.Baker@ipsnp.com (506) 721 8241 BIG VISION: SELF-SERVICE DATA FEDERATION Biomedical
More informationThe CALBC RDF Triple store: retrieval over large literature content
The CALBC RDF Triple store: retrieval over large literature content Samuel Croset, Christoph Grabmüller, Chen Li, Silverstras Kavaliauskas, Dietrich Rebholz-Schuhmann croset@ebi.ac.uk 10 th December 2010,
More informationBUILDING THE SEMANTIC WEB
BUILDING THE SEMANTIC WEB You might have come across the term Semantic Web Applications often, during talks about the future of Web apps. Check out what this is all about There are two aspects to the possible
More informationNCI Thesaurus, managing towards an ontology
NCI Thesaurus, managing towards an ontology CENDI/NKOS Workshop October 22, 2009 Gilberto Fragoso Outline Background on EVS The NCI Thesaurus BiomedGT Editing Plug-in for Protege Semantic Media Wiki supports
More informationSemantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.
Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...
More informationBioNav: An Ontology-Based Framework to Discover Semantic Links in the Cloud of Linked Data
BioNav: An Ontology-Based Framework to Discover Semantic Links in the Cloud of Linked Data María-Esther Vidal 1, Louiqa Raschid 2, Natalia Márquez 1, Jean Carlo Rivera 1, and Edna Ruckhaus 1 1 Universidad
More informationSemantics. Matthew J. Graham CACR. Methods of Computational Science Caltech, 2011 May 10. matthew graham
Semantics Matthew J. Graham CACR Methods of Computational Science Caltech, 2011 May 10 semantic web The future of the Internet (Web 3.0) Decentralized platform for distributed knowledge A web of databases
More informationHPO Workbench Documentation
HPO Workbench Documentation Release 1 Monarch Initiative Dec 17, 2018 Contents: 1 HPOannotQC 1 1.1 Installing and running HPO Annotation Q/C.............................. 1 1.2 HPO Annotation Small Files.......................................
More informationKnowledge Representations. How else can we represent knowledge in addition to formal logic?
Knowledge Representations How else can we represent knowledge in addition to formal logic? 1 Common Knowledge Representations Formal Logic Production Rules Semantic Nets Schemata and Frames 2 Production
More informationAggregating Educational Data for Patient Empowerment
Nick Portokallidis, George Drosatos, Eleni Kaldoudi School of Medicine, Democritus University of Thrace 6 th Panhellenic Conference on Biomedical Technology Athens, Greece, 6-8 May 2015 This work was supported
More informationKorea Institute of Oriental Medicine, South Korea 2 Biomedical Knowledge Engineering Laboratory,
A Medical Treatment System based on Traditional Korean Medicine Ontology Sang-Kyun Kim 1, SeJin Nam 2, Dong-Hun Park 1, Yong-Taek Oh 1, Hyunchul Jang 1 1 Literature & Informatics Research Division, Korea
More informationA Semantic Model for Federated Queries Over a Normalized Corpus
A Semantic Model for Federated Queries Over a Normalized Corpus Samuel Croset, Christoph Grabmüller, Dietrich Rebholz-Schuhmann 17 th March 2010, Hinxton EBI is an Outstation of the European Molecular
More informationSemantic challenges in sharing dataset metadata and creating federated dataset catalogs
Linked Open Data in Agriculture MACS-G20 Workshop in Berlin, September 27th 28th, 2017 Semantic challenges in sharing dataset metadata and creating federated dataset catalogs The example of the CIARD RING
More informationAdvances in Data Integration & Representation in Systems Biology
Advances in Data Integration & Representation in Systems Biology Susie Stephens Principal Product Manager, Life Sciences Oracle susie.stephens@oracle.com Outline Systems Biology Data Requirements Semantic
More informationLinked Data for the Life Sciences. Release 2. Michel Dumontier Associate Professor, Carleton University. on behalf of the Bio2RDF team
Release 2 Linked Data for the Life Sciences Michel Dumontier Associate Professor, Carleton University on behalf of the Bio2RDF team is an open source framework that makes biological data available on the
More informationDisease Information and Semantic Web
Rheinische Friedrich-Wilhelms-Universität Bonn Institute of Computer Science III Disease Information and Semantic Web Master s Thesis Supervisor: Prof. Sören Auer, Heiner OberKampf Turan Gojayev München,
More informationClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar
ClinVar What is ClinVar ClinVar is a freely available, central archive for associating observed variation with supporting clinical and experimental evidence for a wide range of disorders. The database
More informationSemantic MediaWiki A Tool for Collaborative Vocabulary Development Harold Solbrig Division of Biomedical Informatics Mayo Clinic
Semantic MediaWiki A Tool for Collaborative Vocabulary Development Harold Solbrig Division of Biomedical Informatics Mayo Clinic Outline MediaWiki what it is, how it works Semantic MediaWiki MediaWiki
More informationSemantic e-science. Bibliographic Cloud
Semantic e-science in the Bibliographic Cloud Connecting the data with the literature Semantic e-science in the Bibliographic Cloud Connecting the data with the literature Semantic e-science in the Bibliographic
More informationa paradigm for the Semantic Web Linked Data Angelica Lo Duca IIT-CNR Linked Open Data:
Linked Data Angelica Lo Duca IIT-CNR angelica.loduca@iit.cnr.it Linked Open Data: a paradigm for the Semantic Web Linked Data are a series of best practices to connect structured data through the Web.
More informationLinked Data and RDF. COMP60421 Sean Bechhofer
Linked Data and RDF COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk Building a Semantic Web Annotation Associating metadata with resources Integration Integrating information sources Inference
More informationPowering Knowledge Discovery. Insights from big data with Linguamatics I2E
Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural
More informationInteroperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research
Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research Ian Fore, D.Phil. Associate Director, Biorepository and Pathology Informatics Senior Program
More informationA Knowledge-Based System for the Specification of Variables in Clinical Trials
A Knowledge-Based System for the Specification of Variables in Clinical Trials Matthias Löbe, Barbara Strotmann, Kai-Uwe Hoop, Roland Mücke Institute for Medical Informatics, Statistics and Epidemiology
More informationUnstructured Text in Big Data The Elephant in the Room
Unstructured Text in Big Data The Elephant in the Room David Milward ICIC, October 2013 Click Unstructured to to edit edit Master Master Big title Data style title style Big Data Volume, Variety, Velocity
More informationThe Data Web and Linked Data.
Mustafa Jarrar Lecture Notes, Knowledge Engineering (SCOM7348) University of Birzeit 1 st Semester, 2011 Knowledge Engineering (SCOM7348) The Data Web and Linked Data. Dr. Mustafa Jarrar University of
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 5 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2465 1 Semantic
More informationWhat is FAIR? 5 th International Summer School on Rare Disease and Orphan Drug Registries. Claudio Carta 1 and Marco Roos 2
5 th International Summer School on Rare Disease and Orphan Drug Registries What is FAIR? Claudio Carta 1 and Marco Roos 2 1 National Centre for Rare Diseases Istituto Superiore di Sanità, Rome, Italy
More informationLinked Data: Fast, low cost semantic interoperability for health care?
Linked Data: Fast, low cost semantic interoperability for health care? About the presentation Part I: Motivation Why we need semantic operability in health care Why enhancing existing systems to increase
More informationSemantic Annotation, Search and Analysis
Semantic Annotation, Search and Analysis Borislav Popov, Ontotext Ontology A machine readable conceptual model a common vocabulary for sharing information machine-interpretable definitions of concepts in
More informationFacilitating Semantic Alignment of EBI Resources
Facilitating Semantic Alignment of EBI Resources 17 th March, 2017 Tony Burdett Technical Co-ordinator Samples, Phenotypes and Ontologies Team www.ebi.ac.uk What is EMBL-EBI? Europe s home for biological
More informationThe Implementation of Semantic Web Technology in Traditional Plant Medicine
The Implementation of Semantic Web Technology in Traditional Plant Medicine Nur Ana 1, A la Syauqi 2, M Faisal 3 123 Informatics Engineering, Faculty Science and Technology State Islamic University Maulana
More informationIntegrating TCGA Clinical Data Using Metadata-driven Tools and NLP
Integrating TCGA Clinical Data Using Metadata-driven Tools and NLP Guoqian Jiang, MD, PhD, Mayo Clinic Guergana Savova, PhD, HMS/Boston Children's Hospital Rebecca Jacobson, MD,MS, University of Pittsburgh
More informationEmerging practices for mapping and linking life sciences data using RDF - a case series
Emerging practices for mapping and linking life sciences data using RDF - a case series M. Scott Marshall, PhD 1, Richard Boyce, PhD 2, Helena F. Deus, PhD 3, Jun Zhao, PhD 4, Egon L Willighagen, PhD 5,
More informationA preliminary investigation into SPARQL query complexity and federation in Bio2RDF
A preliminary investigation into SPARQL query complexity and federation in Bio2RDF Carlos Buil-Aranda 1, Martín Ugarte 1, Marcelo Arenas 1, and Michel Dumontier 2 1 Department of Computer Science, Pontificia
More informationMeSH: A Thesaurus for PubMed
Resources and tools for bibliographic research MeSH: A Thesaurus for PubMed October 24, 2012 What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH Database What is MeSH? Acronym for Medical
More informationSemantic Technology and CDISC Standards
Paper DH03 Semantic Technology and CDISC Standards Frederik Malfait, IMOS Consulting, Switzerland Scott Bahlavooni, Independent, Boston, USA ABSTRACT Over the past few years there has been increased interest
More informationCustomisable Curation Workflows in Argo
Customisable Curation Workflows in Argo Rafal Rak*, Riza Batista-Navarro, Andrew Rowley, Jacob Carter and Sophia Ananiadou National Centre for Text Mining, University of Manchester, UK *Corresponding author:
More informationMeasuring inter-annotator agreement in GO annotations
Measuring inter-annotator agreement in GO annotations Camon EB, Barrell DG, Dimmer EC, Lee V, Magrane M, Maslen J, Binns ns D, Apweiler R. An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.
More informationSEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL
SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL Wang Wei, Payam M. Barnaghi School of Computer Science and Information Technology The University of Nottingham Malaysia Campus {Kcy3ww, payam.barnaghi}@nottingham.edu.my
More informationBlink Project: Linked Open Data for Countway Library Final report for Phase 1 (June-Nov. 2012) Prepared by Sophia Cheng
Blink Project: Linked Open Data for Countway Library Final report for Phase 1 (June-Nov. 2012) Prepared by Sophia Cheng Summary We propose to improve the usefulness and discoverability of Countway Library
More informationQuerying the Semantic Web
Querying the Semantic Web CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University http://www3.cs.stonybrook.edu/~pfodor/courses/cse595.html Lecture Outline SPARQL Infrastructure Basics:
More informationDBpedia Data Processing and Integration Tasks in UnifiedViews
1 DBpedia Data Processing and Integration Tasks in Tomas Knap Semantic Web Company Markus Freudenberg Leipzig University Kay Müller Leipzig University 2 Introduction Agenda, Team 3 Agenda Team & Goal An
More informationSemantic Technology. Opportunities
Semantic Technology Opportunities Avinash Punekar Scientific Publishing Services April 2011 2 Semantic Technology April 2011 3 What is Semantic Technology? ² Semantic Web ² Web 3.0 ² Linked Open Data /
More informationThe IEEE Metadata Standard for Supporting Big Data Management
The IEEE Metadata Standard for Supporting Big Data Management Alex MH Kuo 1,2 (Ph.D) 1 School of Health Information Science University of Victoria, BC, Canada. 2 CEDAR, School of Medicine University of
More informationThe Internet for Social Machines The end of data sharing as we know it. Barend Mons Feb. 2019
The nternet for Social Machines The end of data sharing as we know it Barend Mons Feb. 2019 Which Gene Did You Mean? why bury it first and then mine it again? (2005) (2018) What does FA eventually entail?
More informationGeoSPARQL Support and Other Cool Features in Oracle 12c Spatial and Graph Linked Data Seminar Culture, Base Registries & Visualisations
GeoSPARQL Support and Other Cool Features in Oracle 12c Spatial and Graph Linked Data Seminar Culture, Base Registries & Visualisations Hans Viehmann Product Manager EMEA Oracle Corporation December 2,
More informationDeveloping a Benchmark Suite for Semantic Web Data from Existing Workflows
Developing a Benchmark Suite for Semantic Web Data from Existing Workflows Antonis Troumpoukis 1, Angelos Charalambidis 1, Giannis Mouchakis 1, Stasinos Konstantopoulos 1, Ronald Siebes 2, Victor de Boer
More informationThe National Cancer Institute's Thésaurus and Ontology
The National Cancer Institute's Thésaurus and Ontology Jennifer Golbeck 1, Gilberto Fragoso 2, Frank Hartel 2, Jim Hendler 1, Jim Oberthaler 2, Bijan Parsia 1 1 University of Maryland, College Park 2 National
More informationSemantic Web Fundamentals
Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2018/19 with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz January 7 th 2019 Overview What is Semantic Web? Technology
More informationSemantic Web Fundamentals
Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2017/18 Vedran Sabol with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz December 11 th 2017 Overview What is Semantic
More informationLinked Data and RDF. COMP60421 Sean Bechhofer
Linked Data and RDF COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk Building a Semantic Web Annotation Associating metadata with resources Integration Integrating information sources Inference
More informationLinked data and its role in the semantic web. Dave Reynolds, Epimorphics
Linked data and its role in the semantic web Dave Reynolds, Epimorphics Ltd @der42 Roadmap What is linked data? Modelling Strengths and weaknesses Examples Access other topics image: Leo Oosterloo @ flickr.com
More informationFrom the Web to the Semantic Web: RDF and RDF Schema
From the Web to the Semantic Web: RDF and RDF Schema Languages for web Master s Degree Course in Computer Engineering - (A.Y. 2016/2017) The Semantic Web [Berners-Lee et al., Scientific American, 2001]
More informationRDF Workshop. Building an RDF representation of the the ChEMBL Database. Mark Davies. ChEMBL Group, Technical Lead 30/04/2014
RDF Workshop Building an RDF representation of the the ChEMBL Database Mark Davies ChEMBL Group, Technical Lead 30/04/2014 Overview Brief introduction to ChEMBL database Approaches to mapping relational
More informationSemantic Integration with Apache Jena and Apache Stanbol
Semantic Integration with Apache Jena and Apache Stanbol All Things Open Raleigh, NC Oct. 22, 2014 Overview Theory (~10 mins) Application Examples (~10 mins) Technical Details (~25 mins) What do we mean
More informationMaximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009
Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images
More informationExposing Provenance Metadata Using Different RDF Models
Exposing Provenance Metadata Using Different RDF Models Gang Fu 1, Evan Bolton 1, Nria Queralt Rosinach 2, Laura I. Furlong 2, Vinh Nguyen 3, Amit Sheth 3, Olivier Bodenreider 4, Michel Dumontier 5 1 gang.fu@nih.gov,
More informationLinked Data. The World is Your Database
Linked Data Dave Clarke Synaptica CEO Gene Loh Synaptica Software Architect The World is Your Database Agenda 1. What is Linked Data, and why is it good for you (15 mins) What is Linked Data 2. How it
More informationVisual Concept Detection and Linked Open Data at the TIB AV- Portal. Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017
Visual Concept Detection and Linked Open Data at the TIB AV- Portal Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017 Agenda 1. TIB and TIB AV-Portal 2. Automated Video Analysis 3. Visual
More informationKnowledge Representation RDF Turtle Namespace
Knowledge Representation RDF Turtle Namespace Jan Pettersen Nytun, UiA 1 URIs Identify Web Resources Web addresses are the most common URIs, i.e., uniform Resource Locators (URLs). RDF resources are usually
More informationBiomedical literature mining for knowledge discovery
Biomedical literature mining for knowledge discovery REZARTA ISLAMAJ DOĞAN National Center for Biotechnology Information National Library of Medicine Outline Biomedical Literature Access Challenges in
More informationProfiles Research Networking Software API Guide
Profiles Research Networking Software API Guide Documentation Version: March 13, 2013 Software Version: ProfilesRNS_1.0.3 Table of Contents Overview... 2 PersonID, URI, and Aliases... 3 1) Profiles RNS
More informationCOMPUTER AND INFORMATION SCIENCE JENA DB. Group Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara
JENA DB Group - 10 Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara OUTLINE Introduction Data Model Query Language Implementation Features Applications Introduction Open Source
More informationSEPA SPARQL Event Processing Architecture
SEPA SPARQL Event Processing Architecture Enabling distributed, context aware and interoperable Dynamic Linked Data and Web of Things applications Luca Roffia (luca.roffia@unibo.it) Web of Things: members
More informationSEBI: An Architecture for Biomedical Image Discovery, Interoperability and Reusability based on Semantic Enrichment
SEBI: An Architecture for Biomedical Image Discovery, Interoperability and Reusability based on Semantic Enrichment Ahmad C. Bukhari 1, Michael Krauthammer 2, Christopher J.O. Baker 1 1 Department of Computer
More informationSearching the Evidence in PubMed
CAMBRIDGE UNIVERSITY LIBRARY MEDICAL LIBRARY Supporting Literature Searching Searching the Evidence in PubMed July 2017 Supporting Literature Searching Searching the Evidence in PubMed How to access PubMed
More informationHumboldt-University of Berlin
Humboldt-University of Berlin Exploiting Link Structure to Discover Meaningful Associations between Controlled Vocabulary Terms exposé of diploma thesis of Andrej Masula 13th October 2008 supervisor: Louiqa
More information4) DAVE CLARKE. OASIS: Constructing knowledgebases around high resolution images using ontologies and Linked Data
require a change in development culture and thus training. 5. Impact and Benefits The project was delivered on time and on budget unusual for a project of this scale and the project was hailed as a great
More informationHarmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies
Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harold R. Solbrig 1, Guoqian Jiang 1 1 Mayo Clinic College of Medicine, Rochester, MN [solbrig.harold,
More informationIntroducing Linked Data
Introducing Linked Data (Part of this work was funded by PlanetData NoE FP7/2007-2013) Irini Fundulaki 1 1 Institute of Computer Science FORTH & W3C Greece Office Manager EICOS : 4th Meeting, Athens, Greece
More informationIntegrated Access to Biological Data. A use case
Integrated Access to Biological Data. A use case Marta González Fundación ROBOTIKER, Parque Tecnológico Edif 202 48970 Zamudio, Vizcaya Spain marta@robotiker.es Abstract. This use case reflects the research
More informationPrototyping a Biomedical Ontology Recommender Service
Prototyping a Biomedical Ontology Recommender Service Clement Jonquet Nigam H. Shah Mark A. Musen jonquet@stanford.edu 1 Ontologies & data & annota@ons (1/2) Hard for biomedical researchers to find the
More informationBuilding Blocks of Linked Data
Building Blocks of Linked Data Technological foundations Identifiers: URIs Data Model: RDF Terminology and Semantics: RDFS, OWL 23,019,148 People s Republic of China 20,693,000 population located in capital
More informationDigital Public Space: Publishing Datasets
Digital Public Space: Publishing Datasets Mo McRoberts, April 2012 I. Organise your data into sets. Implications Your data should ideally exist within a conceptual hierarchy (even if it's a singlelevel
More informationA Justification-based Semantic Framework for Representing, Evaluating and Utilizing Terminology Mappings
A Justification-based Semantic Framework for Representing, Evaluating and Utilizing Terminology Mappings Sajjad HUSSAIN a, Hong SUN b, Gokce B. Laleci ERTURKMEN c, Mustafa YUKSEL c, Charles MEAD d, Alasdair
More informationOntrez Project Report National Center for Biomedical Ontology November, 2007
Ontrez Project Report National Center for Biomedical Ontology November, 2007 Executive summary Currently, genomics data and data repositories in the public domain are expanding at an explosive pace. 1
More informationSemantic Web Technologies
1/33 Semantic Web Technologies Lecture 11: SWT for the Life Sciences 4: BioRDF and Scientifc Workflows Maria Keet email: keet -AT- inf.unibz.it home: http://www.meteck.org blog: http://keet.wordpress.com/category/computer-science/72010-semwebtech/
More informationMeSH : A Thesaurus for PubMed
Scuola di dottorato di ricerca in Scienze Molecolari Resources and tools for bibliographic research MeSH : A Thesaurus for PubMed What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH
More informationSciMiner User s Manual
SciMiner User s Manual Copyright 2008 Junguk Hur. All rights reserved. Bioinformatics Program University of Michigan Ann Arbor, MI 48109, USA Email: juhur@umich.edu Homepage: http://jdrf.neurology.med.umich.edu/sciminer/
More informationThe R2R Framework: Christian Bizer, Andreas Schultz. 1 st International Workshop on Consuming Linked Data (COLD2010) Freie Universität Berlin
1 st International Workshop on Consuming Linked Data (COLD2010) November 8, 2010, Shanghai, China The R2R Framework: Publishing and Discovering i Mappings on the Web Christian Bizer, Andreas Schultz Freie
More informationTaking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria
Taking a view on bio-ontologies Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Who we are European Bioinformatics Institute one of world s largest bio data and service providers
More informationJournal of Biomedical Semantics 2013, 4:37
Journal of Biomedical Semantics This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. PAV ontology:
More informationMeSH : A Thesaurus for PubMed
Resources and tools for bibliographic research MeSH : A Thesaurus for PubMed What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH Database What is MeSH? http://www.ncbi.nlm.nih.gov/mesh
More informationA Unified Framework fro Managing Provenance Information in Translational Research
Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 2011 A Unified Framework fro Managing Provenance Information in Translational
More informationQuerying Semantic Web Data
Querying Semantic Web Data Lalana Kagal Decentralized Information Group MIT CSAIL Eric Prud'hommeaux Sanitation Engineer World Wide Web Consortium SPARQL Program Graph patterns Motivations for RDF RDF
More informationDocument Retrieval using Predication Similarity
Document Retrieval using Predication Similarity Kalpa Gunaratna 1 Kno.e.sis Center, Wright State University, Dayton, OH 45435 USA kalpa@knoesis.org Abstract. Document retrieval has been an important research
More informationLinked Data Practices for the Geospatial Community
Linked Data Practices for the Geospatial Community Talk subtitle Presented at GEOSS Workshop on Climate Boulder Colorado, 23 September 2011 Stephan Zednik, zednis@rpi.edu RPI / Tetherless World Constellation
More information