The ELIXIR of Linked Data

Size: px
Start display at page:

Download "The ELIXIR of Linked Data"

Transcription

1 The ELIXIR of Linked Data Professor Carole Goble (UK node) Barend Mons (NL node), Helen Parkinson (EMBL-EBI node) The Interoperability Services Backbone Team European Life Sciences Infrastructure for Biological Information

2 What is ELIXIR? An international distributed infrastructure for life-science information orchestrate the collection, quality control and archiving of biological data produced by life science experiments. integrate research data ensure a seamless service provision that is easily accessible to all.

3 ELIXIR: An international distributed infrastructure for biological data major bioinformatics service providers (~130) Hub 16 ELIXIR members 4 observers

4 Drivers: Infrastructure Providers Crop and forest plants Marine metagenomics Human data COordinated Research Infrastructures Building Enduring Life-science Services Rare diseases

5 Rare diseases Sample data (biobank databases) Clinical data (registries, and phenotypic databases) 1000 exomes Genomic data (WES, WGS) 1000 exomes + > 2500 from other projects Other omics data (transcriptomics, metabolomics, proteomics )

6 Drug prioritization for Huntington s Disease Katerina Nosikova, Elizaveta Besedina, Eelke van der Horst, Peter-Bram t Hoen, Marco Roos, Eleni Mina, Human Genetics department, LUMC, NL Select drug compounds in Open PHACTS Select genes by phenotype matching in Monarch Filter on feasibility for treating HD Prioritized drug compounds 8

7 What is ELIXIR?

8 Technical platforms Data Secure and deliver core data resources Standards Tools Data management, reuse and integration Findable Accessible Interoperable Reusable Discoverable tools, services and connectors for data access and exploitation Compute Training Robust technical platforms and clouds for secure data access, data exchange and compute Training programme for professionals, bridging the computational biology skills gap

9 Training: BYODs, data wrangling, governance and quality assurance Linked Data experts, data experts from MycoBase and Human Protein Atlas Tomato genome, phenotypic observations, variants

10 Data: Basket of indicators, reflecting the multiple facets of bioinformatics resources Scientifi Quality c impact Legal & funding infrastructure Scientific focus Indicators Community Impact Mandatory and optional 1) Scientific focus and quality of science e.g. curational effort, benchmarking 2) Community served by the resource e.g. web statistics 3) Quality of service e.g. uptime, user support and training 4) Legal and funding infrastructure e.g. institutional support, use policy 5) Impact and translational stories

11

12 Compute Platform: Authentication, Archiving and Movement

13 Tools Interoperability and APIs Describing Workflows Describing APIs Common format for bioinformatics tool execution Rich: Linked Data allows for infinite metadata annotations and reasoning SWAGGER.json API changes Semantic versioning Getting resources to have APIs Describing Tools EDAM Ontology

14 [Luiz Olavo Bonino, DTL] RD-CONNECT, ODEXA4ALL A FAIRifying Architecture

15 FAIR Interoperability Backbone Services Prepare for interop Access from Integrating Frameworks Warehouses API Interoperability Services: Identifiers, Ontologies, Schemas. Preparing Sources On boarding Datasets, Content, API

16 Paul Kersey Crop and forest plants Various species: maize, pine, potato Various data types: from genomes (sequences and annotations) to phenomes (traits) Various ontologies: Crop Ontology, Plant Ontology Emerging standards: MIAPPE (Minimum Information on Plant Phenotyping Experiment) Need for infrastructure o Manage identifiers o Register/access services and data sets o Metadata driven search

17 Ontology Services Ontology mapping Data-Ontology Tools OLS3

18 Identifiers the pivot of everything! Identifier Resolution Service (IRS2) Identifier Mapping Service (IMS)

19 FAIR Metadata at many levels Tool that provisioned the dataset Interface API and Access Tool using the dataset Data record content Dataset Collection mappings between datasets mappings between entities Dataset Profile

20 What is ELIXIR?

21 Metadata Profiles and Dataset Registration Governance, Compliance, Release Protocols Dataset Profile

22 Search, Index and Linked Data BioSolr Data Diabetic nephropathy (EFO_ ) (and Elasticsearch) Data

23 Two tiers of data repository Biological knowledge bases Curated and annotated biological entities and their relationships Uniprot, Ensembl, ChEMBL, Orphanet

24 Two tiers of data repository data records are dynamic and incomplete records update, diverge, merge over time, interpretation changes Biological knowledge bases Curated and annotated biological entities and their relationships Uniprot, Ensembl, ChEMBL, Orphanet identifier resolution varies over time relationships between records are unstable reproducibility potentially compromised a novel gene-rare disease relationship is reported consequences of a single nucleotide change in a regulatory genomic region is better understood.

25 linksets Legacy of Open PHACTS. Mappings are first class. mappings between entities provenance, versioning, mapping linksets Data record content

26 VoID Vocabulary of Interlinked Datasets Create description of a Linkset that connects two datasets. Select datasets from existing descriptions. Capture link predicate and justification

27 Legacy of Open PHACTS. Releasing Data Sets: Software-Like Research Objects Linked Data Manifests Publishing data the software way Controlled data Distribution Containers Builds Dependencies Versioning Verification data-maven-plugin Docker

28 Genotype-Phenotype Mapping terms Cross linking datasets Tracking provenance Linked Data Services Genotype-Phenotype Deans AR, Lewis SE, Huala E, Anzaldo SS, Ashburner M, et al. (2015) Finding Our Way through Phenotypes. PLoS Biol 13(1): e doi: /journal.pbio

29 Interoperating Applications Interoperability Services Backbone Interoperability Backbone Publishing FAIR Data

30 Linked Data Big Picture lower the barriers to linking data connect related data that wasn't previously linked self-describe and annotate data in a common, machine readable form expose linking as a first class information element a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF. Wikipedia

31 Impact of Open PHACTS on ELIXIR Linked Data Components & Know-how Identifiers & Links Annotation & Ontologies Dataset Containers Integrate into off the shelf apps Publishing and Consuming Metadata & Mappings On boarding & Release pipelines APIs, Search Data.when it supports interoperability.retain native forms.preparation and maintenance.data governance..

32 Challenges of Linked Data Getting data providers to generate LOD Getting agreement on URIs Choosing ontologies and relations Modelling challenges (data vs biological reality) Appropriate Extract/Load/Transform pipelines Appropriate representation for datatypes Getting machine readable dataset descriptions Provide an API Link resources to ontology terms SPARQL fetish Expertise in the community to effectively produce/consume LD Services for finding and reusing URIs & ontologies Data annotation services (mapping data to ontologies)

33 [Mons]

34 What is ELIXIR?

35 Human data: The European Genomephenome Archive EGA

What is FAIR? 5 th International Summer School on Rare Disease and Orphan Drug Registries. Claudio Carta 1 and Marco Roos 2

What is FAIR? 5 th International Summer School on Rare Disease and Orphan Drug Registries. Claudio Carta 1 and Marco Roos 2 5 th International Summer School on Rare Disease and Orphan Drug Registries What is FAIR? Claudio Carta 1 and Marco Roos 2 1 National Centre for Rare Diseases Istituto Superiore di Sanità, Rome, Italy

More information

Facilitating Semantic Alignment of EBI Resources

Facilitating Semantic Alignment of EBI Resources Facilitating Semantic Alignment of EBI Resources 17 th March, 2017 Tony Burdett Technical Co-ordinator Samples, Phenotypes and Ontologies Team www.ebi.ac.uk What is EMBL-EBI? Europe s home for biological

More information

FOCUS MEETING ON FAIR DATA DEVELOPMENTS. Luiz Olavo Bonino -

FOCUS MEETING ON FAIR DATA DEVELOPMENTS. Luiz Olavo Bonino - FOCUS MEETING ON FAIR DATA DEVELOPMENTS Luiz Olavo Bonino - luiz.bonino@dtls.nl SUMMARY What is FAIR data? The FAIR ecosystem Plans and how to realise Produces Consumes stewardship privacy? sustainability

More information

Introduction of the BYOD, dataset and group division

Introduction of the BYOD, dataset and group division Bring Your Own Data To Link Rare Disease Registries Introduction of the BYOD, dataset and group division Claudio Carta National Centre for Rare Diseases Istituto Superiore di Sanità Rome, Italy 1 1 st

More information

ELIXIR Human Data Use Case

ELIXIR Human Data Use Case ELIXIR Human Data Use Case Mikael Borg, ELIXIR Sweden ELIXIR-EXCELERATE is funded by the European Commission within the Research Infrastructures programme of Horizon 2020, grant agreement number 676559.

More information

I. Background and objectives

I. Background and objectives Meeting EMPHASIS ELIXIR 15 May 2018 : Data standards and Information Systems: strategies of the European infrastructures EMPHASIS and ELIXIR Participants : See appendix Main authors: C. Pommier and F.

More information

The European Commission s science and knowledge service

The European Commission s science and knowledge service The European Commission s science and knowledge service Joint Research Centre Directorate-General Joint Research Centre Directorate-General Health and Food Safety Knowledge generation centre for rare diseases

More information

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services Enabling Open Science: Data Discoverability, Access and Use Jo McEntyre Head of Literature Services www.ebi.ac.uk About EMBL-EBI Part of the European Molecular Biology Laboratory International, non-profit

More information

ELIXIR Compute platform

ELIXIR Compute platform ELIXIR Compute platform Authors and contributors: Alexander Agafonov (UIT NO), Lars Ailo Bongo (UIT - NO), Mikael Borg (BILS - SE), Amelie Cornelis (EMBL-EBI), Rob Finn (EMBL-EBI), Montserrat Gonzalez

More information

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Taking a view on bio-ontologies Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Who we are European Bioinformatics Institute one of world s largest bio data and service providers

More information

A discovery platform for translational research

A discovery platform for translational research A discovery platform for translational research - DisGeNET-RDF&SPARQL - Usage and Modeling Challenges Núria Queralt Rosinach Integrative Biomedical Informatics Group (IBI) Research Programme on Biomedical

More information

TEXT MINING: THE NEXT DATA FRONTIER

TEXT MINING: THE NEXT DATA FRONTIER TEXT MINING: THE NEXT DATA FRONTIER An Infrastructural Approach Dr. Petr Knoth CORE (core.ac.uk) Knowledge Media institute, The Open University United Kingdom 2 OpenMinTeD Establish an open and sustainable

More information

How to store and visualize RNA-seq data

How to store and visualize RNA-seq data How to store and visualize RNA-seq data Gabriella Rustici Functional Genomics Group gabry@ebi.ac.uk EBI is an Outstation of the European Molecular Biology Laboratory. Talk summary How do we archive RNA-seq

More information

WheatIS: Progress report

WheatIS: Progress report WheatIS: Progress report WheatIS Annual meeting, San Diego, 9 January 2015 WheatIS data submission DSpace Beta-version to test: http://urgi.versailles.inra.fr/xmlui/ At the moment, available submission

More information

towards a federated infrastructure enabling integrated life science research

towards a federated infrastructure enabling integrated life science research towards a federated infrastructure enabling integrated life science research ELIXIR Innovation & SME forum March 18 2015 Ruben Kok & Jaap Heringa www.dtls.nl ZOOMING IN AND OUT OF LIFE LIFE @ ALL LEVELS

More information

The Internet for Social Machines The end of data sharing as we know it. Barend Mons Feb. 2019

The Internet for Social Machines The end of data sharing as we know it. Barend Mons Feb. 2019 The nternet for Social Machines The end of data sharing as we know it Barend Mons Feb. 2019 Which Gene Did You Mean? why bury it first and then mine it again? (2005) (2018) What does FA eventually entail?

More information

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard University @mercecrosas mercecrosas.com Open Research Cloud, May 11, 2017 Best Practices

More information

Linked Data: Fast, low cost semantic interoperability for health care?

Linked Data: Fast, low cost semantic interoperability for health care? Linked Data: Fast, low cost semantic interoperability for health care? About the presentation Part I: Motivation Why we need semantic operability in health care Why enhancing existing systems to increase

More information

DBpedia Data Processing and Integration Tasks in UnifiedViews

DBpedia Data Processing and Integration Tasks in UnifiedViews 1 DBpedia Data Processing and Integration Tasks in Tomas Knap Semantic Web Company Markus Freudenberg Leipzig University Kay Müller Leipzig University 2 Introduction Agenda, Team 3 Agenda Team & Goal An

More information

Bio wikis. Paolo Romano Bioinformatics, National Cancer Research Institute, Genova

Bio wikis. Paolo Romano Bioinformatics, National Cancer Research Institute, Genova Bio wikis Paolo Romano (paolo.romano@istge.it) Bioinformatics, National Cancer Research Institute, Genova Outline o Wiki systems: aims and technologies o Working with wikis: practical issues for setting

More information

Indiana University Research Technology and the Research Data Alliance

Indiana University Research Technology and the Research Data Alliance Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly RDA Mission

More information

Unstructured Text in Big Data The Elephant in the Room

Unstructured Text in Big Data The Elephant in the Room Unstructured Text in Big Data The Elephant in the Room David Milward ICIC, October 2013 Click Unstructured to to edit edit Master Master Big title Data style title style Big Data Volume, Variety, Velocity

More information

An Oz Mammals Bioinformatics and Data Resource

An Oz Mammals Bioinformatics and Data Resource An Oz Mammals Bioinformatics and Data Resource Vicky Schneider, Andrew Pask, Denis O Meally, Philippa Griffin, Jeff Christiansen, Mike Charleston, Dominique Gorse, Andrew Treloar, Jason Williams, Rebecca

More information

SELF-SERVICE SEMANTIC DATA FEDERATION

SELF-SERVICE SEMANTIC DATA FEDERATION SELF-SERVICE SEMANTIC DATA FEDERATION WE LL MAKE YOU A DATA SCIENTIST Contact: IPSNP Computing Inc. Chris Baker, CEO Chris.Baker@ipsnp.com (506) 721 8241 BIG VISION: SELF-SERVICE DATA FEDERATION Biomedical

More information

Deliverable D4.3 Release of pilot version of data warehouse

Deliverable D4.3 Release of pilot version of data warehouse Deliverable D4.3 Release of pilot version of data warehouse Date: 10.05.17 HORIZON 2020 - INFRADEV Implementation and operation of cross-cutting services and solutions for clusters of ESFRI Grant Agreement

More information

Development of an Environment for Data Annotation in Bioengineering

Development of an Environment for Data Annotation in Bioengineering Development of an Environment for Data Annotation in Bioengineering Renato Galina Barbosa Azambuja Correia Instituto Superior Técnico, Universidade Técnica de Lisboa, Portugal renato_correia1@hotmail.com

More information

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud European Cloud Initiative: implementation status Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud Political drivers for action EC Communication "European

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Supplementary Note-- Williams et al The Image Data Resource: A Bioimage Data Integration and Publication Platform

Supplementary Note-- Williams et al The Image Data Resource: A Bioimage Data Integration and Publication Platform Supplementary Note-- Williams et al The Image Data Resource: A Bioimage Data Integration and Publication Platform 1. Exploring the IDR This current IDR web user interface (WUI) is based on the open source

More information

re3data.org - Making research data repositories visible and discoverable

re3data.org - Making research data repositories visible and discoverable re3data.org - Making research data repositories visible and discoverable Robert Ulrich, Karlsruhe Institute of Technology Hans-Jürgen Goebelbecker, Karlsruhe Institute of Technology Frank Scholze, Karlsruhe

More information

Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit

Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit Martin Scharm, Dagmar Waltemath Department of Systems Biology and Bioinformatics University of Rostock

More information

Lessons Learned during Illumina s Secure DevOps Transition Kenneth G. Hartman Associate Director Cloud Products Security

Lessons Learned during Illumina s Secure DevOps Transition Kenneth G. Hartman Associate Director Cloud Products Security Lessons Learned during Illumina s Secure DevOps Transition Kenneth G. Hartman Associate Director Cloud Products Security 2018 Illumina, Inc. All rights reserved. QB Document #6608 At a Glance 2 https://www.illumina.com/company/news-center/feature-articles/illumina-at-a-glance.html

More information

Managing your data. Niclas Jareborg, NBIS

Managing your data. Niclas Jareborg, NBIS Managing your data Niclas Jareborg, NBIS niclas.jareborg@nbis.se How do you know how an old result was generated? The Research Data Life Cycle Data Publishing & Re-use Research Data Planning & Design Data

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

An Archiving System for Managing Evolution in the Data Web

An Archiving System for Managing Evolution in the Data Web An Archiving System for Managing Evolution in the Web Marios Meimaris *, George Papastefanatos and Christos Pateritsas * Institute for the Management of Information Systems, Research Center Athena, Greece

More information

Topics of the talk. Biodatabases. Data types. Some sequence terminology...

Topics of the talk. Biodatabases. Data types. Some sequence terminology... Topics of the talk Biodatabases Jarno Tuimala / Eija Korpelainen CSC What data are stored in biological databases? What constitutes a good database? Nucleic acid sequence databases Amino acid sequence

More information

Update: MIRIAM Registry and SBO

Update: MIRIAM Registry and SBO Update: MIRIAM Registry and SBO Nick Juty, EMBL-EBI 3rd Sept, 2011 Overview MIRIAM Registry MIRIAM Guidelines.. MIRIAM Registry content URIs (URN form), example Summary/current developments SBO Purpose

More information

Embracing Semantic Technology for Better Metadata Authoring in Biomedicine

Embracing Semantic Technology for Better Metadata Authoring in Biomedicine Embracing Semantic Technology for Better Metadata Authoring in Biomedicine Attila L. Egyedi, Martin J. O Connor, Marcos Martínez-Romero, Debra Willrett, Josef Hardi, John Graybeal, and Mark A. Musen Stanford

More information

SEBI: An Architecture for Biomedical Image Discovery, Interoperability and Reusability based on Semantic Enrichment

SEBI: An Architecture for Biomedical Image Discovery, Interoperability and Reusability based on Semantic Enrichment SEBI: An Architecture for Biomedical Image Discovery, Interoperability and Reusability based on Semantic Enrichment Ahmad C. Bukhari 1, Michael Krauthammer 2, Christopher J.O. Baker 1 1 Department of Computer

More information

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond Alessia Bardi and Paolo Manghi, Institute of Information Science and Technologies CNR Katerina Iatropoulou, ATHENA, Iryna Kuchma and Gwen Franck, EIFL Pedro Príncipe, University of Minho OpenAIRE Fostering

More information

Fusion Registry 9 SDMX Data and Metadata Management System

Fusion Registry 9 SDMX Data and Metadata Management System Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance

More information

CORE: Improving access and enabling re-use of open access content using aggregations

CORE: Improving access and enabling re-use of open access content using aggregations CORE: Improving access and enabling re-use of open access content using aggregations Petr Knoth CORE (Connecting REpositories) Knowledge Media institute The Open University @petrknoth 1/39 Outline 1. The

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Integrating Complex Financial Workflows in Oracle Database Xavier Lopez Seamus Hayes Oracle PolarLake, LTD 2 Copyright 2011, Oracle

More information

Information Resources in Molecular Biology Marcela Davila-Lopez How many and where

Information Resources in Molecular Biology Marcela Davila-Lopez How many and where Information Resources in Molecular Biology Marcela Davila-Lopez (marcela.davila@medkem.gu.se) How many and where Data growth DB: What and Why A Database is a shared collection of logically related data,

More information

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Paving the Rocky Road Toward Open and FAIR in the Field Sciences Paving the Rocky Road Toward Open and FAIR Kerstin Lehnert Lamont-Doherty Earth Observatory, Columbia University IEDA (Interdisciplinary Earth Data Alliance), www.iedadata.org IGSN e.v., www.igsn.org Field

More information

A comprehensive and standardised e-infrastructure for analysing medical metabolic phenotype data. 1st September 2015

A comprehensive and standardised e-infrastructure for analysing medical metabolic phenotype data. 1st September 2015 Deliverable 8.4.1 Project ID 654241 Project Title Project Acronym Start Date of the Project Duration of the Project A comprehensive and standardised e-infrastructure for analysing medical metabolic phenotype

More information

Towards FAIRness: some reflections from an Earth Science perspective

Towards FAIRness: some reflections from an Earth Science perspective Towards FAIRness: some reflections from an Earth Science perspective Maggie Hellström ICOS Carbon Portal (& ENVRIplus & SND & Lund University ) Good data management in the Nordic countries Stockholm, October

More information

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Exploring and Exploiting the Biological Maze Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Motivation An abundance of biological data sources contain data about scientific entities, such as

More information

The CALBC RDF Triple store: retrieval over large literature content

The CALBC RDF Triple store: retrieval over large literature content The CALBC RDF Triple store: retrieval over large literature content Samuel Croset, Christoph Grabmüller, Chen Li, Silverstras Kavaliauskas, Dietrich Rebholz-Schuhmann croset@ebi.ac.uk 10 th December 2010,

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018 DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects Matthias Razum April 20, 2018 DATA MANAGEMENT PLANS (DMP) typically state what data will be created and how, outline the plans

More information

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK The Final Updates Supported by the NIH grant 1U24 AI117966-01 to UCSD PI, Co-Investigators at: Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University

More information

The Data Life Cycle a Researcher Perspective

The Data Life Cycle a Researcher Perspective The Data Life Cycle a Researcher Perspective Dr Philippa Griffin Bioinformatician/Research Fellow EMBL-ABR / Melbourne Bioinformatics / UoM - Fly population location (latitude) - Year collected - Frequency

More information

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 0 Welcome to the Pure International Conference Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 1 Mendeley Data Use Synergies with Pure to Showcase Additional Research Outputs Nikhil Joshi Solutions

More information

New Approach to Graph Databases

New Approach to Graph Databases Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few

More information

Striving for efficiency

Striving for efficiency Ron Dekker Director CESSDA Striving for efficiency Realise the social data part of EOSC How to Get the Maximum from Research Data Prerequisites and Outcomes University of Tartu, 29 May 2018 Trends 1.Growing

More information

Semantic Annotation, Search and Analysis

Semantic Annotation, Search and Analysis Semantic Annotation, Search and Analysis Borislav Popov, Ontotext Ontology A machine readable conceptual model a common vocabulary for sharing information machine-interpretable definitions of concepts in

More information

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Unleash Your Data Center s Hidden Power September 16, 2014 Molly Rector CMO, EVP Product Management & WW Marketing

More information

IT Challenges and Initiatives in Scientific Research

IT Challenges and Initiatives in Scientific Research IT Challenges and Initiatives in Scientific Research Alberto Di Meglio CERN openlab Deputy Head DOI: 10.5281/zenodo.9809 LHC Schedule 2009 2010 2011 2011 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022

More information

How FAIR am I? FAIR Principles and Interoperability of Data and Tools

How FAIR am I? FAIR Principles and Interoperability of Data and Tools How FAIR am I? FAIR Principles and Interoperability of Data and Tools Peter Doorn, DANS @pkdoorn @dansknaw Plan-Europe - Platform of National escience Centers in Europe PLAN-E meeting, April 27 & 28, 2017,

More information

Project Data Management

Project Data Management Project Management Niclas Jareborg, NBIS niclas.jareborg@bils.se Management Seminar, 2016-03-18 Four Facilities Short- and medium-term support Wide competence in bioinformatics distributed at all large

More information

XML in the bipharmaceutical

XML in the bipharmaceutical XML in the bipharmaceutical sector XML holds out the opportunity to integrate data across both the enterprise and the network of biopharmaceutical alliances - with little technological dislocation and

More information

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT EUDAT A European Collaborative Data Infrastructure Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT OpenAire Interoperability Workshop Braga, Feb. 8, 2013 EUDAT Key facts

More information

Data Citation and Scholarship

Data Citation and Scholarship University of California, Los Angeles From the SelectedWorks of Christine L. Borgman August 25, 2015 Data Citation and Scholarship Christine L Borgman, University of California, Los Angeles Available at:

More information

Euro- OMERO Tanja Ninkovic, EMBL

Euro- OMERO Tanja Ninkovic, EMBL Euro- BioImaging @ OMERO 2015 02.06.2015 Tanja Ninkovic, EMBL Aim & Development Euro-BioImaging will become a pan-european research infrastructure, which provides: Open access and services to imaging technologies

More information

Exercises. Biological Data Analysis Using InterMine workshop exercises with answers

Exercises. Biological Data Analysis Using InterMine workshop exercises with answers Exercises Biological Data Analysis Using InterMine workshop exercises with answers Exercise1: Faceted Search Use HumanMine for this exercise 1. Search for one or more of the following using the keyword

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative

More information

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003 The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY

More information

Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi

Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi SEMIC conference 2013 21 June 2013 Dublin (AgID) Created last year, AgID is a public body that merged three different existing

More information

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications

More information

Project Data Management

Project Data Management Project Management Niclas Jareborg, NBIS niclas.jareborg@nbis.se Introduction to NGS course, 2017-01-27 Why manage research data? To make your research easier! To stop yourself drowning in irrelevant stuff

More information

Structuring research methods and data with the research object model: genomics workflows as a case study

Structuring research methods and data with the research object model: genomics workflows as a case study Structuring research methods and data with the research object model: genomics workflows as a case study Kristina M Hettne 1 Email: k.m.hettne@lumc.nl Harish Dharuri 1 Email: h.k.dharuri@lumc.nl Jun Zhao

More information

Use of Semantic Technologies at Eli Lilly and Company. J Phil Brooks Information Consultant, SE Data Team Discover IT Eli Lilly and Company

Use of Semantic Technologies at Eli Lilly and Company. J Phil Brooks Information Consultant, SE Data Team Discover IT Eli Lilly and Company Use of Semantic Technologies at Eli Lilly and Company J Phil Brooks Information Consultant, SE Data Team Discover IT Eli Lilly and Company Notable Semantic Projects at Lilly Discovery Metadata Integration

More information

Welcome - webinar instructions

Welcome - webinar instructions Welcome - webinar instructions GoToTraining works best in Chrome or IE avoid Firefox due to audio issues with Macs To access the full features of GoToTraining, use the desktop version by clicking switch

More information

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020

More information

W3C Provenance Incubator Group: An Overview. Thanks to Contributing Group Members

W3C Provenance Incubator Group: An Overview. Thanks to Contributing Group Members W3C Provenance Incubator Group: An Overview DRAFT March 10, 2010 1 Thanks to Contributing Group Members 2 Outline What is Provenance Need for

More information

Spatial Data on the Web

Spatial Data on the Web Spatial Data on the Web Tools and guidance for data providers The European Commission s science and knowledge service W3C Data on the Web Best Practices 35 W3C/OGC Spatial Data on the Web Best Practices

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

Korea Institute of Oriental Medicine, South Korea 2 Biomedical Knowledge Engineering Laboratory,

Korea Institute of Oriental Medicine, South Korea 2 Biomedical Knowledge Engineering Laboratory, A Medical Treatment System based on Traditional Korean Medicine Ontology Sang-Kyun Kim 1, SeJin Nam 2, Dong-Hun Park 1, Yong-Taek Oh 1, Hyunchul Jang 1 1 Literature & Informatics Research Division, Korea

More information

Semantic Web Technologies

Semantic Web Technologies 1/33 Semantic Web Technologies Lecture 11: SWT for the Life Sciences 4: BioRDF and Scientifc Workflows Maria Keet email: keet -AT- inf.unibz.it home: http://www.meteck.org blog: http://keet.wordpress.com/category/computer-science/72010-semwebtech/

More information

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data Heinrich Widmann, DKRZ Claudia Martens, DKRZ Open Science Days, Berlin, 17 October 2017 www.eudat.eu EUDAT receives funding

More information

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data GeoDaRRs: What is the existing landscape and what gaps exist in that landscape for data producers and users? 7 August

More information

Fair data and open data: differences and consequences

Fair data and open data: differences and consequences Fair data and open data: differences and consequences 1. To share or not to share: what is fair? Alex Burdorf, Erasmus MC Rotterdam 2. Data sharing: consequences for informed consent Marie-José Bonthuis,

More information

Soumya Kanti Datta Research Engineer

Soumya Kanti Datta Research Engineer Testing Semantic Interoperability Soumya Kanti Datta Research Engineer Email dattas@eurecom.fr 22/03/2018 Testing Semantic Inteoperability 2 Roadmap Introduction Testing Semantic Interop Survey Conclusion

More information

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: 20180316 Peanut Innovation Lab Management Entity The University of Georgia, Athens, Georgia Feed the Future

More information

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø Towards the Semantic Desktop Dr. Øyvind Hanssen University Library of Tromsø Agenda Background Enabling trends and technologies Desktop computing and The Semantic Web Online Social Networking and P2P Computing

More information

Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research

Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research Ian Fore, D.Phil. Associate Director, Biorepository and Pathology Informatics Senior Program

More information

Decrypting your genome data privately in the cloud

Decrypting your genome data privately in the cloud Decrypting your genome data privately in the cloud Marc Sitges Data Manager@Made of Genes @madeofgenes The Human Genome 3.200 M (x2) Base pairs (bp) ~20.000 genes (~30%) (Exons ~1%) The Human Genome Project

More information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3

More information

Linked.Art & Vocabularies: Linked Open Usable Data

Linked.Art & Vocabularies: Linked Open Usable Data Linked.Art & : Linked Open Usable Data Rob Sanderson, David Newbury Semantic Architect, Software & Data Architect J. Paul Getty Trust rsanderson, dnewbury, RDF & Linked Data & Ontologies & What is RDF?

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

APPLYING KNOWLEDGE BASED AI TO MODERN DATA MANAGEMENT. Mani Keeran, CFA Gi Kim, CFA Preeti Sharma

APPLYING KNOWLEDGE BASED AI TO MODERN DATA MANAGEMENT. Mani Keeran, CFA Gi Kim, CFA Preeti Sharma APPLYING KNOWLEDGE BASED AI TO MODERN DATA MANAGEMENT Mani Keeran, CFA Gi Kim, CFA Preeti Sharma 2 What we are going to discuss During last two decades, majority of information assets have been digitized

More information

Developing a Research Data Policy

Developing a Research Data Policy Developing a Research Data Policy Core Elements of the Content of a Research Data Management Policy This document may be useful for defining research data, explaining what RDM is, illustrating workflows,

More information

Linked Data: Standard s convergence

Linked Data: Standard s convergence Linked Data: Standard s convergence Enhancing the convergence between reporting standards Maria Mora Technical Manager maria.mora@cdp.net 1 Lets talk about a problem Lack of a perfect convergence between

More information

Interoperability in Science Data: Stories from the Trenches

Interoperability in Science Data: Stories from the Trenches Interoperability in Science Data: Stories from the Trenches Karen Stocks University of California San Diego Open Data for Open Science Data Interoperability Microsoft escience Workshop 2012 Interoperability

More information

Integrating large, fast-moving, and heterogeneous data sets in biology.

Integrating large, fast-moving, and heterogeneous data sets in biology. Integrating large, fast-moving, and heterogeneous data sets in biology. C. Titus Brown Asst Prof, CSE and Microbiology; BEACON NSF STC Michigan State University ctb@msu.edu Introduction Background: Modeling

More information

Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies

Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harold R. Solbrig 1, Guoqian Jiang 1 1 Mayo Clinic College of Medicine, Rochester, MN [solbrig.harold,

More information

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University Ontology Engineering CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University http://www3.cs.stonybrook.edu/~pfodor/courses/cse595.html Lecture Outline Constructing Ontologies Reusing Existing

More information

Resilient Linked Data. Dave Reynolds, Epimorphics

Resilient Linked Data. Dave Reynolds, Epimorphics Resilient Linked Data Dave Reynolds, Epimorphics Ltd @der42 Outline What is Linked Data? Dependency problem Approaches: coalesce the graph link sets and partitioning URI architecture governance and registries

More information

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Using Linked Data and taxonomies to create a quick-start smart thesaurus 7) MARJORIE HLAVA Using Linked Data and taxonomies to create a quick-start smart thesaurus 1. About the Case Organization The two current applications of this approach are a large scientific publisher

More information