Introduction to FAIRDOM (fair-dom.org): Findable Accessible Interoperable Reusable Data Operations Models

Similar documents
Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit

XML in the bipharmaceutical

About the Edinburgh Pathway Editor:

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

CACAO Training. Jim Hu and Suzi Aleksander Spring 2016

Acquiring Experience with Ontology and Vocabularies

Open Federated Social Networks Oscar Rodríguez Rocha

Searching the ENCODE Portal

Integration in the 21 st -Century Enterprise. Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003

The ELIXIR of Linked Data

Integrating large, fast-moving, and heterogeneous data sets in biology.

Text mining tools for semantically enriching the scientific literature

Creating Accessible PDFs

Metadata Models for Experimental Science Data Management

Bioinformatics Data Distribution and Integration via Web Services and XML

Developing Online Databases and Serving Biological Research Data

Adobe Web Authoring using Adobe Dreamweaver Exam and objectives

> Semantic Web Use Cases and Case Studies

Update: MIRIAM Registry and SBO

SELF-SERVICE SEMANTIC DATA FEDERATION

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Introduction to Data Management for Ocean Science Research

LATIHAN Identify the use of multimedia in various fields.

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Ontology Servers and Metadata Vocabulary Repositories

USE QUICK ASSIST TO REMOTELY TROUBLESHOOT A FRIEND S COMPUTER

Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin. Frankfurt Book Fair 2017 October 11, 2017

Workshop on Web Archiving

How to share research data

SCAM Portfolio Scalability

Web Design Course Syllabus and Course Outline

PHASE 2 JOB AID 24 October 2016

SQA Advanced Unit specification. General information for centres. Unit title: Web Development Fundamentals. Unit code: HR7M 47

ISO INTERNATIONAL STANDARD. Health informatics Genomic Sequence Variation Markup Language (GSVML)

TextExpander Okta SCIM Configuration

1 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Microdata and schema.org

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (

Creating and Viewing My Favorites

1. Introduction to the Common Language Infrastructure

INTRODUCTION TO THE ACS PUBLICATIONS PLATFORM

ARKive-ERA Project Lessons and Thoughts

BCH339N Systems Biology/Bioinformatics Spring 2018 Marcotte A Python programming primer

Technology in Action. Chapter Topics. Scope creep occurs when: 3/20/2013. Information Systems include all EXCEPT the following:

Cost effective Cheminformatics for Small Chemistry Teams Integrated Within Larger Discovery Groups

SBML to BioPAX. MIRIAM Annotations in use. Camille Laibe

Everyday Digital Skills Projects Plus skills learned through the project

World-Wide Web Protocols CS 571 Fall Kenneth L. Calvert All rights reserved

Chapter 11 Program Development and Programming Languages

Seema Sirpal Delhi University Computer Centre

Microsoft SharePoint Server

Biocomputing II Coursework guidance

enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria

Foundation of Web Goal 4: Proficiency in Adobe Dreamweaver CC

Reaction kinetics database SABIO-RK

I R TECHNICAL RESEARCH REPORT. An XML-Based Approach to Integrating Semiconductor Process Information. by Jing Chen, Raymond A. Adomaitis TR

A cell-cycle knowledge integration framework

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria

ECAD & MCAD Model. Virtual Integration Using. Data Interoperability. Standards. Greg Pollari, Rockwell Collins

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

Developing Microsoft SharePoint Server 2013 Advanced Solutions

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003

USING THE SHARE POD. Share My Screen allows you and your guests to share your desktop live with an audience. It is useful for:

Spotlight Case Study: VectorWorks Spotlight Helps The Daily Show Deliver on Cue. Spotlight

Introduction to Computing using C++ Biomedical applications WELCOME TO CIS 1.5. Introduction to the course. Course structure

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

AMNH Gerstner Scholars in Bioinformatics & Computational Biology Application Instructions

Designing an institutional research data management infrastructure for the life sciences

XML and Agent Communication

Online Photo Sharing with Flickr Website:

Managing your data. Niclas Jareborg, NBIS

Prof. Konstantinos Krampis Office: Rm. 467F Belfer Research Building Phone: (212) Fax: (212)

Bioinformatics Introduction. Sebastian Schmeier

Partners. Brief Description:

CGM v SVG. Computer Graphics Metafile v Scalable Vector Graphic. David Manock

MODULE 2 HTML 5 FUNDAMENTALS. HyperText. > Douglas Engelbart ( )

Linked Data and IIIF

All answers and help topics pertaining to Docsafe

Web Design and HTML. Web Page vs Web Site. Navigation. Links. A web page is a single page viewable using web browser. A web site is a set of web pages

Windows Presentation Foundation for.net Developers

Step by Step Instructions

Developing Microsoft SharePoint Server 2013 Core Solutions

Visualisation and Work Instructions

Introduction to BioHPC New User Training

SEMBIOSPHERE: A SEMANTIC WEB APPROACH TO RECOMMENDING MICROARRAY CLUSTERING SERVICES

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial.

GSLIS Technology Orientation Requirement (TOR)

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix

FIT 100 LAB Activity 3: Constructing HTML Documents

Build Scientific Computing Infrastructure with Rebar3 and Docker. Eric Sage

Languages and tools for building and using ontologies. Simon Jupp, James Malone

Course: 2553A Administering Microsoft SharePoint Portal Server 2003

Contents. G52IWS: The Semantic Web. The Semantic Web. Semantic web elements. Semantic Web technologies. Semantic Web Services

Lecture Telecooperation. D. Fensel Leopold-Franzens- Universität Innsbruck

Enter the site Title: Student Name s eportfolio Choose your Website Domain: Use a Subdomain of Weebly.com

Drupal 8 Webform: When Contact Form isn t enough

Alpha 1 i2b2 User Guide

Our greatest weakness lies in giving up. The most certain way to succeed is always to try just one more time. ~Thomas A. Edison

How to store and visualize RNA-seq data

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

Transcription:

Introduction to FAIRDOM (fair-dom.org): Findable Accessible Interoperable Reusable Data Operations Models Jon Olav Vik Centre for Integrative Genetics (CIGENE) IHA, BIOVIT, NMBU www.nmbu.no/prosjekter/digisal

Norges Forskningsråd, bioteknologiprogrammet. Liv, teknologi og verdiskapning. 12 forskerprosjekter + nettverksprosjektet "Digitalt Liv Norge". Totalt 360 millioner kroner over 5 år. 2

Den digitale laksen: et bibliotek (Storyboard fra animasjon i samarbeid med NMBUs kommunikasjonsavdeling og animatør Tor Martin Austad, visuallab.no.)

Menu FAIRDOM Findable Accessible Interoperable Reusable Data Operations Models ISA Investigation Study Assay Providing context for data Metadata = data about data Making data easy to navigate and use Your own data management plan

Electrofished trout A menagerie of biological data Heart deformation Reindeer tracking (obsolete file format ) Magnetic resonance imaging RNAseq gene expression Satellite vegetation index Heart electrophysiology Liver metabolomics 5 GB/sample 5 MB 500 kb

Now you. Team up with a friend Tell them your project data Then we will review

Responses 8-)

fair-dom.org manages to be useful for...

Data generation and management in the Digital Salmon Fabian Jacob Knowledge manager Jon Olav SPARQL/RDF with GBOL (Genetic Biology Ontology Language) 9

fairdomhub.org 10

16

API (application programming interface)

Investigation Study Assay: Scientists actually agreeing on something

The Investigation-Study-Assay (ISA) structure Programme Overarching research theme (The Digital Salmon) Project Research grant (DigiSal, GenoSysFat) Investigation A particular biological process, phenomenon or thing (typically corresponds to [plans for] one or more closely related papers) Study Experiment whose design reflects a specific biological research question Assay Standardized measurement or diagnostic experiment using a specific protocol (applied to material from a study)

Now you. Programme Overarching research theme (if applicable) Project Research grant Investigation A particular biological process, phenomenon or thing (typically corresponds to [plans for] one or more closely related papers) Study Experiment whose design reflects a specific biological research question Assay Standardized measurement or diagnostic experiment using a specific protocol (applied to material from a study) makes a Data File, may have a SOP (standard operating procedure),

Standards and metadata will save your sanity The analyst who had to work with people who didn't annotate their data Théodore Géricault, oil on canvas, 1822

Open formats? Reindeer tracking (obsolete file format )

Data about data Descriptive metadata (title, abstract, author, keywords) Structural metadata (linking related pieces of information) Administrative metadata (file format, access control, version history)

Minimum information standards

Encoding knowledge for humans and computers HTML (hypertext markup language) browser human-readable 25

Encoding knowledge for humans and computers SBML (systems biology markup language) automatic human-readable (PDF), or executable (C++, ) 26

Standards related to dynamical systems modelling co.mbine.org Fig.: Mosaic of standards, adapted from (Chelliah et al., 2009, DILS)

Self-assembling jigsaw puzzles MODELL om kalsium-ioner i røde blodceller inni hovedpulsåren DATA on Ca 2+ in erythrocytes in aorta d[ca]/dt =... 28

Self-assembling jigsaw puzzles: Ontological annotation MODELL om kalsium-ioner i røde blodceller inni hovedpulsåren FMA:3734 CHEBI:29108 FMA:62885 DATA CHEBI:29108 on Ca 2+ FMA:62885 in erythrocytes FMA:3734 in aorta d[ca]/dt =... New knowledge automatically connects to that which already exists 29

Now you. Three minutes. Choose one aspect of your work and list: Data formats that go into and out of it. Any databases (input and output). Relevant ontologies (lists of concepts and relationships between them), e.g. a namespace of chemical identifiers gene names a bacterial taxonomy Your primary users: Who will take your results further?

The data manager job Talk to each partner in the project, from lab techs to researchers and professors. Identify what data they take in and give out. Do they have lab protocols, bioinformatics pipelines,? What file formats and databases do they relate to? Where do they publish? Who uses their data? Guide them gently towards being conscious of standards and formats. Help them help themselves. When to tidy things up? when you hand it off to someone else, if not before

So, you want research data to be shareable and reusable? Data management plans must be made concrete in each project You need to allocate people time to coordinate and assist 32

Wrapping up You now have a flying start for your data management plan. Later today: FAIRDOM data management checklist, 80 minutes in pairs/groups, 60 minutes to consolidate for your own project. a strategy for gradually fleshing out the details, with your partners Take notes of questions arising, we're here to help!