EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Similar documents
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

B2FIND and Metadata Quality

B2FIND: EUDAT Metadata Service. Daan Broeder, et al. EUDAT Metadata Task Force

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

EUDAT- Towards a Global Collaborative Data Infrastructure

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Towards a joint service catalogue for e-infrastructure services

DOIs for Research Data

Data Discovery - Introduction

The EUDAT Collaborative Data Infrastructure

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

EUDAT Towards a Collaborative Data Infrastructure

irods workflows for the data management in the EUDAT pan-european infrastructure

BPMN Processes for machine-actionable DMPs

EUDAT & SeaDataCloud

Data Staging and Data Movement with EUDAT. Course Introduction Helsinki 10 th -12 th September, Course Timetable TODAY

Specific requirements on the da ra metadata schema

Webinar Annotate data in the EUDAT CDI

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

re3data.org - Making research data repositories visible and discoverable

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

EUDAT. Towards a pan-european Collaborative Data Infrastructure

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

Data and visualization

Increasing access to OA material through metadata aggregation

Demos: DMP Assistant and Dataverse

Metadata Requirements to document Data Analyses and Syntax Files in a Virtual Research Environment (VRE) - The use case soeb 3

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

WORKSHOP 28 TH /29 TH APRIL Christine Staiger

B2SAFE metadata management

Data management and discovery

USE CASES IN SEISMOLOGY. Alberto Michelini INGV

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Registry Interchange Format: Collections and Services (RIF-CS) explained

EUDAT. Towards a Collaborative Data Infrastructure. Ari Lukkarinen CSC-IT Center for Science, Finland NORDUnet 2012 Oslo, 18 August 2012

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

EUDAT - Open Data Services for Research

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

Striving for efficiency

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

Graph-based Data Integration in EUDAT Data Infrastructure

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Research Data Repository Interoperability Primer

EUDAT Common data infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

The European Commission s science and knowledge service. Joint Research Centre

CMIP6 Data Citation and Long- Term Archival

(Geo)DCAT-AP Status, Usage, Implementation Guidelines, Extensions

Why CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam

Ways for a Machine-actionable Processing Chain for Identifier, Metadata, and Data

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Initial Operating Capability & The INSPIRE Community Geoportal

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

Using Persistent Identifiers at

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Robin Wilson Director. Digital Identifiers Metadata Services

Introduction

Interoperability Framework Recommendations

From Open Data to Data- Intensive Science through CERIF

Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project

Remote Workflow Enactment using Docker and the Generic Execution Framework in EUDAT

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

RDF and Digital Libraries

Networking European Digital Repositories

GEOSS Data Management Principles: Importance and Implementation

Data Discovery Service for VI-SEEM

Basics in good research data management (RDM) for reviewing DMPs

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

Towards repository interoperability

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

Implementing the RDA Data Citation Recommendations for Long Tail Research Data. Stefan Pröll

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Using DCAT-AP for research data

RDDS: Metadata Development

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Implementation of the CoreTrustSeal

OpenAIRE Guidelines for Data Archive Managers 1.0 December 2012

Archivierung und Publikation von Forschungsdaten mit RADAR

Harvesting Open Government Data with DCAT-AP

How to contribute information to AGRIS

INSPIRE Geoportal Rich user experience across member states services

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Networking European Digital Repositories

Reproducibility and FAIR Data in the Earth and Space Sciences

DataSTORRE Deposit Guide

Dataverse and DataTags

Indiana University Research Technology and the Research Data Alliance

Towards FAIRness: some reflections from an Earth Science perspective

RDMG Research Data Management Group at Freiburg University

Dutch View on URN:NBN and Related PID Services

RADAR - A repository for long tail data

Supporting Data Workflows at STFC. Brian Matthews Scientific Computing Department

Approaches to Making Dynamic Data Citeable Recommendations of the RDA Working Group Andreas Rauber

Transcription:

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data Heinrich Widmann, DKRZ Claudia Martens, DKRZ Open Science Days, Berlin, 17 October 2017 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-infrastructures. Contract No. 654065

Outline What is EUDAT-B2FIND? Which guidelines are followed? How we tackle the issue diversity vs. interoperability? How is the service implemented? Which data and domains does B2FIND cover? How are the metadata collected? How can you search the Discovery Portal? EUDAT B2FIND Open Science Days, 17. October 2017 2

What is EUDAT? The project EUDAT (see http://eudat.eu ) established a Common Data Infrastructe providing several generic services for interdisciplinary data management, driven by the communities EUDAT B2FIND Open Science Days, 17. October 2017 3

What is EUDAT-B2FIND? B2FIND is the metadata service of EUDAT consisting of An interdisciplinary metadata catalogue that spans a large number heterogeneous datasets harvested from various research communities and stored through the EUDAT service B2SHARE covering a wide range of highly diverse disciplines mapped to a common and unified schema An open search portal allowing researchers to find collections of scientific resources in a wide spread and cross-domain search space to access those resources through the given references in the metadata EUDAT B2FIND Open Science Days, 17. October 2017 4

Which guidelines are followed? The FAIR principles B2FIND approach Findability Discovery Portal with powerful search features based on rich metadata catalogue Accessibility Persistent Identifiers for unique resolvability of data objects Interoperability Interdisciplinary Catalogue based on Common Standards, Vocabs and Schema Reuseability Licenses, Provenance and Domain-relevant Information is provided EUDAT B2FIND Open Science Days, 17. October 2017 5

Heterogeneity How can data interoperability improved? Levels of Interoperability Homogeneity Disciplines Research Communities (Data Provider) Optional / Recomme nded Discipline specific 01010 10101 01010 01010 10101 01010 Schema A Schema B Schema C Mandatory Collect and extract MD Cross-disciplinary Data Repositories (e.g. B2SHARE or DataCite) DataCite B2FIND harvest and mapping Service Provider ( e.g. EUDAT-B2FIND ) B2FIND! 01010 10101010 0101010101010 0101010101 01010!

B2FIND How is the service implemented? uses standard APIs (as OAI- PMH) to harvest from various data providers performs a adaptable mapping of the domain specific metadata onto the unified B2FIND schema is based on CKAN (see ckan.org), an open source repository and portal software CKAN comes with a SOLR indexer, a Postgres DB and a power and RESTful API Extensible by open available and own developed CKAN extensions EUDAT B2FIND Open Science Days, 17. October 2017 7

Metadata Type B2FIND Field name B2FIND MD Schema (extract) based on DataCite 3.0 Allowed values Semantic definition Level of Obligation General Title Free text (unicode) A name or title a Mandatory 1 information resource is known Description Free text Additional info Recommended 0-1 Data Access Source Valid URL or URN Unique link to data resource 0-1 PID Persistent Identifier + persistent and Mandatory (1) 0-1 resolvable DOI Digital Object + citable 0-1 Provenance data Creator Discipline Publication Year Formal data Temporal Coverage Spatial Coverage Identifier ; -sep. list of names Main researchers Recommended 0-n involved in data prod. List of values from CV Field of research Recommended 0-n (Controlled Vocab) YYYY The year data are Recommended 1 published Interval of 2 DTimes The temporal limits of Optional 1-n [ Begin, End ] a date-time Spatial box or point The spatial limits of a Optional 1-n [[minlat,minlon ]] place. Occurence EUDAT B2FIND Open Science Days, 17. October 2017 8 1-3

Which data and domains does B2FIND covers? Ingestion Status of the MD Catalogue Humanities Social Sciences Natural Sciences Cross Discipline 19 communities > 600000 MD records EUDAT B2FIND Open Science Days, 17. October 2017 9

How can you search the Discovery Portal? B2FIND Discovery Portal Faceted Search and Data Access B2FIND provides faceted search for Free text Geo spatial Temporal coverage Publication year Textual facets as Tags Creator Discipline etc. Dataset view provides display of metadata : Spatial extent Table of field-value pairs Links to data resources EUDAT B2FIND Open Science Days, 17. October 2017 10

Data provider (Community) MD Generation and Specification MD Provider A MD Provider MD Provider How are the metadata collected? B2FIND Ingestion Workflow https://github.com/eudat-training/b2find-training Harvest specification : OAI-URL OAI subsets MD formats Mapping specification : XPATH rules Community specific MD schemas and MD Harvesting Mapping and Validation For joining B2FIND only a few preconditons has to be fulfilled Harvesting endpoint Spec. of MD format Gurantee data synchronisation by frequent and incremental data harvesting User (Scientist or Researcher) EUDAT-B2FIND Uploading and Indexer Search and Data Access EUDAT B2FIND Open Science Days, 17. October 2017 11

Summary EUDAT-B2FIND established an operative service based on agreed standards and guidelines as the FAIR principles provides a discovery portal with powerful faceted search functionalities is based on a joint catalogue of research data, combining many heterogeneous and cross-discipline sources achieves improved interoperability by homogenisation to a common metadata schema follows a low barrier approach for data providers to expose metadata to B2FIND EUDAT B2FIND Open Science Days, 17. October 2017 12

Thank you for your attention! Links : Info and Docs : http://eudat.eu/b2find Guidelines for data providers : http://b2find.eudat.eu/guidelines B2FIND Portal : http://b2find.eudat.eu Training : https://github.com/eudat-training/b2find-training Support : www.eudat.eu/support-request Contact : widmann@dkrz.de, martens@dkrz.de EUDAT B2FIND Open Science Days, 17. October 2017 13