EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Similar documents
EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

B2FIND and Metadata Quality

Towards a joint service catalogue for e-infrastructure services

B2FIND: EUDAT Metadata Service. Daan Broeder, et al. EUDAT Metadata Task Force

EUDAT- Towards a Global Collaborative Data Infrastructure

DOIs for Research Data

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

EUDAT & SeaDataCloud

The EUDAT Collaborative Data Infrastructure

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

Webinar Annotate data in the EUDAT CDI

Data Discovery - Introduction

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

re3data.org - Making research data repositories visible and discoverable

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Reducing Consumer Uncertainty

irods workflows for the data management in the EUDAT pan-european infrastructure

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

Specific requirements on the da ra metadata schema

State of the Art in Data Citation

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

National Documentation Centre Open access in Cultural Heritage digital content

BPMN Processes for machine-actionable DMPs

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

Data and visualization

Metadata Requirements to document Data Analyses and Syntax Files in a Virtual Research Environment (VRE) - The use case soeb 3

ZB MED Information Center Life Sciences

Demos: DMP Assistant and Dataverse

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

From Open Data to Data- Intensive Science through CERIF

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

Approaches to Making Dynamic Data Citeable Recommendations of the RDA Working Group Andreas Rauber

Introduction

Dutch View on URN:NBN and Related PID Services

CMIP6 Data Citation and Long- Term Archival

The World Bank Enterprise Search Program. Luisita Guanlao The World Bank Group May 10, 2005

Why CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam

EUDAT - Open Data Services for Research

EUDAT Towards a Collaborative Data Infrastructure

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

CLARIN for Linguists Portal & Searching for Resources. Jan Odijk LOT Summerschool Nijmegen,

Implementing the RDA Data Citation Recommendations for Long Tail Research Data. Stefan Pröll

Horizon 2020 and the Open Research Data pilot. Sarah Jones Digital Curation Centre, Glasgow

Robin Wilson Director. Digital Identifiers Metadata Services

I data set della ricerca ed il progetto EUDAT

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Dataset Documentation Reference Guide for Pure Users

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018

Registry Interchange Format: Collections and Services (RIF-CS) explained

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

META-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Europeana update: aspects of the data

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Metadata of geographic information

The European Commission s science and knowledge service. Joint Research Centre

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Indiana University Research Technology and the Research Data Alliance

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

Persistent identifiers, long-term access and the DiVA preservation strategy

Science Europe Consultation on Research Data Management

Javier NOGUERAS-ISO 1, Manuel A. UREÑA-CÁMARA 2, Javier LACASTA 1, F. Javier ARIZA-LÓPEZ 2

USE CASES IN SEISMOLOGY. Alberto Michelini INGV

EUDAT and Cloud Services

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

B2SAFE metadata management

Using Persistent Identifiers at

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Digital repositories as research infrastructure: a UK perspective

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Quality Assured (QA) data

Data management and discovery

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

Striving for efficiency

Planning the Future with Planets The Planets Interoperability Framework. Presented by Ross King Austrian Research Centers GmbH ARC

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

EUDAT Common data infrastructure

Introduction to INEXDA s Metadata Schema

Data Exchange and Conversion Utilities and Tools (DExT)

GEOSS Data Management Principles: Importance and Implementation

PIDs for CLARIN. Daan Broeder CLARIN / Max-Planck Institute for Psycholinguistics

Ways for a Machine-actionable Processing Chain for Identifier, Metadata, and Data

INSPIRE & Environment Data in the EU

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Metadata aggregation for digital libraries

Crossing the Archival Borders

Using DCAT-AP for research data

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

Technical specifications for the Open Annotation Service

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna

Remote Workflow Enactment using Docker and the Generic Execution Framework in EUDAT

RDF and Digital Libraries

Metadata Workshop 3 March 2006 Part 1

Integration of INSPIRE & SDMX data infrastructures for the 2021 population and housing census

Interoperability Framework Recommendations

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

Transcription:

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-infrastructures. Contract No. 654065

Outline EUDAT and the B2 Service Suite Guidelines and Concepts B2FIND EUDAT s Discovery Service MD Ingestion and the B2FIND Schema Disciplines, Communities and the MD catalogue Data Access Identifiers Discovery Portal Outlook and Summary EUDAT B2FIND DI4R 2016 28 September 2016 2

EUDAT and the B2 Service Suite EUDAT B2FIND DI4R 2016 28 September 2016 3

EUDAT The project European Data Infrastructure (EUDAT) funded by the EU Horizon2020 program started in 2011, now in 2 nd phase 'EUDAT2020', will end 2018 >= 2018 : agreement of cooperation Motivation : Manage the rising tide of research data Improve Interoperability in a wide cross-disciplinary scope Objective : Build up a Collaborate Data Infrastructure, based on common data services driven by requirements of the research communities EUDAT B2FIND DI4R 2016 28 September 2016 4

B2 Service Suite http://www.eudat.eu/services EUDAT B2FIND DI4R 2016 28 September 2016 5

Guidelines and Concepts EUDAT B2FIND DI4R 2016 28 September 2016 6

The FAIR principles B2FIND approach Findability := Ease with which information can be found Powerful and easy-to-use search features and functionalities Accessibility := Ability to access [ ] data stored within repositories Unique and persistent identification and resolvability of data objects Interoperability : Ability of multiple systems with different [] structures to exchange data with minimal loss of content []" (NISO) Comprehensive cross-disciplinary MD catalogue based on common standards and by minimising loss of information Reuseability := Ability to re-use data created by others Cross-discipline approach and catalogue covering multiple sources EUDAT B2FIND DI4R 2016 28 September 2016 7

Heterogeneity Research Communities (Data Provider) 01010 10101 01010 01010 10101 01010 01010 10101010 0101010101010 0101010101 01010 Info Loss MD gener ation MD gener ation MD gener ation Schema A Schema B Schema C Levels of Interoperability Data Repositories (e.g. B2SHARE/B2SAVE or Agreggator as DataCite) Collect and extract MD Schema B2S Information Loss B2FIND harvest and mapping EUDAT B2FIND DI4R2016 28 September 2016 Homogeneity Service Provider ( e.g. EUDAT-B2FIND )! Schema B2FIND! 8

B2FIND MD Ingestion and Common Schema EUDAT B2FIND DI4R 2016 28 September 2016 9

B2FIND Ingestion Workflow Data provider (Community) MD Generation and Specification MD Provider A MD Provider MD Provider Harvest specification : OAI-URL OAI subsets MD formats Mapping specification : XPATH rules Community specific MD schemas and MD Harvesting Mapping and Validation For joining B2FIND only a few preconditons has to be fulfilled Harvesting endpoint Spec. of MD format Gurantee data synchronisation by frequent and incremental data harvesting User (Scientist or Researcher) EUDAT-B2FIND Uploading and Indexer Search and Data Access EUDAT B2FIND DI4R 2016 28 September 2016 10

Metadata Type B2FIND Field name B2FIND MD Schema (extract) Allowed values Semantic definition Level of Obligation General Title Free text (unicode) A name or title a Mandatory 1 information resource is known Description Free text Additional info Recommended 0-1 Data Access Source Valid URL or URN Unique link to data resource 0-1 PID Persistent Identifier + persistent and Mandatory (1) 0-1 resolvable DOI Digital Object + citable 0-1 Provenance data Creator Discipline Publication Year Formal data Temporal Coverage Spatial Coverage Identifier ; -sep. list of names Main researchers Recommended 0-n involved in data prod. List of values from CV Field of research Recommended 0-n (Controlled Vocab) YYYY The year data are Recommended 1 published Interval of 2 DTimes The temporal limits of Optional 1-n [ Begin, End ] a date-time Spatial box or point The spatial limits of a Optional 1-n [[minlat,minlon ]] place. Occurence EUDAT B2FIND DI4R 2016 28 September 2016 12 1-3

B2FIND Disciplines, Communities and MD Catalogue EUDAT B2FIND DI4R 2016 28 September 2016 13

The Facet Discipline Controlled Vocabulary Fields of Knowledge / Humanities Social sciences Natural sciences Professionals Arts History Linguistics Archaeology Biology Physics Earth Sciences Engineering. Elementary Particle Physics Material science taken from List of Academic disciplines Crystallography http://en.wikipedia.org/wiki/list_of_academic_disciplines_and_sub-disciplines and The Fields of Knowledge http://www.thingsmadethinkable.com/item/fields_of_knowledge.php?focus=natural_sciences EUDAT B2FIND DI4R 2016 28 September 2016 14

Coverage of Disciplines in B2FIND EUDAT B2FIND DI4R 2016 28 September 2016 15

B2FIND MD Catalogue Ingestion status Humanities Social Sciences Natural Sciences Cross Discipline 17 communities > 450000 MD records EUDAT B2FIND DI4R 2016 28 September 2016 16

B2FIND Data Access EUDAT B2FIND DI4R 2016 28 September 2016 17

B2FIND Data Access Identifiers Resolvability and Levels of aggregation Resource <//dc:identifier value> </> B2FIND Metadta Source PID DOI Resolution and Access Handle Server Type Unique Persistent Resolvable Citable DOI DOI Resolver Landing Page PID_1 PID x URL (Source) PID_2?? x PID_3 Data Collection 01010 10101 01010 01010 10101 01010 Stricter Policies 01010 10101 01010 EUDAT B2FIND DI4R2016 28 September 2016 20

Coverage of Data Access Identifiers EUDAT B2FIND DI4R 2016 28 September 2016 21

B2FIND Discovery Portal EUDAT B2FIND DI4R 2016 28 September 2016 22

B2FIND provides faceted search for Free text Geo spatial Temporal coverage Publication year Textual facets as Tags Creator Discipline etc. B2FIND Discovery Portal Faceted Search and Data Access Dataset view provides display of metadata : Spatial extent Table of field-value pairs Links to data resources EUDAT B2FIND DI4R 2016 28 September 2016 23

Outlook and Summary EUDAT B2FIND DI4R 2016 28 September 2016 25

Outlook Handle scalability and granularity issues Levels of aggregation Metrics for Key Indicators and Metadata Quality Establish content-related quality assurance Add further search and distribution channels, e.g. Use linked data : Potential for semantic enrichment Annotation functionality : Users link datasets to external reference materials (vocabularies, ontologies, etc.) Query-based Taxonomies : Enabling hierarchical search, e.g. in trees of Disciplines EUDAT B2FIND DI4R 2016 28 September 2016 26

Summary EUDAT-B2FIND established an operative service based on agreed standards and guidelines as the FAIR principles, provides a discovery portal with powerful search functionalities and is based on a unique catalogue of research data, combining many heterogeneous and crossdiscipline sources Improved interoperability is achieved by homogenisation to a common metadata schema Further efforts are made to address the demands of the communities and data projects, to adapt the system for future challenges EUDAT B2FIND DI4R 2016 28 September 2016 27

Thank you for your attention! Links : info : http://eudat.eu/b2find portal : http://b2find.eudat.eu Contact www.eudat.eu/support-request widmann@dkrz.de EUDAT B2FIND DI4R 2016 28 September 2016 28