WORKSHOP 28 TH /29 TH APRIL Christine Staiger

Similar documents
Persistent Identifiers for Audiovisual Archives and Cultural Heritage

Persistent Identifiers

PIDs for CLARIN. Daan Broeder CLARIN / Max-Planck Institute for Psycholinguistics

PID System for eresearch

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Persistent identifiers, long-term access and the DiVA preservation strategy

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Using Persistent Identifiers at

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Digital Preservation. Unique Identifiers

1. Understand what persistent identifiers are, how they work and the benefits to using them in a DSpace repository environment

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

Data management and discovery

Utilizing PBCore as a Foundation for Archiving and Workflow Management

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

doi> Digital Object Identifier

EUDAT - Open Data Services for Research

EUDAT- Towards a Global Collaborative Data Infrastructure

ITTC Science of Communication Networks The University of Kansas EECS 784 Identifiers, Names, and Addressing

Data Discovery - Introduction

Dutch View on URN:NBN and Related PID Services

USE CASES IN SEISMOLOGY. Alberto Michelini INGV

DOI for Astronomical Data Centers: ESO. Hainaut, Bordelon, Grothkopf, Fourniol, Micol, Retzlaff, Sterzik, Stoehr [ESO] Enke, Riebe [AIP]

EUDAT Towards a Collaborative Data Infrastructure

The European Commission s science and knowledge service. Joint Research Centre

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

A Vision for Bigger Biomedical Data: Integration of REDCap with Other Data Sources

Ambiguities in the Implementation of the INSPIRE directive for Metadata. J.Walther, F.Schenk

Towards a joint service catalogue for e-infrastructure services

DOIs for Research Data

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

EUDAT. Towards a pan-european Collaborative Data Infrastructure

epic and the Handle System

Open Access to Publications in H2020

Joining the BRICKS Network - A Piece of Cake

Global ebusiness Interoperability Test Beds (GITB) Test Registry and Repository User Guide

Implementing the RDA Data Citation Recommendations for Long Tail Research Data. Stefan Pröll

EIDR: ID FORMAT Ver. 1.1 August 19, 2013

ResolutionDefinition - PILIN Team Wiki - Trac. Resolve. Retrieve. Reveal Association. Facets. Indirection. Association data. Retrieval Key.

An Entity Name Systems (ENS) for the [Semantic] Web

Handles at LC as of July 1999

BPMN Processes for machine-actionable DMPs

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

Science Europe Consultation on Research Data Management

EUDAT Common data infrastructure

EUDAT. Towards a Collaborative Data Infrastructure. Ari Lukkarinen CSC-IT Center for Science, Finland NORDUnet 2012 Oslo, 18 August 2012

Information Infrastructure: Foundations for ABS Transformation. Stuart Girvan, Australian Bureau of Statistics MSIS Paris, April 2013.

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

Main focus of the of the presentation

B2FIND and Metadata Quality

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

Afsaneh Teymourikhani 1 Saeedeh Akbari-Daryan 2

The Experimental Project of DOI Registration for Research Data at Japan Link Center (JaLC)

RADAR A Repository for Long Tail Data

Building for the Future

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Oracle Workflow. 1 Introduction. 2 Web Services Overview. 1.1 Intended Audience. 1.2 Related Documents. Web Services Guide

Implementation of Open-World, Integrative, Transparent, Collaborative Research Data Platforms: the University of Things (UoT)

SharePoint 2013 Power User

Drupal for Virtual Learning And Higher Education

Adoption of Data Citation Outcomes by BCO-DMO

Key Elements of Global Data Infrastructures

Resilient Linked Data. Dave Reynolds, Epimorphics

Persistent identifiers in the national bibliography context

Safe Havens in a Choppy Sea:

Creating a mytraining Learner Account

State of the Art in Data Citation

Ways for a Machine-actionable Processing Chain for Identifier, Metadata, and Data

LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories

Strategy for long term preservation of material collected for the Netarchive by the Royal Library and the State and University Library 2014

SBML to BioPAX. MIRIAM Annotations in use. Camille Laibe

C2CAMP. (A Working Title) International Coordination for Science Data Infrastructure: A Symposium 1 Nov 2017

DOOR Digital Open Object Repository User Manual v1.0 July 23, 2006

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Pilot integration of an electronic lab notebook and an open source research data repository as part of a modular biomedical research data platform

DuraSpace FAIRness and GDPR

User Guide For LabCollector Workflow Manager

Service Guidelines. This document describes the key services and core policies underlying California Digital Library (CDL) s EZID Service.

MACHINE ACTIONABLE INTEGRATION OF DATACITE AND DDI METADATA

DOIs for Scientists. Kirsten Sachs Bibliothek & Dokumentation, DESY

GEOSS Data Management Principles: Importance and Implementation

Data and visualization

EIDR: ID FORMAT. Ver January 2012

Reproducible Workflows Biomedical Research. P Berlin, Germany

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna

Harvesting Open Government Data with DCAT-AP

irods workflows for the data management in the EUDAT pan-european infrastructure

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director

Application aware access and distribution of digital objects using Named Data Networking (NDN)

Digital Object Architecture

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Technical specifications for the Open Annotation Service

Introduction

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

CrossRef tools for small publishers

Child Welfare Digital Services Sprint Review Presentation

Transcription:

Persistent Persistent Identifiers Identifiers (PIDs) (PIDs) WORKSHOP 28 TH /29 TH APRIL 2015 WORKSHOP 28 TH /29 TH APRIL 2015 Christine Staiger

Persistent Identifiers (PIDs) Pointers to data resources Digital Resources: Data, metadata, documents Real world objects: Species, patient, cell line Globally unique Exist infinitely long Used to identify and retrieve resources Examples: ISBNs, BSNs, DOIs, EPIC PIDS, URIs 2

Digital Object (DO) PID Data Metadata Synchronise PID, Data and Metadata during creation, maintenance and deletion of a digital object! 3

PIDs are static PID 1 PID 2 PID 3 PID 4 Data 1 Data 2 World of data infrastructure Data 4 Data 3 4

Workflow1: Change storage environment PID1 PID2 Storage site A Storage site B 5

Use Case 1: Digital repositories PIDs point to landing page of the digital repository showing metadata Real data can be downloaded from this page with another link E.g. B2SHARE, 3.TU Datacentrum PID http://hdl.handle.net/11304/3265434c-4b34-11e4-81ac-dcbd1b51435e resolves to https://b2share.eudat.eu/record/139 6

Use Case 2: Enabling data flows PIDs point to data directly If needed create another field specifying the data type to choose application 7

Use Case 2a: Retrieving information Use data in workflow via PID, NOT via actual location! 8

Use Case 2b: Enabling workflows Execute program hidden behind a PID 9

PID resolution: Example Handle Handle Resolution Collection of handle services Services consist of several sites Sites contain several serversß 10

PID resolution: Example Handle Handle Resolution Client Local HS Local HS Global HS Local HS Local HS Site 1 Site 2 Site 3. Site n Site 1 Site 2 #1 #2 #1 #2 #3 #4. #n 123.456/abc URL 4 http://www.acme.com/ URL 8 http://www.ideal.com/ 11

Resolving PIDs 1. Client sends request to Global to resolve 0.NA/123 (prefix handle for 123/456) 2. Global Responds with Service Information for 123 Global Registry E.g. Handle system 3. Client gets request to resolve hdl:123/456 IP. 4. Server responds with handle data Secondary Site B #1 ccxv cx cx ccxv cx cx #1 #2 #1 #2 #3 Primary Site Secondary Site A ccxv cx cx Service Information Local Handle Service Local Service

Example: Relationships between DOs PID: prefix2/suffix2 PID: prefix1/suffix1 Metadata: key1: key2: prefix1/suffix1 Part of/has part relationships Metadata: key1: key2: prefix2/suffix2 key3: prefix3/suffix3 PID: prefix3/suffix3 Metadata: key1: key2: prefix1/suffix1 Model cohort-patient relationship Model patient-samples relationship Which metadata to store with the PID and which in en extra catalogue?

Guidelines: Characteristics of PIDs What should be identifiable by a PID? Define what is data and what is metadata Information contained in PID entries: Location Checksums System specific information No information on context or contents! Don t mix PIDs with other IDs, e.g. database IDs Opacity: No assumptions about data context in PID 14

The handle system Offers a resolution service for PIDs Gives a lot of freedom for implementation, e.g. PID information types Software architecture designed for high availability and scalability Basis for several PID providers European Persistent Identifier Consortium PIDs and Digital Object Identifiers (EPIC) Employ handle service Provide extended APIs 15

Thank you! 16