OpenAIRE Guidelines for Data Archive Managers 1.0 December 2012

Similar documents
OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

Illinois Data Bank Metadata Documentation

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

NTM-Metadata-Schema Metadata-Schema for non-textual Materials

OpenAIRE Guidelines for CRIS Managers: Supporting Interoperability of Open Research Information through established standards

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

The OpenAIREplus Project

OpenAIRE Guidelines. Release 3.0. OpenAIRE

Open Access to Publications in H2020

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

OpenAIRE From Pilot to Service

DOIs for Research Data

OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe

OpenAIRE Open Knowledge Infrastructure for Europe

DataCite Metadata Schema Documentation for the Publication and Citation of Research Data

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

MACHINE ACTIONABLE INTEGRATION OF DATACITE AND DDI METADATA

Horizon Societies of Symbiotic Robot-Plant Bio-Hybrids as Social Architectural Artifacts. Deliverable D4.1

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Towards repository interoperability

SHARING YOUR RESEARCH DATA VIA

From Open Data to Data- Intensive Science through CERIF

Interoperability Patterns in Digital Library Systems Federations. Paolo Manghi, Leonardo Candela, Pasquale Pagano

On Constructing Repository Infrastructures The D-NET Software Toolkit

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Increasing access to OA material through metadata aggregation

Specific requirements on the da ra metadata schema

Nuno Freire National Library of Portugal Lisbon, Portugal

OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT

Research Data Repository Interoperability Primer

COAR Interoperability project and Usage Statistcs

Personal Digital Information Project, Part 2: Hands-on Exercise

GEOSS Data Management Principles: Importance and Implementation

Using DCAT-AP for research data

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

DataSTORRE Deposit Guide

META-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation

Orbis Cascade Alliance Content Creation & Dissemination Program Digital Collections Service. Enabling OAI & Mapping Fields in Digital Commons

OpenAIRE Guidelines for CRIS Managers

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

DOIs for Scientists. Kirsten Sachs Bibliothek & Dokumentation, DESY

Interoperability Framework Recommendations

Registry Interchange Format: Collections and Services (RIF-CS) explained

How to contribute information to AGRIS

Networking European Digital Repositories

Science Europe Consultation on Research Data Management

Towards a joint service catalogue for e-infrastructure services

NRF Open Access Statement

Large Scale Repository Auditing to ISO José Carvalho

Scholix Metadata Schema for Exchange of Scholarly Communication Links

RADAR - A repository for long tail data

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

OpenAire and BASE. Services supporting the Interoperability of the European Open Science Network. Lyon,

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

The Case for Interoperability for Open Access Repositories

Swedish National Data Service, SND Checklist Data Management Plan Checklist for Data Management Plan

A Model for Fine-Grained Data Citation

The Metadata Challenge:

Progress towards the EOSC

Infrastructure for the UK

Networking European Digital Repositories

Tracking data usage at NCAR s Research Data Archive. Steven Worley Computational and Information System Laboratory NCAR

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017

Networking European Digital Repositories

The OpenAIRE Infrastructure

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU

CORE: Improving access and enabling re-use of open access content using aggregations

Comparing Open Source Digital Library Software

Dataset Documentation Reference Guide for Pure Users

Interoperability & Archives in the European Commission

Introduction to INEXDA s Metadata Schema

Persistent identifiers: jnbn, a JEE application for the management of a national NBN infrastructure

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

User guide. Created by Ilse A. Rasmussen & Allan Leck Jensen. 27 August You ll find Organic Eprints here:

Building the CIARD Framework for Data and Information Sharing: the case of France & INRA

EUDAT Towards a Collaborative Data Infrastructure

Metadata Standards and Applications

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

The European Commission s science and knowledge service. Joint Research Centre

The Design of a DLS for the Management of Very Large Collections of Archival Objects

OAI Static Repositories (work area F)

Data is the new Oil (Ann Winblad)

DRIVER: Building the Network for Accessing Digital Repositories across Europe

Metadata Harvesting Framework

Facilitate Open Science Training for European Research

ISO 2146 INTERNATIONAL STANDARD. Information and documentation Registry services for libraries and related organizations

Show me the data. The pilot UK Research Data Registry. 26 February 2014

How to share research data

BPMN Processes for machine-actionable DMPs

OAI-PMH. DRTC Indian Statistical Institute Bangalore

The Sunshine State Digital Network

RADAR A Repository for Long Tail Data

META-SHARE : the open exchange platform Overview-Current State-Towards v3.0

Content domain. Leonardo Candela, Paolo Manghi, Carlo Meghini. DL.org Autumn School Athens, 3-8 October th October 2010

Horizon 2020 and the Open Research Data pilot. Sarah Jones Digital Curation Centre, Glasgow

Transcription:

OpenAIRE Guidelines for Data Archive Managers 1.0 December 2012 OpenAIRE Guidelines for Data Archive Managers 1.0 Page 1 of 15

Contents Introduction... 3 Aim... 3 DataCite... 3 What s different... 3 Acknowledgements & Contributors... 5 Editors... 5 Experts & Reviewers... 5 License... 6 Version... 6 OpenAIRE application of DataCite Metadata Schema... 7 Access rights information... 12 Funding information... 12 Related publications and datasets information... 12 Embargo date information... 13 Use of OAI-PMH... 14 Metadata Format... 14 OpenAIRE OAI Set... 14 Set content... 14 Futher instructions... 15 OpenAIRE Guidelines for Data Archive Managers 1.0 Page 2 of 15

Introduction Aim The OpenAIRE Guidelines for Data Archive Managers 1.0 will provide instruction for data archive managers to expose their metadata in a way that is compatible with the OpenAIRE infrastructure. Data archives should be included in the OpenAIRE information space when data are related to a document e.g. a dataset cited by an article. By implementing the OpenAIRE Guidelines data archive managers are facilitating the creation of enhanced publications and building the stepping-stones for a linked data infrastructure for research. DataCite OpenAIRE has adopted the DataCite Metadata Schema as the basis for harvesting and importing metadata about datasets from data archives. The core mission of DataCite is to build and maintain a sustainable framework that makes it possible to cite data through the use of persistent identifiers, DOIs. OpenAIRE shares the goal of the DataCite Metadata Schema - to provide a domain agnostic metadata schema and provide interoperability through a small number of properties - making interoperability possible in the simplest manner possible and as a result keep the technical barriers for implementation as low as possible. We would like to thank DataCite for their kind support and help making these OpenAIRE guidelines for Data Archive Managers possible. What s different In this document you will find the needed properties to make your data archive compatible with the OpenAIRE infrastructure. OpenAIRE has adopted the DataCite Metadata Schema version 2.2 with some minor adjustments. OpenAIRE accepts other persistent identifier schemes and not only a DOI in 1.1 identifiertype. OpenAIRE recommends links to related publications and datasets i.e. 12. RelatedIdentifier property and attributes are mandatory when applicable. OpenAIRE Guidelines for Data Archive Managers 1.0 Page 3 of 15

OpenAIRE recommends using the 7. Contributor property to relate a dataset to funding information. DataCite optional properties 8. Date and 17. Description are mandatory in OpenAIRE. OpenAIRE enforces encoding schemes on DataCite property 16. Rights. OpenAIRE Guidelines for Data Archive Managers 1.0 Page 4 of 15

Acknowledgements & Contributors Editors Mikael K. Elbæk (Technical University of Denmark) Lars Holm Nielsen (CERN, Switzerland) Experts & Reviewers Samuele Kaplun (CERN, Switzerland) Paolo Manghi (CNR, Italy) Natalia Manola (University of Athens, Greece) Jochen Schirrwagen (University of Bielefeld, Germany) Birgit Schmidt (University of Goettingen,Germany) Mathias Lösch (University of Bielefeld, Germany) Frauke Ziedorn (DataCite, Germany) Pedro Principe (University of Minho, Portugal) Eloy Rodrigues (University of Minho, Portugal) Najla Rettberg (University of Goettingen, Germany) OpenAIRE Guidelines for Data Archive Managers 1.0 Page 5 of 15

License This work is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. Version 1.0 December 2012 Initial document OpenAIRE Guidelines for Data Archive Managers 1.0 Page 6 of 15

OpenAIRE application of DataCite Metadata Schema OpenAIRE builds on the DataCite Metadata Schema by making some of the otherwise optional DataCite properties mandatory, as well as enforcing more specific encoding schemes on the values of some DataCite properties. The table below details the differences from the DataCite Metadata Schema. Within OpenAIRE the use of elements and attributes is either: mandatory (M) = the element must always be present in the metadata record. An empty element is not allowed. mandatory when applicable (MA) = when the element can be obtained it must be present in the metadata record recommended (R) = the use of the element is recommended optional (O) = it is not important whether the element is used or not The recommended status is made primarily to encourage users to input certain elements when creating a metadata record to enhance services. ID DataCite-property Status Encoding schemes (if different from DataCite) 1 1 Identifier M DOI (Digital Object Identifier) is preferred. The format should be 10.1234/foo. 1.1 identifiertype M OpenAIRE allows for others types of identifiers than DOIs compared to DataCite. Controlled List: Allowed values: ARK DOI Handle PURL URI 1 Only differences to the DataCite Metadata Schema v2.2 are noted here. Please refer to encoding schemes and allowed values in http://schema.datacite.org/meta/kernel- OpenAIRE Guidelines for Data Archive Managers 1.0 Page 7 of 15

2 Creator M 2.1 creatorname M None, Recommended practice is family name, given name(s) 2.2 nameidentifier R 2.2.1 nameidentifierscheme R 3 Title M 3.1 titletype O 4 Publisher M 5 PublicationYear M 6 Subject O 6.1 subjectscheme O 7 Contributor MA Mandatory when applicable property in OpenAIRE instead of optional in DataCite. 7.1 contributortype MA 7.2 contributorname MA Applicable only when contributortype is Funder : Name of the funding entity. Example for European Commission funded research use European Commission. Specifically do not use the project acronym. 7.3 nameidentifier MA Applicable only when contributortype is Funder : A vocabulary of projects will be exposed by OpenAIRE through OAI-PMH, and available for all repository managers. Values will include the project name and projectid. OpenAIRE Guidelines for Data Archive Managers 1.0 Page 8 of 15

The projectid equals the Grant Agreement number, and is defined by the namespace 2 info:eurepo/grantagreement/funder/fund ingprogram/projectnumber/[juris diction]/[projectname]/[project Acronym]/ For OpenAIRE compatibility, the first three elements are mandatory and the last three are optional. Repositories may choose to use only the first part of the namespace (as in previous versions of the Guidelines), or use the extended version with six parts, which is recommended. When omitting fields in the extended version, the number of fields must nevertheless be preserved by using /. A correct example for omitting the the ProjectName field would therefore look like this: EC/FP7/12345/EU//OpenAIREplus 7.3.1 nameidentfierscheme MA Applicable only when contributortype is Funder : Controlled List Allowed values: 8 Date M Mandatory property in OpenAIRE instead of optional in DataCite. info 8.1 datetype M Use Issued for the date the resource is published or distributed. To indicate the end of an embargo period, use Available. 2 See http://wiki.surffoundation.nl/display/standards/info-eu-repo/#info-eu-repo- GrantAgreementIdentifiers OpenAIRE Guidelines for Data Archive Managers 1.0 Page 9 of 15

To indicate the start of an embargo period, use Accepted. Further datetypes may be specified. Please refer to DataCite Metadata Schema v2.2 for details. 9 Language R 10 ResourceType R 10.1 resourcetypegeneral R 11 AlternateIdentifier O 11.1 alternateidentifiertype O 12. RelatedIdentifier MA Mandatory when applicable property in OpenAIRE instead of optional in DataCite. 12.1 relatedidentifiertype MA Controlled List Allowed values (most common, please refer to DataCite Metadata Schema v2.2 for a complete list) 12.2 relationtype MA 13 Size O 14 Format O 15 Version O IsCitedBy (indicates that B includes A in a citation) Cites (indicates that A includes B in a citation) HasPart (indicates A includes the part B) IsReferencedBy (indicates A is used as a source of information by B) 16 Rights MA Mandatory when applicable property in OpenAIRE instead of optional in DataCite. Use values from vocabulary Access Rights at http://wiki.surf.nl/display/standards/info- OpenAIRE Guidelines for Data Archive Managers 1.0 Page 10 of 15

eu-repo/#info-eu-repo-accessrights; values are: info:eurepo/semantics/closedaccess info:eurepo/semantics/embargoedaccess info:eurepo/semantics/restrictedaccess info:eurepo/semantics/openaccess If the material is licensed under a Creative Commons license then you should provide links to applicable Creative Commons licenses, e.g.: http://creativecommons.org/lice nses/zero/1.0/ http://creativecommons.org/lice nses/by/3.0/ 17 Description MA Mandatory when applicable property in OpenAIRE instead of optional in DataCite. 17.1 descriptiontype MA The OpenAIRE Guidelines for Data Archive Managers are built on the DataCite Metadata Schema 3. The following properties are described to make full integration to the OpenAIRE information space and allow OpenAIRE to make links between publications and datasets; datasets and funding information. 3 DataCite Metadata Schema version 2.2: http://schema.datacite.org/meta/kernel- 2.2/index.html OpenAIRE Guidelines for Data Archive Managers 1.0 Page 11 of 15

Access rights information OpenAIRE uses the access rights to enable a better user experience by declaring the access rights clear and explicit in the portal. Access rights are specified using the 16. Rights property. Please see encoding scheme in the table above. An example: <rights>info:eu-repo/semantics/openaccess</rights> Funding information One of OpenAIRE s main goals is to link research output to (EC) research funding. The following application of the contributor property allows unique and persistent identification of the funder who has funded wholly or partly the dataset described. The following properties are mandatory when applicable to provide funding information: 7. Contributor; 7.1 contributortype; 7.2 contributername; 7.3 nameidentifier; 7.3.1 nameidentifierscheme: An example for linking a research output to OpenAIREplus FP7 project: <contributor contributortype="funder"> <contributorname> European Commission </contributorname> <nameidentifier nameidentifierscheme="info"> info:eu-repo/grantagreement/ec/fp7/282896 </nameidentifier> </contributor> Related publications and datasets information For OpenAIRE datasets are only relevant when they are related to a publication, therefore it is mandatory when applicable for dataset records that OpenAIRE shall harvest to have information about the related publication. DataCite Metadata Schema allows the description of the resource being registered (A) typically a dataset but not limited to that and the related resource (B) in the case of OpenAIRE typically a publication. Related publications/datasets must have the following properties: 12. RelatedIdentifier, 12.1 relatedidentifiertype, 12. relationtype An example: OpenAIRE Guidelines for Data Archive Managers 1.0 Page 12 of 15

<relatedidentifier relatedidentifiertype="doi" relationtype="isreferenced By"> 10.1234/foo </relatedidentifier> Embargo date information For OpenAIRE two main types of dates are relevant. When the data were made available, published or uploaded to a formal database, this is the date the data were Issued. Sometimes data may be embargoed for a period; this information should be managed by the data provider and expressed by exporting an Available date to indicate the end of an embargo period and an Accepted date to indicate the start of an embargo period. An example: <dates> <date datetype="issued">2011-12-01</date> </dates> An embargo example: <dates> <date datetype="accepted">2011-12-01</date> <date datetype="available">2012-12-01</date> </dates> OpenAIRE Guidelines for Data Archive Managers 1.0 Page 13 of 15

Use of OAI-PMH OpenAIRE uses the OAI-PMH v2.0 protocol for harvesting dataset metadata. Metadata Format OpenAIRE expects metadata to be encoded in the DataCite metadata format (metadataprefix oai_datacite) 4. For information on how to use the individual DataCite elements, please refer to OpenAIRE application as documented in these guidelines and the DataCite Metadata Schema version 2.2. OpenAIRE OAI Set For harvesting the records relevant to OpenAIRE, the use of a specific OAI set at the local repository is mandatory. The set must have the following characteristics: setname OpenAIRE_data setspec* openaire_data *A harvester only uses the setspec request to perform selective harvesting. The letters must be in small caps. Set content The specific content of the openaire_data set is to be determined at the local repository. Publications to be inserted in the OpenAIRE set must conform to the following criteria: All the resources that are related to peer-reviewed articles accepted for publishing. 4 See http://schema.datacite.org/oai/oai-1.0/ and http://oai.datacite.org/ OpenAIRE Guidelines for Data Archive Managers 1.0 Page 14 of 15

Futher instructions Best practices and cases of implementations will be available on the OpenAIRE support wiki TBA. OpenAIRE Guidelines for Data Archive Managers 1.0 Page 15 of 15