RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

Similar documents
Archivierung und Publikation von Forschungsdaten mit RADAR

RADAR A Repository for Long Tail Data

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

RADAR Introduction and Basic Concepts. Matthias Razum

RADAR - A repository for long tail data

Improving a Trustworthy Data Repository with ISO 16363

VI-SEEM Data Repository. Presented by: Panayiotis Charalambous

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Illinois Data Bank Metadata Documentation

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

escidoc-based Virtual Research Environments Matthias Razum Frank Schwichtenberg

Reproducibility and FAIR Data in the Earth and Space Sciences

GEOSS Data Management Principles: Importance and Implementation

re3data.org - Making research data repositories visible and discoverable

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

ZB MED Information Center Life Sciences

PubMan Workshop. This work is licensed under a Creative Commons Attribution 3.0 Germany License

DataSTORRE Deposit Guide

Data Management Checklist

Developing a Research Data Policy

SHARING YOUR RESEARCH DATA VIA

EUDAT & SeaDataCloud

Science Europe Consultation on Research Data Management

Horizon Societies of Symbiotic Robot-Plant Bio-Hybrids as Social Architectural Artifacts. Deliverable D4.1

DOIs for Research Data

Quality Assured (QA) data

For Attribution: Developing Data Attribution and Citation Practices and Standards

Research Data Management and Institutional Repositories

Scientific Research Data Management Policy

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Indiana University Research Technology and the Research Data Alliance

Data management Backgrounds and steps to implementation; A pragmatic approach.

Data management and discovery

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Interdisciplinary Processes at the Digital Repository of Ireland

Checklist and guidance for a Data Management Plan, v1.0

re3data.org Registry of Research Data Repositories Peter Schirmbacher Humboldt-Universität zu Berlin ETD Hong Kong, September 25.

DCH-RP Trust-Building Report

Chemotion funded by. Göttingen eresearch Toolbox Series - Electronic Note Keeping. Nicole Jung.

EUROPEANA METADATA INGESTION , Helsinki, Finland

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

An overview of the OAIS and Representation Information

How to make your data open

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018

Data publication and discovery with Globus

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

T I F F A N Y C. C H A O G r a d u a t e S c h o o l o f L i b r a r y a n d I n f o r m a t i o n S c i e n c e U n i v e r s i t y o f I l l i n o

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

EUDAT. Towards a pan-european Collaborative Data Infrastructure

GSCB-Workshop on Ground Segment Evolution Strategy

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Open Access to Publications in H2020

Medici for Digital Cultural Heritage Libraries. George Tsouloupas, PhD The LinkSCEEM Project

Horizon 2020 and the Open Research Data pilot. Sarah Jones Digital Curation Centre, Glasgow

Data Curation Handbook Steps

Specific requirements on the da ra metadata schema

BPMN Processes for machine-actionable DMPs

Using DCAT-AP for research data

Pascal Gilles H-EOP-GT. Meeting ESA-FFG-Austrian Actors ESRIN, 24 th May 2016

Every Bit Counts. Publication and Citation of Data in the Earth Sciences MG&G Data Systems Advisory Committee Meeting 2009 Jens Klump et al.

Data Curation: Technical Challenges Facing Repositories. Brianna Marshall Jan. 9, 2014

DATA SHARING FOR BETTER SCIENCE

PID System for eresearch

Data Archival and Dissemination Tools to Support Your Research, Management, and Education

Dataverse and DataTags

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Dataset Documentation Reference Guide for Pure Users

Linda Strick Fraunhofer FOKUS. EOSC Summit - Rules of Participation Workshop, Brussels 11th June 2018

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Perspectives on Open Data in Science Open Data in Science: Challenges & Opportunities for Europe

EUDAT Towards a Collaborative Data Infrastructure

re3data.org Registry of Research Data Repositories

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

About Knowledge Convergence. e-infrastructures Austria an interdisciplinary case study concerning research resources and their management

Data Curation Profile Plant Genomics

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

CODE AND DATA MANAGEMENT. Toni Rosati Lynn Yarmey

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

Transitioning to Symyx

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

Future Core Ground Segment Scenarios

The Canadian Information Network for Research in the Social Sciences and Humanities.

Swedish National Data Service, SND Checklist Data Management Plan Checklist for Data Management Plan

NRF Open Access Statement

Facilitate Open Science Training for European Research

Tools for Data Management. Research Data Management : Session 3 9 th June 2015

DRIVER Step One towards a Pan-European Digital Repository Infrastructure

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Data Archiving and Networked Services. Valentijn Gilissen, MA

Data Discovery - Introduction

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Adding Research Datasets to the UWA Research Repository

DCH-RP and PREFORMA Two case studies on the digital preservation of cultural heritage

Transcription:

RADAR Project Data Archival and Publication as a Service RESEARCH DATA REPOSITORIUM Matthias Razum FIZ Karlsruhe

The RADAR Project in a Nutshell RADAR = Research Data Repository Goal: Establish a interdisciplinary research data repository Project website: Project duration: September 2013 August 2015; potential extension for one more year Funded by 2

Focus of the Project Archival of research data as a generic service Long tail of research data (not Big Data ) Offerings Basic service: interdisciplinary data archival Extended service: data publication Operational research data management is out of scope 3

RADAR and the Domain Model 1. Private Domain 2. Collaborative Domain 3. Public Domain Archive 4. Dissemination Domain Researchers Workplace Institutional Infrastructure RADAR 2 Offerings: 1. Archival 2. Archival + Publication Portals, Researchers Data Selection Data Documentation Data Typen / Data Formats Business Model Infrastructure Software Metadata Standards Persistent Identifiers Contracts Interfaces Re-use Based on: Treloar, A., Harboe-Ree, C. (2008) Data management and the curation continuum. How the Monash experience is informing repository relationships. VALA2008 14th Biennial Conference, Melbourne and Klump, J. (2009) Managing the Data Continuum. Online: http://oa.helmholtz.de/fileadmin/user_upload/data_continuum/klump.pdf DataCite, Publishers 4

Envisioned Scope of Services /1 Reliable storage space for research data Generic metadata schema Managing license metadata Managing access rights Access may be restricted to the institution providing the data (resp. another authorized party) and service operator 5

Envisioned Scope of Services /2 Regular fixity checks Assign persistent identifiers (e.g., DOI or Handle) on data set or file level Management of storage quotas Bitstream preservation No functional long-term preservation! 6

Target Audience Researchers Archive (and publish) project-based research data Libraries and Research Institutions Institutional data archival Integration with existing institutional portals Cultural Heritage Organizations Long-term preservation of digitized materials Online access to web derivates Publishers Infrastructure for providing access to research data linked to publications 7

STEPS TO DATA PRESERVATION 8

Partners and Roles Business model SW Development FIZ IPB Scientific requirements LMU Operation of data center SCC RADAR TIB Contacts to publishers and learned societies Bitstream Preservation Data publication 9

RADAR Work Packages AP 1: Project Management (TIB/FIZ) AP 2: Requirements Analysis AP 3: Metadata Profiles AP 4: Data Management AP 5: Data Publication AP 6: Business Model and Legal Framework (IPB/LMU) (IPB/LMU, TIB) (FIZ/SCC) (TIB) (FIZ, SCC) AP 7: Evaluation (IPB/LMU) 10

RADAR Architecture Schematic Overview A User Interface API Management Repository Separation of the repository as a metadata store and the business logic from the data center via Storage API Aim: usage of more than one data center Storage API Data Center Storage interface 11

RADAR Architecture Detailed View A 12

A B Two services A Archiving B Data publication 13

SERVICE TYPE A: Archiving/Preservation A Aim: Trustworthy data preservation For whom? Completed research projects Internal resources, not part of a publication Handle Properties: Minimum metadata set (9 parameters) Handle Variable retention period: 5 to 15 years Bitstream preservation for storage period Regular reports on data integrity Access rights for selected groups/users 14

SERVICE TYPE B: Data publication with integrated preservation B Aim: Trustworthy preservation & traceable publication DOI DOI API For whom? Projects: Data basis for scientific papers Independent data publications (e.g. negative data) Digital representations Properties: Expanded metadata set for discipline-specific data DOI Unlimited storage period Regular reports on downstream use to data provider Access management (embargo & publisher services) 15

METADATA SCHEMA Mandatory properties * Identical to properties 1. identifier* Handle, DOI* 2. creator* Persons involved in producing the data 3. title* Study/Data title 4. publisher* Corporate/Institutional or personal name 5. production year Year, in which data was created or refers to 6. subject area Scientific fields appropriate for the resource 7. resource Resource s content (dataset, model, software ) 8. rights* Rights management statement (CC BY ) 9. rightsholder Institution/Person holding rights 16

METADATA SCHEMA Optional properties - for discipline-specific data descriptions 10. additional title Complementary textual information 11. description Further information (abstract ) 12. keyword Keywords describing the subject focus 13. contributor Associated institution/person (funder ) 14. language* Main language used or relevant to resource 15. alternate identifier* Unique string within its domain of issue (local identifier ) 16. related identifier* Identifiers of related resources 17. geo location* Region/Place where resource originated/refers to 18. data source Data origin (instrument, observation, trial ) 19. software type Software used for data production/processing/viewing 20. data processing Specifies further processing (statistics ) 21. related information Further information (database number ) * Identical to properties 17

RADAR Roadmap Software development 1. Middleware infrastructure 2. Archival service 3. Publikation service DSA certification Roll-out to further disciplines Workflows and interfaces to data providers 18

RESEARCH DATA REPOSITORIUM Questions?