Horizon Societies of Symbiotic Robot-Plant Bio-Hybrids as Social Architectural Artifacts. Deliverable D4.1

Similar documents
Open Access to Publications in H2020

Open Access & Open Data in H2020

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Deliverable Final Data Management Plan

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Deliverable Initial Data Management Plan

EC Horizon 2020 Pilot on Open Research Data applicants

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Horizon2020/EURO Coordination and Support Actions. SOcietal Needs analysis and Emerging Technologies in the public Sector

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Supporting the H2020 OA Mandate

OpenAIRE Guidelines for Data Archive Managers 1.0 December 2012

BPMN Processes for machine-actionable DMPs

Demos: DMP Assistant and Dataverse

Research Data Repository Interoperability Primer

Horizon 2020 Open Research Data Pilot: What is required? Sarah Jones Digital Curation Centre

How to contribute information to AGRIS

Invenio: A Modern Digital Library for Grey Literature

DELIVERABLE. D3.9 TransformingTransport Open Data Portal

Data Management Plan

OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

The OpenAIREplus Project

Invenio: a modern digital library system. <

OpenAIRE Open Knowledge Infrastructure for Europe

Developing a Research Data Policy

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

Application of machine learning and big data technologies in OpenAIRE system

DRIVER Step One towards a Pan-European Digital Repository Infrastructure

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

CENTAUR: Cost Effective Neural Technique for Alleviation of Urban flood Risk. D4.4 Data Management Plan

Deliverable D7.3 Data Management Plan (M30)

Coordinating Optimisation of Complex Industrial Processes

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

Deliverable No. 4.1 SciChallenge Web Platform Early Prototype (Additional Report)

Integrated Process Control Based on Distributed In-Situ Sensors into Raw Materials and Energy. Project identity and dissemination material schedule

H2020-ICT

Lessons Learned. Implementing Rosetta in the Harold B. Lee Library

OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe

Data Curation Profile Plant Genetics / Corn Breeding

OpenAIRE From Pilot to Service

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

Research Data Management

Data management Backgrounds and steps to implementation; A pragmatic approach.

Writing a Data Management Plan A guide for the perplexed

The Water SWITCH-ON, Spatial Information Platform (SIP)

How to share research data

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Science Europe Consultation on Research Data Management

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Reproducibility and FAIR Data in the Earth and Space Sciences

Interdisciplinary Processes at the Digital Repository of Ireland

Research Data Edinburgh: MANTRA & Edinburgh DataShare. Stuart Macdonald EDINA & Data Library University of Edinburgh

OpenAire and BASE. Services supporting the Interoperability of the European Open Science Network. Lyon,

Helix Nebula The Science Cloud

Introduction... 2 Preparation... 2 Implementation... 5 Operation... 7 Marketing... 9 Review and Outlook... 11

Scientific Research Data Management Policy

Tools for Data Management. Research Data Management : Session 3 9 th June 2015

Data publication and discovery with Globus

The European Research Council

EVACUATE PROJECT WEBSITE

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

IM&T Data Management Strategy Overview March 2013 CSIRO INFORMATION MANAGEMENT & TECHNOLOGY

Building a Digital Library Software

D2.2 Web Platform development

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

Reproducibility and Reuse of Scientific Code Evolving the Role and Capabilities of Publishers

Cloud-based Rapid Elastic MAnufacturing

Metal Recovery from Low Grade Ores and Wastes Plus

Helping Journals to Upgrade Data Publications for Reusable Research

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

Universidad de Extremadura 22 October Elsevier s approach to Open Access and latest developments. Edward Wedel-Larsen. Research Solutions

EUDAT- Towards a Global Collaborative Data Infrastructure

Data Management and Data Management Plans. Dr. Tomasz Miksa. TU Wien & SBA Research

RADAR - A repository for long tail data

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Digital The Harold B. Lee Library

B2SAFE metadata management

EUDAT Towards a Collaborative Data Infrastructure

CERN Open Data and Data Analysis Knowledge Preservation

Your Open Science and Research Publishing Platform. 1st SciShops Summer School

HORIZON2020 FRAMEWORK PROGRAMME TOPIC EUK Federated Cloud resource brokerage for mobile cloud services. D7.1 Initial publication package

RADAR A Repository for Long Tail Data

OpenAIRE Guidelines for CRIS Managers

Deliverable D5.3. World-wide E-infrastructure for structural biology. Grant agreement no.: Prototype of the new VRE portal functionality

Horizon 2020 INFRADEV Design studies. RICHFIELDS Working Packages 2 Deliverable D2.2. Project identity. Date delivered: M6

Persistent identifiers, long-term access and the DiVA preservation strategy

Show me the data. The pilot UK Research Data Registry. 26 February 2014

Dataverse and DataTags

Data Exchange and Conversion Utilities and Tools (DExT)

Digital Fabrication and Maker Movement in Education Making Computer supported Artefacts from Scratch

DURAFILE. D1.2 DURAFILE Technical Requirements

RDMG Research Data Management Group at Freiburg University

D5.2 FOODstars website WP5 Dissemination and networking

Showing it all a new interface for finding all Norwegian research output

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Research Data Management: lessons learned - and still to learn

D8.1 Project website

Transcription:

Horizon 2020 Societies of Symbiotic Robot-Plant Bio-Hybrids as Social Architectural Artifacts Deliverable D4.1 Data management plan (open research data pilot) Date of preparation: 2015/09/30 Start date of project: 2015/04/01 Project coordinator: UPB Revision: 1 (r81) Duration: 48 months Classification: public Partners: lead: UPB contribution: Project website: http://florarobotica.eu/ H2020-FETPROACT-2014 Deliverable D4.1 Page 1 of 11

DELIVERABLE SUMMARY SHEET Grant agreement number: 640959 Project acronym: Title: flora robotica Societies of Symbiotic Robot-Plant Bio-Hybrids as Social Architectural Artifacts Deliverable N o : Deliverable D4.1 Due date: M6 Delivery date: 2015/09/30 Name: Description: Data management plan (open research data pilot) We report on identified options of data repositories, our data management approach, and individual data set types. Partners owning: UPB Partners contributed: Made available to: public Page 2 of 11 Deliverable D4.1

Contents 1 Introduction 4 2 Specifications for each data set 5 2.1 DATA1 publications.................................. 5 2.1.1 Data set reference and name.......................... 5 2.1.2 Data set description............................... 5 2.1.3 Standards and metadata............................ 5 2.1.4 Data sharing................................... 5 2.1.5 Archiving and preservation (including storage and backup).......... 5 2.2 DATA2 photo & video material of experiments................... 6 2.2.1 Data set reference and name.......................... 6 2.2.2 Data set description............................... 6 2.2.3 Standards and metadata............................ 6 2.2.4 Data sharing................................... 6 2.2.5 Archiving and preservation (including storage and backup).......... 6 2.3 DATA3 developed software.............................. 7 2.3.1 Data set reference and name.......................... 7 2.3.2 Data set description............................... 7 2.3.3 Standards and metadata............................ 7 2.3.4 Data sharing................................... 7 2.3.5 Archiving and preservation (including storage and backup).......... 7 2.4 DATA4 sensor data of experiments.......................... 8 2.4.1 Data set reference and name.......................... 8 2.4.2 Data set description............................... 8 2.4.3 Standards and metadata............................ 8 2.4.4 Data sharing................................... 8 2.4.5 Archiving and preservation (including storage and backup).......... 8 2.5 DATA5 logging & tracking data of experiments................... 9 2.5.1 Data set reference and name.......................... 9 2.5.2 Data set description............................... 9 2.5.3 Standards and metadata............................ 9 2.5.4 Data sharing................................... 9 2.5.5 Archiving and preservation (including storage and backup).......... 9 2.6 DATA6 logging data & images of simulations.................... 10 2.6.1 Data set reference and name.......................... 10 2.6.2 Data set description............................... 10 2.6.3 Standards and metadata............................ 10 2.6.4 Data sharing................................... 10 2.6.5 Archiving and preservation (including storage and backup).......... 10 3 Conclusion 11 Deliverable D4.1 Page 3 of 11

1 Introduction We have done a survey of available data repositories but we could not identify an appropriate repository that would fit the project s focus. This might be due to our nontraditional, innovative, and interdisciplinary approach of combining plant science, computer science, engineering, and architecture. For example, there are multiple data repositories that focus on plant science which are, however, very specific for genetic data, geographical data, etc. Therefore, we have decided to use a multipurpose data repository. We have chosen Zenodo 1 due to its assumed high reliability now and in the future. In addition, we expect to generate big amounts of data. The overhead of storing and handling all of that data seems not justifiably. Hence, we have decided for a hybrid approach of selecting data that we store only for the lifetime of the project on our project s Cloud service hosted by OwnCube e.u. 2 and for which we share data based on individual requests. A more selective choice of data will be uploaded to Zenodo allowing for preservation beyond the project s lifetime. Finally, we plan a strategy of storing almost all data by one service (Zenodo) for simplicity and as a holistic approach. At the same time, we allow to continue ongoing data sharing procedures of members of our project consortium, especially, concerning self-archiving practices of publications (conference papers and journal papers) and open-source availability of developed software via software repositories such as GitHub 3 and sourceforge 4. 1 see http://www.zenodo.org, Zenodo is developed by CERN under the EU FP7 project OpenAIREplus (grant agreement no. 283595). 2 OwnCube e.u., An der Liesing 2-34/7, 1230 Wien, Austria 3 https://github.com/ 4 http://sourceforge.net/ Page 4 of 11 Deliverable D4.1

2 Specifications for each data set 2.1 DATA1 publications 2.1.1. Data set reference and name DATA1 publications (conference papers, journal papers, monographs) 2.1.2. Data set description Scientific and peer-reviewed papers that are published by members of the project consortium. Groups with interest in this type of data set are the readerships of the respective conferences and journals. 2.1.3. Standards and metadata We will use Zenodo 5. Hence, Zenodo also defines and handles the metadata. Citing from their policies 6 : Metadata is licensed under CC0, except for email addresses. All metadata is exported via OAI-PMH and can be harvested. [... ] All metadata is stored internally in MARC [... ]. Metadata is exported in several standard formats such as MARCXML, Dublin Core, and DataCite Metadata Schema according to OpenAIRE Guidelines. Version control is not necessary. Furthermore, we keep for each publication in this data set a BIBTEX entry and its DOI 7. 2.1.4. Data sharing The publications will be uploaded as PDF (Portable Document Format) on the servers of Zenodo. In addition, an incomplete selection of papers will be uploaded to http://arxiv.org and members of the project consortium will also continue their self-archiving practices. 2.1.5. Archiving and preservation (including storage and backup) The data will be curated by Zenodo who state about their retention period: Items will be retained for the lifetime of the repository. This is currently the lifetime of the host laboratory CERN, which currently has an experimental programme defined for the next 20 years at least. [... ] In case of closure of the repository, best efforts will be made to integrate all content into suitable alternative institutional and/or subject based repositories. 8 5 see http://www.zenodo.org, Zenodo is developed by CERN under the EU FP7 project OpenAIREplus (grant agreement no. 283595). 6 https://zenodo.org/policies 7 Digital object identifier, see http://www.doi.org/ 8 https://zenodo.org/policies Deliverable D4.1 Page 5 of 11

2.2 DATA2 photo & video material of experiments 2.2.1. Data set reference and name DATA2 photo & video material of plant & robot experiments 2.2.2. Data set description Generated data, photos and videos taken during experiments with natural plants and robots. Data is of interest for plant scientists and roboticists. It partially underpins scientific publications. The data is probably too specific to allow for integration. 2.2.3. Standards and metadata Data is labeled with experiment ID. Photos are numbered in chronological order or alternatively labeled directly with the time in the experiment (e.g., minutes after start of experiment). Videos are labeled with the experiment ID and numbered chronologically if there are multiple parts. Version control is not necessary. For each experiment dataset there is a short textual description of the experiment itself, when it was started, when it was ended, and who supervised it. For the data that is uploaded to Zenodo, Zenodo defines and handles the metadata (see above). 2.2.4. Data sharing Given the amount of this kind of data that will be generated throughout the project, we do not plan a central data deposit for all of it. Individual photos and videos that are representable for the respective experiment and/or relevant for publications will be stored in the project s Cloud service (OwnCube e.u., An der Liesing 2-34/7, 1230 Wien, Austria) for the duration of the project, shared among project partners, and data requests for that data from outside the project will be handled individually. A smaller selection of that data will be uploaded to the servers of Zenodo and hence allow for easy sharing. 2.2.5. Archiving and preservation (including storage and backup) For the data stored on our cloud service we cannot guarantee preservation beyond the project s lifetime but that is taken care of by the smaller selection of data that will be uploaded to Zenodo (lifetime of the host laboratory CERN is 20+ years, see above). Page 6 of 11 Deliverable D4.1

2.3 DATA3 developed software 2.3.1. Data set reference and name DATA3 developed software 2.3.2. Data set description A variety of different software packages are going to be developed in this project including software to run microprocessors (e.g., Raspberry Pi 9, sensors, actuators, to run complete robot platforms, and also simulation software frameworks. The developed software is mostly of interest for roboticists while some of the simulation software might also be of interest for plant scientists. The software partially underpins scientific publications and is probably too specific to allow for integration. 2.3.3. Standards and metadata For service such as GitHub and sourceforge the metadata is very specific and depends on the respective hosting service. For the data that is uploaded to Zenodo, Zenodo defines and handles the metadata (see above). 2.3.4. Data sharing For software we plan a hybrid approach. For software that is under development (i.e., within the project s lifetime) we are going to use a project-internal software versioning and revision control tool based on Apache Subversion (SVN) 10 which is hosted by the Zentrum für Informationsund Medientechnologien (IMT) of the University of Paderborn (UPB, project coordinator). For software releases we are going to use standard and well accepted hosting services for scientific open-source software such as GitHub 11 and sourceforge 12 but also to provide the software via Zenodo 13. 2.3.5. Archiving and preservation (including storage and backup) For the software stored on our project-internal software versioning system we cannot guarantee preservation beyond the project s lifetime but that is taken care of by the released software packages that will be uploaded to Zenodo (lifetime of the host laboratory CERN is 20+ years, see above). 9 https://www.raspberrypi.org 10 https://subversion.apache.org/ 11 https://github.com/ 12 http://sourceforge.net/ 13 see http://www.zenodo.org Deliverable D4.1 Page 7 of 11

2.4 DATA4 sensor data of experiments 2.4.1. Data set reference and name DATA4 sensor data of plant & robot experiments 2.4.2. Data set description Generated sensor data acquired during experiments with natural plants and robots. This includes data from sensors monitoring the plants (temperature, humidity, etc.) and from sensors of the robots (proximity sensors, force sensors, etc.). Data is of interest for plant scientists and roboticists. It partially underpins scientific publications. The data is probably too specific to allow for integration. 2.4.3. Standards and metadata Data is labeled with experiment ID, numbered in chronological order or alternatively labeled directly with the time in the experiment. For each experiment dataset there is a short textual description of the experiment itself, when it was started, when it was ended, and who supervised it. For the data that is uploaded to Zenodo, Zenodo defines and handles the metadata (see above). Version control is not necessary. 2.4.4. Data sharing Given the amount of this kind of data that will be generated throughout the project, we do not plan a central data deposit for all of it. Individual photos and videos that are representable for the respective experiment and/or relevant for publications will be stored in the project s Cloud service (OwnCube e.u., An der Liesing 2-34/7, 1230 Wien, Austria) for the duration of the project, shared among project partners, and data requests for that data from outside the project will be handled individually. A smaller selection of that data will be uploaded to the servers of Zenodo and hence allow for easy sharing. 2.4.5. Archiving and preservation (including storage and backup) For the data stored on our cloud service we cannot guarantee preservation beyond the project s lifetime but that is taken care of by the smaller selection of data that will be uploaded to Zenodo (lifetime of the host laboratory CERN is 20+ years, see above). Page 8 of 11 Deliverable D4.1

2.5 DATA5 logging & tracking data of experiments 2.5.1. Data set reference and name DATA5 logging & tracking data of plant & robot experiments 2.5.2. Data set description Generated data acquired during experiments with natural plants and robots including data generated by image processing of photos and logging data of our hardware systems. Data is of interest for plant scientists and roboticists. It partially underpins scientific publications. The data is probably too specific to allow for integration. 2.5.3. Standards and metadata Data is labeled with experiment ID, numbered in chronological order or alternatively labeled directly with the time in the experiment. For each experiment dataset there is a short textual description of the experiment itself, when it was started, when it was ended, and who supervised it. For the data that is uploaded to Zenodo, Zenodo defines and handles the metadata (see above). Version control is not necessary. 2.5.4. Data sharing Given the amount of this kind of data that will be generated throughout the project, we do not plan a central data deposit for all of it. Individual photos and videos that are representable for the respective experiment and/or relevant for publications will be stored in the project s Cloud service (OwnCube e.u., An der Liesing 2-34/7, 1230 Wien, Austria) for the duration of the project, shared among project partners, and data requests for that data from outside the project will be handled individually. A smaller selection of that data will be uploaded to the servers of Zenodo and hence allow for easy sharing. 2.5.5. Archiving and preservation (including storage and backup) For the data stored on our cloud service we cannot guarantee preservation beyond the project s lifetime but that is taken care of by the smaller selection of data that will be uploaded to Zenodo (lifetime of the host laboratory CERN is 20+ years, see above). Deliverable D4.1 Page 9 of 11

2.6 DATA6 logging data & images of simulations 2.6.1. Data set reference and name DATA6 logging data & images of simulations 2.6.2. Data set description Generated data acquired during simulations of natural plants and robots including logging data of simulations, screenshots, and other visual representations of data. Data is of interest for plant scientists and roboticists. It partially underpins scientific publications. The data is probably too specific to allow for integration. 2.6.3. Standards and metadata Data is labeled with experiment ID, numbered in chronological order or alternatively labeled directly with the time in the experiment. For each experiment dataset there is a short textual description of the experiment itself, when it was started, when it was ended, and who supervised it. For the data that is uploaded to Zenodo, Zenodo defines and handles the metadata (see above). Version control is not necessary. 2.6.4. Data sharing Given the amount of this kind of data that will be generated throughout the project, we do not plan a central data deposit for all of it. Individual photos and videos that are representable for the respective experiment and/or relevant for publications will be stored in the project s Cloud service (OwnCube e.u., An der Liesing 2-34/7, 1230 Wien, Austria) for the duration of the project, shared among project partners, and data requests for that data from outside the project will be handled individually. A smaller selection of that data will be uploaded to the servers of Zenodo and hence allow for easy sharing. 2.6.5. Archiving and preservation (including storage and backup) For the data stored on our cloud service we cannot guarantee preservation beyond the project s lifetime but that is taken care of by the smaller selection of data that will be uploaded to Zenodo (lifetime of the host laboratory CERN is 20+ years, see above). Page 10 of 11 Deliverable D4.1

3 Conclusion A data management plan is not a fixed document, instead it might be refined later during the project. The feasibility and utility of this data management plan will be monitored during the next months and years of the project. Also the number of data requests from outside the consortium will be recorded. Depending on that experience we will consider appropriate actions and changes of this plan. Deliverable D4.1 Page 11 of 11