Science Europe Consultation on Research Data Management

Similar documents
Linda Strick Fraunhofer FOKUS. EOSC Summit - Rules of Participation Workshop, Brussels 11th June 2018

EC Horizon 2020 Pilot on Open Research Data applicants

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018

Checklist and guidance for a Data Management Plan, v1.0

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

Data Management Checklist

Make your own data management plan

Swedish National Data Service, SND Checklist Data Management Plan Checklist for Data Management Plan

How to share research data

Focus: Themes within Introduction and Context

GEOSS Data Management Principles: Importance and Implementation

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Developing a Research Data Policy

Scientific Research Data Management Policy

Data Management Plan Generic Template Zach S. Henderson Library

Legal Issues in Data Management: A Practical Approach

Where to store research data during and after a project. Dr. Chris Emmerson Research Data Manager

re3data.org - Making research data repositories visible and discoverable

Basic Requirements for Research Infrastructures in Europe

Striving for efficiency

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

PS Mailing Services Ltd Data Protection Policy May 2018

PROJECT FINAL REPORT. Tel: Fax:

Data Management Planning

Towards FAIRness: some reflections from an Earth Science perspective

Research Data Management: Edinburgh University Library s Approach Dominic Tate

Horizon 2020 and the Open Research Data pilot. Sarah Jones Digital Curation Centre, Glasgow

RPg procedures for Research Data Management (RDM)

Open Data is a new paradigm in which research data are freely and openly shared, with full re-use rights. Open data ensures that research integrity

Horizon2020/EURO Coordination and Support Actions. SOcietal Needs analysis and Emerging Technologies in the public Sector

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

Interoperability and transparency The European context

Introduction to Data Management

Data Curation Handbook Steps

VdTÜV Statement on the Communication from the EU Commission A Digital Single Market Strategy for Europe

Guidelines for Depositors

Research Data Management Procedures and Guidance

NIS Directive : Call for Proposals

Trusted Digital Archives

INSPIRE status report

Fair data and open data: differences and consequences

DATA SELECTION AND APPRAISAL CHECKLIST University of Reading Research Data Archive

Data management Backgrounds and steps to implementation; A pragmatic approach.

Data Protection Policy

Data Quality Assessment Tool for health and social care. October 2018

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Islam21c.com Data Protection and Privacy Policy

Data driven transformation of the public sector Tallinn, Estonia Head of unit 22 September 2016 European Commission

Checklist for a Data Management Plan (v3.0, 17 March 2011)

European digital repository certification: the way forward

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Google Cloud & the General Data Protection Regulation (GDPR)

DATA PROTECTION AND PRIVACY POLICY

Research Data Management

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

Indiana University Research Technology and the Research Data Alliance

Improving a Trustworthy Data Repository with ISO 16363

Aon Service Corporation Law Global Privacy Office. Aon Client Data Privacy Summary

Report of the Working Group on mhealth Assessment Guidelines February 2016 March 2017

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Reproducibility and FAIR Data in the Earth and Space Sciences

CDISC Operating Procedure COP-001 Standards Development

Horizon 2020 Open Research Data Pilot: What is required? Sarah Jones Digital Curation Centre

Collection Policy. Policy Number: PP1 April 2015

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

TIA. Privacy Policy and Cookie Policy 5/25/18

Consolidation Team INSPIRE Annex I data specifications testing Call for Participation

Data Management Plan

ENISA s Position on the NIS Directive

DATA PRESERVATION AND SHARING INITIATIVE. 1. Aims of the EORTC QLG Data Repository project

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

EU General Data Protection Regulation (GDPR) Achieving compliance

Research ethics and data sharing: governance and integrity

Within the meanings of applicable data protection law (in particular EU Regulation 2016/679, the GDPR ):

«Prompting an EOSC in Practice»

ESFRI WORKSHOP ON RIs AND EOSC

SWIFT Customer Security Controls Framework and self-attestation via The KYC Registry Security Attestation Application FAQ

Global Specification Protocol for Organisations Certifying to an ISO Standard related to Market, Opinion and Social Research.

Asda. Privacy and Electronic Communications Regulations audit report

Data Management Dr Evelyn Flanagan

The Research Data Alliance (RDA): building global data connections CC BY-SA 4.0

SHARING YOUR RESEARCH DATA VIA

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Privacy Statement. Your privacy and trust are important to us and this Privacy Statement ( Statement ) provides important information

Code Administration Code of Practice

World Wide Jobs Ltd t/a Findmyexpert.com Privacy Policy 12 th April 2018

Review of Implementation of the Global Research Council Action Plan towards Open Access to Publications

GDPR Processor Security Controls. GDPR Toolkit Version 1 Datagator Ltd

European Open Science Cloud Implementation roadmap: translating the vision into practice. September 2018

ISAO SO Product Outline

DROPBOX.COM - PRIVACY POLICY

Privacy Code of Conduct on mhealth apps the role of soft-law in enhancing trust ehealth Week 2016

28 September PI: John Chip Breier, Ph.D. Applied Ocean Physics & Engineering Woods Hole Oceanographic Institution

Best Practice Guidelines for the Development and Evaluation of Digital Humanities Projects

DataFlow and VIDaaS Workshop

Advancing code and data publication and peer review. Erika Pastrana, PhD Executive Editor, Nature Journals ALPSP_Sept 2018

Newsletters We may send out newsletters to our customers providing them with articles and information which we believe may be of interest to you.

Robin Wilson Director. Digital Identifiers Metadata Services

Cybersecurity eit. Software. Certification. Industrial Security Embedded System

ISO / IEC 27001:2005. A brief introduction. Dimitris Petropoulos Managing Director ENCODE Middle East September 2006

Transcription:

Science Europe Consultation on Research Data Management Consultation available until 30 April 2018 at http://scieur.org/rdm-consultation Introduction Science Europe and the Netherlands Organisation for Scientific Research (NWO) have launched an initiative for the voluntary international alignment of research data management (RDM) policies and practices. The aim of the initiative is to develop a set of core requirements for research data management plans and a list of criteria to select trusted repositories where researchers can safely store their data for sharing. Science Europe and NWO wish to work towards the adoption of these requirements by as many stakeholders as possible and towards the alignment of their current rules. About the Consultation This consultation constitutes the next step towards the development of core RDM requirements. Interested organisations are invited to provide their feedback on draft proposed requirements for data management plans (DMPs) and criteria for trusted repositories (see invitation letter). The consultation is structured as follows: A. Information on the responding organisation and contact person B. Core requirements for Data Management Plans C. Criteria for Trusted Repositories The consultation will run until 30 April 2018. In order to ease the preparation of the responses, the full list of questions is available in downloadable format here. However only responses submitted online by the deadline will be considered. Should you have any questions, please contact the responsible person at the Science Europe Office: marie.timmermann@scienceeurope.org A. Information on responding organisation and contact person Responding Organisation 1. The name of your organisation 2. Type of organisation Choose one of the following answers Research Funding Organisation Research Performing Organisation University 1 of 14

Learned Society or Scholarly/Researcher Association Research Infrastructure e-infrastructure Scientific Publisher Data Repository/Curation Centre Other: 3. Does your organisation currently have an RDM policy in place? Choose one of the following answers An RDM policy is in place An RDM policy is under development RDM policy is in place A Data Management Plan (DMP) template is used, but no fully-fledged RDM policy is in place Other: Contact Person 4. Your details Your name Your function within your organisation Your email address Your telephone number 5. Do you agree to be contacted by one of our experts in case we need more information regarding the feedback you are giving on behalf of your organisation? B. Core requirements for Data Management Plans The planning of data management from the beginning of the research process safeguards the full use of the data during the research and is essential for publishing, sharing and archiving data at the end of the research process. A data management plan (DMP) is a formal document that outlines how to handle data both during a research project and after the project is completed. DMPs reduce the risk of losing data, help to anticipate complex ownership and user right issues in advance and improve future collaborations when data has been made openly accessible. There is a general acceptance that data management practices should follow the FAIR principles, meaning that data should be Findable, Accessible, Interoperable, and Re-usable. Despite the general acceptance of these principles, there is globally a great variety in DMP templates and guidelines, which can be confusing for researchers. Thus, it is desirable to identify a core set of DMP items to facilitate increasingly multidisciplinary and globally collaborative research. There is an obvious need to have minimum standards for core DMP templates. The core set of DMP items need to be applicable to all fields of research. Also, guiding questions should be identified to ensure that key elements will be addressed in data management planning. Researchers do not have 2 of 14

to consider each topic in detail. They may refer to texts made available by research funding organisation, research performing organisations or to domain protocols broadly accepted by researchers in their field. 6. Does your organisation agree that there is a general need to align DMP requirements across national and European organisations? 6a. Why does your organisation not agree? 7. Would your organisation join others in supporting and implementing broadly accepted core principles? 7a. What timeframe do you expect to be needed to implement the requirements in your organisation s policy? 7b. What is the major reason or impediment for your organisation not to implement the requirements, and how could this be addressed? 8. Do you have general comments on the approach? Science Europe Survey The planning of data management from the beginning of the research process safeguards the full use of the data during the research and is essential for publishing, sharing and archiving data at the end of the research process. A data management plan (DMP) is a formal document that outlines how to handle data both during a research project and after the project is completed. The Science Europe Working Group on Research Data conducted a survey of European Research Funding and Research Performing Organisations on data policies and data management between vember 2017 and February 2018. Main topics covered by this survey were: research data policy coverage and stipulations; data management planning items; and evaluation of DMPs at RFOs or quality control at RPOs. Answers to the survey came from 18 countries and 22 Science Europe Member Organisations. The survey gave an overview of which DMP items were common in these organisations. In addition, a comparison was made of widely used templates and/or guidelines: 3 of 14

Generic DMP Template of the Digital Curation Centre (DCC) Guidelines on FAIR Data Management in Horizon 2020 Open Research Data and DMP requirements by the European Research Council The FAIR Guiding Principles for scientific data management Policy on data sharing by the Wellcome Trust The Finnish national DMP DMP items and guidelines by the Research Data Alliance (RDA) working groups and Force11 Taken together, it was possible to draft a core set of DMP items for a template consisting of five broad topics to be covered by a DMP together covering no more than 14 requirements/guiding questions. This is the minimum set of requirements that Science Europe is seeking consensus about. Indicative lists of related aspects to consider when developing a DMP have been added as well. The proposed five core DMP items presented here are already widely used and provide the major aspects needed to build a useful DMP: 1. Data description and collection or reuse of existing data 2. Documentation and data quality 3. Storage and backup 4. Ethical and legal compliance, codes of conduct 5. Data sharing and long-term preservation Science Europe welcomes your organisation s feedback on the core requirements as such, as well as on the Guiding questions. The Related aspects to consider when developing a DMP (indicative list) should be seen as a nonexhaustive list of aspects, supposed to serve as an example. Core Requirement 1: Data description and collection or reuse of existing data Guiding questions A. What is the type, format and volume of data? Related aspects to consider when developing a DMP (indicative list) Explain what kind of data you are collecting or producing. Explain what kind of existing data you will reuse. What type, format and volume of data? Clearly note what format(s) your data will be in, eg. plain text (.txt), commaseparated values (.csv), geo-referenced TIFF (.tif,.tfw). Explain why you have chosen certain formats. Decisions may be based on staff expertise, a preference for open formats, the standards accepted by data centres or widespread usage within a given community. Do your chosen formats and software enable sharing and long-term access to the data? By using standardised, interchangeable or open formats ensures the long-term usability of data; these are recommended for sharing and archiving. 4 of 14

Guiding questions B. How will data be collected, created or reused? Related aspects to consider when developing a DMP (indicative list) Are there any existing data that you can reuse? Outline how the data will be collected and processed: eg. via surveys, interviews, laboratory experiments, or observations. This should cover relevant standards or methods, quality assurance and data organisation. Briefly list the methodologies for data collection/generation or description of the existing data that will be used. 9. Does your organisation agree in general with the requirement Data description and collection or reuse of existing data and the accompanying guiding questions as described above? 9a. Why does your organisation not agree with the requirement as described above? 10. Are there any aspects missing? 10a. What aspects are missing? 11. Are there aspects that need more elaboration in the guiding questions? 11a. Which aspects need more elaboration in the guiding questions? 12. Are there aspects that are unnecessary? 12a. Which aspects are unnecessary and why? 5 of 14

Core requirement 2: Documentation and data quality Guiding questions A. What metadata and documentation will accompany data? B. Will you make sure unique and persistent identifier is in use (eg. DOI)? C. What data quality control measures do you use? Related aspects to consider when developing a DMP (indicative list) What metadata will be provided to help others identify and discover the data? What standards or methodologies will you use? Researchers are strongly encouraged to use community metadata standards where these are in place. Your data and metadata vocabularies, standards or methodologies should facilitate interoperability (also interdisciplinary interoperability) Indicate how the data will be organised during the project, mentioning, eg. naming conventions, version control and folder structures. Consistent, well-ordered research data will be easier to find, understand and reuse. Consider what other documentation is needed to enable reuse. This may include information on the methodology used to collect the data, analytical and procedural information, definitions of variables, units of measurement, any assumptions made, the format and file type of the data and software used to collect and/or process the data. Consider how you will capture this information and where it will be recorded, eg. in a database with links to each item, in a readme text file, in file headers, and so on. How might your data be reused in other contexts? Unique persistent identifiers should be applied so people can reliably and efficiently find your data. They also help you to track citations and reuse. Will you pursue getting a unique persistent identifier for your data? What quality assurance processes will you adopt? Explain how the consistency and quality of data collection will be controlled and documented. This may include processes such as calibration, repeat samples or measurements, standardised data capture, data entry validation, peer review of data or representation with controlled vocabularies. 13. Does your organisation agree in general with the requirement Documentation and data quality and the accompanying guiding questions as described above? 13a. Why does your organisation not agree with the requirement as described above? 14. Are there any aspects missing? 6 of 14

14a. What aspects are missing? 15. Are there aspects that need more elaboration in the guiding questions? 15a. Which aspects need more elaboration in the guiding questions? 16. Are there aspects that are unnecessary? 16a. Which aspects are unnecessary and why? Core requirement 3: Storage and backup Guiding questions A. How will data be stored and backed up during the research? B. How will you take care of data security and personal data protection during the research? Related aspects to consider when developing a DMP (indicative list) Describe where the data will be stored and backed up during the course of research activities. Also, how often the backup will be performed. The use of robust, managed storage with automatic backup, for example, that provided by university IT teams, is preferable. Storing data on laptops, computer hard drives or external storage devices alone is very risky. How will the data be recovered in the event of an incident? Consider data security, particularly if your data is sensitive eg. detailed personal data, politically sensitive information, or trade secrets. te the main risks and how these will be managed. Take into consideration the EU General Data Protection Regulation (GDPR) on data privacy. Are there any institutional data security policies in place? 17. Does your organisation agree in general with the requirement Storage and backup and the accompanying guiding questions as described above? 7 of 14

17a. Why does your organisation not agree with the requirement as described above? 18. Are there any aspects missing? 18a. What aspects are missing? 19. Are there aspects that need more elaboration in the guiding questions? 19a. Which aspects need more elaboration in the guiding questions? 20. Are there aspects that are unnecessary? 20a. Which aspects are unnecessary and why? 8 of 14

Core requirement 4: Ethical and legal compliance, codes of conduct Guiding questions A. How will you manage ethical issues and codes of conduct? B. How will you manage IPR, copyright, ownership and other legal issues? Related aspects to consider when developing a DMP (indicative list) Have you gained consent for data preservation and sharing? Follow the national and international codes of conducts, institutional ethical guidelines and check if ethical review is required for your research project. How will sensitive data be handled to ensure it is stored and transferred securely? Investigators carrying out research involving human participants should request consent to preserve and share the data. Consider how you will protect the identity of participants, eg. via anonymisation or using managed access procedures. Ethical issues may affect how you store and transfer data, who can see/use it and how long it is kept. You should demonstrate that you are aware of this and have planned accordingly. Are there any restrictions on the reuse of third-party data? State who will own the copyright and IPR of any existing data as well as new data that you will generate. For multi-partner projects, IPR ownership should be covered in the consortium agreement. Data sharing agreements may also be needed. Explain how the data will be licensed for reuse. Take in to consideration the EU General Data Protection Regulation (GDPR) on data privacy. 21. Does your organisation agree in general with the requirement Storage and backup and the accompanying guiding questions as described above? 21a. Why does your organisation not agree with the requirement as described above? 22. Are there any aspects missing? 22a. What aspects are missing? 23. Are there aspects that need more elaboration in the guiding questions? 9 of 14

23a. Which aspects need more elaboration in the guiding questions? 24. Are there aspects that are unnecessary? 24a. Which aspects are unnecessary and why? Core requirement 5: Data sharing and long-term preservation Guiding questions A. How and when will you share data (consider licences, data security/ protection, possible embargo reasons)? B. How do you select data for preservation and where data will be preserved long-term (eg. data repository or archive)? Related aspects to consider when developing a DMP (indicative list) How will you share the data eg. deposit in a trustworthy data repository, use a secure data service, handle data requests directly or use another mechanism? The methods used will depend on number of factors such as the type, size, complexity and sensitivity of the data. Outline the plans for data sharing and preservation how long will the data be retained? When will you make the data available? What is the expected timely release. For how long do you need exclusive use of the data and why? Will data sharing be postponed/restricted, eg. to publish, protect proprietary or seek patents? How will potential users find out about your data? Who will be able to use your data? If you need to restrict access to certain communities or apply data sharing agreements, explain why. What action will you take to overcome or minimise restrictions? Will a data sharing agreement (or equivalent) be required? What data must be retained/destroyed for contractual, legal, or regulatory purposes? How will you decide what data to keep? Describe the data to be preserved long-term. What are the foreseeable research uses for the data? Where will the data be deposited? If you do not propose to use an established repository, the data management plan should demonstrate that the data can be curated effectively beyond the lifetime of the grant. It helps to show that you have consulted with the repository to understand their policies and procedures, including any metadata standards, and costs involved. 10 of 14

Guiding questions C. What methods or software tools are needed to access data? D. Who will be responsible for data management (ie. data steward)? E. What are the costs and time needed for data management and making data FAIR? Related aspects to consider when developing a DMP (indicative list) How will potential users can access your data? Will you share data via a repository, handle requests directly or use another mechanism? Carefully consider the sustainability of software needed for accessing the data. Outline the roles and responsibilities for all activities, eg. data capture, metadata production, data quality, storage and backup, data archiving, and data sharing. Individual(s) should be named where possible. For collaborative projects, you should explain the co-ordination of data management responsibilities across partners. Who is responsible for implementing the DMP, and ensuring it is reviewed and revised? Will additional resources be needed to prepare data for deposit or meet any charges from data repositories? Have you costed in time and effort to prepare the data for sharing/preservation (data curation)? Carefully consider and justify any resources needed to deliver the data. These may include storage costs, hardware, staff time, costs of preparing data for deposit and repository charges. What costs if any will your selected data repository or archive charge? 25. Does your organisation agree in general with the requirement Storage and backup and the accompanying guiding questions as described above? 25a. Why does your organisation not agree with the requirement as described above? 26. Are there any aspects missing? 26a. What aspects are missing? 27. Are there aspects that need more elaboration in the guiding questions? 11 of 14

27a. Which aspects need more elaboration in the guiding questions? 28. Are there aspects that are unnecessary? 28a. Which aspects are unnecessary and why? Regarding Core requirements 1 5 29. Are there aspects that would seriously refrain your organisation from adopting these requirements? 29a. Which aspects are these, and why would they refrain your organisation from adopting the requirements? C. Criteria for trusted repositories There is no generally accepted list of trusted repositories, whereas registries of repositories mention more than 2,000 of them. This is confusing not only for researchers but also for funders, institutions and journals that seek long term preservation of research data. Certification organisations and processes do exist to ensure that repositories meet minimum standards of quality and reliability. However, currently less than 10% of listed repositories worldwide comply with these certifications. In due time repositories should account for the quality and trustworthiness of the services they offer; preferably by attaining some form of certification. For the time being, minimal requirements for trusted repositories are proposed here to ensure that repositories meet minimum standards of quality and reliability. The proposed set of requirements are based on a comparison of criteria for suitable platforms for research data sharing and archiving for different disciplines, set by a selection of sources (funders, performers, certification bodies, as well as publishers). 12 of 14

Minimal requirements for trusted repositories A trusted repository should have the following requirements: 1. Provision of a persistent and unique identifier (PID) identify the dataset ensure dataset persistence enable searching and retrieval of datasets maintain a repository-managed URI associated with each of those PIDs keep permanent IDs as tombstones even if the data have been retracted 2. Metadata ensure dataset persistence enable finding of datasets provide publicly available and maintained information even for retracted datasets 3. Data access & Usage licenses enable access to the dataset under well-specified conditions ensure dataset stability enable retrieval of datasets provide information about licensing and permissions 4. Machine Accessibility enable searching (and preferably retrieval) of datasets by automated processes ensure that at least intrinsic metadata is accessible in a structured and machine- readable form 5. Long-term Preservation ensure persistence of metadata and datasets explain the long-term preservation policies and plans guarantee the sustainability of a repository The above requirements are very well in line with criteria of existing certification systems. 30. Does your organisation agree in general with the criteria as described above? 30a. Why does your organisation not agree with the criteria as described above? 31. Are there any aspects missing? 31a. What aspects are missing? 13 of 14

32. Are there aspects that need more elaboration? 32a. Which aspects need more elaboration? 33. Are there aspects that are unnecessary? 33a. Which aspects are unnecessary and why? 34. Do you have any comments or suggestions on the proposed RDM requirements or criteria for trusted repositories in general? 14 of 14