FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Similar documents
5/16/2018. Researcher Challenges with Data Use. AGU s position statement on data affirms that

Reproducibility and FAIR Data in the Earth and Space Sciences

Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

DOIs for Research Data

COALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING. Source: NSF.

GEOSS Data Management Principles: Importance and Implementation

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

CODATA: Data Citation Workshop Perspectives from Editors and Publishers. Brooks Hanson Director, Publications, AGU

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017

State of the Art in Data Citation

re3data.org - Making research data repositories visible and discoverable

Towards FAIRness: some reflections from an Earth Science perspective

Data Exchange in the Earth Sciences

Dataverse and DataTags

Data Publishing and Data Linking Introducing SCHOLIX

Adoption of Data Citation Outcomes by BCO-DMO

Indiana University Research Technology and the Research Data Alliance

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

Data Citation and Scholarship

INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES

Making data publication a first class research output

DATA SHARING FOR BETTER SCIENCE

Data Curation Handbook Steps

Fair data and open data: differences and consequences

Research Elsevier

LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project

Data publication and discovery with Globus

Scholix Metadata Schema for Exchange of Scholarly Communication Links

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Linking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Implementation of the CoreTrustSeal

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

Science Europe Consultation on Research Data Management

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

SESAR, IGSN, & a vision for a Repository Portal and Hosted Collection Management

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018

Reproducibility and Reuse of Scientific Code Evolving the Role and Capabilities of Publishers

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future

Linking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier

Open Science, FAIR data and effective data management

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

Implementation of Open-World, Integrative, Transparent, Collaborative Research Data Platforms: the University of Things (UoT)

How FAIR am I? FAIR Principles and Interoperability of Data and Tools

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

Interoperability Framework Recommendations

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

RDMG Research Data Management Group at Freiburg University

NRF Open Access Statement

Data Curation Profile Human Genomics

DATAVERSE FOR JOURNALS

Introducing the Springer Nature Data Support Services

Data Citation. DataONE Community Engagement & Outreach Working Group

Survey of Existing Services in the Mathematical Digital Libraries and Repositories in the EuDML Project

DuraSpace FAIRness and GDPR

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

ZB MED Information Center Life Sciences

Launching the. Data Curation Network NDS/MBDH 2018

Reproducible Workflows Biomedical Research. P Berlin, Germany

RADAR A Repository for Long Tail Data

Striving for efficiency

A Data Citation Roadmap for Scholarly Data Repositories

Callicott, Burton B, Scherer, David, Wesolek, Andrew. Published by Purdue University Press. For additional information about this book

The Role of Repositories and Journals in the Astronomy Research Lifecycle

For Attribution: Developing Data Attribution and Citation Practices and Standards

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION

Making the most of metadata with Metadata 2020

BPMN Processes for machine-actionable DMPs

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

Data Curation Profile Movement of Proteins

Taylor & Francis Online. A User Guide.

How to share research data

Towards a joint service catalogue for e-infrastructure services

CrossRef developments and initiatives: an update on services for the scholarly publishing community from CrossRef

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang

Publishing data?! How do you do that then?

Response to RFI: Public Access to Digital Data Resulting From Federally Funded Scientific Research Office of Science and Technology Policy

State of the Art in Ethno/ Scientific Data Management

How to make your data open

Services pour identifier et valoriser : l enregistrement de données via DataCite

DataCite Persistent links to scientific data. Jan Brase

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

Data Citation. Mark Parsons, Ruth Duerr and the Federation of Earth Science Information Partners (ESIP)

Transcription:

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data GeoDaRRs: What is the existing landscape and what gaps exist in that landscape for data producers and users? 7 August 2018 Shelley Stall, AGU Director, Data Programs sstall@agu.org @ShelleyStall

AGU s position statement on data affirms that Earth and space sciences data are a world heritage. Properly documented, credited, and preserved, they will help future scientists understand the Earth, planetary, and heliophysics systems. https://sciencepolicy.agu.org/files/2013/07/agu-data-position-statement-final-2015.pdf Photo Credit: Photo by Rick Meyers on Unsplash 2

Researcher Challenges with Data Use The top four issues accounted for 73% of respondents 9% 9% 4% 4% Data complexity 20% Lack of data standards and exchange standards Finding relevant existing data - knowing what's out there Data management and storage 17% 20% Dealing with multiple data types Other Data volumes 17% Data access and file transfer Data Management Skills Gap Analysis, April 7, 2017 http://bfe-inf.org/document/skills-gap-analysis

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. - From The FAIR Guiding Principles for scientific data management and stewardship

FAIR Data Principles (applies to software too) Findable Assign persistent IDs (PIDs), provide rich metadata, register in a searchable resource, Accessible Retrievable by their ID using a standard protocol, metadata remain accessible even when data are no longer available Interoperable Use formal, broadly applicable languages, use standard vocabularies, qualified references Reusable Rich, accurate metadata, clear licenses, provenance, use of community standards Article in Nature journal Scientific Data: Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3:160018 doi: 10.1038/sdata.2016.18 (2016).

Evolving Researcher Support Internal Ids Targeted Discovery Community evolved standards Clear Licenses Global, unique PIDs Citation Support Machine and human accessible; documentation for multiple audiences Researcher Communities Cross Community Connections International Discovery Evolving 6

New Grant from Laura and John Arnold Foundations (LJAF) Align publishers and repositories in following best practices to enable FAIR and open data and to create workflows so that researchers will have a simplified, common experience when submitting their paper to any leading Earth and space science journal. This will accelerate scientific discovery and enhance the integrity, transparency, and reproducibility of this data.

Enabling FAIR Data Project - Objectives FAIR-aligned data repositories add value to research data, provide metadata and landing pages for discoverability, and support researchers with documentation guidance, citation support, and curation. FAIR-aligned Earth, space, and environmental science publishers align their policies to establish a similar experience for researchers. Data, software, technology will be available through citations that resolve to repository landing pages. Availability statements are provided. Data are not placed in the supplemental information. 8

http://www.copdess.org/enabling-fair-data-project/ Funded by the Laura and John Arnold Foundation 9

FAIR-aligned Researcher Commitment Locating trustworthy, community-accepted, FAIR-aligned repositories that support: Documenting data and software (and other research outputs as is possible) to agreed community standards that describe provenance and enable discovery, assessment of reliability, and reuse Persistent identifiers for data and software (and other research outputs as is possible) Licenses for data and software (and other research outputs as is possible) that is as open as possible to enable the widest potential reuse. Citing data, software, physical samples, and other research products Developing data availability statements Preparing and managing data management plans. Make them living documents. 10

FAIR-aligned: Repository Commitment Ensure that research outputs (e.g., data, software, technology, and physical samples) curated by repositories are open and FAIR, have essential documentation, and include human-readable and machine-readable metadata (e.g., on landing pages) in standard formats that are exposed and publicly discoverable. Ingest and expose data to promote interoperability and reuse. Ensure that unique, persistent identifiers are used for authors (e.g., ORCID), research objects (e.g., Digital Object Identifier), and physical samples (e.g., IGSN). Create associations among the research outputs that they manage and other related entities. Ensure that data and software have licenses that are as open as possible, and as protected as necessary. Support peer-review of related manuscripts by enabling access to the research outputs prior to publication. Gain third-party validation of trustworthy and sustainable practices and capabilities. 11

Community-Driven Project Partnership Includes: Science Data Communities AGU / EGU Earth Science Information Partners (ESIP) Research Data Alliance (RDA) EarthCube / Council for Data Facilities FORCE11 Publishers AGU Proceedings of the National Academy of Sciences (PNAS) Nature Science/AAAS Elsevier PLOS Hindawi Copernicus/EGU International Repositories (300+) National Computational Infrastructure (NCI) AuScope Australian National Data Service (ANDS) Center for Open Science DataCite / re3data ORCID CrossRef And Growing!! CHORUS Scholix OSGeo Pangaea DataONE Wiley 12

Repository Finder Tool Based on original work by Ruth Duerr, The Ronin Institute Further designed by the Repository Guidance for Researchers (TAG A/D) Co-Chairs: Danie Kinkade (BCO-DMO), Michael Witt (Purdue University) Developed by: DataCite/re3data Researchers: Completing two rounds of Usability testing Repositories. 13

Repositories Get Ready to be Found Review your re3data.org record. If you don t have one, go the page and select the suggest button at the top. Enter your repository information. Tune your record with this guide: http://bit.ly/repoguide Summarize your update in an email: info@re3data.org 14

Thank you! Shelley Stall Director, AGU Data Program sstall@agu.org @ShelleyStall