Reproducibility and FAIR Data in the Earth and Space Sciences

Similar documents
FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements

CODATA: Data Citation Workshop Perspectives from Editors and Publishers. Brooks Hanson Director, Publications, AGU

ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU

COALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING. Source: NSF.

5/16/2018. Researcher Challenges with Data Use. AGU s position statement on data affirms that

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

State of the Art in Data Citation

Data Exchange in the Earth Sciences

Callicott, Burton B, Scherer, David, Wesolek, Andrew. Published by Purdue University Press. For additional information about this book

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

How to make your data open

DOIs for Research Data

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

DATAVERSE FOR JOURNALS

SHARING YOUR RESEARCH DATA VIA

Introducing the Springer Nature Data Support Services

Making data publication a first class research output

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

Data Citation and Scholarship

GEOSS Data Management Principles: Importance and Implementation

re3data.org - Making research data repositories visible and discoverable

Open Science, FAIR data and effective data management

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future

Dataverse and DataTags

Open Access, RIMS and persistent identifiers

Perspectives on Open Data in Science Open Data in Science: Challenges & Opportunities for Europe

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Science Europe Consultation on Research Data Management

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION

Data Publishing and Data Linking Introducing SCHOLIX

Advancing code and data publication and peer review. Erika Pastrana, PhD Executive Editor, Nature Journals ALPSP_Sept 2018

The Experimental Project of DOI Registration for Research Data at Japan Link Center (JaLC)

NRF Open Access Statement

CrossRef tools for small publishers

Research Elsevier

Scholix Metadata Schema for Exchange of Scholarly Communication Links

INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES

A Data Citation Roadmap for Scholarly Data Repositories

Towards FAIRness: some reflections from an Earth Science perspective

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Reproducibility and Reuse of Scientific Code Evolving the Role and Capabilities of Publishers

Open Access to Publications in H2020

Striving for efficiency

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Indiana University Research Technology and the Research Data Alliance

For Attribution: Developing Data Attribution and Citation Practices and Standards

Reproducible Workflows Biomedical Research. P Berlin, Germany

First Light for DOIs at ESO

Adoption of Data Citation Outcomes by BCO-DMO

DATA SHARING FOR BETTER SCIENCE

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

WE HAVE SOME GREAT EARLY ADOPTERS

The Sunshine State Digital Network

European Open Science Cloud Implementation roadmap: translating the vision into practice. September 2018

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

ZB MED Information Center Life Sciences

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

ASTRONOMY & PARTICLE PHYSICS CLUSTER

Implementation of Open-World, Integrative, Transparent, Collaborative Research Data Platforms: the University of Things (UoT)

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

BPMN Processes for machine-actionable DMPs

SESAR, IGSN, & a vision for a Repository Portal and Hosted Collection Management

Response to RFI: Public Access to Digital Data Resulting From Federally Funded Scientific Research Office of Science and Technology Policy

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Data Curation Profile Human Genomics

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

Open Access compliance:

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

Implementation of the CoreTrustSeal

Reflections on Three Decades in Internet Time

The Role of Repositories and Journals in the Astronomy Research Lifecycle

Improving a Trustworthy Data Repository with ISO 16363

Using GitHub to open up your software project

National Materials Data Initiatives

Archivierung und Publikation von Forschungsdaten mit RADAR

OPEN DATA. Dr Arthur Smith Dr Marta Teperek 15/06/2015. Slides: University of Cambridge

Data Publication. HGF Alliance: Remote Sensing and Earth System Dynamics

The library s role in promoting the sharing of scientific research data

Taylor & Francis Online. A User Guide.

Making the most of metadata with Metadata 2020

Data Curation: Technical Challenges Facing Repositories. Brianna Marshall Jan. 9, 2014

Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project

Scientific Research Data Management Policy

Developing a Research Data Policy

TEXT MINING: THE NEXT DATA FRONTIER

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Developing Data Management Plans (DMP) Scholarly Communication Initiative Mississippi State University Libraries March 25, 2015

Interoperability Framework Recommendations

EC Horizon 2020 Pilot on Open Research Data applicants

CUED Library Research Support. Dan Crane, Research Support Librarian

ORCID, Researchers & Repositories

Transcription:

Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org

Earth and Space Science is Essential for Society Special AGU Journal Issue, April 2017 27 Commentaries across AGU journals illuminate the deep and growing benefits of research in the Earth and space sciences for humanity. **Driven by international shared data** Open Access; see link at http://publications.agu.org 2

AGU data position statement (began in 1997; updated in 2016) Earth and space sciences data are a world heritage. They should be preserved long-term for future use. They should be made openly available to the scientific community and the public as soon as possible. They should be accessible in usable formats with sufficient machine-readable documentation to allow informed re-use. These responsibilities are an integral part of scientific research shared by individual scientists, data stewards, research institutions, and funding organizations.

Enabling FAIR data in the scholarly literature Embrace that published papers are only part of the research record. They must contain useful and reliable 2-way links and identifiers to other secure resources for integrity and discoverability: Context (metadata) around these links is critical Data, software, repositories, samples (IGSN) Funding information Author information (ORCID, CREDIT, institutions) Reference information (semantic context is coming) We need efficient ways to help authors, publishers and repositories preserve these links: Standard, expected, sensible, easy. Data needs quality curation, metadata, provenance, linking.

Recent Alignment by Publishers, Repositories, and Funders Around Best Practices TOP (transparency and openness promotion guidelines)--2900 journals http://copdess.org (Coalition on Publishing Data in the Earth and Space Sciences) Statement of Commitment--most publishers and repositories in the Earth and space sciences Joint Declaration of Data Citation Principles--114 organizations Software Citation Principles: https://doi.org/10.7717/peerj-cs.86 Reproducibility conferences and outcomes (AAAS and other orgs) Best practices around: Clinical trials, Lab studies, Field data, Software, Industryacademic research Authorship (https://doi.org/10.1101/140228 submitted to PNAS) Quality/certification standards for repositories Challenge is practicing what you preach

Current status: Publishers increasingly requiring data (and code) availability Supplements still being heavily used (no metadata, pdf often) Growing use of repositories, domain and general (Figshare, Dryad, institutions). Few standards on metadata or linking (limiting discoverability; interoperability). Available from authors (yeah, right) still common unpublished references still commonly allowed Publishers implementing other identifiers (ORCID, IGSN, etc.) Author standards and expectations and declarations standardized Best practices for FAIR data are available Great examples in some disciplines/repositories of successful implementations and solutions but not widely adopted. Leveraging and scaling these solutions

Software: Current Status Leading journals have software transparency standards Community best practices emerging But...little uniformity in those best practices and limited awareness among authors, editors Key issues: Licenses Use MIT or other software license, not CC-BY (which require attribution and documentation of any changes) Github has limited metadata (can use zenodo as a landing page). Different IP issues than for data (treated differently by universities)

Grant from Laura and John Arnold Foundation (LJAF) to AGU Align/develop best practices and standards across the Earth and space sciences to enable FAIR data Develop common solution(s) for researchers, publishers, editorial systems, and data repositories

Community-Driven Project Partnership Includes: 9 Science Data Communities AGU Earth Science Information Partners Research Data Alliance (RDA) COPDESS Earthcube/CDF DataCite Publishers AGU PNAS Nature Science Repositories and COPDESS Signatories National Computational Infrastructure (NCI) AuScope Australian National Data Service Infrastructure Center for Open Science And Growing!!

The Goal: Publishers will adopt common standards in their editorial systems around workflows, datasets, metadata, acceptable repositories, and data citation. Will connect w/ repositories via api s for efficient metadata exchange. Repositories will adopt standards and best practices around, persistent identifiers, landing pages (for exposing metadata), access, embargoes (for peer-review), data citations, and licenses. Researchers will know what to expect across all journals and be able to prepare data early in workflow. 10

Progress so far, and Directions: 6+ working groups (we re calling them temporary action groups) to address needs among publishers and repositories. Major ESS publishers have agreed to eliminate data supplements all data in repositories with DOI s, curation. Repositories working on solutions to improve data curation and help researchers. Funders also engaged; strengthen and provide more directed DMP s (NRC can help here especially). Expanded declaration or principles around FAIR data supported and in progress Implementation begins in summer 2018

Larger Effort Needed Support and publicize these community efforts around best practices Publishers need to follow current best practices Get references out of supplements in online versions (all references in main text); open up references at Crossref (I4OC) Help authors (include data best practices and expectations into workshops, instructions...) Ensure integrity (no unpublished references; data availability statements). Societies should recognize data stewardship in awards and recognition (fellowship, academy membership). Funders need to standardize and strengthen DMP s guidelines and follow through on these (be more directive) Update guidelines and FAQs to follow best practices Support leading publishers Support leading repositories Implement identifiers fully (samples, affiliations, repositories.)

Efforts and Culture aligned for Reproducibility FAIR data effort across the Earth and space sciences publishers and repositories are implementing Recent surveys indicate much increased acceptance of need for data sharing and curation in the Earth and space sciences Need to reward groups/people that are following best practices Help, direction, and common outreach needed from funders NRC can greatly help here in providing directions to funders/societies/researchers Society alignment and support (AGU willing to lead and share what we have done with other societies)