Launching the. Data Curation Network NDS/MBDH 2018

Similar documents
Curating Research Data

Data Curation Handbook Steps

Striving for efficiency

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

DATA SHARING FOR BETTER SCIENCE

How to assist researchers in sharing their research data October 22, 2015

Data publication and discovery with Globus

The library s role in promoting the sharing of scientific research data

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

JISC WORK PACKAGE: (Project Plan Appendix B, Version 2 )

Comparing Curricula for Digital Library. Digital Curation Education

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Demos: DMP Assistant and Dataverse

DOIs for Research Data

Developing a Research Data Policy

Data Curation Profile Human Genomics

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Research Data Edinburgh: MANTRA & Edinburgh DataShare. Stuart Macdonald EDINA & Data Library University of Edinburgh

The Data Curation Profiles Toolkit: Interview Worksheet

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Science Europe Consultation on Research Data Management

Reproducibility and FAIR Data in the Earth and Space Sciences

GEOSS Data Management Principles: Importance and Implementation

Fair data and open data: differences and consequences

RDM through a UK lens - New Roles for Librarians?

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

bwfdm Communities - a Research Data Management Initiative in the State of Baden-Wuerttemberg

BUSINESS-BASED VALUE IN AN MDR

Research Data Management: Edinburgh University Library s Approach Dominic Tate

Indiana University Research Technology and the Research Data Alliance

Jisc Research Data Shared Service

Illinois Data Bank FIRST YEAR REVIEW COMPILED BY RESEARCH DATA SERVICE STAFF

Dataverse and DataTags

Data Life Cycle. Research. Access Collaborate. Acquire. Analyse. Comprehend. Plan. Manage Archive. Publish Reuse

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

Towards FAIRness: some reflections from an Earth Science perspective

Reproducibility and Reuse of Scientific Code Evolving the Role and Capabilities of Publishers

National Materials Data Initiatives

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Data Curation: Technical Challenges Facing Repositories. Brianna Marshall Jan. 9, 2014

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

Developing Data Management Plans (DMP) Scholarly Communication Initiative Mississippi State University Libraries March 25, 2015

Web of Science. Platform Release Nina Chang Product Release Date: March 25, 2018 EXTERNAL RELEASE DOCUMENTATION

How to make your data open

Brown University Libraries Technology Plan

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

Research Data Management and Institutional Repositories

DATAVERSE FOR JOURNALS

Data Management Checklist

For Attribution: Developing Data Attribution and Citation Practices and Standards

Scientific Research Data Management Policy

Conferenza GARR Moving Forward to deliver an «EOSC in Practice» by 2018

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

The What, Why, Who and How of Where: Building a Portal for Geospatial Data. Alan Darnell Director, Scholars Portal

Data Curation Profile Movement of Proteins

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro

Developing a social science data platform. Ron Dekker Director CESSDA

Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements

Federal Statistics, Multiple Data Sources, and Privacy Protection: Next Steps

Science Panel Discussion presentation: "A Data Sharing Story"

Engaging and Connecting Faculty:

DataSTORRE Deposit Guide

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

TEXT MINING: THE NEXT DATA FRONTIER

Survey of Research Data Management Practices at the University of Pretoria

Open Science, FAIR data and effective data management

Cyber Partnership Blueprint: An Outline

Open Access, RIMS and persistent identifiers

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future

Achieving effective risk management and continuous compliance with Deloitte and SAP

re3data.org - Making research data repositories visible and discoverable

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:

Trials And Tribulations Of Moving Forward With Digital Preservation Workflows And Strategies

Data Management Tools. Lizzy Rolando, Georgia Tech Aaron Trehub, Auburn University August 6, 2013

GNSSN. Global Nuclear Safety and Security Network

BPMN Processes for machine-actionable DMPs

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

RN Workshop Series on Innovations in Scholarly Communication: plementing the Benefits of OAI (OAI3)

DATA SELECTION AND APPRAISAL CHECKLIST University of Reading Research Data Archive

Interdisciplinary Processes at the Digital Repository of Ireland

All of Us Research Program

Data Curation Profile Cornell University, Biophysics

ZB MED Information Center Life Sciences

SHARING YOUR RESEARCH DATA VIA

ProQuest Dissertations and Theses Overview. Austin McLean and Marlene Coles CGS Summer Workshop, July 2017

About Knowledge Convergence. e-infrastructures Austria an interdisciplinary case study concerning research resources and their management

COALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING. Source: NSF.

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017

Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication

DSA WDS Partnership Working Group Catalogue of Common Requirements

Data Discovery - Introduction

Legal Issues in Data Management: A Practical Approach

Transcription:

NDS/MBDH 2018 Launching the Data Curation Network Lisa Johnston University of Minnesota Jake Carlson University of Michigan Cynthia Hudson-Vitale Penn State Univ. Heidi Imker University of Illinois Wendy Kozlowski Cornell University Robert Olendorf Penn State University Claire Stewart University of Minnesota Mara Blake Johns Hopkins University Joel Herndon Duke University Elizabeth Hull Dryad Data Repository Timothy M. McGeary Duke University 7-11-2018 Data Curation Network http://datacurationnetwork.org

Rise of the Data Sharing Culture Researchers are increasingly required/incentivised to share data Funder data sharing mandates Journal data sharing policies Disciplinary practices emphasis on transparency and reproducibility But! It s not enough to just share the files, well-curated data are more valuable! Goal of data curation Ingest and maintain (trusted digital repositories) in ways that make it findable, accessible, interoperable and reusable.

Challenges for IR Data Curation Services How to scale local data curation services across all disciplines? How many data curation experts are needed? Types: GIS, spreadsheet/tabular, statistical/survey, software code, video/audio Disciplines: genomic sequence, chemical spectra, bioinformatics Are there ways to more efficiently curate rare or infrequently generated data types? Might our institutions specialize in curation skills? Represent our academic expertise?

The Data Curation Network (DCN) addresses this challenge by collaboratively sharing data curation staff across a network of partner institutions and data repositories. Data Curation Network

The Data Curation Network (DCN) 3-year implementation phase launched June 2018 with funding from the Alfred P Sloan Foundation. Data Curation Network

The Data Curation Network (DCN) serves as the human layer in the data repository stack that provides specialized dataset curation and professional development training for an emerging data curator community. Data Curation Network

Katie Wilson Scientific Data Curator Ben Wiggins Digital Arts and Humanities Curator Valerie Collins DRUM coordinator Melinda Kernik GIS/Spatial Data Curator Shanda Hunt Public Health Data Curator Alicia Hofelich Mohr College of Liberal Arts Data Management Specialist

Chen Chui Data Management Consultant Johns Hopkins University Dave Fearon Data Management Consultant Johns Hopkins University

Susan Borda Data Workflows Specialist Rachel Woodbrook Data Curation Librarian

John Russell Associate Director Center for Humanities and Information Nathan Piekielek Geospatial Services Librarian

Jen Darragh RDM Consultant Duke Libraries Sophia Lafferty-Hess RDM Consultant Duke Libraries

Erin Clary Senior Curator Debra Fagan Curation and Technical Specialist

Wendy Kozlowski Data Curation Specialist Cornell University Library Erica Johns Research Data and Environmental Sciences Librarian Henrik Spoon Physics, Astronomy and Math Librarian

Heidi Imker Director, Research Data Service University of Illinois at Urbana-Champaign Hoa Luong Research Data Specialist University of Illinois at Urbana-Champaign Ashley Hetrick Research Data Specialist University of Illinois at Urbana-Champaign

DCN Workflow Uncurated Data Presenting scale and expertise challenges to individual institutions Ingest Appraise and Select DCN Facilitate Access Preserve Long-Term Curated Data at scale and with great efficiency through shared Data Curation Network

DCN Workflow Uncurated Data Presenting scale and expertise challenges to individual institutions Ingest Appraise and Select DCN Facilitate Access Preserve Long-Term Curated Data at scale and with great efficiency through shared Data Curation Network - Researchers deposit like normal

DCN Workflow Uncurated Data Presenting scale and expertise challenges to individual institutions Ingest Appraise and Select DCN Facilitate Access Preserve Long-Term Curated Data at scale and with great efficiency through shared Data Curation Network - Researchers deposit like normal - DCN functions as a microservice layer (the human layer in your repository stack )

DCN Workflow Uncurated Data Presenting scale and expertise challenges to individual institutions Ingest Appraise and Select DCN Facilitate Access Preserve Long-Term Curated Data at scale and with great efficiency through shared Data Curation Network - Researchers deposit like normal - DCN functions as a microservice layer (the human layer in your repository stack ) - Local institution maintain full responsibility for all technical functionality (eg. storage) and authority for local decision-making (what to ingest, how long to retain, etc.)

DCN Workflow Uncurated Data Presenting scale and expertise challenges to individual institutions Ingest Appraise and Select DCN Facilitate Access Preserve Long-Term Curated Data at scale and with great efficiency through shared Data Curation Network - Researchers deposit like normal - DCN functions as a microservice layer (the human layer in your repository stack ) - Local institution maintain full responsibility for all technical functionality (eg. storage) and authority for local decision-making (what to ingest, how long to retain, etc.) - Seamlessly integrates into all repository systems (Samvera, Fedora, DSpace, Dataverse, etc.)

DCN Workflow Uncurated Data Presenting scale and expertise challenges to individual institutions Ingest Appraise and Select DCN Facilitate Access Preserve Long-Term Curated Data at scale and with great efficiency through shared Data Curation Network Data Curation Network DCN Coordinator Workflow Review Assign CURATE Mediate Approve DCN Curator Workflow C Check files U Understand R Request A T Transform E Augment Evaluate for and and run missing file metadata FAIRness metadata files information formats

CURATE Steps in DCN Workflow DCN Curators will take CURATE steps for each data set, that includes: Check data files and read documentation Understand the data (try to), if not Request missing information or changes Augment the submission with metadata for findability Transform file formats for reuse and long-term preservation Evaluate and rate the overall submission for FAIRness.

DCN CURATE Steps Data Curation Network

Specialized Curation Training (2018-2020) Data Curation Network

DCN Implementation (2018-2021) Assessment Plan (two-prong) Is a networked approach to curating research data more efficient? Are curated data are more valuable? - Number of datasets - Frequency (high-volume time periods, etc.) - Variety (data file formats; range of disciplines) - Efficiency (time, costs) - Track reuse indicators (download counts, citations, alt-metrics) - Implement a DCN registry - Apply badges and metadata to signal that data sets curated by the DCN are FAIR.

In Year 3, the DCN will begin transitioning to a selfsustaining service model where institutional and disciplinary partners contribute data curation staff and central operations costs are offset by users of the Network. Data Curation Network

Data Curation-as-Service Data Curation Network

Sustainability and Expansion (2020-) Stakeholder Academic libraries with existing data curation services Benefits Gain access to data curation expertise in more disciplines/formats than locally available Academic libraries with limited to no resources for data curation services Disciplinary- and general-subject data repositories Are able to provide critical new data curation services when local resources are limited (without needing to hire); Receive better, more valuable data submissions from DCN partner institutions and customers; Have potential to partner with the DCN to expand the scope of curation support for new and/or less frequently encountered data types

6-year Roadmap toward Sustainability Support Transition from planning phase to sustaining phase Year 0 Year 1 Year 2 Year 3 Year 4 Year 5 Year 6 Sloan Grant Grant Funded (Y1-Y2) transition to partnership model (Y3) Curation-as-service (Y4-6) Timing 2016-17 2018-19 2020-21 2022-2023 Phase Planning Implementation Transition Sustaining Partners 6 academic institutions 8 academic institutions and 2 disciplinary partners Recruit new partners as use and demand dictate Mission: With a proven and appealing value-proposition, the Data Curation Network will expand into a sustainable entity that grows beyond our initial partner institutions.

Thanks! https://datacurationnetwork.org Twitter #DataCurationNetwork