Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

Similar documents
Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University

DATAVERSE FOR JOURNALS

Helping Journals to Upgrade Data Publications for Reusable Research

Dataverse and DataTags

Science Panel Discussion presentation: "A Data Sharing Story"

Astronomy Dataverse: enabling astronomer data publishing.

Demos: DMP Assistant and Dataverse

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

DATA SHARING FOR BETTER SCIENCE

A Data Sharing System

SHARING YOUR RESEARCH DATA VIA

Securing Dataverse with an Adapted Command Design Pattern. Gustavo Durand, Michael Bar-Sinai, Merce Crosas SecDev - September 26, 2017

Reproducibility and FAIR Data in the Earth and Space Sciences

Dataverse: Modular Storage and Migration to the Cloud

DOIs for Research Data

RDM, a view from Vancouver

Queen s University Library. Research Data Management (RDM) Workflow

Horizon Societies of Symbiotic Robot-Plant Bio-Hybrids as Social Architectural Artifacts. Deliverable D4.1

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

Universidad de Extremadura 22 October Elsevier s approach to Open Access and latest developments. Edward Wedel-Larsen. Research Solutions

Data Curation Profile Human Genomics

Executive Committee Meeting

The Smithsonian/NASA Astrophysics Data System

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

The library s role in promoting the sharing of scientific research data

Interoperability Between Institutional and Data Repositories: a Pilot Project at MIT

Reproducibility and Reuse of Scientific Code Evolving the Role and Capabilities of Publishers

Dataverse Usability Evaluation: Findings & Recommendations. Presented by Eric Gibbs Lin Lin Elizabeth Quigley

Core Technology Development Team Meeting

How to share research data

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Specific requirements on the da ra metadata schema

Executive Committee Meeting

NRF Open Access Statement

Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements

Click to edit Master title style

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

Carelyn Campbell, Ben Blaiszik, Laura Bartolo. November 1, 2016

Pilot integration of an electronic lab notebook and an open source research data repository as part of a modular biomedical research data platform

re3data.org - Making research data repositories visible and discoverable

INSTITUTIONAL REPOSITORY SERVICES

Data publication and discovery with Globus

A Data Citation Roadmap for Scholarly Data Repositories

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Minimal Metadata Standards and MIIDI Reports

Security Research at Harvard SEAS. Stephen Chong Asst. Prof. of Computer Science Harvard SEAS Cybersecurity Awareness Day Oct

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

The Open Monolith. Keeping Your Codebase (and Your Headaches) CON3449. Matthew sbgrid.

Indiana University Research Technology and the Research Data Alliance

LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories

ZB MED Information Center Life Sciences

Core Technology Development Team Meeting

Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Digital repositories as research infrastructure: a UK perspective

Implementation of Open-World, Integrative, Transparent, Collaborative Research Data Platforms: the University of Things (UoT)

Introducing the Springer Nature Data Support Services

Dublin Core Metadata for Research Data Lessons Learned in a Real-World Scenario with datorium


Facilitate Open Science Training for European Research

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Scopus Development Focus

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

Core Technology Development Team Meeting

DataBridge: CREATING BRIDGES TO FIND DARK DATA. Vol. 3, No. 5 July 2015 RENCI WHITE PAPER SERIES. The Team

Data Citation and Scholarship

Data Curation: Technical Challenges Facing Repositories. Brianna Marshall Jan. 9, 2014

Web of Science. Platform Release Nina Chang Product Release Date: March 25, 2018 EXTERNAL RELEASE DOCUMENTATION

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

CrossRef tools for small publishers

Melvyl Webinar UC and OCLC Roadmap Discussion

Data Citation. DataONE Community Engagement & Outreach Working Group

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

Bengkel Kelestarian Jurnal Pusat Sitasi Malaysia. Digital Object Identifier Way Forward. 12 Januari 2017

Making data publication a first class research output

EUDAT Towards a Collaborative Data Infrastructure

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future

Introduction... 2 Preparation... 2 Implementation... 5 Operation... 7 Marketing... 9 Review and Outlook... 11

Executive Committee Meeting

WEB OF SCIENCE TM RELEASE NOTES v5.21

Digital The Harold B. Lee Library

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:

Basics in good research data management (RDM) for reviewing DMPs

BPMN Processes for machine-actionable DMPs

Data Curation Profile Movement of Proteins

UC Irvine LAUC-I and Library Staff Research

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST)

NOW ON. Mike Takats Thomson Reuters April 30, 2013

Open web-based annotation: Creating a lightweight, portable knowledge layer over biomedicine

Illinois Data Bank Metadata Documentation

The Data Curation Profiles Toolkit: Interview Worksheet

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

Jisc Research Data Shared Service. Looking at the past, looking to the future

Building a Data Catalog

Caring for research data and what about software? Peter Doorn, director DANS

Core Technology Development Team Meeting

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Transcription:

Update on Dataverse Image credit: David Bygott (CC-BY-NC-SA) 2014 Dryad-Dataverse Community Meeting Mercè Crosas, Elizabeth Quigley & Eleni Castro Data Science > IQSS > Harvard University

Introduction to Dataverse Software framework for publishing, citing and preserving research data (open source on github for others to install) Provides incentives for researchers to share: Recognition & credit via data citations Control over data & branding Fulfill Data Management Plan requirements Harvard Dataverse (open to all repository instance at Harvard) currently has: 700 Dataverses > 1 Million Downloads 53, 857 Datasets 739,326 Files 2

Who s Using Dataverse? Worldwide Dataverse Installations Harvard Dataverse (open to all) Types of Dataverses (across all research domains) Institutions (ODUM, MIT, OCUL, ) Projects (IFPRI, PSI, COMPLETE, ) Journals (AJPS, Open Health Data, ) Researchers (Jonathan McDowell, Eric Dunipace, ) 3

Journals Working With Dataverse Option A. Journals include Dataverse as a Recommended Repository Option B. Authors Contribute Directly to a Journal Dataverse Option C. Seamless Integration btw Journal + Dataverse (e.g., OJS) 4

OJS-Dataverse Integration OJS Journal Citation to Data Journal Dataverse Citation to Article Details/Updates: Integrating w/ PKP s Open Journal Systems (Data Deposit API). Pilot with ~ 50 journals + expanding outreach. OJS Dataverse plugin now available with latest OJS release. Future: Embed Dataverse widget into journal article. http://projects.iq.harvard.edu/ojs-dvn 5

Dataverse Milestones 1999-2006: Virtual Data Center (VDC) 2006: Coding of Dataverse begins (initial focus on Social Sciences data) November 2009: Review of Economics and Statistics (RESTAT) Integration with Dataverse 2011: Expanded to include Astronomy & Astrophysics data Fall 2012: OJS & Dataverse API Integration March 2012: Dataverse 3.0 Released April-May 2013: First Usability Testing of Dataverse 2013: Expanded to include Biomedical data October 2013: Dataverse 4.0 development begins October 2013: User centered design process integrated October 2013-Present Dataverse 4.0 Development 6

How our users have influenced 4.0: Over 50 usability testing sessions Users from various disciplines participated User feature requests and reported issues via support tickets System Usability Scale Task completion rates Prototype review sessions with partners Metadata reviews with discipline/domain experts 7

Overview of Dataverse 4.0 Try our Beta site: http://dataverse-demo.iq.harvard.edu/ 8

Rigorous Data Publishing Workflows Upload Draft Dataset Note: A Published Dataset cannot be deleted (only deaccessioned, if legally needed). Publish Version 1 Authors, Title, Year, DOI, Repository, V1 Published Dataset v1 Publish Version 1.1: small metadata change; citation doesn t change. Published Dataset v1.1 Publish Version 2: File change (automatic); big metadata change; or citation changes. Authors, Title, Year, DOI, Repository, UNF, V2 Published Dataset v2 See: Altman, M., & King, G. (2007) doi:10.1045/march2007-altman 9

Expanding Metadata Support Metadata Schema Version 3.6 Version 4.0 DDI (General & Social Science)* X (v2.1) X (v.2.5) Simple Dublin Core X X Dublin Core Terms X DataCite 3.0 X Virtual Observatory (Astrophysics)** X ISA-Tab (Biomedical)*** X * Including variable level metadata found in tabular data files. ** Automatically extracts relevant metadata from the header FITS files. *** Controlled vocabulary maps to ontologies/taxonomies (OBI, NCBI, ). 10

Biomedical Metadata 11

Enhanced Faceted Search 12

Enhanced Faceted Search 13

Enhanced Faceted Search 14

Expanded Advanced Search Ability to search on specific dataset metadata fields across various domains 15

Integrated with Dataverse & Zelig For users at all statistical levels Explore data, view descriptive statistics, and estimate statistical models for files in datasets 16

Integrated with Dataverse & Zelig For users at all statistical levels Explore data, view descriptive statistics, and estimate statistical models for files in datasets 17

Integrated with Dataverse & Zelig For users at all statistical levels Explore data, view descriptive statistics, and estimate statistical models for files in datasets 18

What to Expect After 4.0 19

WorldMap Integration 1. Upload a file containing geographic data into Dataverse 2. Easily visualize the data on the WorldMap system 3. WorldMap layer embedded into dataset in Dataverse 20

WorldMap Integration 1. Upload a file containing geographic data into Dataverse 2. Easily visualize the data on the WorldMap system 3. WorldMap layer embedded into dataset in Dataverse 21

DataTags: Sharing Sensitive Data Demo: http://datatags.org/ 22

23

Full Interview 24

Collaborations & More Biomedical Metadata + Tools HSPH: TB Genomics & Molecular Epidemiology Data Harvard Stem Cell Institute FAIRport Data Astronomy Metadata + Tools Seamless Astronomy Group Harvard-Smithsonian Center for Astrophysics Also working on: Integration w/: ORCID, Open Science Framework, DataUp, DataBridge, Support for large-scale datasets with efficient data storage. 25

Data Science Team Researchers Statistical Innovation http://datascience.iq.harvard.edu Data Curation & Stewardship Information Scientists Data Science Applications and Tools Tool Building & Computer Science Software Engineers Thank you! Contact: ecastro@fas.harvard.edu; equigley@iq.harvard.edu