Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University

Similar documents
Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

Helping Journals to Upgrade Data Publications for Reusable Research

Dataverse and DataTags

DATAVERSE FOR JOURNALS

Demos: DMP Assistant and Dataverse

Science Panel Discussion presentation: "A Data Sharing Story"

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Commi&ng to Data Quality

DATA SHARING FOR BETTER SCIENCE

Dataverse: Modular Storage and Migration to the Cloud

Welcome to the CyVerse Data Store. Manage and share your data across all CyVerse pla8orms

The Open Monolith. Keeping Your Codebase (and Your Headaches) CON3449. Matthew sbgrid.

Queen s University Library. Research Data Management (RDM) Workflow

Securing Dataverse with an Adapted Command Design Pattern. Gustavo Durand, Michael Bar-Sinai, Merce Crosas SecDev - September 26, 2017

Click to edit Master title style

Astronomy Dataverse: enabling astronomer data publishing.

A Data Sharing System

Making Research Data Public: Why, What, and How. Fall 2016

Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE

Specific requirements on the da ra metadata schema

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library

The Avalon Media System

Integrating Selenium with Confluence and JIRA

SHARING YOUR RESEARCH DATA VIA

CoG: The NEW ESGF WEB USER INTERFACE

Working with Islandora

Dublin Core Metadata for Research Data Lessons Learned in a Real-World Scenario with datorium

ZB MED Information Center Life Sciences

The OpenAIRE Infrastructure

Curation module in action - its preliminary findings on VLO metadata quality

CAREER PATH FOR THE NEXT GENERATION RECORDS MANAGER

Dataverse Usability Evaluation: Findings & Recommendations. Presented by Eric Gibbs Lin Lin Elizabeth Quigley

globus online The Galaxy Project and Globus Online

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

Taylor & Francis Online. A User Guide.

Basics in good research data management (RDM) for reviewing DMPs

Data publication and discovery with Globus

5/23/18. Atomized individual items vs. Organized collec=ons (1/2) Atomized individual items vs. Organized collec=ons (2/2)

System Modeling Environment

Core Technology Development Team Meeting

DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson

Horizon Societies of Symbiotic Robot-Plant Bio-Hybrids as Social Architectural Artifacts. Deliverable D4.1

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

TUTORIAL: Creating html s

RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0

LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories

Digital Cura+on Planning at Michigan State University

WE HAVE SOME GREAT EARLY ADOPTERS

Managed So*ware Installa1on with Munki

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK

Open data and analytics for a sustainable energy future. Version 0.2 January 23 rd, 2017

MicroStrategy Desktop

Package dataverse. June 15, 2017

Best Practice Guidelines for the Development and Evaluation of Digital Humanities Projects

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

An innova(on developed by eosurgical

RDM, a view from Vancouver

Making data publication a first class research output

Facilitate Open Science Training for European Research

Crowdsourcing Codebook Enhancements A DDI-based Approach

Crea%ng and U%lizing Linked Open Sta%s%cal Data for the Development of Advanced Analy%cs Services E. Kalampokis, A. Karamanou, A. Nikolov, P.

Implementation of Open-World, Integrative, Transparent, Collaborative Research Data Platforms: the University of Things (UoT)

GOOGLE SHEETS TUTORIAL

Using Persistent Identifiers at

Fair data and open data: differences and consequences

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Data Curation: Technical Challenges Facing Repositories. Brianna Marshall Jan. 9, 2014

TUTORIAL: CREATING S IN CONSTANT CONTACT

Release Notes December 2016

Welcome! Presenters: STFC January 10, 2019

Strategies for Selecting the Right Open Source Framework for Cross-Browser Testing

Survey123 Deep Dive. Presented by: Sue Enyedy-Goldner Fall 2018

Challenges on Developing Tools for Exploi=ng Linked Open Data Cubes. Kalampokis, Roberts, Karamanou, Tambouris, Tarabanis, Hermans

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

AFTER TULANE. Lifelong Learning, Research and Produc4vity a8er Gradua4on. Rudolph Matas Library, Tulane University Health Sciences Center

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

Integra(ve Genomics Viewer IGV. Tom Carroll MRC Clinical Sciences Centre

Data Workflow Workshop

Clocker. Deploying Complex Applica3ons on Docker using Apache Brooklyn

Reproducibility and FAIR Data in the Earth and Space Sciences

Pilot integration of an electronic lab notebook and an open source research data repository as part of a modular biomedical research data platform

BEXIS Release Notes

re3data.org - Making research data repositories visible and discoverable

When the Need for an Ins/tu/onal Repository Gives Rise to a Federa/on

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST)

Core Technology Development Team Meeting

DOIs for Research Data

Towards Open Innovation with Open Data Service Platform

CONTENTdm Users Group Meeting, May 2014 CONTENTdm Users Group Meeting, May 2014

From Continuous Integration To Continuous Delivery With Jenkins

Optional Thesis Deposit

SobekCM Digital Repository : A Retrospective

Your Open Science and Research Publishing Platform. 1st SciShops Summer School

ACCESS Health Indonesia. ACCESS Global Mee.ng February 10-13, 2014 Goa, India

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Developing Web Applications with Geocoding and Routing Services Using ArcGIS Online. Deelesh Mandloi Dmitry Kudinov Brad Niemand

Why Was Arbil Written

Management User Guide

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

TUTORIAL: Creating html s

Transcription:

Dataverse 4.0 & Beyond ì Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University

2 Data Science Team Data Cura/on & Stewardship Informa/on Scien/sts Researchers Sta/s/cal Innova/on Data Science Applica/ons and Tools Tool Building & Computer Science SoCware Engineers Find out more: h$p://datascience.iq.harvard.edu

3 What is Dataverse? SoCware framework for publishing, ci/ng and preserving research data (open source on github for others to install) Provides incen8ves for researchers to share: Recogni/on & credit via data cita/ons Control over data & branding Fulfill Data Management Plan requirements Harvard Dataverse (open to all, repository instance at Harvard) currently has: 700 Dataverses > 1 Million Downloads 53,857 Datasets 739,326 Files

4 Who is using Dataverse? Worldwide Dataverse Installa8ons Ins8tu8ons can setup/host their own Dataverse installa/on (OCUL, UoA, etc) and within them can have dataverses for a variety of users (across all research domains): Researchers, Projects, Journals (OJS Dataverse integra/on), etc.

5 Streamlined Workflows Based on extensive con/nuous usability tes/ng: improved account crea/on process, dataverse setup (incl. customiza/ons), and dataset (prev. study) crea/on.

Featured Dataverses 6

7 Improved File Upload & Handling Select mul/ple files, Drag- n- Drop, Dropbox, File Previews, and extra handling for csv, tsv and excel files (no control card needed).

8 Rigorous Data Publishing Workflows Upload DraE Dataset Note: A Published Dataset cannot be deleted (only deaccessioned, with reason included (i.e., legal)). Publish Version 1 Authors, Title, Year, DOI, Repository, V1 Published Dataset v1 Publish Version 1.1: small metadata change (not cita/on); files not changed. Published Dataset v1.1 Publish Version 2: File change (automa/c); big metadata change (cita/on metadata). Authors, Title, Year, DOI, Repository, UNF, V2 See: Altman, M., & King, G. (2007) doi:10.1045/march2007- altman Published Dataset v2

9 Expanding Metadata Support Metadata Schema Version 3.6 Version 4.0 DDI (General & Social Science)* X (v2.1) X (v.2.5) Simple Dublin Core X X Dublin Core Terms X DataCite 3.0 X Virtual Observatory (Astronomy)** X ISA- Tab (Biomedical)*** X * Including variable level metadata found in tabular data files. ** Automa/cally extracts relevant metadata from the header FITS files. *** Controlled vocabulary maps to ontologies/taxonomies (OBI, NCBI, ).

Astronomy Metadata: Certain values (e.g., Type, Facility, Instrument, etc) automa/cally extracted from FITS file header. 10

Biomedical Metadata 11

Enhanced Faceted Search 12

13 Expanded Advanced Search Ability to search on specific dataverses, dataset metadata fields across various domains, and files (variables).

14 Visualize & Analyze Data: TwoRavens Integrated with Dataverse & Zelig (sta/s/cal socware) From beginners up to advanced stats users Explore data, view descrip/ve sta/s/cs, and es/mate sta/s/cal models for files in datasets

15 WorldMap Integration 1. Upload a file containing geographic data into Dataverse 2. Easily visualize the data on the WorldMap system. 3. WorldMap layer embedded into dataset in Dataverse Read more on: Data Science Blog.

16 After 4.0 ì ì Sharing Privacy Sensi/ve Data ì ì Secure Dataverse DataTags (ques/onnaires based on privacy laws) ORCID Integra/on (API) Longer- Term Large- scale datasets (efficient storage) Ensuring long- term preserva/on for more file formats (e.g., Archivema/ca)

17 Get Involved: Dataverse Community Let us know your thoughts on Dataverse 4.0 Beta in the Dataverse Google Group. Sign up to par/cipate in usability tes/ng of Dataverse 4.0 Beta by filling out this form. Contribute to our code or scripts: GitHub Pull Requests. Read our Data Science Blog for any upcoming updates and no/fica/ons. Credit: FlickrCommons

18 Thank You! Eleni Castro, Research Coordinator IQSS, Harvard University ecastro@fas.harvard.edu Dataverse 4.0 Demo: hqp://dataverse- demo.iq.harvard.edu/ Dataverse Twiqer: @thedataorg