Astronomy Dataverse: enabling astronomer data publishing.

Similar documents
Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

The Role of Repositories and Journals in the Astronomy Research Lifecycle

Science Panel Discussion presentation: "A Data Sharing Story"

The Smithsonian/NASA Astrophysics Data System

Helping Journals to Upgrade Data Publications for Reusable Research

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Demos: DMP Assistant and Dataverse

DATAVERSE FOR JOURNALS

Dataverse and DataTags

Linking Literature to Astronomy Data with DOIs

Software + Services for Data Storage, Management, Discovery, and Re-Use

DATA SHARING FOR BETTER SCIENCE

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST)

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

Fair data and open data: differences and consequences

Data management Backgrounds and steps to implementation; A pragmatic approach.

Securing Dataverse with an Adapted Command Design Pattern. Gustavo Durand, Michael Bar-Sinai, Merce Crosas SecDev - September 26, 2017

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

Dataverse: Modular Storage and Migration to the Cloud

Data Citation and Scholarship

Case Study: CyberSKA - A Collaborative Platform for Data Intensive Radio Astronomy

Spitzer Heritage Archive

Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

The Virtual Observatory and the IVOA

The Data Curation Profiles Toolkit: Interview Worksheet


Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

Data Centres in the Virtual Observatory Age

The Open Monolith. Keeping Your Codebase (and Your Headaches) CON3449. Matthew sbgrid.

SHARING YOUR RESEARCH DATA VIA

Introducing the Springer Nature Data Support Services

The Materials Data Facility

Data Curation Profile Human Genomics

A Data Sharing System

Data Discovery - Introduction

Data Curation Profile Architectural History / Epigraphy

NDSA Web Archiving Survey

Reflections on Three Decades in Internet Time

Sustainable Governance for Long-Term Stewardship of Earth Science Data

OUR VISION To be a global leader of computing research in identified areas that will bring positive impact to the lives of citizens and society.

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

The IPAC Research Archives. Steve Groom IPAC / Caltech

CREATING DIGITAL REPOSITORIES PRESENTED BY CHAMA MPUNDU MFULA CHIEF LIBRARIAN NATIONAL ASSEMBLY OF ZAMBIA

DOI for Astronomical Data Centers: ESO. Hainaut, Bordelon, Grothkopf, Fourniol, Micol, Retzlaff, Sterzik, Stoehr [ESO] Enke, Riebe [AIP]

Carelyn Campbell, Ben Blaiszik, Laura Bartolo. November 1, 2016

How to share research data

The DataBridge: A Social Network for Long Tail Science Data!

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

Metadata Ingestion and Processinng

Semantics at ADS Labs. Rahul Dave, Alberto Accomazzi Sponsored by Microsoft Research and ADS

UC Irvine LAUC-I and Library Staff Research

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

For Attribution: Developing Data Attribution and Citation Practices and Standards

DataBridge: CREATING BRIDGES TO FIND DARK DATA. Vol. 3, No. 5 July 2015 RENCI WHITE PAPER SERIES. The Team

Research Data Repository Interoperability Primer

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Indiana University Research Technology and the Research Data Alliance

Glossary: AAS American Astronomical Society AURA Association of Universities for Research in Astronomy

Dryad Curation Manual, Summer 2009

Abstract. Introduction. The Virtual Astronomy Multimedia Project

Checklist and guidance for a Data Management Plan, v1.0

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and

Reproducibility and FAIR Data in the Earth and Space Sciences

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

The Canadian CyberSKA Project

DOIs for Research Data

Data management and discovery

National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology WISE Archive.

LASDA: an archiving system for managing and sharing large scientific data

INDEPTH Network Introduction to NADA

Data Curation Profile Plant Genomics

Collaborative Editing and Linking of Astronomy Vocabularies Using Semantic Mediawiki

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Mendeley Institutional Edition Group Owner Guide

I data set della ricerca ed il progetto EUDAT

SKA Regional Centre Activities in Australasia

Army Data Services Layer (ADSL) Data Mediation Providing Data Interoperability and Understanding in a

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

PrototypeofaDiscoveryToolforQuerying Heterogeneous Services

NASA and International Open Standards and the Future of Space Weather Studies

How can CLARIN archive and curate my resources?

Introduction to Bibliometrics and Tools for Organizing References. Uta Grothkopf ESO Library

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

A VO-friendly, Community-based Authorization Framework

Advanced Multi-Beam Spect rom et er for t he GBT

NOW ON. Mike Takats Thomson Reuters April 30, 2013


Executive Committee Meeting

DCC Disciplinary Metadata

Data Curation Profile Food Technology and Processing / Food Preservation

B2FIND and Metadata Quality

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

VAPE Virtual observatory Aided Publishing for Education

Improving a Trustworthy Data Repository with ISO 16363

Design and Implementation of the Japanese Virtual Observatory (JVO) system Yuji SHIRASAKI National Astronomical Observatory of Japan

Data Citation. DataONE Community Engagement & Outreach Working Group

B2SAFE metadata management

Transcription:

Astronomy Dataverse: enabling astronomer data publishing http://theastrodata.org

Harvard-Smithsonian Center for Astrophysics

References: Nielsen, M. The Future of Science http://michaelnielsen.org/blog/the-future-of-science-2/

in the past data were hidden... 1660 Robert Hooke pre published anagram: ceiiinosssttuv ut tensio, sic vis as the tension, so the force References: Nielsen, M. The Future of Science http://michaelnielsen.org/blog/the-future-of-science-2/

in the present data live in papers References: McLaughlin et al. 2006; http://adsabs.harvard.edu/abs/2006apjs..166..249m

Tables, Tables in tar file FITS Files Code in tar file References: McLaughlin et al. 2006; http://adsabs.harvard.edu/abs/2006apjs..166..249m

Tables, Tables in tar file VizieR FITS Files Code in tar file References: McLaughlin et al. 2006; http://adsabs.harvard.edu/abs/2006apjs..166..249m

Tables, Tables in tar file VizieR FITS Files!"#$"%&'()*+*($,(-./0('*+*1 Code in tar file References: McLaughlin et al. 2006; http://adsabs.harvard.edu/abs/2006apjs..166..249m

And now for a remix... Consider Minard s charting of the demise of Napoleon s army on its roundtrip to Moscow... except instead of losing soldiers, we ask about losing data behind or in a paper... References: Charles Minard (1781-1870) (see upload log) [Public domain], via Wikimedia Commons

Losses from Data to Literature Raw data:! might already be in a telescope archive! linkage partially fixed by post-pub curation Theoretical data; Analysis codes and logs; Processed data:! Reduced data; mosaics; References: Charles Minard (1781-1870) (see upload log) [Public domain], via Wikimedia Commons

Losses (and some Gains) from Literature to Archives: Data still leaks:! data products that are not machined tables;! data in tar files;! data from external websites (linked as footnote URLs). Recovery: Post-publication curation creates or captures:! SIMBAD objects; big archive data references;! large machined tables captured by CDS. References: Charles Minard (1781-1870) (see upload log) [Public domain], via Wikimedia Commons

in the future data live... Refined data sets are published by scientists in long lived repositories; Scientist s data linked in ADS & are searchable Scientist s data is reused & cited, giving credit for that work.

http://theastrodata.org

The Dataverse Network (DVN) Project was built originally for managing Social Science Data; Collaboration between the Harvard/CfA Seamless Astronomy team and the DVN team to reuse this framework for Astronomy Data. Institutional support from Harvard Library for DVN infrastructure and training for Astronomy.

Gives ownership and recognition to data owner Generates a persistent data citation Converts data sets to a preservable and verifiable format Distributes data to the public, but also supports restricted access Indexes all metadata for quick data discovery Supports subsetting and analysis for (some) data files Can be branded as your web site. Inter-operates with other systems using standards

Network Institutional, CfA Dataverse Scientist/Project Study Data+Metadata

We are: Metadata mapping between the Data Documentation Initiative (DDI) standard used by DVN and Astronomy s VO standards; Conducting Data Interviews with Astronomers to deduce their needs; Working with NASA-SAO ADS to expose data publications; Professional Outreach Training for CfA astronomers to use platform; Working on the DVN API for search & up/downloading of data products; Working with VAO to expose internal data products to VO indexing and search.

http://figshare.com?

Why DVN? Open Source (Java) Software Stack Instantiate new Dataverse Networks: Societal, Publishing, Institutional needs. Copy our CfA work to new Astronomy DVN. Built in DVN Universe search and linking.

Why DVN? Domain Specific Metadata/Data Formats; Use Astronomy Controlled Vocabularies for Curation; Hook up DVN to VO and other Software tools. Reuse DVN API for Astronomy specific software tools

Why DVN? Friends Work with DVN developers to evolve software: Metadata/Data format support. Link Dataverse Studies NASA-ADS American Astronomical Society Publications (ApJ, AJ...)

Virtual Observatory Plugin to DVN Index individual datatypes in a published data study; Expose services for datatypes; Manage publication registration to VO.

this problem References: Ton Zijlstra; http://www.flickr.com/photos/tonz/2463875144/

http://theastrodata.org