PDS, DOIs, and the Literature Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics
A Brief Introduction to DOIs History DOI = Digital Identifier for an Object Began c. 1998; ISO standard (26324) as of 2010 The International DOI Foundation (IDF) supports DOI resolution, metadata definition, interoperability. doi.org Benefits Permanent identifier When a resource moves, the metadata is updated Clear, permanent linkage to metadata Searchable metadata database to resolve queries to object identification and location
DOIs in the Literature DOIs assigned to articles, chapters, books DOIs included in reference style sheets Aggregators, like the CrossRef, integrate metadata from the DataCite and other databases for subscribers to mine
DOIs for Data Sets DataCite was founded in 2009 to provide DOI services for research data. Metadata schema, now version 4.1 Public metadata database available for searching Citable data become legitimate contributions to scholarly communication, paving the way for new metrics and publication models that recognize and reward data sharing. (datacite.org/mission.html, 28 May 2018) Contributor Roles development
Data Sets in the Literature Publishing Data Pre-digital Post-digital Mentioning Data Informal, inconsistent and often incomplete identification Acknowledging Data Usually more complete, but inconsistent style - hard to recognize programmatically Citing Data Formal, but rare
Data Sets and DOIs Permanent, formal identifier for the specific data set Resolution service to lead users directly to the associated data Metadata to support broad searching through the literature Standardized reference information
PDS and Data Sets No data set is accepted for archiving in the PDS until it passes a full, formal, external peer review for both PDS standards compliance as well as scientific completeness, usability, and significance. PDS is committed to maintain that data set in a usable form for as long as there is a PDS. PDS archive standards include extensive metadata to document file structure, provenance, observing circumstances, and science context.
PDS and DOIs Metadata collection The DataCite metadata schema and the PDS4 metadata do not yet align. They will. Coining a new DOI The DataCite database enables reserving DOIs in advance of publication PDS records and updates metadata as part of data review and acceptance DOIs are available for use in manuscripts prepared in parallel with data Subsequent versions New version = New DOI The DOI metadata includes documented relationships with other DOIs.
PDS, DOIs, and the Literature DOIs provide the stable link needed for citation PDS4 IM and infrastructure (can) support DOI generation PDS4 infrastructure can support DOI resolution Through formal citation, data sets can be linked to papers and conversely Enables reproducibility for associated analysis Supports and documents the value of archive development work Metrics can be calculated for data sets as for journal articles Career credit for data preparers Impact statistics for ROI of funding programs
Next Steps Analysis papers without data set citations should be problematic. Where an established archive has rigorous review policies for acceptance, data sets from these sources should be acknowledged as refereed publications. Similarly, institutions and employers should recognize data set publication by these rigorous archives as being a career achievement consistent with publication of a refereed paper.
For More Information: Astrophysics Data System (ADS) DataCite International DOI Foundation Planetary Data System https://ui.adsabs.harvard.edu/ https://datacite.org/ https://www.doi.org/ https://pds.nasa.gov/ Questions?