Every Bit Counts. Publication and Citation of Data in the Earth Sciences MG&G Data Systems Advisory Committee Meeting 2009 Jens Klump et al.

Similar documents
Pilot Implementation: Publication and Citation of Scientific Primary Data

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Using digital library techniques - Registration of scientific primary data -

CERA: Database System and Data Model

INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES

RADAR A Repository for Long Tail Data

Riding the Wave: Move Beyond Text TIB's strategy in the context of non-textual materials. Uwe Rosemann, Irina Sens IATUL Conference Singapur

DOIs for Research Data

DOIs for Scientists. Kirsten Sachs Bibliothek & Dokumentation, DESY

Archivierung und Publikation von Forschungsdaten mit RADAR

For Attribution: Developing Data Attribution and Citation Practices and Standards

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Specific requirements on the da ra metadata schema

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

The Canadian Information Network for Research in the Social Sciences and Humanities.

Visual Core Description

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

An introduction to data publications

The Value of Metadata

re3data.org Registry of Research Data Repositories Peter Schirmbacher Humboldt-Universität zu Berlin ETD Hong Kong, September 25.

re3data.org Registry of Research Data Repositories

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Metadata Workshop 3 March 2006 Part 1

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

Improving a Trustworthy Data Repository with ISO 16363

Strategy for long term preservation of material collected for the Netarchive by the Royal Library and the State and University Library 2014

Data Management Checklist

- C3Grid Stephan Kindermann, DKRZ. Martina Stockhause, MPI-M C3-Team

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Metadata Overview: digital repositories

Archiving the Web: What can Institutions learn from National and International Web Archiving Initiatives

TIB AV-Portal: A Trusted Home for Conference Recordings

re3data.org - Making research data repositories visible and discoverable

CMIP5 Datenmanagement erste Erfahrungen

Reproducibility and FAIR Data in the Earth and Space Sciences

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Managing Web Resources for Persistent Access

TIB AV-Portal. Margret Plank 19th of January 2015 TACC Meeting

MACHINE ACTIONABLE INTEGRATION OF DATACITE AND DDI METADATA

DataCite Persistent links to scientific data. Jan Brase

Using Persistent Identifiers at

The EOC Geoservice: Standardized Access to Earth Observation Data Sets and Value Added Products ABSTRACT

Data Curation Handbook Steps

Digital Object Identifiers for scientific data. Dr Norman Paskin International DOI Foundation Oxford OX2 8HY UK

Description. Speaker Patrizia Monteduro (International Consultant, FAO) TRAINING GEONETWORK OPENSOURCE Islamabad, Pakistan, Jan 29-31, 2014

Striving for efficiency

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

DURAARK. Ex Libris conference April th, 2013 Berlin. Long-term Preservation of 3D Architectural Data

GEOSS Data Management Principles: Importance and Implementation

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

The Need for a Terminology Bridge. May 2009

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Persistent identifiers, long-term access and the DiVA preservation strategy

EUDAT & SeaDataCloud

The Experimental Project of DOI Registration for Research Data at Japan Link Center (JaLC)

Scientific Data Curation and the Grid

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Introduction to. Digital Curation Workshop. March 14, 2013 SFU Wosk Centre for Dialogue Vancouver, BC

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Dutch View on URN:NBN and Related PID Services

Digital repositories as research infrastructure: a UK perspective

TERENO Workshop. Management and publishing of TERENO data from distributed data bases

Reducing Consumer Uncertainty

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

ATARRABI A WORKFLOW SYSTEM FOR THE PUBLICATION OF ENVIRONMENTAL DATA

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Personal Digital Information Project, Part 2: Hands-on Exercise

Checklist and guidance for a Data Management Plan, v1.0

ODC and future EIDA/ EPOS-S plans within EUDAT2020. Luca Trani and the EIDA Team Acknowledgements to SURFsara and the B2SAFE team

Arctic Data Center: Call for Synthesis Working Group Proposals Due May 23, 2018

Writing a Data Management Plan A guide for the perplexed

European digital repository certification: the way forward

Is my institution ready for data citation? Dave Connell, Australian Antarctic Data Centre

Persistent identifiers: jnbn, a JEE application for the management of a national NBN infrastructure

Data Archiving and Networked Services. Valentijn Gilissen, MA

Two Traditions of Metadata Development

How to make your data open

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

Safeguarding Digital Heritage through Sustained Use of Legacy Software

The NIH Big Data to Knowledge Initiative: Raising the Prominence of Data

NOW ON. Mike Takats Thomson Reuters April 30, 2013

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

Indiana University Research Technology and the Research Data Alliance

Data Discovery - Introduction

International Audit and Certification of Digital Repositories

Adding value to open access research data: the ebank UK Project.

Trusted Digital Archives

epic and the Handle System

Metadata Framework for Resource Discovery

Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

Bengkel Kelestarian Jurnal Pusat Sitasi Malaysia. Digital Object Identifier Way Forward. 12 Januari 2017

AN INFORMATION SYSTEM FOR RESEARCH DATA IN MATERIAL SCIENCE

Data management Backgrounds and steps to implementation; A pragmatic approach.

August 14th - 18th 2005, Oslo, Norway. Web crawling : The Bibliothèque nationale de France experience

Persistent identifiers in the national bibliography context

Transcription:

Every Bit Counts Publication and Citation of Data in the Earth Sciences MG&G Data Systems Advisory Committee Meeting 2009 Jens Klump et al.

Autors Jens Klump 1, Robert Huber 2, Jan Brase 3, Michael Diepenbroek 2, Hannes Grobe 4, Beate Hildenbrand 5, Heinke Höck 6, Michael Lautenschlager 6, Uwe Schindler 2, Irina Sens 3 and Joachim Wächter 1 1. ( WDC-TERRA GFZ Potsdam (proposed 2. WDC-MARE, Univ. Bremen 3. ( Germany TIB Hannover (Nat. Lib. Sci. et Tech. 4. WDC-MARE, AWI Bremerhaven 5. WDC-RSAT, DLR-DFD Oberpfaffenhofen 6. WDC-Climate, MPI-MET Hamburg

Data driven research has become an important part of science. Scientific communication still emulates paperbased media. Most data remain inaccessible and are at risk of being lost. Why Data Publication?

Data publication today

Use of Published Data No citation of the data source. The data source needs to be deduced from the paper. No Metadata. Often, the source of data is not acknowledged.

Data in the publication process today Library Publication Private Files Manuscript Data Metadata ( 2003 ) al. After Helly et

The consequences Most data remain underutilised because they are not accessible. Unnecessary duplication Research results cannot be verified. Falsification of results. Calls to make data accessible and share data were welcomed but did not give any results.

Why are data not made accessible? Data publication is hampered by structural barriers in the publication process: Journals do not devote space to data tables due to economic constraints and have no interest in archiving data. Authors do not receive professional recognition for publishing data because the datasets cannot be cited in a reliable way. Data are not cited because their location (URL), in many cases, is transient.

Necessary steps Data need to be citeable to be valuable. Reputation is the currency of science. Authors will only prepare data for publication if the effort is worthwhile. Data publication is labour intensive. Data must be accessible. Access through persistent indentifiers and long-term archives. Intellectual property rights need to be secured. Authors need full control of their publications.

Project Publication and Citation of Scientific Primary Data Funded by the German Science Foundation. Project partners: ( Bremen/Bremerhaven ) WDC-MARE ( Hamburg ) WDC Climate ( WDC-TERRA GFZ Potsdam (proposed ( Oberpfaffenhofen ) WDC-RSAT Implementation of services for the publication of data. DOI registration agency at German National Library for Science and Technology (TIB Hannover). To date 18 DOI registration agents. Inclusion of data publications into library catalogues.

STD-DOI System Architecture

Digital Preservation & Trust Creation of digital information continues to accelerate! Practical digital preservation/curation efforts are just starting. Who can guarantee the long-term availability, authenticity and integrity of digital information? Who is trustworthy? Which institutions, approaches and technologies can be trusted? Evaluation and audit methods have been developed and are now in an ISO standardisation process.

Data Publication as a Supplement to Literature TIBORDER catalogue of the German National Library of Science and Technology. doi:10.1594/gfz.sddb.1043 at the ICDP Scientific Drilling Database.

Data Publication as Independent Work

DOI metadata The STD-DOI metadata are mainly Dublin Core elements, plus data specific elements. The metadata transmitted to the National Library via web service (HTTP/SOAP) and incorporated into the library catalogue. The metadata may contain references to other objects. Element <RelatedIdentifier> iscited, isparent, ischild, isduplicate,

Links to other sources The element <relatedidentifier> can be used to point to other electronic objects: Point to the literature where the data set is interpreted. Point to samples, from which the data were derived. Point to other datasets that belong to the same collection of datasets. These links can be used by machines (e.g. data portals) to make search suggestions and thus aid discovery of data, literature and samples, or other added value services.

Information Discovery Link to publication Citation of data IGSN points to sample

Technical Questions Granularity What size dataset should be included in the catalogue? Which child elements of a collection of data objects should be identifiable through DOI? Quality Control ( syntactic ) Technical QC ( semantic ) Peer-review Persistence Data must be available forever.

How can I participate? Point of contact: TIB Hannover Roles: Data Creator -> Author Data Centre -> Data Publication Agent Library -> Publication Agent for Grey Literature STD-DOI Service is part of the TIB Hannover infrastructure. The STD-DOI consortium is open to new members.

Summary Data publication is a good idea, but it needs Persistent Identifiers Long-term archives Incentives to authors to publish their data Data publications can, and should be, cited. Links pointing to literature, samples, and related data can be used to aid discovery. The challenge is to integrate this into our scientific culture the technology is there. You are welcome to participate!