An introduction to data publications

Similar documents
Data Publication. HGF Alliance: Remote Sensing and Earth System Dynamics

Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements

re3data.org Registry of Research Data Repositories Peter Schirmbacher Humboldt-Universität zu Berlin ETD Hong Kong, September 25.

ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU

Reproducibility and FAIR Data in the Earth and Space Sciences

re3data.org Registry of Research Data Repositories

Every Bit Counts. Publication and Citation of Data in the Earth Sciences MG&G Data Systems Advisory Committee Meeting 2009 Jens Klump et al.

COALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING. Source: NSF.

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Pilot Implementation: Publication and Citation of Scientific Primary Data

INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES

CODATA: Data Citation Workshop Perspectives from Editors and Publishers. Brooks Hanson Director, Publications, AGU

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Using digital library techniques - Registration of scientific primary data -

Archivierung und Publikation von Forschungsdaten mit RADAR

re3data.org - Making research data repositories visible and discoverable

DOIs for Research Data

Reproducibility and Reuse of Scientific Code Evolving the Role and Capabilities of Publishers

DATAVERSE FOR JOURNALS

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Taylor & Francis Online. A User Guide.

RADAR A Repository for Long Tail Data

CODE AND DATA MANAGEMENT. Toni Rosati Lynn Yarmey

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

GEOSS Data Management Principles: Importance and Implementation

Striving for efficiency

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

WE HAVE SOME GREAT EARLY ADOPTERS

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Robin Wilson Director. Digital Identifiers Metadata Services

Science Europe Consultation on Research Data Management

Developing a Research Data Policy

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

Specific requirements on the da ra metadata schema

State of the Art in Data Citation

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

Quality Assured (QA) data

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

Dataset Documentation Reference Guide for Pure Users

AN INFORMATION SYSTEM FOR RESEARCH DATA IN MATERIAL SCIENCE

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Callicott, Burton B, Scherer, David, Wesolek, Andrew. Published by Purdue University Press. For additional information about this book

Data Curation Profile Human Genomics

Trusted Digital Archives

For many years, the creation and dissemination

Arctic Data Center: Call for Synthesis Working Group Proposals Due May 23, 2018

Introduction to Data Management for Ocean Science Research

Technical documentation. SIOS Data Management Plan

Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project

Advancing code and data publication and peer review. Erika Pastrana, PhD Executive Editor, Nature Journals ALPSP_Sept 2018

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

For Attribution: Developing Data Attribution and Citation Practices and Standards

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

RN Workshop Series on Innovations in Scholarly Communication: plementing the Benefits of OAI (OAI3)

Copernicus Space Component. Technical Collaborative Arrangement. between ESA. and. Enterprise Estonia

Review of Implementation of the Global Research Council Action Plan towards Open Access to Publications

Data Management Checklist

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Data Curation Handbook Steps

bwfdm Communities - a Research Data Management Initiative in the State of Baden-Wuerttemberg

5/16/2018. Researcher Challenges with Data Use. AGU s position statement on data affirms that

Introducing the Springer Nature Data Support Services

The State of Arctic Data the IPY experience

Data Curation Profile Movement of Proteins

Data Citation and Scholarship

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

InfoSci -Databases Platform

Bengkel Kelestarian Jurnal Pusat Sitasi Malaysia. Digital Object Identifier Way Forward. 12 Januari 2017

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

HRA Open User Guide for Awardees

ODC and future EIDA/ EPOS-S plans within EUDAT2020. Luca Trani and the EIDA Team Acknowledgements to SURFsara and the B2SAFE team

Climate Risk & National Security: People, not Polar Bears

Data and information sharing WMO global systems

28 September PI: John Chip Breier, Ph.D. Applied Ocean Physics & Engineering Woods Hole Oceanographic Institution

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group

Report to Plenary on item 3.1

EUDAT & SeaDataCloud

Writing a Data Management Plan A guide for the perplexed

file:///g:/help/index.html DSPACE HELP Browse Search Advanced Search Subject Category Search Communities Collections Sign on to DSpace Submit

Research Elsevier

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

DOIs for Scientists. Kirsten Sachs Bibliothek & Dokumentation, DESY

Objectives of the Webometrics Ranking of World's Universities (2016)

Research Data Management: lessons learned - and still to learn

CrossRef tools for small publishers

JCOMM Observing Programme Support Centre

SESAR, IGSN, & a vision for a Repository Portal and Hosted Collection Management

The CEDA Archive: Data, Services and Infrastructure

Towards repository interoperability

Introduction to INEXDA s Metadata Schema

The Institutional Repository and Minho University OA Policy. Eloy Rodrigues

INFORMATION NOTE. United Nations/Germany International Conference

Dryad Curation Manual, Summer 2009

CMIP5 Datenmanagement erste Erfahrungen

Transcription:

An introduction to data publications Kirsten Elger Deutsches GeoForschungsZentrum GFZ, Potsdam, kirsten.elger@gfz-potsdam.de

Research Data Research data are essential for scientific research Many datasets, e.g. observational data, are irreplaceable With the advent of the internet, there is a significant change in the way to collect, manage, and archive research datasets

Observations on: meteorology, geomagnetism, auroral phenomena, ocean currents, tides, structure and motion of ice and atmospheric electricity So extensive and dangerous a work Eleven nations established 14 principal research stations across the Polar Regions. 12 were in the Arctic, along with at least 13 auxiliary stations. Over 700 men incurred the dangers of Arctic service to establish and relieve these stations between 1881 and 1884.

Geological field work in 1995 GPS values

data publication in 1995

and after the end of the project? the bad case: the phd student/ postdoc takes the data with him or her (on a floppy disc/ CD) and, years later, throws everything away Slightly better: data submission (in digital or analogue form) to a computer of the department, with or without data description (depending on the time and motivation of the respective scientist What happens when the professor or lab PI retires? Who takes care of the hard drives with the old data? Who takes care of paper copies of maps or other datasets? How long may rock samples be archived after the scientist left?

Research Data Today Thanks to the internet many datasets are available online very fast data access, even to large datasets online access to journal articles online-only journals are coming of age real-time data

Real-time data example: climate station in Alaska (air, surface, shallow ground temperatures) Quelle: Permafrost Lab, UAF, Fairbanks http://permafrost.gi.alaska.edu/

GEOFON earthquake information service GEOFON Live Seismograms

NOAA (National Ocean and Athmosphere Administration): Synoptic meterological records of the first IPY ín digital form (surface air temp, sea level pressure 1-year time series) extensive documentary image collection Overview on IPY reports Posters Online available for download: www.arctic.noaa.gov/aro/ipy-1

as a consequence With the advent of the digital era and the internet, data sets increasingly grow in size and complexity Data reuse and data mining are becoming more and more important Metadata portals (with automatically generated standardised metadata) are more and more important for data discoverygetting There is an incrasing number of data repositories and for all types of research data There is increasing expectation by the scientific community, funding agencies and the public to make publicly-funded research results and data free and open accessible without any constraints.

Politics 2003 Berliner Erklärung über den offenen Zugang zu wissenschaftlichem Wissen: Open Access- Veröffentlichungen umfassen originäre wissenschaftliche Forschungsergebnisse ebenso wie Ursprungsdaten, Metadaten, Schwerpunktinitiative Digitale Information der Allianz der deutschen Wissenschaftsorganisationen: Die Verfügbarkeit und Nachnutzung digitaler Informationen schließt den möglichst kostenfreien und offenen Zugang zu Forschungsdaten ein.. Digitale Agenda der Bundesregierung 2013-2017

Helmholtz Open Science Open science, the unrestricted access to scientific publications and cultural heritage, is an ongoing and future trend in the scientific landscape worldwide. Research publications and other digital objects such as research data and scientific software will thus be publicly available on the internet. The Helmholtz Association was one of the initial signatories of the Berlin Declaration on Open Access to Knowledge in the Sciences and Humanities in 2003. This commitment towards open access was then formally approved by its Assembly of Members (assembly of the directors of the Helmholtz Centres): Publications from the Helmholtz Association shall in future, without exception, be available free of charge, as far as no conflicting agreement with publishers or others exists. (Resolution of the Assembly of Members, 27 September 2004).

Obstacles of sharing too much work with no benefit data publications were deleted from reference lists by journal editors they mis-interpret or mis-use my data someone will publish MY data before me Do I have to share ALL my data? www.aukeherrema.nl

Domains of research data PRIVATE DOMAIN SHARED DOMAIN PERMANENT DOMAIN PUBLIC DOMAIN Think about data sharing from the beginning on!

Intelligent Openness (Royal Soc. London 2012) The practice of science: Open inquiry is at the heart of the scientific enterprise. Publication of scientific theories - and of the experimental and observational data on which they are based - permits others to identify errors, to support, reject or refine theories and to reuse data for further understanding and knowledge. Science s powerful capacity for self-correction comes from this openness to scrutiny and challenge How to make intelligent openness standard? data must be accessible and readily located Data must be intelligible for those who wish to scrutinize them They must be assessable so that judgments can be made about their reliability and the competence of those who created them They must be usable by others For data to meet these requirements it must be supported by explanatory metadata (data about data) Science as an open enterprise (2012) The Royal Society Science Policy Centre report 02/12 ISBN: 978-0-85403-962-3

There is a need for. Researchers willingness to publish their data Technical solutions to facilitate data availability, access and reuse Recognition and credits for data producers

Data publication with DOI persistent citable with metadata

DataCite and Digital Object Identifiers (DOI) for Data STD DOI "Publikation und Zitierbarkeit von Primärdaten" (DFG Project 2004-2009, Partner: TIB, DKRZ, PANGAEA, DLR, GFZ) DOI for research data DataCite

What is a DOI Digital Object Identifier A unique and permanent identifier for digital objects Signpost to the URL with the dataset and its description = landing page Persistent = long term data access guaranteed by the publisher With metadata

Metadata and Metadata Metadata for data discovery: example DOI landing page title citation description/ abstract download data files Keywords standardised metadata related work spatial coverage

Metadata and Metadata Metadata for data discovery author, title, description, keywords, spatial/temporal domain,... Structural metadata (for reuse): formats, methodology, sources Definition of data labels

Metadata and Metadata metadata for data discovery author, title, description, keywords, spatial/temporal domain,... structural metadata (for reuse) formats, methodology, sources, processing steps, administrative metadata metadata related to the use, management, and encoding processes of digital objects over a period of time Includes technical metadata: versions, checksum, timestamp,

A comprehensive data description is essential for data reuse and should always be available before a DOI registration There are different possibilities for data publication

Examples for data publication 1 data supplements to scientific articles Links to datasets Link to original article with data description

Examples for data publication 2: Data Journals Peer-reviewed articles with the description of datasets or collections, etc.

3. Data Reports GFZ examples Institutional Report Series have long traditions as important sources of information. Today: persistently online accessible and citable with DOI GFZ: Data Reports Flexible format enhanced data description standardised templates for each discipline, internal review Project-specific design if required

Coalition on Publishing Data in the Earth and Space Sciences GOAL OPEN DATA in the EARTH and SPACE SCIENCES STATEMENT OF COMMITMENT To promote metadata information and domain standards, [ ], to help simplify and standardize deposition and reuse. To promote referencing of data sets using the Joint Declaration of Data Citation Principles, in which citations of data sets should be included within reference lists. To include in research papers concise statements indicating where data reside and clarifying availability. To promote and implement links to data sets in publications and corresponding links to journals in data facilities via persistent identifiers. (January 2015)

SIGNATURES (Nov 2015) additional signatures welcome

Conclusions Data are increasingly recognized as part of the scholarly record, data citation is coming of age. Data publications with assigned DOI provide citable and persistent access to research data. There is a growing number of data repositories to store and access data (institutional, domain specific, general). Data description is essential for reuse

Next step International Geo Sample Number IGSN unique identifier for physical objects