Research Elsevier

Similar documents
Linking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier

Linking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier

Data Publication Services WG. Chairs: Adrian Burton (ANDS) Hylke Koers (Elsevier) - presenter

Some Ideas on Making Research Data Discoverable and Usable: It s the Metadata, Stupid!

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

CODATA: Data Citation Workshop Perspectives from Editors and Publishers. Brooks Hanson Director, Publications, AGU

Universidad de Extremadura 22 October Elsevier s approach to Open Access and latest developments. Edward Wedel-Larsen. Research Solutions

Reproducibility and FAIR Data in the Earth and Space Sciences

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

The Role of Repositories and Journals in the Astronomy Research Lifecycle

Services to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017

Data Curation Profile Human Genomics

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project

The Materials Data Facility

DOIs for Research Data

Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements

INSTITUTIONAL REPOSITORY SERVICES

DATAVERSE FOR JOURNALS

Data publication and discovery with Globus

How to make your data open

Perspectives on Open Data in Science Open Data in Science: Challenges & Opportunities for Europe

Data Discovery - Introduction

Open Science, FAIR data and effective data management

Introducing the Springer Nature Data Support Services

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Callicott, Burton B, Scherer, David, Wesolek, Andrew. Published by Purdue University Press. For additional information about this book

EUDAT. Towards a pan-european Collaborative Data Infrastructure

ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

The Data Curation Profiles Toolkit: Interview Worksheet

The Science International Accord on Open Data in a Big Data World and the IUCr's response

Advancing code and data publication and peer review. Erika Pastrana, PhD Executive Editor, Nature Journals ALPSP_Sept 2018

Making data publication a first class research output

Web of Science. Platform Release Nina Chang Product Release Date: March 25, 2018 EXTERNAL RELEASE DOCUMENTATION

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION

GEOSS Data Management Principles: Importance and Implementation

Adding Research Datasets to the UWA Research Repository

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

RADAR A Repository for Long Tail Data

SHARING YOUR RESEARCH DATA VIA

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

TEXT MINING: THE NEXT DATA FRONTIER

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe

COALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING. Source: NSF.

Linking datasets with user commentary, annotations and publications: the CHARMe project

Data Curation Profile Movement of Proteins

Dataverse and DataTags

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

ZB MED Information Center Life Sciences

NOW ON. Mike Takats Thomson Reuters April 30, 2013

The DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012

Indiana University Research Technology and the Research Data Alliance

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Elsevier publishing partner for the journals in Thailand

Data Curation Profile Cornell University, Biophysics

Embracing research data. Lucie Boudova, Elsevier

Data Curation Profile Plant Genomics

Data management Backgrounds and steps to implementation; A pragmatic approach.

Putting Open Access into Practice

Digital The Harold B. Lee Library

Data Citation and Scholarship

PDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics

Data Curation Handbook Steps

University at Buffalo's NEES Equipment Site. Data Management. Jason P. Hanley IT Services Manager

Interoperability Framework Recommendations

UC Irvine LAUC-I and Library Staff Research

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Data Curation Profile Botany / Plant Taxonomy

A Brief Introduction to the Data Curation Profiles

State of the Art in Data Citation

Increasing research efficiency. Lucia Franco 15 April 2010

Integrating Data with Publications: Greater Interactivity and Challenges for Long-Term Preservation of the Scientific Record

The iplant Data Commons

ScienceDirect Lithuanian Road Show March 2013

The Smithsonian/NASA Astrophysics Data System

Data management issues and how Societies can contribute

Big Data infrastructure and tools in libraries

ProQuest Dissertations and Theses Overview. Austin McLean and Marlene Coles CGS Summer Workshop, July 2017

Evolving the digital library for digital scholarship enablement

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Data Publishing and Data Linking Introducing SCHOLIX

ClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar

VI-SEEM Data Repository. Presented by: Panayiotis Charalambous

Lessons Learned. Implementing Rosetta in the Harold B. Lee Library

Scientific Research Data Management Policy

Publishing data?! How do you do that then?

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Data Management Checklist

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

CrossRef tools for small publishers

Discovery to Delivery with WorldCat Local

Data Curation Profile Plant Genetics / Corn Breeding

Implementation of the CoreTrustSeal

CORE: Improving access and enabling re-use of open access content using aggregations

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

DATA SHARING FOR BETTER SCIENCE

NRF Open Access Statement

Transcription:

Research Data @ Elsevier From generation through sharing and publishing to discovery IJsbrand Jan Aalbersberg SVP Journal and Data Solutions NDS, Boulder - June 12, 2014 Contributors: Anita de Waard Hylke Koers

Outline Research Data Current Status @ Elsevier Researcher Data Workflow Research Data Needs Experiments 2

Elsevier data activities started early on, with supplementary files and outward entity linking Accepting supplemental files for over a decade: Growth of articles with such files of 25-35% per year Recent snapshot showed 50% of such files contain data Suggesting in GfA to post data at repositories: When appropriate data repositories were available When supported by community and editorial boards Signing Brussels Declaration on data in 2007: Raw research data should be made freely available. Publishers encourage public posting of such data. Data sets submitted with paper should (wherever possible) be made freely accessible. Entity-linking from data identifiers in article text: Author-indicated (initially) or text-mined (sometimes) Examples: GENBANK, PDB, TAIR, mostly Life Sciences 3

% articles with supp. data (estimate) Supplemental files in research articles What kind of content is submitted as supplementary material? Methods, videos, text, references, model output, etc. Raw, experimental data Most prevalent file types: DOC PDF ZIP XLS Combination of data and other material 4

Elsevier data activities started early on, with supplementary files and outward entity linking Accepting supplemental files for over a decade: Linear growth of articles with such files of 30-40% per year Recent snapshot showed 50% of such files contain data Suggesting in GfA to post data at repositories: When appropriate data repositories were available When supported by community and editorial boards Signing Brussels Declaration on data in 2007: Raw research data should be made freely available. Publishers encourage public posting of such data. Data sets submitted with paper should (wherever possible) be made freely accessible. Entity-linking from data identifiers in article text: Author-indicated (initially) or text-mined (sometimes) Examples: GENBANK, PDB, TAIR, mostly Life Sciences 5

Over time, data sets grew in importance and availability, however they were difficult to find 6

Research data has a data discovery problem Do you think it is useful to link underlying research data with formal literature? Researcher survey, 1202 respondents (PARSE.insight 2010) 7

We started a project to use the published articles to better discover associated data sets Started collaboration with data repositories now ~40 USER SD Article SD Server articles link Repository Server data sets Based on image-based linking deep link to data set SD article asks for a data set image from repository If data available, repository shows image and link If no data available, repository shows 1-pixel transparent image (which is de-facto invisible for the user) 8

Image link displayed inside published article Deep-links to data set associated to this specific article http://www.sciencedirect.com/science/article/pii/s0370157304002753 9

Making data actionable or visualize it inside the article, increases the chance of (re-)use Bringing data functionality inside the SD article SD Article SD Server articles USER RD app Research Data Server data sets Functionality is specific to data repository / data set Application does link to repository for full functionality Data set can be interacted with in context of article Research Data Server can also be supplemental file Examples: GenBank, PDB, PANGAEA, MINT 10

Repository application inside published article Offers full Google Maps functionality here the pins deep-link to data at PANGAEA http://www.sciencedirect.com/science/article/pii/s0025322703003724 11

Deep link directly goes to associated dataset http://www.sciencedirect.com/science/article/pii/s0025322703003724 12

Application can reveal data underlying plots Interactive plots show the data behind figures (data also available for download) http://www.sciencedirect.com/science/article/pii/s0011916414001520 13

Executable papers allow immediate validation Dara and code are added, such that input files can be adapted and code can be rerun http://www.sciencedirect.com/science/article/pii/s0097849313000484 14

Up to 40 data repository linking partners http://www.elsevier.com/databaselinking 15

Current linking approach isn t scalable; an increase in repositories requires standards Data Rep,,, X Publisher Distribution Platform Data Rep 16

Current linking approach isn t scalable; an increase in repositories requires standards Data Rep,,, Data Rep X Publisher Distribution Platform,,, Publisher Distribution Platform 17

This especially holds when publishers start to support data posting at variety of repositories Publisher Submisson Platform,,, X Publisher Submission Platform Data Rep X,,, Data Rep Publisher Distribution Platform,,, Publisher Distribution Platform 18

Requires solution for article-data connections, and for data submission interoperability Publisher Submisson Platform Data Rep Publisher Distribution Platform,,, HUB? HUB?,,,,,, Publisher Submission Platform Data Rep Publisher Distribution Platform 19

Looking at Researcher Data Workflow shows that also other standards are highly needed Main Task Activities Needs Experiment Publish Re-use Plan, measure, record, analyze, annotate, store, archive, preserve, Prepare, post, submit, get reviewed, publish, get cited, get credit, Curate, search, access, analyze, Workflow and analysis tools, ELN, standards, metadata Public hosting, data space, standards, metadata Standards, metadata, analysis tools Publishers (and others) operate in all task areas Effective interoperable infrastructure needs standards Generic, discipline-specific, and data and metadata RDA, WDS, CODATA, Force11 now data citation 20

Such standards are also required to move the data from bits and bytes to fully (re-) usable Useful & Re-usable Trusted Reproducible Discoverable Comprehensible Archived Accessible Preserved Elsevier initiatives: Executable papers Microarticles Data articles Data linking Data integration Supplem. files Standard Cmt Pilot projects: Urban Legend Moonrocks 21

Urban Legend with CMU How can we make a standard neuroscience wet lab store and share their data? Incorporate structured workflows into the daily practice of a typical electrophysiology lab (the Urban Lab at CMU) What does it take? Where are points of conflict? 1-year pilot, funded by Elsevier CMU: Shreejoy Tripathy, manage/user test Elsevier: development, UI, project management 22

Urban Legend Components de Waard, A., Burton, S. et al., 2013 23

Urban Legend Data Entry App 24

Urban Legend Data Dashboard 25

Moonrocks with IEDA How can we scale up data curation? Build a database for lunar geochemistry Leapfrog & improve curation time Determine best practices and challenges Estimate costs 1-year pilot, funded by Elsevier 26

Moonrocks Data Entry 27

Urban Legend Data Dashboard 28

1) Interoperable architecture for subm and disc 2) Metadata and standards for value re-use Useful & Trusted Reproducible Discoverable Comprehensible Archived Accessible Preserved Re-usable Elsevier initiatives: Executable papers Microarticles Data articles Data linking Data integration Supplem. files Standard Cmt Pilot projects: Urban Legend Moonrocks 29