State of the Art in Ethno/ Scientific Data Management
|
|
- Edgar Lawson
- 5 years ago
- Views:
Transcription
1 State of the Art in Ethno/ Scientific Data Management Ruth Duerr This work is licensed under a Creative Commons Attribution v4.0 License.
2 Overview Data management layers Data lifecycle Levels of data Citation and the data landscape *
3 Renear, A. H., Sacchi, S., & Wickett, K. M. (2010). Definitions of dataset in the scientific and technical literature. Proceedings of the American Society for Information Science and Technology, 47(1), 1 4.
4 Data Management Layers Layers Examples Implication for PI Curation Preservation Future JHU Data Archive and other DCS instances, NSIDC Sea Ice Index, ICPSR comparison sets JHU Data Archive Portico ICPSR New query capabilities Cross-disciplinary re-use possible Ability to use own data in the future (e.g. 5 yrs) Data sharing Archiving Storage CUAHSI FigShare Dataverse Server in Lab, Project Website, Amazon S3 System provides identifiers for sharing, references, fixity, backups, etc. Responsible for: Restore Sharing Staffing
5
6 Data Management Layers Layers Examples Implication for PI Curation NSIDC Sea Ice Index, ICPSR comparison sets, Protein Data Bank New query capabilities Cross-disciplinary re-use Preservation JHU Data Archive Portico ICPSR Ability to use own data in the future (e.g. 5 yrs) Data sharing Archiving Storage CUAHSI FigShare Dataverse Server in Lab, Project Website, Amazon S3 System provides identifiers for sharing, references, fixity, backups, etc. Responsible for: Restore Sharing Staffing
7 The Remote Sensing Record Satellite-based Passive Microwave sensors have been measuring sea ice since 1972 Consistent collection of data started in 1978 with the SMMR series of instruments Why passive microwave? Distinguishing sea ice from ocean is straightforward Passive microwave works through clouds and in the dark Initial user base was cryospheric scientists Arctic sea ice concentration in April 2004, calculated from data measured by the Special Sensor Microwave/Imager (SSM/I) on the Defense Meteorological Satellite Program (DMSP) satellite. The image is centered over the North Pole, with continents shown in green. - Image courtesy of Florence Fetterer and Ken Knowles, National Snow and Ice Data Center, University of Colorado, Boulder, CO. Evolution of sea ice data products at NSIDC, presented by Ruth Duerr March 10, 2015, RDA 5th Plenary, San Diego
8 Evolution of sea ice data products at NSIDC, presented by Ruth Duerr March 10, 2015, RDA 5th Plenary, San Diego
9
10 I2S2 Partners, Idealized Scientific Research Activity Lifecycle Model [2011]
11 Levels of data There may have been several steps in processing data to create the data products that are actually used by a particular community In many cases the "raw" data may only be useful to specialists (e.g., algorithm developers) Typically, more processed data are useful to different and often broader audiences NASA defined processing levels for their satellite data decades ago Now other fields are beginning to think about reusing this concept for their own types of data
12 Data levels for NASA data vs textual criticism Level NASA Text Criticism 0 unprocessed instrument data at full resolutions. 1 unprocessed instrument data at full resolution, time referenced, and annotated with ancillary information, including radiometric and geometric calibration coefficients and georeferencing parameters (i.e., platform ephemeris) computed and appended but not applied to the Level 0 data. 2 Derived environmental variables (e.g., ocean wave height, soil moisture, ice concentration) at the same resolution and location as the Level 1 source data. 3 Variables mapped on uniform spacetime grid scales, usually with some completeness and consistency properties (e.g., missing points interpolated, complete regions mosaicked together from multiple orbits)." Unprocessed text images Unprocessed text images, annotated with the identification of the hardware and software used, any configuration or calibration information, the time and place of the scanning, and organization or persons conducting the imaging, and a [nondescriptive] identification of the object imaged. Derived representation of text content and structure, mapped to locations in the Level 1 source data Representation of textual content and structure mapped on to and expansion of abbreviations, interpolation of missing text. (perhaps multiple) carriers with described structure (e.g, physical bibliography), Renear, A. H., Dolan, M., Trainor, K. and Cragin, M. H. (2009), Towards a cross-disciplinary notion of data level in data curation. Proc. Am. Soc. Info. Sci. Tech., 46: 1 8. doi: /meet
13 A brief history of data citation Data citation used to be common practice What!!?
14 A brief history of data citation Data was in the literature! In Books and Technical Reports
15 A brief history of data citation Data was in the literature! and Journals
16 A brief history of data citation That started changing with the advent of digital data At first because the publications were still paper Why would you want to make your data less accessible to the computers needed to analyze it? Now how do you represent a multi-dimensional data set in a twodimensional medium? Later because often the data was voluminous
17 A brief history of data citation Digital data repositories came into being in the final decades of the 20 th century Many collocated with existing data centers (e.g., World Data Centers set up during the International Geophysical Year 1957/8) Many have been promoting data citation for decades
18 A brief history of data citation By 2013 many groups had been working on data citation guidelines and principles for many years Adapted from a slide by Maryann Martone
19 A brief history of data citation Photo: Flickr Paul Uhlir...a plea to come together
20 Joint Declaration of Data Citation Principles Importance: Data should be considered legitimate, citable products of research. Data citations should be accorded the same importance in the scholarly record as citations of other research objects, such as publications. Credit and Attribution: Data citations should facilitate giving scholarly credit and normative and legal attribution to all contributors to the data, recognizing that a single style or mechanism of attribution may not be applicable to all data. Evidence: In scholarly literature, whenever and wherever a claim relies upon data, the corresponding data should be cited. Unique Identification: A data citation should include a persistent method for identification that is machine actionable, globally unique, and widely used by a community. Access: Data citations should facilitate access to the data themselves and to such associated metadata, documentation, code, and other materials, as are necessary for both humans and machines to make informed use of the referenced data. Persistence: Unique identifiers, and metadata describing the data, and its disposition, should persist -- even beyond the lifespan of the data they describe. Specificity and Verifiability: Data citations should facilitate identification of, access to, and verification of the specific data that support a claim. Citations or citation metadata should include information about provenance and fixity sufficient to facilitate verifying that the specific time slice, version and/or granular portion of data retrieved subsequently is the same as was originally cited. Interoperability and flexibility: Data citation methods should be sufficiently flexible to accommodate the variant practices among communities, but should not differ so much that they compromise interoperability of data citation practices across communities. Data Citation Synthesis Group. (2014). Joint Declaration of Data Citation Principles.
21 Data Citation Implementer's Group Work in 4 areas: NISO JATS. Identifiers and associated metadata. Common repository interfaces. Putting together and analyzing some exemplar journal workflows with suggestions on how the editorial process can deal with data citations, to provide context and analysis of commonality for the other tasks.
22 Data Citation in the NISO-JATS DTD NISO-JATS is an open standard for representing full text articles in XML Used widely, but not limited to, in life sciences. Technical Workshop: June 2014, London 18 (publishers, JATS users, and JATS committee reps) Workshop Goals JATS recommendations to support structured data citations according to the F11 Data Citation Principles Decide adoption and implementation strategy by publishers
23 Implications of NISO-JATS support for data citation Enabling the citation of data to be treated with the same respect as article citations Journals empowered to structure the citation of data in machine-actionable form ultimately supporting development of new applications and processes Agreements on implementation best practice will become important as uptake grows (Data Citation Principles!) For more info:
24
25 Data Citation Implementer's Group
26 Moving Forward Research Data Alliance has several working groups working on data citation also Data bibliometrics Data services Data Workflows in conjunction with Force 11 group Cost recovery for data centers Dynamic data citation 26 Data Citation: ESIP, AGU, and NSIDC, Feb. 2014, Ruth Duerr
27 Moving Forward in Earth Sciences Brooks Hanson (AGU) and Kerstin Lehnert (IEDA) held a publisher's round table Statement of Commitment from Earth and Space Science Publishers and Data Facilities Data management policies Training/Ethics - E.g., for NSF program managers Ongoing collaboration between publishers and data centers Index of data facilities Seeking endorsement at AGU winter meeting Nancy Ritchey (NCDC) session on defining what data publication means for data centers 27 Data Citation: ESIP, AGU, and NSIDC, Feb. 2014, Ruth Duerr
28 Moving Forward Integrating Domain Repositories into the National Data Infrastructure How do each of the proposed infrastructure interact with existing repositories Shared Access Research Ecosystem (SHARE) (Clifford Lynch) (Libraries) Clearinghouse for the Open Research of the United States (CHORUS) (Publishers) National Data Service - supercomputer centers 28 Data Citation: ESIP, AGU, and NSIDC, Feb. 2014, Ruth Duerr
29 Making Dynamic Data Citeable Data Citation: Data + Means-of-access Data time-stamped & versioned (aka history) Researcher creates working-set via some interface: Access assign PID to QUERY, enhanced with Time-stamping for re-execution against versioned system Re-writing for normalization, unique-sort, mapping to history Hashing result-set: verifying identity/correctness PID leads to a query specific landing page S. Pröll, A. Rauber. Scalable Data Citation in Dynamic Large Databases: Model and Reference Implementation. In IEEE Intl. Conf. on Big Data 2013 (IEEE BigData2013),
30 Making Dynamic Data Citeable Building blocks of supporting dynamic data citation: - Uniquely identifiable data records (for unique sort) - Versioned data, marking changes as insertion/deletion - Time stamps of data insertion / deletions - Query language for constructing subsets Add modules: - Persistent query store: queries, timestamp, hash, metadata including creator of subset - Query rewriting module - PID assignment to queries - Landing page design, citation text Stable across data source migrations (e.g. diff. DBMS), scalable, machine-actionable Page 14
31 Data Citation Deployment Researcher uses workbench to identify subset of data Upon executing selection ( download ) user gets Data (package, access API, ) This is an important advantage over PID (e.g. DOI) (Query is time-stamped and stored) traditional approaches relying on, e.g. Hash value storing computed a list of over identifiers!!! the data for local storage Recommended citation text (e.g. BibTeX) PID resolves to landing page Provides detailed metadata, link to parent data set, subset, Option to retrieve original data OR current version OR changes Upon activating PID associated with a data citation Query is re-executed against time-stamped and versioned DB Results as above are returned
32 ESIP View of Dynamic Citation ESIP has had guidelines for citation of dynamic data for many years Doe, J. and R. Roe. 2001, updated daily. The FOO Gridded Time Series Data Set. Version 3.2. Oct Sep. 2008, 84 N, 75 W; 44 N, 10 W. The FOO Data Center Accessed 1 May The question is can a reproducible subset identifier be generated to replace the red bit.
State of the Art in Data Citation
State of the Art in Data Citation Ruth Duerr This work is licensed under a Creative Commons Attribution v4.0 License. Overview Citing data in publications is a re-emerging practice that: Encourages reproducibility
More informationImplementing the RDA Data Citation Recommendations for Long Tail Research Data. Stefan Pröll
Implementing the RDA Data Citation Recommendations for Long Tail Research Data Stefan Pröll Overview 2 Introduction Recap of the WGDC Recommendations Long Tail Research Data SQL Prototype Git Prototype
More informationApproaches to Making Dynamic Data Citeable Recommendations of the RDA Working Group Andreas Rauber
Approaches to Making Dynamic Data Citeable Recommendations of the RDA Working Group Andreas Rauber Vienna University of Technology rauber@ifs.tuwien.ac.at http://www.ifs.tuwien.ac.at/~andi Outline Joint
More informationCOALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING. Source: NSF.
COALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING Source: NSF.gov October 23, 2014 NDS Meeting 2 DATA BEST PRACTICES FOR EARTH
More informationReproducibility and FAIR Data in the Earth and Space Sciences
Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org Earth and Space Science is Essential for Society
More informationFAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data
FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data GeoDaRRs: What is the existing landscape and what gaps exist in that landscape for data producers and users? 7 August
More informationServices to Make Sense of Data. Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017
Services to Make Sense of Data Patricia Cruse, Executive Director, DataCite Council of Science Editors San Diego May 2017 How many journals make data sharing a requirement of publication? https://jordproject.wordpress.com/2013/07/05/going-back-to-basics-reusing-data/
More informationISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU
ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU bhanson@agu.org 1 Recent Alignment by Publishers, Repositories, and Funders Around Data
More informationCODATA: Data Citation Workshop Perspectives from Editors and Publishers. Brooks Hanson Director, Publications, AGU
CODATA: Data Citation Workshop Perspectives from Editors and Publishers Brooks Hanson Director, Publications, AGU bhanson@agu.org 1 Requiring access to data is not new AGU position statement on data in
More informationMaking Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements
Making Sense of Data: What You Need to know about Persistent Identifiers, Best Practices, and Funder Requirements Council of Science Editors May 23, 2017 Shelley Stall, MBA, CDMP, EDME AGU Assistant Director,
More informationPersistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1
Persistent Identifier the data publishing perspective Sünje Dallmeier-Tiessen, CERN 1 Agenda Data Publishing Specific Data Publishing Needs THOR Latest Examples/Solutions Publishing Centerpiece of research
More informationAdoption of Data Citation Outcomes by BCO-DMO
Adoption of Data Citation Outcomes by BCO-DMO Cynthia Chandler, Adam Shepherd, David Bassendine Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution and Blue
More informationData Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center
Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center Robert Cook, DAAC Scientist Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN cookrb@ornl.gov
More informationChecklist and guidance for a Data Management Plan, v1.0
Checklist and guidance for a Data Management Plan, v1.0 Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan, v1.0. Available online: https://wiki.helsinki.fi/x/dzeacw
More informationNational Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC
National Snow and Ice Data Center Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC Authors: R. Weaver, R. Duerr Date 10/5/2010 CHANGE LOG Revision Date Description Author 1.0 6/29/2009
More informationNational Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC
National Snow and Ice Data Center Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC Authors: R. Weaver, R. Duerr Date 3/21/2010 CHANGE LOG Revision Date Description Author 1.0 6/29/2009
More informationData Citation and Scholarship
University of California, Los Angeles From the SelectedWorks of Christine L. Borgman August 25, 2015 Data Citation and Scholarship Christine L Borgman, University of California, Los Angeles Available at:
More informationFREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources
FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources Brian Matthews Data Science and Technology Group Scientific Computing Department STFC Persistent Identifiers Long-lasting
More informationImproving a Trustworthy Data Repository with ISO 16363
Improving a Trustworthy Data Repository with ISO 16363 Robert R. Downs 1 1 rdowns@ciesin.columbia.edu NASA Socioeconomic Data and Applications Center (SEDAC) Center for International Earth Science Information
More informationData Citation Then and Now
Data Citation Then and Now Mark A. Parsons with help from Ruth Duerr and Peter Fox!!! 17 June 2014 GeoData 2014 Boulder, CO Unless otherwise noted, the slides in this presentation are licensed by Mark
More informationGEOSS Data Management Principles: Importance and Implementation
GEOSS Data Management Principles: Importance and Implementation Alex de Sherbinin / Associate Director / CIESIN, Columbia University Gregory Giuliani / Lecturer / University of Geneva Joan Maso / Researcher
More informationMercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard
Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard University @mercecrosas mercecrosas.com Open Research Cloud, May 11, 2017 Best Practices
More informationIndiana University Research Technology and the Research Data Alliance
Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly RDA Mission
More informationData Citation. DataONE Community Engagement & Outreach Working Group
Data Citation DataONE Community Engagement & Outreach Working Group Data Citation Image attribution: CC image by adesigna on Flickr Lesson Topics Data Citation in the Data Life Cycle Definitions: What
More informationFor Attribution: Developing Data Attribution and Citation Practices and Standards
For Attribution: Developing Data Attribution and Citation Practices and Standards Board on Research Data and Information Policy and Global Affairs Division National Research Council in collaboration with
More informationDOI for Astronomical Data Centers: ESO. Hainaut, Bordelon, Grothkopf, Fourniol, Micol, Retzlaff, Sterzik, Stoehr [ESO] Enke, Riebe [AIP]
DOI for Astronomical Data Centers: ESO Hainaut, Bordelon, Grothkopf, Fourniol, Micol, Retzlaff, Sterzik, Stoehr [ESO] Enke, Riebe [AIP] DOI for Astronomical Data Centers: ESO Hainaut, Bordelon, Grothkopf,
More informationLIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories
LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories Martin Fenner (DataCite) Mercè Crosas (Institute for Quantiative Social Science, Harvard University) May 15, 2017 2014 Joint Declaration
More informationPDS, DOIs, and the Literature. Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics
PDS, DOIs, and the Literature Anne Raugh, University of Maryland Edwin Henneken, Harvard-Smithsonian Center for Astrophysics A Brief Introduction to DOIs History DOI = Digital Identifier for an Object
More informationPersistent Identifiers for Earth Science Provenance
Persistent Identifiers for Earth Science Provenance Curt Tilmes Curt.Tilmes@umbc.edu ebiquity Research Group Presentation February 25, 2009 Overview Background Identification Persistence Actionable Identifiers
More informationDataverse and DataTags
NFAIS Open Data Fostering Open Science June 20, 2016 Dataverse and DataTags Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitive Social Science Harvard University @mercecrosas
More informationResearch Elsevier
Research Data @ Elsevier From generation through sharing and publishing to discovery IJsbrand Jan Aalbersberg SVP Journal and Data Solutions NDS, Boulder - June 12, 2014 Contributors: Anita de Waard Hylke
More informationPaving the Rocky Road Toward Open and FAIR in the Field Sciences
Paving the Rocky Road Toward Open and FAIR Kerstin Lehnert Lamont-Doherty Earth Observatory, Columbia University IEDA (Interdisciplinary Earth Data Alliance), www.iedadata.org IGSN e.v., www.igsn.org Field
More informationEUDAT- Towards a Global Collaborative Data Infrastructure
EUDAT- Towards a Global Collaborative Data Infrastructure FOT-Net Data Stakeholder Meeting Brussels, 8 March 2016 Yann Le Franc, PhD e-science Data Factory, France CEO and founder EUDAT receives funding
More informationDATA SHARING FOR BETTER SCIENCE
DATA SHARING FOR BETTER SCIENCE THE DATAVERSE PROJECT Mercè Crosas, Institute for Quantitative Social Science, Harvard University @mercecrosas MAX PLANCK INSTITUTE FOR RADIOASTRONOMY, SEPTEMBER 12, 2017
More informationPERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA MATTHEW WOOLLARD.. ECONOMIC AND SOCIAL DATA SERVICE UNIVERSITY OF ESSEX... METADATA AND PERSISTENT IDENTIFIERS FOR SOCIAL AND ECONOMIC DATA,
More informationData Citation Working Group P7 March 2 nd 2016, Tokyo
Data Citation Working Group Mtg @ P7 March 2 nd 2016, Tokyo Agenda 2 11:00 - Welcome and Intro 11:10 Recommendations: Re-cap, Flyer, Report 11:30 - Recommendations Q&A 11:45 - Adoption activities 12:15
More informationA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data Repositories Tim Clark (Harvard Medical School & Massachusetts General Hospital) Martin Fenner (DataCite) Mercè Crosas (Institute for Quantiative Social Science,
More informationThe Role of Repositories and Journals in the Astronomy Research Lifecycle
The Role of Repositories and Journals in the Astronomy Research Lifecycle Alberto Accomazzi NASA Astrophysics Data System Smithsonian Astrophysical Observatory http://ads.harvard.edu Astroinformatics 2010,
More information5/16/2018. Researcher Challenges with Data Use. AGU s position statement on data affirms that
Enabling FAIR Data in the Earth, Space, and Environmental Sciences New Journal Policy, Recommendations, and Guidelines 8 May 2018 Shelley Stall Director, AGU Data Programs sstall@agu.org AGU s position
More informationData Exchange in the Earth Sciences
Data Exchange in the Earth Sciences Perspective of a multidisciplinary data facility Kerstin Lehnert, Columbia University lehnert@ldeo.columbia.edu 1 Access to Data Transparency & Reproducibility Publishers/Journals
More informationSupporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences
Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences Vicki L. Ferrini, Kerstin A. Lehnert, Suzanne M. Carbotte, and Leslie Hsu Lamont-Doherty Earth Observatory What is
More informationEUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020
More informationReflections on Three Decades in Internet Time
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States of America License. Reflections on Three Decades in Internet Time Christine Borgman, Paul
More informationEnabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services
Enabling Open Science: Data Discoverability, Access and Use Jo McEntyre Head of Literature Services www.ebi.ac.uk About EMBL-EBI Part of the European Molecular Biology Laboratory International, non-profit
More informationRADAR A Repository for Long Tail Data
RADAR A Repository for Long Tail Data Angelina Kraft, Janna Neumann German National Library of Science and Technology TIB 36th Annual IATUL Conference Hannover, July 6 th, 2015 funded by IN A NUTSHELL
More informationDOIs for Research Data
DOIs for Research Data Open Science Days 2017, 16.-17. Oktober 2017, Berlin Britta Dreyer, Technische Informationsbibliothek (TIB) http://orcid.org/0000-0002-0687-5460 Scope 1. DataCite Services 2. Data
More informationTHE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel
THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel National Center for Supercomputing Applications University of Illinois
More informationData Citation. Mark Parsons, Ruth Duerr and the Federation of Earth Science Information Partners (ESIP)
Data Citation Mark Parsons, Ruth Duerr and the Federation of Earth Science Information Partners (ESIP) The National Snow and Ice Data Center Manages and distributes scientific data Performs scientific
More informationCODE AND DATA MANAGEMENT. Toni Rosati Lynn Yarmey
CODE AND DATA MANAGEMENT Toni Rosati Lynn Yarmey Data Management is Important! Because Reproducibility is the foundation of science Journals are starting to require data deposit You want to get credit
More informationApplying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories
Purdue University Purdue e-pubs Libraries Faculty and Staff Presentations Purdue Libraries 2015 Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing
More informationEarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography
EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography Christopher Crosby, San Diego Supercomputer Center J Ramon Arrowsmith, Arizona State University Chaitan
More informationDATAVERSE FOR JOURNALS
DATAVERSE FOR JOURNALS Mercè Crosas, Ph.D. Director of Data Science IQSS, Harvard University @mercecrosas Society for Scholarly Publishing 37 th Meeting, 28, May, 2015 About Dataverse Science requires
More informationData Curation Handbook Steps
Data Curation Handbook Steps By Lisa R. Johnston Preliminary Step 0: Establish Your Data Curation Service: Repository data curation services should be sustained through appropriate staffing and business
More informationInge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license
Inge Van Nieuwerburgh OpenAIRE NOAD Belgium Tools&Services OpenAIRE EUDAT can be reused under the CC BY license Open Access Infrastructure for Research in Europe www.openaire.eu Research Data Services,
More informationHow to make your data open
How to make your data open Marialaura Vignocchi Alma Digital Library Muntimedia Center University of Bologna The bigger picture outside academia Thursday 29th October 2015 There is a strong societal demand
More informationExecutive Committee Meeting
Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationData Discovery - Introduction
Data Discovery - Introduction Why (benefits of reusing data) How EUDAT's services help with this (in general) Adam Carter In days gone by: Design an experiment Getting Your Data Conduct the experiment
More informationRN Workshop Series on Innovations in Scholarly Communication: plementing the Benefits of OAI (OAI3)
RN Workshop Series on Innovations in Scholarly Communication: plementing the Benefits of OAI (OAI3) Overview of the OAI and its Relation to Scientific Publishing in 2004 Dr. Diann Rusch-Feja, Director
More informationDeveloping a Research Data Policy
Developing a Research Data Policy Core Elements of the Content of a Research Data Management Policy This document may be useful for defining research data, explaining what RDM is, illustrating workflows,
More informationA Brief Introduction to the Data Curation Profiles
A Brief Introduction to the Data Curation Profiles Jake Carlson Data Services Specialist Setting the stage Since 2004: Purdue Interdisciplinary Research Initiative revealed many data needs on campus What
More informationData publication and discovery with Globus
Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,
More informationCyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)
Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation, Integration Alan Blatecky Director OCI 1 1 Framing the
More informationDeliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations
Ref. Ares(2017)3291958-30/06/2017 Readiness of ICOS for Necessities of integrated Global Observations Deliverable 6.4 Initial Data Management Plan RINGO (GA no 730944) PUBLIC; R RINGO D6.5, Initial Risk
More informationEUDAT Common data infrastructure
EUDAT Common data infrastructure Giuseppe Fiameni SuperComputing Applications and Innovation CINECA Italy Peter Wittenburg Max Planck Institute for Psycholinguistics Nijmegen, Netherlands some major characteristics
More informationTowards a joint service catalogue for e-infrastructure services
Towards a joint service catalogue for e-infrastructure services Dr British Library 1 DI4R 2016 Workshop Joint service catalogue for research 29 September 2016 15/09/15 Goal A framework for creating a Catalogue
More informationRobin Wilson Director. Digital Identifiers Metadata Services
Robin Wilson Director Digital Identifiers Metadata Services Report Digital Object Identifiers for Publishing and the e-learning Community CONTEXT elearning the the Publishing Challenge elearning the the
More informationData Curation Profile Human Genomics
Data Curation Profile Human Genomics Profile Author Profile Author Institution Name Contact J. Carlson N. Brown Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation October 27, 2009 Date
More informationFlexible Framework for Mining Meteorological Data
Flexible Framework for Mining Meteorological Data Rahul Ramachandran *, John Rushing, Helen Conover, Sara Graves and Ken Keiser Information Technology and Systems Center University of Alabama in Huntsville
More informationINTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES
TOWARDS OSGEO BEST PRACTICES FOR SCIENTIFIC SOFTWARE CITATION INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES Peter Löwe, Markus Neteler, Jan Goebel, Marco Tullney Boston,
More informationWriting a Data Management Plan A guide for the perplexed
March 29, 2012 Writing a Data Management Plan A guide for the perplexed Agenda Rationale and Motivations for Data Management Plans Data and data structures Metadata and provenance Provisions for privacy,
More informationWhy CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam
A Europe-wide Interoperable Virtual Research Environment to Empower Multidisciplinary Research Communities and Accelerate Innovation and Collaboration Why CERIF? Keith G Jeffery Scientific Coordinator
More informationACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development
ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term
More informationData Archival and Dissemination Tools to Support Your Research, Management, and Education
Data Archival and Dissemination Tools to Support Your Research, Management, and Education LIZA BRAZIL CUAHSI PRODUCT MANAGER Shout Out: Upcoming Cyberseminars April 13: Liza Brazil, CUAHSI: Data Archiving
More informationThe State of Arctic Data the IPY experience
The State of Arctic Data the IPY experience Mark A. Parsons,Taco de Bruin, Scott Tomlinson, Øystein Godøy, Helen Campbell, Julie Leclert, Ellsworth LeDrew, David Carlson, and the IPY data community. 22
More informationData Curation Profile Plant Genomics
Data Curation Profile Plant Genomics Profile Author Institution Name Contact J. Carlson Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation October 27, 2009 Date of Last Update Version 1.0
More informationDescription Cross-domain Task Force Research Design Statement
Description Cross-domain Task Force Research Design Statement Revised 8 November 2004 This document outlines the research design to be followed by the Description Cross-domain Task Force (DTF) of InterPARES
More informationThe Research Data Alliance Creating the culture and technology for an international data infrastructure
The Research Data Alliance Creating the culture and technology for an international data infrastructure Mark A. Parsons Managing Director, RDA/United States Rensselaer Polytechnic Institute!! AGU Town
More informationUniversity at Buffalo's NEES Equipment Site. Data Management. Jason P. Hanley IT Services Manager
University at Buffalo's NEES Equipment Site Data Management Jason P. Hanley IT Services Manager Structural Engineering and Earthquake Simulation Laboratory, Department of Civil, Structural and Environmental
More informationFrom Sensor to Archive: Observational Data Services at NCAR s Earth Observing Laboratory
From Sensor to Archive: Observational Data Services at NCAR s Earth Observing Laboratory Mike Daniels Computing, Data and Software Facility NCAR/Earth Observing Laboratory NSF Aircraft operated by EOL
More informationScalable Data Citation in Dynamic, Large Databases: Model and Reference Implementation
Scalable Data Citation in Dynamic, Large Databases: Model and Reference Implementation Stefan Pröll SBA Research Vienna, Austria sproell@sba-research.org Andreas Rauber Technical University of Vienna Vienna,
More informationThe International Journal of Digital Curation Issue 1, Volume
92 Digital Archive Policies Issue 1, Volume 2 2007 Digital Archive Policies and Trusted Digital Repositories MacKenzie Smith, MIT Libraries Reagan W. Moore, San Diego Supercomputer Center June 2007 Abstract
More informationLinking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier
Linking data and publications the past, present, and future Dr. Hylke Koers, Head of Content Innovation, Elsevier BioCADDIE webinar January 8, 2015 Ease of access Open Access 2 The issue: data is important,
More informationData Management Plans. Sarah Jones Digital Curation Centre, Glasgow
Data Management Plans Sarah Jones Digital Curation Centre, Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjdcc Data Management Plan (DMP) workshop, e-infrastructures Austria, Vienna, 17 November 2016 What
More informationCheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment
Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Paul Watry Univ. of Liverpool, NaCTeM pwatry@liverpool.ac.uk Ray Larson Univ. of California, Berkeley
More informationData Management Checklist
Data Management Checklist Managing research data throughout its lifecycle ensures its long-term value and prevents data from falling into digital obsolescence. Proper data management is a key prerequisite
More informationINTAROS Integrated Arctic Observation System
INTAROS Integrated Arctic Observation System A project funded by EC - H2020-BG-09-2016 Coordinator: Stein Sandven Nansen Environmental and Remote Sensing Center, Norway Overall objective: to develop an
More informationThe Materials Data Facility
The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials
More informationGSAW The Earth Observing System (EOS) Ground System: Leveraging an Existing Operational Ground System Infrastructure to Support New Missions
GSAW 2016 The Earth Observing System (EOS) Ground System: Leveraging an Existing Operational Ground System Infrastructure to Support New Missions David Hardison NASA Goddard Space Flight Center Johnny
More informationData Curation Profile Movement of Proteins
Data Curation Profile Movement of Proteins Profile Author Institution Name Contact J. Carlson Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation July 14, 2010 Date of Last Update July 14,
More informationWelcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018
0 Welcome to the Pure International Conference Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 1 Mendeley Data Use Synergies with Pure to Showcase Additional Research Outputs Nikhil Joshi Solutions
More informationNISO STS (Standards Tag Suite) Differences Between ISO STS 1.1 and NISO STS 1.0. Version 1 October 2017
NISO STS (Standards Tag Suite) Differences Between ISO STS 1.1 and NISO STS 1.0 Version 1 October 2017 1 Introduction...1 1.1 Four NISO STS Tag Sets...1 1.2 Relationship of NISO STS to ISO STS...1 1.3
More informationSoftware + Services for Data Storage, Management, Discovery, and Re-Use
Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External
More informationDIAS_Satellite_MODIS_SurfaceReflectance dataset
DIAS_Satellite_MODIS_SurfaceReflectance dataset 1. IDENTIFICATION INFORMATION DOI Metadata Identifier DIAS_Satellite_MODIS_SurfaceReflectance dataset doi:10.20783/dias.273 [http://doi.org/10.20783/dias.273]
More informationHow to share research data
How to share research data University Library, UiT Fall 2018 Research data @ UiT Learning objectives Part 1 Why share data? What are the important criteria in choosing a data archive? Part 2 Learn about
More informationBig Data infrastructure and tools in libraries
Line Pouchard, PhD Purdue University Libraries Research Data Group Big Data infrastructure and tools in libraries 08/10/2016 DATA IN LIBRARIES: THE BIG PICTURE IFLA/ UNIVERSITY OF CHICAGO BIG DATA: A VERY
More informationData Publication. HGF Alliance: Remote Sensing and Earth System Dynamics
HGF Alliance: Remote Sensing and Earth System Dynamics Data Publication Kirsten Elger & Roland Bertelmann, GFZ German Research Centre for Geosciences, Potsdam 1 Why are we speaking about research data?
More informationRoy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project
Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project Office) Linda Pikula (IODE GEMIM/NOAA Library) Data
More informationEnabling Precise Identification and Citabilityof Dynamic Data. Recommendations of the RDA Working Group on Data Citation
Enabling Precise Identification and Citabilityof Dynamic Data Recommendations of the RDA Working Group on Data Citation Andreas Rauber Vienna University of Technology Favoritenstr. 9-11/188 1040 Vienna,
More informationSession Two: OAIS Model & Digital Curation Lifecycle Model
From the SelectedWorks of Group 4 SundbergVernonDhaliwal Winter January 19, 2016 Session Two: OAIS Model & Digital Curation Lifecycle Model Dr. Eun G Park Available at: https://works.bepress.com/group4-sundbergvernondhaliwal/10/
More informationCarelyn Campbell, Ben Blaiszik, Laura Bartolo. November 1, 2016
Carelyn Campbell, Ben Blaiszik, Laura Bartolo November 1, 2016 Data Landscape Collaboration Tools (e.g. Google Drive, DropBox, Sharepoint, Github, MatIN) Data Sharing Communities (e.g. Dryad, FigShare,
More information