The State of Arctic Data the IPY experience

Similar documents
An Overview of International Data Coordination Activities

Global Cryosphere Watch. GCW web portal Structure and demonstration

Sustaining Arctic Observing Networks (SAON)

Growing Variety and Volume of Remote Sensing and In Situ Data

Developing the ICSU World Data System (WDS)

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang

Technical documentation. SIOS Data Management Plan

Chair, IASC-SAON Arctic Data Committee (ADC) Co-Lead, IARPC Arctic Data Sub-Team Co-Lead GEO Cold Regions Initiative

Technical documentation. D2.4 KPI Specification

A distributed network of digital heritage information

For Attribution: Developing Data Attribution and Citation Practices and Standards

Indiana University Research Technology and the Research Data Alliance

Creating a Digital Preservation Network with Shared Stewardship and Cost

EUDAT Towards a Collaborative Data Infrastructure

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

Engaging and Connecting Faculty:

OAI-ORE. A non-technical introduction to: (

Sustainable Governance for Long-Term Stewardship of Earth Science Data

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

International Oceanographic Data and Information Exchange - Ocean Data Portal (IODE ODP)

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

University of Texas Arlington Data Governance Program Charter

SCICEX Data Stewardship: FY2012 Report

The Sunshine State Digital Network

Multi-disciplinary Interoperability: the EuroGEOSS Operating Capacities

Lessons learnt from building the International Virtual Observatory Alliance. Françoise Genova

Improving a Trustworthy Data Repository with ISO 16363

Earth Observation Imperative

Arctic Action Overview

SPACE for SDGs a Global Partnership

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

LEHMAN COLLEGE OF THE CITY UNIVERSITY OF NEW YORK. Department of Economics and Business. Curriculum Change

SEXTANT 1. Purpose of the Application

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

Proposal for Implementing Linked Open Data on Libraries Catalogue

Data Curation Profile Movement of Proteins

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Data Citation Then and Now

Registry Interchange Format: Collections and Services (RIF-CS) explained

Development of GFCS: The WMO contribution and the need for many partners. David Grimes President, World Meteorological Organization

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

Data Curation Profile Human Genomics

Data Management Checklist

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr

DOIs for Research Data

Reproducibility and FAIR Data in the Earth and Space Sciences

Towards FAIRness: some reflections from an Earth Science perspective

28 September PI: John Chip Breier, Ph.D. Applied Ocean Physics & Engineering Woods Hole Oceanographic Institution

James Hardiman Library. Digital Scholarship Enablement Strategy

Introduction to Data Management for Ocean Science Research

Interoperability in Science Data: Stories from the Trenches

Collaborations and Partnerships. John Broome CODATA-International

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Corso di Biblioteche Digitali

Corso di Biblioteche Digitali

ISO RM standards. Hans Hofman DLM Forum Budapest, 6 October 2005

Citizen Science and Nontraditional Partner Monitoring: Memorandum of Understanding

MANAGEMENT OF IPY 2007/08 NATIONAL DATA

CODATA: Data Citation Workshop Perspectives from Editors and Publishers. Brooks Hanson Director, Publications, AGU

Interoperability ~ An Introduction

Interoperability Between GRDC's Data Holding And The GEOSS Infrastructure

WHAT ISINTEROPERABILITY? (AND HOW DO WE MEASURE IT?) INSPIRE Conference 2011 Edinburgh, UK

CONFERENCE OF EUROPEAN STATISTICIANS ACTIVITIES ON CLIMATE CHANGE-RELATED STATISTICS

JCOMM Observing Programme Support Centre

Defense Coastal/Estuarine Research Program (DCERP) SERDP RC DCERP Data Policy Version 2.0

Dataverse and DataTags

Resolution adopted by the General Assembly. [on the report of the Second Committee (A/56/561/Add.2)]

IASC Subsidiary Bodies. [Sub-Working Group on Emergency Telecommunications] Work Plan for 2010

Infrastructure PA Stephen Lecce

Linked Data and Libraries

School of Engineering & Computational Sciences

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group

Digital repositories as research infrastructure: a UK perspective

eresearch Collaboration across the Pacific:

Global Framework for Climate Services

Architecture Subgroup Report GEO4DOC 4.1(2) - REV 2.0 UPDATED - POST EO SUMMIT II 13 May 2004

Data, Information and Customer Support Standing Committee (DICSSC) Report IICWG-18

The Open Archives Initiative in Practice:

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz

Linking datasets with user commentary, annotations and publications: the CHARMe project

World Summit on the Information Society (WSIS) and the Digital Divide

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

EPOS: European Plate Observing System

GEO Update and Priorities for 2014

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Fedora, Islandora, & Samvera: Requirements and Gaps. David Wilcox,

The Semantic Web DEFINITIONS & APPLICATIONS

CREATING SMART TRANSPORT SERVICES BY FACILITATING THE RE-USE OF OPEN GIS DATA

Basic Requirements for Research Infrastructures in Europe

e-infrastructures and Data Management

GNSSN. Global Nuclear Safety and Security Network

The Global Research Council

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Statistics and Open Data

Transcription:

The State of Arctic Data the IPY experience Mark A. Parsons,Taco de Bruin, Scott Tomlinson, Øystein Godøy, Helen Campbell, Julie Leclert, Ellsworth LeDrew, David Carlson, and the IPY data community. 22 nd International CODATA Conference Stellenbosch, Cape Town,South Africa 25 October 2010 This presentation is licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License

In fifty years time the data resulting from IPY2007-2008 may be seen as the most important single outcome of the programme. A Framework for the IPY (ICSU 2004)

Our vision: Data are open, linked, useful, and safe. Copyright: Christian Morel

A pragmatic assessment Data sharing and publication open and linked Interoperability across systems, data, and standards linked and useful Sustainable preservation and stewardship of diverse data safe Governance and conduct of the virtual organization that coordinates data access and stewardship around the globe. practicality

Open Open Data is a philosophy and practice requiring that certain data are freely available to everyone, without restrictions from copyright, patents or other mechanisms of control. Wikipedia

IPY Data Policy http://www.ipy.org/subcommittees/final_ipy_data_policy.pdf Special Cases: Human subjects Intellectual property of LTK Where data release may cause harm Data generated by IPY Data used by IPY the IPY Joint Committee requires that IPY data, including operational data delivered in real time, are made available fully, freely and on the shortest feasible timescale. 6

CC Zero waiver + norms waive rights public domain + attribution /citation through community norms, not a contract http://polarcommons.org/ethics-and-norms-of-data-sharing.php

Data sharing and publication Data should be accessible soon after collection (online wherever possible) in a discovery portal such as the GCMD. Data users should provide fair and formal credit to providers.

Linked The term Linked Data is used to describe a method of exposing, sharing, and connecting data via dereferenceable URIs on the Web. Wikipedia

Linked Data Tim Berners-Lee s 4 steps emphasize that it s about relationships. 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information, using standards (RDF, SPARQL). 4. Include links to other URIs. so that they can discover more things. Current practice registries and catalogs (e.g. GCMD) The Economist 10

A Union Catalog catalog 1 catalog 2 catalog 5 catalog 3 (Mirror) catalog 6 (Mirror) catalog 4 catalog 8 catalog 7 catalog 9 courtesy P. Pulsifer 12

An initial IPY Union Catalog (using OAI-PMH) UK Norway DAMOCLES NSIDC Canada Sweden CADIS GCMD 20

Interoperability discovery Metadata should be readily interchangeable between different polar data systems to enable data discovery across multiple portals.

Useful

To the social scientist, the world is: A complex, involved narrative with many players The narrative describes a network of interactions between human and non-human elements (including data)

Interoperable (Geospatial and Semantic)

An Arctic Spatial Data Infrastructure The result of two IPY GeoNorth conferences. Approved by all Senior Arctic Officials of the Arctic Council last fall Planning a task force, consisting of representatives from national mapping agencies and the Working Groups of the Arctic Council GeoNorth3 this summer in Tromsø Similar work already underway in the Antarctic under the auspices of the Standing Committee on Antarctic Geospatial Information

Semantic Sea Ice Interoperability Initiative A new collaborative project funded by the US National Science Foundation working to improve semantic interoperability of Arctic data. Builds from the International Polar Year and the IPY Data and Information Service Creating a sea ice ontology incorporating scientific knowledge and some elements of traditional knowledge and linked Working to create a community of semantic practitioners in the Arctic. Seeking to establish Arctic interoperability in regional and global data systems (e.g. GEOSS, WIS, SAON) PIs: Mark Parsons, Ruth Duerr, Siri Jodha Singh Khalsa NSIDC Peter Fox, Deborah McGuinness RPI Please talk to me to learn more Fresh fallen snow accumulates in bands on the open water, while the blue and grey melt ponds and puddles cover parts of an ice floe in the background. Image courtesy of The Hidden Ocean, Arctic 2005 Exploration. 19

Interoperability usability Data from different projects, disciplines, and data centers should be easily understood and used in conjunction with each other in standard tools and analysis frameworks Data should be well described so to be useful for a broad audience.

Safe Safe from hackers, from obsolescence, from undocumented change, from loss, and from the ravages of time.

Preservation All IPY data must be archived in their simplest, useful form and be accompanied by a complete metadata description. An IPY Data and Information Service (IPYDIS) should help projects identify appropriate long-term archives and data centers, but it is the responsibility of individual IPY projects to make arrangements with long-term archives to ensure the preservation of their data. It must be recognized that data preservation and access should not be afterthoughts and need to be considered while data collection plans are developed. IPY Data Policy

Preservation All raw IPY data should be preserved and well stewarded in long-term archives following the OAIS Reference Model. Data should be accompanied by complete documentation to enable preservation and stewardship.

Coordination and Governance Identify, evolve, or develop a sustained virtual organization to enable effective international collaboration on data sharing, interoperability, and preservation.

Some initial recommendations for scientists and data centers Investigators must publish their IPY data immediately. The scientific community needs to recognize the value of good data through citation, consideration of data publication in promotion and tenure review, and by training young scientists in data management. Data centers must develop partnerships with other data centers in other countries and other disciplines to enhance data accessibility and interoperability. Data centers should partner with their scientific community to explicitly meet their needs, provide easy submission tools, and make the data more useful and integrated with other data.

Some initial summary recommendations for national and international sponsors International sponsors must lead an aggressive initiative to ensure all IPY data are in secure archives by June 2012. IASC must develop an effective and pragmatic data strategy to ensure active pan-arctic data sharing and collaboration. ICSU and WMO must continue to lead the global discussion to harmonize data policies around as much rapid openness as possible, while recognizing legitimate, moral restrictions. Funding agencies also need consistent and enforced policy. Funding agencies must support data archiving and insist that data they fund be archived and accessible. Agencies must also create new archives where appropriate ones do not exist. Agencies should take advantage of the interdisciplinary use cases generated by IPY science questions to support basic and applied research on improving interdisciplinary data management and interoperability.

Find the report at http://ipydis.org/documents/ or Google state of polar data image NY Times Thank you parsonsm@nsidc.org