Sustainable Governance for Long-Term Stewardship of Earth Science Data

Similar documents
Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Improving a Trustworthy Data Repository with ISO 16363

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Enabling Collaboration for Digital Preservation

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

GEOSS Data Management Principles: Importance and Implementation

Agenda. Bibliography

Growing Variety and Volume of Remote Sensing and In Situ Data

The President s Spectrum Policy Initiative

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

The library s role in promoting the sharing of scientific research data

Data Curation Handbook Steps

Federal Statistics, Multiple Data Sources, and Privacy Protection: Next Steps

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

Introduction to Data Management for Ocean Science Research

Competency Definition

The Center for Research Libraries. Archive Profile Inter-university Consortium for Political and Social Research (ICPSR)

The US National Near-Earth Object Preparedness Strategy and Action Plan

CISER Data Archive Collection Policy

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

Writing a Data Management Plan A guide for the perplexed

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

Future Core Ground Segment Scenarios

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Digital Preservation at NARA

Oh My, How the Archive has Grown: Library of Congress Challenges and Strategies for Managing Selective Harvesting on a Domain Crawl Scale

Survey of Research Data Management Practices at the University of Pretoria

Research Infrastructures and Horizon 2020

UNT Libraries TRAC Audit Checklist

National Science and Technology Council. Interagency Working Group on Digital Data

Tools for Reusing Earth Science Software

Applying Mitigation. to Build Resilient Communities

Response to RFI: Public Access to Digital Data Resulting From Federally Funded Scientific Research Office of Science and Technology Policy

COPPUL Digital Preservation Network: Update for ACCOLEDS. Corey Davis COPPUL DPN Manager

National Earthquake Hazards Reduction Program

The UNISDR Private Sector Alliance for Disaster Resilient Societies

Data Governance Central to Data Management Success

Views on the Framework for Improving Critical Infrastructure Cybersecurity

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Denver Regional Data Summit 2009

University of Maryland Libraries: Digital Preservation Policy

Controlling the Project Pipeline: Managing Constraints to Maximize Outputs

Annual Report for the Utility Savings Initiative

Can a Consortium Build a Viable Preservation Repository?

COALITION ON PUBLISHING DATA IN THE EARTH AND SPACE SCIENCES: A MODEL TO ADVANCE LEADING DATA PRACTICES IN SCHOLARLY PUBLISHING. Source: NSF.

How the SBIR/STTR Programs Help Grow Your Businesses

A Climate Monitoring architecture for space-based observations

Checklist and guidance for a Data Management Plan, v1.0

ESTABLISHMENT OF AN OFFICE OF FORENSIC SCIENCES AND A FORENSIC SCIENCE BOARD WITHIN THE DEPARTMENT OF JUSTICE

Columbia Gorge Discovery Center and Museum

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

Pascal Gilles H-EOP-GT. Meeting ESA-FFG-Austrian Actors ESRIN, 24 th May 2016

DATA SHARING FOR BETTER SCIENCE

Cyber Partnership Blueprint: An Outline

Survey of research data management practices at the University of Pretoria, South Africa: October 2009 March 2010

Business Model for Global Platform for Big Data for Official Statistics in support of the 2030 Agenda for Sustainable Development

Defining Computer Security Incident Response Teams

Homeland Security Institute. Annual Report. pursuant to. Homeland Security Act of 2002

Resolution adopted by the General Assembly. [on the report of the Second Committee (A/56/561/Add.2)]

The Space Generation Congress 2010: Perspectives from University Students and Young Professionals in the Space Sector

DSA WDS Partnership Working Group Catalogue of Common Requirements

Stewarding NOAA s Data: How NCEI Allocates Stewardship Resources

UPU UNIVERSAL POSTAL UNION. CA C 4 SDPG AHG DRM Doc 3. Original: English COUNCIL OF ADMINISTRATION. Committee 4 Development Cooperation

Commission for Environmental Cooperation (CEC) Sponsored Workshop on Environmental Assistance Programs and Resources for Automotive OEMs and Suppliers

The NIH Big Data to Knowledge Initiative: Raising the Prominence of Data

<goals> 10/15/11% From production to preservation to access to use: OAIS, TDR, and the FDLP

Innovations in Federal Statistics: Combining Data Sources While Protecting Privacy

Federal STI Managers Group Presentation to Board on Research Data and Information Ellen Herbst Director, NTIS CENDI Chair

The Data Management Plan: Putting policy into practice Suzanne Clarke Director, Information Resources

For Attribution: Developing Data Attribution and Citation Practices and Standards

Data Stewardship NOAA s Programs for Archive, Access, and Producing Climate Data Records

Data Archival and Dissemination Tools to Support Your Research, Management, and Education

National Policy and Guiding Principles

Marine Transportation System Resilience: A Federal Agency Perspective

Guidelines for Depositors

Storage Infrastructure Optimization

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

Management s Response to the Auditor General s Review of Management and Oversight of the Integrated Business Management System (IBMS)

EGM, 9-10 December A World that Counts: Mobilising the Data Revolution for Sustainable Development. 9 December 2014 BACKGROUND

Tool Repositories, ESCHER and continuing the legacy of the CHESS ITR

CABINET PLANNING SYSTEM PROCUREMENT

Fair data and open data: differences and consequences

The State of Arctic Data the IPY experience

Canadian Anti-Spam Legislation (CASL)

Frequently Asked Questions

RDM through a UK lens - New Roles for Librarians?

IT Transformation Through ESPCs

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE

NCEI Customer Engagement Initiatives to Guide Opportunities for Innovation

Research Infrastructures and Horizon 2020

WELCOME TO THE COSA MEMBER WEBINAR

NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION S SCIENTIFIC DATA STEWARDSHIP PROGRAM

Scientific Data Policy of European X-Ray Free-Electron Laser Facility GmbH

The U.S. Government s Role in Standards and Conformity Assessment

Colorado Charter School Institute. Website Design & Development Request for Proposal

Resolution adopted by the General Assembly on 14 December [without reference to a Main Committee (A/61/L.44 and Add.1)]

Transcription:

Sustainable Governance for Long-Term Stewardship of Earth Science Data Robert R. Downs and Robert S. Chen NASA Socioeconomic Data and Applications Center (SEDAC) Center for International Earth Science Information Network (CIESIN) The Earth Institute, Columbia University Prepared for presentation to the Earth and Space Science Informatics Workshop August 4, 2010 George Mason University Fairfax, Virginia

Abstract In addition to technical infrastructure and capabilities, the long-term management of Earth science data requires organizational sustainability to provide continuing stewardship to address the risks to scientific data and support their use by future communities. Providing sustainable infrastructure for the preservation of scientific data requires organizational commitments, capacity, structures, and plans for data stewardship that are consistent with the missions of the organizations that accept the responsibility to serve in data stewardship roles. Alternative approaches to attaining organizational sustainability for interdisciplinary human dimensions and polar data are discussed in terms of recent recommendations for organizational sustainability 2

Where Do Old Data Go? Scientific data are often collected through projects or missions with limited lifetimes Funding received to collect data for a specific period Limited lifetime of instrument or platform or limited time period for data collection and processing Data may be at risk after project or mission ends Funds for project activities expire and project team disperses Resources often not available to clean up and properly document and archive project data Those most knowledgeable about the data become inaccessible, e.g., due to changes in responsibilities, jobs, retirement, illness, or death Computer and information systems are replaced and not all contents are migrated Active archives lack sufficient resources to track, access, and archive important datasets 3

Organizational Needs for Sustainable Stewardship of Scientific Data Organizational commitment to long-term scientific data stewardship and associated costs Capacity to manage scientific data on an ongoing basis Organizational structure for allocating resources and staff to plan and perform scientific data stewardship Plans for data stewardship that are consistent with the mission of the organization 4

Traditional Organizational Models for Sustainable Data Stewardship Government repositories Government agency or institution mission includes data stewardship Budget allocated based on mission needs; sometimes includes cost recovery, user fees Data management capacity deployed and operated by government or contractors Examples: Library of Congress, National Archives and Records Administration, Smithsonian Institution, USGS, NOAA, US Census Bureau Non-government repositories and networks Data managed by public or private nongovernmental institutions, including universities Funding from a mix of sources, including depositor, membership, and user fees; grants and contracts; gifts and endowments; volunteer and educational activities; publication and other value-added services Examples: University depositories, museum collections, ICPSR, IRIS 5

Concerns about Traditional Organizational Models Government repositories Missions and budgets subject to change based on legislative or executive decision making, which may put short-term issues ahead of long-term needs Longevity of agencies not guaranteed only a few Federal agencies with more than a century of successful experience in records management and retention Holdings could be subject to loss or corruption due to political influence Non-government repositories and networks Missions and budgets not necessarily sustainable due to changes in economic, social or political conditions and user needs and expectations Traditional funding models at risk, e.g., support for physical infrastructure vs. digital infrastructure, open access vs. subscription models, ability to subsidize preservation through educational or other value-added activities 6

Examples of Organizational Longevity Harvard, 1636 William & Mary, 1693 Yale, 1701 U. Penn, 1740 Princeton, 1746 Columbia, 1754 Brown, 1764 Dartmouth, 1769 UNC, 1795 U. Maryland*, 1859 MIT, 1865 Purdue, 1869 Stanford, 1891 Treasury, 1789 Lib. of Congress, 1800 Smithsonian Inst., 1846 NAS, 1863 AMNH, 1869 Commerce Dept., 1903 NY Public Library, 1911 ICSU, 1931 NARA, 1934 NSF, 1950 NASA, 1958 World Data Ctrs, 1958 NSSDC, 1966 CODATA, 1966 SEDAC, 1993 1550 1600 1650 1700 1750 1800 1850 1900 1950 2000 2050 Year Based on Downs and Chen (2010), Journal of Digital Information http://journals.tdl.org/jodi/article/view/753 7

Potential Alternatives to Traditional Models Hybrid government-private model Establishment of public-private institutions or consortia with combination of government support and private infrastructure and independence Development of business models that generate resources from combinations of sources, e.g., endowments, gifts, grants, subscriptions, fees, royalties, value-added support, micropayments (itunes), etc. Information Commons approach Development of communities that share scientific data and agree to serve as data stewards in an organized, voluntary framework Business model based on combination of voluntary efforts, distributed organizational and institutional support, and other gifts, endowments, etc. Not yet clear how an information commons can meet the needs for longterm organizational commitment, data management capacity, and plans 8

Polar Information Commons (PIC) International initiative launched in June 2010 to provide long-term stewardship of polar science data Core approach is based on the Science Commons Protocol for Implementing Open Access Data: http://www.sciencecommons.org/projects/ publishing/open-access-data-protocol/ Data owners/originators use the Creative Commons Zero (CC0) license to place their data in the public domain or use CC-BY-3.0 where copyrights apply The PIC has developed a digital label (the PIC badge ) and a set of community norms to facilitate but not require data use and reuse http://www.polarcommons.org 9

PIC Stewardship Approach Polar data sources expose their data to the world through the PIC badge and open protocols Data centers monitor new PIC data and assess and acquire important data for formal archiving, curation, and access The PIC community encourages compliance with PIC norms through peer pressure, brand identity, arrangements with journals and funding The PIC community provides useful tools for PIC data quality, search, access, documentation, attribution, 10

Key Challenges for the PIC Can the PIC develop sustainable collaboration between: Scientists and scientific bodies to support the commons (content, tools, and rights)? Universities and other digital data repositories to support selection and appraisal and long-term institutional commitments to knowledge preservation and dissemination? Governments and other governmental bodies to provide resources and requirements (carrots and sticks)? The broader public to provide volunteer labor, infrastructure, and other support? How can a reliable virtual organizational framework and infrastructure for the PIC be developed and maintained over the long term? 11

Summary Long-term data stewardship requires a long-term commitment to data management and preservation Data stewardship must be consistent with long-term organizational objectives A long-term but flexible business model is needed to ensure that resources are available in the long term to support data stewardship activities An organization s history is important for understanding its ability to adapt and survive but is not a blueprint for its future! New organizational approaches and models are needed in the new digital age the status quo won t last much longer! 12