DataONE: Open Persistent Access to Earth Observational Data

Similar documents
DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

Introduction to Data Management for Ocean Science Research

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Growing Variety and Volume of Remote Sensing and In Situ Data

Earth Observation Imperative

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

Ada L. Benavides, Deputy Chief South Pacific Division Regional Integration Team. May 5, US Army Corps of Engineers BUILDING STRONG

Indiana University Research Technology and the Research Data Alliance

Arctic Data Center: Call for Synthesis Working Group Proposals Due May 23, 2018

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Integrated Water Resources Science and Services (IWRSS)

DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson

Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE

Opportunities for collaboration in Big Data between US and EU

Case Study: CyberSKA - A Collaborative Platform for Data Intensive Radio Astronomy

Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012

MAJOR RESEARCH EQUIPMENT $240,450,000 AND FACILITIES CONSTRUCTION

ArcGIS Solutions for Community Resilience. Matthew S Deal

How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center

National Science and Technology Council. Interagency Working Group on Digital Data

The Science and Technology Roadmap to Support the Implementation of the Sendai Framework for Disaster Risk Reduction

National Strategy for CBRNE Standards

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University

National Earthquake Risk Reduction Program in Haiti

Ag-Analytics Data Platform

UAE National Space Policy Agenda Item 11; LSC April By: Space Policy and Regulations Directory

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

UC Irvine LAUC-I and Library Staff Research

Applying Mitigation. to Build Resilient Communities

NeAT Business Plan Component Data Integration and Annotation Services in Biodiversity (DIAS-B) 1. Service Description

Big Data infrastructure and tools in libraries

Competency Definition

2011 NNI Environment, Health, and Safety Research Strategy

Towards a Canadian Integrated Ocean Observing System

Digital repositories as research infrastructure: a UK perspective

Electronic Records Archives: Philadelphia Federal Executive Board

Curriculum Guide for Doctor of Philosophy degree program with a specialization in ENVIRONMENTAL PUBLIC HEALTH

Development of a Protected Areas Database for Jamaica

Outreach and Partnerships for Promoting and Facilitating Private Sector Emergency Preparedness

EPA Near-port Community Capacity Building: Tools and Technical Assistance for Collaborative Solutions

NASA's GLOBAL CHANGE MASTER DIRECTORY: FOSTERING COLLABORATIONS FOR EARTH SCIENCE INFORMATION AND DATA RETRIEVAL

Charter for the System Interoperability and Data Synchronization Requirements Team

YOUR REGION FOR BUSINESS. Presentation to the Markham Development Services Committee February 12, 2018

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Enabling Collaboration for Digital Preservation

USE CASE STUDY. Leveraging Data Through Partnerships The United States Agency for International Development (USAID)

Introduction to Grid Computing

THE GLOBUS PROJECT. White Paper. GridFTP. Universal Data Transfer for the Grid

DataONE. Promoting Data Stewardship Through Best Practices

Interoperability ~ An Introduction

March 21, 2016 MEMORANDUM FOR THE HEADS OF EXECUTIVE DEPARTMENTS AND AGENCIES. Building National Capabilities for Long-Term Drought Resilience

spatial metadata automation

INSPIRE in a nutshell, and overview of the European Union Location Framework

Status Spring Irge Olga Aujouannet Director, Global Policy Affairs

UCLA RESEARCH INFORMATICS STRATEGIC PLAN Taking Action June, 2013

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Resolution adopted by the General Assembly. [without reference to a Main Committee (A/62/L.30 and Add.1)]

Library Board of Directors / Board of County Commissioners 21 May 2015

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

Emily Vuxton and Lauren Leuck U.S. Army Corps of Engineers Institute for Water Resources (IWR) Alexandria, VA

The library s role in promoting the sharing of scientific research data

Building Resilience to Disasters for Sustainable Development: Visakhapatnam Declaration and Plan of Action

Geoffrey Fox Community Grids Laboratory Indiana University

einfrastructures Concertation Event

Earth Observation, Climate and Space for Smarter Government

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

Nile River Awareness Kit (NRAK) CD-ROM User s Manual

Big Data Retos y Oportunidades

How UAE is Driving Smart Sustainable Cities: key Achievements and Future Considerations

CHAMELEON: A LARGE-SCALE, RECONFIGURABLE EXPERIMENTAL ENVIRONMENT FOR CLOUD RESEARCH

DataDryad.org and the interoperability continuum.

Design patterns for data-driven research acceleration

Data Management at NIST

Interoperability in Science Data: Stories from the Trenches

Summary of Data Management Principles

Data and information sharing WMO global systems

20-Year Sustainability vision and goals

Comparison of Different Existing Approaches to Accreditation and Assessment

The U.S. National Spatial Data Infrastructure

Wade Sheldon. Georgia Coastal Ecosystems LTER University of Georgia CUAHSI Virtual Workshop Field Data Management Solutions

GOVERNMENT IT: FOCUSING ON 5 TECHNOLOGY PRIORITIES


Infrastructure PA Stephen Lecce

LEGAZPI CITY: DISASTER RESPONSE AND RESILIENCY INITIATIVES

Federal STI Managers Group Presentation to Board on Research Data and Information Ellen Herbst Director, NTIS CENDI Chair

UAE Space Policy Efforts Towards Long Term Sustainability of Space Activities Agenda Item 4; COPUOS June 2017 By: Space Policy and

Research Infrastructures and Horizon 2020

INTAROS Integrated Arctic Observation System

Building a National Address Database. Presented by Steve Lewis, Department of Transportation Mark Lange, Census Bureau July 13, 2017

INTRODUCING RESILIENT LOS ANGELES

AGENCY: National Weather Service, National Oceanic and Atmospheric Administration, U.S.

DOE OFFICE OF INDIAN ENERGY Program Overview May 5, Chris Deschene, Director

A framework for community safety and resilience

GEO Update and Priorities for 2014

Stephanie Stuckey Chief Resilience Officer

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Principles for a National Space Industry Policy

Transcription:

Open Persistent Access to al Robert J. Sandusky, UIC University of Illinois at Chicago The Net Partners Update: ONE and the Conservancy December 14, 2009

Outline NSF s Net Program ONE Introduction Motivating Challenges Drives Science ONE Overview Who Scope Virtual Organization Cyberinfrastructure Architecture What s Happened So Far Year 1 Goals

NSF s Net Program Each Net project will: Provide reliable digital preservation, access, integration and support for analysis Adapt to changes in technologies and user needs/expectations Be on the leading edge in research in computer science and cyberinfrastructure Be a component for interoperable data preservation and access in the Net Partners NSF: Provides $20 million for 5 years Expects self-sustaining virtual organizations, viable for many decades 3

ONE Introduction: Environmental Challenges

ONE Overview: Environmental Challenges Smith, Knapp, Collins. In press.

ONE Overview: Environmental Challenges

ONE Overview: Environmental Challenges Health of ecological services affect human well-being Support processes: Nutrient cycling Soil formation Provisioning: Food production Fresh water Wood / fiber Fuel Regulation of: Climate Flood Disease Security: Personal safety Resource access From disaster Basic materials: Adequate livelihood Sufficient food Shelter / goods Health: Clean air / water Strength Social relations: Social cohesion Freedom of choice and action Opportunities to achieve Adapted from Millenium Ecosystem Assessment

ONE Scope: Who PI: William K. Michener, University of New Mexico Co-PIs: Robert Cook, Oak Ridge National Laboratory (ORNL) Michael Frame, U.S. Geological Survey National Biological Information Infrastructure (USGS NBII) Stephanie Hampton, National Center for Ecological Analysis and Synthesis (NCEAS) Kathleen Smith, National Ecological Synthesis Center (NESCent) California Digital Library Co-Investigators from: California Digital Library University of California - Davis University of Southampton CSIRO, Australia Cornell University Ecological Society of America Keystone Center NCSA University of Illinois at Chicago University of Kansas University of Manchester University of Michigan University of Southern CA University of Tennesse, Knoxville University of Edinburgh Utah State University UNM, NCEAS, NESCent, ORNL

ONE Scope: Biological e.g., Gene, Organism, Population, Species, Community, Biome, Ecosystem Environmental e.g., Atmospheric, Chemical, Ecological, Hydrological, Oceanographic, Physical Social e.g., Land use, human population Economic e.g., trade, ecosystem services, resource extraction

ONE Scope: Halpern et al, 2008, A Global Map of Human Impact on Marine Ecosystems. 319, 15 February, 2008, 948.

ONE Scope: Objectives ONE strategic objectives 1. Engage the broadest possible community 2. Create an informatics literate populace 3. Build an extensive data resource 4. Build infrastructure to support the full data life cycle 5. Ensure financial support and sustainability 6. Provide responsive governance and management Universal access to data about life on earth and the environment that sustains it

NSF Engagement, Coordination and Management Net Partners Principal Investigator Leadership Team Director Development & Operations Director Community Engagement & Outreach R&D Core Cyberinfrastructure Team R&D

NSF Engagement, Coordination and Management External Advisory Committee Net Partners Director Development & Operations Principal Investigator Executive Director ONE Office Leadership Team Director Community Engagement & Outreach R&D CI Operations Core CI Team R&D Operations DIUG Education and Outreach Team

NSF Engagement, Coordination and Management External Advisory Committee Net Partners Director Development & Operations Principal Investigator Executive Director ONE Office Leadership Team Director Community Engagement & Outreach R&D CI Operations Core CI Team R&D Operations DIUG Education and Outreach Team Federated security Distributed storage preservation, metadata, and interoperability Scientific workflows integration and semantics Exploration, Visualization, Analysis Usability and assessment Cyberinfrastructure & Research Working Groups

NSF Engagement, Coordination and Management External Advisory Committee Net Partners Director Development & Operations Principal Investigator Executive Director ONE Office Leadership Team Director Community Engagement & Outreach R&D CI Operations Core CI Team R&D Operations DIUG Education and Outreach Team Federated security Distributed storage preservation, metadata, and interoperability Scientific workflows integration and semantics Exploration, Visualization, Analysis Usability and assessment Cyberinfrastructure & Research Working Groups Sociocultural barriers to data sharing and preservation Community engagement and education Citizen science and public outreach Long-term sustainability and governance Exploration, Visualization, Analysis Usability and assessment Engagement & Research Working Groups

Cyberinfrastructure Objectives Support synthesis in earth observation sciences Support full lifecycle of scientific process acquisition and management preservation discovery and access integration analysis and visualization Process management and preservation Evolve to accommodate technology change

ONE CI Design Goals Distributed data management at distributed nodes Replication and caching for preservation and performance Software must provide benefits for scientists today Support and adapt existing community software efforts Evolution of software and standards Emphasize Free and Open Source Software

ONE Cyberinfrastructure Coordinating Member Nodes Nodes retain complete metadata diverse institutions catalog subset serve local of all community data perform basic indexing provide network-wide resources for services managing their data ensure data availability (preservation) provide replication services Flexible, scalable, sustainable network

ONE Deployment

ONE CI Components

Node Design Member nodes Geographically Distributed Nodes observing institutions Libraries contributing capacity; levering repositories Existing disciplinary repositories Government agencies Authoritative repository for many datasets Diversity tolerant (less tightly coordinated) Freedom to try new tools, methods, and leapfrog forward Location of replicated data Coordinating nodes Completely replicated Complete metadata catalog Tightly coordinated, stable service platform Provide centralized services

ONE Service API Federated Identity and Authorization Services Object Management Services Discovery and Usage Services Preservation Services Network Services

Service API for Interoperability Common access methods for different clients Mechanism to map heterogeneous services Provide interface between nodes and service requests Simplicity of construction Lightweight Ease of implementation Implementations are hidden from service consumers

Investigator Toolkit Suite of software tools for researchers Emphasize Free and Open Source, but support commercial General analysis frameworks (e.g., R, MATLAB) Domain-specific tools (e.g., GARP, Phylocom) Organized using scientific workflows (e.g., Kepler, Taverna) Portals (e.g., myexperiemnt, VegBank) Supports the scientific lifecycle management and preservation query and access analysis and visualization Process management and preservation Communication via the Service API

ONE Cyberinfrastructure

ONE Cyberinfrastructure

ONE Cyberinfrastructure

ONE Cyberinfrastructure

ONE Cyberinfrastructure

Where We Are Meeting regularly Core cyberinfrastructure team Input from working group leaders Defining architecture, APIs Beginning first prototype Acquiring CN hardware Community engagement team Leadership team One working group constituted and active Building the organization One director hired, others in the pipeline External advisory board Programmers hired, others in the pipeline Working groups writing charters, identifying members

Year 1 Goals for Cyberinfrastructure Launch 3 Coordinating Nodes: ORNL, UNM, UCSB Launch 3 Member Nodes, drawn from: Dryad at UNC Distributed Active Archive Center at ORNL Knowledge Biocomplexity Water Resource Center (CDL) National Biological Information Infrastructure (NBII) Clearing House and metadata replication Interoperable metadata search and data retrieval Basic logging, health and heartbeat

Discussion Questions, comments? ONE http://www.dataone.org/ E-mail sandusky@uic.edu Acknowledgements Sustainable Digital Preservation and Access Network Partners (Net); Program Solicitation NSF 07-601