Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Similar documents
An Approach to Software Preservation

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

The EOC Geoservice: Standardized Access to Earth Observation Data Sets and Value Added Products ABSTRACT

An overview of the OAIS and Representation Information

Scientific Data Curation and the Grid

The CEDA Archive: Data, Services and Infrastructure

From Open Data to Data- Intensive Science through CERIF

Initial Operating Capability & The INSPIRE Community Geoportal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

INSPIRE & Environment Data in the EU

eresearch Collaboration across the Pacific:

Reducing Consumer Uncertainty

The OAIS Reference Model: current implementations

Data is the new Oil (Ann Winblad)

Long-term digital preservation of UNSWorks

The Scottish Spatial Data Infrastructure (SSDI)

SAFER the GIGAS Effect

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

EarthLookCZ as Czech way to GMES

The European Commission s science and knowledge service. Joint Research Centre

Digital repositories as research infrastructure: a UK perspective

Towards a joint service catalogue for e-infrastructure services

INSPIRE status report

The GeoPortal Cookbook Tutorial

SEXTANT 1. Purpose of the Application

Earth Science Community view on Digital Repositories

Project Information. Advanced Climate Research Infrastructure for Data (ACRID)

Linking datasets with user commentary, annotations and publications: the CHARMe project

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

The UK Marine Environmental Data and Information Network MEDIN

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

Uniform Resource Management

Enabling Efficient Discovery of and Access to Spatial Data Services. CHARVAT, Karel, et al. Abstract

German EO missions: TerraSAR-X, TanDEM-X, EnMAP, Firebird

The ESA CASPAR Scientific Testbed and the combined approach with GENESI-DR

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University

DCC Disciplinary Metadata

EUDAT- Towards a Global Collaborative Data Infrastructure

Metadata Models for Experimental Science Data Management

Show me the data. The pilot UK Research Data Registry. 26 February 2014

EUDAT. Towards a pan-european Collaborative Data Infrastructure

spatial metadata automation

Technical documentation. SIOS Data Management Plan

Networking European Digital Repositories

Networking European Digital Repositories

Towards FAIRness: some reflections from an Earth Science perspective

Earth Observation Payload Data Ground Systems Infrastructure Evolution LTDP SAFE. EO Collection and EO Product metadata separation Trade-Off

METAINFORMATION INFRASTRUCTURE FOR GEOSPATIAL INFORMATION

GeoDCAT-AP: Use cases and open issues

Webservice-energy.org GEO Community Portal & Spatial Data Infrastructure for Energy

Introduction to INSPIRE. Network Services

RDDS: Metadata Development

From the INSPIRE Engine Room

GENeric European Sustainable Information Space for Environment.

Future Core Ground Segment Scenarios

PortalU, a Tool to Support the Implementation of the Shared Environmental Information System (SEIS) in Germany

GEOSS Data Management Principles: Importance and Implementation

EUDAT & SeaDataCloud

EUDAT. Towards a pan-european Collaborative Data Infrastructure

CCSDS STANDARDS A Reference Model for an Open Archival Information System (OAIS)

Striving for efficiency

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

Software Requirements Specification for the Names project prototype

Science Europe Consultation on Research Data Management

The geospatial metadata catalogue. FOSS4G Barcelona. Jeroen Ticheler. Founder and chair. Director

MY DEWETRA IPAFLOODS REPORT

Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom

Technical documentation. D2.4 KPI Specification

The Photon and Neutron Data Initiative PaN-data

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

Compass INSPIRE Services. Compass INSPIRE Services. White Paper Compass Informatics Limited Block 8, Blackrock Business

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Session Two: OAIS Model & Digital Curation Lifecycle Model

Developing data catalogue extensions for metadata harvesting in GIS

INSPIRE overview and possible applications for IED and E-PRTR e- Reporting Alexander Kotsev

Toward Horizon 2020: INSPIRE, PSI and other EU policies on data sharing and standardization

Trusted Digital Repositories. A systems approach to determining trustworthiness using DRAMBORA

Pascal Gilles H-EOP-GT. Meeting ESA-FFG-Austrian Actors ESRIN, 24 th May 2016

Global Cryosphere Watch. GCW web portal Structure and demonstration

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Leveraging metadata standards in ArcGIS to support Interoperability. Aleta Vienneau and Marten Hogeweg

RDF and Digital Libraries

EUDAT. Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? -

Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

The digital preservation technological context

Sharing geographic data across the GEF IW Portfolio: IW:LEARN Web-based GIS

Registry Interchange Format: Collections and Services (RIF-CS) explained

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

UKOLN involvement in the ARCO Project. Manjula Patel UKOLN, University of Bath

Metadata Workshop 3 March 2006 Part 1

Linking library data: contributions and role of subject data. Nuno Freire The European Library

(Geo)DCAT-AP Status, Usage, Implementation Guidelines, Extensions

NEODC Service Level Agreement (SLA): NDG Discovery Client Service

Relation between Geospatial information projects related to GBIF

/// INTEROPERABILITY BETWEEN METADATA STANDARDS: A REFERENCE IMPLEMENTATION FOR METADATA CATALOGUES

IR on metadata Change proposal(s) on the Resource Locator element

The Value of Metadata

An Introduction to Digital Preservation

Transcription:

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3 May, 2010

Science and Technology Facilities Council Provide large-scale scientific facilities for UK Science, particularly in physics and astronomy E-Science Centre at RAL and DL Provides advanced IT development and services to the STFC Science Programme Strong interest in Digital Curation of our science data; R&D Programme: DCC, CASPAR Home of two key UK environmental data centres: British Atmospheric Data Centre (BADC) and NERC Earth Observational Data Centre (NEODC) Active contributor in the international arena of Environmental Informatics, e.g. OGC(OWS 6), INSPIRE and ESA

Long-term Preservation of Spatial Information: Motivation for INSPIRE The INSPIRE directive requires global availability and uniform accessibility of heterogeneous environmental datasets across Europe through common Implementation Rules (IRs) Metadata standards(e.g. ISO 19115) and Interoperable Data Infrastructures (e.g. INSPIRE SDI). Interoperability does not always guarantee sustainability over the long-term. A key question to be answered- What happens to the data when a data provider ceases to exist? A phenomenal deluge of spatial data over the last decade Triggered by the growing concerns over environmental problems, such as global climate changes Ensuring sustained access to these data is becoming more difficult. Efficient long-term preservation is required for both current and historical spatial data exposed through INSPIRE.

Long-term Preservation of Spatial Information: Main Challenges Environmental data inherit the preservation challenges inherent to all digital information. Existing preservation approaches and standards, such as the OAIS Reference Model should be also applicable to environmental data. Environmental data adds to these: Highly structured and complex data models ( feature types ) that require special knowledge for accurate interpretation. Static data being replaced with dynamic web services, such as the OGC web services. Existing preservation approaches would need to be tailored to handle these added complexities. The work presented explored the applicability of the OAIS Reference model to the preservation of environmental data.

The Current State of Play A relatively underexplored area until recently, when ESA announced the Long Term Digital Preservation (LTDP) initiative for their increasingly voluminous Earth Observation datasets. Other notable relevant initiatives: The National Geospatial Digital Archive (NGDA) project funded under the National Digital Information Infrastructure and Preservation Program (NDIIPP): approach specific to US-based data The Geospatial Electronic Records (GER) project: new metadata format introduced is incompatible with ISO 19115 - the metadata format required by European law and INSPIRE for describing European environmental data. Some exploratory work by the Digital Preservation Coalition (DPC)

The OAIS Reference Model A widely adopted ISO Standard for long-term preservation of digital objects Defines a information model that needs to be captured for effective preservation Aids data management, e.g. Provenance history, versioning info Facilitates data discovery, e.g. Keywords, abstract Information needed to render data in future, e.g. Software, hardware

Preservation Aspects of INSPIRE What is missing: ISO19115 is not curation-aware Insufficient RI Data annotation is not captured Metadata Curation SDI What already exists: ISO 19115 -Good for resource discovery Controlled vocabulary for semantic metadata validation Data Preservation Ad-hoc approaches to data management and storage Not considered in this project

A Preservation-focused INSPIRE SDI Annotate Data Verify Authenticity, Quality Metadata Curation View RI, Annotation etc. RI Registry RI Re-render Raw Data Data Preservation

A Preservation Profile of ISO Extends MD_ApplicationSchemaInformation Refines the LI_Lineage used and to create a 19115 particular feature view of a LI_ProcessStep source geospatial elements dataset Extends DQ_Elements that Adds information about the provide Captures mapping detailed between account provenance a source of the data and its application schema quality history, assurance such change measures of ownership applied and/or preservation to a dataset body Adds information about applications/software/services Extends MD_Usage elements required to effectively apply the mapping Adds Records fixity of information, all major events such as checksum (including Captures preservation-related) and annotation digital signature made to to that Defines additional data specific verify have data affected and RI (e.g. unauthorised its context data formats, storage the data, modification such as major media), mainly in the form of web-accessible resources (e.g. to platform Adds data value change to data and preservation URL). certification process during its life In the OAIS, this is Preservation Enables data providers to cycle record RI in other formats than ISO Descriptive Information 19115. Essential for verifying the provenance of a dataset as well as the reliability of its preservation in the future The OAIS defines this information as Preservation Descriptive Information

A Prototype Preservation-aware Geo-Portal Implemented a web-based portal that demonstrates the underlying functions of a preservation-aware SDI Based on GeoNetwork a widely adopted open source and standards-based catalogue service, also used for the INSPIRE GeoPortal. Key features: Recording, editing, searching and viewing metadata in the Preservation profile of ISO 19115 Versioning of metadata Annotation of both Data and Metadata through an intuitive and user friendly wizard; captures annotation context in Xpath for metadata records

A Prototype Preservation-aware Geo-Portal (Annotation Wizard)

A Prototype Preservation-aware Geo-Portal (Annotation Wizard)

Conclusions & Future Directions Long-term preservation of both current and historical environmental data exposed through INSPIRE is highly important for e.g. monitoring and analysing climate change. Awareness is growing in Europe with the emergence of ESA LTDP, albeit not addressed in the current INSPIRE directive. The work presented investigates the requirements for a preservation-aware SDI for INSPIRE and presents a preservation profile of ISO 19115 that outlines the metadata requirements. Future work would need to focus on the implementation of efficient and interoperable data preservation solutions for the INSPIRE data repositories.

References National Geospatial Digital Archive (NGDA) Project - http://www.digitalpreservation.gov/partners/ngda/ngda.ht ml National Digital Information Infrastructure and Preservation Program (NDIIPP) - http://www.digitalpreservation.gov/library/ Geospatial Electronic Records (GER) project - http://www.ciesin.columbia.edu/ger/ Open Geospatial Consortium (OGC) Web Services Specifications - http://www.opengeospatial.org/standards Open Archival Information System (OAIS) Reference Model - http://public.ccsds.org/publications/archive/650x0b1.pdf

Questions?