The ESA CASPAR Scientific Testbed and the combined approach with GENESI-DR

Similar documents
The International Journal of Digital Curation Issue 3, Volume

Earth Science Community view on Digital Repositories

An overview of the OAIS and Representation Information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

AUTHENTICITY AND OAIS. THE CASPAR MODEL AND THE INTERPARES PRINCIPLES & OUTPUTS

Session Two: OAIS Model & Digital Curation Lifecycle Model

Components for a Science Data Infrastructure preservation and re-use of data. David Giaretta

GEOSS Data Management Principles: Importance and Implementation

Preservation Planning in the OAIS Model

Future Core Ground Segment Scenarios

Digital Preservation Standards Using ISO for assessment

Image Information Mining (IIM): Where do we go?

Developing a Research Data Policy

Promoting Open Standards for Digital Repository. case study examples and challenges

Earth Observation Payload Data Ground Systems Infrastructure Evolution LTDP SAFE. EO Collection and EO Product metadata separation Trade-Off

Preservation DataStores: An architecture for Preservation Aware Storage

The VITO Earth Observation LTDA Facility

ULISSE. Carlo Albanese Telespazio. ASI Workshop on Space Foundations Rome, 29 May 2012

Its All About The Metadata

German EO missions: TerraSAR-X, TanDEM-X, EnMAP, Firebird

Fragility of digitally encoded information increasingly appreciated as a major concern This concern applies to almost every aspect of life

The CASPAR Finding Aids

Earth Observation Payload Data Ground Systems Infrastructure Evolution LTDP SAFE. SAFE Software System Specification

OAIS: What is it and Where is it Going?

XML information Packaging Standards for Archives

The OAIS Reference Model: current implementations

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

CCSDS STANDARDS A Reference Model for an Open Archival Information System (OAIS)

DURAARK. Ex Libris conference April th, 2013 Berlin. Long-term Preservation of 3D Architectural Data

The ESA Earth Observation Long Term Data Preservation (LTDP) Programme ABSTRACT

The EOC Geoservice: Standardized Access to Earth Observation Data Sets and Value Added Products ABSTRACT

Interoperability & Archives in the European Commission

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

Digital repositories as research infrastructure: a UK perspective

Towards a Federated Collaborative Platform - From OGC Testbed13 to the Future

HMA Standardisation Status

The Need for a Terminology Bridge. May 2009

Space Data Coordination Group Meeting (SDCG-13) 15 th -16 th September 2018, Bogotá, Colombia Meeting Objectives

Agenda. Bibliography

What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011

Digital Preservation DMFUG 2017

Geospatial Records and Your Archives

Lessons from the implementation

Post Digitization: Challenges in Managing a Dynamic Dataset. Jasper Faase, 12 April 2012

Pascal Gilles H-EOP-GT. Meeting ESA-FFG-Austrian Actors ESRIN, 24 th May 2016

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

Copyright 2008, Paul Conway.

Preservation of digital IG at the ICGC. Dolors Barrot, J. Luis Colomer, Anna Lleopart, Carme Montaner, Maria Pla

Striving for efficiency

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

Using XFDU for CASPAR information packaging Matthew Dunckley Science & Technology Facilities Council, Oxford, UK

Defining OAIS requirements by Deconstructing the OAIS Reference Model Date last revised: August 28, 2005

Digital preservation activities at the German National Library nestor / kopal

An Approach to Software Preservation

Heterogeneous Missions Accessibility: Interoperability for Earth Observation

Assessment of product against OAIS compliance requirements

Data Curation Handbook Steps

Building for the Future

First Review of COP 21 and Potential impacts on Space Agencies

Building a Digital Repository on a Shoestring Budget

Protection of the National Cultural Heritage in Austria

The Integration of Grid Technology with OGC Web Services (OWS) in NWGISS for NASA EOS Data

SmartHMA Introduction of SmartHMA project objectives

Invenio: A Modern Digital Library for Grey Literature

Real World Data Governance- Part 1

Data Curation Profile Human Genomics

Designing a System Engineering Environment in a structured way

Integration of INSPIRE & SDMX data infrastructures for the 2021 population and housing census

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

Linking datasets with user commentary, annotations and publications: the CHARMe project

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Content Management for the Defense Intelligence Enterprise

Exploring the Concept of Temporal Interoperability as a Framework for Digital Preservation*

Different approaches to digital preservation

Persistent identifiers: jnbn, a JEE application for the management of a national NBN infrastructure

Different Aspects of Digital Preservation

Importance of cultural heritage:

Improving a Trustworthy Data Repository with ISO 16363

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Combining SNIA Cloud, Tape and Container Format Technologies for the Long Term Retention of Big Data

The European Commission s science and knowledge service. Joint Research Centre

30 April 2012 Comprehensive Exam #3 Jewel H. Ward

OpenPlant Accelerating ISO Adoption Through Open Applications.

Igitur Archive: Institutional Repository Utrecht University. May , Martin Slabbertje

Conch Appendix: Discovery Questionnaire. Questionnaire Summary

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Testbed a walk-through

EVOlution of EO Online Data Access Services (EVO-ODAS) ESA GSTP-6 Project by DLR, EOX and GeoSolutions (2015/ /04)

Simplify EO data exploitation for information-based services

MITA s approach to Open Standards. Presented by: Noel Cuschieri 24 th November 2015

Long term archiving (LTA) of digital product data, which are not based on technical drawings. VDA Working Group "CAD/CAM"

Assessment of product against OAIS compliance requirements

CBS PROCEDURE ON REGIONAL REQUIREMENTS COORDINATION GROUPS. (Submitted by the Secretariat) Summary and Purpose of Document

EUDAT- Towards a Global Collaborative Data Infrastructure

INSPIRE & Environment Data in the EU

Use of XML Schema and XML Query for ENVISAT product data handling

A Model for Managing Digital Pictures of the National Archives of Iran Based on the Open Archival Information System Reference Model

Europeana update: aspects of the data

Transcription:

The ESA CASPAR Scientific Testbed and the combined approach with GENESI-DR S. ALBANI (ACS c/o ESA-ESRIN) Sergio.Albani@esa.int PV2009, 1-3/12/2009, Madrid

SUMMARY ESA, CASPAR and Long Term Data Preservation ESA CASPAR testbed CASPAR & GENESI-DR combined approach

ESA and EO introduction ESA users worldwide have access to ~4 PB of EO data EO data provide global coverage of the Earth Data volumes are increasing dramatically Large requirements for accessing historical archives This unique dataset has to be preserved! ESA is promoting a European EO LTDP Strategy ESA is involved in several international preservation activities

CASPAR Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval CASPAR is an Integrated Project co-financed by EU within the Sixth Framework Programme (Priority IST-2005-2.5.10, "Access to and preservation of cultural and scientific resources"). CASPAR has built a framework to support the end-to-end preservation lifecycle for digital information, based on the OAIS reference model, with a strong focus on the preservation of the knowledge associated with the data. Duration: April 2006 November 2009

ESA role in CASPAR ESA participation to CASPAR was mainly driven by the interest in: consolidating and extending the validity of the OAIS reference model, already adopted in several internal initiatives (e.g. SAFE); developing preservation techniques/tools covering not only the data but also the knowledge associated with them in order to maintain the scientific capabilities of ES data users. CASPAR Scientific Testbed ESA user and data/infrastructure provider ACS technical side of the testbed implementation

Testbed focus Testbed scenarios have been implemented taking into account the current ESA archives and the European EO LTDP Common Guidelines Strong focus on: knowledge management and preservation data accessibility/usability preservation of higher level data, processing capabilities and science applications Archived data (in the form of AIPs) shall contain all the elements necessary to be accessed, used, understood and processed to obtain mission products to be delivered to users (in the form of DIPs) Provide and maintain mission products generation capability (systematic or through ordering) from AIPs to DIPs including the processing chains Allow information extraction from low-level EO products and information preservation through supporting chainable information based services. Adopt a common standard reference model for the archives (ISO 14721 - OAIS standard)

Testbed goals Development of a complete 100% CASPAR components based preservation system (ESA CASPAR System) supporting data providers in the preservation of the users capabilities to process data using appropriate knowledge providing basic archiving features as Ingest, Access, Retrieve AND: Knowledge preservation RepInfo creation and appropriate browsing User Communities profiling OAIS compliance On demand generation of data

Testbed activities The ESA testbed has covered: the setup of the framework in ESA-ESRIN; the definition and collection of a significant sample of a whole processing chain dataset; the conversion of data from the native format to a OAIS compliant format; the analysis of ontologies to describe and preserve scientific workflows (e.g. the applicability of CIDOC CRM on scientific data); the generation of appropriate Representation Information, Descriptive Information, Knowledge Modules and Scientific Community profiles; the implementation of a 100% CASPAR-based archiving system; the ingestion and the retrieval (through a profile-based access) of data and related RepInfo; the coping with some long term data preservation problems by using only CASPAR components, methodology and tools.

Testbed Dataset The ESA selected dataset for the CASPAR scientific testbed consists of data from GOME (Global Ozone Monitoring Experiment), a sensor on board the ESA ERS-2 (European Remote Sensing) satellite L1B Preservation of the ability to process GOME data from L1B to L1C L1C GOME L1 products L1B L1C processor readme_1st.doc ERS-Products.pdf readme.doc ProductSpecification.pdfrelease_l01.doc PSD.pdf user_manual.pdf The Ozone license.doc howtouse_l01.doc The ERS-2 satellite disclaimer.pdf The GOME sensor L1B L1C source code The C Bible The OS Bible

Ingested AIP (OAIS compliant) Content Information Data Object Representation Information Descriptive Information Metadata is extracted by the product and contained in the manifest file The filename itself Empty Preservation Description Information Reference Provenance Context Fixity AIP Packaging Information The RepInfo provided are contained in the manifest file and in the schemas The principal investigator who recorded the data and the information concerning its storage, handling and migration A Cyclical Redundancy Check (CRC) code for a file No packaging restrictions

Ingest phase Data Producer Level 1B AIP GOME L1B data SIP PACK Level 1C Proxy AIP Level 1 Docs AIP L1 Processor SIP FIND Processor Executable AIP Processor Source Code AIP Processor Docs AIP PDS RepInfo REG KM

Search and Retrieve phase Level 1B AIP GOME Expert Level 1C Proxy AIP FIND RepInfo Level 1 Docs AIP Processor Executable AIP GOME User Processor Source Code AIP Processor Help Docs AIP RepInfo RepInfo RepInfo PDS Level 1C AIP On Demand KM Registry Additional RepInfo PACK

Underlying ontology

Preservation process (update phase) CASPAR GOME L1 Dataset L1B->L1C processor L1 products L1B->L1C processor source code Documents Uses GOME data Notifies alert User Community Events chain OS or lib change POM FIND PDS PDI & Alert Notifies Get processor source code New processor ingestion RepInfo Update Processor recompiled Processor reingested Processor recompiling Docs & Links updated Notification to users

Testbed Validation Change in Software (new release of FFTW library needed to compile the processor) Change in Environment (migration from obsolete LINUX operating system to the more used SUN SOLARIS)

ESA CASPAR System DEMO http://caspar-nas.esrin.esa.int:9999/caspar nas.esrin.esa.int:9999/caspar-demo2

Benefits and major outcomes Framework validation (CASPAR components are suitable for preservation of ES data) Lesson learnt preservation of knowledge associated to data preservation not only of data but also of data processing best practices to cope with long term data preservation problems by using OAIS model real applications Main outcomes development of a 100% CASPAR components based framework (ESA CASPAR System is available for further enhancement/testing and for users and data owners/providers willing to see a practical approach to preservation using CASPAR solutions) demonstration of the suitability of CASPAR solutions for applications in the Earth Science field (in the ESA EO Ground Segments infrastructure) integration with GENESI-DR

GENESI-DR Ground European Network for Earth Science Interoperations Digital Repositories GENESI-DR is a federation of Digital Repositories (DR) dedicated to Earth Science GENESI-DR provides to users/applications open access to different European Earth Science Digital Repositories through the same interface.

CASPAR & GENESI-DR ENEA KSAT CENIA NILU Webportal CNES ISPL CASPAR EGEE Infoterra JRC GENESI-DR CASPAR Application Interface External Application ESA CNR CASPAR will benefit from the GENESI-DR services to validate in a more complete form its data preservation framework in the Earth Science domain GENESI-DR Research Infrastructure will demonstrate its ability to adopt data preservation and curation mechanisms defined in CASPAR. The integration of the ESA CASPAR System in the GENESI-DR infrastructure will promote the CASPAR preservation model in a wide community sharing the ESA CASPAR experience with other ES stakeholders We are evaluating how to evolve CASPAR and GENESI-DR to respond to new requirements in the ES community

GENESI-DR GENESI-DR Ozone Processing and Profiles Validation services Ozone profiles GOME L1B DATA CASPAR DR L1C data CASPAR & GENESI-DR L1B->L1C processor approach 1. GENESIfication of a CASPARbased DR (ESA CASPAR System) 2. Development of services accessible through GENESI- DR to estimate vertical profiles of ozone or generate L1C data using processing software and data both preserved in CASPAR 3. Allow users to preserve their processing results in CASPAR 4. To return profile-based Representation Information to GENESI users 5. To define a strategy for propagating CASPAR features to other interested GENESI DRs.

CASPAR &GENESI-DR DEMO

THANK YOU!!! ESA CASPAR TEAM: Luigi Fusco, Sergio Albani, Pasquale Renna ACS CASPAR TEAM: Ugo Di Giammatteo, Fulvio Marelli, Marco Fulcoli, Alessio d Innocenti ESA GENESI-DR TEAM: Roberto Cossu, Eliana Li Santi www.esa.int www.acsys.it www.genesi-dr.eu www.casparpreserves.eu