High Performance Data Efficient Interoperability for Scientific Data

Size: px
Start display at page:

Download "High Performance Data Efficient Interoperability for Scientific Data"

Transcription

1 High Performance Data Efficient Interoperability for Scientific Data Alex Ip 1, Andrew Turner 1, Dr. David Lescinsky 1 1 Geoscience Australia, Canberra, Australia

2 Problem: Legacy Data Formats holding us back GA s geoscientific data is currently held in several legacy formats, some of them proprietary Data is not readily interoperable each different format requires custom development.

3 Metadata Tedious but Vital Metadata is of varying quality and accessibility usually held separately to data. Discovery is hit-or-miss. Metadata is sometimes deduced from filename, context or actual data values Need Further Information!

4 Modern Container Formats (e.g. NetCDF/HDF) Transparency and interoperability of text-based file formats, combined with efficiency of binary formats Encapsulated metadata ensures that data and metadata never become separated Internal indexing for rapid subsetting to support large-scale parallel processing Compression and chunking for efficient storage and retrieval Robust, well-proven software libraries for most languages and environments Able to be consumed via web services (e.g. OPeNDAP) using identical code for direct file access

5 INTEROPERABILITY Container Format Options Which one? NetCDF-CF Highly Interoperable Extensive Toolsets Available Large User Base NetCDF4 Simple Standardised Proven (at large scale) HDF5 Self-Describing Flexible & Extensible Efficient & Scalable Robust Well Supported NetCDF Climate & Forecasting conventions (NetCDF-CF) provides enhanced interoperability by further constraining NetCDF4/HDF5, allowing us to leverage existing tools Network Common Data Form v4 (NetCDF4) facilitates the use of data in HDF5 by providing conventions to constrain structure and metadata. Hierarchical Data Format v5 (HDF5) Flexible, proven, domain-agnostic scientific data format

6 What GA has been doing with its geophysics data GA has been working to convert geophysics data (magnetics, radiometrics, & gravity) to netcdf GA already delivers much of its geophysics data (mostly gridded) from the NCI NERDIP infrastructure NCI has helped GA achieve compliance with relevant data & metadata standards and conventions, e.g: netcdf, CF (Climate & Forecasting), ACDD (Attribute Convention for Data Discovery), etc. GA now has systems to synchronise metadata between its catalogue and the data files Have implemented Jupyter Notebook demonstrators for geophysics data

7 Gridded Data vs. Un-gridded (Point & Line) Gridded data is relatively straightforward in HDF/NetCDF (e.g. Earth observation, climate model output, etc) Un-gridded point and flight-line (trajectory) geophysics data is new for netcdf. Approach based on existing NOAA approaches to oceanographic & atmospheric data. Structure of point/line netcdf is very simple each pointwise attribute is a separate variable. Able to be consumed via OPeNDAP web services Metadata requires careful attention. Need to standardise naming conventions for possible inclusion in CF convention

8 Representing Geoscientific Data in NetCDF4/HDF5 netcdf P452MAG { dimensions: variables: sample = ; line = 114 ; line_index = 115 ; int index_lines(line_index) ; int LINE(line) ; index_lines:my_standard_name = "index_lines" ; index_lines:long_name = "zero based index of the first sample in the line, and then one past the end of the last line" ; index_lines:units = "1" ; index_lines:_fillvalue = ; index_lines:_storage = "contiguous" ; index_lines:_endianness = "little" ; LINE:my_standard_name = "line_number" ; LINE:long_name = "line identification number" ; LINE:units = "1" ; LINE:_FillValue = ; LINE:_Storage = "contiguous" ; LINE:_Endianness = "little" ; short bearing(line) ; bearing:my_standard_name = "aircraft_bearing" ; bearing:units = "degrees" ; bearing:_fillvalue = s ; bearing:_storage = "contiguous" ; bearing:_endianness = "little" ; int datecode(line) ; datecode:my_standard_name = "date" ; datecode:long_name = "local date on which the line was flown" ; datecode:units = "yyyymmdd" ; datecode:_fillvalue = ; datecode:_storage = "contiguous" ; datecode:_endianness = "little" ; short flight(line) ; flight:my_standard_name = "flight_number" ; flight:long_name = "flight identification number" ; flight:units = "1" ; flight:_fillvalue = s ; flight:_storage = "contiguous" ; flight:_endianness = "little" ;

9 Making Geoscientific Data Available with Web Services Using OPeNDAP: Data is served remotely in full or as subset Data can be delivered in several formats including ASCII or binary

10 Accessing geoscientific Data in NetCDF4/HDF5 Data can now be rapidly accessed not only in-situ, but also via OPeNDAP from the NCI directly into environments including Python, R or MATLAB

11 Case Study Airborne Electromagnetic (AEM) Data Subsurface conductivity section visualisation in Jupyter Notebook by Neil Symington Originally tab-delimited text, now netcdf. Provides sub-second data reads c.f. minutes. Raw AEM data or derived inversions both handled.

12 Case Study Airborne Magnetic & Radiometric Survey Line Datasets Trajectory (line) data encoded as netcdf, with line groupings and full metadata. Fully-automated discovery of survey line datasets via CSW, data subset retrieval for area of interest via OPeNDAP web service.

13 Case Study Ground Gravity Survey Point Datasets Currently only available as CSV dumps from GA database ground gravity survey point datasets processed into netcdf with full metadata Both raw data and analysis-ready adjusted values contained in the same file NetCDF permits encoding of point-based metadata as enumerated types (e.g. data quality string, instrument, etc) No sexy visualisations (yet) translation work was only completed last week.

14 FAIR Data Principles and GA s netcdf Findable Datasets are catalogued in GA s ISO19115 metadata catalogue. Metadata is harvested by the NCI, ANDS and data.gov.au, amongst others. Accessible Data is published from the NCI via web services (or via HTTP download) Interoperable netcdf is self-describing, open format, with libraries implemented in many different programming environments. Reusable Data has embedded metadata, with links to full metadata.

15 Summary NetCDF-CF is a highly suitable container format for multiple types of geoscientific data GA is in the process of translating several geoscientific data types as part of a pilot program with the NCI NERDIP program Proposed controlled vocabularies will be presented to communities of practice for approval, before being submitted for inclusion in the CF convention Layered, standards-based architecture will speed the development of future-proof systems Improving data interoperability will help break down silos and drive exciting new, transdisciplinary science.

16 Thank you! Acknowledgements: Ross C Brodie, Yvette Poudjomdjomani, Neil Symington Geoscience Australia Kelsey Druken, Lesley Wyborn National Computational Infrastructure Edward King, Matt Paget - CSIRO Phone: Web: alex.ip@ga.gov.au Address: Cnr Jerrabomberra Avenue and Hindmarsh Drive, Symonston ACT 2609 Postal Address: GPO Box 378, Canberra ACT 2601

Implementing a Data Quality Strategy to simplify access to data

Implementing a Data Quality Strategy to simplify access to data IN43D-07 AGU Fall Meeting 2016 Implementing a Quality Strategy to simplify access to data Kelsey Druken, Claire Trenham, Ben Evans, Clare Richards, Jingbo Wang, & Lesley Wyborn National Computational Infrastructure,

More information

Clare Richards, Benjamin Evans, Kate Snow, Chris Allen, Jingbo Wang, Kelsey A Druken, Sean Pringle, Jon Smillie and Matt Nethery. nci.org.

Clare Richards, Benjamin Evans, Kate Snow, Chris Allen, Jingbo Wang, Kelsey A Druken, Sean Pringle, Jon Smillie and Matt Nethery. nci.org. The important role of HPC and data-intensive infrastructure facilities in supporting a diversity of Virtual Research Environments (VREs): working with Climate Clare Richards, Benjamin Evans, Kate Snow,

More information

Implementing a Data Quality Strategy to simplify access to data

Implementing a Data Quality Strategy to simplify access to data Implementing a Quality Strategy to simplify access to data Kelsey Druken Implementing a Quality Strategy to simplify access to data Kelsey Druken, Claire Trenham, Lesley Wyborn, Ben Evans National Computational

More information

HDF Product Designer: A tool for building HDF5 containers with granule metadata

HDF Product Designer: A tool for building HDF5 containers with granule metadata The HDF Group HDF Product Designer: A tool for building HDF5 containers with granule metadata Lindsay Powers Aleksandar Jelenak, Joe Lee, Ted Habermann The HDF Group Data Producer s Conundrum 2 HDF Features

More information

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future AGU Fall Meeting 2016 IN31-G The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future Clare Richards, Ben Evans, Lesley Wyborn, Jingbo

More information

Making data access easier with OPeNDAP. James Gallapher (OPeNDAP TM ) Duan Beckett (BoM) Kate Snow (NCI) Robert Davy (CSIRO) Adrian Burton (ARDC)

Making data access easier with OPeNDAP. James Gallapher (OPeNDAP TM ) Duan Beckett (BoM) Kate Snow (NCI) Robert Davy (CSIRO) Adrian Burton (ARDC) Making data access easier with OPeNDAP James Gallapher (OPeNDAP TM ) Duan Beckett (BoM) Kate Snow (NCI) Robert Davy (CSIRO) Adrian Burton (ARDC) Outline Introduction and trajectory (James Gallapher) OPeNDAP

More information

Technical documentation. SIOS Data Management Plan

Technical documentation. SIOS Data Management Plan Technical documentation SIOS Data Management Plan SIOS Data Management Plan Page: 2/10 SIOS Data Management Plan Page: 3/10 Versions Version Date Comment Responsible 0.3 2017 04 19 Minor modifications

More information

Supporting positioning in Australia through open access multi-gnss data.

Supporting positioning in Australia through open access multi-gnss data. Session: GNSS Networks, Processing and Calibrations Supporting positioning in Australia through open access multi-gnss data. Ryan Ruddick, Nicholas Brown, Brandon Owen, Ted Zhou and Bart Thomas Overview

More information

Ocean Color Data Formats and Conventions:

Ocean Color Data Formats and Conventions: Ocean Color Data Formats and Conventions: NASA's perspective Sean Bailey NASA Goddard Space Flight Center 07 May 2013 International Ocean Color Science Meeting Darmstadt, Germany 1 The Big Picture The

More information

NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences

NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder 2006-01-31 Acknowledgments This work was supported by the

More information

Catalog-driven, Reproducible Workflows for Ocean Science

Catalog-driven, Reproducible Workflows for Ocean Science Catalog-driven, Reproducible Workflows for Ocean Science Rich Signell, USGS, Woods Hole, MA, USA Filipe Fernandes, Centro Universidade Monte Serrat, Santos, Brazil. 2015 Boston Light Swim, Aug 15, 7:00am

More information

BIG DATA CHALLENGES A NOAA PERSPECTIVE

BIG DATA CHALLENGES A NOAA PERSPECTIVE BIG DATA CHALLENGES A NOAA PERSPECTIVE Dr. Edward J. Kearns NASA Examiner, Science and Space Branch, OMB/EOP and Chief (acting), Remote Sensing and Applications Division National Climatic Data Center National

More information

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group The Common Framework for Earth Observation Data US Group on Earth Observations Data Management Working Group Agenda USGEO and BEDI background Concise summary of recommended CFEOD standards today Full document

More information

These notes are designed to provide an introductory-level knowledge appropriate to understanding the basics of digital data formats.

These notes are designed to provide an introductory-level knowledge appropriate to understanding the basics of digital data formats. A brief guide to binary data Mike Sandiford, March 2001 These notes are designed to provide an introductory-level knowledge appropriate to understanding the basics of digital data formats. The problem

More information

IMOS/AODN ocean portal: tools for data delivery. Roger Proctor, Peter Blain, Sebastien Mancini IMOS

IMOS/AODN ocean portal: tools for data delivery. Roger Proctor, Peter Blain, Sebastien Mancini IMOS IMOS/AODN ocean portal: tools for data delivery Roger Proctor, Peter Blain, Sebastien Mancini IMOS Data from IMOS: The six Nodes Bluewater and Climate Node open ocean focus Five Regional Nodes continental

More information

Introduction to NetCDF

Introduction to NetCDF Introduction to NetCDF NetCDF is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. First released in 1989.

More information

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison Glossary API Application Programming Interface AR5 IPCC Assessment Report 4 ASCII American Standard Code for Information Interchange BUFR Binary Universal Form for the Representation of meteorological

More information

NetCDF and Scientific Data Durability. Russ Rew, UCAR Unidata ESIP Federation Summer Meeting

NetCDF and Scientific Data Durability. Russ Rew, UCAR Unidata ESIP Federation Summer Meeting NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting 2009-07-08 For preserving data, is format obsolescence a non-issue? Why do formats (and their access software)

More information

Big Data Pragmaticalities Experiences from Time Series Remote Sensing

Big Data Pragmaticalities Experiences from Time Series Remote Sensing Big Data Pragmaticalities Experiences from Time Series Remote Sensing Edward King Remote Sensing & Software Team Leader 3 September 2013 MARINE & ATMOSPHERIC RESEARCH Overview Remote sensing (RS) and RS

More information

7C.2 EXPERIENCE WITH AN ENHANCED NETCDF DATA MODEL AND INTERFACE FOR SCIENTIFIC DATA ACCESS. Edward Hartnett*, and R. K. Rew UCAR, Boulder, CO

7C.2 EXPERIENCE WITH AN ENHANCED NETCDF DATA MODEL AND INTERFACE FOR SCIENTIFIC DATA ACCESS. Edward Hartnett*, and R. K. Rew UCAR, Boulder, CO 7C.2 EXPERIENCE WITH AN ENHANCED NETCDF DATA MODEL AND INTERFACE FOR SCIENTIFIC DATA ACCESS Edward Hartnett*, and R. K. Rew UCAR, Boulder, CO 1 INTRODUCTION TO NETCDF AND THE NETCDF-4 PROJECT The purpose

More information

European Marine Data Exchange

European Marine Data Exchange European Marine Data Exchange By Dick M.A. Schaap MARIS (NL) EU SeaDataNet Technical Coordinator EU EMODnet Ingestion Coordinator Noordzeedagen 2018 - October 2018 Acquisition of ocean and marine data

More information

CSIRO and the Open Data Cube

CSIRO and the Open Data Cube CSIRO and the Open Data Cube Dr Robert Woodcock, Matt Paget, Peter Wang, Alex Held CSIRO Overview The challenge The Earth Observation Data Deluge Integrated science needs Data volume, rate of growth and

More information

netcdf-ld SKOS: demonstrating Linked Data vocabulary use within netcdf-compliant files

netcdf-ld SKOS: demonstrating Linked Data vocabulary use within netcdf-compliant files : demonstrating Linked Data vocabulary use within netcdf-compliant files Nicholas Car Data Architect Geoscience Australia nicholas.car@ga.gov.au Prepared for ISESS2017 conference (http://www.isess2017.org/)

More information

SCSODC: Integrating Ocean Data for Visualization Sharing and Application

SCSODC: Integrating Ocean Data for Visualization Sharing and Application IOP Conference Series: Earth and Environmental Science OPEN ACCESS SCSODC: Integrating Ocean Data for Visualization Sharing and Application To cite this article: C Xu et al 2014 IOP Conf. Ser.: Earth Environ.

More information

Data Access and Analysis with Distributed, Federated Data Servers in climateprediction.net

Data Access and Analysis with Distributed, Federated Data Servers in climateprediction.net Data Access and Analysis with Distributed, Federated Data Servers in climateprediction.net Neil Massey 1 neil.massey@comlab.ox.ac.uk Tolu Aina 2, Myles Allen 2, Carl Christensen 1, David Frame 2, Daniel

More information

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations

Deliverable 6.4. Initial Data Management Plan. RINGO (GA no ) PUBLIC; R. Readiness of ICOS for Necessities of integrated Global Observations Ref. Ares(2017)3291958-30/06/2017 Readiness of ICOS for Necessities of integrated Global Observations Deliverable 6.4 Initial Data Management Plan RINGO (GA no 730944) PUBLIC; R RINGO D6.5, Initial Risk

More information

eresearch Collaboration across the Pacific:

eresearch Collaboration across the Pacific: eresearch Collaboration across the Pacific: Marine Systems and Australian Marine Science Craig Johnson University of Tasmania Outline Introduce the Australian Ocean Network Possibilities for trans-pacific

More information

Toward the Development of a Comprehensive Data & Information Management System for THORPEX

Toward the Development of a Comprehensive Data & Information Management System for THORPEX Toward the Development of a Comprehensive Data & Information Management System for THORPEX Mohan Ramamurthy, Unidata Steve Williams, JOSS Jose Meitin, JOSS Karyn Sawyer, JOSS UCAR Office of Programs Boulder,

More information

Adapting Software to NetCDF's Enhanced Data Model

Adapting Software to NetCDF's Enhanced Data Model Adapting Software to NetCDF's Enhanced Data Model Russ Rew UCAR Unidata EGU, May 2010 Overview Background What is netcdf? What is the netcdf classic data model? What is the netcdf enhanced data model?

More information

Lidar Radar Open Software Environment LROSE and the Python ARM Radar Toolkit Py-ART

Lidar Radar Open Software Environment LROSE and the Python ARM Radar Toolkit Py-ART Lidar Radar Open Software Environment LROSE and the Python ARM Radar Toolkit Py-ART Joe VanAndel and Mike Dixon Earth Observing Laboratory (EOL) National Center for Atmospheric Research (NCAR) Scott Collis

More information

Big Data Earth Observation Standardization elements Codrina Ilie TERRASIGNA TF7/SG5

Big Data Earth Observation Standardization elements Codrina Ilie TERRASIGNA TF7/SG5 Big Data Earth Observation Standardization elements Codrina Ilie TERRASIGNA TF7/SG5 1 Earth Observation standardization intro 2 directions: 1. standardization of the Ground Segment Services: Heterogeneous

More information

Python: Working with Multidimensional Scientific Data. Nawajish Noman Deng Ding

Python: Working with Multidimensional Scientific Data. Nawajish Noman Deng Ding Python: Working with Multidimensional Scientific Data Nawajish Noman Deng Ding Outline Scientific Multidimensional Data Ingest and Data Management Analysis and Visualization Extending Analytical Capabilities

More information

Online intercomparison of models and observations using OGC and community standards

Online intercomparison of models and observations using OGC and community standards Online intercomparison of models and observations using OGC and community standards Alastair Gemmell * Jon Blower Keith Haines Adit Santokhee Reading e-science e Centre, Environmental Systems Science Centre,

More information

The Integrated Data Viewer A Tool for Scientific Analysis and Visualization

The Integrated Data Viewer A Tool for Scientific Analysis and Visualization The Integrated Data Viewer A Tool for Scientific Analysis and Visualization Don Murray Unidata Program Center Overview What is the Integrated Data Viewer (IDV)? IDV features Web enabled features Client/Server

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

The C3S Climate Data Store and its upcoming use by CAMS

The C3S Climate Data Store and its upcoming use by CAMS Atmosphere The C3S Climate Data Store and its upcoming use by CAMS Miha Razinger, ECMWF thanks to Angel Alos, Baudouin Raoult, Cedric Bergeron and the CDS contractors Atmosphere What are C3S and CDS? The

More information

Ocean, Atmosphere & Climate Model Assessment for Everyone

Ocean, Atmosphere & Climate Model Assessment for Everyone Ocean, Atmosphere & Climate Model Assessment for Everyone Rich Signell USGS Woods Hole, MA Unidata 2014 DeSouza Award Presentation Boulder, CO : Sep 15, 2014 2 US Integrated Ocean Observing System (IOOS

More information

NetCDF Metadata Guidelines for FY 2011 IOC NOAA Climate Data Records

NetCDF Metadata Guidelines for FY 2011 IOC NOAA Climate Data Records NetCDF Metadata Guidelines for FY 2011 IOC NOAA Climate Data Records This document provides guidance on a recommended set of netcdf metadata attributes to be implemented for the FY 2011 Initial Operating

More information

Analysis Methods in Atmospheric and Oceanic Science

Analysis Methods in Atmospheric and Oceanic Science Analysis Methods in Atmospheric and Oceanic Science AOSC 652 HDF & NetCDF files; Regression; File Compression & Data Access Week 11, Day 1 Today: Data Access for Projects; HDF & NetCDF Wed: Multiple Linear

More information

Working with Scientific Data in ArcGIS Platform

Working with Scientific Data in ArcGIS Platform Working with Scientific Data in ArcGIS Platform Sudhir Raj Shrestha sshrestha@esri.com Hong Xu hxu@esri.com Esri User Conference, San Diego, CA. July 11, 2017 What we will cover today Scientific Multidimensional

More information

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:

More information

Matlab stoqstoolbox for accessing in situ measurements from STOQS

Matlab stoqstoolbox for accessing in situ measurements from STOQS Matlab stoqstoolbox for accessing in situ measurements from STOQS MBARI Summer Internship Project 2012 Francisco Lopez Castejon (Mentor: Mike McCann) INDEX Abstract... 4 Introduction... 5 In situ measurement

More information

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography Christopher Crosby, San Diego Supercomputer Center J Ramon Arrowsmith, Arizona State University Chaitan

More information

Time Series Analytics with Simple Relational Database Paradigms Ben Leighton, Julia Anticev, Alex Khassapov

Time Series Analytics with Simple Relational Database Paradigms Ben Leighton, Julia Anticev, Alex Khassapov Time Series Analytics with Simple Relational Database Paradigms Ben Leighton, Julia Anticev, Alex Khassapov LAND AND WATER & CSIRO IMT SCIENTIFIC COMPUTING Energy Use Data Model (EUDM) endeavours to deliver

More information

The netcdf- 4 data model and format. Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012

The netcdf- 4 data model and format. Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012 The netcdf- 4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012 NetCDF data models, formats, APIs Data models for scienbfic data and metadata - classic: simplest model - - dimensions,

More information

S-100 Product Specification Roll Out Implementation Plan. Introduction

S-100 Product Specification Roll Out Implementation Plan. Introduction S-100 Product Specification Roll Out Implementation Plan Introduction This intent of this plan is to provide status, challenges, timelines, and strategies for the suite of S-100 products under development

More information

Dataset Interoperability Working Group

Dataset Interoperability Working Group Dataset Interoperability Working Group Co-Chairs: Charlie Zender and Peter Leonard Ed Armstrong, Mary Jo Brodzik, Joe Glassy, Aleksander Jelenak, Siri Jodha Khalsa, Wenli Yang; Steve Berrick, Chris Lynnes,

More information

The CEDA Archive: Data, Services and Infrastructure

The CEDA Archive: Data, Services and Infrastructure The CEDA Archive: Data, Services and Infrastructure Kevin Marsh Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk with thanks to V. Bennett, P. Kershaw, S. Donegan and the rest of the CEDA Team

More information

Dataset Interoperability Recommendations for Earth Science

Dataset Interoperability Recommendations for Earth Science Status of this RFC Dataset Interoperability Recommendations for Earth Science This RFC provides information to the NASA Earth Science community. This RFC does not specify an Earth Science Data Systems

More information

Towards a pan-european infrastructure for marine and ocean data management + Importance of standards

Towards a pan-european infrastructure for marine and ocean data management + Importance of standards Towards a pan-european infrastructure for marine and ocean data management + Importance of standards By Dick M.A. Schaap MARIS Technical Coordinator SeaDataNet & ODIP Coordinator EMODnet Bathymetry Münster

More information

Fair data and open data: differences and consequences

Fair data and open data: differences and consequences Fair data and open data: differences and consequences 1. To share or not to share: what is fair? Alex Burdorf, Erasmus MC Rotterdam 2. Data sharing: consequences for informed consent Marie-José Bonthuis,

More information

GEOSS Data Management Principles: Importance and Implementation

GEOSS Data Management Principles: Importance and Implementation GEOSS Data Management Principles: Importance and Implementation Alex de Sherbinin / Associate Director / CIESIN, Columbia University Gregory Giuliani / Lecturer / University of Geneva Joan Maso / Researcher

More information

Interoperability in Science Data: Stories from the Trenches

Interoperability in Science Data: Stories from the Trenches Interoperability in Science Data: Stories from the Trenches Karen Stocks University of California San Diego Open Data for Open Science Data Interoperability Microsoft escience Workshop 2012 Interoperability

More information

Data Management Components for a Research Data Archive

Data Management Components for a Research Data Archive Data Management Components for a Research Data Archive Steven Worley and Bob Dattore Scientific Computing Division Computational and Information Systems Laboratory National Center for Atmospheric Research

More information

OGC at KNMI: Current use and plans

OGC at KNMI: Current use and plans OGC at KNMI: Current use and plans 4th Workshop on the use of GIS/OGC standards in meteorology 4 th of March 2013, Reading 1. Ernst de Vreede 2. Maarten Plieger Contents 1. ADAGUC 2. Internal applications

More information

SciSpark 201. Searching for MCCs

SciSpark 201. Searching for MCCs SciSpark 201 Searching for MCCs Agenda for 201: Access your SciSpark & Notebook VM (personal sandbox) Quick recap. of SciSpark Project What is Spark? SciSpark Extensions scitensor: N-dimensional arrays

More information

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020

More information

Developing data catalogue extensions for metadata harvesting in GIS

Developing data catalogue extensions for metadata harvesting in GIS University of Bergen Department of Informatics Developing data catalogue extensions for metadata harvesting in GIS Author: André Mossige Long master thesis June 2018 Acknowledgements I would like to thank

More information

Leveraging metadata standards in ArcGIS to support Interoperability. David Danko and Aleta Vienneau

Leveraging metadata standards in ArcGIS to support Interoperability. David Danko and Aleta Vienneau Leveraging metadata standards in ArcGIS to support Interoperability David Danko and Aleta Vienneau Leveraging Metadata Standards in ArcGIS for Interoperability Why metadata and metadata standards? Overview

More information

Open Software Standards for Next- Generation Community Satellite Software Packages June 2017

Open Software Standards for Next- Generation Community Satellite Software Packages June 2017 Atmospheric and Environmental Research www.aer.com Lexington, MA 2017 IMAP/ CSPP Users Group Meeting Open Software Standards for Next- Generation Community Satellite Software Packages June 2017 David Hogan

More information

The Determination of Telescope and Antenna Invariant Point (IVP)

The Determination of Telescope and Antenna Invariant Point (IVP) The Determination of Telescope and Antenna Invariant Point (IVP) John Dawson, Gary Johnston, and Bob Twilley Minerals and Geohazards Division, Geoscience Australia, Cnr Jerrabomberra Ave and Hindmarsh

More information

HDF- A Suitable Scientific Data Format for Satellite Data Products

HDF- A Suitable Scientific Data Format for Satellite Data Products HDF- A Suitable Scientific Data Format for Satellite Data Products Sk. Sazid Mahammad, Debajyoti Dhar and R. Ramakrishnan Data Products Software Division Space Applications Centre, ISRO, Ahmedabad 380

More information

Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom

Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom The Met Office s Logical Store Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom Background are the lifeblood of the Met Office. However, over time, the organic, un-governed growth of

More information

Scientific and Multidimensional Raster Support in ArcGIS

Scientific and Multidimensional Raster Support in ArcGIS Scientific and Multidimensional Raster Support in ArcGIS Sudhir Raj Shrestha sshrestha@esri.com Brief breakdown Scientific Multidimensional data Ingesting Scientific MultiDim Data in ArcGIS Ingesting and

More information

Leveraging metadata standards in ArcGIS to support Interoperability. Aleta Vienneau and Marten Hogeweg

Leveraging metadata standards in ArcGIS to support Interoperability. Aleta Vienneau and Marten Hogeweg Leveraging metadata standards in ArcGIS to support Interoperability Aleta Vienneau and Marten Hogeweg Leveraging metadata standards in ArcGIS to support Interoperability Overview of metadata standards

More information

Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO

Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO Anusuriya Devaraju Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO Anusuriya Devaraju, Jens

More information

Data Centre NetCDF Implementation Pilot

Data Centre NetCDF Implementation Pilot Data Centre NetCDF Implementation Pilot Peter Miu EUMETSAT User Conference Oslo, Sep. 2011 Splinter Session, Facilitating Data Access and Utilisation Slide: 1 EUM/OPS/VWG/11/2600 V.1 What is this Pilot

More information

Data discovery and access via the SeaDataNet CDI system

Data discovery and access via the SeaDataNet CDI system Data discovery and access via the SeaDataNet CDI system Central dataproducts and data services on distributed data. Peter Thijsse MARIS CLIPC IS-ENES workshop, KNMI, November 2014 Outline 1. Introduction

More information

Towards a pan-european infrastructure for marine and ocean data management + Importance of standards

Towards a pan-european infrastructure for marine and ocean data management + Importance of standards Towards a pan-european infrastructure for marine and ocean data management + Importance of standards By Dick M.A. Schaap Technical Coordinator SeaDataNet & Coordinator EMODnet Bathymetry Hydrography Day,

More information

CF-netCDF and CDM. Ethan Davis, John Caron, Ben Domenico, Stefano Nativi* UCAR Unidata Univ of Florence*

CF-netCDF and CDM. Ethan Davis, John Caron, Ben Domenico, Stefano Nativi* UCAR Unidata Univ of Florence* CF-netCDF and CDM Ethan Davis, John Caron, Ben Domenico, Stefano Nativi* UCAR Unidata Univ of Florence* OGC in MetOcean, Toulouse, France, November 2009 CF-netCDF and CDM CF-netCDF CDM/netCDF-java TDS

More information

Air Quality Community Experiences and Perspectives on International Interoperability Standards

Air Quality Community Experiences and Perspectives on International Interoperability Standards Air Quality Community Experiences and Perspectives on International Interoperability Standards Erin Robinson, Stefan Falke, Rudolf Husar, David McCabe, Frank Lindsay, Chris Lynnes, Greg Leptoukh, Beate

More information

Dataset-XML - A New CDISC Standard

Dataset-XML - A New CDISC Standard Dataset-XML - A New CDISC Standard Lex Jansen Principal Software Developer @ SAS CDISC XML Technologies Team Single Day Event CDISC Tools and Optimization September 29, 2014, Cary, NC Agenda Dataset-XML

More information

The Logical Data Store

The Logical Data Store Tenth ECMWF Workshop on Meteorological Operational Systems 14-18 November 2005, Reading The Logical Data Store Bruce Wright, John Ward & Malcolm Field Crown copyright 2005 Page 1 Contents The presentation

More information

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang WP4: Data Forum Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang Motivation INTERACT research stations generate data and metadata Long term monitoring Short term process studies External

More information

Data Formats. for Data Science. Valerio Maggio Data Scientist and Researcher Fondazione Bruno Kessler (FBK) Trento, Italy.

Data Formats. for Data Science. Valerio Maggio Data Scientist and Researcher Fondazione Bruno Kessler (FBK) Trento, Italy. Data Formats for Data Science Valerio Maggio Data Scientist and Researcher Fondazione Bruno Kessler (FBK) Trento, Italy @leriomaggio About me kidding, that s me!-) Post Doc Researcher @ FBK Complex Data

More information

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata Meeting Host Supporting Partner Meeting Sponsors Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata 105th OGC Technical Committee Palmerston North, New Zealand Dr.

More information

NetCDF and HDF5. NASA Earth Science Data Systems Working Group October 20, 2010 New Orleans. Ed Hartnett, Unidata/UCAR, 2010

NetCDF and HDF5. NASA Earth Science Data Systems Working Group October 20, 2010 New Orleans. Ed Hartnett, Unidata/UCAR, 2010 NetCDF and HDF5 NASA Earth Science Data Systems Working Group October 20, 2010 New Orleans Ed Hartnett, Unidata/UCAR, 2010 Unidata Mission: To provide the data services, tools, and cyberinfrastructure

More information

Dataset Interoperability (formerly HDF5 Conventions) Working Group

Dataset Interoperability (formerly HDF5 Conventions) Working Group Dataset Interoperability (formerly HDF5 Conventions) Working Group Co-Chairs: Charlie Zender and Peter Leonard Ed Armstrong, Sean Bailey, Bill Emanual, Fan Feng, Doug Fowler, Ted Haberman, Beth Huffer,

More information

Oceanic Observatory for the Iberian Shelf

Oceanic Observatory for the Iberian Shelf Oceanic Observatory for the Iberian Shelf B.Vila Barcelona, 26th September 2016 Objectives: The Project Improve the oceanic observation at the North Western Iberian coast (meteorological, oceanographical

More information

HDF Product Designer Documentation

HDF Product Designer Documentation HDF Product Designer Documentation Release 1.6.0 The HDF Group Dec 01, 2017 Contents 1 Contents: 3 1.1 Getting Started.............................................. 3 1.2 Usage...................................................

More information

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast Max-Planck-Institut für Meteorologie, DKRZ September 24, 2014 MAX-PLANCK-GESELLSCHAFT Data

More information

WM2015 Conference, March 15 19, 2015, Phoenix, Arizona, USA

WM2015 Conference, March 15 19, 2015, Phoenix, Arizona, USA OECD NEA Radioactive Waste Repository Metadata Management (RepMet) Initiative (2014-2018) 15614 Claudio Pescatore*, Alexander Carter** *OECD Nuclear Energy Agency 1 (claudio.pescatore@oecd.org) ** Radioactive

More information

Increasing dataset quality metadata presence: Quality focused metadata editor and catalogue queriables.

Increasing dataset quality metadata presence: Quality focused metadata editor and catalogue queriables. Increasing dataset quality metadata presence: Quality focused metadata editor and catalogue queriables. Alaitz Zabala (UAB), Joan Masó (CREAF), Lucy Bastin (ASTON), Fabrizio Papeschi (CNR), Eva Sevillano

More information

Index Introduction Setting up an account Searching and accessing Download Advanced features

Index Introduction Setting up an account Searching and accessing Download Advanced features ESGF Earth System Grid Federation Tutorial Index Introduction Setting up an account Searching and accessing Download Advanced features Index Introduction IT Challenges of Climate Change Research ESGF Introduction

More information

What is Gluent? The Gluent Data Platform

What is Gluent? The Gluent Data Platform What is Gluent? The Gluent Data Platform The Gluent Data Platform provides a transparent data virtualization layer between traditional databases and modern data storage platforms, such as Hadoop, in the

More information

Open Data Standards for Administrative Data Processing

Open Data Standards for Administrative Data Processing University of Pennsylvania ScholarlyCommons 2018 ADRF Network Research Conference Presentations ADRF Network Research Conference Presentations 11-2018 Open Data Standards for Administrative Data Processing

More information

DATA FORMATS FOR DATA SCIENCE Remastered

DATA FORMATS FOR DATA SCIENCE Remastered Budapest BI FORUM 2016 DATA FORMATS FOR DATA SCIENCE Remastered Valerio Maggio @leriomaggio Data Scientist and Researcher Fondazione Bruno Kessler (FBK) Trento, Italy WhoAmI Post Doc Researcher @ FBK Interested

More information

And now for something completely different

And now for something completely different And now for something completely different (data management?) HYCOM Data Management & Services Ashwanth Srinivasan (RSMAS) Steve Hankin (PMEL) A community of of contributors, including Peter Peter Cornillon,

More information

BENEFITS OF INTRA-VEHICLE DISTRIBUTED NETWORK ARCHITECTURE

BENEFITS OF INTRA-VEHICLE DISTRIBUTED NETWORK ARCHITECTURE 2011 NDIA GROUND VEHICLE SYSTEMS ENGINEERING AND TECHNOLOGY SYMPOSIUM VEHICLE ELECTRONICS AND ARCHITECTURE (VEA) MINI-SYMPOSIUM AUGUST 9-11 DEARBORN, MICHIGAN BENEFITS OF INTRA-VEHICLE DISTRIBUTED NETWORK

More information

The CEDA Web Processing Service for rapid deployment of earth system data services

The CEDA Web Processing Service for rapid deployment of earth system data services The CEDA Web Processing Service for rapid deployment of earth system data services Stephen Pascoe Ag Stephens Phil Kershaw Centre of Environmental Data Archival 1 1 Overview of CEDA-WPS History first implementation

More information

The NCAR Community Data Portal

The NCAR Community Data Portal The NCAR Community Data Portal http://cdp.ucar.edu/ QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this

More information

Pangeo. A community-driven effort for Big Data geoscience

Pangeo. A community-driven effort for Big Data geoscience Pangeo A community-driven effort for Big Data geoscience !2 What would you like to have and why? Pangeo s vision for scientific computing in the big-data era Pangeo s Website pangeo-data.org !3 Hello!

More information

The NCI High Performance Computing (HPC) and High Performance Data (HPD) Platform to Support the Analysis of Petascale Environmental Data Collections

The NCI High Performance Computing (HPC) and High Performance Data (HPD) Platform to Support the Analysis of Petascale Environmental Data Collections ESSI 2015-8273 The NCI High Performance Computing (HPC) and High Performance Data (HPD) Platform to Support the Analysis of Petascale Environmental Data Collections Ben Evans 1, Lesley Wyborn 1, Tim Pugh

More information

Compass INSPIRE Services. Compass INSPIRE Services. White Paper Compass Informatics Limited Block 8, Blackrock Business

Compass INSPIRE Services. Compass INSPIRE Services. White Paper Compass Informatics Limited Block 8, Blackrock Business Compass INSPIRE Services White Paper 2010 Compass INSPIRE Services Compass Informatics Limited Block 8, Blackrock Business Park, Carysfort Avenue, Blackrock, County Dublin, Ireland Contact Us: +353 1 2104580

More information

Towards a joint service catalogue for e-infrastructure services

Towards a joint service catalogue for e-infrastructure services Towards a joint service catalogue for e-infrastructure services Dr British Library 1 DI4R 2016 Workshop Joint service catalogue for research 29 September 2016 15/09/15 Goal A framework for creating a Catalogue

More information

HDF Product Designer Documentation

HDF Product Designer Documentation HDF Product Designer Documentation Release 1.5.0 The HDF Group May 31, 2017 Contents 1 Contents: 3 1.1 Getting Started.............................................. 3 1.2 Usage...................................................

More information

Research Data Repository Interoperability Primer

Research Data Repository Interoperability Primer Research Data Repository Interoperability Primer The Research Data Repository Interoperability Working Group will establish standards for interoperability between different research data repository platforms

More information

Geo Seas A pan European infrastructure for the management of marine geological and geophysical data

Geo Seas A pan European infrastructure for the management of marine geological and geophysical data Geo Seas A pan European infrastructure for the management of marine geological and geophysical data Colin Graham (BGS), Dick Schaap (MARIS), Paolo Diviacco (OGS) & Helen Glaves (BGS) Integrated Infrastructure

More information

(Towards) A metadata model for atmospheric data resources

(Towards) A metadata model for atmospheric data resources (Towards) A metadata model for atmospheric data resources Anne De Rudder and Jean-Christopher Lambert Belgian Institute for Space Aeronomy (IASB-BIRA), Brussels The context EU FP7 Ground-based atmospheric

More information

Cross-mission Analysis Through Space Physics Data Facility (SPDF) Services

Cross-mission Analysis Through Space Physics Data Facility (SPDF) Services Presented at the Fall 2012 AGU meeting Paper SM43A-2231 Cross-mission Analysis Through Space Physics Data Facility (SPDF) Services R.M. Candey, D. Bilitza, R. Chimiak, J.F. Cooper, L.N. Garcia, B. Harris,

More information