Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom

Similar documents
The Logical Data Store

Compass INSPIRE Services. Compass INSPIRE Services. White Paper Compass Informatics Limited Block 8, Blackrock Business

The European Commission s science and knowledge service. Joint Research Centre

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

The CEDA Archive: Data, Services and Infrastructure

Understanding users workflows

INSPIRE & Environment Data in the EU

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison

Deliverable D3.12. Contract number: OJEU 2010/S Deliverable: D3.12 Author: Igor Antolovic Date: Version: Final

TIGGE and the EU Funded BRIDGE project

GML Application Schema for Meteorological Objects

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Quality - The Key to Successful SOA. Charitha Kankanamge WSO2 February 2011

The GeoPortal Cookbook Tutorial

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Exadata Database Machine: 12c Administration Workshop Ed 2

Exadata Database Machine: 12c Administration Workshop Ed 2 Duration: 5 Days

Factsheet of Public Services Infrastructure (PSi) Updated on: 1st Sep 03

Anne Fouilloux. Fig. 1 Use of observational data at ECMWF since CMA file structure.

INSPIRE: The ESRI Vision. Tina Hahn, GIS Consultant, ESRI(UK) Miguel Paredes, GIS Consultant, ESRI(UK)

Exdata Database Machine: 12c Administration Workshop Ed 2

Implementation and Use of OGC/HMA/WMO/ISO & Inspire Standards in EUMETSAT EO Portal

GENeric European Sustainable Information Space for Environment.

The Value of Metadata

Leveraging OGC Services in ArcGIS Server. Satish Sankaran, Esri Yingqi Tang, Esri

PRODUCT BROCHURE ERDAS APOLLO MANAGING AND SERVING GEOSPATIAL INFORMATION

NOAA NextGen IT/Web Services (NGITWS)

ERDAS APOLLO Managing and Serving Geospatial Information

Carmenta Server Product Description

International Civil Aviation Organization THE SECOND MEETING OF SYSTEM WIDE INFORMATION MANAGEMENT TASK FORCE (SWIM TF/2)

C3S Data Portal: Setting the scene

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content

Novel System Architectures for Semantic Based Sensor Networks Integraion

Introduction to SDIs (Spatial Data Infrastructure)

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University

Technical documentation. SIOS Data Management Plan

Exadata Database Machine: 12c Administration Workshop Ed 2

Oracle Fusion Middleware

Oracle - Exadata Database Machine: 12c

Jim Mains Director of Business Strategy and Media Services Media Solutions Group, EMC Corporation

GEO-SPATIAL METADATA SERVICES ISRO S INITIATIVE

FDO Data Access Technology at a Glance

IMS CLDB and EnviDB. Universal & Reliable Climate Database Management System. IMS CLDB and EnviDB Climatological and Integrated Environmental Database

An overview of the OAIS and Representation Information

ORACLE DATABASE LIFECYCLE MANAGEMENT PACK

XML Applications. Introduction Jaana Holvikivi 1

(9A05803) WEB SERVICES (ELECTIVE - III)

The EU-funded BRIDGE project

Implementing a Data Quality Strategy to simplify access to data

eresearch Collaboration across the Pacific:

Extending SOA Infrastructure for Semantic Interoperability

A service oriented approach for geographical data sharing

Oracle Spatial Technologies: An Update. Xavier Lopez Director, Spatial Technologies Oracle Corporation

Exadata Database Machine: 12c Administration Workshop Ed 1

International Oceanographic Data and Information Exchange - Ocean Data Portal (IODE ODP)

May 2012 Oracle Spatial User Conference

SEXTANT 1. Purpose of the Application

Copernicus Space Component. Technical Collaborative Arrangement. between ESA. and. Enterprise Estonia

- C3Grid Stephan Kindermann, DKRZ. Martina Stockhause, MPI-M C3-Team

DT-ICT : Big data solutions for energy

The VITO Earth Observation LTDA Facility

SERVO - ACES Abstract

Laserfiche Product Suite 2011

Robin Wilson Director. Digital Identifiers Metadata Services

Version 3 Updated: 10 March Distributed Oceanographic Match-up Service (DOMS) User Interface Design

Standards, GML and AIXM. Dr. David Burggraf Vice President Galdos Systems Inc

NOTICE. This document is only one section of a larger document. All sections together collectively form the NNEW Documentation.

Land Administration and Management: Big Data, Fast Data, Semantics, Graph Databases, Security, Collaboration, Open Source, Shareable Information

Python: Working with Multidimensional Scientific Data. Nawajish Noman Deng Ding

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

AWIPS Technology Infusion Darien Davis NOAA/OAR Forecast Systems Laboratory Systems Development Division April 12, 2005

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment

Project Proposal: OSLC4MBSE - OMG SE and OSLC working group as part of the OMG SE DSIG. OSLC for Model-Based Systems Engineering Interoperability

Using WIS to support WIGOS Steve Foreman WIS Branch

Implementing a Data Quality Strategy to simplify access to data

ASTROWEB ASTROINFORMATICS PROJECT AND COMPARISON OF THE WEB-GIS PROTOCOL STANDARDS

USING ADMINISTRATOR FEATURES

Oracle Fusion Middleware

THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I)

The TIDB2 Meteo Experience

Development of Java Plug-In for Geoserver to Read GeoRaster Data. 1. Baskar Dhanapal CoreLogic Global Services Private Limited, Bangalore

Easily Managing Hybrid IT with Transformation Technology

How Insurers are Realising the Promise of Big Data

Data Partnerships to Improve Health Frequently Asked Questions. Glossary...9

DataView Suite. The Face to your Test Data. Call: » L5, 14 Martin Place, Sydney, NSW,

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

April Oracle Spatial User Conference

First experiences of using WC(P)S at ECMWF

The UK Marine Environmental Data and Information Network MEDIN

The OAIS Reference Model: current implementations

Regarding the quality attributes, the architecture of the system must be:

Integrated Aeronautical Information database

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

ARCHITECTURE OF SPATIAL DATA INFRASTRUCTURE (SDI) (DRAFT)

ArcGIS 9.2 Works as a Complete System

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group

IBM. IBM i2 Enterprise Insight Analysis Understanding the Deployment Patterns. Version 2 Release 1 BA

Global Cryosphere Watch. GCW web portal Structure and demonstration

Fusion Registry 9 SDMX Data and Metadata Management System

Transcription:

The Met Office s Logical Store Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom Background are the lifeblood of the Met Office. However, over time, the organic, un-governed growth of data management solutions has led complex, incoherent and (often) undocumented IT systems. This information silo approach has resulted in a number of organisational issues: Inconsistent, locally processed data; Proliferation of data & data access mechanisms; Poor access to enterprise information assets. The business drivers for change are: Cost efficiency; Improved agility in developing new solutions; Improved consistency of information. The concept of a Logical Store (LDS) is the means by which these issues will be addressed. Logical Store concept The concept of the Logical Store (LDS) is that of a single, logical repository for all core (shared) enterprise meteorological information, delivering information that is: Consistent; Uniquely identifiable; Spatially-enabled (facilitating spatial manipulation and querying); Accessible through a set of common interfaces Managed in a standard way The LDS forms a key part of the future information architecture (figure 1), ingesting data from and serving data to all the other components. Products Observations Collection & Processing NWP Supplementary Models Product Generation Dissemination Logical Store Corporate Functions Fig. 1 High-level Information Architecture The LDS is composed of a small number of high-level components which can be characterised by their interaction with each other and other external entities (figure 2). Key logical components: Information Lifecycle administers the data though it s lifecycle, from ingestion to deletion, on behalf of the data owner; Discovery Portal provides the facility to search for information; Public Interface provides the access to the information in a standard way; 77

Shared Model provides a set standard solutions for the internal management of the data; Catalogue Records hold metadata, both static and a limited amount harvested from the data. The latter two are only exposed through the first three interfaces and can be regarded as under the covers. Server-side applications consoles Client applications API APIs Web browsers consoles Clients Private data APIs Information Lifecycle Public interface Discovery Portal Infrastructure Services Private data APIs Private data APIs Security Applicationspecific data model Shared Model Catalogue Records LDS Boundary Models Storage Service Physical Infrastructure Infrastructure Fig. 2 LDS component diagram LDS Public Interface The Public Interface is the key to the LDS concept 1 ; it should: Hide the complexity of: bases & archives; Formats & codes; Interfaces to different data types. De-couple the client application from the data store. Provide: A single way to access all data in the LDS; Using a standard request (metadata). As such it should provide controlled, secure read access to the LDS, through a set of well-defined, common interfaces, using a range of selection criteria, and delivering both data and products (and metadata). The capability of getting a resource can be expressed in terms of: Select set (a set of data, rather than a physical file (e.g. an NWP model run, all the UK climate observations); Subset: Geospatial extent (e.g. latitude-longitude box); Temporal extent (e.g. validity time); Parameter / attribute (e.g. screen temperature) Domain restriction (e.g. temperature > 28degC) 1 Note: In the development of the LDS, it is also the intention to: Rationalise and consolidate data stores; Take advantage of new data management technologies. 78

Process: Transform (i.e. isomorphic (reversible) process, such as temperature conversion from Kelvin to degrees Celsius); Derive (i.e. non-isomorphic (irreversible) process, such as computation of dewpoint temperature from temperature, humidity and pressure); Re-project (i.e. computation of the co-ordinates of the data points of features in a new projection given the co-ordinates in an initial projection - re-projection of raster (gridded) data usually also implies a regridding) Interpolate/re-grid; Filter (i.e. sub-sample data to, for example, reduce the resolution by a factor). Format (Change the data into a particular format different from that of the underlying data, this includes the transformation of the data to specific precision, number format and packing (or compression), the combination of the appropriate data and the derivation of the required metadata. There should be a restricted range of forms for the returned data, including: PP/Fieldsfile, netcdf, GRIB, BUFR,, CSV) Return Information. The solution will need to support access from a range of clients, including a web client, Java, C and FORTRAN. The Public Interface will make use of web services, as they: Use an HTTP Transport for messages (like web pages), providing: A high level interoperability; Clear and simple client-server interaction. Use as a standard form for the request and response, facilitating: Self-describing data; Implementation of metadata standards; Use a standard schema (e.g. GML). In particular, it is intended to use OGC 2 -compliant web services, such as the Web Feature Service (WFS), which characterises the data as a set of (real world) features and make use of Geography Markup Language (GML). The possible risks are the poor performance (especially for voluminous data), and difficulty in getting the new technology to work. Work in already completed or in progress Some work has already taken place on: Consistent use of Oracle RDBMS to hold a range of data types; Standard Java interfaces to the database; Web Services with for data exchange; Draft Met Office Metadata Standard building on the ISO191xx, WMO standards and CF Convention; Standard components for deriving best climatological observation values from our archive. Currently, various other initiatives are underway: 1. An operational lightning location database (figure 3 shows the demonstrator), which will: Store direct to Oracle base (data and products); Provide Web Services interface (initially using bespoke, but probably ultimately as a Web Feature Service). 2 Open Geospatial Consortium; see: http://www.opengeospatial.org/ 79

XSLT / jython BPEL XPath Oracle 10g base XSQL & XMDB SQL base Component Build request Specify message traverse path Route message based on content Web Service component HTTP Request Client System MetDB BUFR Files Call MDB() MetDB Component Transform to expected response type Normalised Message router Response XSLT / jython Fig. 3 Lightning location Web Service demonstrator 2. Investigations into the use of the Oracle 10g base cluster functionality, with a 3-node Dell cluster using Oracle RAC (Real Application Cluster) (figure 4). Corporate Ethernet LAN backbone (CDN) Cluster interconnect switches Gigabit links to other systems (e.g. data sources) VMWare ESX Application/Ingestion Server (Virtual Servers created as required) Cluster interconnect switches Oracle Server Nodes Storage Array Fig. 4 Hardware architecture for the Oracle database cluster 3. A proof of concept exercise looking at the use of an Oracle RDBMS the data management and provision for the NWP process, will investigate: Extraction of observations directly to supercomputer from database; Storing NWP forecast output direct from supercomputer into database; Providing application data access using current (FORTRAN-based) APIs. 80

Approaches to be adopted The Logical Store will not be implemented using a Big Bang approach, as this is very high risk, requires a huge amount of effort and it would take along time before the benefit was achieved. Instead, the approach will be Piecemeal, focussing on specific data types to: Prove the approach (technical solution); Demonstrate end-to-end capability; Address existing problems; Provide quick wins. However, the work will form a part of a longer term plan. Use will also be made of two other approaches: Wrappering to interface to existing data stores (for the present): Where migration costs are high (e.g. the archive); To provide simple migration paths to use LDS. Proxy interfaces, i.e. provide access to the LDS via existing (legacy) interfaces for widely used interfaces: To allow partial/gradual data migration; To allow gradual application migration. Other parallel activities These are a number of external activities that will influence the development of the Logical Store; two are: SIMDAT (EU co-funded project to promote use of GRID technology), which will: Involve collaboration with ECMWF, DWD, MeteoFrance & EUMETSAT to deliver a meteorological scenario for the Future Weather Information System; Be developing a catalogue for managing distributed data. DEWS Demonstrating Environmental Web Services (a DTI co-funded collaborative project): Using leading edge technology in real scenarios for health & marine; With academic (BADC, ESSC) & commercial (Lost Wax, BMT, Intel) input. Contact Information For further information, please contact either: Bruce Wright (bruce.wright@metoffice.gov.uk); or: John Ward (john.c.ward@metoffice.gov.uk). 81