The Logical Data Store

Similar documents
Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast

TIGGE and the EU Funded BRIDGE project

The CEDA Archive: Data, Services and Infrastructure

The TIDB2 Meteo Experience

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

INSPIRE & Environment Data in the EU

Deliverable D3.12. Contract number: OJEU 2010/S Deliverable: D3.12 Author: Igor Antolovic Date: Version: Final

Compass INSPIRE Services. Compass INSPIRE Services. White Paper Compass Informatics Limited Block 8, Blackrock Business

The European Commission s science and knowledge service. Joint Research Centre

Implementation and Use of OGC/HMA/WMO/ISO & Inspire Standards in EUMETSAT EO Portal

Using WIS to support WIGOS Steve Foreman WIS Branch

IMS CLDB and EnviDB. Universal & Reliable Climate Database Management System. IMS CLDB and EnviDB Climatological and Integrated Environmental Database

XML Applications. Introduction Jaana Holvikivi 1

AWIPS Technology Infusion Darien Davis NOAA/OAR Forecast Systems Laboratory Systems Development Division April 12, 2005

Some Big Data Challenges

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Overview of Open Services for Lifecycle Collaboration (OSLC)

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003

GML Application Schema for Meteorological Objects

Understanding users workflows

Introduction to SDIs (Spatial Data Infrastructure)

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University

Enterprise Geographic Information Servers. Dr David Maguire Director of Products Kevin Daugherty ESRI

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content

Data Centre NetCDF Implementation Pilot

Copernicus Space Component. Technical Collaborative Arrangement. between ESA. and. Enterprise Estonia

Distributed Data Management with Storage Resource Broker in the UK

The EU-funded BRIDGE project

Oracle 10g and IPv6 IPv6 Summit 11 December 2003

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

A ONE-STOP SERVICE HUB INTEGRATING ESSENTIAL WEATHER AND GEOPHYSICAL INFORMATION ON A GIS PLATFORM. Hong Kong Observatory

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

Exadata Database Machine: 12c Administration Workshop Ed 2

implement INSPIRE: "Concrete steps to synergies between the public and the private sector"

C3S Data Portal: Setting the scene

Exadata Database Machine: 12c Administration Workshop Ed 2 Duration: 5 Days

Implementing a Data Quality Strategy to simplify access to data

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

International Oceanographic Data and Information Exchange - Ocean Data Portal (IODE ODP)

OGC at KNMI: Current use and plans

eresearch Collaboration across the Pacific:

ArcGIS 9.2 Works as a Complete System

GENeric European Sustainable Information Space for Environment.


Anne Fouilloux. Fig. 1 Use of observational data at ECMWF since CMA file structure.

Exdata Database Machine: 12c Administration Workshop Ed 2

Achieving the digital thread through PLM and ALM integration using OSLC

<Insert Picture Here> Enterprise Data Management using Grid Technology

Current Progress of Grid Project in KMA

NOAA NextGen IT/Web Services (NGITWS)

Extension of INSPIRE Download Services TG for Observation Data

Driving Interoperability with CMIS

The challenges of the ECMWF graphics packages

Yajing (Phillis)Tang. Walt Wells

The GeoPortal Cookbook Tutorial

Elements of sustained data management solutions for climate

IT Infrastructure for BIM and GIS 3D Data, Semantics, and Workflows

SEXTANT 1. Purpose of the Application

INSPIRE: The ESRI Vision. Tina Hahn, GIS Consultant, ESRI(UK) Miguel Paredes, GIS Consultant, ESRI(UK)

Distributed Online Data Access and Analysis

A data-driven framework for archiving and exploring social media data

International Civil Aviation Organization THE SECOND MEETING OF SYSTEM WIDE INFORMATION MANAGEMENT TASK FORCE (SWIM TF/2)

Vlad Vinogradsky

Desarrollo de una herramienta de visualización de datos oceanográficos: Modelos y Observaciones

Jim Mains Director of Business Strategy and Media Services Media Solutions Group, EMC Corporation

Data Management Glossary

Robin Wilson Director. Digital Identifiers Metadata Services

Hydrographic Services and Standards Committee. Dr Vasily Smolyanitsky, JCOMM ETSI chair

National and Regional FDR Data Management in Southeast Asia

Striving for efficiency

Integration of INSPIRE & SDMX data infrastructures for the 2021 population and housing census

CMIS An Industry Effort to Define a Service-Based Interoperability Standard for Content Management

Online intercomparison of models and observations using OGC and community standards

Technical documentation. SIOS Data Management Plan

Introduction to INSPIRE. Network Services

Workshop 4.4: Lessons Learned and Best Practices from GI-SDI Projects II

Hospital System Lowers IT Costs After Epic Migration Flatirons Digital Innovations, Inc. All rights reserved.

Technical Capabilities of WMO and its Partners in Support of GFCS Implementation: the Climate Services Information System

Land Administration and Management: Big Data, Fast Data, Semantics, Graph Databases, Security, Collaboration, Open Source, Shareable Information

April Oracle Spatial User Conference

Implementing a Data Quality Strategy to simplify access to data

C3Grid ISO Metadata Profile

S-100 Product Specification Roll Out Implementation Plan. Introduction

Oracle Spatial Technologies: An Update. Xavier Lopez Director, Spatial Technologies Oracle Corporation

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang

Data Sheet: Archiving Altiris Server Management Suite 7.0 Essential server management: Discover, provision, manage, and monitor

Factsheet of Public Services Infrastructure (PSi) Updated on: 1st Sep 03

Novel System Architectures for Semantic Based Sensor Networks Integraion

Project Proposal: OSLC4MBSE - OMG SE and OSLC working group as part of the OMG SE DSIG. OSLC for Model-Based Systems Engineering Interoperability

The EC Presenting a multi-terabyte dataset MWF via ER the web

Sviluppi e priorità europee nel settore delle smart grids. M. de Nigris

Extending SOA Infrastructure for Semantic Interoperability

Observations and Measurements as a basis for semantic reconciliation between GRIB and netcdf... and some other ideas.

INSPIRE in a nutshell, and overview of the European Union Location Framework

High Performance Computing Data Management. Philippe Trautmann BDM High Performance Computing Global Research

WCS2.1 Met-Ocean. 16 th Workshop on Operational Systems. Crown copyright Met Office

Oracle - Exadata Database Machine: 12c

GEO-SPATIAL METADATA SERVICES ISRO S INITIATIVE

Transcription:

Tenth ECMWF Workshop on Meteorological Operational Systems 14-18 November 2005, Reading The Logical Data Store Bruce Wright, John Ward & Malcolm Field Crown copyright 2005 Page 1

Contents The presentation covers the following sections Background Logical Data Store (LDS) LDS Public Interface Work in already completed or in progress Implementation approach & parallel activities Questions and answers Crown copyright 2005 Page 2

Current situation and the way forward Organic un-governed growth has lead to: complex, incoherent, undocumented IT systems striking resemblance to spaghetti and meatballs The information silo results in: inconsistent, locally processed data proliferation of data & data access mechanisms poor access to enterprise information assets Crown copyright 2005 Page 3

Current data flows Crown copyright 2005 Page 4

Current situation and the way forward Organic un-governed growth has lead to: complex, incoherent, undocumented IT systems striking resemblance to spaghetti and meatballs The information silo results in: inconsistent, locally processed data proliferation of data & data access mechanisms poor access to enterprise information assets The business drivers for change are: cost efficiency improved agility improved consistency The LDS (Logical Data Store) is the means by which these issues will be addressed Crown copyright 2005 Page 5

Logical Data Store concept Vision A single, logical repository for all core (shared) enterprise meteorological datasets and products Aims Consistent meteorological data Uniquely identifiable Spatially-enabled (facilitating spatial manipulation & querying) Accessible through a set of common interfaces Managed in a standard way Crown copyright 2005 Page 6

Information architecture Data Products Observations Collection & Processing NWP Best Forecast Selection Supplementary Models Product Generation Dissemination Logical Data Store Crown copyright 2005 Page 7

Logical Data Store Server-side Server-side Server-side applications applications applications Management consoles Client applications Client Client applications applications API API API Web browsers Web Web browsers browsers Management consoles Clients Private data Private data Private APIs data APIs APIs Data Lifecycle Management Private data APIs Public interface Private data APIs Data Discovery Portal Infrastructure Management Security LDS Services Applicationspecific data model Shared Data Model Catalogue Records Infrastructure Infrastructure Infrastructure Infrastructure Crown copyright 2005 Page 8

Key to Logical Data Store concept The Public Interface Hides the complexity of: Databases & archives Formats & codes Interfaces to different data types De-couples the client application from the data store Provides: A single way to access all data in the LDS Using a standard request (metadata) Note: In the development of the LDS, we also intend to: Rationalise and consolidate data stores Take advantage of new data management technologies Crown copyright 2005 Page 9

LDS Public Interface Client applications Client Client applications applications API API API Provision of data and products This refers to a set of data and not a physical file (e.g. an NWP model run, all the UK climate observations) Public interface Private data APIs Security Shared Data Model Infrastructure Get resource Dataset Subset Return Process Format Return Standard form Support for range of clients: Web browser, Java, C, Fortran, Open GIS client Geospatial extent (e.g. lat-long box) Temporal extent (e.g. validity time) Parameter / attribute (e.g. Temp.) Domain restriction (e.g. T > 28C) Transform (e.g. units) Derive (e.g. T d from T, q, p) Re-project (e.g. lat-long to National Grid) Interpolate (re-grid) Filter (sub-sample) Change from Standard to: PP/Fieldsfile, NetCDF, GRIB, BUFR, CSV Crown copyright 2005 Page 10

Proposed solution for LDS Public Interface Web Services Use an HTTP Transport for messages (like web pages) Highly interoperable Clear and simple client-server interaction Use XML as a standard form for the request and response Self-describing data Implement metadata standard Can use a standard schema (e.g. GML) Possible Issues: Performance? (esp. for voluminous data) Will it work? (new technology risk) Crown copyright 2005 Page 11

Development already completed Consistent use of Oracle RDBMS to hold a range of data types Standard Java interfaces Web Services with XML for data exchange Draft Met Office Metadata Standard building on the ISO191xx, WMO standards and CF Convention Standard components for deriving best climatological observation values from our archive Crown copyright 2005 Page 12

Investigations underway Lightning location database operational demonstrator Store direct to Oracle Database (data and products) Provide Web Services interface (probably as Web Feature Service) Crown copyright 2005 Page 13

Lightning location Web Service demonstrator Oracle 10g Database XSLT / jython BPEL XPath XSQL & XMDB XML Data SQL Database Binding Component Build request Specify message traverse path Route message based on content Web Service Binding component HTTP Request Client System MetDB MetDB Binding Component XML Transform to expected response type XML XML Response BUFR Files MDB() Normalised Message router XSLT / jython Crown copyright 2005 Page 14

Investigations underway Lightning location database operational demonstrator Store direct to Oracle Database (data and products) Provide Web Services interface (probably as Web Feature Service) NWP use of Oracle RDBMS proof of concept Pull observations directly to supercomputer from database Store forecast output direct from supercomputer into database Provide data access using current (Fortran-based) APIs Crown copyright 2005 Page 15

Investigations underway Lightning location database operational demonstrator Store direct to Oracle Database (data and products) Provide Web Services interface (probably as Web Feature Service) NWP use of Oracle RDBMS proof of concept Pull observations directly to supercomputer from database Store forecast output direct from supercomputer into database Provide data access using current (Fortran-based) APIs Oracle 10g Database cluster functionality investigations 3-node Dell cluster using Oracle RAC (Real Application Cluster) Crown copyright 2005 Page 16

Proposed hardware architecture Corporate Ethernet LAN backbone (CDN) Cluster interconnect switches Gigabit links to other systems (e.g. data sources) VMWare ESX Application/Ingestion Server (Virtual Servers created as required) Cluster interconnect switches Oracle Database Server Nodes Storage Array Crown copyright 2005 Page 17

Approaches to be adopted X NOT Big Bang Piecemeal Very high risk Huge amount of effort Long time before getting any benefit Focus of specific data types, to: Prove the approach (technical solution) Demonstrate end-to-end capability Address existing problems Provide quick wins But, as part of a long term Roadmap Wrapping Interface to existing data stores (for the present): Where migration costs are high (e.g. Archive) To provide simple migration paths to use LDS FFREAD Proxy LDS Provide temporary proxy interfaces For those widely used To allow partial/gradual data migration To allow gradual application migration Crown copyright 2005 Page 18

Other parallel activities SIMDAT EU co-funded project to promote the use of GRID technology Developing catalogue for managing distributed data Collaboration with ECMWF, DWD, MeteoFrance & EUMETSAT to deliver a meteorological scenario for the Future Weather Information System DEWS Developing Environmental Web Services DTI co-funded collaborative project Using leading edge technology in real scenarios: health & marine Academic (BADC, ESSC) & commercial (Lost Wax, BMT, IBM) input Crown copyright 2005 Page 19

Questions & Answers Email: bruce.wright@metoffice.gov.uk Crown copyright 2005 Page 20