Gridded data from many sources

Similar documents
Fimex Introduction. Heiko Klein Meteorologisk institutt met.no

NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences

WRF-NMM Standard Initialization (SI) Matthew Pyle 8 August 2006

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast

Python: Working with Multidimensional Scientific Data. Nawajish Noman Deng Ding

The EC Presenting a multi-terabyte dataset MWF via ER the web

COM INTRO 2017: GRIB Decoding - Solutions to practicals. Solution to Practical 1: using grib_dump and grib_ls

Unified Model Intercomparisons for Volcanic Ash Transport Modeling

COM INTRO 2016: GRIB Decoding - Solutions to practicals. Solution to Practical 1: using grib_dump and grib_ls

Observation feedback archiving in MARS. Acknowledgement:

Interpolation. Computer User Training Course Paul Dando. User Support. ECMWF 25 February 2016

The netcdf- 4 data model and format. Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012

HPC Performance Advances for Existing US Navy NWP Systems

Existing Solutions. Operating data services: Climate Explorer ECA&D climate4impact.eu data.knmi.nl

Meteorology and Python

IPSL Boot Camp Part 5:

NetCDF and Scientific Data Durability. Russ Rew, UCAR Unidata ESIP Federation Summer Meeting

You will create some icons yourself, but some are supplied for you. If you are at ECMWF then you can copy the icons from the command line like this:

A mass-conservative version of the semi- Lagrangian semi-implicit HIRLAM using Lagrangian vertical coordinates

Development of an information service system based on GOOGLE graphical interfaces. Instruction for the use of the MOON-VOS portal Interface

iris-grib Documentation

PyNGL & PyNIO Geoscience Visualization & Data IO Modules

MATLAB & Practical Application on Climate Variability Studies EXERCISES

Writing NetCDF Files: Formats, Models, Conventions, and Best Practices. Overview

Introduction to NetCDF

NetCDF and HDF5. NASA Earth Science Data Systems Working Group October 20, 2010 New Orleans. Ed Hartnett, Unidata/UCAR, 2010

VEGETATION Geometrical Image Quality

LaTiS! Data Access Service Architecture"

How Can We Use NetCDF Extractor V.2.0?

The Logical Data Store

Metview FLEXTRA Tutorial. Meteorological Visualisation Section Operations Department ECMWF

The most recent CMS-Wave code developed is Version 3.2. Several new capabilities and advanced features in this version include:

OGC at KNMI: Current use and plans

MIR. ECMWF s New Interpolation Package. P. Maciel, T. Quintino, B. Raoult, M. Fuentes, S. Villaume ECMWF. ECMWF March 9, 2016

Lab: Scientific Computing Tsunami-Simulation

GrADS for Beginners. Laura Mariotti

Data and information sharing WMO global systems

XIOS and I/O Where are we?

v STWAVE Analysis SMS 11.2 Tutorial Requirements Map Module STWAVE Cartesian Grid Module Scatter Module Prerequisites Time minutes

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison

Geographic Information System and its Application in Hydro-Meteorology Exercises using SavGIS

FMS: the Flexible Modeling System

RegCM-ROMS Tutorial: Introduction to ROMS Ocean Model

McIDAS-V Tutorial Using and Creating Formulas updated September 2013 (software version 1.4)

Unified Model Performance on the NEC SX-6

Online Trajectory Module in COSMO - A short user guide

Sea Level CCI project Phase II. System Engineering Status

Interpolation. Introduction and basic concepts. Computer User Training Course Paul Dando. User Support Section.

Update on NAWIPS/GEMPAK Migration to AWIPS II

Porting and Optimizing the COSMOS coupled model on Power6

fixnc Documentation Release Nikolay Koldunov

Ocean, Atmosphere & Climate Model Assessment for Everyone

ITACS : Interactive Tool for Analysis of the Climate System

Working with Scientific Data in ArcGIS Platform

v CMS-Wave Analysis SMS 12.2 Tutorial Prerequisites Requirements Time Objectives

Technical documentation. SIOS Data Management Plan

Introduction to INSPIRE. Network Services

Bruce Wright, John Ward, Malcolm Field, Met Office, United Kingdom

netcdf-ld SKOS: demonstrating Linked Data vocabulary use within netcdf-compliant files

The Jülich WCS interface for emissions, model products, and model evaluation

Adapting Software to NetCDF's Enhanced Data Model

B+ Tree Review. CSE332: Data Abstractions Lecture 10: More B Trees; Hashing. Can do a little better with insert. Adoption for insert

ECMWF point database: providing direct access to any model output grid-point values

Gridded Data Speedwell Derived Gridded Products

About the SPEEDY model (from Miyoshi PhD Thesis):

SciSpark 201. Searching for MCCs

Analysis Methods in Atmospheric and Oceanic Science

Implementing the RDA data citation recommendations in the distributed Infrastructure of the Virtual Atomic and Molecular Data Centre

Extreme I/O Scaling with HDF5

Linux Kernel Evolution. OpenAFS. Marc Dionne Edinburgh

16 th Annual CESM Workshop s Software Engineering Working Group. Parallel Analysis of GeOscience Data Status and Future

7C.2 EXPERIENCE WITH AN ENHANCED NETCDF DATA MODEL AND INTERFACE FOR SCIENTIFIC DATA ACCESS. Edward Hartnett*, and R. K. Rew UCAR, Boulder, CO

Data Management Plans Are they working for the Australian Antarctic program? Dave Connell Australian Antarctic Data Centre

3.Data Abstraction. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai 1 / 26

Scaling MongoDB. Percona Webinar - Wed October 18th 11:00 AM PDT Adamo Tonete MongoDB Senior Service Technical Service Engineer.

In this exercise, you ll create a netcdf raster layer using the variable tmin. You will change the display by selecting a different time step.

PGS hyperbeam - rapid scenario-testing of velocity models to optimize depth imaging

File Formats and Pre-Processing

Format specification for the SMET Weather Station Meteorological Data Format version 1.1

Two Phase Commit Protocol. Distributed Systems. Remote Procedure Calls (RPC) Network & Distributed Operating Systems. Network OS.

Metview Macro Language

Measuring Turbulence with Lidars

CS103 Handout 42 Spring 2017 May 31, 2017 Practice Final Exam 1

Surfaces. Science B44. Lecture 11 Surfaces. Surfaces. 1. What problem do surfaces solve? 2. How are surfaces discovered

Keeping Up With The Linux Kernel. Marc Dionne AFS and Kerberos Workshop Pittsburgh

A data model of the Climate and Forecast metadata conventions (CF-1.6) with a software implementation (cf-python v2.1)

Package cmsaf. August 6, 2018

OPeNDAP: Accessing HYCOM (and other data) remotely

Introduction of new WDCGG website. Seiji MIYAUCHI Meteorological Agency

04-Atmospheric_Data_Formats

Real-world sorting (not on exam)

HYSPLIT model description and operational set up for benchmark case study

11A.3 INVESTIGATION OF USING HDF5 ARCHIVAL INFORMATION PACKAGES (AIP) TO STORE NASA ECS DATA

Intro to CMIP, the WHOI CMIP5 community server, and planning for CMIP6

COSC 6374 Parallel Computation. Scientific Data Libraries. Edgar Gabriel Fall Motivation

JAXA Himawari Monitor Aerosol Products. JAXA Earth Observation Research Center (EORC) August 2018

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science

CESM Projects Using ESMF and NUOPC Conventions

Ecography. Supplementary material

SeaSonde Radial Site Release 6 SeaDisplay Ver6 Application Guide Apr 23, 2009 Copyright CODAR Ocean Sensors, Ltd

Transcription:

Gridded data from many sources A data-user's perspective Heiko Klein 26.09.2014

Background MET used legacy format (felt) for gridded data since ~1980s -Index 2d fields -«unique» parameter table 2012 decided to use netcdf / CF for all gridded data internally -Grib still allowed as exchange format -Grib used in atmospheric models since autumn 2013: all new products are in netcdf since spring 2014: felt no longer needed in post-production 2 26.09.2014 Norwegian Meteorological Institute

Model-input: e.g. Regional ocean models Need surface fields, usually U10, V10, (T, P) Usually not 100% overlap between atmospheric regional model and ocean model merging of several atmospheric models required Model expects unified input Tasks: -Rename variable names (Uair, Vair, Tair, Pair) -Use unique units -Remove scaling-factors (in particular, U/V scaling factors change when changing projection loss of precicision) -Interpolate datasets / take care of land-sea mask -Join dataset => Heavy preprocessing required, independent of input-data 3 26.09.2014 Norwegian Meteorological Institute

Model-input: e.g. Atmos. Dispersion models Needs model-level parameters: U, V, T, Q, CLWC, P, + surface (u10, v10, precip.,...) Some derived parameters (w, deaccumulated precip.) Model-domain = atmospheric model-domain -Many domains (local, regional, global), but usually not changing too often (~ once a year) Emergency-models response-time < 10minutes Tasks: -Rename variable names -Precalculation of derived parameters (on demand) -Hard-coding of model-domain (Fortran) => Preprocessing should be simple (Fortran), otherwise storing of specialized dataset for emergency model? 4 26.09.2014 Norwegian Meteorological Institute

Model input: grib / netcdf Models which require heavy preprocessing of data usually don't care Emergency models need to read data as is, often can read many formats (grib and netcdf) -Netcdf-advantage due to index, but Coordinate-systems are tough to read in Fortran Usually semi hard-coded dimensions Rely upon stable product no changes to dimensions and attributes, only change data -Grib seems often simpler but is it really? 5 26.09.2014 Norwegian Meteorological Institute

Real world, bad example of grib atmopl09190000_003.grib latfirst latlast shortname datadate 40 0 r 20140919 0-5 r 20140919 40 0 t 20140919 0-5 t 20140919 haven't found a general reader which can handle it good enough Netcdf -> grib conversion simple Grib -> netcdf conversion can be impossible (What belongs together?) 6 26.09.2014 Norwegian Meteorological Institute

Visualization 7 26.09.2014 Maps, time-series, vert. cross-sections/sondes, wave-spectra Products must show up within 2s on a meteorological workstation -IO: ~50 fields / s Pre-defined units and legends based on standard_name or variable-name Simple derived parameters are calculated on demand (variable-name based) Tasks: -Rename variable-names to those expected for presentation (or define several presentations) -Index data (grib, not netcdf) -Startup-delay for time-series: Extract time-series for all stations: ~minutes

Chunking? Maps float air_temperature_2m(time, height1, y, x) ; air_temperature_2m:_chunksizes = 1, 1, 929, 719 ; Time-Series (not used at MET) float air_temperature_2m(time, height1, y, x) ; air_temperature_2m:_chunksizes = 96, 1, 1, 1 ; Maps (not used at MET) float wave_spectra(time, angle, freq, lon, lat) ; wave_spectra:_chunksizes = 1, 1, 1, 929, 719 ; Wave-Spectra float wave_spectra(time, angle, freq, lon, lat) ; wave_spectra:_chunksizes = 1, 36, 36, 1, 1 ; => chunking can increase performance, but chunking is also a «specialized» product 8 26.09.2014

standard_name Very useful as documentation Good (enough) for discovery metadata Good first guess to add visualization hints but needs often more tuning No way to add non-standard_names «metno_name» Often files with wrong standard_names Long standardization process (~3 weeks discussion, 3 weeks silence = eventual acceptance, 3-12 month until it appears on web-page) too long for people working in the field (I have a list of ~1500 standard_names which I don't dare to send in. I have local grib-extensions which no grib-reader can read.) 9 26.09.2014

standard_name Several variables in one file might have same standard_name (e.g. pressure-level/model-level/surface) Not precise enough often only correct together with dimensions / cell_method / bounds Usage of standard_name as variable name: -air_temperature_ml -air_temperature_6hour_max 10 26.09.2014

Visualization grib / netcdf Map-visualization works well, grib requires indexing -Usage of libraries to handle data-conversion and CF-coordinate system (C++: fimex,cdo; Java: netcdf-java) External tables (grib) / configuration (netcdf/ncml) needed in most cases Netcdf can be used for most specialized, preprocessed products, e.g. wave-spectra (chunking), pre-calculated crosssections -High-performance web-servers (http://yr.no) need still very special solutions (caching, parallelization, pre-calculations) 11 26.09.2014