The Earth System Grid: A Visualisation Solution. Gary Strand

Similar documents
The Earth System Grid Discovery and Semantic Web Technologies

The Earth System Grid: Supporting the Next Generation of Climate Modeling Research

The Virtual Observatory

Monitoring the Earth System Grid with MDS4

Ontologies and The Earth System Grid

Building a Global Data Federation for Climate Change Science The Earth System Grid (ESG) and International Partners

The NCAR Community Data Portal

The Virtual Solar-Terrestrial Observatory ++

Introduction to Grid Computing

Data Management Components for a Research Data Archive

By Ian Foster. Zhifeng Yun

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

SciDAC's Earth System Grid Center for Enabling Technologies Semiannual Progress Report October 1, 2010 through March 31, 2011

Climate Data Management using Globus

A Distributed Media Service System Based on Globus Data-Management Technologies1

The Community Data Portal and the WMO WIS

Knowledge-based Grids

THE GLOBUS PROJECT. White Paper. GridFTP. Universal Data Transfer for the Grid

A Simple Mass Storage System for the SRB Data Grid

Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory

Transitioning NCAR MSS to HPSS

Grid Technologies & Applications: Architecture & Achievements

LIGO Virtual Data. Realizing. UWM: Bruce Allen, Scott Koranda. Caltech: Kent Blackburn, Phil Ehrens, Albert. Lazzarini, Roy Williams

SDS: A Scalable Data Services System in Data Grid

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison

Grid Data Management in Action: Experience in Running and Supporting Data Management Services in the EU DataGrid Project

The EC Presenting a multi-terabyte dataset MWF via ER the web

Index Introduction Setting up an account Searching and accessing Download Advanced features

Introduction to The Storage Resource Broker

Zhengyang Liu University of Virginia. Oct 29, 2012

Web Enabled Collaborative Climate Visualization in the Earth System Grid

ESGF IdEA: Iden-ty, En-tlement and Access Management

ALICE Grid Activities in US

Engagement With Scientific Facilities

Lawrence Berkeley National Laboratory Recent Work

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

Data Grid Services: The Storage Resource Broker. Andrew A. Chien CSE 225, Spring 2004 May 26, Administrivia

Database Assessment for PDMS

The Materials Data Facility

The Grid Architecture

Astrophysics and the Grid: Experience with EGEE

Introduction to Grid Computing

Grid-BGC: A Grid-Enabled Terrestrial Carbon Cycle Modeling System

Data Access and Analysis with Distributed, Federated Data Servers in climateprediction.net

A Replica Location Grid Service Implementation

The NOAA Operational Model Archive and Distribution System (NOMADS)

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

NCAR Globally Accessible Data Environment (GLADE) Updated: 15 Feb 2017

HEP Grid Activities in China

Cloud Computing. Up until now

SC17 - Overview

and the GridKa mass storage system Jos van Wezel / GridKa

Coordinating Parallel HSM in Object-based Cluster Filesystems

Kepler Scientific Workflow and Climate Modeling

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

glite Grid Services Overview

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations

A Metadata Catalog Service for Data Intensive Applications

Based on: Grid Intro and Fundamentals Review Talk by Gabrielle Allen Talk by Laura Bright / Bill Howe

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

Web Services for Visualization

DIRAC data management: consistency, integrity and coherence of data

Combining Virtual Organization and Local Policies for Automated Configuration of Grid Services

Globus GTK and Grid Services

Indiana University s Lustre WAN: The TeraGrid and Beyond

Implementation of Geospatial Product Virtualization in Grid Environment

The National Fusion Collaboratory

Metadata Models for Experimental Science Data Management

The Integration of Grid Technology with OGC Web Services (OWS) in NWGISS for NASA EOS Data

- C3Grid Stephan Kindermann, DKRZ. Martina Stockhause, MPI-M C3-Team

Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

HPSS Treefrog Summary MARCH 1, 2018

APIs - what are they, really? Web API, Programming libraries, third party APIs etc

Grid Portal Architectures for Scientific Applications

Navigational Data Management. Joshua Stillerman, Martin Greenwald, John Wright MIT Plasma Science and Fusion Center

NARCCAP: North American Regional Climate Change Assessment Program. Seth McGinnis, NCAR

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Grid Computing Initiative at UI: A Preliminary Result

Regular Forum of Lreis. Speechmaker: Gao Ang

Progress in building the International Lattice Data Grid

Distributing BaBar Data using the Storage Resource Broker (SRB)

Pangeo. A community-driven effort for Big Data geoscience

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

The EHRI GraphQL API IEEE Big Data Workshop on Computational Archival Science

Replica Selection in the Globus Data Grid

Toward Scalable Monitoring on Large-Scale Storage for Software Defined Cyberinfrastructure

irods usage at CC-IN2P3: a long history

The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets

Distributed Data Management on the Grid. Mario Lassnig

MONitoring Agents using a Large Integrated Services Architecture. Iosif Legrand California Institute of Technology

Managing HPC Active Archive Storage with HPSS RAIT at Oak Ridge National Laboratory

Design patterns for data-driven research acceleration

Mitigating Risk of Data Loss in Preservation Environments

Globus Online and HPSS. KEK, Tsukuba Japan October 16 20, 2017 Guangwei Che

ACME Exploratory Analysis and Classic Diagnostics Viewer

The ASCI/DOD Scalable I/O History and Strategy Run Time Systems and Scalable I/O Team Gary Grider CCN-8 Los Alamos National Laboratory LAUR

Transcription:

The Earth System Grid: A Visualisation Solution Gary Strand

Introduction

Acknowledgments PI s Ian Foster (ANL) Don Middleton (NCAR) Dean Williams (LLNL) ESG Development Team Veronika Nefedova (ANL) Ann Chervenak (ISI/USC) Carl Kesselman (ISI/USC) David Bernholdt (ORNL) Kasidit Chanchio (ORNL) Line Pouchard (ORNL) Alex Sim (LBNL) Arie Shoshani (LBNL) Bob Drach (LLNL) Dave Brown (NCAR) Gary Strand (NCAR) Jose Garcia (NCAR) Luca Cinquini (NCAR) Peter Fox (NCAR)

Current Practices o Scientist (or others) wants a visualisation o Visualisation person gets appropriate data after verifying with data manager as to the name, location, total size, etc. o Data moved to local machine that has visualisation tools o Visualization created on local machine o Hopefully, someone remembers to archive the visualisation

Simple Vis Example

Problems in the process o What if the data cannot be found (e.g. we have 1.2 million files, 73 TB of data), or the data manager is unavailable? o What if there isn t enough disk space or sufficient other resources? o What if a better visualisation tool is located elsewhere? o What if the visualisation should be shared? o What if the visualisation is lost? o ESG is part of the answers to these questions

What is ESG? LBNL: Climate storage facility ANL: Computational grids, & grid-based applications LLNL: Model diagnostics & inter-comparison USC/ISI: Computational grids, & grid-based applications NCAR: Climate change predication and scenarios LANL: Next generation coupled models & computing ORNL: Climate storage & computational resources

ESG Architecture LBNL HPSS! High Performance! Storage System! disk! ANL SRM! Storage Resource! Management! gridftp! server! NCAR opendapg! server! gridftp! Striped! server! CAS! Community Authorization Services! Tomcat servlet engine! MyProxy! server! disk! LLNL MCS client! RLS client! MyProxy client! CAS client! SRM! Storage Resource! Management! gridftp! server! GRAM! gatekeeper! gridftp! gridftp! server! ORNL ISI MCS! Metadata Cataloguing Services! SOAP" SRM! Storage Resource! Management! gridftp! gridftp! server! SRM! Storage Resource! Management! RLS! Replica Location Services! RMI" disk! MSS! Mass Storage System! disk! HPSS! High Performance! Storage System!

Solutions o What happens when data cannot be found, or the data manager is unavailable? Metadata catalogue service (MCS) Replica location service (RLS)

MCS and RLS and Metadata Services ESG CLIENTS API! & USER INTERFACES! PUBLISHING! ANALYSIS & VISUALIZATION! SEARCH & DISCOVERY! ADMINISTRATION! BROWSING & DISPLAY! METADATA! EXTRACTION! METADATA! ANNOTATION! HIGH LEVEL METADATA SERVICES! METADATA & DATA! REGISTRATION! METADATA! BROWSING! METADATA! QUERY! METADATA! AGGREGATION! METADATA! VALIDATION! METADATA! DISPLAY! METADATA! DISCOVERY! METADATA ACCESS! (update, insert, delete, query)! CORE METADATA SERVICES! SERVICE TRANSLATION! LIBRARY! METADATA HOLDINGS! Data &! Metadata! Catalog! Dublin Core! Database! mirror! Dublin Core! XML Files! COARDS! Database! COMMENTS! XML Files!

Solutions (contd.) o What if there isn t enough disk space or sufficient other resources? Hierarchical Resource Manager (HRM)

HRM

Solutions (contd.) o What What if a better visualization tool is located elsewhere? Distributed visualization

CDAT Example of an ESG Script Access The next-generation language, Python, is used to access the Earth System Grid (ESG) at LLNL Import cdms, vcs db = cdms.open( ldap://localhost:389/database=demo,ou=pcmdi,o=llnl,c=us ) f = db.open( ncep_reanalysis_mo ) ds = f( ts ) x=vcs.init( ) x.plot(ds)

CDAT: Example of an ESG GUI Client Access

Solutions (contd.) o What if the visualization should be shared? Access Grid plus Visualisation Tool

Collaborative Environments Science Portals + AccessGrid: University of Michigan (Knoop, Hardin) Vegetation & Ecosystem Mapping Program (VEMAP) NCAR/SCD VETS/KEG Argonne National Labs

Conclusions " Visualisation can require as many services and resources as the initial computation " Many sites do not offer sufficient resources for the visualisations earth sciences require " ESG provides, and will provide, the tools that enable visualisation on a grander scale

Conclusions (contd.) " ESG tools enable better data access, better data knowledge, and the processes of collaboration for the needs of investigating, visualising, and learning