Use Hierarchical Storage and Analysis to Exploit Intrinsic Parallelism

Size: px
Start display at page:

Download "Use Hierarchical Storage and Analysis to Exploit Intrinsic Parallelism"

Transcription

1 Use Hierarchical Storage and Analysis to Exploit Intrinsic Parallelism Charlie Zender1 Pedro Vicente1, Wenshan Wang1 1 Departments of Earth System Science and Computer Science, UC Irvine AGU Fall Meeting San Francisco, CA December 9-13, 2013 Seminar on Web

2 Model Evaluation Background 1. Evaluation of models by data (or other models) 2. Dataset-level operators simplify evaluation 3. Hierarchical groups are a powerful extension

3 Parallelism Already Exploited A. Replicate arithmetic over variables: T, p, q... (NCO uses OpenMP, also suitable for MPI). B. MPI & Collective I/O (PnetCDF, netcdf4 API): C. Compile scripts into (independent) Basic Blocks for batch scheduling (SWAMP, Wang & Zender)

4 What remains exploitable? Scientific method uses replication to build statistical significance and to correct mistakes. This translates into experimental ensembles of data. Larger ensembles facilitate more detailed questions and robust conclusions. Intrinsic Parallelism Often we pose the same questions of each ensemble, and thus of each member of each ensemble... Francis Bacon ( )

5 Flat File Group File (aka, a namespace) /var1, /var2,... /varn var1, var2,... varn group_name... varn

6 cesm ecmwf giss Contain different namespaces Inherit shared properties Advantageous because...datasets often use same variable, coordinate names

7 cesm run1 run2 Hierarchical Groups Nested groups can contain ensemble simulations, or measurement replications runj Multiple nested groups for Model Intercomparison Projects (CMIP5, PCMIP, AeroCom...)

8 giss cesm ecmwf run1 run1 run2 run2 run2 runj runj runj run1

9 What s New? nces: Ensemble Statistics + Runs 1,2,...J Runs J+1. Unlimited, ragged F Files E(f) Ensembles M(e,f) Members (ragged) V(e) Variables D(e,v) Dimensions (sharable)

10 Exploit Name Parallelism Non-Parallel: Exploit Parallelism: mdl=addfile(mdl.nc) ncdiff mdl.nc obs.nc out.nc Model mdl_var1=mdl->var1 mdl_var2=mdl->var2 obs=addfile(obs.nc) obs_var1=obs->var1 Obs. obs_var2=obs->var2 var1=mdl_var1-obs_var1 var2=mdl_var2-obs_var2 out=addfile(out.nc) Math out->var1=var1 out->var2=var2 Output

11 Exploit Namespace Parallelism Non-Parallel: obs=addfile(obs.nc) obs_var1=obs->var1 obs_var2=obs->var2 do j=1,8 mdl=addfile("mdl_"+j+".nc") mdl_var1=mdl->var1 mdl_var2=mdl->var2 var1(j)=mdl_var1-obs_var1 var2(j)=mdl_var2-obs_var2 end do out=addfile(out.nc) out->var1=var1 out->var2=var2 Exploit Parallelism: ncdiff mdl.nc obs.nc out.nc

12 Exploit Hierarchical Parallelism Non-Parallel: for i in 'cesm ecmwf giss'; do for j in ' '; do var1(i,j)=mdl_var1(i,j)-obs_var1 var2(i,j)=mdl_var2(i,j)-obs_var2... end do end do Exploit Parallelism: ncdiff cmip5.nc obs.nc out.nc Variable Parallelism: for i in 'cesm ecmwf giss'; do for j in ' '; do ncdiff $i_$j.nc obs.nc $i_$j_obs.nc done done

13 Exploit Intrinsic Parallelism Ensembles members (Run1, Run2, ) Or.HDF,.h5, DAP, SFTP... Ensembles means nces cmip_*.nc cmip_avg.nc ncdiff cmip_avg.nc obs.nc bias.nc Tropical hyperslab Area weight ncwa -w area -d latitude,-23.,23. \ -v tas bias.nc tropics.nc Weighted Averager Ts(t,x,y) Tropical-mean Ts biases of all CMIP5 models

14 Exploit Intrinsic Parallelism Disparate datasets in one suitcase : BCC-CSM CanAM CMCC CNRM CFS ACCESS CSIRO EC-EARTH GFDL FIO-ESM BNU-ESM CESM INM-CM4 IPSL-CM5 FGOAKS MIROC HadGEM2 MPI-ESM MRI-ESM GISS GEOS CCSM4 NorESM1 NICAM

15 Types of Intrinsic Parallelism math model.nc observ.nc out.nc ncdiff cmip5.nc modis.nc out.nc Type Name (e.g., multiple variables in flat file) Namespace (group namespaces, single level) Hierarchical Namespace (nested group hierarchy) Value/Challenge Simple, easy, netcdf3 Single ensemble Aggregator (ncecat) Multiple ensembles Aggregator (ncecat)

16 Exploit Intrinsic Parallelism math model.nc observ.nc out.nc ncdiff cmip5.nc modis.nc out.nc Hierarchical Storage and Analysis can exploit intrinsic parallelism of replication and other parallelisms. Use more Bacon in your analysis.

CMIP5 Update. Karl E. Taylor. Program for Climate Model Diagnosis and Intercomparison (PCMDI) Lawrence Livermore National Laboratory

CMIP5 Update. Karl E. Taylor. Program for Climate Model Diagnosis and Intercomparison (PCMDI) Lawrence Livermore National Laboratory CMIP5 Update Karl E. Taylor Program for Climate Model Diagnosis and Intercomparison () Lawrence Livermore National Laboratory Presented to the WCRP Working Group on Coupled Modelling Hamburg, Germany 24

More information

part two: variable to variable relationship v Correct wrong unit and other issues v Resolve some common issues v Run a regional diagnostics

part two: variable to variable relationship v Correct wrong unit and other issues v Resolve some common issues v Run a regional diagnostics Outline v Describe the benchmark system structure v Benchmark Scoring System v Install the package v Run the package v Add a new benchmark data v Add a new variable v Add a new model v Add a new diagnostic

More information

netcdf Operators [NCO]

netcdf Operators [NCO] [NCO] http://nco.sourceforge.net/ 1 Introduction and History Suite of Command Line Operators Designed to operate on netcdf/hdf files Each is a stand alone executable Very efficient for specific tasks Available

More information

16 th Annual CESM Workshop s Software Engineering Working Group. Parallel Analysis of GeOscience Data Status and Future

16 th Annual CESM Workshop s Software Engineering Working Group. Parallel Analysis of GeOscience Data Status and Future 16 th Annual CESM Workshop s Software Engineering Working Group Parallel Analysis of GeOscience Data Status and Future Jeff Daily PI: Karen Schuchardt, in collaboration with Colorado State University s

More information

IPSL Boot Camp Part 5:

IPSL Boot Camp Part 5: IPSL Boot Camp Part 5: CDO and NCO Sabine Radanovics, Jérôme Servonnat March 24, 2016 1 / 33 Group exercise Suppose... We have Tasks 30 years climate model simulation 1 file per month, 6 hourly data netcdf

More information

ExArch: Climate analytics on distributed exascale data archives Martin Juckes, V. Balaji, B.N. Lawrence, M. Lautenschlager, S. Denvil, G. Aloisio, P.

ExArch: Climate analytics on distributed exascale data archives Martin Juckes, V. Balaji, B.N. Lawrence, M. Lautenschlager, S. Denvil, G. Aloisio, P. ExArch: Climate analytics on distributed exascale data archives Martin Juckes, V. Balaji, B.N. Lawrence, M. Lautenschlager, S. Denvil, G. Aloisio, P. Kushner, D. Waliser, S. Pascoe, A. Stephens, P. Kershaw,

More information

ExArch, Edinburgh, March 2014

ExArch, Edinburgh, March 2014 ExArch: Climate analytics on distributed exascale data archives Martin Juckes, V. Balaji, B.N. Lawrence, M. Lautenschlager, S. Denvil, G. Aloisio, P. Kushner, D. Waliser, S. Pascoe, A. Stephens, P. Kershaw,

More information

Neil Berg October 18 th, The wonderful world of NCO

Neil Berg October 18 th, The wonderful world of NCO Neil Berg October 18 th, 2013 The wonderful world of NCO NetCDF Operators Q: What is NCO? A: Collection of command-line based tools specifically for analyzing, processing, viewing, and manipulating netcdf

More information

Data Reference Syntax Governing Standards within Climate Research Data archived in the ESGF

Data Reference Syntax Governing Standards within Climate Research Data archived in the ESGF Data Reference Syntax Governing Standards within Climate Research Data archived in the ESGF Michael Kolax Swedish Meteorological and Hydrological Institute Motivation for a DRS within CMIP5 In CMIP5 the

More information

Dataset Interoperability (formerly HDF5 Conventions) Working Group

Dataset Interoperability (formerly HDF5 Conventions) Working Group Dataset Interoperability (formerly HDF5 Conventions) Working Group Co-Chairs: Charlie Zender and Peter Leonard Ed Armstrong, Sean Bailey, Bill Emanual, Fan Feng, Doug Fowler, Ted Haberman, Beth Huffer,

More information

Adapting Software to NetCDF's Enhanced Data Model

Adapting Software to NetCDF's Enhanced Data Model Adapting Software to NetCDF's Enhanced Data Model Russ Rew UCAR Unidata EGU, May 2010 Overview Background What is netcdf? What is the netcdf classic data model? What is the netcdf enhanced data model?

More information

7C.2 EXPERIENCE WITH AN ENHANCED NETCDF DATA MODEL AND INTERFACE FOR SCIENTIFIC DATA ACCESS. Edward Hartnett*, and R. K. Rew UCAR, Boulder, CO

7C.2 EXPERIENCE WITH AN ENHANCED NETCDF DATA MODEL AND INTERFACE FOR SCIENTIFIC DATA ACCESS. Edward Hartnett*, and R. K. Rew UCAR, Boulder, CO 7C.2 EXPERIENCE WITH AN ENHANCED NETCDF DATA MODEL AND INTERFACE FOR SCIENTIFIC DATA ACCESS Edward Hartnett*, and R. K. Rew UCAR, Boulder, CO 1 INTRODUCTION TO NETCDF AND THE NETCDF-4 PROJECT The purpose

More information

Climate Models: Challenges for Fortran Development Tools

Climate Models: Challenges for Fortran Development Tools Climate Models: Challenges for Fortran Development Tools Mariano Méndez III-LIDI, Facultad de Informática Universidad Nacional de La Plata 50 y 120, 1900, La Plata, Argentina Fernando G. Tinetti III-LIDI,

More information

Identifier Infrastructure Usage for Global Climate Reporting

Identifier Infrastructure Usage for Global Climate Reporting Identifier Infrastructure Usage for Global Climate Reporting IoT Week 2017, Geneva Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) World Data Center for Climate (WDCC) Scientific driver: Global climate

More information

Convergence of model frameworks and data frameworks

Convergence of model frameworks and data frameworks Convergence of model frameworks and data frameworks V. Balaji Princeton University and NOAA/GFDL PRISM Community Meeting CERFACS Toulouse FRANCE 16 November 2005 The routine use of Earth System models

More information

This version is the same as NetCDF Extractor V.2.0, but it has an API for plotting contour and heat map graphs.

This version is the same as NetCDF Extractor V.2.0, but it has an API for plotting contour and heat map graphs. What is NetCDF Extractor V..? This version is the same as NetCDF Extractor V..0, but it has an API for plotting contour and heat map graphs. For applying this tool, please following these steps: Step :

More information

Enhancement for bitwise identical reproducibility of Earth system modeling on the C-Coupler platform

Enhancement for bitwise identical reproducibility of Earth system modeling on the C-Coupler platform Geosci. Model Dev. Discuss., 8, 2403 243, www.geosci-model-dev-discuss.net/8/2403// doi:.194/gmdd-8-2403- Author(s). CC Attribution 3.0 License. This discussion paper is/has been under review for the journal

More information

The Earth System Modeling Framework (and Beyond)

The Earth System Modeling Framework (and Beyond) The Earth System Modeling Framework (and Beyond) Fei Liu NOAA Environmental Software Infrastructure and Interoperability http://www.esrl.noaa.gov/nesii/ March 27, 2013 GEOSS Community ESMF is an established

More information

HTAP2 Data Analysis Logistics

HTAP2 Data Analysis Logistics HTAP2 Data Analysis Logistics Michael Schulz, Jan Griesfeller EMEP-MSCW Norwegian Meteorological Institute Martin Schultz, Michael Decker, Snehal Waychal FZ Julich 1 05/12/2013 HTAP meeting San Francisco

More information

NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences

NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder 2006-01-31 Acknowledgments This work was supported by the

More information

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison Glossary API Application Programming Interface AR5 IPCC Assessment Report 4 ASCII American Standard Code for Information Interchange BUFR Binary Universal Form for the Representation of meteorological

More information

PyCordexer. A RegCM output format converter according to CORDEX archive specifications

PyCordexer. A RegCM output format converter according to CORDEX archive specifications PyCordexer A RegCM output format converter according to CORDEX archive specifications December 2014 2 PyCordexer The PyCordexer scripts have been developed to ease the RegCM Model User in converting variables

More information

NARCCAP: North American Regional Climate Change Assessment Program. Seth McGinnis, NCAR

NARCCAP: North American Regional Climate Change Assessment Program. Seth McGinnis, NCAR NARCCAP: North American Regional Climate Change Assessment Program Seth McGinnis, NCAR mcginnis@ucar.edu NARCCAP: North American Regional Climate Change Assessment Program Nest highresolution regional

More information

CDI-pio & XIOS I/O servers compatibility with HR climate models Eric Maisonnave, Irina Fast, Thomas Jahns, Joachim Biercamp, Stéphane Sénési, Yann

CDI-pio & XIOS I/O servers compatibility with HR climate models Eric Maisonnave, Irina Fast, Thomas Jahns, Joachim Biercamp, Stéphane Sénési, Yann CDI-pio & XIOS I/O servers compatibility with HR climate models Eric Maisonnave, Irina Fast, Thomas Jahns, Joachim Biercamp, Stéphane Sénési, Yann Meurdesoif, Uwe Fladrich TR/CMGC/17/52 Abstract I/O performance

More information

UNIVERSITY OF CALIFORNIA, IRVINE. Compilation, Locality Optimization, and Managed Distributed Execution of Scientific Dataflows DISSERTATION

UNIVERSITY OF CALIFORNIA, IRVINE. Compilation, Locality Optimization, and Managed Distributed Execution of Scientific Dataflows DISSERTATION UNIVERSITY OF CALIFORNIA, IRVINE Compilation, Locality Optimization, and Managed Distributed Execution of Scientific Dataflows DISSERTATION submitted in partial satisfaction of the requirements for the

More information

Updated ranking of GCM-RCM results for five Scandinavian locations

Updated ranking of GCM-RCM results for five Scandinavian locations METreport No. /0 ISSN -0 Climate Updated ranking of GCM-RCM results for five Scandinavian locations Oskar Landgren, Jan Erik Haugen METreport Title Updated ranking of GCM-RCM results for five Scandinavian

More information

Kepler Scientific Workflow and Climate Modeling

Kepler Scientific Workflow and Climate Modeling Kepler Scientific Workflow and Climate Modeling Ufuk Turuncoglu Istanbul Technical University Informatics Institute Cecelia DeLuca Sylvia Murphy NOAA/ESRL Computational Science and Engineering Dept. NESII

More information

Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package

Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package MuQun Yang, Christian Chilan, Albert Cheng, Quincey Koziol, Mike Folk, Leon Arber The HDF Group Champaign, IL 61820

More information

Implementing a Data Quality Strategy to simplify access to data

Implementing a Data Quality Strategy to simplify access to data IN43D-07 AGU Fall Meeting 2016 Implementing a Quality Strategy to simplify access to data Kelsey Druken, Claire Trenham, Ben Evans, Clare Richards, Jingbo Wang, & Lesley Wyborn National Computational Infrastructure,

More information

Joachim Biercamp Deutsches Klimarechenzentrum (DKRZ) With input from Peter Bauer, Reinhard Budich, Sylvie Joussaume, Bryan Lawrence.

Joachim Biercamp Deutsches Klimarechenzentrum (DKRZ) With input from Peter Bauer, Reinhard Budich, Sylvie Joussaume, Bryan Lawrence. Joachim Biercamp Deutsches Klimarechenzentrum (DKRZ) With input from Peter Bauer, Reinhard Budich, Sylvie Joussaume, Bryan Lawrence. The ESiWACE project has received funding from the European Union s Horizon

More information

CMIP5 Datenmanagement erste Erfahrungen

CMIP5 Datenmanagement erste Erfahrungen CMIP5 Datenmanagement erste Erfahrungen Dr. Michael Lautenschlager Deutsches Klimarechenzentrum Helmholtz Open Access Webinare zu Forschungsdaten Webinar 18-17.01./28.01.14 CMIP5 Protocol +Timeline Taylor

More information

Working with JavaScript

Working with JavaScript Working with JavaScript Creating a Programmable Web Page for North Pole Novelties 1 Objectives Introducing JavaScript Inserting JavaScript into a Web Page File Writing Output to the Web Page 2 Objectives

More information

PDRMIP data available

PDRMIP data available PDRMIP data available Experiments CanESM2 MPI-ESM NorESM1 Core Base Co2x2 Ch4x3 Solar Bcx10 x Sulx5 x Regional bcx10asia x x x sulx10asia x x x sulx10eur x x x sulred x x x x x x x x sulasiared x x x x

More information

Parallel processing large data

Parallel processing large data Parallel processing large data Overview of presentation Traditional parallel processing and parallelising data analysis Parallel processing on JASMIN / LOTUS Examples of running parallel code (on LOTUS)

More information

Dataset Interoperability Recommendations for Earth Science

Dataset Interoperability Recommendations for Earth Science Status of this RFC Dataset Interoperability Recommendations for Earth Science This RFC provides information to the NASA Earth Science community. This RFC does not specify an Earth Science Data Systems

More information

Index Introduction Setting up an account Searching and accessing Download Advanced features

Index Introduction Setting up an account Searching and accessing Download Advanced features ESGF Earth System Grid Federation Tutorial Index Introduction Setting up an account Searching and accessing Download Advanced features Index Introduction IT Challenges of Climate Change Research ESGF Introduction

More information

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast Max-Planck-Institut für Meteorologie, DKRZ September 24, 2014 MAX-PLANCK-GESELLSCHAFT Data

More information

NetCDF and HDF5. NASA Earth Science Data Systems Working Group October 20, 2010 New Orleans. Ed Hartnett, Unidata/UCAR, 2010

NetCDF and HDF5. NASA Earth Science Data Systems Working Group October 20, 2010 New Orleans. Ed Hartnett, Unidata/UCAR, 2010 NetCDF and HDF5 NASA Earth Science Data Systems Working Group October 20, 2010 New Orleans Ed Hartnett, Unidata/UCAR, 2010 Unidata Mission: To provide the data services, tools, and cyberinfrastructure

More information

XIOS and I/O Where are we?

XIOS and I/O Where are we? Y. Meurdesoif, M.H. Nguyen, R. Lacroix, A. Caubel, O.Abramkina, Y. Wang, J. Dérouillat U t + 2Ω U =. XIOS and I/O Where are we? 25/01/17 1 Short reminder : IS-ENES 1 Achievement v Was focused on : Flexibility

More information

Introduction to the ClimValDiagTool

Introduction to the ClimValDiagTool Introduction to the ClimValDiagTool K. Gottschaldt & V. Eyring, 13. 2. 2013 1. General Info 2. Access miklip.dkrz.de 3. Get the code 4. Prepare data 5. Walk through an example 6. Modify the example 7.

More information

CESM (Community Earth System Model) Performance Benchmark and Profiling. August 2011

CESM (Community Earth System Model) Performance Benchmark and Profiling. August 2011 CESM (Community Earth System Model) Performance Benchmark and Profiling August 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell,

More information

Dataset Interoperability Working Group

Dataset Interoperability Working Group Dataset Interoperability Working Group Co-Chairs: Charlie Zender and Peter Leonard Ed Armstrong, Mary Jo Brodzik, Joe Glassy, Aleksander Jelenak, Siri Jodha Khalsa, Wenli Yang; Steve Berrick, Chris Lynnes,

More information

NCL variable based on a netcdf variable model

NCL variable based on a netcdf variable model NCL variable based on a netcdf variable model netcdf files self describing (ideally) all info contained within file no external information needed to determine file contents portable [machine independent]

More information

CERA GUI Usage. Revision History. Contents

CERA GUI Usage. Revision History. Contents CERA GUI Usage Revision History Revision Author Scope February-2017 DKRZ Data management Public release Contents Introduction...2 Intended Audience...2 Revision History...2 Interface...2 Browse...4 Search...6

More information

NCO User Guide. by Charlie Zender Departments of Earth System Science and Computer Science University of California, Irvine

NCO User Guide. by Charlie Zender Departments of Earth System Science and Computer Science University of California, Irvine NCO User Guide A suite of netcdf operators Edition 4.7.7, for NCO Version 4.7.7-beta01 September 2018 by Charlie Zender Departments of Earth System Science and Computer Science University of California,

More information

Metview s new Python interface

Metview s new Python interface Metview s new Python interface Workshop on developing Python frameworks for earth system sciences. ECMWF, 2018 Iain Russell Development Section, ECMWF Thanks to Sándor Kertész Fernando Ii Stephan Siemen

More information

FMS: the Flexible Modeling System

FMS: the Flexible Modeling System FMS: the Flexible Modeling System Coupling Technologies for Earth System Modeling Toulouse FRANCE V. Balaji balaji@princeton.edu Princeton University 15 December 2010 Balaji (Princeton University) Flexible

More information

NetCDF-4: A New Data Model, Programming Interface, and Format Using HDF5

NetCDF-4: A New Data Model, Programming Interface, and Format Using HDF5 NetCDF-4: A New Data Model, Programming Interface, and Format Using HDF5 Russ Rew, Ed Hartnett, John Caron UCAR Unidata Program Center Mike Folk, Robert McGrath, Quincey Kozial NCSA and The HDF Group,

More information

Writing NetCDF Files: Formats, Models, Conventions, and Best Practices. Overview

Writing NetCDF Files: Formats, Models, Conventions, and Best Practices. Overview Writing NetCDF Files: Formats, Models, Conventions, and Best Practices Russ Rew, UCAR Unidata June 28, 2007 1 Overview Formats, conventions, and models NetCDF-3 limitations NetCDF-4 features: examples

More information

Introduction to NetCDF

Introduction to NetCDF Introduction to NetCDF NetCDF is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. First released in 1989.

More information

Experiences with Porting CESM to ARCHER

Experiences with Porting CESM to ARCHER Experiences with Porting CESM to ARCHER ARCHER Technical Forum Webinar, 25th February, 2015 Gavin J. Pringle 25 February 2015 ARCHER Technical Forum Webinar Overview of talk Overview of the associated

More information

Implementing a Data Quality Strategy to simplify access to data

Implementing a Data Quality Strategy to simplify access to data Implementing a Quality Strategy to simplify access to data Kelsey Druken Implementing a Quality Strategy to simplify access to data Kelsey Druken, Claire Trenham, Lesley Wyborn, Ben Evans National Computational

More information

Documentation of the chemistry-transport model. [version 2017r4] July 25, How to install required libraries under GNU/Linux

Documentation of the chemistry-transport model. [version 2017r4] July 25, How to install required libraries under GNU/Linux Documentation of the chemistry-transport model [version 2017r4] July 25, 2018. How to install required libraries under GNU/Linux Contents 1 pnetcdf and NetCDF4 formats 2 1.1 Problems with NetCDF4 files..........................

More information

In-situ processing of big raster data with command line tools *

In-situ processing of big raster data with command line tools * In-situ processing of big raster data with command line tools * R.A. Rodriges Zalipynis National Research University Higher School of Economics, Moscow, Russia Explosive growth of raster data volumes in

More information

A simple OASIS interface for CESM E. Maisonnave TR/CMGC/11/63

A simple OASIS interface for CESM E. Maisonnave TR/CMGC/11/63 A simple OASIS interface for CESM E. Maisonnave TR/CMGC/11/63 Index Strategy... 4 Implementation... 6 Advantages... 6 Current limitations... 7 Annex 1: OASIS3 interface implementation on CESM... 9 Annex

More information

NCO User s Guide. by Charlie Zender Department of Earth System Science University of California, Irvine

NCO User s Guide. by Charlie Zender Department of Earth System Science University of California, Irvine NCO User s Guide A suite of netcdf operators Edition 4.0.5, for NCO Version 4.0.5 September 2010 by Charlie Zender Department of Earth System Science University of California, Irvine Copyright c 1995 2010

More information

An Evolutionary Path to Object Storage Access

An Evolutionary Path to Object Storage Access An Evolutionary Path to Object Storage Access David Goodell +, Seong Jo (Shawn) Kim*, Robert Latham +, Mahmut Kandemir*, and Robert Ross + *Pennsylvania State University + Argonne National Laboratory Outline

More information

Efficient clustered server-side data analysis workflows using SWAMP

Efficient clustered server-side data analysis workflows using SWAMP Earth Sci Inform (2009) 2:141 155 DOI 10.1007/s12145-009-0021-z RESEARCH ARTICLE Efficient clustered server-side data analysis workflows using SWAMP Daniel L. Wang Charles S. Zender Stephen F. Jenks Received:

More information

WRF Software Architecture. John Michalakes, Head WRF Software Architecture Michael Duda Dave Gill

WRF Software Architecture. John Michalakes, Head WRF Software Architecture Michael Duda Dave Gill WRF Software Architecture John Michalakes, Head WRF Software Architecture Michael Duda Dave Gill Outline Introduction Computing Overview WRF Software Overview Introduction WRF Software Characteristics

More information

DEPARTMENT OF COMPUTER SCIENCE

DEPARTMENT OF COMPUTER SCIENCE Department of Computer Science 1 DEPARTMENT OF COMPUTER SCIENCE Office in Computer Science Building, Room 279 (970) 491-5792 cs.colostate.edu (http://www.cs.colostate.edu) Professor L. Darrell Whitley,

More information

NetCDF and Scientific Data Durability. Russ Rew, UCAR Unidata ESIP Federation Summer Meeting

NetCDF and Scientific Data Durability. Russ Rew, UCAR Unidata ESIP Federation Summer Meeting NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting 2009-07-08 For preserving data, is format obsolescence a non-issue? Why do formats (and their access software)

More information

PRISM Project for Integrated Earth System Modelling An Infrastructure Project for Climate Research in Europe funded by the European Commission

PRISM Project for Integrated Earth System Modelling An Infrastructure Project for Climate Research in Europe funded by the European Commission PRISM Project for Integrated Earth System Modelling An Infrastructure Project for Climate Research in Europe funded by the European Commission under Contract EVR1-CT2001-40012 The VTK_Mapper Application

More information

Working with Scientific Data in ArcGIS Platform

Working with Scientific Data in ArcGIS Platform Working with Scientific Data in ArcGIS Platform Sudhir Raj Shrestha sshrestha@esri.com Hong Xu hxu@esri.com Esri User Conference, San Diego, CA. July 11, 2017 What we will cover today Scientific Multidimensional

More information

CESM Workflow Refactor Project Land Model and Biogeochemistry Working Groups 2015 Winter Meeting CSEG & ASAP/CISL

CESM Workflow Refactor Project Land Model and Biogeochemistry Working Groups 2015 Winter Meeting CSEG & ASAP/CISL CESM Workflow Refactor Project Land Model and Biogeochemistry Working Groups 2015 Winter Meeting Alice Bertini Sheri Mickelson CSEG & ASAP/CISL CESM Workflow Refactor Project Who s involved? Joint project

More information

Intro to CMIP, the WHOI CMIP5 community server, and planning for CMIP6

Intro to CMIP, the WHOI CMIP5 community server, and planning for CMIP6 Intro to CMIP, the WHOI CMIP5 community server, and planning for CMIP6 Caroline Ummenhofer, PO Overview - Background on IPCC & CMIP - WHOI CMIP5 server - Available model output - How to access files -

More information

The EHRI GraphQL API IEEE Big Data Workshop on Computational Archival Science

The EHRI GraphQL API IEEE Big Data Workshop on Computational Archival Science The EHRI GraphQL API IEEE Big Data Workshop on Computational Archival Science 13/12/2017 Mike Bryant CONNECTING COLLECTIONS The EHRI Project The main objective of EHRI is to support the Holocaust research

More information

SciSpark 201. Searching for MCCs

SciSpark 201. Searching for MCCs SciSpark 201 Searching for MCCs Agenda for 201: Access your SciSpark & Notebook VM (personal sandbox) Quick recap. of SciSpark Project What is Spark? SciSpark Extensions scitensor: N-dimensional arrays

More information

What NetCDF users should know about HDF5?

What NetCDF users should know about HDF5? What NetCDF users should know about HDF5? Elena Pourmal The HDF Group July 20, 2007 7/23/07 1 Outline The HDF Group and HDF software HDF5 Data Model Using HDF5 tools to work with NetCDF-4 programs files

More information

NEXTGenIO Performance Tools for In-Memory I/O

NEXTGenIO Performance Tools for In-Memory I/O NEXTGenIO Performance Tools for In- I/O holger.brunst@tu-dresden.de ZIH, Technische Universität Dresden 22 nd -23 rd March 2017 Credits Intro slides by Adrian Jackson (EPCC) A new hierarchy New non-volatile

More information

netcdf-ld SKOS: demonstrating Linked Data vocabulary use within netcdf-compliant files

netcdf-ld SKOS: demonstrating Linked Data vocabulary use within netcdf-compliant files : demonstrating Linked Data vocabulary use within netcdf-compliant files Nicholas Car Data Architect Geoscience Australia nicholas.car@ga.gov.au Prepared for ISESS2017 conference (http://www.isess2017.org/)

More information

NetCDF = Network Common Data Form

NetCDF = Network Common Data Form msftmyz_omon_mpi-esm LR_historical_r1i1p1_185001-200512.nc prc_amon_mpi-esm-lr_historical_r1i1p1_185001-200512.nc ps_amon_mpi-esm-lr_historical_r1i1p1_185001-200512.nc psl_amon_mpi-esm-lr_historical_r1i1p1_185001-200512.nc

More information

Matlab Advanced Programming. Matt Wyant University of Washington

Matlab Advanced Programming. Matt Wyant University of Washington Matlab Advanced Programming Matt Wyant University of Washington Matlab as a programming Language Strengths (as compared to C/C++/Fortran) Fast to write -no type declarations needed Memory allocation/deallocation

More information

Production Petascale Climate Data Replication at NCI Lustre and our engagement with the Earth Systems Grid Federation (ESGF)

Production Petascale Climate Data Replication at NCI Lustre and our engagement with the Earth Systems Grid Federation (ESGF) Joseph Antony, Andrew Howard, Jason Andrade, Ben Evans, Claire Trenham, Jingbo Wang Production Petascale Climate Data Replication at NCI Lustre and our engagement with the Earth Systems Grid Federation

More information

The netcdf- 4 data model and format. Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012

The netcdf- 4 data model and format. Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012 The netcdf- 4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012 NetCDF data models, formats, APIs Data models for scienbfic data and metadata - classic: simplest model - - dimensions,

More information

Reproducibility and Replication in Climate Science

Reproducibility and Replication in Climate Science May 9 2018 Reproducibility and Replication in Climate Science Gavin Schmidt, NASA GISS For the National Academies of Sciences, Engineering, and Medicine, Committee on Reproducibility and Replicability

More information

High Performance Ocean Modeling using CUDA

High Performance Ocean Modeling using CUDA using CUDA Chris Lupo Computer Science Cal Poly Slide 1 Acknowledgements Dr. Paul Choboter Jason Mak Ian Panzer Spencer Lines Sagiv Sheelo Jake Gardner Slide 2 Background Joint research with Dr. Paul Choboter

More information

A recipe for fast(er) processing of netcdf files with Python and custom C modules

A recipe for fast(er) processing of netcdf files with Python and custom C modules A recipe for fast(er) processing of netcdf files with Python and custom C modules Ramneek Maan Singh a, Geoff Podger a, Jonathan Yu a a CSIRO Land and Water Flagship, GPO Box 1666, Canberra ACT 2601 Email:

More information

Reports on user support, training, and integration of NEMO and EC-Earth community models Milestone MS6

Reports on user support, training, and integration of NEMO and EC-Earth community models Milestone MS6 Reports on user support, training, and integration of NEMO and EC-Earth community models Milestone MS6 This project has received funding from the European Union s Horizon 2020 Research and Innovation Programme

More information

ESMValTool v2.0 Technical Overview

ESMValTool v2.0 Technical Overview ESMValTool v2.0 Technical Overview Mattia Righi Version 16.11.2018 www.dlr.de Folie 2 Outline Structure Installation Configuration Running ESMValTool Output directory structure Backend (aka preprocessor)

More information

The NCAR Community Data Portal

The NCAR Community Data Portal The NCAR Community Data Portal http://cdp.ucar.edu/ QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this

More information

Standards and business models transformations

Standards and business models transformations Standards and business models transformations Inspire Conference 2017 by Jean Michel Zigna, with support of Elisabeth Lambert, Tarek Habib, Tony Jolibois and Sylvain Marty Collecte Localisation Satellite

More information

Graham vs legacy systems

Graham vs legacy systems New User Seminar Graham vs legacy systems This webinar only covers topics pertaining to graham. For the introduction to our legacy systems (Orca etc.), please check the following recorded webinar: SHARCNet

More information

HPC Input/Output. I/O and Darshan. Cristian Simarro User Support Section

HPC Input/Output. I/O and Darshan. Cristian Simarro User Support Section HPC Input/Output I/O and Darshan Cristian Simarro Cristian.Simarro@ecmwf.int User Support Section Index Lustre summary HPC I/O Different I/O methods Darshan Introduction Goals Considerations How to use

More information

climate4impact.eu Christian Pagé, CERFACS

climate4impact.eu Christian Pagé, CERFACS IS-ENES2 1 st General Assembly 11-13 th June 2014 UPC Campus, Barcelona, Spain Status of infrastructure climate4impact.eu Christian Pagé, CERFACS Working teams and institutions CERFACS: Christian Pagé

More information

Distilling Regional Climate Model Data from NARCCAP for Use in Impacts Analysis

Distilling Regional Climate Model Data from NARCCAP for Use in Impacts Analysis Distilling Regional Climate Model Data from NARCCAP for Use in Impacts Analysis Seth McGinnis IMAGe CISL NCAR mcginnis@ucar.edu 2013-09-09 v.5 Outline Introduction Overview of NARCCAP Supporting impacts

More information

Orientation to NCAR, CISL and the Outreach Services Group

Orientation to NCAR, CISL and the Outreach Services Group Orientation to NCAR, CISL and the Outreach Services Group Dr. Richard Loft Director, SIParCS Program Director, Technology Development Div. Computational and Information Systems Laboratory loft@ucar.edu

More information

Data Issues for next generation HPC

Data Issues for next generation HPC Data Issues for next generation HPC Bryan Lawrence National Centre for Atmospheric Science National Centre for Earth Observation Rutherford Appleton Laboratory Caveats: Due to time, discussion is limited

More information

2. COURSE DESIGNATION: 3. COURSE DESCRIPTIONS:

2. COURSE DESIGNATION: 3. COURSE DESCRIPTIONS: College of San Mateo Official Course Outline 1. COURSE ID: CIS 278 TITLE: (CS1) Programming Methods: C++ C-ID: COMP 122 Units: 4.0 units Hours/Semester: 48.0-54.0 Lecture hours; 48.0-54.0 Lab hours; and

More information

Compiling the uncompilable: A case for shell. script compilation

Compiling the uncompilable: A case for shell. script compilation Compiling the uncompilable: A case for shell script compilation Daniel L. Wang,1, Department of Electrical Engineering and Computer Science Charles S. Zender, Department of Earth System Science Stephen

More information

Database Engineering. Percona Live, Amsterdam, September, 2015

Database Engineering. Percona Live, Amsterdam, September, 2015 Database Engineering Percona Live, Amsterdam, 2015 September, 2015 engineering, not administration 2 yesterday s DBA gatekeeper master builder superhero siloed specialized 3 engineering quantitative interdisciplinary

More information

The CIME Case Control System

The CIME Case Control System The CIME Case Control System An Object Oriented Python Data Driven Workflow Control System for Earth System Models Jim Edwards 22 nd Annual Community Earth System Model Workshop Boulder, CO 19-22 June

More information

Bash command shell language interpreter

Bash command shell language interpreter Principles of Programming Languages Bash command shell language interpreter Advanced seminar topic Louis Sugy & Baptiste Thémine Presentation on December 8th, 2017 Table of contents I. General information

More information

Clare Richards, Benjamin Evans, Kate Snow, Chris Allen, Jingbo Wang, Kelsey A Druken, Sean Pringle, Jon Smillie and Matt Nethery. nci.org.

Clare Richards, Benjamin Evans, Kate Snow, Chris Allen, Jingbo Wang, Kelsey A Druken, Sean Pringle, Jon Smillie and Matt Nethery. nci.org. The important role of HPC and data-intensive infrastructure facilities in supporting a diversity of Virtual Research Environments (VREs): working with Climate Clare Richards, Benjamin Evans, Kate Snow,

More information

Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow

Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow Diagnostics and Exploratory Analysis Infrastructure for ACME Workflow ORNL: Brian Smith, John Harney, Brian Jewell LLNL: Jeffrey Painter, James McEnerney, ORNL is managed by UT-Battelle for the US Department

More information

HDF5: An Introduction. Adam Carter EPCC, The University of Edinburgh

HDF5: An Introduction. Adam Carter EPCC, The University of Edinburgh HDF5: An Introduction Adam Carter EPCC, The University of Edinburgh What is HDF5? Hierarchical Data Format (version 5) From www.hdfgroup.org: HDF5 is a unique technology suite that makes possible the management

More information

Exploiting Weather & Climate Data at Scale (WP4)

Exploiting Weather & Climate Data at Scale (WP4) Exploiting Weather & Climate Data at Scale (WP4) Julian Kunkel 1 Bryan N. Lawrence 2,3 Jakob Luettgau 1 Neil Massey 4 Alessandro Danca 5 Sandro Fiore 5 Huang Hu 6 1 German Climate Computing Center (DKRZ)

More information

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System Ilkay Al(ntas and Daniel Crawl San Diego Supercomputer Center UC San Diego Jianwu Wang UMBC WorDS.sdsc.edu Computa3onal

More information

From the latency to the throughput age. Prof. Jesús Labarta Director Computer Science Dept (BSC) UPC

From the latency to the throughput age. Prof. Jesús Labarta Director Computer Science Dept (BSC) UPC From the latency to the throughput age Prof. Jesús Labarta Director Computer Science Dept (BSC) UPC ETP4HPC Post-H2020 HPC Vision Frankfurt, June 24 th 2018 To exascale... and beyond 2 Vision The multicore

More information

Day 3: Diagnostics and Output

Day 3: Diagnostics and Output Day 3: Diagnostics and Output Adam Phillips Climate Variability Working Group Liaison CGD/NCAR Thanks to Dennis Shea, Andrew Gettelman, and Christine Shields for their assistance Outline Day 3: Diagnostics

More information

Common Infrastructure for Modeling Earth (CIME) and MOM6. Mariana Vertenstein CESM Software Engineering Group

Common Infrastructure for Modeling Earth (CIME) and MOM6. Mariana Vertenstein CESM Software Engineering Group Common Infrastructure for Modeling Earth (CIME) and MOM6 Mariana Vertenstein CESM Software Engineering Group Outline What is CIME? New CIME coupling infrastructure and MOM6 CESM2/DART Data Assimilation

More information