What Can a Small Country Do? The MeteoSwiss Implementation of the COSMO Suite on the Cray XT4

Size: px
Start display at page:

Download "What Can a Small Country Do? The MeteoSwiss Implementation of the COSMO Suite on the Cray XT4"

Transcription

1 Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz What Can a Small Country Do? The MeteoSwiss Implementation of the COSMO Suite on the Cray XT4 13th ECMWF Workshop on the Use of HPC in Meteorology November 2008

2 MeteoSwiss Model Setup ECMWF IFS (global) 25km, 91 layers 2 x 240h per day + 2 x 78h per day ECMWF IFS COSMO-7 COSMO-7 (regional) 6.6km, 60 layers, 393 x 338 grid points 2 x 72h per day COSMO-2 COSMO-2 (local) 2.2km, 60 layers, 520 x 350 grid points 8 x 24h per day 2

3 Computing Facilities Swiss National Supercomputing Centre (CSCS) in Manno, Ticino 3

4 HPC Platforms Production system: Cray XT4 buin 16 service nodes (AMD Opteron dual core, 2.6 GHz) 448 compute nodes (AMD Opteron dual core, 2.6 GHz) Special purpose purchase of CSCS to guarantee the high level of availability needed by MeteoSwiss Reserved usage by MeteoSwiss during operational time slots Failover system: Cray XT3 palu 32 service nodes (AMD Opteron dual core, 2.6 GHz) 1664 compute nodes (AMD Opteron dual core, 2.6 GHz) Shared machine, also used for development UNICOS/lc operating system (Linux on service nodes, Catamount on compute nodes) Lustre parallel file system 4

5 COSMO Scalability on the CRAY XT4 Code: FORTRAN90, MPI-only parallelization Compiler: Portland Group 24h COSMO-2 forecast NX NY NIO #Cores t in s Speed-Up Measured Theoretical Operational setting: #Cores 390 Gflops sustained (9 % of peak) (Data: J.-G. Piccinali, CSCS) 5

6 Data Flow ECMWF CSCS Cray Compute Nodes Cray Service Nodes fieldextra ASCII & GRIB products Metview IFS INT2LM COSMO IDL Plots trajek LPDM Concentrations OBS 6

7 Hardware and Communications 7

8 Queuing PBS Pro workload management system YOD parallel execution library under Catamount Cray Compute Nodes advance reservation Permitted exclusively for MeteoSwiss operators Used for operational computations Cray Service Nodes post-processing queue Permitted solely for MeteoSwiss operators Used for operational post-processing high-priority queue Permitted solely for MeteoSwiss operators and further special MeteoSwiss users Higher priority than normal queue Used to run model test chains Backup in case of advance reservation failure normal queue Permitted for all users 8

9 Production Scheme Assimilation cycle UTC h Forecasts COSMO-7 COSMO h.. +72h 9

10 Production Scheme UTC h 12 UTC cycle 06 UTC cycle 09 UTC cycle 15 UTC cycle 18 UTC cycle 21 UTC cycle 00 UTC cycle 03 UTC cycle 3h 3h 3h 3h 3h 3h 3h 3h Long production cycle: Elapsed time in min COSMO-7 assimilation COSMO-7 forecast COSMO-7 TC products COSMO-2 assimilation COSMO-2 forecast COSMO-2 TC products 3h assimilation (21 UTC) 0-24h forecast (00 UTC) and TC products 3h assimilation (21 UTC) 0-24h forecast (00 UTC) and TC products 25-72h forecast (00 UTC) and TC products 10

11 Production Scheme UTC h 12 UTC cycle 06 UTC cycle 09 UTC cycle 15 UTC cycle 18 UTC cycle 21 UTC cycle 00 UTC cycle 03 UTC cycle 3h 3h 3h 3h 3h 3h 3h 3h Short production cycle: Elapsed time in min COSMO-7 assimilation COSMO-7 forecast COSMO-2 assimilation COSMO-2 forecast COSMO-2 TC products 3h assimilation (00 UTC) 0-24h forecast (03 UTC) 3h assimilation (00 UTC) 0-24h forecast (03 UTC) and TC products 25-54h forecast (03 UTC) 11

12 The COSMO Package Steering software, comprising ~50 shell and Ruby scripts Running on Cray service node Key features Starter with task schedule list input, running as cron job Concurrent run of supervising spy and task processing Time-critical and non-time-critical part of task execution Concurrent model integration, post-processing, and dissemination in time-critical part Inter-process-communication via files 12

13 The COSMO Package Time Time-critical (TC) Non-time-critical (NTC) 13

14 The COSMO Package lm_starter.rb lm_for Root script Task start.taskrunning lm_spy Task supervision lm_integ Task processing <pid>.run <pid>.run lm_bufget <pid>.end <pid>.end lm_arget <pid>.run <pid>.end lm_jobctrl. rb TC <pid>.run lm_lm2lm <pid>.run lm_f <pid>.run Task supervision Information: periodically collect all messages for log file and mail messages Control: periodically check disk and abnormal exit of package modules, using <pid>.run/<pid>.end file mechanism, check for.taskend Final actions: task status, post-mortem actions, and housekeeping 14

15 The COSMO Package Root script.taskends lm_spy lm_integ <pid>.run lm_bufput NTC lm_arput lm_acct lm_integ.post 15

16 The COSMO Package lm_jobctrl. rb TC lm_lm2lm lm_f Job control and queue handling suite.nqs lm2lm.nqs lm_f.nqs.fromlm_f reservation post-processing queue lm_postproc_f TC lm_plot_f TC lm_fieldextra_f TC Sliced processing, using model output list lm_plot.kern plot_xx_f_tc.nqs fldxtr_f_tc.nqs lm_loops lm_diss TC lm_mhs lm_scp lm_rcp lm_tlisubmit Synchronization point TC 16

17 Reliability Statistics 6 months period Apr-Sep 2008: 366 times +72h COSMO-7 production: (96% reliability) #Total #CSCS problems #COSMO package delay TC end > 12h h < delay TC end <12h h < delaytc end < 5h TC end OK, but missing products times +24h COSMO-2 production: (97% reliability) #Total #CSCS problems #COSMO package delay TC end > 3h h < delay TC end <3h TC end OK, but missing products

18 Many thanks for your attention! Questions? 18

An update on the COSMO- GPU developments

An update on the COSMO- GPU developments An update on the COSMO- GPU developments COSMO User Workshop 2014 X. Lapillonne, O. Fuhrer, A. Arteaga, S. Rüdisühli, C. Osuna, A. Roches and the COSMO- GPU team Eidgenössisches Departement des Innern

More information

Adapting Numerical Weather Prediction codes to heterogeneous architectures: porting the COSMO model to GPUs

Adapting Numerical Weather Prediction codes to heterogeneous architectures: porting the COSMO model to GPUs Adapting Numerical Weather Prediction codes to heterogeneous architectures: porting the COSMO model to GPUs O. Fuhrer, T. Gysi, X. Lapillonne, C. Osuna, T. Dimanti, T. Schultess and the HP2C team Eidgenössisches

More information

COSMO Software: fieldextra

COSMO Software: fieldextra Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz COSMO Software: fieldextra / MeteoSwiss Offenbach, COSMO GM, September 2016 Core development team Petra

More information

Sami Saarinen Peter Towers. 11th ECMWF Workshop on the Use of HPC in Meteorology Slide 1

Sami Saarinen Peter Towers. 11th ECMWF Workshop on the Use of HPC in Meteorology Slide 1 Acknowledgements: Petra Kogel Sami Saarinen Peter Towers 11th ECMWF Workshop on the Use of HPC in Meteorology Slide 1 Motivation Opteron and P690+ clusters MPI communications IFS Forecast Model IFS 4D-Var

More information

PLAN-E Workshop Switzerland. Welcome! September 8, 2016

PLAN-E Workshop Switzerland. Welcome! September 8, 2016 PLAN-E Workshop Switzerland Welcome! September 8, 2016 The Swiss National Supercomputing Centre Driving innovation in computational research in Switzerland Michele De Lorenzi (CSCS) PLAN-E September 8,

More information

On the 2m temperature and dew point diagnostics in the COSMO model

On the 2m temperature and dew point diagnostics in the COSMO model Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz On the 2m temperature and dew point diagnostics in the COSMO model Matteo Buzzi and M.W. Rotach MeteoSwiss

More information

Research Collection. WebParFE A web interface for the high performance parallel finite element solver ParFE. Report. ETH Library

Research Collection. WebParFE A web interface for the high performance parallel finite element solver ParFE. Report. ETH Library Research Collection Report WebParFE A web interface for the high performance parallel finite element solver ParFE Author(s): Paranjape, Sumit; Kaufmann, Martin; Arbenz, Peter Publication Date: 2009 Permanent

More information

Update on Cray Activities in the Earth Sciences

Update on Cray Activities in the Earth Sciences Update on Cray Activities in the Earth Sciences Presented to the 13 th ECMWF Workshop on the Use of HPC in Meteorology 3-7 November 2008 Per Nyberg nyberg@cray.com Director, Marketing and Business Development

More information

Physical parametrizations and OpenACC directives in COSMO

Physical parametrizations and OpenACC directives in COSMO Physical parametrizations and OpenACC directives in COSMO Xavier Lapillonne Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Name (change on Master slide)

More information

The challenges of new, efficient computer architectures, and how they can be met with a scalable software development strategy.! Thomas C.

The challenges of new, efficient computer architectures, and how they can be met with a scalable software development strategy.! Thomas C. The challenges of new, efficient computer architectures, and how they can be met with a scalable software development strategy! Thomas C. Schulthess ENES HPC Workshop, Hamburg, March 17, 2014 T. Schulthess!1

More information

The Red Storm System: Architecture, System Update and Performance Analysis

The Red Storm System: Architecture, System Update and Performance Analysis The Red Storm System: Architecture, System Update and Performance Analysis Douglas Doerfler, Jim Tomkins Sandia National Laboratories Center for Computation, Computers, Information and Mathematics LACSI

More information

Porting SLURM to the Cray XT and XE. Neil Stringfellow and Gerrit Renker

Porting SLURM to the Cray XT and XE. Neil Stringfellow and Gerrit Renker Porting SLURM to the Cray XT and XE Neil Stringfellow and Gerrit Renker Background Cray XT/XE basics Cray XT systems are among the largest in the world 9 out of the top 30 machines on the top500 list June

More information

NWP Test suite: Present Status. COSMO General Meeting, 7-10 Sept 2015, Wroclaw, Poland : NWP Test suite session

NWP Test suite: Present Status. COSMO General Meeting, 7-10 Sept 2015, Wroclaw, Poland : NWP Test suite session NWP Test suite: Present Status GOAL Build up a software environment to perform carefully-controlled and rigorous testing: Calculation of verification statistics for any COSMO model test version Offer necessary

More information

HTCondor on Titan. Wisconsin IceCube Particle Astrophysics Center. Vladimir Brik. HTCondor Week May 2018

HTCondor on Titan. Wisconsin IceCube Particle Astrophysics Center. Vladimir Brik. HTCondor Week May 2018 HTCondor on Titan Wisconsin IceCube Particle Astrophysics Center Vladimir Brik HTCondor Week May 2018 Overview of Titan Cray XK7 Supercomputer at Oak Ridge Leadership Computing Facility Ranked #5 by TOP500

More information

Approaches to I/O Scalability Challenges in the ECMWF Forecasting System

Approaches to I/O Scalability Challenges in the ECMWF Forecasting System Approaches to I/O Scalability Challenges in the ECMWF Forecasting System PASC 16, June 9 2016 Florian Rathgeber, Simon Smart, Tiago Quintino, Baudouin Raoult, Stephan Siemen, Peter Bauer Development Section,

More information

Scalable Computing at Work

Scalable Computing at Work CRAY XT4 DATASHEET Scalable Computing at Work Cray XT4 Supercomputer Introducing the latest generation massively parallel processor (MPP) system from Cray the Cray XT4 supercomputer. Building on the success

More information

The Cray Rainier System: Integrated Scalar/Vector Computing

The Cray Rainier System: Integrated Scalar/Vector Computing THE SUPERCOMPUTER COMPANY The Cray Rainier System: Integrated Scalar/Vector Computing Per Nyberg 11 th ECMWF Workshop on HPC in Meteorology Topics Current Product Overview Cray Technology Strengths Rainier

More information

Cray RS Programming Environment

Cray RS Programming Environment Cray RS Programming Environment Gail Alverson Cray Inc. Cray Proprietary Red Storm Red Storm is a supercomputer system leveraging over 10,000 AMD Opteron processors connected by an innovative high speed,

More information

NCEP HPC Transition. 15 th ECMWF Workshop on the Use of HPC in Meteorology. Allan Darling. Deputy Director, NCEP Central Operations

NCEP HPC Transition. 15 th ECMWF Workshop on the Use of HPC in Meteorology. Allan Darling. Deputy Director, NCEP Central Operations NCEP HPC Transition 15 th ECMWF Workshop on the Use of HPC Allan Darling Deputy Director, NCEP Central Operations WCOSS NOAA Weather and Climate Operational Supercomputing System CURRENT OPERATIONAL CHALLENGE

More information

Deutscher Wetterdienst

Deutscher Wetterdienst Accelerating Work at DWD Ulrich Schättler Deutscher Wetterdienst Roadmap Porting operational models: revisited Preparations for enabling practical work at DWD My first steps with the COSMO on a GPU First

More information

What SMT can do for You. John Hague, IBM Consultant Oct 06

What SMT can do for You. John Hague, IBM Consultant Oct 06 What SMT can do for ou John Hague, IBM Consultant Oct 06 100.000 European Centre for Medium Range Weather Forecasting (ECMWF): Growth in HPC performance 10.000 teraflops sustained 1.000 0.100 0.010 VPP700

More information

NVIDIA Update and Directions on GPU Acceleration for Earth System Models

NVIDIA Update and Directions on GPU Acceleration for Earth System Models NVIDIA Update and Directions on GPU Acceleration for Earth System Models Stan Posey, HPC Program Manager, ESM and CFD, NVIDIA, Santa Clara, CA, USA Carl Ponder, PhD, Applications Software Engineer, NVIDIA,

More information

Deutscher Wetterdienst

Deutscher Wetterdienst Porting Operational Models to Multi- and Many-Core Architectures Ulrich Schättler Deutscher Wetterdienst Oliver Fuhrer MeteoSchweiz Xavier Lapillonne MeteoSchweiz Contents Strong Scalability of the Operational

More information

OPERATING SYSTEM. Functions of Operating System:

OPERATING SYSTEM. Functions of Operating System: OPERATING SYSTEM Introduction: An operating system (commonly abbreviated to either OS or O/S) is an interface between hardware and user. OS is responsible for the management and coordination of activities

More information

A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator Servers

A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator Servers A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator Servers Maxime Martinasso, Grzegorz Kwasniewski, Sadaf R. Alam, Thomas C. Schulthess, Torsten Hoefler Swiss National Supercomputing

More information

BC-EPS Generating boundary values for the COSMO-DE-EPS

BC-EPS Generating boundary values for the COSMO-DE-EPS MetOpSys: BC-EPS 1 BC-EPS Generating boundary values for the COSMO-DE-EPS Helmut Frank Helmut.Frank@dwd.de including modified slides from S. Theis, T. Hanisch, D. Majewski MetOpSys: BC-EPS 2 Introduction

More information

The Effect of Page Size and TLB Entries on Application Performance

The Effect of Page Size and TLB Entries on Application Performance The Effect of Page Size and TLB Entries on Application Performance Neil Stringfellow CSCS Swiss National Supercomputing Centre June 5, 2006 Abstract The AMD Opteron processor allows for two page sizes

More information

The next generation supercomputer. Masami NARITA, Keiichi KATAYAMA Numerical Prediction Division, Japan Meteorological Agency

The next generation supercomputer. Masami NARITA, Keiichi KATAYAMA Numerical Prediction Division, Japan Meteorological Agency The next generation supercomputer and NWP system of JMA Masami NARITA, Keiichi KATAYAMA Numerical Prediction Division, Japan Meteorological Agency Contents JMA supercomputer systems Current system (Mar

More information

ACCRE High Performance Compute Cluster

ACCRE High Performance Compute Cluster 6 중 1 2010-05-16 오후 1:44 Enabling Researcher-Driven Innovation and Exploration Mission / Services Research Publications User Support Education / Outreach A - Z Index Our Mission History Governance Services

More information

CRAY XK6 REDEFINING SUPERCOMPUTING. - Sanjana Rakhecha - Nishad Nerurkar

CRAY XK6 REDEFINING SUPERCOMPUTING. - Sanjana Rakhecha - Nishad Nerurkar CRAY XK6 REDEFINING SUPERCOMPUTING - Sanjana Rakhecha - Nishad Nerurkar CONTENTS Introduction History Specifications Cray XK6 Architecture Performance Industry acceptance and applications Summary INTRODUCTION

More information

Regression Testing on Petaflop Computational Resources. CUG 2010, Edinburgh Mike McCarty Software Developer May 27, 2010

Regression Testing on Petaflop Computational Resources. CUG 2010, Edinburgh Mike McCarty Software Developer May 27, 2010 Regression Testing on Petaflop Computational Resources CUG 2010, Edinburgh Mike McCarty Software Developer May 27, 2010 Additional Authors Troy Baer (NICS) Lonnie Crosby (NICS) Outline What is NICS and

More information

HPC Technology Update Challenges or Chances?

HPC Technology Update Challenges or Chances? HPC Technology Update Challenges or Chances? Swiss Distributed Computing Day Thomas Schoenemeyer, Technology Integration, CSCS 1 Move in Feb-April 2012 1500m2 16 MW Lake-water cooling PUE 1.2 New Datacenter

More information

ECMWF's Next Generation IO for the IFS Model and Product Generation

ECMWF's Next Generation IO for the IFS Model and Product Generation ECMWF's Next Generation IO for the IFS Model and Product Generation Future workflow adaptations Tiago Quintino, B. Raoult, S. Smart, A. Bonanni, F. Rathgeber, P. Bauer ECMWF tiago.quintino@ecmwf.int ECMWF

More information

REQUEST FOR A SPECIAL PROJECT

REQUEST FOR A SPECIAL PROJECT REQUEST FOR A SPECIAL PROJECT 2018 2020 MEMBER STATE: Germany, Greece, Italy This form needs to be submitted via the relevant National Meteorological Service. Principal Investigator 1 Amalia Iriza (NMA,Romania)

More information

OBTAINING AN ACCOUNT:

OBTAINING AN ACCOUNT: HPC Usage Policies The IIA High Performance Computing (HPC) System is managed by the Computer Management Committee. The User Policies here were developed by the Committee. The user policies below aim to

More information

OpenFOAM Scaling on Cray Supercomputers Dr. Stephen Sachs GOFUN 2017

OpenFOAM Scaling on Cray Supercomputers Dr. Stephen Sachs GOFUN 2017 OpenFOAM Scaling on Cray Supercomputers Dr. Stephen Sachs GOFUN 2017 Safe Harbor Statement This presentation may contain forward-looking statements that are based on our current expectations. Forward looking

More information

Moab Workload Manager on Cray XT3

Moab Workload Manager on Cray XT3 Moab Workload Manager on Cray XT3 presented by Don Maxwell (ORNL) Michael Jackson (Cluster Resources, Inc.) MOAB Workload Manager on Cray XT3 Why MOAB? Requirements Features Support/Futures 2 Why Moab?

More information

Compute Node Linux: Overview, Progress to Date & Roadmap

Compute Node Linux: Overview, Progress to Date & Roadmap Compute Node Linux: Overview, Progress to Date & Roadmap David Wallace Cray Inc ABSTRACT: : This presentation will provide an overview of Compute Node Linux(CNL) for the CRAY XT machine series. Compute

More information

Introduction to ECMWF resources:

Introduction to ECMWF resources: Introduction to ECMWF resources: Computing and archive services. and how to access them Paul Dando User Support Paul.Dando@ecmwf.int advisory@ecmwf.int University of Reading - 23 January 2014 ECMWF Slide

More information

Large Scale Visualization on the Cray XT3 Using ParaView

Large Scale Visualization on the Cray XT3 Using ParaView Large Scale Visualization on the Cray XT3 Using ParaView Cray User s Group 2008 May 8, 2008 Kenneth Moreland David Rogers John Greenfield Sandia National Laboratories Alexander Neundorf Technical University

More information

ECMWF New Users Metview Tutorial

ECMWF New Users Metview Tutorial ECMWF New Users Metview Tutorial Author: Date: URL: Iain Russell 06-Mar-2014 08:43 https://software.ecmwf.int/wiki/display/metv/ecmwf+new+users+metview+tutorial 1 of 12 Table of Contents 1 Preparation

More information

PBS PROFESSIONAL VS. MICROSOFT HPC PACK

PBS PROFESSIONAL VS. MICROSOFT HPC PACK PBS PROFESSIONAL VS. MICROSOFT HPC PACK On the Microsoft Windows Platform PBS Professional offers many features which are not supported by Microsoft HPC Pack. SOME OF THE IMPORTANT ADVANTAGES OF PBS PROFESSIONAL

More information

Cray XD1 Supercomputer Release 1.3 CRAY XD1 DATASHEET

Cray XD1 Supercomputer Release 1.3 CRAY XD1 DATASHEET CRAY XD1 DATASHEET Cray XD1 Supercomputer Release 1.3 Purpose-built for HPC delivers exceptional application performance Affordable power designed for a broad range of HPC workloads and budgets Linux,

More information

Using Quality of Service for Scheduling on Cray XT Systems

Using Quality of Service for Scheduling on Cray XT Systems Using Quality of Service for Scheduling on Cray XT Systems Troy Baer HPC System Administrator National Institute for Computational Sciences, University of Tennessee Outline Introduction Scheduling Cray

More information

ECMWF s Next Generation IO for the IFS Model

ECMWF s Next Generation IO for the IFS Model ECMWF s Next Generation IO for the Model Part of ECMWF s Scalability Programme Tiago Quintino, B. Raoult, P. Bauer ECMWF tiago.quintino@ecmwf.int ECMWF January 14, 2016 ECMWF s HPC Targets What do we do?

More information

Our new HPC-Cluster An overview

Our new HPC-Cluster An overview Our new HPC-Cluster An overview Christian Hagen Universität Regensburg Regensburg, 15.05.2009 Outline 1 Layout 2 Hardware 3 Software 4 Getting an account 5 Compiling 6 Queueing system 7 Parallelization

More information

HPCF Cray Phase 2. User Test period. Cristian Simarro User Support. ECMWF April 18, 2016

HPCF Cray Phase 2. User Test period. Cristian Simarro User Support. ECMWF April 18, 2016 HPCF Cray Phase 2 User Test period Cristian Simarro User Support advisory@ecmwf.int ECMWF April 18, 2016 Content Introduction Upgrade timeline Changes Hardware Software Steps for the testing on CCB Possible

More information

Deutscher Wetterdienst. Ulrich Schättler Deutscher Wetterdienst Research and Development

Deutscher Wetterdienst. Ulrich Schättler Deutscher Wetterdienst Research and Development Deutscher Wetterdienst COSMO, ICON and Computers Ulrich Schättler Deutscher Wetterdienst Research and Development Contents Problems of the COSMO-Model on HPC architectures POMPA and The ICON Model Outlook

More information

GPU Consideration for Next Generation Weather (and Climate) Simulations

GPU Consideration for Next Generation Weather (and Climate) Simulations GPU Consideration for Next Generation Weather (and Climate) Simulations Oliver Fuhrer 1, Tobias Gisy 2, Xavier Lapillonne 3, Will Sawyer 4, Ugo Varetto 4, Mauro Bianco 4, David Müller 2, and Thomas C.

More information

Design and Evaluation of a 2048 Core Cluster System

Design and Evaluation of a 2048 Core Cluster System Design and Evaluation of a 2048 Core Cluster System, Torsten Höfler, Torsten Mehlan and Wolfgang Rehm Computer Architecture Group Department of Computer Science Chemnitz University of Technology December

More information

Batch Scheduling on XT3

Batch Scheduling on XT3 Batch Scheduling on XT3 Chad Vizino Pittsburgh Supercomputing Center Overview Simon Scheduler Design Features XT3 Scheduling at PSC Past Present Future Back to the Future! Scheduler Design

More information

Parallel & Cluster Computing. cs 6260 professor: elise de doncker by: lina hussein

Parallel & Cluster Computing. cs 6260 professor: elise de doncker by: lina hussein Parallel & Cluster Computing cs 6260 professor: elise de doncker by: lina hussein 1 Topics Covered : Introduction What is cluster computing? Classification of Cluster Computing Technologies: Beowulf cluster

More information

2014 LENOVO. ALL RIGHTS RESERVED.

2014 LENOVO. ALL RIGHTS RESERVED. 2014 LENOVO. ALL RIGHTS RESERVED. Parallel System description. Outline p775, p460 and dx360m4, Hardware and Software Compiler options and libraries used. WRF tunable parameters for scaling runs. nproc_x,

More information

Findings from real petascale computer systems with meteorological applications

Findings from real petascale computer systems with meteorological applications 15 th ECMWF Workshop Findings from real petascale computer systems with meteorological applications Toshiyuki Shimizu Next Generation Technical Computing Unit FUJITSU LIMITED October 2nd, 2012 Outline

More information

The Architecture and the Application Performance of the Earth Simulator

The Architecture and the Application Performance of the Earth Simulator The Architecture and the Application Performance of the Earth Simulator Ken ichi Itakura (JAMSTEC) http://www.jamstec.go.jp 15 Dec., 2011 ICTS-TIFR Discussion Meeting-2011 1 Location of Earth Simulator

More information

Practical Scientific Computing

Practical Scientific Computing Practical Scientific Computing Performance-optimized Programming Preliminary discussion: July 11, 2008 Dr. Ralf-Peter Mundani, mundani@tum.de Dipl.-Ing. Ioan Lucian Muntean, muntean@in.tum.de MSc. Csaba

More information

Cray XT3 for Science

Cray XT3 for Science HPCx Annual Seminar 2006 Cray XT3 for Science David Tanqueray Cray UK Limited dt@cray.com Topics Cray Introduction The Cray XT3 Cray Roadmap Some XT3 Applications Page 2 Supercomputing is all we do Sustained

More information

PROGRAMMING MODEL EXAMPLES

PROGRAMMING MODEL EXAMPLES ( Cray Inc 2015) PROGRAMMING MODEL EXAMPLES DEMONSTRATION EXAMPLES OF VARIOUS PROGRAMMING MODELS OVERVIEW Building an application to use multiple processors (cores, cpus, nodes) can be done in various

More information

Advanced Software for the Supercomputer PRIMEHPC FX10. Copyright 2011 FUJITSU LIMITED

Advanced Software for the Supercomputer PRIMEHPC FX10. Copyright 2011 FUJITSU LIMITED Advanced Software for the Supercomputer PRIMEHPC FX10 System Configuration of PRIMEHPC FX10 nodes Login Compilation Job submission 6D mesh/torus Interconnect Local file system (Temporary area occupied

More information

Evaluating the Performance and Energy Efficiency of the COSMO-ART Model System

Evaluating the Performance and Energy Efficiency of the COSMO-ART Model System Evaluating the Performance and Energy Efficiency of the COSMO-ART Model System Joseph Charles & William Sawyer (CSCS), Manuel F. Dolz (UHAM), Sandra Catalán (UJI) EnA-HPC, Dresden September 1-2, 2014 1

More information

Introduction to High Performance Computing at UEA. Chris Collins Head of Research and Specialist Computing ITCS

Introduction to High Performance Computing at UEA. Chris Collins Head of Research and Specialist Computing ITCS Introduction to High Performance Computing at UEA. Chris Collins Head of Research and Specialist Computing ITCS Introduction to High Performance Computing High Performance Computing at UEA http://rscs.uea.ac.uk/hpc/

More information

Lecture 3: Intro to parallel machines and models

Lecture 3: Intro to parallel machines and models Lecture 3: Intro to parallel machines and models David Bindel 1 Sep 2011 Logistics Remember: http://www.cs.cornell.edu/~bindel/class/cs5220-f11/ http://www.piazza.com/cornell/cs5220 Note: the entire class

More information

Managing complex cluster architectures with Bright Cluster Manager

Managing complex cluster architectures with Bright Cluster Manager Managing complex cluster architectures with Bright Cluster Manager Christopher Huggins www.clustervision.com 1 About ClusterVision Specialists in Compute, Storage & Database Clusters (Tailor-Made, Turn-Key)

More information

The Earth Simulator System

The Earth Simulator System Architecture and Hardware for HPC Special Issue on High Performance Computing The Earth Simulator System - - - & - - - & - By Shinichi HABATA,* Mitsuo YOKOKAWA and Shigemune KITAWAKI The Earth Simulator,

More information

The TIDB2 Meteo Experience

The TIDB2 Meteo Experience The TIDB2 Meteo Experience Experience with the TIDB2 database interface in managing meteorological observation and forecast data João Simões ECMWF, IM (Portugal) Maria Monteiro - IM (Portugal) António

More information

Introduction to Abel/Colossus and the queuing system

Introduction to Abel/Colossus and the queuing system Introduction to Abel/Colossus and the queuing system November 14, 2018 Sabry Razick Research Infrastructure Services Group, USIT Topics First 7 slides are about us and links The Research Computing Services

More information

The Hopper System: How the Largest* XE6 in the World Went From Requirements to Reality! Katie Antypas, Tina Butler, and Jonathan Carter

The Hopper System: How the Largest* XE6 in the World Went From Requirements to Reality! Katie Antypas, Tina Butler, and Jonathan Carter The Hopper System: How the Largest* XE6 in the World Went From Requirements to Reality! Katie Antypas, Tina Butler, and Jonathan Carter CUG 2011, May 25th, 2011 1 Requirements to Reality Develop RFP Select

More information

Porting and Optimisation of UM on ARCHER. Karthee Sivalingam, NCAS-CMS. HPC Workshop ECMWF JWCRP

Porting and Optimisation of UM on ARCHER. Karthee Sivalingam, NCAS-CMS. HPC Workshop ECMWF JWCRP Porting and Optimisation of UM on ARCHER Karthee Sivalingam, NCAS-CMS HPC Workshop ECMWF JWCRP Acknowledgements! NCAS-CMS Bryan Lawrence Jeffrey Cole Rosalyn Hatcher Andrew Heaps David Hassell Grenville

More information

Introduction to HPC Using zcluster at GACRC

Introduction to HPC Using zcluster at GACRC Introduction to HPC Using zcluster at GACRC Georgia Advanced Computing Resource Center University of Georgia Zhuofei Hou, HPC Trainer zhuofei@uga.edu Outline What is GACRC? What is HPC Concept? What is

More information

Joachim Biercamp Deutsches Klimarechenzentrum (DKRZ) With input from Peter Bauer, Reinhard Budich, Sylvie Joussaume, Bryan Lawrence.

Joachim Biercamp Deutsches Klimarechenzentrum (DKRZ) With input from Peter Bauer, Reinhard Budich, Sylvie Joussaume, Bryan Lawrence. Joachim Biercamp Deutsches Klimarechenzentrum (DKRZ) With input from Peter Bauer, Reinhard Budich, Sylvie Joussaume, Bryan Lawrence. The ESiWACE project has received funding from the European Union s Horizon

More information

Just on time to face new challenges with NEC super-computer at Meteo-France

Just on time to face new challenges with NEC super-computer at Meteo-France Just on time to face new challenges with NEC super-computer at Meteo-France Agenda of the procurement Presentation of the first phase Installation phase (power supply, air cooling) Use of a porting machine

More information

Unifying Heterogeneous Resources Moab Con Scott Jackson Engineering

Unifying Heterogeneous Resources Moab Con Scott Jackson Engineering Unifying Heterogeneous Resources Moab Con 2009 Scott Jackson Engineering Overview Introduction Heterogeneous Resources w/in the Cluster Disparate Clusters -- Multiple Resource Managers Disparate Clusters

More information

Shared Object-Based Storage and the HPC Data Center

Shared Object-Based Storage and the HPC Data Center Shared Object-Based Storage and the HPC Data Center Jim Glidewell High Performance Computing BOEING is a trademark of Boeing Management Company. Computing Environment Cray X1 2 Chassis, 128 MSPs, 1TB memory

More information

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast Max-Planck-Institut für Meteorologie, DKRZ September 24, 2014 MAX-PLANCK-GESELLSCHAFT Data

More information

News from the consortium

News from the consortium Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss News from the consortium Swiss COSMO User Workshop 1st November 2012 COSMO users for NWP by 2012 Members

More information

Technical Computing Suite supporting the hybrid system

Technical Computing Suite supporting the hybrid system Technical Computing Suite supporting the hybrid system Supercomputer PRIMEHPC FX10 PRIMERGY x86 cluster Hybrid System Configuration Supercomputer PRIMEHPC FX10 PRIMERGY x86 cluster 6D mesh/torus Interconnect

More information

The EU-funded BRIDGE project

The EU-funded BRIDGE project from Newsletter Number 117 Autumn 2008 COMPUTING The EU-funded BRIDGE project doi:10.21957/t8axr71gg0 This article appeared in the Computing section of ECMWF Newsletter No. 117 Autumn 2008, pp. 29-32.

More information

Anne Fouilloux. Fig. 1 Use of observational data at ECMWF since CMA file structure.

Anne Fouilloux. Fig. 1 Use of observational data at ECMWF since CMA file structure. ODB (Observational Database) and its usage at ECMWF Anne Fouilloux Abstract ODB stands for Observational DataBase and has been developed at ECMWF since mid-1998 by Sami Saarinen. The main goal of ODB is

More information

Outline. March 5, 2012 CIRMMT - McGill University 2

Outline. March 5, 2012 CIRMMT - McGill University 2 Outline CLUMEQ, Calcul Quebec and Compute Canada Research Support Objectives and Focal Points CLUMEQ Site at McGill ETS Key Specifications and Status CLUMEQ HPC Support Staff at McGill Getting Started

More information

Metview and Python - what they can do for each other

Metview and Python - what they can do for each other Metview and Python - what they can do for each other Workshop on Python for Earth System Sciences, ECMWF Iain Russell, Fernando Ii, Sándor Kertész, Stephan Siemen Development Section, ECMWF ECMWF November

More information

Bright Cluster Manager Advanced HPC cluster management made easy. Martijn de Vries CTO Bright Computing

Bright Cluster Manager Advanced HPC cluster management made easy. Martijn de Vries CTO Bright Computing Bright Cluster Manager Advanced HPC cluster management made easy Martijn de Vries CTO Bright Computing About Bright Computing Bright Computing 1. Develops and supports Bright Cluster Manager for HPC systems

More information

Introduction to High-Performance Computing (HPC)

Introduction to High-Performance Computing (HPC) Introduction to High-Performance Computing (HPC) Computer components CPU : Central Processing Unit CPU cores : individual processing units within a Storage : Disk drives HDD : Hard Disk Drive SSD : Solid

More information

ReFrame: A Regression Testing Framework Enabling Continuous Integration of Large HPC Systems

ReFrame: A Regression Testing Framework Enabling Continuous Integration of Large HPC Systems ReFrame: A Regression Testing Framework Enabling Continuous Integration of Large HPC Systems HPC Advisory Council 2018 Victor Holanda, Vasileios Karakasis, CSCS Apr. 11, 2018 ReFrame in a nutshell Regression

More information

CS500 SMARTER CLUSTER SUPERCOMPUTERS

CS500 SMARTER CLUSTER SUPERCOMPUTERS CS500 SMARTER CLUSTER SUPERCOMPUTERS OVERVIEW Extending the boundaries of what you can achieve takes reliable computing tools matched to your workloads. That s why we tailor the Cray CS500 cluster supercomputer

More information

IFS migrates from IBM to Cray CPU, Comms and I/O

IFS migrates from IBM to Cray CPU, Comms and I/O IFS migrates from IBM to Cray CPU, Comms and I/O Deborah Salmond & Peter Towers Research Department Computing Department Thanks to Sylvie Malardel, Philippe Marguinaud, Alan Geer & John Hague and many

More information

Use of Common Technologies between XT and Black Widow

Use of Common Technologies between XT and Black Widow Use of Common Technologies between XT and Black Widow CUG 2006 This Presentation May Contain Some Preliminary Information, Subject To Change Agenda System Architecture Directions Software Development and

More information

File Systems for HPC Machines. Parallel I/O

File Systems for HPC Machines. Parallel I/O File Systems for HPC Machines Parallel I/O Course Outline Background Knowledge Why I/O and data storage are important Introduction to I/O hardware File systems Lustre specifics Data formats and data provenance

More information

Using EasyBuild and Continuous Integration for Deploying Scientific Applications on Large Scale Production Systems

Using EasyBuild and Continuous Integration for Deploying Scientific Applications on Large Scale Production Systems Using EasyBuild and Continuous Integration for Deploying Scientific Applications on Large HPC Advisory Council Swiss Conference Guilherme Peretti-Pezzi, CSCS April 11, 2017 Table of Contents 1. Introduction:

More information

Shared Services Canada Environment and Climate Change Canada HPC Renewal Project

Shared Services Canada Environment and Climate Change Canada HPC Renewal Project Shared Services Canada Environment and Climate Change Canada HPC Renewal Project CUG 2017 Redmond, WA, USA Deric Sullivan Alain St-Denis & Luc Corbeil May 2017 Background: SSC's HPC Renewal for ECCC Environment

More information

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments Torben Kling-Petersen, PhD Presenter s Name Principle Field Title andengineer Division HPC &Cloud LoB SunComputing Microsystems

More information

MM5 Modeling System Performance Research and Profiling. March 2009

MM5 Modeling System Performance Research and Profiling. March 2009 MM5 Modeling System Performance Research and Profiling March 2009 Note The following research was performed under the HPC Advisory Council activities AMD, Dell, Mellanox HPC Advisory Council Cluster Center

More information

SLURM Operation on Cray XT and XE

SLURM Operation on Cray XT and XE SLURM Operation on Cray XT and XE Morris Jette jette@schedmd.com Contributors and Collaborators This work was supported by the Oak Ridge National Laboratory Extreme Scale Systems Center. Swiss National

More information

University at Buffalo Center for Computational Research

University at Buffalo Center for Computational Research University at Buffalo Center for Computational Research The following is a short and long description of CCR Facilities for use in proposals, reports, and presentations. If desired, a letter of support

More information

COSMO software fieldextra

COSMO software fieldextra Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz COSMO software fieldextra / MeteoSwiss Jerusalem, COSMO GM, September 2017 Fieldextra Core development

More information

Scaling to Petaflop. Ola Torudbakken Distinguished Engineer. Sun Microsystems, Inc

Scaling to Petaflop. Ola Torudbakken Distinguished Engineer. Sun Microsystems, Inc Scaling to Petaflop Ola Torudbakken Distinguished Engineer Sun Microsystems, Inc HPC Market growth is strong CAGR increased from 9.2% (2006) to 15.5% (2007) Market in 2007 doubled from 2003 (Source: IDC

More information

Preparing a weather prediction and regional climate model for current and emerging hardware architectures.

Preparing a weather prediction and regional climate model for current and emerging hardware architectures. Preparing a weather prediction and regional climate model for current and emerging hardware architectures. Oliver Fuhrer (MeteoSwiss), Tobias Gysi (Supercomputing Systems AG), Xavier Lapillonne (C2SM),

More information

High Performance Computing (HPC) Using zcluster at GACRC

High Performance Computing (HPC) Using zcluster at GACRC High Performance Computing (HPC) Using zcluster at GACRC On-class STAT8060 Georgia Advanced Computing Resource Center University of Georgia Zhuofei Hou, HPC Trainer zhuofei@uga.edu Outline What is GACRC?

More information

Status of the COSMO GPU version

Status of the COSMO GPU version Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Status of the COSMO GPU version Xavier Lapillonne Contributors in 2015 (Thanks!) Alon Shtivelman Andre Walser

More information

Big changes coming to ECMWF Product Generation system

Big changes coming to ECMWF Product Generation system Big changes coming to ECMWF Product Generation system European Working Group on Operational meteorological Workstations (EGOWS): 15-17 October 2018 Marta Gutierrez ECMWF Forecast Department Marta.Gutierrez@ecmwf.int

More information

HPC Architectures. Types of resource currently in use

HPC Architectures. Types of resource currently in use HPC Architectures Types of resource currently in use Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us

More information