Using Cartesius and Lisa. Zheng Meyer-Zhao - Consultant Clustercomputing

Similar documents
Using Cartesius & Lisa

Introduction to Joker Cyber Infrastructure Architecture Team CIA.NMSU.EDU

Introduction to the NCAR HPC Systems. 25 May 2018 Consulting Services Group Brian Vanderwende

Graham vs legacy systems

Duke Compute Cluster Workshop. 3/28/2018 Tom Milledge rc.duke.edu

How to run applications on Aziz supercomputer. Mohammad Rafi System Administrator Fujitsu Technology Solutions

Introduction to GALILEO

HPC Introductory Course - Exercises

Quick Start Guide. by Burak Himmetoglu. Supercomputing Consultant. Enterprise Technology Services & Center for Scientific Computing

An Introduction to Gauss. Paul D. Baines University of California, Davis November 20 th 2012

Introduction to BioHPC

How to Use a Supercomputer - A Boot Camp

The cluster system. Introduction 22th February Jan Saalbach Scientific Computing Group

High Performance Computing (HPC) Club Training Session. Xinsheng (Shawn) Qin

Introduction to PICO Parallel & Production Enviroment

Introduction to HPC Using zcluster at GACRC

Introduction to GALILEO

Knights Landing production environment on MARCONI

Duke Compute Cluster Workshop. 11/10/2016 Tom Milledge h:ps://rc.duke.edu/

UF Research Computing: Overview and Running STATA

High Performance Computing (HPC) Using zcluster at GACRC

Quick Start Guide. by Burak Himmetoglu. Supercomputing Consultant. Enterprise Technology Services & Center for Scientific Computing

Introduction to HPC Using zcluster at GACRC

Introduction to HPC Resources and Linux

Introduction to HPC Using zcluster at GACRC

Introduction to UNIX. SURF Research Boot Camp April Jeroen Engelberts Consultant Supercomputing

Before We Start. Sign in hpcxx account slips Windows Users: Download PuTTY. Google PuTTY First result Save putty.exe to Desktop

Using a Linux System 6

For Dr Landau s PHYS8602 course

Name Department/Research Area Have you used the Linux command line?

HPC system startup manual (version 1.20)

NBIC TechTrack PBS Tutorial

Introduction to HPC2N

Introduction to High-Performance Computing (HPC)

Using Compute Canada. Masao Fujinaga Information Services and Technology University of Alberta

Introduction to GALILEO

Introduction to High-Performance Computing (HPC)

High Performance Computing Cluster Basic course

Migrating from Zcluster to Sapelo

New User Seminar: Part 2 (best practices)

Minnesota Supercomputing Institute Regents of the University of Minnesota. All rights reserved.

Introduction to GACRC Teaching Cluster

Slurm basics. Summer Kickstart June slide 1 of 49

XSEDE New User Training. Ritu Arora November 14, 2014

Introduction to GACRC Teaching Cluster PHYS8602

Image Sharpening. Practical Introduction to HPC Exercise. Instructions for Cirrus Tier-2 System

Introduction to High Performance Computing Using Sapelo2 at GACRC

UoW HPC Quick Start. Information Technology Services University of Wollongong. ( Last updated on October 10, 2011)

Minnesota Supercomputing Institute Regents of the University of Minnesota. All rights reserved.

Introduction to HPC Using the New Cluster at GACRC

Slurm and Abel job scripts. Katerina Michalickova The Research Computing Services Group SUF/USIT October 23, 2012

Introduction to GACRC Teaching Cluster

Our Workshop Environment

Slurm and Abel job scripts. Katerina Michalickova The Research Computing Services Group SUF/USIT November 13, 2013

Using Sapelo2 Cluster at the GACRC

Introduction to HPCC at MSU

SARA Overview. Walter Lioen Group Leader Supercomputing & Senior Consultant

Duke Compute Cluster Workshop. 10/04/2018 Tom Milledge rc.duke.edu

Lab 1 Introduction to UNIX and C

Introduction to RCC. January 18, 2017 Research Computing Center

Introduction to High Performance Computing and an Statistical Genetics Application on the Janus Supercomputer. Purpose

OBTAINING AN ACCOUNT:

CRUK cluster practical sessions (SLURM) Part I processes & scripts

Batch Systems & Parallel Application Launchers Running your jobs on an HPC machine

A Hands-On Tutorial: RNA Sequencing Using High-Performance Computing

Introduction to Discovery.

Using the IBM Opteron 1350 at OSC. October 19-20, 2010

Using the computational resources at the GACRC

Introduction to High Performance Computing at UEA. Chris Collins Head of Research and Specialist Computing ITCS

Our new HPC-Cluster An overview

Linux Essentials. Smith, Roderick W. Table of Contents ISBN-13: Introduction xvii. Chapter 1 Selecting an Operating System 1

Introduction to Cheyenne. 12 January, 2017 Consulting Services Group Brian Vanderwende

CSC Supercomputing Environment

KISTI TACHYON2 SYSTEM Quick User Guide

Introduction to RCC. September 14, 2016 Research Computing Center

Introduction to Unix Environment: modules, job scripts, PBS. N. Spallanzani (CINECA)

Joint High Performance Computing Exchange (JHPCE) Cluster Orientation.

Introduction. Kevin Miles. Paul Henderson. Rick Stillings. Essex Scales. Director of Research Support. Systems Engineer.

Our Workshop Environment

New User Tutorial. OSU High Performance Computing Center

A Brief Introduction to The Center for Advanced Computing

Introduction to CINECA Computer Environment

Shark Cluster Overview

Introduction to HPC Using zcluster at GACRC On-Class GENE 4220

Sherlock for IBIIS. William Law Stanford Research Computing

INTRODUCTION TO GPU COMPUTING WITH CUDA. Topi Siro

GUI Installation Manual

Linux Tutorial. Ken-ichi Nomura. 3 rd Magics Materials Software Workshop. Gaithersburg Marriott Washingtonian Center November 11-13, 2018

Submitting and running jobs on PlaFRIM2 Redouane Bouchouirbat

Outline. March 5, 2012 CIRMMT - McGill University 2

Introduction to the Cluster

HPC Workshop. Nov. 9, 2018 James Coyle, PhD Dir. Of High Perf. Computing

Introduction to Discovery.

Introduction to BioHPC

Introduction to HPC Using zcluster at GACRC

Sharpen Exercise: Using HPC resources and running parallel applications

Introduction to the ITA computer system

PARALLEL COMPUTING IN R USING WESTGRID CLUSTERS STATGEN GROUP MEETING 10/30/2017

ACEnet for CS6702 Ross Dickson, Computational Research Consultant 29 Sep 2009

Introduction to BioHPC New User Training

Transcription:

Zheng Meyer-Zhao - zheng.meyer-zhao@surfsara.nl Consultant Clustercomputing

Outline SURFsara About us What we do Cartesius and Lisa Architectures and Specifications File systems Funding Hands-on Logging in Module environment Batch system 2

About SURFsara SURFsara offers an integrated ICT research infrastructure and provides services in the areas of computing, data storage, visualization, networking, cloud and e-science. In 1971, SARA was founded as an Amsterdam computing center by the two Amsterdam universities (UvA and VU) and the current CWI Independent as of 1995 Founded Vancis in 2008 offering ICT services and ICT products to enterprises, universities, and educational and healthcare institutions As from 1 January 2013, SARA from then on SURFsara forms part of the SURF Foundation First supercomputer in The Netherlands in 1984 (Control Data Cyber 205). Hosting the national supercomputer(s) ever since. 3

SURFsara Super- and Clustercomputing 4

Supercomputers / Cartesius / Lisa What is a Supercomputer? A fast computer A large computer (memory/storage) An expensive computer (millions of for hardware, electricity and man power) Why/when do you need a Super? If your task would take months/years on a normal PC If your task requires more space (memory/storage) than available in PC Why do you, SURFsara, own two Supercomputers? Historic reasons Cartesius via NWO Lisa via UvA, VU, FOM, CWI and NWO What is the difference? Cartesius larger blocks (capability computing fewer large scale jobs) Lisa smaller blocks (capacity computing more small(er) scale jobs) Cartesius expensive expensive Lisa cheaper, but still expensive 5

Lisa / Cartesius Nodes Lisa: Total cores 7856 Total memory 26 TB Total peak pf 149 TFlop/sec Disk space 100 TB for the home file systems OS Debian Linux AMD64 OS Network Ethernet / Infiniband Cartesius: Total cores 47776 CPU + 132 GPU (incl. KNL, broadwell) Total memory 130 TB Total peak pf 1.843 PFlop/sec Disk space 180 TB home, 7.7PB project + scratch OS BullX Network Infiniband 56 Gbps, 3µs latency 6

Working on Supercomputers nowadays So, how does working on a supercomputer look like nowadays? 7

Is NOT like: 8

Also NOT like: 9

Typically it is like: 10

General Architecture 11

Cartesius & Lisa File systems (1) /home/user User home directory (quota - currently 200GB) Backed up Meant for storage of important files (sources, scripts, input and output data) Not the fastest file system /scratch Cartesius: /scratch-local & /scratch-shared (quota currently 8 TB) Lisa: /scratch (quota depends on disk size) Not backed up Meant for temporary storage (during running of a job and shortly thereafter) The fastest file systems on Cartesius & Lisa 12

Cartesius & Lisa File systems (2) /archive Connected to the tape robot (quota virtually unlimited) Backed up Meant for long term storage of files, zipped, tarred, combined into small number of files Slow especially when retrieving old data Not available to worker nodes /project Only available on Cartesius! For special projects requiring lots of space (quota as much as needed/possible) Not backed up Meant for special projects Comparable in speed with /scratch 13

Cartesius & Lisa Funding model Compute time and storage is paid for by: NWO Fully subsidized application through ISAAC peer reviewed administration and reporting involved free Your university or institute (RCCS) application through local owner or chair Take a look at the SURFsara website: https://userinfo.surfsara.nl/systems/cartesius/account 14

Questions? 15

Hands-on: Let s play! Contents Username and password Install Terminal software Login to Cartesius Look at some scripts Submit a few jobs wait for the result Analyze the results (Copy back output/results locally) 16

Hands-on : Preparation Username and Password If you don t have an sdemo account yet, please fill in the form at http://bit.ly/2j7uiut Install Terminal software For Windows users only: Install MobaXterm (installer edition) from http://mobaxterm.mobatek.net/... and, run it Linux users: Run a terminal Max OS-X users: Find Terminal, run it Install XQuartz (http://www.xquartz.org) 17

Logging in on Cartesius Windows, MobaXterm: Start a new session Hostname is cartesius.surfsara.nl (all lowercase) Username is sdemoxxx use the address received by mail Linux and mac: Start a new terminal; do (replace XXX with your actual username) ssh sdemoxxx@cartesius.surfsara.nl Use the password received by e-mail 18

Exercise 1: warm up Go to the Hands-on page at http://bit.ly/2ic6ubt Are you familiar with the following commands? 19

Exercise 2: software modules We have a lot of software installed. 1. in your browser, go to http://userinfo.surfsara.nl then systems -> Cartesius -> software, and look for programs that you use. Is there anything you already use? or would like to use? 2. on Lisa/Cartesius, use module avail module avail modulename module load modulename module unload modulename to see what software we have installed, and to load/unload a module. 20

Modules Why modules? Why modules? Environment variables are set for you, like: $PATH $LD_LIBRARY_PATH Multiple versions of software LOTS of software already available and optimized, by us, for you! 21

Exercise 3: running a job (1) 1. Use the nano text editor to create a file mycode with the following contents: 2. Create another file job.sh with the contents below: 22

Exercise 3: running a job (2) 3. submit the job with sbatch job.sh 4. use squeue u sdemoxxx to find out what your job is doing. 5. once your job is finished, use cat slurm-nnnnn.out to see the output. Discuss. 23

Lisa / Cartesius batch commands To submit a job Lisa: qsub <jobscript> Cartesius: sbatch <jobscript> Job submission returns a unique jobid. To view submitted jobs (of yourself) Lisa: qstat u <username> Cartesius: squeue u <username> To cancel submitted jobs Lisa: qdel <jobid> Cartesius: scancel <jobid> 24

Exercise 4: Run serial code on multiple cores Why? - One thin node on Cartesius has 24 cores - It costs 24 SBUs per hour even if you only use one core We want to use the computing resources efficiently. How? 25

Batch for loop `$i` is the value of `i` in the current loop `&` makes sure that each process runs at backgroud simultaneously `wait` tells the script to wait until all processes are finished View matlab/matrixmul.sh and submit the job to Cartesius 26

Exercise 5: running a wave equation 1. When logged in, go to the directory wave ; 2. Type module load openmpi module load c/intel module load hdf5 make to compile the program. 3. run the wave model with sbatch run_wave 4. use scp sdemonnn@cartesius.surfsara.nl:usingcartesius/wave/wave.h5.gif./ to copy the result animation to your own system 5. view the file in a web browser 27

That is all for today! If you have any questions, please contact us at helpdesk@surfsara.nl 28