Using the Yale HPC Clusters

Size: px

Start display at page:

Download "Using the Yale HPC Clusters"

Milo Wilkins
6 years ago
Views:

1 Using the Yale HPC Clusters Robert Bjornson Yale Center for Research Computing Yale University Feb 2017

2 What is the Yale Center for Research Computing? Independent center under the Provost s office Created to support your research computing needs Focus is on high performance computing and storage 15 staff, including applications specialists and system engineers Available to consult with and educate users Manage compute clusters and support users Located at 160 St. Ronan st, at the corner of Edwards and St. Ronan Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

3 What is a cluster? A cluster usually consists of a hundred to a thousand rack mounted computers, called nodes. It has one or two login nodes that are externally accessible, but most of the nodes are compute nodes and are only accessed from a login node via a batch queueing system (BQS). The CPU used in clusters may be similar to the CPU in your desktop computer, but in other respects they are rather different. Linux operating system Command line oriented Many cores (cpus) per node No monitors, no CD/DVD drives, no audio or video cards Very large distributed file system(s) Connected internally by a fast network Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

4 Why use a cluster? Clusters are very powerful and useful, but it may take some time to get used to them. Here are some reasons to go to that effort: Don t want to tie up your own machine for many hours or days Have many long running jobs to run Want to run in parallel to get results quicker Need more disk space Need more memory Want to use software installed on the cluster Want to access data stored on the cluster Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

5 Limitations of clusters Clusters are not the answer to all large scale computing problems. Some of the limitations of clusters are: Cannot run Windows or Mac programs Not for persistent services (DBs or web servers) Not ideal for interactive, graphical tasks Jobs that run for weeks can be a problem (unless checkpointed) Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

6 Summary of Yale Clusters (Dec 2016) Omega Grace Louise/Farnam Ruddle Role FAS FAS LS/Med YCGA Total nodes Cores/node 8 20 mostly 20 Mem/node 36 GB/48 GB 128 GB GB Network QDR IB FDR IB 10 Gb EN File system Lustre GPFS GPFS NAS + GPFS Batch queueing Torque LSF Torque/Slurm Torque Duo MFA? No No No/Yes Yes Details on each cluster here: Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

7 Setting up a account Accounts are free of charge to Yale researchers. Request an account at: After your account has been approved and created, you will receive an describing how to access your account. This may involve setting up ssh keys or using a secure login application. Details vary by cluster. If you need help setting up or using ssh, send an to: hpc@yale.edu. Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

8 Ssh to a login node To access any of the clusters, you must use ssh. From a Mac or Linux machine, you simply use the ssh command: laptop$ ssh netid@omega.hpc.yale.edu laptop$ ssh netid@grace.hpc.yale.edu laptop$ ssh netid@louise.hpc.yale.edu laptop$ ssh netid@farnam.hpc.yale.edu laptop$ ssh netid@ruddle.hpc.yale.edu From a Windows machine we recommend using putty: For more information on using PuTTY see: secure-shell-for-microsoft-windows Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

9 Ssh key pairs We use key pairs instead of passwords to log into clusters Keys are generated as pair: public (id rsa.pub) and private (id rsa) You should always use a pass phrase to protect private key! You can freely give the public key to anyone NEVER give the private key to anyone! Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

10 Sshing to Farnam and Ruddle Farnam and Ruddle have an additional level of ssh security, using Multi Factor Authentication (MFA) Example: We use Duo, the same MFA as other secure Yale sites ssh Enter passphrase for key /home/bjornson/.ssh/id_rsa : Duo two-factor login for rdb9 Enter a passcode or select one of the following options: 1. Duo Push to XXX-XXX Phone call to XXX-XXX SMS passcodes to XXX-XXX-9022 Passcode or option (1-3): 1 Success. Logging you in... Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

11 More about MFA Don t have a smartphone? Get a hardware device from the ITS Helpdesk Register another phone: e.g your home or office phone, as a backup See Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

12 Two ways to run jobs on a cluster: Interactive: Batch: you request an allocation Running jobs on a cluster system grants you one or more nodes you are logged onto one of those nodes you run commands you exit and system automatically releases nodes you write a job script containing commands you submit the script system grants you one or more nodes your script is automatically run on one of the nodes your script terminates and system releases nodes system sends a notification via Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

13 Interactive vs. Batch Interactive jobs: like a remote session require an active connection for development, debugging, or interactive environments like R and Matlab Batch jobs: non-interactive can run many jobs simultaneously your best choice for production computing Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

14 General overview on BQS Each BQS supports: Interactive allocation Batch submission Specifying the resources you need for your job Listing running and pending jobs Cancelling jobs Different node partitions/queues Priority rules For information specific to each cluster and BQS: Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

15 Interactive allocations Torque (omega) : qsub -I -q fas_devel Torque (louise): qsub -I -q general Torque (ruddle): qsub -I -q interactive LSF (grace) : bsub -Is -q interactive bash Slurm (farnam): srun -p interactive --pty bash You ll be logged into a compute node and can run your commands. To exit, type exit or ctrl-d Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

16 Torque: Example of an interactive job laptop$ ssh Last login: Thu Nov 6 10:48: from akw105.cs.yale.internal [ snipped ascii art, etc ] login-0-0$ qsub -I -q fas_devel qsub: waiting for job rocks.omega.hpc.yale.internal to start qsub: job rocks.omega.hpc.yale.internal ready compute-34-15$ cd ~/workdir compute-34-15$ module load Apps/R/3.0.3 compute-34-15$ R --slave -f compute.r compute-34-15$ exit login-0-0$ Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

17 LSF: Example of an interactive job laptop$ ssh Last login: Thu Nov 6 10:47: from akw105.cs.yale.internal [ snipped ascii art, etc ] grace1$ bsub -Is -q interactive bash Job <358314> is submitted to queue. <<Waiting for dispatch...>> <<Starting on c01n01>> c01n01$ cd ~/workdir c01n01$ module load Apps/R/3.0.3 c01n01$ R --slave -f compute.r c01n01$ exit login-0-0$ Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

18 Slurm: Example of an interactive job farnam-0:~ $ srun -p interactive --pty bash c01n01$ module load Apps/R c01n01$ R --slave -f compute.r c01n01$ exit farnam-0:~ $ Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

19 Batch jobs create a script wrapping your job declares resources required contains the command(s) to run Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

20 Example batch script (Torque) #!/bin/bash #PBS -q fas_normal #PBS -m abe -M cd $PBS_O_WORKDIR module load Apps/R/3.0.3 R --slave -f compute.r Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

21 Example of a non-interactive/batch job (Torque) login-0-0$ qsub batch.sh rocks.omega.hpc.yale.internal -bash-4.1$ qstat -1 -n rocks.omega.hpc.yale.internal: Req d Req d Elap Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time sw464 fas_norm batch.sh :00:00 R 00:00:34 compute-34-15/2 Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

22 Example batch script (LSF) The same script in LSF: #!/bin/bash #BSUB -q shared # automatically in submission dir module load Apps/R/3.0.3 R --slave -f compute.r Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

23 Example of a batch job (LSF) [rdb9@grace0 ~]$ bsub < batch.sh Job <113409> is submitted to queue <shared>. [rdb9@grace0 ~]$ bjobs JOBID USER STAT QUEUE FROM_HOST EXEC_HOST JOB_NAME SUBMIT_TIME rdb9 RUN shared grace0 c13n12 *name;date Sep 7 11:18 Note use of <, required for LSF Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

24 Slurm: Example of a batch script The same script in Slurm: #!/bin/bash #SBATCH -p debug #SBATCH -t 00:30:00 #SBATCH -J testjob module load Apps/R R --slave -f compute.r Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

25 Slurm: Example of a batch job $ sbatch test.sh Submitted batch job 42 $ squeue -j 42 JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON) 42 general my_job rdb9 R 0:03 1 c13n10 The script runs in the current directory. Output goes to slurm-jobid.out by default, or use -o Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

26 Copy scripts and data to the cluster On Linux and Mac, use scp and rsync to copy scripts and data files from between local computer and cluster. laptop$ scp -r ~/workdir laptop$ rsync -av ~/workdir Cyberduck is a good graphical tool for Mac or Windows. For Windows, WinSCP is available from the Yale Software Library: Other windows options include: Bitvise ssh Cygwin (Linux emulator in Windows: Graphical copy tools usually need configuration when used with Duo. See: Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

27 Summary of Torque commands (Omega/Louise/Ruddle) Description Command Submit a job qsub [opts] SCRIPT Status of a job qstat JOBID Status of a user s jobs qstat -u NETID Detailed status of a user s jobs qstat -1 -n -u NETID Cancel a job qdel JOBID Get queue info qstat -Q Get node info pbsnodes NODE Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

28 Summary of qsub options Description Option Queue -q QUEUE Process count -l nodes=n:ppn=m Wall clock limit -l walltime=dd:hh:mm:ss Memory limit -l mem=jgb Interactive job -I Interactive/X11 job -I -X Job name -N NAME Examples: login-0-0$ qsub -I -q fas_normal -l mem=32gb,walltime=24:00:00 login-0-0$ qsub -q fas_long -l mem=32gb,walltime=3:00:00:00 job.sh Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

29 Summary of LSF commands (Grace) Description Submit a job Status of a job Status of a user s jobs Cancel a job Get queue info Get node info Command bsub [opts] <SCRIPT bjobs JOBID bjobs -u NETID bkill JOBID bqueues bhosts Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

30 Summary of LSF bsub options (Grace) Description Queue Process count Process count Wall clock limit Memory limit Interactive job Interactive/X11 job Job name Option -q QUEUE -n P -R span[hosts=1] -n P -W HH:MM -M M -Is bash -Is -XF bash -J NAME Examples: grace1$ bsub -Is -q shared -M W 24:00 grace1$ bsub -q long -M W 72:00 < job.sh # 32gb for 1 day # 32gb for 3 days Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

31 Summary of Slurm commands (Farnam) Description Command Submit a batch job sbatch [opts] SCRIPT Submit an interactive job srun -p interactive --pty [opts] bash Status of a job squeue -j JOBID Status of a user s jobs squeue -u NETID Cancel a job scancel JOBID Get queue info squeue -p PART Get node info sinfo -N Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

32 Summary of Slurm sbatch/srun options (Farnam) Description Partition Process count Node count Wall clock limit Memory limit Interactive job Job name Option -p QUEUE -c Cpus -N Nodes -t HH:MM or DD-HH mem-per-cpu M (MB) srun --pty bash -J NAME Examples: farnam1$ srun --pty -p general --mem-per-cpu t 24:00 bash # 32gb for 1 day farnam1$ sbatch -p general --mem-per-cpu t 72:00 job.sh # 32gb for 3 days Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

33 Controlling memory usage It s crucial to understand memory required by your program. Nodes (and thus memory) are often shared Jobs have default memory limits that you can override Each BQS specifies this differently To specify 8 cores on one node with 8 GB RAM for each core: Torque: qsub -lnodes=1:ppn=8 -lmem=64gb t.sh LSF: bsub -n 8 -M t.sh Slurm: sbatch -c 8 --mem-per-cpu=8g t.sh Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

34 Controlling walltime Each job has a default maximum walltime The job is killed if that is exceeded You can specify longer walltime to prevent being killed You can specify shorter walltime to get resources faster To specify walltime limit of 2 days: Torque: qsub -lwalltime=2:00:00:00 t.sh LSF: bsub -W 48:00 t.sh Slurm: sbatch -t 2- t.sh Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

35 Submission Queues Regardless of which cluster you are on, you will submit jobs to a particular queue, depending on: Job s requirements Access rights for particular queues Queue rules Each cluster has its own queues, with specific rules, such as maximum walltime, cores, nodes, jobs, etc. Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

36 Modules Software is setup with module $ module avail find modules $ module avail name find particular module $ module load name use a module $ module list show loaded modules $ module unload name unload a module $ module purge unload all $ module save collection save loaded modules as collection $ module restore collection restore collection $ module describe collection list modules in collection Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

37 Run your program/script When you re finally ready to run your script, you may have some trouble determining the correct command line, especially if you want to pass arguments to the script. Here are some examples: compute-20-1$ python compute.py input.dat compute-20-1$ R --slave -f compute.r --args input.dat compute-20-1$ matlab -nodisplay -nosplash -nojvm < compute.m compute-20-1$ math -script compute.m compute-20-1$ MathematicaScript -script compute.m input.dat You often can get help from the command itself using: compute-20-1$ matlab -help compute-20-1$ python -h compute-20-1$ R --help Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

38 Running graphical programs on compute nodes Two different ways: X11 forwarding easy setup ssh -Y to cluster, then qsub -Y/bsub -XF/srun x11 works fine for most applications bogs down for very rich graphics Remote desktop (VNC) more setup allocate node, start VNC server there, connect via ssh tunnels works very well for rich graphics Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

39 Cluster Filesystems Each cluster has a number of different filesystems for you to use, with different rules and performance characteristics. It is very important to understand the differences. Generally, each cluster will have: Home : Backed up, small quota, for scripts, programs, documents, etc. Scratch : Not backed up. Automatically purged. For temporary files. Project : Not backed up. For longer term storage. Local HD : /tmp For local scratch files. RAMDISK : /dev/shm For local scratch files. Storage@Yale : University-wide storage (active and archive). Consider using local HD or ramdisk for intermediate files. Also consider avoiding files by using pipes. For more info: Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

40 Large datasets (e.g. Genomes) Please do not install your own copies of popular files (e.g. genome refs). We have a number of references installed, and can install others. For example, on Ruddle: /home/bioinfo/genomes If you don t find what you need, please ask us, and we will install them. Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

41 Wait, where is the Parallelism? Qsub/Bsub/Sbatch can allocate multiple cores and nodes, but the script runs on one core on one node sequentially. Simply allocating more nodes or cores DOES NOT make jobs faster. How do we use multiple cores to increase speed? Two classes of parallelism: Single job parallelized (somehow) Lots of independent sequential jobs Some options: Submit many batch jobs simultaneously (not good) Use job arrays, or SimpleQueue (much better) Submit a parallel version of your program (great if you have one) Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

42 Job Arrays Useful when you have many nearly identical, independent jobs to run Starts many copies of your script, distinguished by a task id. Submit jobs like this: Torque: qsub -t Slurm: sbatch --array= LSF: bsub -J "job [1-100]"... You batch script uses an environment variable:./mycommand -i input.${pbs_arrayid} \ -o output.${pbs_arrayid} Other BQS are similar Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

43 SimpleQueue Useful when you have many similar, independent jobs to run Automatically schedules jobs onto a single PBS allocation Advantages Handles startup, shutdown, errors Only one batch job to keep track of Keeps track of status of individual jobs More flexible than job arrays Automatically schedules jobs onto a single PBS allocation Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

44 Using SimpleQueue 1 Create file containing list of commands to run (jobs.list) cd ~/mydir/myrun; prog arg1 arg2 -o job1.out cd ~/mydir/myrun; prog arg1 arg2 -o job2.out... 2 Create launch script module load Tools/SimpleQueue sqcreatescript -q queuename -n 4 jobs.list > run.sh 3 Submit launch script qsub run.sh bsub < run.sh sbatch run.sh # Omega, Ruddle # Grace # Farnam For more info, see Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

45 Parallel-enabled programs Many modern programs are able to use multiple cpus on one node. Typically specify something like -p 20 or -t 20 (see man page) You must allocate matching number of cores: Torque: -lnodes=1:ppn=20 LSF: -n 20 Slurm: -c 20 #!/bin/bash #SBATCH -c 20 myapp -t Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

46 MPI-enabled programs Some programs use MPI to run on many nodes Impressive speedups are possible MPI programs are compiled and run under MPI MPI must cooperate with BQS #!/bin/bash #SBATCH -N 4 -n 20 --mem-per-cpu 4G module load MPI/OpenMPI mpirun./mpiprog... Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

47 Best Practices Start Slowly Run your program on a small dataset interactively In another ssh, watch program with top. Track memory usage. Check outputs Only then, scale up dataset, convert to batch run Input/Output Think about input and output files, number and size Should you use local or ram filesystem? Memory Use top or /usr/bin/time -a to monitor usage Specify memory and walltime requirements Be considerate! Clusters are shared resources. Don t run programs on the login nodes. Use a compute node. Don t submit a huge number of jobs. Use simplequeue or job array. Don t do heavy IO to /home. Keep an eye on quotas. Don t fill filesystems. Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

48 Plug for scripting languages Learning basics of a scripting language is a great investment. Very useful for automating lots of day to day activities Parsing data files Converting file formats Verifying collections of files Creating processing pipelines Summarizing, etc. etc. Python (strongly recommended) Bash Perl (if your lab uses it) R (if you do a lot of statistics or graphing) Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

49 To get help Send an to: Read documentation at: Come to office hours: office-hours-getting-help Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

50 Resources Very useful table of equivalent BQS commands: Recommended Books Learning Python: Mark Lutz Python Cookbook: Alex Martelli Bioinformatics Programming using Python: Mitchell Model Learning Perl: Randal Schwarz Robert Bjornson (Yale) Using the Yale HPC Clusters Feb / 70

Using the Yale HPC Clusters

Using the Yale HPC Clusters Stephen Weston Robert Bjornson Yale Center for Research Computing Yale University Oct 2015 To get help Send an email to: hpc@yale.edu Read documentation at: http://research.computing.yale.edu/hpc-support