HP Storage and UMCG

Similar documents
Deploying (community) codes. Martin Čuma Center for High Performance Computing University of Utah

Writing Easyconfig Files: The Basics

HPC at UZH: status and plans

Stable Cray Support in EasyBuild 2.7. Petar Forai

INTRODUCTION TO THE CLUSTER

Combining CVMFS, Nix, Lmod, and EasyBuild at Compute Canada. Bart Oldeman, McGill HPC, Calcul Québec, Compute Canada

Genius Quick Start Guide

Guillimin HPC Users Meeting February 11, McGill University / Calcul Québec / Compute Canada Montréal, QC Canada

Introduction to High-Performance Computing (HPC)

Introduction to High-Performance Computing (HPC)

Lmod. Robert McLay. Jan. 11, The Texas Advanced Computing Center

EasyBuild on Cray Linux Environment (WIP) Petar Forai

An introduction to checkpointing. for scientifc applications

Rhinoback Online Backup. In-File Delta

Using EasyBuild and Continuous Integration for Deploying Scientific Applications on Large Scale Production Systems

The Why and How of HPC-Cloud Hybrids with OpenStack

Restoring data from a backup

Supercomputing environment TMA4280 Introduction to Supercomputing

Introduction to PICO Parallel & Production Enviroment

The cluster system. Introduction 22th February Jan Saalbach Scientific Computing Group

Cerebro Quick Start Guide

My operating system is old but I don't care : I'm using NIX! B.Bzeznik BUX meeting, Vilnius 22/03/2016

New High Performance Computing Cluster For Large Scale Multi-omics Data Analysis. 28 February 2018 (Wed) 2:30pm 3:30pm Seminar Room 1A, G/F

Backup and Restore Operations

HP Data Protector A disaster recovery support for Microsoft Windows 7 and Windows Server 2008 R2

Site presentation: CSCS

UAntwerpen, 24 June 2016

AWS Administration. Suggested Pre-requisites Basic IT Knowledge

Introduction to the NCAR HPC Systems. 25 May 2018 Consulting Services Group Brian Vanderwende

Course 6231A: Maintaining a Microsoft SQL Server 2008 Database

Sap Content Server For Windows Installation >>>CLICK HERE<<<

Simplifying the contribution process for both contributors & maintainers

Running Galaxy in an HPC environment requirements, challenges and some solutions : the LIFEPORTAL

FILE / RMAN BACKUP PRODUCTS MANUAL EBOOK

Course 6231A: Maintaining a Microsoft SQL Server 2008 Database

Introduction to Discovery.

Modules. Help users managing their shell environment. Xavier Delaruelle UST4HPC May 15th 2018, Villa Clythia, Fréjus

The modules covered in this course are:

MongoDB in AWS (MongoDB as a DBaaS)

FUJITSU Cloud Service K5 CF Service Functional Overview

Texas A&M AgriLife Research Procedures

Graham vs legacy systems

Replace Single Server or Cluster

Introduction to Discovery.

Groom synchronization with nzbackup

Challenges in making Lustre systems reliable

s390 zlinux at Citi Presented by Doctor P. Robinson June 5, 2013 Hillgang Citi Managing zlinux in a Heterogenous Enterprise

Disaster Recovery System Administration Guide for Cisco Unified Communications Manager Release 8.0(2)

Windows Server 2016 MCSA Bootcamp

Maintaining a Microsoft SQL Server 2008 Database (Course 6231A)

Python ecosystem for scientific computing with ABINIT: challenges and opportunities. M. Giantomassi and the AbiPy group

Quelling the Clamor for Containers. Vanessa Borcherding Director, Scientific Computing Unit Weill Cornell Medicine

The safer, easier way to help you pass any IT exams. Exam : Administering Microsoft SQL Server 2012 Databases.

What is Research Computing?

Introduction to Abel/Colossus and the queuing system

Modules and Software. Daniel Caunt Harvard FAS Research Computing

Choosing Resources Wisely. What is Research Computing?

Cisco Unified CM Disaster Recovery System

SQL SERVER DBA TRAINING IN BANGALORE

ExpressCluster X SingleServerSafe for Windows. Quick Start Guide for Microsoft SQL Server. (Installation & Configuration Guide)

Design Patterns for the Cloud. MCSN - N. Tonellotto - Distributed Enabling Platforms 68

This option lets you reset the password that you use to log in if you do not remember it. To change the password,

OUR CUSTOMER TERMS CLOUD SERVICES - INFRASTRUCTURE

Task-based distributed processing for radio-interferometric imaging with CASA

TOSS - A RHEL-based Operating System for HPC Clusters

Modern Scientific Software Management using EasyBuild & co

Redundancy. Cisco Unified Communications Manager Redundancy Groups CHAPTER

High Performance Computing Cluster Basic course

BACKUP RECOVERY MANAGEMENT

Google Cloud Platform for Systems Operations Professionals (CPO200) Course Agenda

EasyBuild + Nix + ComputeCanada. Bart Oldeman, McGill HPC, Calcul Québec, Compute Canada

HPC Introductory Course - Exercises

Manual Backup Sql Server Express 2005 Automatically

Introduction to Computer Systems and Operating Systems

Choosing Resources Wisely Plamen Krastev Office: 38 Oxford, Room 117 FAS Research Computing

ECM583 Special Topics in Computer Systems

ExpressCluster X R3 WAN Edition for Windows

Overview. What are community packages? Who installs what? How to compile and install? Setup at FSU RCC. Using RPMs vs regular install

Operating Systems, Fall

ReFrame: A Regression Testing Framework Enabling Continuous Integration of Large HPC Systems

Running Applications on The Sheffield University HPC Clusters

Wong Tze Chuan General Manager. Gadget Wearable Tech (M) Sdn Bhd

An introduction to checkpointing. for scientific applications

Upgrading the Server Software

Introduction to Joker Cyber Infrastructure Architecture Team CIA.NMSU.EDU

Welcome! Considering a Warm Disaster Recovery Site?

High Performance Computing Cluster Advanced course

Linux Clusters Institute:

ExpressCluster X 2.0 for Linux

Experiences with HP SFS / Lustre in HPC Production

Perceptive Intelligent Capture

CloudShell 7.1 GA. Installation Guide. Release Date: September Document Version: 2.0

Microsoft Windows Operating System Fundamentals. Version: 10.0

SUSE Linux Enterprise Server 12 Modules

CS/Math 471: Intro. to Scientific Computing

Using a Linux System 6

Vienna Scientific Cluster: Problems and Solutions

Install your scientific software stack easily with Spack

File systems: management 1

Course Content of MCSA ( Microsoft Certified Solutions Associate )

Transcription:

HP Storage and Computing @ UMCG Pieter Neerincx Genomics Coordination Center UMCG SURF-DTL SIG Compute for life science reseh April 22 2015 Utrecht 1

Topics Expectation Management Shared lab / kitchen / cluster = shared responsibility Disaster Recovery Failover vs. Fallback Data Management Dependency Management

Disaster Recovery Failover vs. Fallback PBS/Torque PBS/Torque SLURM scheduler scheduler scheduler 10 nodes 5 nodes 5 nodes UI servers 10 nodes storage homes storage HA storage HP tmp storage HP tmp storage HP tmp GPFS GPFS Lustre

Data Management: Why There are 10 kinds of people: Those who have lost data and those who will loose data Make backups Backup window Restore window Costs

Data Management: Why Traceability Reproducibility Continuity December 2015: All reseh @ UMCG ISO9001 certified Volatile CPUs Mem Network Non-volatile Storage

user 1 home

user 1 home -- Never ++++ Never ++ Never +++++ Daily: 3+ months old Quota Backup Auto Clean

home user 1 SFTP UI Nodes

Data Manager Policy differs per group DM reviews documented data when moving to / No DM One dedicated DM Everybody is also DM Everybody is also DM, but you are not allowed to review your own documented data

Dependency Management Runtime environment modules / Lmod / other implementations Modifies environment Analysis scripts No hardcoded paths to software No environment variables for software defined (only used) Portable Deploytime (Download-Decompress-Compile-Install-time) EasyBuild

Runtime DepMan: modules Lmod (Lua implementation of modules) Texas Advanced Computing Center $> module avail GATK ----------------------- ///modules/ ----------------------- GATK/2.7-4-g6f46d11 GATK/2.8-1-g932cd3a $> module load GATK/2.7-4-g6f46d11 $> module list Currently Loaded Modulefiles: 1) /jdk/1.7.0_25 2) /R/3.0.2 3) /GATK/2.7-4-g6f46d11

Runtime DepMan: modules $> module show GATK/2.7-4-g6f46d11 --------------------------------------------------------------- ///modules//gatk/2.7-4-g6f46d11: module-whatis Sets GATK environment. prereq jdk/1.7.0_25 setenv GATK_HOME ////GATK-2.7-4-g6f46d11/ --------------------------------------------------------------- $> java -jar ${GATK_HOME}/GenomeAnalysisTK.jar --version GATK-2.7-4-g6f46d11

Deploytime DepMan: EasyBuild HPC UGent hpcugent.github.io/easybuild/ EasyBuild Framework (Python) EasyBlocks EasyConfigs

Deploytime DepMan: EasyBuild Toolchain example: goolf-1.7.20 (GCC OpenMPI OpenBlas LAPACK FFTW) Install example: eb BWA-0.7.12-goolf-1.7.20.eb

Deploytime DepMan: EasyBuild EasyBuild automates Fetch sources Decompress Configure Compile Install Generate module file No root access required Large collection of EasyConfigs shared by community

home user 1 1. Modify/upload your personal configs/preferences You

home user 1 1. Modify/upload your personal configs/preferences You 2. Perform experiment

home user 1 1. Modify/upload your personal configs/preferences You 2. Perform experiment 3. Generate raw data

home user 1 1. Modify/upload your personal configs/preferences You 2. Perform experiment 3. Generate raw data 4. Document raw data www.nature.com/scientificdata/

home user 1 1. Modify/upload your personal configs/preferences You 2. Perform experiment 3. Generate raw data 4. Document raw data 5. Upload documented raw data

home user 1 1. Modify/upload your personal configs/preferences DM 6. Contact Data Manager You 2. Perform experiment 3. Generate raw data 4. Document raw data 5. Upload documented raw data

home user 1 1. Modify/upload your personal configs/preferences DM 6. Contact Data Manager You 7. Move or copy documented raw data 2. Perform experiment 3. Generate raw data 4. Document raw data 5. Upload documented raw data

user 1 home

user 1 home 8. Copy raw data You

user 1 home 8. Copy raw data You 9. Analyze data

user 1 home 8. Copy raw data You 9. Analyze data 10. Generate tmp data 11. Generate final results

user 1 home 8. Copy raw data You 9. Analyze data 10. Generate tmp data 11. Generate final results 12. Document final results

user 1 home 8. Copy raw data You 13. Contact Data Manager DM 9. Analyze data 10. Generate tmp data 11. Generate final results 12. Document final results

user 1 home 14. Move documented final results 8. Copy raw data You 13. Contact Data Manager DM 9. Analyze data 10. Generate tmp data 11. Generate final results 12. Document final results

user 1 home 14. Move documented final results 8. Copy raw data You 13. Contact Data Manager DM 9. Analyze data 15. Cleanup 10. Generate tmp data 11. Generate final results 12. Document final results

user 1 home

www.molgenis.org

www.molgenis.org

?