Bright Cluster Manager Advanced HPC cluster management made easy. Martijn de Vries CTO Bright Computing
|
|
- Moris Quinn
- 5 years ago
- Views:
Transcription
1 Bright Cluster Manager Advanced HPC cluster management made easy Martijn de Vries CTO Bright Computing
2 About Bright Computing Bright Computing 1. Develops and supports Bright Cluster Manager for HPC systems and server farms 2. Incorporated in USA (HQ in San Jose, California) 3. Development office in Amsterdam, NL 4. Backed by ING Bank as shareholder and investor 5. Sells through a rapidly growing network of resellers and OEMs world-wide 6. Customers and resellers in US, Canada, Brazil, Europe, Middle-East, India, Singapore, Japan 7. Installations in Academia, Government, Industry, ranging from 4 node to TOP500 systems 2
3 Customers Academia Government Industry 3
4 The Commonly Used Toolkit Approach Most HPC cluster management solutions use the toolkit approach (Linux distro + tools) Examples: Rocks, PCM, OSCAR, UniCluster, CMU, bullx, etc. Tools typically used: Ganglia, Cacti, Nagios, Cfengine, System Imager, Puppet, Cobbler, Hobbit, Big Brother, Zabbix, Groundwork, etc. Issues with the toolkit approach: Tools rarely designed to work together Tools rarely designed for HPC Tools rarely designed to scale Each tool has its own command line interface and GUI Each tool has its own daemon and database Roadmap dependent on developers of the tools Making a collection of unrelated tools work together Requires a lot of expertise and scripting Rarely leads to a really easy-to-use and scalable solution 4
5 About Bright Cluster Manager Bright Cluster Manager takes a much more fundamental & integrated approach Designed and written from the ground up Single cluster management daemon provides all functionality Single, central database for configuration and monitoring data Single CLI and GUI for ALL cluster management functionality Which makes Bright Cluster Manager Extremely easy to use Extremely scalable Secure & reliable Complete Flexible 5
6 Architecture CMDaemon 6
7 Bright Cluster Manager Elements Cluster Management GUI Provisioning SSL / SOAP / X509 / IPtables Cluster Management Daemon PBS Pro Torque Maui/MOAB Grid Engine SLURM LSF* Monitoring Automation Health Checks Management SLES / RHEL / CentOS / SL / Oracle EL SLES / RHEL / CentOS / SL / Oracle EL ScaleMP vsmp Cluster Management Shell Compilers Libraries Debuggers Profilers CPU GPU Memory Disk Ethernet Interconnect IPMI / ilo PDU 7
8 HPC User Environment Let users focus on performing computations Rich collection of HPC software Compilers (GNU, Intel*, Portland*, Open64, etc.) Parallel middleware (MPI libraries, threading libraries, OpenMP, Global Arrays, etc.) Mathematical libraries (ACML, MKL*, LAPACK, BLAS, etc.) Development tools (debuggers, profilers, etc.) Environment modules Intel Cluster Ready Compliant Compliant applications run out of the box 8
9 Management Interface Graphical User Interface (GUI) Offers administrator full cluster control Standalone desktop application Manages multiple clusters simultaneously Runs on Linux, Windows, MacOS X* Built on top of Mozilla XUL engine Cluster Management Shell (CMSH) All GUI functionality also available through Cluster Management Shell Interactive and scriptable in batch mode 9
10
11
12 Cluster Management Shell (CMSH) Features: Modular interface Command completion using tab key Command line history Output redirection to file or shell command Scriptable in batch mode Support for looping over objects Example [demo]% device [demo->device]% status demo... [ UP ] node [ UP ] node [ UP ] 12
13
14
15
16
17 Node Provisioning Image based Slave node image is a directory on the head node Unlimited number of images can be created Software changes for the slave nodes are made inside the image(s) on the head node Provisioning system ensures that changes are propagated to the slave nodes Nodes always boot over the network Slave nodes PXE boot into Node Installer, which Identifies node (switch port or MAC based) Configures BMC Partition disks (if any) and creates file systems Installs or updates software image Pivot the root from NFS to the local file system 17
18
19
20 Architecture Monitoring CMDaemon BMC BMC BMC 20
21
22
23
24
25 Bright Cluster Manager for GPGPU 25
26 GPU Development Environment CUDA & OpenCL redistribution rights Current and previous versions of CUDA & OpenCL Easy switching between CUDA & OpenCL versions CUDA driver automatically compiled at boot time Support for new Fermi architecture Native 64-bit GPU support Multiple copy engine support ECC reporting Concurrent kernel execution Fermi HW debugging support in cuda-gdb 26
27 GPU Monitoring 27
28
29
30
31 Cluster Health Checking Goal: provide problem free environment for running jobs Hardware & software health Three types of health check Health checks before jobs are run Halt workload manager few (milli)seconds before job is executed Check health of each reserved node If unhealthy, take off line, inform system administrator Hand job back to workload manager Frequently scheduled health checks Run health check when node is not used Run health check through queuing system Hardware burn-in environment Most thorough health check Requires reboot All types are extensible 31
32
33
34
35
36 Scalability Cluster Management software should not be limiting factor for cluster size. Philosophy used for Bright Cluster Manager: All tasks performed by master node should be offloadable to dedicated nodes. If master node can not handle a task as a result of cluster size, task can be placed on 1 or more dedicated nodes. For example: multiple dedicated load-balanced provisioning nodes may be assigned in a cluster. 36
37 Image Based Provisioning Software image (or image ) is directory on head node Image contains full Linux file-tree (/bin, /usr, ) Software is not installed on nodes directly, but rather to image After image has been changed, changes can be propagated to the compute nodes Propagating image changes to nodes can be done in two ways: 1. Rebooting nodes 2. Using device imageupdate in CMSH, or Update Node in GUI Latter allows nodes to be updated without reboot Some changes do require reboot (e.g. kernel update) 37
38 Provisioning Process Node Installer submits provisioning request to head node Head node will queue request until a provisioning slot becomes available on one of the provisioning nodes (possibly just head node itself) Provisioning node will connect to compute node to provision software image to local file system Two install modes: FULL: Re-partition hard drives, transfer image from scratch SYNC: Only transfer differences between image and local disk Default install mode is SYNC Disk setup mismatch triggers FULL install mode 38
39
40 Changing Software Images Installing/updating RPMs rpm --root=/cm/images/default-image i myapp.rpm yum --installroot=/cm/images/default-image install myapp yum --installroot=/cm/images/default-image update Installing software from source make DESTDIR=/cm/images/default-image install Note that not all Makefiles support $DESTDIR Usage example from Makefile: install -m644 file-example $(DESTDIR)/etc/file Making changes manually chroot /cm/images/default-image cd /usr/src/myapp; make install emacs /cm/images/default-image/etc/file 40
41 Cloud Bursting (in development) Allow clusters to be extended with cloud resources Cluster can grow or shrink based on workload and policies Integrated interface to public cloud providers Unsolved problem: how to deal with local storage? 41
42 Looking for challenging and exciting jobs in HPC?
Managing complex cluster architectures with Bright Cluster Manager
Managing complex cluster architectures with Bright Cluster Manager Christopher Huggins www.clustervision.com 1 About ClusterVision Specialists in Compute, Storage & Database Clusters (Tailor-Made, Turn-Key)
More informationBright Cluster Manager
Bright Cluster Manager Using Slurm for Data Aware Scheduling in the Cloud Martijn de Vries CTO About Bright Computing Bright Computing 1. Develops and supports Bright Cluster Manager for HPC systems, server
More informationOptimizing Cluster Utilisation with Bright Cluster Manager
Optimizing Cluster Utilisation with Bright Cluster Manager Arno Ziebart Sales Manager Germany HPC Advisory Council 2011 www.clustervision.com 1 About us Specialists in Compute, Storage & GPU Clusters (Tailor-Made,
More informationBright Cluster Manager
Bright Computing Bright Cluster Manager Advanced Cluster Management Made Easy Bright Cluster Manager removes the complexity from the installation, management and use of HPC clusters. With Bright Cluster
More informationLinux HPC Software Stack
Linux HPC Software Stack Makia Minich Clustre Monkey, HPC Software Stack Lustre Group April 2008 1 1 Project Goals Develop integrated software stack for Linux-based HPC solutions based on Sun HPC hardware
More informationRHRK-Seminar. High Performance Computing with the Cluster Elwetritsch - II. Course instructor : Dr. Josef Schüle, RHRK
RHRK-Seminar High Performance Computing with the Cluster Elwetritsch - II Course instructor : Dr. Josef Schüle, RHRK Overview Course I Login to cluster SSH RDP / NX Desktop Environments GNOME (default)
More informationHow to run applications on Aziz supercomputer. Mohammad Rafi System Administrator Fujitsu Technology Solutions
How to run applications on Aziz supercomputer Mohammad Rafi System Administrator Fujitsu Technology Solutions Agenda Overview Compute Nodes Storage Infrastructure Servers Cluster Stack Environment Modules
More informationBright Cluster Manager 5.2. Administrator Manual. Revision: Date: Fri, 27 Nov 2015
Bright Cluster Manager 5.2 Administrator Manual Revision: 6776 Date: Fri, 27 Nov 2015 2012 Bright Computing, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless
More informationResearch Project 1. Failover research for Bright Cluster Manager
University of Amsterdam System & Network Engineering Research Project 1 Failover research for Bright Cluster Manager Authors: cdumitru@os3.nl ntimmers@os3.nl Coordinator: Prof. dr. ir. Cees de Laat University
More informationReadme for Platform Open Cluster Stack (OCS)
Readme for Platform Open Cluster Stack (OCS) Version 4.1.1-2.0 October 25 2006 Platform Computing Contents What is Platform OCS? What's New in Platform OCS 4.1.1-2.0? Supported Architecture Distribution
More informationPBS PROFESSIONAL VS. MICROSOFT HPC PACK
PBS PROFESSIONAL VS. MICROSOFT HPC PACK On the Microsoft Windows Platform PBS Professional offers many features which are not supported by Microsoft HPC Pack. SOME OF THE IMPORTANT ADVANTAGES OF PBS PROFESSIONAL
More informationMERCED CLUSTER BASICS Multi-Environment Research Computer for Exploration and Discovery A Centerpiece for Computational Science at UC Merced
MERCED CLUSTER BASICS Multi-Environment Research Computer for Exploration and Discovery A Centerpiece for Computational Science at UC Merced Sarvani Chadalapaka HPC Administrator University of California
More informationThe GPU-Cluster. Sandra Wienke Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky
The GPU-Cluster Sandra Wienke wienke@rz.rwth-aachen.de Fotos: Christian Iwainsky Rechen- und Kommunikationszentrum (RZ) The GPU-Cluster GPU-Cluster: 57 Nvidia Quadro 6000 (29 nodes) innovative computer
More informationInfoBrief. Platform ROCKS Enterprise Edition Dell Cluster Software Offering. Key Points
InfoBrief Platform ROCKS Enterprise Edition Dell Cluster Software Offering Key Points High Performance Computing Clusters (HPCC) offer a cost effective, scalable solution for demanding, compute intensive
More informationSTARTING THE DDT DEBUGGER ON MIO, AUN, & MC2. (Mouse over to the left to see thumbnails of all of the slides)
STARTING THE DDT DEBUGGER ON MIO, AUN, & MC2 (Mouse over to the left to see thumbnails of all of the slides) ALLINEA DDT Allinea DDT is a powerful, easy-to-use graphical debugger capable of debugging a
More informationHPC Middle East. KFUPM HPC Workshop April Mohamed Mekias HPC Solutions Consultant. Agenda
KFUPM HPC Workshop April 29-30 2015 Mohamed Mekias HPC Solutions Consultant Agenda 1 Agenda-Day 1 HPC Overview What is a cluster? Shared v.s. Distributed Parallel v.s. Massively Parallel Interconnects
More informationRWTH GPU-Cluster. Sandra Wienke March Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky
RWTH GPU-Cluster Fotos: Christian Iwainsky Sandra Wienke wienke@rz.rwth-aachen.de March 2012 Rechen- und Kommunikationszentrum (RZ) The GPU-Cluster GPU-Cluster: 57 Nvidia Quadro 6000 (29 nodes) innovative
More informationDesigned for Maximum Accelerator Performance
Designed for Maximum Accelerator Performance A dense, GPU-accelerated cluster supercomputer that delivers up to 329 double-precision GPU teraflops in one rack. This power- and spaceefficient system can
More informationBluemin: A Suite for Management of PC Clusters
Bluemin: A Suite for Management of PC Clusters Hai Jin, Hao Zhang, Qincheng Zhang, Baoli Chen, Weizhong Qiang School of Computer Science and Engineering Huazhong University of Science and Technology Wuhan,
More informationLinux Administration
Linux Administration This course will cover all aspects of Linux Certification. At the end of the course delegates will have the skills required to administer a Linux System. It is designed for professionals
More informationDebugging CUDA Applications with Allinea DDT. Ian Lumb Sr. Systems Engineer, Allinea Software Inc.
Debugging CUDA Applications with Allinea DDT Ian Lumb Sr. Systems Engineer, Allinea Software Inc. ilumb@allinea.com GTC 2013, San Jose, March 20, 2013 Embracing GPUs GPUs a rival to traditional processors
More informationA WEB-BASED SOLUTION TO VISUALIZE OPERATIONAL MONITORING LINUX CLUSTER FOR THE PROTODUNE DATA QUALITY MONITORING CLUSTER
A WEB-BASED SOLUTION TO VISUALIZE OPERATIONAL MONITORING LINUX CLUSTER FOR THE PROTODUNE DATA QUALITY MONITORING CLUSTER BADISA MOSESANE EP-NU Supervisor: Nektarios Benekos Department: EP-NU Table of Contents
More informationInstallation Tools for Clusters. Rajesh K., Computer Division, BARC
Installation Tools for Clusters Rajesh K., Computer Division, BARC Outline of the presentation Cluster Intro Steps involved in a cluster installation Different approaches to installation Issues in cluster
More informationRed Hat HPC Solution Overview. Platform Computing
Red Hat HPC Solution Overview Gerry Riveros Red Hat Senior Product Marketing Manager griveros@redhat.com Robbie Jones Platform Computing Senior Systems Engineer rjones@platform.com 1 Overview 2 Trends
More informationImproving the Productivity of Scalable Application Development with TotalView May 18th, 2010
Improving the Productivity of Scalable Application Development with TotalView May 18th, 2010 Chris Gottbrath Principal Product Manager Rogue Wave Major Product Offerings 2 TotalView Technologies Family
More informationOur new HPC-Cluster An overview
Our new HPC-Cluster An overview Christian Hagen Universität Regensburg Regensburg, 15.05.2009 Outline 1 Layout 2 Hardware 3 Software 4 Getting an account 5 Compiling 6 Queueing system 7 Parallelization
More informationQuickSpecs HPE Insight Cluster Management Utility v8.2
Overview HPE Insight Cluster Management Utility v8.2 HPE Insight Cluster Management Utility (HPE Insight CMU) is a proven and highly capable utility for the management of HPC and Big Data clusters and
More informationBatch Systems & Parallel Application Launchers Running your jobs on an HPC machine
Batch Systems & Parallel Application Launchers Running your jobs on an HPC machine Partners Funding Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike
More informationRocks Cluster Administration. Learn how to manage your Rocks Cluster Effectively
Rocks Cluster Administration Learn how to manage your Rocks Cluster Effectively Module 1: Customizing Your Cluster Customizing Nodes Using built in node attributes and the Rocks Command line Using extend-node.xml
More informationSCALABLE HYBRID PROTOTYPE
SCALABLE HYBRID PROTOTYPE Scalable Hybrid Prototype Part of the PRACE Technology Evaluation Objectives Enabling key applications on new architectures Familiarizing users and providing a research platform
More informationBright Cluster Manager 5.1. Administrator Manual. Revision: Date: Fri, 27 Nov 2015
Bright Cluster Manager 5.1 Administrator Manual Revision: 6775 Date: Fri, 27 Nov 2015 2011 Bright Computing, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless
More informationGPU Debugging Made Easy. David Lecomber CTO, Allinea Software
GPU Debugging Made Easy David Lecomber CTO, Allinea Software david@allinea.com Allinea Software HPC development tools company Leading in HPC software tools market Wide customer base Blue-chip engineering,
More informationAASPI Software Structure
AASPI Software Structure Introduction The AASPI software comprises a rich collection of seismic attribute generation, data conditioning, and multiattribute machine-learning analysis tools constructed by
More informationIntroduction to the SHARCNET Environment May-25 Pre-(summer)school webinar Speaker: Alex Razoumov University of Ontario Institute of Technology
Introduction to the SHARCNET Environment 2010-May-25 Pre-(summer)school webinar Speaker: Alex Razoumov University of Ontario Institute of Technology available hardware and software resources our web portal
More informationEffective Use of CCV Resources
Effective Use of CCV Resources Mark Howison User Services & Support This talk... Assumes you have some familiarity with a Unix shell Provides examples and best practices for typical usage of CCV systems
More informationRocks ʻnʼ Rolls# An Introduction to # Programming Clusters # using Rocks# Anoop Rajendra# 2010 UC Regents#
An Introduction to # Programming Clusters # using Rocks# Rocks ʻnʼ Rolls# Anoop Rajendra# Rules of the Talk# This talk is for YOU!!# Let me know if Iʼm too fast# Donʼt hesitate to stop me and ask questions
More informationMonitoring and Trouble Shooting on BioHPC
Monitoring and Trouble Shooting on BioHPC [web] [email] portal.biohpc.swmed.edu biohpc-help@utsouthwestern.edu 1 Updated for 2017-03-15 Why Monitoring & Troubleshooting data code Monitoring jobs running
More informationBatch Systems. Running calculations on HPC resources
Batch Systems Running calculations on HPC resources Outline What is a batch system? How do I interact with the batch system Job submission scripts Interactive jobs Common batch systems Converting between
More informationBeginner's Guide for UK IBM systems
Beginner's Guide for UK IBM systems This document is intended to provide some basic guidelines for those who already had certain programming knowledge with high level computer languages (e.g. Fortran,
More informationGPU ACCELERATED DATABASE MANAGEMENT SYSTEMS
CIS 601 - Graduate Seminar Presentation 1 GPU ACCELERATED DATABASE MANAGEMENT SYSTEMS PRESENTED BY HARINATH AMASA CSU ID: 2697292 What we will talk about.. Current problems GPU What are GPU Databases GPU
More informationHPC DOCUMENTATION. 3. Node Names and IP addresses:- Node details with respect to their individual IP addresses are given below:-
HPC DOCUMENTATION 1. Hardware Resource :- Our HPC consists of Blade chassis with 5 blade servers and one GPU rack server. a.total available cores for computing: - 96 cores. b.cores reserved and dedicated
More informationQuickSpecs HP Cluster Management Utility V5.0
Overview HP Cluster Management Utility V5.0 (CMU) is a software suite of tools that are used to manage a large collection of nodes within a High Performance Computing (HPC) cluster environment. Each cluster
More informationAddressing the Increasing Challenges of Debugging on Accelerated HPC Systems. Ed Hinkel Senior Sales Engineer
Addressing the Increasing Challenges of Debugging on Accelerated HPC Systems Ed Hinkel Senior Sales Engineer Agenda Overview - Rogue Wave & TotalView GPU Debugging with TotalView Nvdia CUDA Intel Phi 2
More informationGeneral Purpose GPU Computing in Partial Wave Analysis
JLAB at 12 GeV - INT General Purpose GPU Computing in Partial Wave Analysis Hrayr Matevosyan - NTC, Indiana University November 18/2009 COmputationAL Challenges IN PWA Rapid Increase in Available Data
More informationIntroduction to Abel/Colossus and the queuing system
Introduction to Abel/Colossus and the queuing system November 14, 2018 Sabry Razick Research Infrastructure Services Group, USIT Topics First 7 slides are about us and links The Research Computing Services
More informationBright Cluster Manager 7.3. Installation Manual. Revision: Date: Thu, 25 Jan 2018
Bright Cluster Manager 7.3 Installation Manual Revision: 9140 Date: Thu, 25 Jan 2018 2017 Bright Computing, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless
More informationIntroduction to Parallel Programming
Introduction to Parallel Programming January 14, 2015 www.cac.cornell.edu What is Parallel Programming? Theoretically a very simple concept Use more than one processor to complete a task Operationally
More informationECMWF Workshop on High Performance Computing in Meteorology. 3 rd November Dean Stewart
ECMWF Workshop on High Performance Computing in Meteorology 3 rd November 2010 Dean Stewart Agenda Company Overview Rogue Wave Product Overview IMSL Fortran TotalView Debugger Acumem ThreadSpotter 1 Copyright
More informationIBM High Performance Computing Toolkit
IBM High Performance Computing Toolkit Pidad D'Souza (pidsouza@in.ibm.com) IBM, India Software Labs Top 500 : Application areas (November 2011) Systems Performance Source : http://www.top500.org/charts/list/34/apparea
More informationIntroduction to High-Performance Computing (HPC)
Introduction to High-Performance Computing (HPC) Computer components CPU : Central Processing Unit cores : individual processing units within a CPU Storage : Disk drives HDD : Hard Disk Drive SSD : Solid
More informationFast Setup and Integration of Abaqus on HPC Linux Cluster and the Study of Its Scalability
Fast Setup and Integration of Abaqus on HPC Linux Cluster and the Study of Its Scalability Betty Huang, Jeff Williams, Richard Xu Baker Hughes Incorporated Abstract: High-performance computing (HPC), the
More informationPART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE SHEET) Supply and installation of High Performance Computing System
INSTITUTE FOR PLASMA RESEARCH (An Autonomous Institute of Department of Atomic Energy, Government of India) Near Indira Bridge; Bhat; Gandhinagar-382428; India PART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE
More informationInstalling and running COMSOL 4.3a on a Linux cluster COMSOL. All rights reserved.
Installing and running COMSOL 4.3a on a Linux cluster 2012 COMSOL. All rights reserved. Introduction This quick guide explains how to install and operate COMSOL Multiphysics 4.3a on a Linux cluster. It
More informationBright Cluster Manager 7.1. Installation Manual. Revision: Date: Thu, 10 Dec 2015
Bright Cluster Manager 7.1 Installation Manual Revision: 6825 Date: Thu, 10 Dec 2015 2015 Bright Computing, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless
More informationChapter 2 Operating-System Structures
This chapter will discuss the following concepts: 2.1 Operating System Services 2.2 User Operating System Interface 2.3 System Calls 2.4 System Programs 2.5 Operating System Design and Implementation 2.6
More information(Reaccredited with A Grade by the NAAC) RE-TENDER NOTICE. Advt. No. PU/R/RUSA Fund/Equipment Purchase-1/ Date:
Phone: 0427-2345766 Fax: 0427-2345124 PERIYAR UNIVERSITY (Reaccredited with A Grade by the NAAC) PERIYAR PALKALAI NAGAR SALEM 636 011. RE-TENDER NOTICE Advt. No. PU/R/RUSA Fund/Equipment Purchase-1/139-2018
More informationBright Cluster Manager 7.2. Installation Manual. Revision: e9f97c1. Date: Tue May
Bright Cluster Manager 7.2 Installation Manual Revision: e9f97c1 Date: Tue May 1 2018 2015 Bright Computing, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless
More informationTo hear the audio, please be sure to dial in: ID#
Introduction to the HPP-Heterogeneous Processing Platform A combination of Multi-core, GPUs, FPGAs and Many-core accelerators To hear the audio, please be sure to dial in: 1-866-440-4486 ID# 4503739 Yassine
More informationImplementation and. Oracle VM. Administration Guide. Oracle Press ORACLG. Mc Grauv Hill. Edward Whalen
ORACLG Oracle Press Oracle VM Implementation and Administration Guide Edward Whalen Mc Grauv Hill New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore
More informationAnswers to Federal Reserve Questions. Training for University of Richmond
Answers to Federal Reserve Questions Training for University of Richmond 2 Agenda Cluster Overview Software Modules PBS/Torque Ganglia ACT Utils 3 Cluster overview Systems switch ipmi switch 1x head node
More informationMoab Workload Manager on Cray XT3
Moab Workload Manager on Cray XT3 presented by Don Maxwell (ORNL) Michael Jackson (Cluster Resources, Inc.) MOAB Workload Manager on Cray XT3 Why MOAB? Requirements Features Support/Futures 2 Why Moab?
More informationBarcelona Supercomputing Center
www.bsc.es Barcelona Supercomputing Center Centro Nacional de Supercomputación EMIT 2016. Barcelona June 2 nd, 2016 Barcelona Supercomputing Center Centro Nacional de Supercomputación BSC-CNS objectives:
More informationBright Cluster Manager 7.0. Installation Manual. Revision: 1fabda4. Date: Fri Mar
Bright Cluster Manager 7.0 Installation Manual Revision: 1fabda4 Date: Fri Mar 30 2018 2015 Bright Computing, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless
More informationCS500 SMARTER CLUSTER SUPERCOMPUTERS
CS500 SMARTER CLUSTER SUPERCOMPUTERS OVERVIEW Extending the boundaries of what you can achieve takes reliable computing tools matched to your workloads. That s why we tailor the Cray CS500 cluster supercomputer
More informationName Department/Research Area Have you used the Linux command line?
Please log in with HawkID (IOWA domain) Macs are available at stations as marked To switch between the Windows and the Mac systems, press scroll lock twice 9/27/2018 1 Ben Rogers ITS-Research Services
More informationGree. SunTM HPC Software, Linux Edition Deployment and User Guide
Gree SunTM HPC Software, Linux Edition 2.0.1 Deployment and User Guide. www.sun.com Part No. 821-0375-10 July 2009 Copyright 2009., 4150 Network Circle, Santa Clara, California 95054, U.S.A. All rights
More informationGraham vs legacy systems
New User Seminar Graham vs legacy systems This webinar only covers topics pertaining to graham. For the introduction to our legacy systems (Orca etc.), please check the following recorded webinar: SHARCNet
More informationHPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)
HPC and IT Issues Session Agenda Deployment of Simulation (Trends and Issues Impacting IT) Discussion Mapping HPC to Performance (Scaling, Technology Advances) Discussion Optimizing IT for Remote Access
More informationHigh Performance Computing Software Development Kit For Mac OS X In Depth Product Information
High Performance Computing Software Development Kit For Mac OS X In Depth Product Information 2781 Bond Street Rochester Hills, MI 48309 U.S.A. Tel (248) 853-0095 Fax (248) 853-0108 support@absoft.com
More informationDebugging Intel Xeon Phi KNC Tutorial
Debugging Intel Xeon Phi KNC Tutorial Last revised on: 10/7/16 07:37 Overview: The Intel Xeon Phi Coprocessor 2 Debug Library Requirements 2 Debugging Host-Side Applications that Use the Intel Offload
More informationHigh Performance Beowulf Cluster Environment User Manual
High Performance Beowulf Cluster Environment User Manual Version 3.1c 2 This guide is intended for cluster users who want a quick introduction to the Compusys Beowulf Cluster Environment. It explains how
More informationComet Virtualization Code & Design Sprint
Comet Virtualization Code & Design Sprint SDSC September 23-24 Rick Wagner San Diego Supercomputer Center Meeting Goals Build personal connections between the IU and SDSC members of the Comet team working
More informationIntroduction to High-Performance Computing (HPC)
Introduction to High-Performance Computing (HPC) Computer components CPU : Central Processing Unit CPU cores : individual processing units within a Storage : Disk drives HDD : Hard Disk Drive SSD : Solid
More informationThe Slide does not contain all the information and cannot be treated as a study material for Operating System. Please refer the text book for exams.
The Slide does not contain all the information and cannot be treated as a study material for Operating System. Please refer the text book for exams. Operating System Services User Operating System Interface
More informationDebugging, benchmarking, tuning i.e. software development tools. Martin Čuma Center for High Performance Computing University of Utah
Debugging, benchmarking, tuning i.e. software development tools Martin Čuma Center for High Performance Computing University of Utah m.cuma@utah.edu SW development tools Development environments Compilers
More informationTable of Contents. Table of Contents Job Manager for remote execution of QuantumATK scripts. A single remote machine
Table of Contents Table of Contents Job Manager for remote execution of QuantumATK scripts A single remote machine Settings Environment Resources Notifications Diagnostics Save and test the new machine
More informationViglen NPACI Rocks. Getting Started and FAQ
Viglen NPACI Rocks Getting Started and FAQ Table of Contents Viglen NPACI Rocks...1 Getting Started...3 Powering up the machines:...3 Checking node status...4 Through web interface:...4 Adding users:...7
More informationAn Integrated Approach to Workload and Cluster Management: The HP CMU PBS Professional Connector
An Integrated Approach to Workload and Cluster Management: The HP CMU PBS Professional Connector Scott Suchyta Altair Engineering Inc., 1820 Big Beaver Road, Troy, MI 48083, USA Contents 1 Abstract...
More informationTOSS - A RHEL-based Operating System for HPC Clusters
TOSS - A RHEL-based Operating System for HPC Clusters Supercomputing 2017 Red Hat Booth November 14, 2017 Ned Bass System Software Development Group Leader Livermore Computing Division LLNL-PRES-741473
More informationRed Hat enterprise virtualization 3.0
Red Hat enterprise virtualization 3.0 feature comparison at a glance Red Hat Enterprise is the first fully open source, enterprise ready virtualization platform Compare the functionality of RHEV to VMware
More informationWrite a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical
Identify a problem Review approaches to the problem Propose a novel approach to the problem Define, design, prototype an implementation to evaluate your approach Could be a real system, simulation and/or
More informationCSinParallel Workshop. OnRamp: An Interactive Learning Portal for Parallel Computing Environments
CSinParallel Workshop : An Interactive Learning for Parallel Computing Environments Samantha Foley ssfoley@cs.uwlax.edu http://cs.uwlax.edu/~ssfoley Josh Hursey jjhursey@cs.uwlax.edu http://cs.uwlax.edu/~jjhursey/
More informationUsing the IBM Opteron 1350 at OSC. October 19-20, 2010
Using the IBM Opteron 1350 at OSC October 19-20, 2010 Table of Contents Hardware Overview The Linux Operating System User Environment and Storage 2 Hardware Overview Hardware introduction Login node configuration
More informationMDHIM: A Parallel Key/Value Store Framework for HPC
MDHIM: A Parallel Key/Value Store Framework for HPC Hugh Greenberg 7/6/2015 LA-UR-15-25039 HPC Clusters Managed by a job scheduler (e.g., Slurm, Moab) Designed for running user jobs Difficult to run system
More informationSGE Roll: Users Guide. Version Edition
SGE Roll: Users Guide Version 4.2.1 Edition SGE Roll: Users Guide : Version 4.2.1 Edition Published Sep 2006 Copyright 2006 University of California and Scalable Systems This document is subject to the
More informationWorking with Shell Scripting. Daniel Balagué
Working with Shell Scripting Daniel Balagué Editing Text Files We offer many text editors in the HPC cluster. Command-Line Interface (CLI) editors: vi / vim nano (very intuitive and easy to use if you
More informationAllinea Unified Environment
Allinea Unified Environment Allinea s unified tools for debugging and profiling HPC Codes Beau Paisley Allinea Software bpaisley@allinea.com 720.583.0380 Today s Challenge Q: What is the impact of current
More informationGetting started with the CEES Grid
Getting started with the CEES Grid October, 2013 CEES HPC Manager: Dennis Michael, dennis@stanford.edu, 723-2014, Mitchell Building room 415. Please see our web site at http://cees.stanford.edu. Account
More informationAnswers to Federal Reserve Questions. Administrator Training for University of Richmond
Answers to Federal Reserve Questions Administrator Training for University of Richmond 2 Agenda Cluster overview Physics hardware Chemistry hardware Software Modules, ACT Utils, Cloner GridEngine overview
More informationBatch Systems. Running your jobs on an HPC machine
Batch Systems Running your jobs on an HPC machine Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us
More informationTesting an Open Source installation and server provisioning tool for the INFN CNAF Tier1 Storage system
Testing an Open Source installation and server provisioning tool for the INFN CNAF Tier1 Storage system M Pezzi 1, M Favaro 1, D Gregori 1, PP Ricci 1, V Sapunenko 1 1 INFN CNAF Viale Berti Pichat 6/2
More informationWindows Azure Services - At Different Levels
Windows Azure Windows Azure Services - At Different Levels SaaS eg : MS Office 365 Paas eg : Azure SQL Database, Azure websites, Azure Content Delivery Network (CDN), Azure BizTalk Services, and Azure
More informationWhat s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018
What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018 Todd Tannenbaum Center for High Throughput Computing Department of Computer Sciences University of Wisconsin-Madison
More informationCluster Computing. Resource and Job Management for HPC 16/08/2010 SC-CAMP. ( SC-CAMP) Cluster Computing 16/08/ / 50
Cluster Computing Resource and Job Management for HPC SC-CAMP 16/08/2010 ( SC-CAMP) Cluster Computing 16/08/2010 1 / 50 Summary 1 Introduction Cluster Computing 2 About Resource and Job Management Systems
More informationVMware vsphere 6.0 / 6.5 Infrastructure Deployment Boot Camp
Title: Summary: Length: Overview: VMware vsphere 6.0 / 6.5 Infrastructure Deployment Boot Camp Class formats available: Live In-Classroom Training (LICT) Mixed class with Classroom and Online Instruction
More informationCentre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules. Singularity overview. Vanessa HAMAR
Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules Singularity overview Vanessa HAMAR Disclaimer } The information in this presentation was compiled from different
More informationThe Why and How of HPC-Cloud Hybrids with OpenStack
The Why and How of HPC-Cloud Hybrids with OpenStack OpenStack Australia Day Melbourne June, 2017 Lev Lafayette, HPC Support and Training Officer, University of Melbourne lev.lafayette@unimelb.edu.au 1.0
More information1 Bull, 2011 Bull Extreme Computing
1 Bull, 2011 Bull Extreme Computing Table of Contents Overview. Principal concepts. Architecture. Scheduler Policies. 2 Bull, 2011 Bull Extreme Computing SLURM Overview Ares, Gerardo, HPC Team Introduction
More informationCisco UCS Diagnostics User Guide for B-Series Servers, Release 2.0
First Published: 2018-03-13 Americas Headquarters Cisco Systems, Inc. 170 West Tasman Drive San Jose, CA 95134-1706 USA http://www.cisco.com Tel: 408 526-4000 800 553-NETS (6387) Fax: 408 527-0883 2018
More informationSTAR-CCM+ Performance Benchmark and Profiling. July 2014
STAR-CCM+ Performance Benchmark and Profiling July 2014 Note The following research was performed under the HPC Advisory Council activities Participating vendors: CD-adapco, Intel, Dell, Mellanox Compute
More information