The ClusterStor Engineered Solution for HPC. Dr. Torben Kling Petersen, PhD Principal Solutions Architect - HPC

Similar documents
Xyratex ClusterStor6000 & OneStor

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Xyratex ClusterStor TM The Future of HPC Storage. Ken Claffey, Alan Poston & Torben Kling-Petersen. Networked Storage Solutions

A ClusterStor update. Torben Kling Petersen, PhD. Principal Architect, HPC

The Genesis HyperMDC is a scalable metadata cluster designed for ease-of-use and quick deployment.

Altos R320 F3 Specifications. Product overview. Product views. Internal view

Network Storage Appliance

HPE Scalable Storage with Intel Enterprise Edition for Lustre*

Dell Fluid Data solutions. Powerful self-optimized enterprise storage. Dell Compellent Storage Center: Designed for business results

Genesis HyperMDC 200D

STAR-CCM+ Performance Benchmark and Profiling. July 2014

Cisco UCS S3260 System Storage Management

GW2000h w/gw175h/q F1 specifications

Cisco UCS S3260 System Storage Management

Building dense NVMe storage

IOPStor: Storage Made Easy. Key Business Features. Key Business Solutions. IOPStor IOP5BI50T Network Attached Storage (NAS) Page 1 of 5

Altos T310 F3 Specifications

Cisco HyperFlex HX220c Edge M5

NEC Express5800/R120h-2M System Configuration Guide

Architecting High Performance Computing Systems for Fault Tolerance and Reliability

Introduction to NetApp E-Series E2700 with SANtricity 11.10

SUN CUSTOMER READY HPC CLUSTER: REFERENCE CONFIGURATIONS WITH SUN FIRE X4100, X4200, AND X4600 SERVERS Jeff Lu, Systems Group Sun BluePrints OnLine

Cisco MCS 7845-H1 Unified CallManager Appliance

Cisco HyperFlex HX220c M4 Node

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

CyberStore DSS. Multi Award Winning. Broadberry. CyberStore DSS. Open-E DSS v7 based Storage Appliances. Powering these organisations

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

Density Optimized System Enabling Next-Gen Performance

SMART SERVER AND STORAGE SOLUTIONS FOR GROWING BUSINESSES

HUAWEI Tecal X6000 High-Density Server

Acer AW2000h w/aw170h F2 Specifications

15 Jun 2012 Kevin Chang

Cisco MCS 7835-H2 Unified Communications Manager Appliance

LAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015

NEC Express5800/R120h-2E System Configuration Guide

Slide 0 Welcome to this Web Based Training session introducing the ETERNUS DX80 S2, DX90 S2, DX410 S2 and DX440 S2 storage systems from Fujitsu.

Acer AR320 F1 specifications

SGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012

Cisco UCS S3260 System Storage Management

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

HUAWEI Tecal X8000 High-Density Rack Server

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO

VX3000-E Unified Network Storage

Cisco UCS C200 M2 High-Density Rack-Mount Server

NEC Express5800/R120h-1M System Configuration Guide

The Oracle Database Appliance I/O and Performance Architecture

Cisco MCS 7825-I1 Unified CallManager Appliance

HUAWEI TECHNOLOGIES CO., LTD. HUAWEI FusionServer X6000 High-Density Server

Sugon TC6600 blade server

NEC Express5800/R120h-2M System Configuration Guide

Huawei RH1288 V3 Rack Server Datasheet. Overview. Check its price: Click Here. Quick Specs. Product Details

STOR-LX. Features. Linux-Based Unified Storage Appliance

Full Featured with Maximum Flexibility for Expansion

Configuring a Single Oracle ZFS Storage Appliance into an InfiniBand Fabric with Multiple Oracle Exadata Machines

Product Introduction of Inspur Server NF5280M4

Suggested use: infrastructure applications, collaboration/ , web, and virtualized desktops in a workgroup or distributed environments.

Cisco UCS C210 M2 General-Purpose Rack-Mount Server

HostEngine 5URP24 Computer User Guide

VSTOR Vault Mass Storage at its Best Reliable Mass Storage Solutions Easy to Use, Modular, Scalable, and Affordable

SUN ZFS STORAGE 7X20 APPLIANCES

LEVERAGING A PERSISTENT HARDWARE ARCHITECTURE

Acer AR320 F2 Specifications

NEC Express5800/R120e-1M System Configuration Guide

January 28 29, 2014San Jose. Engineering Workshop

High Performance Computing

1. ALMA Pipeline Cluster specification. 2. Compute processing node specification: $26K

Supports up to four 3.5-inch SAS/SATA drives. Drive bays 1 and 2 support NVMe SSDs. A size-converter

NEC Express5800/R120d-2M Configuration Guide

Essentials. Expected Discontinuance Q2'15 Limited 3-year Warranty Yes Extended Warranty Available

Achieve Optimal Network Throughput on the Cisco UCS S3260 Storage Server

Acer AT310 F2 Specifications

Flexible General-Purpose Server Board in a Standard Form Factor

Ref : IT/GOV/ th July Tender for Procurement of Blade Enclosure, Server & Storage

VX1800 Series Unified Network Storage

An Oracle White Paper December Accelerating Deployment of Virtualized Infrastructures with the Oracle VM Blade Cluster Reference Configuration

Lot # 10 - Servers. 1. Rack Server. Rack Server Server

PART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE SHEET) Supply and installation of High Performance Computing System

DESCRIPTION GHz, 1.536TB shared memory RAM, and 20.48TB RAW internal storage teraflops About ScaleMP

REQUEST FOR PROPOSAL FOR PROCUREMENT OF

DELL POWEREDGE SERVERS

Cisco UCS C210 M1 General-Purpose Rack-Mount Server

NEC Express5800/R120h-1E System Configuration Guide

Next Generation Computing Architectures for Cloud Scale Applications

NAMD Performance Benchmark and Profiling. January 2015

Intel Xeon E v4, Windows Server 2016 Standard, 16GB Memory, 1TB SAS Hard Drive and a 3 Year Warranty

Virtual Security Server

Cisco and Cloudera Deliver WorldClass Solutions for Powering the Enterprise Data Hub alerts, etc. Organizations need the right technology and infrastr

The advantages of architecting an open iscsi SAN

Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node

HP solutions for mission critical SQL Server Data Management environments

Dell PowerEdge Servers Portfolio Guide

<Insert Picture Here> Exadata Hardware Configurations and Environmental Information

SCHOOL OF PHYSICAL, CHEMICAL AND APPLIED SCIENCES

NEC Express5800/R120h-1E System Configuration Guide

Altair OptiStruct 13.0 Performance Benchmark and Profiling. May 2015

Intel Server. Performance. Reliability. Security. INTEL SERVER SYSTEMS

John Fragalla TACC 'RANGER' INFINIBAND ARCHITECTURE WITH SUN TECHNOLOGY. Presenter s Name Title and Division Sun Microsystems

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

DELL STORAGE NX WINDOWS NAS SERIES CONFIGURATION GUIDE

Intel Server. Performance. Reliability. Security. INTEL SERVER SYSTEMS FEATURE-RICH INTEL SERVER SYSTEMS FOR PERFORMANCE, RELIABILITY AND SECURITY

Transcription:

The ClusterStor Engineered Solution for HPC Dr. Torben Kling Petersen, PhD Principal Solutions Architect - HPC

Forward Looking Statement The following information contains, or may be deemed to contain, "forwardlooking statements" (as defined in the U.S. Private Securities Litigation Reform Act of 1995) which reflect our current views with respect to future events and financial performance. We use words such as "anticipates," "believes," "plans," "expects," "future,"' "intends," "may," "will," "should," "estimates," "predicts," "potential," "continue" and similar expressions to identify these forward-looking statements. All forward-looking statements address matters that involve risks and uncertainties. Accordingly, you should not rely on forward-looking statements, as there are or will be important factors that could cause our actual results, as well as those of the markets we serve, levels of activity, performance, achievements and prospects to differ materially from the results predicted or implied by these forward-looking statements. These risks, uncertainties and other factors include, among others, those identified in "Risk Factors," "Management's Discussion and Analysis of Financial Condition and Results of Operations'' and elsewhere in the company s 20-F filed with the SEC. Xyratex Ltd undertakes no obligation to publicly update or review any forward-looking statements, whether as a result of new information, future developments or otherwise. 2

An Integrated Lustre Solution Why bother?? Because Lustre can be hard and painful Task Typical Lustre ClusterStor Configuration and Ordering Multiple discreet building blocks, myriad of part numbers and options, complex configurator Simplified configurator model with Single Scalable Storage Unit (SSU) part number Software Installation >5 Days Factory Installed Out of Box Experience Performance Tuning >5 Days to get system up and running 2-4 Weeks balancing OSS, networking, RAID and JBOD systems Out of box to configured storage cluster in <4 hours Immediate (Factory Set) Client Setup 2-5 Days Immediate (Factory Set) Storage Expansion 3-5 Weeks <5 Hours Acceptance Testing Standard (weeks/months) Shortened (Days/Weeks) 3

Is it important?? While Lustre grows in the Top500 it needs to grow beyond the list Footprint in automotive, aerospace and manufacturing still small Footprint in atmospherics almost non-existent While present in the energy sector, this is not growing Finance??, Pharmaceutical??, Data warehousing?? For acceptance by industry several things are needed HSM (on the roadmap since many years but. ) Back-ups and snapshots Windows connectivity. Graphical management tools Training?? Custom builds wont work. Industry buys appliances. Support important (global 24/7 level 1-3 support required) Standard components produced in volume (user replaceable) 4

Appliance Architecture Balanced approach TESTING TESTING TESTING TESTING Xyratex Integrated TESTING Xyratex Customized Networking TESTING Hardware are OS TESTING Integrated TESTING Support Power TESTING TESTING Xyratex ClusterStor TESTING r Optimized Manager TESTING ager Lustre TESTING TESTING TESTING TESTED!! 5

Scalable Storage Unit SSU HA-OSS enclosure

ClusterStor Storage Module Enclosure Layout 5U Height 84 x 3.5 drives (16.8 drives per U) 3.5 SAS SSDs and HDDs @ 7200 RPM Two SBB Embedded Server Modules Fully Redundant (1+1) Power Gold standard 2.8 kw PSUs with built in UPS batteries (Metis) Industry leading performance/capacity city per kwh Two PSUs power all disks, 2x Servers, fans and the UPS Dual Cable Assemblies Redundant SAS and Power connections to drawers Single Power & Connectivity Midplane Rear mounted fan assemblies 7 Xyratex Patent Pending

Metis Xyratex Metis technology adds long-hold power supply capabilities to OneStor storage enclosures 764W. Removes the need for cache holdup batteries in I\O controllers and external UPS During AC failure the CPU complex remains powered for short period of time, enabling the dump of volatile memory to a non-volatile store Frees up space in the rack or I\O controller canister for adding other functions Eliminates the need for NVRAM cards 8

Front to Back cooling Air enters at the front Flows between the drives via controlled air gaps Most airflow is directed to PSUs Embedded servers Some airflow passes through to fan modules cooling any Metis batteries fitted. Air exits top vents in PSUs and embedded servers (inverted) Air exits via fans at the rear. 9

CS-2584 OSS Custom enclosure / standard Lustre design OSS Configuration - 4 OST s per server PCI-e based heartbeat OSS- Configuration 4 OST s per server 3 rows of 14 drives per drawer Hardwired SAS 3 rows of 14 drives per drawer 42 Physical drives per tray 42 Physical drives per tray Parity Drives Parity Drives 40 Drives 40 Drives 10 Four RAID-6 (8+2) RAID Arrays Four RAID-6 (8+2) RAID Arrays OSS Linux Ext Journals (SSD) Two Global Hot spares

ClusterStor 3000 ESM

ClusterStor 3000 OSS Embedded Server Module 12 Intel Nehalem micro-architecture Jasper Forest CPU Quad core (Xeon C5518) with RAID enhancements (XOR/P+Q) Fully integrated DMA engine 48W Max TDP low voltage SKU Integrated DDR3 ECC memory controllers Integrated PCIe Gen 2 buses Boot Options USB, SATA, Network Boot (PXE) Management Xyratex Unified System Management/SES IPMI version 2.0 Serial-over-LAN, ikvm and Remote Media Mellanox Connect-X2 HCA providing a single QDR IB port SAS 2.0 Expansion and Drive Side interfaces The entire hot swappable server uses less than 200W

ClusterStor 3000 OSS Module Major Component Detail 13

Integrated Solution

15 Factory built, cabled, installed, tuned and tested

ClusterStor Software Features

ClusterStor Stack Overview Native Lustre Client Protocol ClusterStor Manager SMB/CIFS Export NFS Export Management Interface/ ClusterStor Manager System Monitoring Agents/ GEM Data Protection Layer/ RAID 6 Xyratex Optimized Lustre 2.x Enterprise Linux CTBD Module Replication Engine Flash / Cache Module 17

18 GEM (General Enclosure Management) Building Blocks

GEM Overview High Availability Hardware and Software Advanced Firmware Updates Software Integrity Checking Logging - Volatile and Persistent Power Monitoring and control Adaptive Cooling Drive Power Control Advanced CLI Automatic enclosure plug and play Diagnostics Extensive status information of all services at all times SES 2.xx Compliant Tracking the standard Ethernet Support (when available) Telnet, SSH Based on 4 th Generation codebase Xyratex Portability Framework (XPF) 19

ClusterStor Manager Provides extensive end-to-end management of the entire Lustre cluster Automated Installation and Maintenance 20 Up and running in hours vs. days/weeks No need to go offline for updates Integrated Cluster and block storage management Health Monitoring and Notification Health status of entire storage cluster Error logging at every layer (Lustre, RAID, Enclosure..) E-mail alerts & single button bug submission and support via Bugzilla (OEM and/or XRTX) Root cause diagnostics and error message interpretation Leverages proven Open Source Infrastructure LMT- Lustre management Yaci cluster provisioning Puppet cluster configuration LNET LLNL GUI for Lustre monitoring Advanced analytics and data mining facility The means to optimize data I/O Ganglia/Cerebro scalable cluster monitoring Powerman power management Pacemaker fail over clustering DRBD configuration database failover

Installing and Managing ClusterStor

22 Installing a cluster

23 Installing a cluster

24 Installing a cluster

25 Installing a cluster

26 Installing a cluster

27 Installing a cluster

28 Managing the solution

29 Managing the solution

30 Managing the solution

31 Managing the solution

32 Managing the solution

33 Managing the solution

34 Managing the solution

35 Managing the solution

36 Managing the solution

37 Managing the solution

38 Managing the solution

39 Managing the solution

Xyratex ClusterStor Breaking new ground Up to 1.5 PB/rack Up to 672 disks per rack (exp rack) Supports 1, 2 or 3 TB SAS disks More than 20 GB/s throughput per rack Supports full QDR IB or 10 GbE fabrics All active components redundant and hot swappable Engineered, balanced solution for extreme density and performance Dedicated GUI based management utility Lustre 2.0/2.1 based solution Active development of new features and fixes Energy optimized design Less than 20 kw/rack at full performance 40

41 The End!!