Terabit Networking with JASMIN

Similar documents
JASMIN Petascale storage and terabit networking for environmental science

JASMIN Overview. UKMO Visit 24/11/2014. Matt Pritchard

The Impact of Hyper- converged Infrastructure on the IT Landscape

VIRTUAL CLUSTER SWITCHING SWITCHES AS A CLOUD FOR THE VIRTUAL DATA CENTER. Emil Kacperek Systems Engineer Brocade Communication Systems.

TITLE. the IT Landscape

Vmware VCXN610. VMware Certified Implementation Expert (R) Network Virtualization.

The CEDA Archive: Data, Services and Infrastructure

Mellanox Virtual Modular Switch

Arista 7020R Series: Q&A

Network Configuration Example

THE OPEN DATA CENTER FABRIC FOR THE CLOUD

Highest Levels of Scalability Simplified Network Manageability Maximum System Productivity

SwitchX Virtual Protocol Interconnect (VPI) Switch Architecture

Arista 7050X Series: Q&A

Introduction: PURPOSE BUILT HARDWARE. ARISTA WHITE PAPER HPC Deployment Scenarios

InfiniBand Switch System Family. Highest Levels of Scalability, Simplified Network Manageability, Maximum System Productivity

V.I.B.E. Virtual. Integrated. Blade. Environment. Harveenpal Singh. System-x PLM

<Insert Picture Here> Exadata Hardware Configurations and Environmental Information

Arista 7320X: Q&A. Product Overview. 7320X: Q&A Document What are the 7320X series?

RESEARCH DATA DEPOT AT PURDUE UNIVERSITY

Deploying Data Center Switching Solutions

N-Series Switches IDEAL FOR DATA CENTER NETWORKS AND HIGH-END CAMPUS NETWORKS

The Next Opportunity in the Data Centre

VxRack FLEX Technical Deep Dive: Building Hyper-converged Solutions at Rackscale. Kiewiet Kritzinger DELL EMC CPSD Snr varchitect

Arista 7010 Series: Q&A

DDN About Us Solving Large Enterprise and Web Scale Challenges

Networking Terminology Cheat Sheet

Arista 7060X, 7060X2, 7260X and 7260X3 series: Q&A

Cisco UCS Network Performance Optimisation and Best Practices for VMware

VMware Validated Design for Micro-Segmentation Reference Architecture Guide

Dell EMC. VxBlock Systems for VMware NSX 6.3 Architecture Overview

VMware vsan Network Design-OLD November 03, 2017

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND FIBRE CHANNEL INFRASTRUCTURE

Dell EMC Networking VxRail Networking Quick Guide

Dell EMC. VxBlock Systems for VMware NSX 6.2 Architecture Overview

Benefits of 25, 40, and 50GbE Networks for Ceph and Hyper- Converged Infrastructure John F. Kim Mellanox Technologies

Welcome. Questions? Please contact or call

CloudEngine 6800 Series Data Center Switches

Arista 7050X Series: Q&A

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Architecting Data Center Networks in the era of Big Data and Cloud

Architecture and Design. VMware Validated Design 4.0 VMware Validated Design for Micro-Segmentation 4.0

Nimble Storage SmartStack Getting Started Guide Cisco UCS and VMware ESXi5

IPv6 Best Operational Practices of Network Functions Virtualization (NFV) With Vmware NSX. Jeremy Duncan Tachyon Dynamics

Ten things hyperconvergence can do for you

DELL EMC ISILON F800 AND H600 I/O PERFORMANCE

Architecting Storage for Semiconductor Design: Manufacturing Preparation

GUIDE. Optimal Network Designs with Cohesity

Hochverfügbarkeit in Campusnetzen

DELL EMC TECHNICAL SOLUTION BRIEF. ARCHITECTING A DELL EMC HYPERCONVERGED SOLUTION WITH VMware vsan. Version 1.0. Author: VICTOR LAMA

Verified Scalability Guide for Cisco APIC, Release 3.0(1k) and Cisco Nexus 9000 Series ACI-Mode Switches, Release 13.0(1k)

Cloud Networking (VITMMA02) Server Virtualization Data Center Gear

CloudEngine Series Data Center Switches

Huawei CloudFabric and VMware Collaboration Innovation Solution in Data Centers

IBM Hortonworks Design Guide 14-Sep-17 v1

Verified Scalability Guide for Cisco APIC, Release 3.0(1k) and Cisco Nexus 9000 Series ACI-Mode Switches, Release 13.0(1k)

Brocade Ethernet Fabrics

Scaling to Petaflop. Ola Torudbakken Distinguished Engineer. Sun Microsystems, Inc

John Fragalla TACC 'RANGER' INFINIBAND ARCHITECTURE WITH SUN TECHNOLOGY. Presenter s Name Title and Division Sun Microsystems

vstart 50 VMware vsphere Solution Specification

Introduction to Spine-Leaf Networking Designs

Dell DCPPE-200. Dell PowerEdge Professional. Download Full version :

Dell EMC. VxBlock and Vblock Systems 540 Architecture Overview

Emerging Technologies for HPC Storage

IronPOD System 400 Series System Overview

LSW6600 are the industry's highest performance 1U stackable data center switch, featuring with 1.28Tbps

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND ISCSI INFRASTRUCTURE

DELL EMC SCALEIO. Networking Best Practices and Design Considerations ABSTRACT

IBM Cloud for VMware Solutions NSX Edge Services Gateway Solution Architecture

BROCADE CLOUD-OPTIMIZED NETWORKING: THE BLUEPRINT FOR THE SOFTWARE-DEFINED NETWORK

NetApp HCI with Mellanox SN2010 Switch Quick Cabling Guide

VMware Virtual SAN Routed Network Deployments with Brocade

Deep Dive QFX5100 & Virtual Chassis Fabric Washid Lootfun Sr. System Engineer

Create a pfsense router for your private lab network template

Apstra Operating System AOS

Store Process Analyze Collaborate Archive Cloud The HPC Storage Leader Invent Discover Compete

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Dell EMC Networking vsan vsphere Networking Quick Guide using Dell OS 9

Shared Object-Based Storage and the HPC Data Center

NETWORK ARCHITECTURES AND CONVERGED CLOUD COMPUTING. Wim van Laarhoven September 2010

Rack-Level I/O Consolidation with Cisco Nexus 5000 Series Switches

GCN Lead Greece Cyprus & Malta GLOBAL SPONSORS

The Impact of Hyper- converged Infrastructure on the IT Landscape

Video Surveillance EMC Storage with Honeywell Digital Video Manager

Modern hyperconverged infrastructure. Karel Rudišar Systems Engineer, Vmware Inc.

vsphere Design and Deploy Fast Track v6 Additional Slides

Tiered IOPS Storage for Service Providers Dell Platform and Fibre Channel protocol. CloudByte Reference Architecture

General Questions. Section Specific Questions

FlexPod Express with VMware vsphere 6.0U2, NetApp E-Series, and Cisco UCS Mini

DELL EMC VXRACK FLEX FOR HIGH PERFORMANCE DATABASES AND APPLICATIONS, MULTI-HYPERVISOR AND TWO-LAYER ENVIRONMENTS

Cloud Thinking in the Enterprise

Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions

VMware and Arista Network Virtualization Reference Design Guide for VMware vsphere Environments

MS425 SERIES. 40G fiber aggregation switches designed for large enterprise and campus networks. Datasheet MS425 Series

VMware Cloud Provider Platform

Building a Phased Plan for End-to-End FCoE. Shaun Walsh VP Corporate Marketing Emulex Corporation

Getting Started with Linux on Cumulus Networks

MULTI-STAGE CLOS ARCHITECTURES

VMware Virtual SAN Technology

Transcription:

Terabit Networking with JASMIN Jonathan Churchill JASMIN Infrastructure Manager Research Infrastructure Group Scientific Computing Department STFC Rutherford Appleton Labs

Terabit Networking with JASMIN What is JASMIN? Why is it needed? 3 year Growth Pains Network (re)design #3 Criteria Network Design Issues ECMP CLOS Architecture Advantages and Disadvantages VXLAN JASMIN Future Expansion

JASMIN s Purpose CEDA data storage & services Curated data archive Archive management services Archive access services (HTTP, FTP, Helpdesk,...) Data intensive scientific computing Global / regional datasets & models High spatial, temporal resolution Private cloud Flexible access to high-volume & complex data for climate & earth observation communities Online workspaces Services for sharing & collaboration

JASMIN is a world leading, unique hybrid of: 16PB high performance storage (~250GByte/s) High-performance computing (~4,000 cores) Non-blocking Networking (> 3Tbit/sec), and Optical Private Network WAN s Coupled with cloud hosting capabilities To address one of NERC s most strategically important challenges: the improvement of predictive environmental science. Prof. Duncan Wingham, NERC Chief Exec.

Panasas Storage Parallel file system (cf Lustre, GPFS, pnfs etc) Single Namespace 140GB/sec benchmarked (95 shelves PAS14) Access via PanFS client/nfs/cifs Posix filesystem out of the box. Mounted on Physical machines and VMs 103 shelves PAS11 + 101 shelves PAS14 + 40 Shelves PAS16 Each shelf connected at 10Gb (20Gb PAS14) 2,684 Blades JASMIN - Largest single realm in the world One Management Console TCO: Big Capital, Small Recurrent but JASMIN2 /TB < GPFS/Lustre offerings

Three year growth pains 172.16.X.0/21 = 2,000 IPs 130.246.X.0/21 Flat Overlaid L2 160->240 Ports @ 10Gb

4 x VMware Clusters vjasmin 156 cores, 1.2TB 40x 10Gb 385 x 10Gb Panasas Storage 20PBytes 15.1 PB (usable) Overview 12.5M,38 Racks, 850Amps, 25 tonnes, 3Terabit/s bandwidth Lotus HPC Cluster 468 x 10Gb 32x 10Gb NetApp+Dell 1010TB + (VM VMDK images) A network : 1,100 Ports @ 10GbE vcloud 208-1648 cores, 1.5TB 12.8TB MPI network (10Gb low latency eth) 144-234 hosts, 2.2K-3.6K cores. RHEL6, Platform LSF, MPI 40x 10Gb LightPaths @ 1&2Gb/s and 10Gb/s: Leeds, UKMO, Archer, (KNMI), CEMS-ISIC

Floor Plan Network distributed ~30m x ~20m JASMIN 1 JASMIN 4,5 (2016 20) ) JASMIN 3 JASMIN 2 Science DMZ

Network Design Criteria Non-Blocking (No network contention) Low Latency ( < 20uS MPI. Preferably < 10uS) Small latency spread. Converged (IP storage, SAN storage, Compute, MPI) 700-1100 Ports @ 10Gb Expansion to 1,600 ports and beyond wo forklift. Easy to manage and configure Cheap later on: Replaces JASMIN1 240 ports in place.

Cabling Costs 1,000 Fibre Connections = 400-600K JASMIN1+2 700-1100 10Gb Connections Compute Rack Storage Rack Storage Rack Storage Rack Network Rack Storage Rack Storage Rack Storage Rack Compute Rack 312 Twinax Fully Populated ToR 6x S4810 ToR Switches 48x Active Optic QSFP ToR e.g Force10 S4810P 48 x 10Gb SFP+ 4x 40Gb QSFP+ 1:1 Contention ToR 20x S4810 ToR Switches 80x Active Optic QSFP Lots of core 40Gb ports needed. MLAG to 72 Ports?... Chassis switch? expansion/cost

Mellanox SX1036 1,104 x 10GbE Ports CLOS L3 ECMP OSPF Mellanox SX1024 768 Ports max. no expansion so 12 spines Max 36 leaf switches :1,728 Ports @ 10GbE Non-Blocking. Zero Contention (48x10Gb = 12x 40Gb uplinks) Low Latency (250nS L3 / per switch/router). 7-10uS MPI Cheap!.. (ish)

Four routed ECMP hopshops Fast

ECMP CLOS L3 Advantages Massive scale High performance Low latency with fixed switches Standards based supports multiple vendors Very small blast radius upon network failures Small isolated subnets Deterministic latency with a fixed spine and leaf Pay as you grow start small and increment https://www.nanog.org/sites/default/files/monday.general.hanks.multistage.10.pdf

ECMP CLOS L3 Issues Managing scale: #s of IPs, subnets, VLANs, Cables Monitoring Routed L3 network: Reqs dynamic OSPF routing (100 s routes per switch) No L2 between switches (VMware: SAN s, vmotion) Reqs: DHCP Relay, VXLAN Complex traceroute seen by users.

IP and Subnet Management

Subnet / IP Management 2x /21 Panasas Storage 4x /24 Internet Connects 55x /26 Fabric Subnets 264x /30 Inter switch links.. 400 VMs & Growing quickly 288 Servers 2,244 Storage Blades 5,000 IPs & Growing Another 1,000 this month! ~260 VLAN IDs

Monitoring / Visualisation Complex Cacti >30 Fabric Switches >50 Mgmt Switches 100 s links to monitor Nagios bloat

Need for VXLAN 24 hosts per L2 switch L2 subnets differ per switch No switch to switch vmotion http://crankypotato.com/?p=598/

VXLAN ESXi MultiCast PIM IGMP Snooping MTU 50 Byte overhead VMs no MTU 9000 IP Storage ESXi Routing ESXi Auto deploy DHCP No Panasas Mounts http://crankypotato.com/?p=598/

JASMIN Future Expansion?? Or 3 Tier CLOS 4x 100Gb Fabric Links?? 30-80PB on Disk by 2020 (Demand for 300PB) Reqs > 2-3K 10GbE ports (more likely 20Gb or 40Gb) Standards based OSPF, ECMP, L3 Needs automation / SDN to manage.

Terabit Networking with JASMIN What is JASMIN? Why is it needed? 3 year Growth Pains Network (re)design #3 Criteria Network Design Issues ECMP CLOS Architecture Advantages and Disadvantages VXLAN JASMIN Future Expansion

Questions? Contact: jonathan.churchill@stfc.ac.uk http://www.stfc.ac.uk/scd/ LinkedIn