Kubernetes for Stateful Workloads Benchmarks

Similar documents
Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

EsgynDB Enterprise 2.0 Platform Reference Architecture

Robin Systems: Container-based Virtualization for Databases and Data-centric applications

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

Falling Out of the Clouds: When Your Big Data Needs a New Home

IaaS Vendor Comparison

IOmark- VM. HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VM- HC b Test Report Date: 27, April

ROBIN SYSTEMS. Containerizing Oracle: Not Thinking About It Yet? You Should Be!!!

Benefits of Multi-Node Scale-out Clusters running NetApp Clustered Data ONTAP. Silverton Consulting, Inc. StorInt Briefing

Modernize Your Backup and DR Using Actifio in AWS

VMware Virtual SAN Technology

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE

ReDefine Enterprise Storage

Four Steps to Unleashing The Full Potential of Your Database

Running Databases in Containers.

AWS Storage Gateway. Amazon S3. Amazon EFS. Amazon Glacier. Amazon EBS. Amazon EC2 Instance. storage. File Block Object. Hybrid integrated.

Copyright 2018 Dell Inc.

Copyright 2012 EMC Corporation. All rights reserved.

Cloud Acceleration. Performance comparison of Cloud vendors. Tobias Deml DOAG2017

Modern hyperconverged infrastructure. Karel Rudišar Systems Engineer, Vmware Inc.

FIVE REASONS YOU SHOULD RUN CONTAINERS ON BARE METAL, NOT VMS

DELL EMC TEST DRIVE. Build Confidence and Close More Deals EXPLORE TEST DRIVES BY PRODUCT

Best Practices for Deploying Hadoop Workloads on HCI Powered by vsan

Virtual SAN and vsphere w/ Operations Management

How to Keep UP Through Digital Transformation with Next-Generation App Development

HCI: Hyper-Converged Infrastructure

Springpath Data Platform

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

Data Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public

State of OpenShift on Bare Metal

Modernize Without. Compromise. Modernize Without Compromise- All Flash. All-Flash Portfolio. Haider Aziz. System Engineering Manger- Primary Storage

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

Aerospike Scales with Google Cloud Platform

HyperFlex. Simplifying your Data Center. Steffen Hellwig Data Center Systems Engineer June 2016

VxRack SDDC Deep Dive: Inside VxRack SDDC Powered by VMware Cloud Foundation. Harry Meier GLOBAL SPONSORS

NetApp HCI QoS and Mixed Workloads

New HPE 3PAR StoreServ 8000 and series Optimized for Flash

Workspace & Storage Infrastructure for Service Providers

Storage Designed to Support an Oracle Database. White Paper

Consider Hyperconverged Infrastructure

The intelligence of hyper-converged infrastructure. Your Right Mix Solution

MongoDB on Kaminario K2

Presented by Nanditha Thinderu

Mission-critical Workload Performance Testing of Different Hyperconverged Approaches on the Cisco Unified Computing System Platform (UCS)

Cisco HyperFlex Deep Dive and Performance

Introduction to Database Services

Rok: Decentralized storage for the cloud native world

DriveScale-DellEMC Reference Architecture

70-247: Configuring and Deploying a Private Cloud with System Center 2012

A scalability comparison study of data management approaches for smart metering systems

Today s trends in the storage world. Jacint Juhasz Storage Infrastructure Architect

Why Datrium DVX is Best for VDI

Sheldon D Paiva, Nimble Storage Nick Furnell, Transform Medical

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

Data Movement & Tiering with DMF 7

Cloud Meets Big Data For VMware Environments

What is Dell EMC Cloud for Microsoft Azure Stack?

MidoNet Scalability Report

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage

Virtualized SQL Server Performance and Scaling on Dell EMC XC Series Web-Scale Hyper-converged Appliances Powered by Nutanix Software

Running VMware vsan Witness Appliance in VMware vcloudair First Published On: April 26, 2017 Last Updated On: April 26, 2017

HPE Strategy for VMware Cloud Foundation

Nutanix White Paper. Hyper-Converged Infrastructure for Enterprise Applications. Version 1.0 March Enterprise Applications on Nutanix

Virtual Desktop Infrastructure (VDI) Bassam Jbara

2014 VMware Inc. All rights reserved.

Lane Levere( Sr. Systems Engineer

Adapted from: TRENDS AND ATTRIBUTES OF HORIZONTAL AND VERTICAL COMPUTING ARCHITECTURES

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

Certified Reference Design for VMware Cloud Providers

Virtuozzo Hyperconverged Platform Uses Intel Optane SSDs to Accelerate Performance for Containers and VMs

Accelerating Big Data: Using SanDisk SSDs for Apache HBase Workloads

Next-Generation Cloud Platform

Deploying and Managing Dell Big Data Clusters with StackIQ Cluster Manager

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

All-Flash High-Performance SAN/NAS Solutions for Virtualization & OLTP

Modernize with all-flash

ELASTIC DATA PLATFORM

RED HAT CEPH STORAGE ROADMAP. Cesar Pinto Account Manager, Red Hat Norway

Mission-Critical Databases in the Cloud. Oracle RAC in Microsoft Azure Enabled by FlashGrid Software.

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

Key to A Successful Exadata POC

Deep Dive on SimpliVity s OmniStack A Technical Whitepaper

Real-Time & Big Data GIS: Best Practices. Josh Joyner Adam Mollenkopf

Why Converged Infrastructure?

Oracle Real Application Clusters (RAC) 12c Release 2 What s Next?

Optimizing the Data Center with an End to End Solutions Approach

Cloud Storage with AWS: EFS vs EBS vs S3 AHMAD KARAWASH

Architecture of a Real-Time Operational DBMS

On BigFix Performance: Disk is King. How to get your infrastructure right the first time! Case Study: IBM Cloud Development - WW IT Services

QLIK INTEGRATION WITH AMAZON REDSHIFT

Big Data analytics in insurance

The age of Big Data Big Data for Oracle Database Professionals

Cisco Container Platform

Virtualizing Oracle on VMware

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

Lenovo Software Defined Infrastructure Solutions. Aleš Simončič Technical Sales Manager, Lenovo South East Europe

Flash Storage Complementing a Data Lake for Real-Time Insight

Configuring and Deploying a Private Cloud DURATION: Days

Transcription:

Kubernetes for Stateful Workloads Benchmarks Baremetal Like Performance for For Big Data, Databases And AI/ML Executive Summary Customers are actively evaluating stateful workloads for containerization and management using Kubernetes but have not been able to find adequate solutions to get baremetal performance for data-heavy applications. The results in a recent CIO survey highlighted the confidence in Kubernetes as a viable platform for stateful applications. Robin completed the below performance, scale and key workload-specific benchmark for containerized Stateful workloads that are managed using Kubernetes. Performance and Scale are the key parameters for CIOs in making a choice for modernizing their environment data infrastructure. Storage Performance The first parameter that the customers consider when containerizing data-intensive applications is the storage performance of an application. This benchmark showcases that Robin delivers near baremetal like performance when compared to baremetal. Using Robin entails gaining multiple beneficial features such as consolidation of multiple apps on fewer resources or within the same server, cost savings with better hardware utilization, and an addition of a host of features such as snapshots, cloning, backup, scale and upgrade with 1-click. Packaging more applications within the same hardware resources without performance degradation is critical to large enterprises that use big data, databases, and AI/ML apps across physical and cloud environments. Robin enables automonitoring resource allocation across on-premise and public cloud depending on the needs of the applications. Robin recalibrates the resource allocation as and when needed.

Environment Common environment details used shown below. Tool Used Vdbench Command Used [ john@server01]# vdbench -f confg io=seq random bs=16k 32k 64k 128k rw=0 100 rate=max etime=60000 thr=8 lun1=/dev/ sdx Workloads Sequential and random workloads SSD Benchmark Below are the test environment details within which the tests were performed. SSD Device : ata-samsung_mz7km960hahp-00005 Memory : 16GB CPU : 4 cores SSD Test Results I/O Type Sequential Write Sequential Read Random Write Random Read Block Size 16k 32k 64k 128k 16k 32k 64k 128k 16k 32k 64k 128k 16k 32k 64k 128k Robin 434.0 463.4 478.9 491.7 487.7 507.7 519.5 524.7 425.3 455.8 477.5 488.9 477.7 494.1 500.4 501.9 HDD Benchmark Below are the test environment details within which the tests were performed. HDD Device : ATA- HGST_HUS724020AL Memory : 16GB CPU : 4 cores HDD Test Results I/O Type Sequential Write Sequential Read Random Write Random Read Block Size 16k 32k 64k 128k 16k 32k 64k 128k 16k 32k 64k 128k 16k 32k 64k 128k Robin 109.7 115.8 121.0 121.2 153.7 153.7 152.7 153.8 2.4 4.6 9.2 15.7 4.4 8.5 16.0 28.4

Application Workload Performance The earlier section shows that Robin offers near baremetal device performance. Now, at the Application level, we chose SQL and NoSQL applications to showcase that Robin is inline with bare-metal application performance. Stateful application or workloads have unique infrastructure requirements and Robin brings to the table a host of features that help manage the enterprise environment in an efficient way with near bare-metal performance. Oracle SLOB Tests were performed with different replication factors with Robin Hyper Converged Kubernetes platform. The benchmark results show that even with application protection enabled using Robin replication factor, the performance is near bare-metal. Test Environment Number of nodes : 3 Memory : 128GB/Node CPU : 20 cores/node Oracle SLOB Test Results All data is from AWR reports DB Size: 50GB Test duration: 2 hrs SSD Baremetal Robin Replication-1 Robin Replication-2 Robin Replication-3 IOPS MB/s IOPS MB/s IOPS MB/s IOPS MB/s 100% Write 6436.90 75.2 5872.10 69.6 5156.10 67.5 4982.9 65.2 100% Read 47158.40 368.6 46401.20 362.6 46419.0 363.1 46446.50 363 OLTP, 70% Reads, 30% Writes 9521.7 96.3 10640.8 107.6 9902.4 102.2 9450.1 99.2 HDD Baremetal Robin Replication-1 Robin Replication-2 Robin Replication-3 IOPS MB/s IOPS MB/s IOPS MB/s IOPS MB/s 100% Write 26.1 4.1 510.1 7.1 497.6 6.9 499.6 7 100% Read 290.3 2.3 310.1 2.5 309.8 2.5 306.4 2.5 OLTP, 70% Reads, 30% Writes 269.1 3.3 353.8 4.1 353.8 4.0 343.8 3.9

Cassandra YCSB Results : Tests were performed processing 1 Billion records with different workloads, on a 3 node setup using Robin Hyper Converged Kubernetes platform. The benchmark results show the performance is near baremetal even with such a huge amount of data. Test Environment: Number of nodes : 3 Memory : 128GB/Node CPU : 20 cores/node Cassandra YCSB run Test Results YCSB Benchmark for Cassandra (1 Billion Records) Workload Type #Records Cassandra Replication OPS Baremetal Robin Workload A LOAD 100000000(1B) 3 78150 77762 Workload A RUN 100000000(1B) 3 77727 75954 Workload B RUN 100000000(1B) 3 62074 64743 Workload C RUN 100000000(1B) 3 89114 86217 Workload D RUN 100000000(1B) 3 88049 85264 Workload E RUN 100000000(1B) 3 79612 77104

Application Scaling Scaling to meet the demand is a fundamental requirement for cloud native applications and this need is a complex one to meet in the case of distributed or stateful applications. ROBIN is architected to scale horizontally with ease. Bringing up data-heavy application clusters is a huge effort and takes a lot of time - hardware, software installation, cluster configuration, and performance tuning - A repetitive cycle for each cloudera installation - As opposed to this Robin was able to bring up a 500 node cluster in less than one hour - bringing the time-line down from many weeks to 54 minutes. Application Scaling - Cloudera To showcase the high level of scalability with Robin, we brought up a Cloudera cluster with 500 nodes. # of application : 1 # of cloudera nodes : 500 491 DataNodes 1 Gateway Node 2 Name Nodes 1 CM Server 3 zookeepers 1 node for utilities 1 node for hive Total cores : 6400 configured Memory : 47.65 TiB Storage configured 1.45 PiB 100 physical servers used App creation time : 54 mins Cloudera Version: CDH-5.12.0