MongoDB on Kaminario K2

Similar documents
Functional Testing of SQL Server on Kaminario K2 Storage

Performance Testing of SQL Server on Kaminario K2 Storage

A Kaminario Reference Architecture: Reference Architecture for Running SQL Server on ESXi

Accelerated Application Development

Achieving SQL Server High Availability with Kaminario K2

Accelerating SAP Performance and Scalability with Kaminario K2

Four Steps to Unleashing The Full Potential of Your Database

Discover the all-flash storage company for the on-demand world

HPE Nimble Storage HF20 Adaptive Dual Controller 10GBASE-T 2-port Configure-to-order Base Array (Q8H72A)

Managed Service Providers

Dell EMC All-Flash solutions are powered by Intel Xeon processors. Learn more at DellEMC.com/All-Flash

FLASHARRAY//M Business and IT Transformation in 3U

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside.

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

HCI: Hyper-Converged Infrastructure

SQL Server Backup and Restore using Kaminario K2 Snapshots

FLASHARRAY//M Smart Storage for Cloud IT

Why Datrium DVX is Best for VDI

Ten things hyperconvergence can do for you

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

SolidFire and Ceph Architectural Comparison

REFERENCE ARCHITECTURE Microsoft SQL Server 2016 Data Warehouse Fast Track. FlashStack 70TB Solution with Cisco UCS and Pure Storage FlashArray//X

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

Veritas NetBackup on Cisco UCS S3260 Storage Server

Copyright 2012 EMC Corporation. All rights reserved.

A High-Performance Storage and Ultra- High-Speed File Transfer Solution for Collaborative Life Sciences Research

Accelerate Database Performance and Reduce Response Times in MongoDB Humongous Environments with the LSI Nytro MegaRAID Flash Accelerator Card

Dell Fluid Data solutions. Powerful self-optimized enterprise storage. Dell Compellent Storage Center: Designed for business results

EMC Virtual Infrastructure for Microsoft Exchange 2010 Enabled by EMC Symmetrix VMAX, VMware vsphere 4, and Replication Manager

EMC Backup and Recovery for Microsoft Exchange 2007 SP1. Enabled by EMC CLARiiON CX4-120, Replication Manager, and VMware ESX Server 3.

Software-defined Storage: Fast, Safe and Efficient

Comparison of Storage Protocol Performance ESX Server 3.5

FLASHARRAY AT A GLANCE

IBM Spectrum Protect Plus

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

THE SUMMARY. CLUSTER SERIES - pg. 3. ULTRA SERIES - pg. 5. EXTREME SERIES - pg. 9

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

HPE SimpliVity 380. Simplyfying Hybrid IT with HPE Wolfgang Privas Storage Category Manager

SolidFire and Pure Storage Architectural Comparison

Emulex LPe16000B 16Gb Fibre Channel HBA Evaluation

How Flash Storage is Changing the Economics of SaaS Businesses

EMC Integrated Infrastructure for VMware. Business Continuity

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

The storage challenges of virtualized environments

SUN ZFS STORAGE APPLIANCE

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA

VMware vsan 6.6. Licensing Guide. Revised May 2017

W H I T E P A P E R. Comparison of Storage Protocol Performance in VMware vsphere 4

FlashArray//m. Business and IT Transformation in 3U. Transform Your Business. All-Flash Storage for Every Workload.

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

StorMagic SvSAN 6.1. Product Announcement Webinar and Live Demonstration. Mark Christie Senior Systems Engineer

NetApp AFF. Datasheet. Leading the future of flash

SUN ZFS STORAGE APPLIANCE

SolidFire. Petr Slačík Systems Engineer NetApp NetApp, Inc. All rights reserved.

HPE Synergy HPE SimpliVity 380

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

Data Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public

Hedvig as backup target for Veeam

NetApp HCI QoS and Mixed Workloads

HPE SimpliVity. The new powerhouse in hyperconvergence. Boštjan Dolinar HPE. Maribor Lancom

Webinar Series: Triangulate your Storage Architecture with SvSAN Caching. Luke Pruen Technical Services Director

Software-defined Shared Application Acceleration

Lenovo Validated Designs

The Data-Protection Playbook for All-flash Storage KEY CONSIDERATIONS FOR FLASH-OPTIMIZED DATA PROTECTION

Introducing HPE SimpliVity 380

Nimble Storage Adaptive Flash

SQL Server 2017 for your Mission Critical applications

How Architecture Design Can Lower Hyperconverged Infrastructure (HCI) Total Cost of Ownership (TCO)

Quest DR Series Disk Backup Appliances

Veritas Access. Installing Veritas Access in VMWare ESx environment. Who should read this paper? Veritas Pre-Sales, Partner Pre-Sales

NetApp AFF A700 Performance with Microsoft SQL Server 2014

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

PERFORMANCE CHARACTERIZATION OF MICROSOFT SQL SERVER USING VMWARE CLOUD ON AWS PERFORMANCE STUDY JULY 2018

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

EMC Backup and Recovery for Microsoft SQL Server

Fujitsu PRIMEFLEX for VMware vsan 20,000 User Mailbox Exchange 2016 Mailbox Resiliency Storage Solution

OpenVMS Storage Updates

Microsoft SQL Server HA and DR with DVX

IBM Emulex 16Gb Fibre Channel HBA Evaluation

DELL EMC DATA DOMAIN OPERATING SYSTEM

Virtual Desktop Infrastructure (VDI) Bassam Jbara

How Composable Storage will Define the Economics of the SaaS Business

EMC CLARiiON CX3-80. Enterprise Solutions for Microsoft SQL Server 2005

São Paulo. August,

SurFS Product Description

VMWARE VSAN LICENSING GUIDE - MARCH 2018 VMWARE VSAN 6.6. Licensing Guide

2014 VMware Inc. All rights reserved.

Hitachi Virtual Storage Platform Family

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Flash In the Data Center

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

Accelerating Enterprise Search with Fusion iomemory PCIe Application Accelerators

High performance and functionality

THESUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System

FlashStack 70TB Solution with Cisco UCS and Pure Storage FlashArray

Transcription:

MongoDB on Kaminario K2 June 2016 Table of Contents 2 3 3 4 7 10 12 13 13 14 14 Executive Summary Test Overview MongoPerf Test Scenarios Test 1: Write-Simulation of MongoDB Write Operations Test 2: Write-Simulation of MongoDB Read & Write Operations Data Reduction K2 s Data Protection Native Backup and Replication Configuration Details Hardware Configurations Array, Servers MongoDB Virtual Machine Software Configurations

Executive Summary MongoDB is the leading NoSQL database that provides agility and the freedom to scale. It is widely used by a wide range of businesses to support the infrastructure for modern applications and environments where relational databases cannot deliver the agility, flexibility and scalability of various types of data. As such, MongoDB also powers companies that provide software-as-a-service (SaaS) where scalability is a must. The Kaminario K2 all-flash array is the perfect storage infrastructure to match the characteristics of MongoDB. The K2 is the leading all-flash array (AFA) for modern technologies such as MongoDB, allowing applications and services based on MongoDB to be deployed faster, dramatically reducing the time from development to production which directly impacts business productivity. The K2 is the only on-premises shared storage infrastructure that can match MongoDB s scalability and flexibility, with a unique storage architecture that can both scale-up and scale-out. While serving all the storage needs of the business, the K2 is capable of delivering high performance and consistent latency for mixed type of workloads simultaneously. The latency remains consistently low as the K2 scales, while IOPS and throughput grow linearly. MongoDB-based applications are guaranteed to get the best performance possible with predictability and consistency for all use-cases. K2 s feature-rich storage array has native snapshots which allow instant creation of additional production-like environments, a robust K-RAID scheme that ensures data is available at all times and inline data-reduction features such as deduplication and compression that offloads operations from MongoDB servers to the storage. Kaminario offers an all-flash array which is more cost-efficient than HDD based or hybrid arrays. This paper describes a set of tests that demonstrate how well MongoDB works with the K2 all-flash storage array. The tests include performance benchmarks, compression algorithms comparison and the use of K2 s native snapshots with MongoDB. 2

Test Overview In order to show the advantages of using MongoDB with Kaminario K2, we performed a set of tests using Mongoperf, an official MongoDB utility to check disk I/O performance. In addition, MongoDB implementations can benefit from different Kaminario K2 capabilities, another set of tests were done to show a proven proof to that. MongoPerf Test Scenarios Mongoperf has 2 different options of workloads 1. Write A MongoDB simulation of writes operations 2. Mixed- A MongoDB simulation of read and writes operations Each Mongoperf test is constructed of two phases: 1. Inflation of the test file 2. Simulating MongoDB like operations on the test file In both workloads we used a 1TB file to run the Mongoperf utility. After the inflation of the file, we continued running those operations for more than 1 hour to demonstrate consistency. The mmf parameter was set to false in order to simulate direct work with the storage without using the memory mapped file option. 3

Test 1: Write-Simulation of MongoDB Write Operations At this test we run Mongoperf with the next set of parameters: { } nthreads:16, filesizemb:1000000, mmf:false, r:false, w:true, recsizekb:4 Phase 1: Inflation of a 1 TB test file Inflation of a 1TB test file took ~8 minutes while the K2 provided average throughput of 2.25GB/sec, IOPS an average of 4.5K and latency at an average of 0.8ms, as can be seen in figure 1. Figure 1: K2 GUI during phase 1 of test 1 4

Phase 2: Simulating MongoDB like operations As seen in figure 2, during the entire test run the K2 demonstrated consistent performance with very low latency at an average of 0.1ms. Mongoperf write scenario produces 4K random I/O that consists of 100% writes. Throughput during this phase is of 200MB/sec and average of 51,000 IOPS. Figure 2: K2 GUI during phase 2 test 1 From the Mongoperf output, we can see that even after an hour of running the test, performance is consistent with over 50,000 operations/sec, as is evident from figure 3. Figure 3: Mongoperf output 5

The following GUI output captures both phases: Figure 4: K2 GUI during both phases of test 1 As seen, K2 provided consistent performance during both phases of the test and all with very low latency. 6

Test 2: Write Simulation of MongoDB Read & Write Operations At this test we run Mongoperf with the next set of parameters: { } nthreads:16, filesizemb:1000000, mmf:false, r:true, w:true, recsizekb:4 Phase 1: Inflation of a 1 TB test file Same as in the write test, the inflation of the 1TB test file took ~8 minutes while the K2 provided average throughput of 2.25GB/sec, IOPS averaged at 4.5K and latency at the average of 0.8ms as can be seen in figure 5. Figure 5: K2 GUI during phase 1 test 2 7

Phase 2: Simulating MongoDB like operations The second phase is no different from the first phase of the first test described previously. K2 demonstrated consistent performance with very low latency of under 0.2ms. Mongoperf produced read and write I/O. The I/O size is still the same with random 4K I/O but the read and write percentage is of 50% each. The average throughput during this phase is an average of 175MB/sec and IOPS running at the average of 44,000. Figure 6: K2 GUI during phase 2 test 2 From the Mongoperf output, we can see that the operations/sec were consistent for over an hour, with a slight increase of 5% up to 45,000 operations/sec, as can be seen in figure 7. Figure 7: Mongoperf output 8

Similar to the first test, and as can be seen in figure 8, K2 provided consistent performance during both phases of the test and again with very low latency. Figure 8: K2 GUI during both phases of test 9

Data Reduction One of the benefits MongoDB has when using Kaminario K2 as its storage is getting a built-in always on inline compression capability. MongoDB 3.2 has a built-in compression mechanism as part of its default storage engine of WiredTiger. WiredTiger storage engine has 3 different compression levels: 1. None MongoDB is not compressing the data at all 2. Snappy MongoDB s default compression level. MongoDB uses Google s Snappy compression/decompression library which provides lower compression rate but has little performance cost. 3. Zlib zlib compression is a software library used for data compression. It is a compression component used in software platforms as Linux, MAC OS X and ios. Zlib compression provides a high compression rate but has a high performance cost as well. A series of tests were conducted to check how K2 s inline compression compresses a typical MongoDB database and if K2 is able to compress the data even more after applying MongoDB different compression options. For these set of tests we used a customer database at the size of 212GB and imported it to 3 different mount points (3 different LUNs in K2): 1. Imported the database with the parameter wiredtigercollectionblockcompressor none 2. Imported the database with the parameter wiredtigercollectionblockcompressor snappy 3. Imported the database with the parameter wiredtigercollectionblockcompressor zlib The results are shown in figures 9 and 10 below: Figure 9: K2 GUI - Volumes (LUNs) view Figure 10: Linux output of mount points and sizes 10

In order to understand the physical size of each LUN, using the K2 CLI we can check the data reduction ratio of each LUN in the system: Figure 9: K2 GUI - Volumes (LUNs) view As seen, Kaminario K2 native compression is able to compress even more an already highly compressed database as the zlib compression. The next table shows actual compression ratios and the real physical size of each LUN in K2: K2 Compression Specifications Database Size (GB) Compression Ratio Physical Capacity (GB) No Compression 212 3.7:1 57.3 Snappy Compression 42 1.5:1 28 Zlib Compression 18 1.2:1 15 To conclude, the K2 is able to gain even more data reduction on top of MongoDB s compression methods. In addition, K2 s native deduplication further reduces the physical footprint to achieve the most cost efficient all-flash array. 11

K2 s Data Protection, Native Backup and Replication K-RAID The K2 all-flash array has a unique dual-parity RAID scheme K-RAID. It allows losing up to three SSDs within a single SSD shelf while still keeping the data intact. One of the core advantages of using the K2 for MongoDB deployments is the resilience it offers, reducing the need to create replica-sets thus further making a MongoDB deployment more cost efficient when using the K2 for its storage, and making it easy to plan the scale of MongoDB without having to plan for storage. Native Snapshots Data is always in risk to hazards from viruses, malware etc. but in most cases it is human error that might corrupt data. K2 s native snapshots feature allows taking thousands of point-in-time copies of the data, with no capacity overhead. Achieving application-consistent snapshots with MongoDB and K2 is accomplished with K2 s RESTful API platform: 1. Connect to MongoDB and request the instance to flush all pending write operations to the disk and lock the database to further writes (fsynclock()) 2. Connect to the K2 array and take a snapshot of the MongoDB LUNs. This operation takes tens of milliseconds, regardless the number of LUNs or their size. 3. After the completion of the snapshot operation, connect to MongoDB and unlock the database (fsyncunlock()). K2 s snapshots make it easy to create up-to-date copies for environments such as QA, Dev-Test and more, giving them the ability to run on the same all-flash storage with the consistency and predictable performance that comes along but with no added cost of capacity. In order to be able to take a consistent application snapshot, MongoDB storage engine should run with journaling enabled. Native Replication Based on K2 s native snapshots, K2 s native replication feature enable customers to replicate applications-consistent copies of data to remote sites. This allows increased data protection in the scenario of a disaster, in addition to having all sites working on the most up-to-date production copy. 12

Configuration Details Hardware Configurations Arrays, Servers The environment for testing consists of a dual K-Block K2 array. A K-Block is the building block of the K2 AFA, each K-Block consists of 2 active/active controllers with one or more SSD shelves. In this test environment, one shelf was used per K-Block. An out-of-box configuration was used for the K2 array and no tuning was performed for any of the specific tests. This document describes different set of tests that required 2 different set of volumes: 1. 1 volume of 1.5TB was configured in a non-deduplication volume-group for mongoperf tests 2. 3 different volumes in a non-deduplication volume-group for the compression tests a. 250GB LUN for non-compressed MongoDB database b. 150GB LUN for Snappy MongoDB database c. 150GB LUN for zlib MongoDB database All volumes were mapped to a cluster of two (2) ESXi 5.5U2 hosts and connected as RDM disks to a MongoDB virtual machine. ESXi configuration: Supermicro 12 X 2.499 GHz Intel Xeon E5-2640 CPUs for a total of 24 hyperthreaded cores 256 GB of memory Two Dual port + one Quad Port 8 GB QLogic FC HBA used to connect to a dual K-Block K2 array through two redundant 48 Ports Brocade FC Switch. 13

MongoDB Virtual Machine Attribute OS Specification CentOS Linux release 7.1.1503 (Core) vcpu 12 vmemory Virtual SCSI Controller 0 (OS) Virtual Disk (OS) VMDK Virtual SCSI Controller 1 Virtual Disk (MongoPerf) RDM Virtual SCSI Controller 2 Virtual Disk (MongoDB No Compression) RDD Virtual Disk (MongoDB Snappy) RDM Virtual Disk (MongoDB zlib) RDM Virtual CD/DVD/Floppy Drives Installed Applications 8,192 MB LSI Logical Parallel 40 GB VMware Paravirtual 1.5 TB VMware Paravirtual 250 GB 150 GB 150 GB Removed MongoDB DB Version v3.2.3 Git Version: b326ba837cf6f49d65c2f85e1b70f6f31ece7937 OpenSSL version: OpenSSL 1.0.1e-fips 11 Feb 2013 Allocator: tcmalloc Modules: Enterprise Build Environment: Distmod: rhel70 Distarch: x86_64 Target_arch: x86_64 Software Configurations No additional software was installed on the MongoDB virtual machine except of the database itself. The same database engine with the same database schema were used to test the MongoDB data reduction tests on the Kaminario K2. An XFS filesystem was chosen to accommodate the MongoDB data files. 14

About MongoDB MongoDB makes development simple and beautiful. For tens of thousands of organizations, MongoDB provides agility and the freedom to scale. Fortune 500 enterprises, startups, hospitals, governments and organizations of all kinds use MongoDB because it is the best database for modern applications. Through simplicity, MongoDB changes what it means to build. Through openness, MongoDB elevates what it means to work with a software company. About Kaminario Contact Contact a business development representative to answer any questions you may have. Schedule a Demo Schedule a demo with an engineer and learn if Kaminario s solution works for you. Kaminario, the leading all-flash storage company, is redefining the future of modern data centers. Its unique solution enables organizations to succeed in today s on-demand world and prepares them to seamlessly handle tomorrow s innovations. Only Kaminario K2 delivers the agility, scalability, performance and economics a data center requires to deal with today s cloud-first, dynamic world and provide real-time data access -- anywhere, anytime. Hundreds of customers rely on the Kaminario K2 all-flash array to power their mission critical applications and safeguard their digital ecosystem. Headquartered in Needham, MA, Kaminario works with an extensive network of resellers and distributors, globally. Request a Quote Request a quote for your application from our business development team. For more information, visit www.kaminario.com Kaminario, the Kaminario logo and SPEAR are registered trademarks of Kaminario Inc. K-RAID, Perpetual Array and HealthShield are trademarks of Kaminario, Inc. Product specifications and performance are subject to change without notice. 2016 Kaminario All Rights Reserved DOC_100059_00 15