Forget IOPS: A Proper Way to Characterize & Test Storage Performance Peter Murray SwiftTest

Similar documents
Flash vs. Disk Storage: Testing Workloads is Key

Storage validation at GoDaddy Best practices from the world s #1 web hosting provider

Load Dynamix Enterprise 5.2

Getting it Right: Testing Storage Arrays The Way They ll be Used

Storage Performance Validation for Panzura

Considerations to Accurately Measure Solid State Storage Systems

NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp

PRESENTATION TITLE GOES HERE

Oracle IaaS, a modern felhő infrastruktúra

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

What is QES 2.1? Agenda. Supported Model. Live demo

Webinar Series: Triangulate your Storage Architecture with SvSAN Caching. Luke Pruen Technical Services Director

Best Practices for Validating the Performance of Data Center Infrastructure. Henry He Ixia

How Scalable is your SMB?

Introducing Tegile. Company Overview. Product Overview. Solutions & Use Cases. Partnering with Tegile

Session 201-B: Accelerating Enterprise Applications with Flash Memory

vsan 6.6 Performance Improvements First Published On: Last Updated On:

Paperspace. Architecture Overview. 20 Jay St. Suite 312 Brooklyn, NY Technical Whitepaper

Flashed-Optimized VPSA. Always Aligned with your Changing World

Testing All-Flash-Arrays with WorkloadWisdom

Enterprise Architectures The Pace Accelerates Camberley Bates Managing Partner & Analyst

Solid State Storage is Everywhere Where Does it Work Best?

Enabling SSD and Flash for Virtualized Environments. Vikram Joshi, Co-founder and CTO

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

Citrix XenServer 7.1 Feature Matrix

IBM Spectrum NAS. Easy-to-manage software-defined file storage for the enterprise. Overview. Highlights

Understanding Data Locality in VMware vsan First Published On: Last Updated On:

HPE SimpliVity. The new powerhouse in hyperconvergence. Boštjan Dolinar HPE. Maribor Lancom

Designing Next Generation FS for NVMe and NVMe-oF

SoftNAS Cloud Performance Evaluation on Microsoft Azure

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Deep Learning Performance and Cost Evaluation

Decoupling Datacenter Studies from Access to Large-Scale Applications: A Modeling Approach for Storage Workloads

HCI: Hyper-Converged Infrastructure

The Fastest And Most Efficient Block Storage Software (SDS)

How Flash-Based Storage Performs on Real Applications Session 102-C

IOmark- VM. HP MSA P2000 Test Report: VM a Test Report Date: 4, March

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage

Hybrid Storage Architecture Marries Performance and Efficiency

SoftNAS Cloud Performance Evaluation on AWS

Next-Generation NVMe-Native Parallel Filesystem for Accelerating HPC Workloads

IOmark- VM. HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VM- HC b Test Report Date: 27, April

Warsaw. 11 th September 2018

Nimble Storage Adaptive Flash

Hypervisors and Server Flash Satyam Vaghani

Hedvig as backup target for Veeam

Isilon Performance. Name

Red Hat Gluster Storage performance. Manoj Pillai and Ben England Performance Engineering June 25, 2015

Software Defined Storage at the Speed of Flash. PRESENTATION TITLE GOES HERE Carlos Carrero Rajagopal Vaideeswaran Symantec

What s New in VMware vsphere 4.1 Performance. VMware vsphere 4.1

VirtualWisdom Performance Probes are VirtualWisdom Fibre Channel SAN Performance Probe

Object storage platform How it can help? Martin Lenk, Specialist Senior Systems Engineer Unstructured Data Solution, Dell EMC

Storage Protocol Offload for Virtualized Environments Session 301-F

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING

VMWARE VREALIZE OPERATIONS MANAGEMENT PACK FOR. Nutanix. User Guide

Exploring Cloud Security, Operational Visibility & Elastic Datacenters. Kiran Mohandas Consulting Engineer

VMware vsan Ready Nodes

Virtualization. Michael Tsai 2018/4/16

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

All-Flash High-Performance SAN/NAS Solutions for Virtualization & OLTP

IOmark-VM. Datrium DVX Test Report: VM-HC b Test Report Date: 27, October

Modern hyperconverged infrastructure. Karel Rudišar Systems Engineer, Vmware Inc.

TALK THUNDER SOFTWARE FOR BARE METAL HIGH-PERFORMANCE SOFTWARE FOR THE MODERN DATA CENTER WITH A10 DATASHEET YOUR CHOICE OF HARDWARE

The Benefits of Solid State in Enterprise Storage Systems. David Dale, NetApp

SAN Acceleration Using Nexenta Connect View Edition with Third- Party SAN Storage

Hitachi Virtual Storage Platform Family

Store Process Analyze Collaborate Archive Cloud The HPC Storage Leader Invent Discover Compete

StarWind Storage Appliance

Network Design Considerations for Grid Computing

MySQL Performance Optimization and Troubleshooting with PMM. Peter Zaitsev, CEO, Percona

SNIA Developers Conference - Growth of the iscsi RDMA (iser) Ecosystem

VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS

NetVault Backup Client and Server Sizing Guide 2.1

SolidFire. Petr Slačík Systems Engineer NetApp NetApp, Inc. All rights reserved.

ONTAP 9 Cluster Administration. Course outline. Authorised Vendor e-learning. Guaranteed To Run. DR Digital Learning. Module 1: ONTAP Overview

"Software-defined storage Crossing the right bridge"

DDN. DDN Updates. DataDirect Neworks Japan, Inc Nobu Hashizume. DDN Storage 2018 DDN Storage 1

Deep Learning Performance and Cost Evaluation

Verron Martina vspecialist. Copyright 2012 EMC Corporation. All rights reserved.

VirtualWisdom â ProbeNAS Brief

DataON and Intel Select Hyper-Converged Infrastructure (HCI) Maximizes IOPS Performance for Windows Server Software-Defined Storage

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel

Isilon Scale Out NAS. Morten Petersen, Senior Systems Engineer, Isilon Division

JANUARY 28, 2014, SAN JOSE, CA. Microsoft Lead Partner Architect OS Vendors: What NVM Means to Them

Scale-out Storage Solution and Challenges Mahadev Gaonkar igate

Accelerate Applications Using EqualLogic Arrays with directcache

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

MICHAËL BORGERS, SYSTEM ENGINEER MATTHIAS SPELIER, SYSTEM ENGINEER. Software Defined Datacenter

BEST PRACTICES FOR OPTIMIZING YOUR LINUX VPS AND CLOUD SERVER INFRASTRUCTURE

Understanding Data Locality in VMware Virtual SAN

All-Flash High-Performance SAN/NAS Solutions for Virtualization & OLTP

Emulex LPe16000B 16Gb Fibre Channel HBA Evaluation

Comparing Performance of Solid State Devices and Mechanical Disks

REFERENCE ARCHITECTURE Microsoft SQL Server 2016 Data Warehouse Fast Track. FlashStack 70TB Solution with Cisco UCS and Pure Storage FlashArray//X

Scaling Data Center Application Infrastructure. Gary Orenstein, Gear6

Software Defined Storage. Reality or BS?

As storage networking technology

Transcription:

Forget IOPS: A Proper Way to Characterize & Test Storage Performance Peter Murray peter@swifttest.com SwiftTest

Storage Performance Validation Rely on vendor IOPS claims Test in production and pray Validate with freeware tools (Iometer, IOZone) Validate with Workload Models ssss The Evolution of Storage Performance Validation 2013 Storage Developer Conference. Insert Your Company Evolution Name. All Rights Reserved. 2

What is Storage Validation? FC iscsi SMB NFS OBJECT HTTP-S Validation Appliance Workload Emulations Storage Under Test

Storage Infrastructure Performance Your Customers Care! Product Evaluations & Vendor Bakeoffs New Feature & Technology Evaluations Performance Impact of Virtualization Optimal Storage Configuration or Protocol Impact of Infrastructure Changes Troubleshooting NFS, SMB, FC, iscsi 4

Common Enterprise IT Challenges Validate new hybrid SSD architecture & design to reduce storage costs Choose most cost effective vendor from short list Validate that proposed storage system can meet performance needs Configuration planning for private cloud migration Troubleshoot poor performance of enterprise BI application Common Requirements Emulate our application workloads with high fidelity Test on target system 5

Validating New Storage Products & Upgrades Validation Appliance Workload (File, Block, Object) Storage Pilot Candidate(s) Switch Solution A Solution B Current Production Solution (to validate OS/Firmware upgrades, configuration changes) 6

Agenda What are IOPS? Why IOPS are insufficient for storage testing What is metadata? Why metadata is critical for storage testing What are workloads? Why workloads are the essential to storage testing 7

IOPS 8

IOPS Definition IOPS [Storage System]: I/O Operations per Second IOPS/W and MBps/W are the units normally used to report these quantities IOPS and read/write loads are not specified in this definition Care must be taken when comparing different storage systems that the I/O load used is as identical as possible Comparisons of these measurements when I/O sizes and load mix are different may not be very meaningful From the 2012 SNIA Dictionary. Used with permission. All rights reserved. 9

Origin of IOPS Originally a measurement for local hard disk performance Enabled a common way to test Write/Read disk performance Iometer most common IOPS measurement tool Written by Intel, maintained by the Open Source community Designed to test drives, expanded to test NAS performance Widely used to compare overall system performance Commonly cited in vendor literature Uses host s TCP stack to communicate Designed for Write/Read IO testing Tests include only the metadata required to perform IO 10

IOPS: One Size Does Not Fit All One IOP requires one or more request-response pairs Depends on block/chunk size 1KB 1 Request, 1 response 4KB 1 Request, 3 Responses Larger block sizes require more responses Larger block sizes mean a lower IOPS rate There is no standard size 4KB often cited, but varies vendor dependent 11

IOPS Vary in Real World Application Traffic Real storage IO does not use a fixed block/chunk size Block/chunk size is protocol version and application dependent Block sizes have increased in newer protocol versions Write/Read operations may be a small fraction of storage traffic May comprise less than 10% All remaining traffic is related to metadata RDMA Write/Read operations may erode the usefulness of isolated W/R IOPS measurement Less time required for data transfer Metadata still needs to be processed

IOPS 13

Metadata Metadata operations enable IO Information about data Operations needed to access data Location Extent File name Access Attributes, locking, etc.

Metadata Usage The amount of metadata required to perform IO varies widely Determined by access method Often in excess of 50% of storage traffic; sometimes 90%+ Block/chunk vs. file Both require metadata File access requires more metadata

Reducing Metadata s Impact Metadata impacts performance Based on application type Higher usage: Web apps, Office apps Lower usage: Database, email Caching with RAM, Flash or SSDs helps, but expensive Metadata performance on HDDs is slow Caching doesn t eliminate Metadata, only speeds processing

IOPS 17

Workloads Defined IOPS, metadata and access patterns that reflect an application Each application has an unique signature: Write vs. read % Random vs. sequential access IO vs. metadata % Data compressibility Block/chunk size Metadata command frequency Use of asynchronous/compound commands Workloads are intrinsic to characterizing storage performance

The Importance of Workloads Workload testing enables engineers to: Understand how an application operates in a given environment: Server Cluster With Deduplication/Compression With varying network configurations and conditions Understand overload and failure conditions Degraded application performance vs. application failure Network issues: Active-active network failover Active-passive network failover

IOPS 20

The Workload Model Access Patterns File System I/O Parameters Load Properties

Access Patterns Access Patterns I/O Mix Write/Read % Metadata %

File System File System File System Hierarchy File system depth Files per folder Number of subfolders File size distribution

I/O Parameters I/O Parameters Block/Chunk Size Read/Write Direction

Load Properties Load Properties Concurrent Users Actions/Sec Load variability/time

Step 1 Creating a Production Workload Model PRODUCTION STATS (Workload Analyzers, Netstat, NFSstat, etc) PRE-BUILT TEST SUITES PACKET CAPTURES (PCAP, etc) ACCURATE, REALISTIC WORKLOAD MODEL

Workload Analysis Command Mix Representative Command Mix I/O Percentage Write Read Metadata Metadata commands Command percentages

Workload Analysis I/O Mix I/O Mix Chunk Size Read/Write Ratio Hot Spots

Workload Analysis Load Parameters Load Parameters Temporality Burstiness Overload conditions

Workload Re-creation Challenges Difficult for vendors and operators to vet against real-world service and application conditions at scale: Big data: database intensive Virtualized environments: VDI, general VM workloads Infinite permutations: bare-metal, virtual, Linux, Windows nodes with applications on top in multi-tenant cloud environment Server to storage traffic in the data center is a mix of OS, hypervisor and application behaviors Complex workload emulation is difficult and time consuming Competency in generating detailed, scalable emulations is highly desirable

Workload Emulation Requirements A solution to model different data center conditions Flexibility: allow customers to model a wide spectrum of conditions ranging from exact conversation to blended cloud application mix Efficiency: without tedious study, analysis and test configuration synthesis A system to analyze/fingerprint traffic in a standardized way Framework that decomposes the characteristics of traffic into quantifiable workload dimensions Generate specific or generalized workloads

Workload Simulation Doesn t Require Dozens of Servers / VMs to Drive Go Daddy In Production SwiftTest Simulation Total NFS ops ~65K ~66K Avg. Latency Read Write Metadata Ops Op-Mix Reads Writes Metadata Ops Getattr Lookup Access 1.5 ms 10 ms 0.5 ms 0.5 ms 5% 1% 94% 62% 11% 17% 1.4 ms 11.5 ms 0.6 ms 0.4 ms 5% 1% 94% 63% 13% 14% Avg. CPU Utilization 81% 80% Max. Disk Utilization 55% 54%

Customer Example: Workloads Modeled from Production Protocol File Action Distribution NFSv3 Getattr 55% Lookup 11% Access 17% Read 5% Write 10% Protocol File Action Distribution SMB2 Query path info 27% Query file info 25% Read 10% Write 5% Set file info 6% NT create 10% Close 9% Protocol File Size Distribution Protocol File Size Distribution NFSv3 8192 100000 2000000 SMB2 4096 100000 1000000 10000000

Step 2 Run Workload Model Against Target FC iscsi SMB NFS OBJECT HTTP-S Validation Appliance Workload Emulations Storage Under Test

Step 3 Analyze Results for Better Decisions Analytics Insight Block vs File Boot storm handling Limits testing Failure modes Effects of flash, dedupe, tiering, scaleout, etc.

Results Analysis Performance Performance Response times Throughput

Results Analysis Command Mix Expected Command Mix Execution status Attempts Successes Errors Aborts

Summary IOPS measurement alone cannot characterize real application storage performance Inclusion of metadata is essential Workload modeling and purposebuild load generation appliances are the way to emulate applications The more complete the emulation, the deeper the understanding