Storage validation at GoDaddy Best practices from the world s #1 web hosting provider

Similar documents
Forget IOPS: A Proper Way to Characterize & Test Storage Performance Peter Murray SwiftTest

Flash vs. Disk Storage: Testing Workloads is Key

Storage Performance Validation for Panzura

NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp

Load Dynamix Enterprise 5.2

"Software-defined storage Crossing the right bridge"

HP Simply StoreIT: Cut through the confusion of storage

Discover the all-flash storage company for the on-demand world

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside.

Today s trends in the storage world. Jacint Juhasz Storage Infrastructure Architect

EMC STORAGE STRATEGY. Copyright 2015 EMC Corporation. All rights reserved.

Hitachi Virtual Storage Platform Family

Nimble Storage vs HPE 3PAR: A Comparison Snapshot

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

Considerations to Accurately Measure Solid State Storage Systems

Solid Access Technologies, LLC

HPE SimpliVity. The new powerhouse in hyperconvergence. Boštjan Dolinar HPE. Maribor Lancom

NEC Express5800 R320f Fault Tolerant Servers & NEC ExpressCluster Software

Take control of storage performance

WHITE PAPER Software-Defined Storage IzumoFS with Cisco UCS and Cisco UCS Director Solutions

McKesson mixes SSDs with HDDs for Optimal Performance and ROI. Bob Fine, Dir., Product Marketing

HPE Storage news. Mauro Colombo Hybrid IT sales & presales manager 23 rd May 2018

Introducing Tegile. Company Overview. Product Overview. Solutions & Use Cases. Partnering with Tegile

THE SUMMARY. CLUSTER SERIES - pg. 3. ULTRA SERIES - pg. 5. EXTREME SERIES - pg. 9

StarWind Virtual SAN Free

Enterprise power with everyday simplicity

VMware vsphere Clusters in Security Zones

Native vsphere Storage for Remote and Branch Offices

VMware Virtual SAN Technology

The Impact of Hyper- converged Infrastructure on the IT Landscape

Verron Martina vspecialist. Copyright 2012 EMC Corporation. All rights reserved.

Technical White Paper: IntelliFlash Architecture

vsan Security Zone Deployment First Published On: Last Updated On:

Optimizing the Data Center with an End to End Solutions Approach

3/26/2018. Hyperconvergence. CreekPointe, Inc. Introductions Hyperconvergance Defined Advantages Use Cases Q&A Close. Mike Clarke, CreekPointe Inc.

Balakrishnan Nair. Senior Technology Consultant Back Up & Recovery Systems South Gulf. Copyright 2011 EMC Corporation. All rights reserved.

Clouds, Convergence & Consolidation

HPE Nimble Storage Partner Sales Guide

HCI: Hyper-Converged Infrastructure

Webinar Series: Triangulate your Storage Architecture with SvSAN Caching. Luke Pruen Technical Services Director

Cisco and EMC Release Joint Validated Design for Desktop Virtualization with Citrix XenDesktop. Daniel Chiu Business Development Manager, EMC

SoftNAS Cloud Data Management Products for AWS Add Breakthrough NAS Performance, Protection, Flexibility

EMC & VMWARE STRATEGIC FORUM NEW YORK MARCH David Goulden President & COO. Copyright 2013 EMC Corporation. All rights reserved.

BIG DATA READY WITH ISILON JEUDI 19 NOVEMBRE Bertrand OUNANIAN: Advisory System Engineer

Copyright 2012 EMC Corporation. All rights reserved.

Enterprise power with everyday simplicity

TITLE. the IT Landscape

NetApp Clustered Data ONTAP 8.2 Storage QoS Date: June 2013 Author: Tony Palmer, Senior Lab Analyst

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

Системы хранения IBM. Новые возможности

vsan Remote Office Deployment January 09, 2018

São Paulo. August,

Boost Performance and Extend NAS Life

Ten things hyperconvergence can do for you

Get More Out of Storage with Data Domain Deduplication Storage Systems

The Failure of SSDs. Adam Leventhal Senior Staff Engineer Sun Microsystems / Fishworks

Copyright 2012 EMC Corporation. All rights reserved.

Offloaded Data Transfers (ODX) Virtual Fibre Channel for Hyper-V. Application storage support through SMB 3.0. Storage Spaces

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

powered by Cloudian and Veritas

DEDICATED SERVERS WITH EBS

Maximizing your Storage Capacity: Data reduction techniques and performance metrics

Storage for High-Performance Computing Gets Enterprise Ready

HPE Synergy HPE SimpliVity 380

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

Microsoft Applications on Nutanix

A Cloud WHERE PHYSICAL ARE TOGETHER AT LAST

Flashed-Optimized VPSA. Always Aligned with your Changing World

Markus Kujala, Systems Engineering Manager

IBM Storwize V7000 TCO White Paper:

SoftNAS Cloud Performance Evaluation on Microsoft Azure

FlexPod. The Journey to the Cloud. Technical Presentation. Presented Jointly by NetApp and Cisco

The World s Fastest Backup Systems

INTEL NEXT GENERATION TECHNOLOGY - POWERING NEW PERFORMANCE LEVELS

ADVANCED DATA REDUCTION CONCEPTS

Real world observations from a Midsize Enterprise company transitioning to a Virtualized Environment

Overview and Current Topics in Solid State Storage

Modernize Your Backup and DR Using Actifio in AWS

Cisco SAN Analytics and SAN Telemetry Streaming

Modernize Your IT with Dell EMC Storage and Data Protection Solutions

Transforming Storage. An EMC Overview. Sasho Tasevski Sr. Technology consultant

5 Fundamental Strategies for Building a Data-centered Data Center

Leveraging the power of Flash to Enable IT as a Service

Next Generation Data Center : Future Trends and Technologies

NEXT GENERATION UNIFIED STORAGE

IBM FlashSystem. IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein?

VioStor NVR + Turbo NAS. Surveillance Storage Expansion NVR NAS

1 Quantum Corporation 1

NEXT GENERATION UNIFIED STORAGE

Take Back Lost Revenue by Activating Virtuozzo Storage Today

When, Where & Why to Use NoSQL?

SurFS Product Description

Reduce costs and enhance user access with Lenovo Client Virtualization solutions

Preserving the World s Most Important Data. Yours. SYSTEMS AT-A-GLANCE: KEY FEATURES AND BENEFITS

Software Defined Storage

RUNNING PETABYTE-SIZED CLUSTERS

It s Time for Mass Scale VDI Adoption

Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55%

Transcription:

Storage validation at GoDaddy Best practices from the world s #1 web hosting provider Julia Palmer Storage and Backup Manager Justin Richardson Senior Storage Engineer

Agenda and Benefits Agenda Faster, better, cheaper storage challenges Evolution of our validation process Insights and benefits Whay will you learn? How do we test new storage technology and achieve: Storage cost reduction Confidence in user experience (performance and 99.999 storage availability) Freedom to innovate!

GoDaddy key facts GoDaddy enables individuals and businesses to establish, maintain and evolve an online presence Nearly $1.3 billion (B) in sales 12 million customers worldwide, World s largest Web Hosting Provider, with over 5.6 million hosting accounts 99.9% uptime guarantee for Web hosting Key storage infrastructure highlights 31+ Pbytes, 30% annual growth 95% unstructured (NAS), 5% structured (SAN) Distributed over 5 data centers on 3 continents 5 storage administrators (more than 6PB per admin) 3 storage engineers

Storage Technology infrastructure challenges 99.999% uptime for storage services. Whatever we design must work in production. Minimal downtime, planned or unplanned. Cost reduction Provide Storage customers with better performance while reducing our costs. Innovate! Find FASTER and BETTER and CHEAPER technologies. Leverage new features, protocols, hardware, tools. Go fast! Speed up process from idea to production release Ensure efficient engineering processes for POC and Pilots Automate validation and QA

Our topic today Faster, Better and Cheaper storage services? How? Storage Engineer

Storage technology innovations! Cool storage technology? NFS4.1, SMB 3.0 Unified Storage SSDs Deduplication Virtualization Caching/Tiering Compression Object Storage Storage Engineer Commodity Hardware

Storage technology innovations Would it work in production? How can I find out? Storage Engineer

Storage technology innovations requirement #1 Storage validation process with a validation solution!!! Storage Engineer

Stage 0: Storage validation process evolution Stage 0 If it ain t broke don t fix it! Engineering process is risk averse. Difficult to investigate new solutions primarily reactive to problems. High costs! Over-spec solution in order to accommodate the unknown High priced, premium branded product Cannot keep up with changes in demand Limited insight hindering ability to innovate

Stage 1: Storage validation process evolution Stage 1 Test in Production and Pray Introduction of new storage solutions to production in controlled environment with slow customer ramp up Design based on vendor specs Load-related failures may take 12 months to happen Highly visible failures (potentially thousands of customers disrupted) 12 months wasted CIO directive: No more testing on live customers!!!

Stage 2: Storage validation process evolution Stage 2 Validation process with freeware tools Offline validation of candidate solutions with mix of tools: IOMeter, IOZone, Dbench, Fstress, etc Could not scale up to our high density production loads so could not address the large scale failure issue Lack of realism: inability to emulate meta-data and other file system calls. Required lots of high end servers Cumbersome to use

12

Stage 3: Storage validation process evolution Stage 3 Validation process with SwiftTest Appliance Rapid evaluation of storage configurations Find out in days not months! Failure is not an option it is a requirement. Precise and measurable findings! Test solution Realistic emulation of our production workloads (~90%+ fidelity, metadata calls, etc.) Validates against full scale operation loads Single solution for designing, running and analyzing tests Flexible, lightweight lab - sandbox for designing, running and analyzing different HW, SW and firmware options

GoDaddy storage validation process 1 to 2 weeks Pass/Fail 1 2 3 4 5 Investigate Procure Pilot and Install Apply Workloads with SwiftTest Analyze and Report Release Pilot in Production Technology Costs (BOM) Vendors POC with Vendor/VAR Full scale unit Modify Scale back density? Increase performance? Real customers!

What do we use it for? Typical use cases. New feature evaluation (e.g., dedup, compression) Protocol evaluation (e.g., NFS v4, SMB3, iscsi, etc.) Vendor product bakeoffs New architecture / technology evaluation (e.g., SSD caching, tiering) Density testing (how many customers could we deploy on one box?) Routine change management OS upgrades Firmware upgrades Etc

Storage Engineering team is Knowledgeable with storage technologies Eager to investigate and test the leading edge Comfortable with storage protocols and file systems Always challenging hardware and software vendors to stand by their specs Focused on complete picture cost, performance, capacity, availability

GoDaddy load generation lab Test Application (File, Block, Object storage emulation) Storage Pilot Candidate(s) MAN SwiftTest Appliance 10Gb switch Used for validating OS/Firmware, configuration updates Solution A Solution B Current Production Solution

How we analyzed our production workload We used vendor provided performance analyzer, netstat, and nfsstat utilities to obtain the workload characteristics. For example: Bandwidth utilized NFS Latency READ % WRITE % Metadata % GETATTR % LOOKUP % ACCESS %

Our production workloads vs. our SwiftTest simulation: virtually identical! Workload GoDaddy Production Total NFS ops ~65K Avg. Latency Read Write Other (Metadata Ops) 1.5 ms 10 ms 0.5 ms 0.5 ms SwiftTest Simulation ~66K 1.4 ms 11.5 ms 0.6 ms 0.4 ms Op-Mix Reads Writes Other Ops Getattr Lookup Access 5% 1% 94% 62% 11% 17% 5% 1% 94% 63% 13% 14% Avg. CPU Utilization 81% Max. Disk Utilization 55% 80% 54%

Actual production workload vs. SwiftTest emulation 80000 70000 60000 50000 40000 30000 20000 10000 0 Production IOPs 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 Total IOPs 80000 70000 60000 50000 40000 30000 20000 10000 0 SwiftTest generated IOPs 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 Total IOPs 1400 1200 1000 800 600 400 200 0 Production READs/WRITEs 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 READs WRITEs SwiftTest generated READs/WRITES 1400 1200 1000 800 600 400 200 0 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 READs WRITEs

Types of validations conducted SSDs Caching / Tiering New disk / OS configs Compression Commodity Hardware Deduplication

Validating new hardware configuration Configuration Cost Latency @ 35k IOPS Latency @ 70k IOPS Current Production 15k RPM Drives Baseline New config. A 7.2k RPM Drives 45% decrease New config. B Tiered HDD with SSD 15% decrease 9ms 20ms 15ms 10ms 30ms 18ms

Validating caching, tiering and SSDs technologies Pass / Fail Findings Solution A PASS Cache offloaded ~45% of READS from disks Able to test cache warm-up time Solution B FAIL Software issue: software not capable of pushing the high IO load on full SSD arrays Solution C FAIL Latency issues when SSDs were trying to flush writes to slow disk and deal with random reads Solution D PASS Full SSD array was able to push SSDs to required throughput.

Validating commodity hardware + open source storage software solutions Vendor-A (production reference) Faster Cheaper Read Latency 10ms baseline Open Compute solution 1 8ms 23% reduction Better + Features + Support + Mature/stable - Support - Features - Stability unknown Open Compute solution 2 3ms 56% reduction - Support - Features - Stability unknown

Perfect storage is as rare as a perfect body. I would know. 25

Benefits: Go Daddy cost reduction - ROI roadmap Cost per GB Vendor A Baseline Vendor C 56% drop Vendor B 23% drop OS & Disk Change 56% drop Every transition validated with our process and SwiftTest Compression 35% drop Dedupe/Compression 62% drop Whitebox 67% drop Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4

Summary: Benefits of storage validation Time to Validate Weeks to test in lab, not 12 months in production Price/performance analysis Pick most cost effective solution for work load Performance Testing Confidence in user experience Freedom to Innovate Insight we need Availability Confidence in 5-9 s under Load

Contact us: Julia@godaddy.com jbrichardson@godaddy.com