LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

Similar documents
The Benefits of Solid State in Enterprise Storage Systems. David Dale, NetApp

Overview and Current Topics in Solid State Storage

Overview and Current Topics in Solid State Storage

Storage Performance Management Overview. Brett Allison, IntelliMagic, Inc.

WHAT HAPPENS WHEN THE FLASH INDUSTRY GOES TO TLC? Luanne M. Dauber, Pure Storage

Notes & Lessons Learned from a Field Engineer. Robert M. Smith, Microsoft

SNIA Tutorial 1 A CASE FOR FLASH STORAGE HOW TO CHOOSE FLASH STORAGE FOR YOUR APPLICATIONS

Facing an SSS Decision? SNIA Efforts to Evaluate SSS Performance. Ray Lucchesi Silverton Consulting, Inc.

Tiered File System without Tiers. Laura Shepard, Isilon

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation

How To Get The Most Out Of Flash Deployments

Scaling Data Center Application Infrastructure. Gary Orenstein, Gear6

TAGITM Winter Summit 2017 Evaluating All Flash Storage for the Enterprise

Adrian Proctor Vice President, Marketing Viking Technology

Virtualization Practices:

Mark Rogov, Dell EMC Chris Conniff, Dell EMC. Feb 14, 2018

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

Planning For Persistent Memory In The Data Center. Sarah Jelinek/Intel Corporation

ADVANCED DATA REDUCTION CONCEPTS

Virtualization Practices: Providing a Complete Virtual Solution in a Box

Apples to Apples, Pears to Pears in SSS performance Benchmarking. Esther Spanjer, SMART Modular

HCI: Hyper-Converged Infrastructure

Deploying Public, Private, and Hybrid. Storage Cloud Environments

Felix Xavier CloudByte Inc.

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

Felix Xavier CloudByte Inc.

Copyright 2012 EMC Corporation. All rights reserved.

ASPECTS OF DEDUPLICATION. Dominic Kay, Oracle Mark Maybee, Oracle

The File Systems Evolution. Christian Bandulet, Sun Microsystems

Analysis and Optimization. Carl Waldspurger Irfan Ahmad CloudPhysics, Inc.

PRESENTATION TITLE GOES HERE. Understanding Architectural Trade-offs in Object Storage Technologies

Trends in Worldwide Media and Entertainment Storage

The Role of WAN Optimization in Cloud Infrastructures. Josh Tseng, Riverbed

Storage Virtualization II Effective Use of Virtualization - focusing on block virtualization -

Architectural Principles for Networked Solid State Storage Access

Optimizing Tier-1 Enterprise Storage for Solid State Memory

A Promise Kept: Understanding the Monetary and Technical Benefits of STaaS Implementation. Mark Kaufman, Iron Mountain

Everything You Wanted To Know About Storage (But Were Too Proud To Ask) The Basics

SAS: Today s Fast and Flexible Storage Fabric. Rick Kutcipal President, SCSI Trade Association Product Planning and Architecture, Broadcom Limited

Application Recovery. Andreas Schwegmann / HP

Introducing Tegile. Company Overview. Product Overview. Solutions & Use Cases. Partnering with Tegile

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING

Everything You Wanted To Know About Storage: Part Teal The Buffering Pod

Multi-Cloud Storage: Addressing the Need for Portability and Interoperability

Use Cases for iscsi and FCoE: Where Each Makes Sense

Storage Virtualization II. - focusing on block virtualization -

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Trends in Data Protection and Restoration Technologies. Jason Iehl, NetApp

Effective Storage Tiering for Databases

VMware Virtual SAN Technology

Nimble Storage vs HPE 3PAR: A Comparison Snapshot

How to create a synthetic workload test. Eden Kim, CEO Calypso Systems, Inc.

Data Deduplication Methods for Achieving Data Efficiency

HP VMA-series Memory Arrays

The next step in Software-Defined Storage with Virtual SAN

Flash In the Data Center

The Data-Protection Playbook for All-flash Storage KEY CONSIDERATIONS FOR FLASH-OPTIMIZED DATA PROTECTION

Understanding Your Virtual Workload. Irfan Ahmad CTO CloudPhysics

Storage in combined service/product data infrastructures. Craig Dunwoody CTO, GraphStream Incorporated

Enterprise Class Mass Storage: Innovative Approaches to Disk-based Active Archive Storage

Restoration Technologies

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO

Testing. Michael Ault, IBM Oracle FlashSystem Consulting Manager

Method to Establish a High Availability and High Performance Storage Array in a Green Environment

Jeff Dodson / Avago Technologies

Discover the all-flash storage company for the on-demand world

Copyright 2012 EMC Corporation. All rights reserved.

Extending RDMA for Persistent Memory over Fabrics. Live Webcast October 25, 2018

Create a Smarter and More Economic Cloud Storage Architecture. November 7, 2018

SAS: Today s Fast and Flexible Storage Fabric

EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE, BUT WERE TOO PROUD TO ASK Part Cyan Storage Management. September 28, :00 am PT

iscsi : A loss-less Ethernet fabric with DCB Jason Blosil, NetApp Gary Gumanow, Dell

Ron Emerick, Oracle Corporation

IBM FlashSystem. IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein?

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

High Availability Using Fault Tolerance in the SAN. Wendy Betts, IBM Mark Fleming, IBM

DDN. DDN Updates. Data DirectNeworks Japan, Inc Shuichi Ihara. DDN Storage 2017 DDN Storage

UCS Invicta: A New Generation of Storage Performance. Mazen Abou Najm DC Consulting Systems Engineer

IBM System Storage Reference Architecture featuring IBM FlashSystem for SAP landscapes, incl. SAP HANA

Nimble Storage Adaptive Flash

Webinar Series: Triangulate your Storage Architecture with SvSAN Caching. Luke Pruen Technical Services Director

SRM: Can You Get What You Want? John Webster, Evaluator Group.

DAHA AKILLI BĐR DÜNYA ĐÇĐN BĐLGĐ ALTYAPILARIMIZI DEĞĐŞTĐRECEĞĐZ

Automated Storage Tiering on Infortrend s ESVA Storage Systems

SAP Applications on IBM XIV System Storage

PRESENTATION TITLE GOES HERE

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

IBM System Storage DS8800 Overview

DDN. DDN Updates. DataDirect Neworks Japan, Inc Nobu Hashizume. DDN Storage 2018 DDN Storage 1

Design Considerations for Using Flash Memory for Caching

Optimizing the Data Center with an End to End Solutions Approach

Lenovo SAN Manager. Rapid Tier and Read Cache. David Vestal, WW Product Marketing. June Lenovo.com/systems

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

New Approach to Unstructured Data

COS 318: Operating Systems. Storage Devices. Jaswinder Pal Singh Computer Science Department Princeton University

Non-Volatile Memory Cache Enhancements: Turbo-Charging Client Platform Performance

StorMagic SvSAN 6.1. Product Announcement Webinar and Live Demonstration. Mark Christie Senior Systems Engineer

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

COS 318: Operating Systems. Storage Devices. Vivek Pai Computer Science Department Princeton University

Transcription:

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE Luanne Dauber, Pure Storage Author: Matt Kixmoeller, Pure Storage

SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney. The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 2

Abstract This session is for Storage Administrators and Application Architects seeking to understand how to best take advantage of flash memory in enterprise storage environments. The relative advantages of flash tiering, caching and all-flash approaches will be considered, across the dimensions of performance, cost, reliability and predictability. 3

Agenda The Storage I/O Crisis How to Evaluate Flash Solutions Approaches for Leveraging Flash PCI cards Flash caching in arrays Flash tiering in arrays All-flash arrays Analyzing Flash ROI 4

Moore s Law vs. Newton s Law Moore s Law: 58% CAGR HDD Areal Density: 40-100% CAGR 10 HDD Latency (Seek Time) -3% CAGR ms 5 Sources: Intel, IDEMA, IBM Research 0 2006 2008 2011 2013 5

Rotating Disk Challenges Deconstructing a Random Read Typical Resulting Duty Pattern: 1 Spin disk ~2ms 3 Read Data ~0.04ms Data Data Data Data S E E K S E E K S E E K 2 Actuate arm ~4ms Time Disks spend >95% of their time seeking and rotating, not delivering data!

Randomization: The I/O Blender Traditional Architecture Virtualized / Consolidated Architecture Single-workload Serialized Cached Optimized Multi-workload No cross-vm optimization Highly randomized 7

Summary: The Storage I/O Crisis Performance Demands Randomization: The I/O Blender IOPS / TB + + Multi-core CPUs Data growth Instant user expectations Virtualization Data consolidation Cloud architectures 95% time seeking/rotating IOPS/TB dropping 8

Understanding Your Application s I/O Fingerprint 1Ks-10Ks IOPS I/O Load 10Ks-100Ks IOPS Small Block <8K Block Size Large Block 10sK - Ms Sequential, Predictable, Cacheable Locality of Access Random, Unpredictable Write- Centric Access Pattern Read- Centric 10s of ms, Variability OK Latency Sensitivity <1-3 ms, Consistency Key 9

How to Evaluate Flash Alternatives Cost: $/GB? $/GB: Managed + Operations team $/GB: Operational + Datacenter space + Power / Cooling $/GB: Protected + Snapshot / replication software + Snapshot / replication copies + Backup software and media Performance Read IOPS Write IOPS Latency Bandwidth $/IOP Protection How to make HA? How to backup? How to manage drive loss? Power & Size Rackspace / floor space Power consumption Cooling $/GB: Usable + Cost of RAID parity + Over-provisioning waste + Management software $/GB: Raw + Cost of raw disk / flash Operational Simplicity Makes operations more or less complex? Increases TBs/admin managed? Integration Fits in current architecture? Requires process change? 10

Flash Architecture Approaches Server-Attach PCI Flash Array with Flash Cache Array with Flash Tier All-Flash LUN or Array PCI card in application server Host CPU typically used for flash management 100s of GB Flash as a read and/or write cache in array, or to cache FS metadata All data persisted to spinning disk Typically <1% - 5% of total capacity Sub-LUN/FS tiering, where a LUN or FS is spread across flash and disk Hot blocks/files moved to flash, cold left on disk Typically 1% - 10% of total capacity 100% flash LUN or array May or may not use DRAM for read/write caching 11

Host-Based PCI Flash Overview: Host-based PCI flash cards, typically 100s of GB in size Either non-raided, or two cards mirrored in the server Either non-ha, or application clustered across multiple servers Leverages host CPU for flash management Server-Attach PCI Flash High-level Benefits: Highest-performance flash architecture possible Eliminates the cost/burden of shared storage Ultra-small footprint Potentially allows for the reduction of host DRAM High-level Challenges: Very high-cost, creates islands of expensive flash in every server Requires a re-architecture for most enterprises Expensive and difficult to protect (HA, backup) Requires mirroring for HA across multiple servers Requires app-level or external replication for DR 12

Array with Flash Cache Overview: Expansion of array s DRAM cache with flash Implemented either via controller-connected PCI flash cards, or SSDs in drive bays Typically read-cache only, although some implementations of write cache as well Cache page size / caching scheme varies by array (typically 4-16K) Array with Flash Cache High-level Benefits: Expands array s cache buffer from <<1% of total storage to 1-5% 10 latency improvement on IOs which hit the flash cache (10+ms <1ms) Off-loading these IOs from disk reduces the load on disk and also makes the disk perform better Result: depends completely on cacheability of I/O stream, but typically 30-80% performance improvement High-level Challenges: Improvement depends completely on cacheability of the I/O stream, results vary Flash as a cache heavily exercised: requires SLC flash Very expensive: $50-150/GB list prices typical from major vendors Cache requires time to warm to see the performance benefit, often doesn t persist across re-boots or HA events Minimal benefit for truly random I/O streams: random I/O can t be cached, it is random 13

Array with Flash Tier Overview: Creation of multiple tiers of storage (flash, enterprise FC/SAS HDD, SATA HDD) with the ability to spread a LUN / FS across them Typically implemented via 3.5 or 2.5 SSDs in the drive bays/shelves Generally 5-10% Flash, 90-95% disk in typical implementations Back-end storage is virtualized, and blocks/files are migrated up to flash or down to disk, based upon access patterns Chunk size for migration varies by vendor: typically MBs to GBs large Frequency varies by vendor: typically daily during low I/O windows Array with Flash Tier High-level Benefits: Allows one to leverage flash for IOPs, HDDs for capacity Less cost than an all-flash solution, blends the economics of flash and HDD Suitable for MLC and SLC implementations of flash Potentially allows for significantly less disk / footprint / power if disk has been over-built for performance High-level Challenges: Success depends on the predictability/randomness of the I/O stream Only works well if the hot blocks are consistent Current implementation block sizes are large; a single hot block could pin a whole MB or GB chunk in flash Not a real-time adaptable solution, chunks only promoted/demoted once daily Difficult to setup and manage is it worth the complexity of another tier to manage? 14

All-Flash LUN or Array Overview: Creation of a LUN on 100% flash Typically implemented via 3.5 or 2.5 SSDs in the drive bays/shelves Suitable for MLC or SLC flash implementations High-level Benefits: 10x+ potential performance improvement vs. 1-2x for cached/tiered solutions Much more consistent latency: no cache-miss penalty, all IOs at the speed of flash (typically <1ms) Dramatically smaller size, allows for reduction of storage footprint by 4-5x Flash fast enough to eliminate the need for significant DRAM caching in the array RAID re-build times greatly improved due to the speed of flash (hours 10s of minutes) All-Flash LUN or Array High-level Challenges: Cost: flash varies from $20/GB - $150/GB depending on the vendor and type (MLC vs. SLC) Connectivity: most flash arrays have limited connectivity options compared to their more mature disk array alternatives HA & DR: varies by vendor, but the HA and DR models of these arrays are less mature than existing disk array alternatives 15

Understanding Flash ROI Cost: $/GB? Flash Impacts $/GB: Managed + Operations team $60 / GB 1-2x Reduction in management cost (performance troubleshooting) $/GB: Operational + Datacenter space + Power / Cooling $45 / GB 1-2x 2-5x reduction in space 2-5x reduction in power $/GB: Protected + Snapshot / replication software + Snapshot / replication copies + Backup software and media $30 / GB 2-3x 10x+ $/GB: Usable + Cost of RAID parity + Over-provisioning waste + Management software $10 / GB 2x 2-3x reduction in disk over-provisioning for performance $/GB: Raw + Cost of raw disk / flash $5 / GB 5-10x the raw cost Source: averaged / anonymized customer examples 16

Summary Datacenters are facing a storage I/O crisis + + Increasing Demand Randomization Shrinking IOPS/TB Understanding your application's I/O fingerprint is key to choosing the best flash strategy for your environment Flash should be evaluated on several dimensions, it s not all about performance Performance Cost Power & Size Integration Protection Operations Simplicity Look at the full picture (beyond $/GB raw!) to build your flash ROI 17

Q&A / Feedback Please send any questions or comments on this presentation to SNIA: tracksolidstate@snia.org 18