Benefits of Storage Capacity Optimization Methods (COMs) And. Performance Optimization Methods (POMs)

Similar documents
Energy Efficiency Metrics for Storage Herb Tanzer, Chuck Paridon Hewlett Packard Company

Product/Family, Best Foot Forward (BFF), Test Points and Qualification Ranges

Product/Family, Best Foot Forward (BFF), Test Points and Qualification Ranges

SNIA Emerald SNIA Emerald Power Efficiency Measurement Specification. SNIA Emerald Program

Live System Under Test (SUT): Power Meter Connection &

Intro to Capacity Optimization Methods (COMs)

COM Verification. PRESENTATION TITLE GOES HERE Alan G. Yoder, Ph.D. SNIA Technical Council Huawei Technologies, LLC

for the SNIA Emerald TM Power Efficiency Measurement Specification Version 2.0 Revision 2

What has Changed? SNIA Emerald TM Power Efficiency Measurement Specification V2.0.2 to V Version 1.0 Revision 2

IOmark- VM. HP MSA P2000 Test Report: VM a Test Report Date: 4, March

User Guide for the SNIA Emerald Power Efficiency Measurement Specification V3.0.2

SNIA Green Storage Initiative Green Technical Working Group PRESENTATION TITLE GOES HERE. SNIA Emerald TM

VDBENCH Overview. Steven Johnson. SNIA Emerald TM Training. SNIA Emerald Power Efficiency Measurement Specification, for use in EPA ENERGY STAR

GREEN STORAGE PRODUCTS: Efficiency with ENERGY STAR & Beyond Patrick Stanko Oracle Tape Development, Oracle Chair of the SNIA Green TWG

Storage Systems Can Now Get ENERGY STAR Labels and Why You Should Care

SNIA VDBENCH Rules of Thumb

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

vsan All Flash Features First Published On: Last Updated On:

The term "physical drive" refers to a single hard disk module. Figure 1. Physical Drive

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO

GREEN STORAGE II: Metrics and Measurement. Patrick B. Chu, Ph.D., Seagate Technology Erik Riedel, Ph.D., Seagate Technology

ENERGY STAR Program Requirements Product Specification for Data Center Storage. Eligibility Criteria Version 1.0

NVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory

PRESENTATION TITLE GOES HERE

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

IOmark-VM. VMware VSAN Intel Servers + VMware VSAN Storage SW Test Report: VM-HC a Test Report Date: 16, August

Increasing Performance of Existing Oracle RAC up to 10X

Getting the Most Out of SSDs IT System Optimization Best Practices

Power Management in Storage Systems

Nové možnosti storage virtualizace s řešením IBM Storwize V7000

I/O Characterization of Commercial Workloads

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

IOmark-VM. Datrium DVX Test Report: VM-HC b Test Report Date: 27, October

Automated Storage Tiering on Infortrend s ESVA Storage Systems

Method to Establish a High Availability and High Performance Storage Array in a Green Environment

GREEN STORAGE PRODUCTS: Efficiency with ENERGY STAR & Beyond

IOmark- VM. HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VM- HC b Test Report Date: 27, April

Introducing Tegile. Company Overview. Product Overview. Solutions & Use Cases. Partnering with Tegile

New HPE 3PAR StoreServ 8000 and series Optimized for Flash

All-Flash High-Performance SAN/NAS Solutions for Virtualization & OLTP

Configuring Short RPO with Actifio StreamSnap and Dedup-Async Replication

NetVault Backup Client and Server Sizing Guide 2.1

SAN Storage Array Workbook September 11, 2012

SNIA Emerald Power Efficiency Measurement Specification

White Paper. EonStor GS Family Best Practices Guide. Version: 1.1 Updated: Apr., 2018

NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp

Getting it Right: Testing Storage Arrays The Way They ll be Used

SNIA Green Storage Initiative SNIA Emerald Program Overview PRESENTATION TITLE GOES HERE

The HP 3PAR Get Virtual Guarantee Program

HPE Storage Update The All Flash Datacenter 3PAR

Considerations to Accurately Measure Solid State Storage Systems

Scale-out Data Deduplication Architecture

What is QES 2.1? Agenda. Supported Model. Live demo

Introduction to NetApp E-Series E2700 with SANtricity 11.10

Planning for Easy Tier with IBM System Storage Storwize V7000 and SAN Volume Controller

SNIA Emerald Power Efficiency Measurement Specification

Green Storage Technologies, CAPEX and OPEX. Alan G. Yoder, Ph.D. NetApp

NetVault Backup Client and Server Sizing Guide 3.0

The storage challenges of virtualized environments

Scale-out Object Store for PB/hr Backups and Long Term Archive April 24, 2014

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

vsan 6.6 Performance Improvements First Published On: Last Updated On:

IBM N Series. Store the maximum amount of data for the lowest possible cost. Matthias Rettl Systems Engineer NetApp Austria GmbH IBM Corporation

A DEDUPLICATION-INSPIRED FAST DELTA COMPRESSION APPROACH W EN XIA, HONG JIANG, DA N FENG, LEI T I A N, M I N FU, YUKUN Z HOU

Testing All-Flash-Arrays with WorkloadWisdom

Performance Analysis in the Real World of Online Services

Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System

ADVANCED DATA REDUCTION CONCEPTS

Assessing and Comparing HP Parallel SCSI and HP Small Form Factor Enterprise Hard Disk Drives in Server Environments

Flash In the Data Center

Системы хранения IBM. Новые возможности

- SLED: single large expensive disk - RAID: redundant array of (independent, inexpensive) disks

SolidFire and Pure Storage Architectural Comparison

A More Sustainable Approach to Enterprise Data Storage: Reducing Power Consumption with InfiniBox. Executive Brief

All-Flash High-Performance SAN/NAS Solutions for Virtualization & OLTP

vsan Space Efficiency Technologies First Published On: Last Updated On:

Demartek Evaluation Accelerated Business Results with Seagate Enterprise Performance HDDs

Easily Recreate the Qtier SSD Tier

White Paper Features and Benefits of Fujitsu All-Flash Arrays for Virtualization and Consolidation ETERNUS AF S2 series

Purity: building fast, highly-available enterprise flash storage from commodity components

SAP Applications on IBM XIV System Storage

IBM Storwize V5000 disk system

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage

UCS Invicta: A New Generation of Storage Performance. Mazen Abou Najm DC Consulting Systems Engineer

SSD Architecture Considerations for a Spectrum of Enterprise Applications. Alan Fitzgerald, VP and CTO SMART Modular Technologies

HGST: Market Creator to Market Leader

Disruptive Forces Affecting the Future

Technologies for Green Storage. Alan G. Yoder, Ph.D. NetApp

IBM EXAM QUESTIONS & ANSWERS

DAHA AKILLI BĐR DÜNYA ĐÇĐN BĐLGĐ ALTYAPILARIMIZI DEĞĐŞTĐRECEĞĐZ

Backup and archiving need not to create headaches new pain relievers are around

IBM FlashSystem 9100 NVMe-Accelerated, Multi-Cloud Enabled

Technical White Paper ETERNUS AF/DX Optimization Features

SoftNAS Cloud Performance Evaluation on AWS

SNIA Emerald Power Efficiency Measurement Specification

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING

Lenovo SAN Manager. Rapid Tier and Read Cache. David Vestal, WW Product Marketing. June Lenovo.com/systems

ChunkStash: Speeding Up Storage Deduplication using Flash Memory

IT Certification Exams Provider! Weofferfreeupdateserviceforoneyear! h ps://

OpenVMS Storage Updates

Transcription:

Benefits of Storage Capacity Optimization Methods (COMs) And Performance Optimization Methods (POMs) Herb Tanzer & Chuck Paridon Storage Product & Storage Performance Architects Hewlett Packard Enterprise Source: Software Defined Storage Energy Efficiency Features 1

Topics Green Storage Metrics Capacity Optimization Methods (COMs) Features Overview Test Characterization Test vehicle description Space and power savings Performance impact Benefits: CAPEX, OPEX Performance Optimization Methods (POMs): brief 2

Green Storage Metrics Metrics are based on SNIA Emerald and EPA ENERGY STAR definitions of storage power efficiency IDLE: Capacity (Idle GB/Watt) ACTIVE: Transactional Performance (IOPs/Watt) and Streaming Performance or Bandwidth (MBPS/Watt) Note: Active workloads use the Oracle VdBench IO generator Data is collected in 1-minute intervals for the measurement period (2 hours idle & 30 minutes active) and averaged These metrics are used for Acquisition decisions Manufacturer runs Emerald benchmark workloads and measures metric values in the lab Customer compares per taxonomy type and by peak metric value TGG s DCsP uses the same metrics, but measure operational datacenter workloads Operational effectiveness (productivity) is measured as Capacity or Performance per kwh 3

Capacity Optimization Methods (COMs) COMs are software techniques that allow more data to be stored on less capacity (i.e., use fewer disks and less power) COM Description Potential Estimated Space Savings RAID 5/6, Erasure Encoding Thin Provisioning Deduplication Compression Delta Snapshots Data integrity w/o using full copies Virtualize allocation of Storage Eliminate duplicate files or data chunks Reduction of data sets within files or blocks Reconstruct different versions of a file w/o multiple full copies When replacing Mirroring (RAID 1): ~40% Take from 30% utilization (legacy) to ~ 80% Can be 25-40% primary, up to 50% secondary 2:1 compression is done routinely Large savings possible when change delta is small 4

The status of COM testing Currently, the SNIA Emerald Specification defines simple pass/fail tests that verify the existence and activation of particular COMs Currently, Storage ENERGY STAR requires that for the eligible categories: Parity RAID is enabled in the System Under Test (SUT) 1 out of 4 specific COMS is available as a selectable feature and needs to be verified for existence, but they are to be disabled during idle & active metric testing There has been some estimation of active COM benefits, but more test characterization is needed to establish valid methods to accurately account for their capacity-saving efficiency in idle & active testing Some early characterization (tested and estimated) of COM benefits is presented here 5

Test Vehicle: COM High-mid storage array Dual controller node pairs Integrated disk devices Expansion disk enclosures We used 160 SFF SAS drives 450GB x 15k 6

COM Testing Procedure 1. Create Baseline Configuration Raid 1 - basis for comparing Parity Raid, Compression (2:1 ratio) Raid 5 - basis for comparing Data dedup and Thin Provisioning (2:1 dedup, OP) 2. Record the Baseline Characteristics Usable Capacity Power Consumption Active IO rate 3. Observe/Record the Optimized Characteristics from Enabling the COM 4. Adjust the Physical Capacity to Create Equivalent Usable Capacity w/baseline Note: could also hold physical capacity constant & optimize the logical capacity 5. Record the Physically Optimized Characteristics 7

Dedupe Enabled Emerald Script File (Example) ################## # # Single Lun Dedup Tests - fill ################## dedupratio=5 dedupunit=16k # concatenatesds=yes sd=sd1,lun=\\.\physicaldrive10 #Hotband workload wd=hotwd_uniform,skew=6,sd=sd*,seekpct=rand,rdpct=50 8

Dedupe Enabled Storage Array Screen Shot (Example) 9

Energy Efficient Storage: SNIA Emerald Program Specific COM Examples (70/30 R/W random workload) Based on Physical Capacity Reduction COM Type Raid Parity Data Compression Baseline Config Raid 1 Mirrored and Striped Uncompressed Baseline Capacity (GB) 36,000 (160 450 GB disks) 36,000 (160 450 GB disks) Baseline Power (W) 2360(I) 3134(B) 2360(I) 3134(B) Baseline (GB/W) 15.25 15.25 Baseline Perf. (IOPs) 36,010 36,010 Baseline (IO/W) 11.49 11.49 Optimized Config Raid 5 (3+1 Data + Parity) Compressed (2:1) Optimized Capacity (GB) 36,000 (107 450 GB disks) 36,000 (80 450 GB disks) Opt. Power (W) 1564(I) 2200(B) 1464(I) 2030(B) Optimized (GB/W) 23.02 24.59 Delta (GB/W) +51% +61% Opt. Perf (IOPs) 17,040 8,235 Optimized (IO/W) 7.75 4.06 Delta (IO/W) (33%) (64%) 10

Energy Efficient Storage: SNIA Emerald Program Specific COM Examples (70/30 R/W random) Based on Physical Capacity Reduction COM Type Thin Provisioning TP + Data Dedup Baseline Config Raid 5 (3+1 Data + Parity) Full Provisioned No Dedup Baseline Capacity (GB) 54,000 (160 450 GB disks) 54,000 (160 450 GB disks) Baseline Power (W) 2360(I) 3134(B) 2360(I) 3134(B) Baseline (GB/W) 22.88 22.88 Baseline Perf. (IOPs) 25,250 25,250 Baseline (IO/W) 8.06 8.06 Optimized Config Thin Provisioning (50% written) TP+Deduped Patterns (2:1) Optimized Capacity (GB) 54,000 (50% written) 54,000 (50% written-dedup 2:1) Opt. Power (W) 1464(I) 2030(B) (80 disks) 1016(I) 1478(B) (40 disks) Optimized (GB/W) 36.88 53.15 Delta (GB/W) +61% +132% Opt. Perf (IOPs) 12,560 4,710 (2:1 Dedup Ratio) Optimized (IO/W) 6.19 3.19 Delta (IO/W) (23)% (60%) 11

Summary of COM Benefits COM Feature Physical capacity savings Idle / active power savings IOPs (70/30 R-W Workload) Parity RAID 33% 34% / 30% 0.47 X Compression 50% 38% / 35% 0.25 X Thin Provision 50% 38% / 35% 0.50 X Thin Provision + Deduplication Snapshot 75% 57% / 68% 0.20X not tested Note: capacity savings = fewer drives 12

CAPEX Savings Estimated COM Feature Fewer drives Cost savings* Tested 3.1 kw Cost savings Scaled to 10kW 100kW Parity RAID 53 $34.5K $110K -- $1100K Compression 80 $49.3K $157K -- $1570K Thin Provision 80 $49.3K $157K -- $1570K Thin Provision + Deduplication 120 $71.3K $228K -- $2280K *Estimated customer costs for disk drive hardware: $549 for a 450GB 15K SAS drive $2710 for an empty drive enclosure 13

OPEX Savings Estimated COM Feature Less installed power Cost savings* Tested 3.1kW Cost savings* Scaled to 10kW 100kW Parity RAID 934W $497/mo $1585/mo -- $15850/mo Compression 1104W $587/mo $1874/mo -- $18740/mo Thin Provision 1104W $587/mo $1874/mo -- $18740/mo Thin Provision + Deduplication 1656W $881/mo $2811/mo -- $28110/mo *It costs $5,320 per month per 10-kW installation for data center power, cooling, and space (based on a first-order approximation for a typical data center by Patel and Shah at HP Labs) 14

Summary For the most part, the COMs can deliver capacity and power efficiency, but at a performance penalty [I.e., the idle Capacity/Power metric will improve, but when subjected to an active workload, the performance/power metric will be reduced] Note: these are COMs not POMs (Performance Optimization Methods) 15

References 1. SNIA Emerald program, SNIA Emerald Power Efficiency Measurement Specification, http://snia.org/tech_activities/standards/curr_standards/emerald 2. U.S. Environmental Protection Agency, ENERGY STAR Program for Data Center Storage Specification, www.energystar.gov/index.cfm?c=new_specs.enterprise_storage 3. The Green Grid, White Paper #58 - The Green Grid Data Center Storage Productivity Metrics (DCsP) http://www.thegreengrid.org/en/global/content/white-papers/wp58-dcsp 4. C. Patel and A. Shah, Cost Model for Planning, Development and Operation of a Data Center, HP Labs (2005) 16

POM Backup Slides www.thegreengrid.org 17

Workload with Hot Band IO Patterns Is asymmetric; comprised of several different IO streams, some of which contain hot spots or regions of more intense IO demand Results in varying degrees of logical block address re-referencing in certain regions of the overall IO space that can be either contained with the cache of an array, or can be placed on storage devices that deliver a higher IO rate Hot bands concentrate 54% of the IO in 32% of the space Therefore the opportunity exists to speed up a fraction of the total capacity IO Profile % of workload Read/Write Percentage IO Size (KiB) Write Stream 1 5 0/100 See Write Stream 5 0/100 See 2 Write Stream 5 0/100 See 3 Read Stream 5 100/0 See 1 Read Stream 5 100/0 See 2 Read Stream 5 100/0 See 3 Read Stream 5 100/0 See 4 Read Stream 5 100/0 See 5 Uniform 6 50/50 See Random Hot Band 1 28 70/30 See Hot Band 2 14 70/30 See Hot Band 3 7 70/30 See Hot Band 4 5 70/30 See Access Pattern Usable Address Range Sequential 0-100% Sequential 0-100% Sequential 0-100% Sequential 0-100% Sequential 0-100% Sequential 0-100% Sequential 0-100% Sequential 0-100% Random 0-100% Random 10-18% Re. SNIA Emerald Specification Random 32-40 % Random 55-63 % Random 80-88 % 18

Hot Band enables the benefits of Storage Caching and Device Tiering The Hot Band workload, when run on High End Storage (typically) demonstrates the performance/power advantage of two product features Array Based Cache Device Tiering Although the initial goal was solely cache focused, there is also a benefit of implementing faster tiers in the product via Device Tiering The hot band workload is included in the Emerald Specification, thus enabling these two features to improve the active metrics 19

Hot Band Primary Active Metric Comparison Configuration Tier Type Power Consumption (Modeled) IOPS IOPS/Watt Large Array (Initial) Large Array (Cache Assist) Large Array (Cache + Tiered) 15k RPM 7491 Watts 18410 2.457 15k RPM 7491 Watts 39900 5.326 15k + SSD 7283 Watts 42870 5.886 Overall Primary Metric Improvement ~ 140% (2.40x) 20

COM Backup Slides www.thegreengrid.org 21

Snapshot COM Benefits Due to the lack of an active snapshot workload, we present the following idle capacity analysis Most snapshot products consist of two components A base file or image created at a specific Point In Time (PIT) A delta file - the enumeration of the changes to the base file until another PIT The combination of these enables the recreation of many versions of the file or image in a space-efficient manner. Consider the case where a 100 GB file has 50 % of it s contents change in a week. Only 150 GB is required to store both files 25% physical capacity savings 22

Thank you www.thegreengrid.org 23