Planning for Easy Tier with IBM System Storage Storwize V7000 and SAN Volume Controller

Similar documents
IBM System Storage Reference Architecture featuring IBM FlashSystem for SAP landscapes, incl. SAP HANA

Implementing IBM Easy Tier with IBM Real-time Compression IBM Redbooks Solution Guide

IBM System Storage SAN Volume Controller IBM Easy Tier enhancements in release

IBM System Storage DS8870 Release R7.3 Performance Update

IBM System Storage DS8700 and DS8800 Performance with Easy Tier 2 nd Generation

How Smarter Systems Deliver Smarter Economics and Optimized Business Continuity

Stellar performance for a virtualized world

Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System

DS8880 High Performance Flash Enclosure Gen2

IBM Storwize V7000: For your VMware virtual infrastructure

DS8880 High-Performance Flash Enclosure Gen2

IBM Storwize V5000 disk system

IBM System Storage DS8800 Overview

IBM System Storage SAN Volume Controller IBM Easy Tier in release

SAS workload performance improvements with IBM XIV Storage System Gen3

EMC XTREMCACHE ACCELERATES ORACLE

IBM i Virtual I/O Performance in an IBM System Storage SAN Volume Controller with IBM System Storage DS8000 Environment

IBM System Storage DS6800

IBM Tivoli Storage Productivity Center Version Storage Tier Reports. Authors: Mike Lamb Patrick Leahy Balwant Rai Jackson Shea

Technical Note P/N REV A01 March 29, 2007

Executive Brief June 2014

Lenovo SAN Manager - Provisioning and Mapping Volumes

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades

IBM FlashSystem V840 and Comprestimator IBM Redbooks Solution Guide

Migration from a TS7740 to a TS7700T considerations

IBM System Storage LTO Ultrium 6 Tape Drive Performance White Paper

Dell Reference Configuration for Large Oracle Database Deployments on Dell EqualLogic Storage

IBM High IOPS SSD PCIe Adapters IBM System x at-a-glance guide

IBM Storwize V7000 TCO White Paper:

1 Revisions. Storage Layout, DB, and OS performance tuning guideline for SAP - V4.4. IBM System Storage layout & performance guideline for SAP

Lenovo SAN Manager Rapid RAID Rebuilds and Performance Volume LUNs

Lenovo SAN Manager. Rapid Tier and Read Cache. David Vestal, WW Product Marketing. June Lenovo.com/systems

IBM FlashSystem. IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein?

SAP Applications on IBM XIV System Storage

IBM System Storage DS8700 Performance with Easy Tier

IBM XIV Storage System

IBM FlashSystem V MTM 9846-AC3, 9848-AC3, 9846-AE2, 9848-AE2, F, F. Quick Start Guide IBM GI

Virtualisation, tiered storage, space management How does it all fit together?

All-Flash Storage Solution for SAP HANA:

Using the Storage Tier Advisor Tool (STAT) and Easy Tier TM on the IBM Storwize V7000

LINUX IO performance tuning for IBM System Storage

High performance and functionality

IBM Virtualization Engine TS7700 Series Best Practices. Usage with Linux on System z 1.0

A Comparative Study of Microsoft Exchange 2010 on Dell PowerEdge R720xd with Exchange 2007 on Dell PowerEdge R510

Copyright 2012 EMC Corporation. All rights reserved.

The Deployment of SAS Enterprise Business Intelligence Solution in a large IBM POWER5 Environment

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

IBM FlashSystem V Quick Start Guide IBM GI

IBM TotalStorage Enterprise Storage Server Model RAID 5 and RAID 10 Configurations Running Oracle Database Performance Comparisons

Using EMC FAST with SAP on EMC Unified Storage

Optimizing the Data Center with an End to End Solutions Approach

Reduce Latency and Increase Application Performance Up to 44x with Adaptec maxcache 3.0 SSD Read and Write Caching Solutions

LSI and HGST accelerate database applications with Enterprise RAID and Solid State Drives

Continuous Availability with the IBM DB2 purescale Feature IBM Redbooks Solution Guide

IBM System Storage N3000 Express series Modular Disk Storage Systems

IBM TotalStorage Enterprise Storage Server Model 800

Nimble Storage vs HPE 3PAR: A Comparison Snapshot

Virtualization of the MS Exchange Server Environment

Vendor: IBM. Exam Code: Exam Name: IBM System Storage DS8000 Technical Solutions V3. Version: Demo

Exam : Title : High-End Disk for Open Systems V2. Version : DEMO

C Q&As IBM Certified Specialist - Enterprise Storage Sales V4

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

LEVERAGING EMC FAST CACHE WITH SYBASE OLTP APPLICATIONS

IBM FlashSystem storage

SAS Connectivity Card (CIOv) for IBM BladeCenter IBM Redbooks Product Guide

Question No: 1 Which tool should a sales person use to find the CAPEX and OPEX cost of an IBM FlashSystem V9000 compared to other flash vendors?

Designing a Reference Architecture for Virtualized Environments Using IBM System Storage N series IBM Redbooks Solution Guide

Системы хранения IBM. Новые возможности

Exam : Title : High-End Disk Solutions for Open Systems Version 1. Version : DEMO

EMC XTREMCACHE ACCELERATES MICROSOFT SQL SERVER

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

VEXATA FOR ORACLE. Digital Business Demands Performance and Scale. Solution Brief

IBM System Storage DS5020 Express

Speaker Notes. IBM Software Group Rational software. Exporting records from ClearQuest

InfoSphere Warehouse with Power Systems and EMC CLARiiON Storage: Reference Architecture Summary

EMC VMAX 400K SPC-2 Proven Performance. Silverton Consulting, Inc. StorInt Briefing

IBM Real-time Compression and ProtecTIER Deduplication

Implementing IBM CICS JSON Web Services for Mobile Applications IBM Redbooks Solution Guide

Lenovo RAID Introduction Reference Information

The next step in Software-Defined Storage with Virtual SAN

Storwize/IBM Technical Validation Report Performance Verification

IBM Corporation IBM Corporation

IBM DS8880F All-flash Data Systems

Management Update: Storage Management TCO Considerations

SPC BENCHMARK 1 EXECUTIVE SUMMARY IBM CORPORATION IBM STORWIZE V7000 (SSDS) SPC-1 V1.12

... IBM Power Systems with IBM i single core server tuning guide for JD Edwards EnterpriseOne

Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions

IBM System Storage EXP3500 Express

Assessing performance in HP LeftHand SANs

Best practices. Starting and stopping IBM Platform Symphony Developer Edition on a two-host Microsoft Windows cluster. IBM Platform Symphony

White Paper. EonStor GS Family Best Practices Guide. Version: 1.1 Updated: Apr., 2018

EMC Integrated Infrastructure for VMware. Business Continuity

IBM EXAM QUESTIONS & ANSWERS

EMC Disk Tiering Technology Review

... IBM Advanced Technical Skills IBM Oracle International Competency Center September 2013

ServeRAID-BR10il SAS/SATA Controller v2 for IBM System x IBM System x at-a-glance guide

IBM System Storage DS4800

PowerVault MD3 SSD Cache Overview

Oracle Flash Storage System QoS Plus Operation and Best Practices ORACLE WHITE PAPER OCTOBER 2016

Roadmap for Enterprise System SSD Adoption

Transcription:

Planning for Easy Tier with IBM System Storage Storwize V7000 and SAN Volume Controller May 2013 Nick Clayton Carlos Fuente Document WP102295 Systems and Technology Group 2013, International Business Machines Corporation

Notices, Disclaimer and Trademarks Copyright 2013 by International Business Machines Corporation. No part of this document may be reproduced or transmitted in any form without written permission from IBM Corporation. Product data has been reviewed for accuracy as of the date of initial publication. Product data is subject to change without notice. This information may include technical inaccuracies or typographical errors. IBM may make improvements and/or changes in the product(s) and/or programs(s) at any time without notice. References in this document to IBM products, programs, or services does not imply that IBM intends to make such products, programs or services available in all countries in which IBM operates or does business. THE INFORMATION PROVIDED IN THIS DOCUMENT IS DISTRIBUTED "AS IS" WITHOUT ANY WARRANTY, EITHER EXPRESS OR IMPLIED. IBM EXPRESSLY DISCLAIMS ANY WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NON-INFRINGEMENT. IBM shall have no responsibility to update this information. IBM products are warranted according to the terms and conditions of the agreements (e.g., IBM Customer Agreement, Statement of Limited Warranty, International Program License Agreement, etc.) Under which they are provided. IBM is not responsible for the performance or interoperability of any non- IBM products discussed herein. The performance data contained herein was obtained in a controlled, isolated environment. Actual results that may be obtained in other operating environments may vary significantly. While IBM has reviewed each item for accuracy in a specific situation, there is no guarantee that the same or similar results will be obtained elsewhere. Statements regarding IBM s future direction and intent are subject to change or withdraw without notice, and represent goals and objectives only. The provision of the information contained herein is not intended to, and does not grant any right or license under any IBM patents or copyrights. Inquiries regarding patent or copyright licenses should be made, in writing, to: IBM Director of Licensing IBM Corporation North Castle Drive Armonk, NY 10504-1785 U.S.A. IBM, Easy Tier, FlashCopy, System Storage, Storage Tier Advisor Tool, Storwize V7000, San Volume Controller are trademarks of International Business Machines Corporation in the United States, other countries, or both. Other company, products or service names may be trademarks or service marks of others. Document: WP102295 2013 IBM Corporation Page 2 of 19

Acknowledgements The authors would like to thank the following colleagues for their comments and insight: Phil Bryan Lawrence Chiu Chip Jarvis Bruce McNuttt Brian Sherman Yan Xu We would also like to thank Fiona Pudge for her work providing the V7000 configuration and pricing data. A Note to the Reader This White Paper assumes a familiarity with the general concepts of Enterprise Disk Storage Systems,the Storwize V7000 product line and the SAN Volume Controller. Readers unfamiliar with these topics should consult the References section at the end of this paper. Document: WP102295 2013 IBM Corporation Page 3 of 19

Table of Contents Acknowledgements... 3 A Note to the Reader... 3 Table of Contents... 4 Table of Figures... 4 1 Introduction... 5 1.1 Audience... 5 2 Overview... 6 2.1 Solid State Storage in Storwize V7000 and SVC... 6 2.2 Easy Tier... 7 2.3 Disk Magic... 7 3 Drive Selection with Easy Tier... 8 3.1 Drive selection using a single tier... 8 3.2 Skew and percentage of hot data... 10 3.3 Choosing an optimized configuration based on skew... 12 3.4 Further Application acceleration... 13 3.5 Easy Tier Data Movement... 14 4 Sample TCO benefits with Easy Tier... 15 4.1 Typical 50TB mixed environment... 15 4.2 High performance environment... 16 5 Conclusions... 18 6 References... 19 Table of Figures Figure 1. Storwize V7000 SPC-1 benchmark response time curves... 6 Figure 2. Example maximum access densities by drive type... 9 Figure 3. Skew for several workloads... 10 Figure 4. Skew for several workloads showing hottest 20% of data... 11 Figure 5. Skew for System i workload... 11 Figure 6 Effect of different percentage of SSDs in a 2-tier configuration... 12 Figure 7 Drive Selection with a specific skew curve... 12 Figure 8 Example of Easy Tier data movement... 14 Figure 9 Three year costs for 50TB mixed environment... 15 Figure 10 Yearly environmental costs for 50TB mixed environment... 16 Figure 11 Three year costs for high performance environment... 17 Figure 12 Yearly environmental costs for high performance environment... 17 Document: WP102295 2013 IBM Corporation Page 4 of 19

1 Introduction Over the last two years, Easy Tier has helped numerous of IBM s clients using SAN Volume Controller and IBM Storwize V7000 to achieve dramatic gains in performance and storage efficiency for their production environments. This paper aims to provide an overview of the Easy Tier functionality and show how typical configurations can help improve performance and Total Cost of Ownership for real world production workloads. 1.1 Audience This technical paper was developed to assist IBM Business Partners, field sales representatives, technical specialists, and IBM s clients in understanding the performance characteristics of the Storwize V7000 and SVC when used in an Easy Tier environment. Document: WP102295 2013 IBM Corporation Page 5 of 19

2 Overview IBM Storwize V7000 and Storwize V7000 Unified are virtualized storage systems designed to consolidate workloads into a single storage system for simplicity of management, reduced cost, highly scalable capacity, performance and high availability. They offer improved efficiency and flexibility through built-in solid state drive (SSD) optimization, thin provisioning, Real-Time compression and non-disruptive migration of data from existing storage. They can also virtualize and reuse existing disk systems offering a greater potential return on investment. 2.1 Solid State Storage in Storwize V7000 and SVC SAN Volume Controller and Storwize V7000 come with a broad range of options for integrating solid state storage into a storage environment. Storwize V7000 can be ordered with industry standard SSDs with configurations starting as small as two 200GB drives, growing to many TB s of SSD RAID storage using 200GB or 400GB drive options. Data can be protected with any of the supported RAID levels, but it is likely that RAID-5 will be the best choice for most environments as is generally the case with traditional HDDs. IBM has submitted SPC-1 benchmarks for the Storwize V7000 both with and without SSDs and a link to these results can be found in the references section. The response time and throughput information from these benchmarks is shown in Figure 1 below. This shows both the significant response time improvements possible when using SSDs and the ability of the Storwize V7000 to drive significantly more IO than is possible purely using traditional HDDs. 12 10 Response time (ms) 8 6 4 2 0 0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 IOs per second Storwize V7000 Storwize V7000 with SSDs Figure 1. Storwize V7000 SPC-1 benchmark response time curves SVC supports up to four 200GB or 400GB SSDs in the latest CG8 node, which can be protected using RAID-10 and provides a very convenient way of adding small amounts of solid state storage to a large environment. Up to 1.6TB of usable RAID-protected capacity can be added to an I/O group or up to 6.4TB for a clustered system. Document: WP102295 2013 IBM Corporation Page 6 of 19

Both SVC and Storwize V7000 can integrate storage from external storage systems into storage pools. This includes the family of IBM FlashSystem products developed by Texas Memory Systems. Solid State Storage can also be integrated into the external storage systems, such as the IBM System Storage DS8870. This storage can be presented as Tier 0 storage and integrated into the storage pools. Any of these forms of storage, from IBM or other vendors, can benefit from the advanced virtualization capabilities of SVC and Storwize V7000 including features like Easy Tier. All this is accessible from the easy to use management UI, or from the CLI.. 2.2 Easy Tier Storwize V7000 and SVC both support Easy Tier, IBM's industry leading technology for optimizing use of solid state storage. This technology was first released for the DS8000, and has since been integrated into Storwize V7000 following its release in 2010. Easy Tier provides a very easy to configure means of integrating solid state storage into existing data center environments and optimizing the use of that storage, providing maximum return of investment. Many applications exhibit a very significant skew in the distribution of I/O workload: a small fraction of the storage is responsible for a disproportionately large fraction of the total I/O workload of an environment. This paper will present data from real environments that illustrate this distribution. Easy Tier acts to identify this skew and to automatically place data to take advantage of it. By moving the 'hottest' data onto solid state storage, the workload on the remainder of the storage is significantly reduced. By servicing the majority of the application workload from the fastest storage, Easy Tier acts to accelerate application performance, and increase overall server utilization. This can reduce costs in servers and application licenses. Easy Tier also presents a wide range of opportunities for reducing the cost of the remainder of the storage, without losing the performance advantage. This reduced cost can come from both lower acquisition cost, by using devices that cost less $/GB, and also from savings in terms of power and rack space that contribute significantly to the hidden costs in a data center. This paper will present sample calculations to demonstrate this. 2.3 Disk Magic One tool available for Easy Tier analysis is Disk Magic 1, a performance modeling tool used by IBM that can help predict the expected performance of storage subsystems. It has been enhanced to include support for the new features of Easy Tier. Using data from existing systems, models can be created to predict the performance of Easy Tier in Storwize V7000 and SVC configurations. 1 Disk Magic is a registered trademark of IntelliMagic, Inc. Document: WP102295 2013 IBM Corporation Page 7 of 19

3 Drive Selection with Easy Tier While the operation of Easy Tier is extremely simple the number of drive combinations possible in an Easy Tier environment provides a much greater degree of choice than when considering a single tier of storage. The key questions we might ask for an environment are: 1) What size drives would I need to satisfy the workload if using a single tier of enterprise drives and striping the data over the available drives? This provides a good view as to how active or inactive the overall workload is. 2) What is the skew of my data and how much SSD capacity would be needed to handle say 50%, (or even 80% or 99%) of the random workload? 3) Given particular capacity of SSD storage, what size of Enterprise drives should be used to handle the workload remaining on the Enterprise tier? In some cases it may even be that enterprise drives are not required and a combination of SSD and Nearline drives is appropriate. 4) Is there an opportunity to provide significantly better backend performance in the target environment to accelerate application workloads and is the workload type (cache hit ratios / percentage sequential workload etc) such that SSD storage will provide significant application acceleration? For environments already running on an Easy Tier capable disk subsystem we can answer these questions with a good degree of accuracy as we can use the monitoring capability of Easy Tier to understand the environment in detail. In case the workloads are on other disk subsystems we would need to use a combination of overall workload performance data and to compare the environment against the range of workloads for which we have collected real world performance data to provide an estimate of how the workload skew might be distributed. In the sections below we will discuss each of these four points in turn. 3.1 Drive selection using a single tier Sizing disk subsystems in a single-tier environment is a well understood process; where, given a particular capacity and workload requirement, tools such as Disk Magic are used to determine the optimal drive size and other configuration factors to meet both of these requirements. The key metric that Disk Magic uses to determine the optimal drive size is the access density of an application. This compares the peak IO workload that the application generations to the total amount of data the application needs to access. We need to remember that the SVC and Storwize V7000 are able to perform wide striping of data using striped mode VDisks, which can significantly improve balance of workload and allow for larger drives than might have been used in a traditional disk subsystem with volumes assigned to a subset of the available backend resources. For example, analysis in customer environments have shown that implementing wide striping of data using SVC would have allowed for equal or better performance with twice the drive size as was being used in a traditionally configured environment. Document: WP102295 2013 IBM Corporation Page 8 of 19

The table below shows some example access densities (IOPS/GB) that can be supported for a particular workload on different drive types based on Disk Magic modelling. We have chosen a 16kB average IO size with 80% read, 15% sequential IO and a 50% cache hit ratio as a reasonably representative workload. Access density (IOPS/GB) Drive Type 16kB workload 200GB SSD 60 400GB SSD 30 146GB 15K 2.00 300GB 15K 1.00 450GB 10K 0.48 600GB 10K 0.36 900GB 10K 0.24 1TB 7.2K 0.14 2TB 7.2K 0.07 3TB 7.2K 0.03 Figure 2. Example maximum access densities by drive type Note : RAID6 is assumed for the Nearline drives and RAID5 for Enterprise SAS drives and SSD. For example, if we had a workload doing 50,000 IOPS on 60TB of data then this would be doing 50,000/60,000 = 0.83 IOPS/GB and so we might choose a single tier configuration using 300GB 15K drives. This would be a relatively active workload in typical mixed workload commercial environment. The main point here is the comparison of the capability of the different drive types, which span a wide range and show a dramatically higher access density for SSD compared to other tiers. Real application workloads are a complex mix of IO sizes but the 16KB IO size is a representative average and much more realistic than some other benchmarks such as 512byte 100% read workloads. Document: WP102295 2013 IBM Corporation Page 9 of 19

3.2 Skew and percentage of hot data When implementing Easy Tier one of the key characteristics of the workload is the skew of the random and small sized IO activity in the environment. By this we mean how evenly or unevenly the IO activity is spread over the capacity. The skew curve of a number of real workloads (Figure 3) shows that in most environments the random IO activity is concentrated on a very small percentage of the capacity and a significant percentage of capacity has very little random IO workload. The skew is in the range of 5% of allocated capacity receives 50% (SVC #4) to 87% (SVC#2, V7000#1) of small IO accesses. 100 90 80 Percentage of small IO workload 70 60 50 40 30 20 10 0 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 Percentage of allocated capacity SVC #1 SVC #2 V7000 #1 SVC #3 SVC #4 Figure 3. Skew for several workloads In typical environments somewhere between 2 5% of SSD can result in the movement of 50% of the small/random IO workload from enterprise drives to SSDs as shown in the zoomed graph in Figure 4 which shows only the hottest 20% of the workload. This would potentially allow for a doubling of the enterprise drive size compared to a single-tier configuration to provide the same capacity and equivalent or better performance. In some environments this can be significantly more and we can see up to 80% of the small random IO moved to SSDs for 5% or even less of SSD capacity. Document: WP102295 2013 IBM Corporation Page 10 of 19

100 90 80 Percentage of small IO workload 70 60 50 40 30 20 10 0 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Percentage of allocated capacity SVC #1 SVC #2 V7000 #1 SVC #3 SVC #4 Figure 4. Skew for several workloads showing hottest 20% of data If the environment is one where there is a significant amount of very small granularity striping, such as Oracle or DB2 tablespace striping, then the skew of the workload may be significantly reduced. This is also true of at least some System i environments. In these cases there may be less benefit from smaller amounts of SSDs and it may not be economical to implement an Easy Tier solution. Figure 5 shows the skew for a System i environment. It is clear to see that System i is very effective in spreading the workload over the extents although there is some noticeable skew for the small random write activity. 100 90 80 70 Percentage of workload 60 50 40 30 20 10 0 0 10 20 30 40 50 60 70 80 90 100 Percentage of extents % small reads % small writes %MB Figure 5. Skew for System i workload Document: WP102295 2013 IBM Corporation Page 11 of 19

3.3 Choosing an optimized configuration based on skew Once we understand the skew of workload we can look to see how moving a percentage of data to SSDs can enable a use of larger or slower drives for the remainder of the workload based on the relative performance and capacity of the different drive types. Figure 6 gives some examples of the effect of adding SSDs to a configuration. It assumes that 2 5% of SSDs will result in 50% of the small random workload on SSDs and 5 10% of the SSDs will result in 75 80% of the small random workload on SSDs which is a reasonable assumption for the skewed workloads shown in Figure 4. Single-tier drive size driven 2 5% SSDs 5 10% SSDs by performance requirements 900 GB 1TB 2TB 600 GB 900 GB / 1TB 1TB / 2TB 450 GB 900 GB 1TB 300 GB 600 GB 900 GB 146 GB 300 GB 600 GB Figure 6 Effect of different percentage of SSDs in a 2-tier configuration The values in the table above are somewhat conservative and if the skew of a workload is known then it is possible to take a more accurate view of the percentage of SSDs which would allow a particular drive size. Figure 7 shows, for a particular skew curve, how much SSD capacity would be required to allow different sizes of enterprise drive to be used in the configuration. If we assume 146GB drives would be required for a single tier solution then the intersection between the skew curve and the lines representing the other drives shows what capacity of SSDs would be needed to use this particular drive size. 100 90 80 Percentage of small IO workload 70 60 50 40 30 20 10 0 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 Percentage of allocated capacity Workload Skew Single tier 146GB drives Two tier with 300GB drives Two tier with 450GB drives Two tier with 600GB drives Two tier with 900GB drives Two tier with 2TB drives Figure 7 Drive Selection with a specific skew curve This shows that only 3% SSDs would be required in this configuration to move to 300GB drives. 450GB drives would be possible with 6% SSDs and 10% SSDs would allow 900GB drives. Document: WP102295 2013 IBM Corporation Page 12 of 19

The analysis method here looks at the breakdown of the average workload and concentrates on the backend random IOPS. It does not take into account the performance limits for peak workloads (e.g. sequential workloads with high MB/s) so care should be taken to understand this when moving to a much larger drive size. Disk Magic can help with this more detailed modeling. 3.4 Further Application acceleration There are situations where an application requires SSD performance for an entire environment rather than simply to optimize the price, power, and footprint of their environment. We have seen that it is possible to move almost the entire active workload for an environment onto SSDs with a relatively small percentage of SSDs. One particular client is running with Easy Tier and with 20% of their database environment on SSDs and has equivalent performance to that of a 100% SSD solution. Looking at the skew information from a range of clients we can see that adding more SSDs to a configuration can very quickly result in the majority of the small/random workload being moved to these drives. However, there are diminishing returns so unless there is workload expansion requirement or an application and business benefit for significantly improved read miss performance the higher percentages of SSDs may not provide the best return on investment. Document: WP102295 2013 IBM Corporation Page 13 of 19

3.5 Easy Tier Data Movement One of the other questions often asked regarding Easy Tier is how much the workload changes over time and hence how much work Easy Tier has to do on a daily basis in order to optimize the configuration. The chart below shows the number of extents moved per 15 minute period for an Easy Tier environment once the initial promotion of data to SSDs has completed. This configuration is using a 512MB extent size so the daily movement is typically around 200GB out of a total capacity of 140TB which is less than 0.2% of data being moved. This very small percentage of data is all that needs to be moved to maintain an optimized configuration. 35 30 25 Extent moves 20 15 k 10 5 0 00:00:00 02:00:00 04:00:00 06:00:00 08:00:00 10:00:00 12:00:00 14:00:00 16:00:00 18:00:00 20:00:00 22:00:00 00:00:00 Time 01/10/2012 02/10/2012 03/10/2012 04/10/2012 04/10/2012 Figure 8 Example of Easy Tier data movement The chart also shows the fact that Easy Tier is using a randomized interval for the analysis and subsequent movement of data so that it does not happen at the same time each day. In fact on the 4 th November there were two periods of data movement as the Easy Tier analysis was performed twice. Document: WP102295 2013 IBM Corporation Page 14 of 19

4 Sample TCO benefits with Easy Tier As well as providing improved performance using a multiple tier configuration, Easy Tier can also provide benefits both in terms of acquisition costs and ongoing datacentre costs for power, cooling and floor space. This section provides two different examples of storage requirements, and shows the benefits Easy Tier can provide compared to a standard single tier configuration using Storwize V7000 disk subsystems to meet the requirement. In these examples we have chosen relatively conservative assumptions regarding the skew and as can be seen in section 3.2 it is likely that Easy Tier could provide even greater savings in many environments. 4.1 Typical 50TB mixed environment The first example is a configuration of 50TB usable capacity with performance requirements that can be met with 300GB 15K RPM drives. Two alternative configurations are considered using Easy Tier. The first is a configuration with 5% SSDs and 95% 600GB 10K RPM drives and the second is a configuration with 10% SSDs and 90% 900GB 10K RPM drives. Both of these alternative configurations would easily meet the performance requirements of the environment given the typical skew levels shown earlier in this paper and would generally provide a better average response time given the percentage of random workload that is moved to the SSDs by Easy Tier. The chart below shows the comparative costs of the three configurations over three years, including hardware, software and maintenance. As can be seen the Easy Tier configuration with 5% SSD is the most cost effective configuration although the 10% SSD solution is likely to provide lower response times and has a similar cost to the base configuration with 300GB drives. 1.1 1.0 0.9 0.8 0.7 Comparative cost 0.6 0.5 0.4 0.3 0.2 0.1 0.0 100% 300GB 5% SSD 95% 600GB 10% SSD 90% 900GB Configuration Hardware Software Maintenance Figure 9 Three year costs for 50TB mixed environment Document: WP102295 2013 IBM Corporation Page 15 of 19

If we look at the datacenter costs for power, cooling and floor space for this configuration we can again see that both the Easy Tier configurations are providing a significant reduction compared to the traditional enterprise drive configuration with around a 40% saving in environmental costs. 1 0.9 0.8 0.7 Comparative yearly costs 0.6 0.5 0.4 0.3 0.2 0.1 0 100% 300GB 5% SSD 95% 600GB 10% SSD 90% 900GB Configuration Floorspace Power/cooling Figure 10 Yearly environmental costs for 50TB mixed environment 4.2 High performance environment The second example is a high performance environment. The base configuration requires a full Storwize V7000 configuration of 240 146GB 15K RPM drives in a RAID10 configuration. For the sake of simplicity we have assumed that this matches the requirements both from a performance and capacity perspective. Two alternative configurations were considered. The first is an Easy Tier configuration with 20% SSDs and 80% 600GB 10K RPM drives in a RAID5 configuration and the second is a SSD only configuration with 100% 400GB SSDs also in RAID5. Both the SSD and Easy Tier configurations would be expected to give significantly improved performance compared to the base 146GB configuration. In the Easy Tier configuration we might expect 95% or more of the random IOs to be on the SSDs and so the performance could be very similar to the SSD only configuration. Document: WP102295 2013 IBM Corporation Page 16 of 19

The chart below shows the comparative costs of the three configurations over three years, including hardware, software and maintenance. As can be seen the Easy Tier configuration is the most cost effective configuration and while the SSD only configuration has a significantly higher maximum performance capability it also is more expensive than either of the other configurations. 1.75 1.50 1.25 Comparative cost 1.00 0.75 0.50 0.25 0.00 146GB drives 20% SSD 80% 600GB drives 100% SSD Configuration Hardware Software Maintenance Figure 11 Three year costs for high performance environment If we look at the datacentre costs for power, cooling and floorspace for this configuration we can again see that the Easy Tier configuration is providing a significant reduction compared to the traditional enterprise drive configuration with around 75% saving in environmental costs. What is perhaps somewhat surprising is that with this typical skew level the combination of 20% SSDs with larger enterprise drives is also more efficient from a facilities perspective than the 100% SSD solution. 1 0.9 0.8 0.7 Comparative yearly costs 0.6 0.5 0.4 0.3 0.2 0.1 0 146GB drives 20% SSD 80% 600GB drives 100% SSD Configuration Floorspace Power/cooling Figure 12 Yearly environmental costs for high performance environment Document: WP102295 2013 IBM Corporation Page 17 of 19

5 Conclusions With Easy Tier an automated solution is made available for automatically managing data placement for performance in a variety of different configurations. Typical real world workloads shown in this paper have a significantly uneven distribution of the IO activity over the capacity which enables significant performance benefits from relatively small percentages of SSDs. We have also shown that Easy Tier can provide a reduction in both acquisition costs and the ongoing datacenter costs by enabling the use of larger enterprise drives along with SSDs. This paper reflects performance and cost of storage in early 2013. The reader must remember though that the capabilities and cost of flash storage are improving markedly every quarter. The opportunities described here for flash storage will only increase as the technology develops. In particular we expect to see an increasing opportunity for 100% SSD systems to be deployed in the near future, based on a number of factors: Use of technology such as IBM's Real-time compression to reduce the cost of SSD storage Cost savings that can be realized outside of the storage management environment through the use of SSD storage, by improving server and application performance and reducing costs their. There are also further opportunities to reduce application development and maintenance costs, through the benefits of reduced response time that SSD storage provide The creation of new business applications that take advantage of SSD storage and its unique capabilities to solve latent business problems in novel ways Realizing the full benefits needs new co-ordination within organizations, since some of the benefits can only be achieved by reaching out from the storage management domain into other parts of the IT organization, and into other business units. However the trend is set in terms of increasing use of SSD storage in all data centers. Document: WP102295 2013 IBM Corporation Page 18 of 19

6 References The following redbooks and whitepapers contain information about the architecture and implementation of Easy Tier on the SVC and Storwize V7000 as well as performance information with benchmark workloads. Implementing the IBM System Storage SAN Volume Controller V6.3 http://www.redbooks.ibm.com/abstracts/sg247933.html?open IBM System Storage SAN Volume Controller Best Practices and Performance Guidelines http://www.redbooks.ibm.com/abstracts/sg247521.html?open Using the Storage Tier Advisory Tool (STAT) and Easy Tier on the IBM Storwize V7000 http://www-03.ibm.com/support/techdocs/atsmastr.nsf/webindex/wp101852 SAP and IBM demonstrate capability of candling high billing volume in a Telecommunications Scenario http://www-03.ibm.com/support/techdocs/atsmastr.nsf/webindex/wp101984 Microsoft SQL Server 2008 R2 with IBM System x3850 X5 and IBM Storwize V7000 Easy Tier http://www-03.ibm.com/support/techdocs/atsmastr.nsf/webindex/wp101988 Performance benefits of IBM Storwize V7000 with IBM Easy Tier for Oracle ASM http://www-03.ibm.com/support/techdocs/atsmastr.nsf/webindex/wp101990 The following whitepapers provide information about Easy Tier functionality as implemented on the on the DS8000. IBM System Storage DS8700 and DS8800 Performance with Easy Tier 3 nd Generation. http://www-03.ibm.com/support/techdocs/atsmastr.nsf/webindex/wp102024 IBM System Storage DS8700 and DS8800 Performance with Easy Tier 2 nd Generation. http://www-03.ibm.com/support/techdocs/atsmastr.nsf/webindex/wp101961 IBM System Storage DS8700 Performance with Easy Tier http://www-03.ibm.com/support/techdocs/atsmastr.nsf/webindex/wp101675 The SPC-1 benchmarks referenced in this whitepaper can be found here: http://www.storageperformance.org/results/benchmark_results_spc1/#spc1 Document: WP102295 2013 IBM Corporation Page 19 of 19