Dell EMC Unity: Data Reduction Analysis

Similar documents
DELL EMC UNITY: DATA REDUCTION

White Paper Effects of the Deduplication/Compression Function in Virtual Platforms ETERNUS AF series and ETERNUS DX S4/S3 series

Surveillance Dell EMC Storage with Bosch Video Recording Manager

Surveillance Dell EMC Storage with Cisco Video Surveillance Manager

VMware vstorage APIs FOR ARRAY INTEGRATION WITH EMC VNX SERIES FOR SAN

Surveillance Dell EMC Storage with Verint Nextiva

Dell EMC SAN Storage with Video Management Systems

MIGRATING TO DELL EMC UNITY WITH SAN COPY

Surveillance Dell EMC Storage with Cisco Video Surveillance Manager

Surveillance Dell EMC Storage with Digifort Enterprise

Dell Compellent Storage Center and Windows Server 2012/R2 ODX

Surveillance Dell EMC Storage with Milestone XProtect Corporate

Surveillance Dell EMC Storage with Milestone XProtect Corporate

Video Surveillance EMC Storage with Honeywell Digital Video Manager

Dell EMC Unity Family

Surveillance Dell EMC Storage with FLIR Latitude

DELL EMC UNITY: BEST PRACTICES GUIDE

EMC Virtual Infrastructure for Microsoft Exchange 2010 Enabled by EMC Symmetrix VMAX, VMware vsphere 4, and Replication Manager

Dell EMC UnityVSA Cloud Edition with VMware Cloud on AWS

EMC STORAGE FOR MILESTONE XPROTECT CORPORATE

Surveillance Dell EMC Storage with IndigoVision Control Center

DELL EMC UNITY: COMPRESSION FOR FILE Achieving Savings In Existing File Resources A How-To Guide

EMC Integrated Infrastructure for VMware. Business Continuity

Dell EMC Storage with the Avigilon Control Center System

Surveillance Dell EMC Storage with Synectics Digital Recording System

Understanding Virtual System Data Protection

EMC VSPEX END-USER COMPUTING

Dell EMC Ready Architectures for VDI

Video Surveillance EMC Storage with LENSEC Perspective VMS

Virtualizing SQL Server 2008 Using EMC VNX Series and VMware vsphere 4.1. Reference Architecture

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

vsan Disaster Recovery November 19, 2017

Dell EMC Unity Family Dell EMC Unity All Flash, Unity Hybrid, UnityVSA. Unity shut down and restart the system (power down and power up)

Dell EMC vsan Ready Nodes for VDI

Dell EMC Storage with the Avigilon Control Center System

Dell EMC Storage with Panasonic Video Insight

EMC VNX2 Deduplication and Compression

CLOUDIQ OVERVIEW. The Quick and Smart Method for Monitoring Unity Systems ABSTRACT

Scale out a 13th Generation XC Series Cluster Using 14th Generation XC Series Appliance

Dell SC Series Snapshots and SQL Server Backups Comparison

EMC CLARiiON Backup Storage Solutions

VMware vsphere Data Protection 5.8 TECHNICAL OVERVIEW REVISED AUGUST 2014

Video Surveillance EMC Storage with Digifort Enterprise

The Virtual Desktop Infrastructure Storage Behaviors and Requirements Spencer Shepler Microsoft

White Paper Features and Benefits of Fujitsu All-Flash Arrays for Virtualization and Consolidation ETERNUS AF S2 series

EMC Virtual Architecture for Microsoft SharePoint Server Reference Architecture

DELL EMC UNITY: COMPRESSION

EMC Unity Family. Monitoring System Performance. Version 4.2 H14978 REV 03

Storage Considerations for VMware vcloud Director. VMware vcloud Director Version 1.0

Dell EMC Ready Solution for VMware vcloud NFV 3.0 OpenStack Edition Platform

VMware vsphere 5.0 STORAGE-CENTRIC FEATURES AND INTEGRATION WITH EMC VNX PLATFORMS

WHITE PAPER. Optimizing Virtual Platform Disk Performance

Dell EMC Ready Architectures for VDI

EMC Backup and Recovery for Microsoft SQL Server

Dell EMC Microsoft Storage Spaces Direct Ready Nodes for VDI

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

EMC VSPEX END-USER COMPUTING

INTEGRATED INFRASTRUCTURE FOR VIRTUAL DESKTOPS ENABLED BY EMC VNXE3300, VMWARE VSPHERE 4.1, AND VMWARE VIEW 4.5

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

IOmark- VM. HP HP ConvergedSystem 242- HC StoreVirtual Test Report: VM- HC b Test Report Date: 27, April

Dell EMC Ready System for VDI on VxRail

Dell EMC Best Practices for Running VMware ESXi 6.5 or Later Clusters on XC Series Appliances and XC Core Systems

DELL EMC UNITY: VIRTUALIZATION INTEGRATION

Dell EMC Storage Manager Scalability Solutions

Virtualized SQL Server Performance and Scaling on Dell EMC XC Series Web-Scale Hyper-converged Appliances Powered by Nutanix Software

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

VMware VAAI Integration. VMware vsphere 5.0 VAAI primitive integration and performance validation with Dell Compellent Storage Center 6.

Dell EMC SAP HANA Appliance Backup and Restore Performance with Dell EMC Data Domain

Dell EMC Unity: Performance Analysis Deep Dive. Keith Snell Performance Engineering Midrange & Entry Solutions Group

EMC Performance Optimization for VMware Enabled by EMC PowerPath/VE

DELL EMC UNITY: REPLICATION TECHNOLOGIES

Dell EMC. Converged Technology Extensions for Storage Product Guide

Video Surveillance EMC Storage with Godrej IQ Vision Ultimate

Video Surveillance EMC Storage with LenSec Perspective VMS

Data center requirements

Dell EMC Ready System for VDI on XC Series

Changing default password of root user for idrac9 by using Dell EMC License Manager

Dell EMC. VxRack System FLEX Architecture Overview

Managing Data Growth and Storage with Backup Exec 2012

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

Dell EMC SC Series Storage and SMI-S Integration with Microsoft SCVMM

EMC VSPEX FOR VIRTUALIZED MICROSOFT EXCHANGE 2013 WITH MICROSOFT HYPER-V

vsan All Flash Features First Published On: Last Updated On:

EMC Backup and Recovery for Microsoft Exchange 2007

EMC VSPEX WITH EMC XTREMSF AND EMC XTREMSW CACHE

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

EMC Backup and Recovery for Microsoft Exchange 2007 SP1. Enabled by EMC CLARiiON CX4-120, Replication Manager, and VMware ESX Server 3.

EMC VSPEX END-USER COMPUTING

Best Practices for Deploying a Mixed 1Gb/10Gb Ethernet SAN using Dell Storage PS Series Arrays

Configuring a Microsoft Windows Server 2012/R2 Failover Cluster with Storage Center

EMC CLARiiON CX3-40. Reference Architecture. Enterprise Solutions for Microsoft Exchange Enabled by MirrorView/S

Vblock System 540. with VMware Horizon View 6.1 Solution Architecture

Setting Up the Dell DR Series System on Veeam

Microsoft SQL Server for Common Workload on Dell EMC VxRack FLEX

Surveillance Dell EMC Storage with Synectics Digital Recording System

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

Surveillance Dell EMC Storage with Genetec Security Center

Dell Technologies IoT Solution Surveillance with Genetec Security Center

INTEGRATING PURE AND COHESITY

Dell EMC. Converged Technology Extension for Isilon Storage Product Guide

Transcription:

Dell EMC Unity: Data Reduction Analysis Data reduction on application-specific datasets Abstract This document analyzes Dell EMC Unity data reduction ratios for various application-specific data types to help administrators accurately estimate space savings. February 2019 Dell EMC Technical White Paper

Revisions Revisions Date Description January 2019 Initial release February 2019 Updates to section 2.3 Acknowledgements Author: Darin Schmitz The information in this publication is provided as is. Dell Inc. makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any software described in this publication requires an applicable software license. 2019 Dell Inc. or its subsidiaries. All Rights Reserved. Dell, EMC, Dell EMC and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other trademarks may be trademarks of their respective owners. Published in the USA [2/19/2019] [Technical White Paper] [H17574.1] Dell believes the information in this document is accurate as of its publication date. The information is subject to change without notice. 2 Dell EMC Unity: Data Reduction Analysis H17574.1

Table of contents Table of contents Revisions... 2 Acknowledgements... 2 Table of contents... 3 Executive summary... 4 1 Introduction... 5 1.1 Environment and setup... 5 1.2 About the test data... 6 1.3 Performance reporting... 6 2 Analyzed data types... 7 2.1 VMware vsphere virtual machines... 7 2.2 Microsoft Hyper-V virtual machines... 7 2.3 File share... 8 2.4 ISO share... 8 2.5 Virtual desktops... 9 2.6 Microsoft SQL Server... 9 3 Conclusion... 10 A Technical support and resources... 11 A.1 Related resources... 11 3 Dell EMC Unity: Data Reduction Analysis H17574.1

Executive summary Executive summary Data reduction techniques such as compression and deduplication within storage arrays have been indispensable to help reduce storage consumption. Unfortunately, due to the variability of data types stored within any given storage array, data reduction savings can greatly fluctuate between environments, making it difficult for administrators to estimate storage needs. This document explores potential savings using real-world data reduction ratios on a Dell EMC Unity array. Data was gathered from common applications such as VMware virtual machines, file-share data, Microsoft SQL Server databases, Microsoft Hyper-V virtual machines, and more. 4 Dell EMC Unity: Data Reduction Analysis H17574.1

Introduction 1 Introduction When exploring compression and deduplication savings in a storage array, it is important to understand that the types of data stored have a large impact on how reducible the data is. For example, while a volume full of ASCII text documents may have a high reduction ratio, another volume full of MP4 video files may produce very little savings since the audio and video codecs have already reduced the file sizes. Note: The results presented in this document should only be used as a general guideline. Every environment is different and results may vary based on a number of factors including dataset, workload, or environment. Review the resources outlined in section 1.3 for details and best practice guidelines. For assistance with sizing or designing a system, see your Dell EMC representative. 1.1 Environment and setup The environment used in this document is composed of a Dell EMC Unity 550F array with 15 x 7 TB flash drives. This array is connected by Fibre Channel to a separate non-unity array. The configuration simulates migration of data from a different block-based platform to a Dell EMC Unity array to examine potential realworld savings. Enabling Advanced Deduplication on the LUNs in Dell EMC Unisphere 5 Dell EMC Unity: Data Reduction Analysis H17574.1

Introduction 1.2 About the test data To best gauge data reduction savings, this analysis uses real-world data instead of artificially generated data from prevailing test tools. The data is copied to the Dell EMC Unity array from another block-based array. This array stores data for a mid-sized department of technical employees in an existing Fortune 500 company. Due to the size of the department, the data represents a large variety of data types, with several years worth of active and historical data accumulated. 1.3 Performance reporting This analysis measures only the data reduction ratios for the various data types. Array performance is out of scope for this document and is not detailed. For more information regarding data reduction, refer to the following documents: Dell EMC Unity: Data Reduction Dell EMC Unity: Best Practices Guide 6 Dell EMC Unity: Data Reduction Analysis H17574.1

Analyzed data types 2 Analyzed data types This analysis takes data reduction measurements from numerous volumes totaling over 35 TB of allocated space and consuming several terabytes of data. The volumes are grouped with similar data types and are separated into the six categories outlined in the following subsections. 2.1 VMware vsphere virtual machines The virtual machine (VM) datastores tested consisted of various operating systems and applications. The storage-reduction ranges for all VMware VM volumes were as follows: Small savings = 1.66:1 (39.89%) Average savings = 2.13:1 (51.00%) Large savings = 2.97:1 (66.33%) Small savings: The datastore storing VM templates and ISOs produced the smallest reduction ratio. This result is likely caused by the datastore containing a very small number of VM templates stored alongside a high number of operating system ISO images. It is worth nothing that while the ISO format itself is not compressed, it is common for data contained within ISO images to already be compacted. Average savings: The average savings figure represents the average reduction ratio and savings percentage across all ten of the VMware datastores analyzed. Large savings: The largest savings observed was on the datastore where Microsoft Windows Server VMs were cloned five times from a single template and then powered on. This favorable reduction ratio was achieved because all VMs were cloned from the same template. Another notable observation regards VMDK formats. To analyze the difference between VMDK formats, one datastore contained five virtual machines in the thick format, and another datastore contained five virtual machines in the eager zeroed thick (EZT) format. Interestingly, this made an almost imperceptible difference in the reduction ratios. The thick-formatted VMs only had a reduction savings of 300 MB more than the EZTformatted virtual machines (46.2 GB compared to 46.5 GB). 2.2 Microsoft Hyper-V virtual machines For this analysis, the Hyper-V VM formats were stored in two Cluster Shared Volumes (CSV), each containing a different format of virtual hard disks (VHDs). The first volume stored five VMs with fixed VHDs, and the second stored five VMs with dynamically expanding VHDs. The storage-reduction ranges were as follows: Fixed-format VHDs = 1.84:1 (45.78%) Dynamic-format VHDs = 1.85:1 (46.08%) Much like the previous observations of the VMware virtual disk formats, the Hyper-V disk formats also produced a negligible reduction in savings. 7 Dell EMC Unity: Data Reduction Analysis H17574.1

Analyzed data types 2.3 File share There were three volumes analyzed for file-share data. The first volume analyzed was a 6 TB Windows Storage Server 2016 volume formatted with NTFS. This volume housed multiple SMB file shares, and contained unstructured departmental data such as user directories, a public directory, departmental projects, and a video share. The second volume was an exact file-copy replica of the first volume but with one difference: It had the Windows Server 2016 Data Deduplication role enabled and set to the General purpose file server setting. This tested the effect of software-based deduplication on the array-based data-reduction ratios. The third volume contained a dedicated application share that stored transactional log files that were output from a specialized Java application. The storage-reduction ranges were as follows: Small savings = 1.06:1 (5.69%) (Windows Data Deduplication role enabled) Average savings = 1.29:1 (22.76%) Large savings = 3.14:1 (68.19%) The average savings ratio is what most administrators may expect to see with traditional Microsoft Windows user shares. Within each of the directories, there were a wide variety of Microsoft Office files, downloaded zip files, executable files, JPG images, Adobe PDFs, and other file types. It is important to remember that many modern file types store data in a reduced format, and many files saved from the internet are also likely to be pre-reduced to save web-server bandwidth. The small savings ratio was achieved on the volume stored with the Windows Data Deduplication role enabled. Since this data was deduplicated first by the operating system, the array-based deduplication produced very little additional savings. With the large savings ratio, the share stored an archive of proprietary application log files. These log files contained a large sum of highly reducible data due to the structured format of the output. As with many transactional files, data was written sequentially with numerous timestamped filenames. Note: It is not a best practice to use array-based deduplication in addition to software deduplication. The results presented in this section are made for illustrative purposes only. 2.4 ISO share This NTFS volume contained a directory purely consisting of ISOs to analyze the data reduction ratios when measured by themselves. As noted previously, ISOs usually contain compacted data and the low reduction levels were consistent with expectations. ISO share = 1.1:1 (9.17%) 8 Dell EMC Unity: Data Reduction Analysis H17574.1

Analyzed data types 2.5 Virtual desktops This volume contained VMware virtual desktops running Windows 10 and stored them as full clones. Virtual desktops stored as linked clones were not analyzed because deduplication would have been non-productive. Virtual desktops = 1.52:1 (34.29%) This data reduced well because the full-clone VMs contain a lot of duplicate data such as the operating system, Microsoft Office applications, and other productivity tools. However, it is worth noting that within virtual desktops there can still be a lot of unique data in the user profiles. 2.6 Microsoft SQL Server The Microsoft SQL Server data was divided into two volumes based on Microsoft best practices. One volume was used to store the database files, and the other volume was used to store the transaction logs. Database volume = 1.49:1 (32.96%) Logs volume = 12.9:1 (92.25%) Reducing database volumes can produce storage savings, though this falls outside the performance considerations of whether database volumes should have deduplication enabled on them or not. While actual database reduction levels can vary wildly based on the data being stored, the transaction logs were reduced significantly. 9 Dell EMC Unity: Data Reduction Analysis H17574.1

Conclusion 3 Conclusion This document demonstrated data reduction with several different data types, and almost all of them had beneficial savings. Although these savings can be difficult to predict due to the natural variability of data, these test results can help administrators make educated guesses regarding data reduction. However, until data is actually stored on the Dell EMC Unity array, the actual data reductions will be unknown. Overall pool savings from the array where all volumes have Advanced Deduplication enabled In this analysis, the pool-level savings had a definite impact on the bottom-line consumption, but it is important to remember that data reduction is not appropriate for all volumes. Since it is highly unlikely that data reduction will be enabled on every volume in an array, the pool-level results presented are for illustrative purposes only. 10 Dell EMC Unity: Data Reduction Analysis H17574.1

Technical support and resources A Technical support and resources Dell EMC Support is focused on meeting customer needs with proven services and support. Storage technical documents and videos on Dell.com provide expertise that helps to ensure customer success on Dell EMC storage platforms. A.1 Related resources Refer to the following referenced or related resources: Dell EMC Unity: Introduction to the Platform Dell EMC Unity: Best Practices Guide Dell EMC Unity: Data Reduction 11 Dell EMC Unity: Data Reduction Analysis H17574.1