Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55%

Similar documents
Condusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware

Condusiv s V-locity VM Boosts Virtual Machine Performance Over 50% Without Additional Hardware

I D C T E C H N O L O G Y S P O T L I G H T

Vembu VMBackup. True 15-Minute RTO for Mission-Critical VM Systems with Vembu VMBackup Replication. Author Dr. Jack Fegreus.

WHITE PAPER. Optimizing Virtual Platform Disk Performance

Accelerate Applications Using EqualLogic Arrays with directcache

Benefits of Automatic Data Tiering in OLTP Database Environments with Dell EqualLogic Hybrid Arrays

EMC XTREMCACHE ACCELERATES MICROSOFT SQL SERVER

Vembu OffsiteDR Server

A Comparative Study of Microsoft Exchange 2010 on Dell PowerEdge R720xd with Exchange 2007 on Dell PowerEdge R510

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Scalable Shared Databases for SQL Server 2005

IBM Emulex 16Gb Fibre Channel HBA Evaluation

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Performance Testing of SQL Server on Kaminario K2 Storage

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING

Extreme Storage Performance with exflash DIMM and AMPS

Virtual Switch Acceleration with OVS-TC

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd

Reduce Costs & Increase Oracle Database OLTP Workload Service Levels:

EMC VMAX 400K SPC-2 Proven Performance. Silverton Consulting, Inc. StorInt Briefing

QLogic 16Gb Gen 5 Fibre Channel for Database and Business Analytics

Consolidating OLTP Workloads on Dell PowerEdge R th generation Servers

Gen 6 Fibre Channel Evaluation of Products from Emulex and Brocade

Virtualized SQL Server Performance and Scaling on Dell EMC XC Series Web-Scale Hyper-converged Appliances Powered by Nutanix Software

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA

EMC XTREMCACHE ACCELERATES ORACLE

Ultra-Low Latency Down to Microseconds SSDs Make It. Possible

vsan 6.6 Performance Improvements First Published On: Last Updated On:

Emulex LPe16000B Gen 5 Fibre Channel HBA Feature Comparison

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

Testing the effects of file fragmentation on your SAN s performance

Aerospike Scales with Google Cloud Platform

V-locity Endpoint Installation Guide

Configuring Short RPO with Actifio StreamSnap and Dedup-Async Replication

Nimble Storage Adaptive Flash

QLE10000 Series Adapter Provides Application Benefits Through I/O Caching

Increasing Performance of Existing Oracle RAC up to 10X

How VembuHIVE, a backup repository as a file system is changing the dynamics of data protection

Four-Socket Server Consolidation Using SQL Server 2008

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades

FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC

Technical Paper. Performance and Tuning Considerations for SAS on Fusion-io ION Accelerator

An Oracle White Paper April 2010

Was ist dran an einer spezialisierten Data Warehousing platform?

HIGH PERFORMANCE SANLESS CLUSTERING THE POWER OF FUSION-IO THE PROTECTION OF SIOS

QLogic/Lenovo 16Gb Gen 5 Fibre Channel for Database and Business Analytics

Low Latency Evaluation of Fibre Channel, iscsi and SAS Host Interfaces

WHITE PAPER Diskeeper: Improving the Performance of SAN Storage

PRESERVE DATABASE PERFORMANCE WHEN RUNNING MIXED WORKLOADS

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage

Virtuozzo Containers

HP ProLiant BladeSystem Gen9 vs Gen8 and G7 Server Blades on Data Warehouse Workloads

Database Solutions Engineering. Best Practices for Deploying SSDs in an Oracle OLTP Environment using Dell TM EqualLogic TM PS Series

On the Radar: Condusiv Technologies

When, Where & Why to Use NoSQL?

Cloud Meets Big Data For VMware Environments

NetApp AFF A300 Review

Lenovo Database Configuration for Microsoft SQL Server TB

HP SmartCache technology

Memory-Based Cloud Architectures

Best Practices for Deploying a Mixed 1Gb/10Gb Ethernet SAN using Dell Storage PS Series Arrays

Maximizing VMware ESX Performance Through Defragmentation of Guest Systems

The Server-Storage Performance Gap

Migrate from Netezza Workload Migration

Accelerating storage performance in the PowerEdge FX2 converged architecture modular chassis

PowerVault MD3 SSD Cache Overview

Copyright 2012 EMC Corporation. All rights reserved.

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Evaluation of the Chelsio T580-CR iscsi Offload adapter

Virtuozzo Hyperconverged Platform Uses Intel Optane SSDs to Accelerate Performance for Containers and VMs

Executive Briefing. All SAN storage

EMC Integrated Infrastructure for VMware. Business Continuity

The Impact of Disk Fragmentation on Servers. By David Chernicoff

SQL Server Performance on AWS. October 2018

Migrate from Netezza Workload Migration

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations

QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER

BENEFITS AND BEST PRACTICES FOR DEPLOYING SSDS IN AN OLTP ENVIRONMENT USING DELL EQUALLOGIC PS SERIES

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

V-locity Overview with Questions and Answers

Impact of Dell FlexMem Bridge on Microsoft SQL Server Database Performance

Strategic Briefing Paper Big Data

Netronome 25GbE SmartNICs with Open vswitch Hardware Offload Drive Unmatched Cloud and Data Center Infrastructure Performance

Got Isilon? Need IOPS? Get Avere.

FlashStack 70TB Solution with Cisco UCS and Pure Storage FlashArray

Performance Report: Multiprotocol Performance Test of VMware ESX 3.5 on NetApp Storage Systems

The Impact of SSD Selection on SQL Server Performance. Solution Brief. Understanding the differences in NVMe and SATA SSD throughput

Best Practices for Deploying a Mixed 1Gb/10Gb Ethernet SAN using Dell EqualLogic Storage Arrays

ADDENDUM TO: BENCHMARK TESTING RESULTS UNPARALLELED SCALABILITY OF ITRON ENTERPRISE EDITION ON SQL SERVER

Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory

Automated Netezza Migration to Big Data Open Source

Accelerating Big Data: Using SanDisk SSDs for Apache HBase Workloads

Clustered Storage for High Avaliability Clouds

EMC Business Continuity for Microsoft Applications

White Paper All-Flash Arrays for Improving Database Performance to Open Up Potential for New Applications ETERNUS AF series

Transcription:

openbench Labs Executive Briefing: May 20, 2013 Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55% Optimizing I/O for Increased Throughput and Reduced Latency on Physical Servers 01

Executive Overview V -locity Server increased SQL Server 2012 transaction processing rate by an average of 55% and improved response time by 33%. Why Read This Document? For this briefing, openbench Labs tested the ability of V-locity Server to optimize I/O in a SQL Server environment. Using SQL Server 2012, we tested a mix of a high volume of light-weight SQL select transaction processing (TP) queries, combined with heavy-weight background update queries. For the SQL Server benchmark testing, openbench simulated 1 to 32 daemon processes (1 daemon generates the equivalent of 70 normally-queued user processes) issuing queries non-stop. When a real application user interacts with SQL Server, there is lag between queries issued. In our test scenario, however, the daemon process issued queries without lag that is, no think-time, type-time, or pause-time between query activity. In a series of tests, openbench Labs measured the ability of V-locity Server s IntelliMemory to offload I/O on read operations through dynamic caching, in order to boost throughput and reduce latency. In addition we examined the ability of IntelliWrite technology to prevent unnecessary split I/Os, using its intelligence to extend current database files and create new log files as single contiguous collections of logical blocks. In a test of SQL Server query processing, openbench Labs benchmark findings revealed that V-locity, on a server running SQL Server, enabled higher transaction per second (TPS) rates and improved response time by reducing I/O processing on storage devices. What s more, in a SAN- or NAS-based storage environment, V-locity Server reduced I/O stress on multiple systems sharing storage resources. Overall, V-locity Server can improve scalability by reducing average response time and enabling SQL Server to support more users. Snapshot of Findings 1) In SQL Server 2012 processing tests involving 1 to 32 daemon processes, V-locity Server improved transaction performance by an average of 55%, before the I/O subsystem of our server began to physically limit the disk IOPS rate. 2) By improving the average response time of transactions by 33%, V-locity Server enabled SQL Server to scale enough to handle the transactions of 62% more daemon processes than the database could sustain without V-locity Server. 3) V-locity Server prevented 96% log file fragments from being created during a typical series of stress tests, contributing to the increase in transaction processing rate and improved response time. 02

Executive Briefing: Jack Fegreus May 20, 2013 Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55% Optimizing I/O for Increased Throughput and Reduced Latency on Physical Servers With the release of SQL Server 2012, Microsoft is positioning SQL Server as a database for use with both large structured and unstructured (big data) applications. Whether with traditional data warehouse scenarios, or as a link between unstructured data platforms such as Hadoop, sites with large databases with large numbers of rows are now a key target, which makes SQL Server 2012 a good way for CIOs to initiate big data pilot projects. Under Test: Server I/O Acceleration Condusiv Technologies V-locity Server 1) V-locity s read optimization (IntelliMemory ) technology reduces disk I/O requests by predictively caching active data within the server to increase local IOPS performance and reduces overhead on shared storage devices by reducing physical read I/O requests. 2) V-locity s write optimization (IntelliWrite ) technology provides continuous I/O write optimization using dynamic intelligence when creating or extending files to behave sequentially, eliminating split and other unnecessary I/Os for greater sequential throughput performance on both writes and subsequent reads. 3) By optimizing I/O processing on a server running V-locity, less I/O traffic is directed at an underlying SAN- or NAS-based storage system. Moreover, with IntelliWrite and IntelliMemory, both TP throughput and average response time were more predictable and scalable as the TP rate increased 55% and response time was improved by 33% over all test loads of 1 to 32 daemon processes on SQL Server 2012. 4) V-locity s advanced I/O optimization technology is compatible with all advanced storage features, such as replication, de-duplication, thin provisioning, and snapshots. To provide a test case and example of a large mission-critical application that could not be fully cached by SQL Server 2012, and would be highly dependent on disk access rates for essential performance, openbench labs created a 30GB instance of the ANSI AS3AP benchmark database and populated it with 32.5 million rows of data. For IT operations, optimizing I/O of transaction processing applications is complicated by the difference in the rate that CPU and memory performance have advanced versus the rate that disk access times have improved. Attempts to deal with these issues often create CPU bottlenecks as processing stalls waiting for the delivery of data. With I/O access time highly dependent on disk drive mechanics, IT frequently turns to costly hardware solutions involving solid-state drives (SSDs). Such hardware solutions, however, suffer from dependence on a hardware platform, under-utilization of resources, lack of flexibility with changing workloads, and high costs. In this analysis, openbench Labs examines the ability of Condusiv Technologies V-locity Server to maximize I/O performance in a high-tp SQL Server environment. By efficiently optimizing the way data is read from and written to disk for Windows systems, V-locity Server optimized the processing of database transactions sent from another application that was used to generate multiple SQL Server client requests. 03

Running a specific collection of SQL transactions for up to 32 daemon processes (1 daemon generating the equivalent of 70 normally-queued user processes) with randomly-generated values, such as indices and row numbers, we initiated transaction loads for varyingly-sized groups of users. Over multiple load tests, we measured an average increase in SQL Server transactions of 55% with V-locity Server. Removing Barriers to I/O Performance To optimize I/O, V-locity Server has been designed to resolve two very important issues: 1) V-locity eliminates nearly all unnecessary I/O operations at the source when writing a file, which eliminates all unnecessary I/O operations on subsequent reads. 2) V-locity caches frequentlyaccessed data to keep read requests from traveling the full distance to storage and back. U sing sophisticated pattern analysis to determine which blocks will be placed in cache, IntelliMemory offloads a significant portion of I/O read operations from logical disks. IntelliWrite Technology V-locity Server solves the important issue of unnecessary I/O generation with IntelliWrite technology. IntelliWrite prevents the Windows OS from storing files as disjointed block sets in its logical block space representation of a logical storage volume. In a SAN- or NAS-based environment, the problem of unnecessary I/O is compounded as multiple systems share the same set of storage resources, which results in highly random I/O behavior pushed down to the disk subsystem. The generation of unnecessary I/Os not only slows the speed of the originating system, it also slows other systems sharing the same storage arrays. To solve this, IntelliWrite adds more intelligence to the way the Windows OS preallocates file space in order to continuously restructure writes in a coherent manner, store files as contiguous sets of blocks, and prevent performance penalties. Moreover, when a file is accessed and modified in the future, IntelliWrite will automatically restructure that file for optimal I/O performance. IntelliMemory Technology Optimizing writes and eliminating unnecessary I/O operations does not resolve all important data access issues, especially when reading data. To provide a full spectrum of I/O optimization, V-locity Server implements IntelliMemory, a highly efficient data caching solution that leverages available server memory to provide faster access to data and dramatically improved throughput. Using sophisticated pattern analysis to determine which blocks will be placed in cache, IntelliMemory offloads a significant portion of I/O read operations from logical disks. More importantly, by offloading physical I/O, V-locity Server improves performance on all systems sharing storage resources via a SAN or NAS. 04

SQL Stress Test The Test To assess the performance capabilities of V-locity Server, we configured a test scenario using three servers. The following is an overview of the test environment and process: 1) A Dell 1950 PowerEdge server, with a quad-core CPU and 8GB RAM, was used to run Windows Server 2012, V-locity Server and SQL Server 2012. 2) We set up an independent logical disk for the ANSI AS3AP benchmark database. We configured a 30GB instance of the ANSI AS3AP benchmark database, which had 32.5 million rows in data tables. With 8GB of RAM on the server, the entire database could not fit within the SQL Server cache. More importantly, SQL Server claimed 3.5GB of RAM; however it utilized less than 2GB as a workspace. As a result, a significant amount of I/O was directed at the disk subsystem. For our tests, we configured the database with 32.5 million rows, which translated into a 30GB database. 3) We configured SQL Server to use a maximum of 3MB of server memory. 4) To generate both a batch and an interactive transaction processing load of SQL queries, we configured two servers with 2 CPUs running Windows Server 2008 R2 and a SQL load generator, which used a random number generator to modify scripted shell transactions directed at the target database server. 5) We generated queries for each daemon user and each batch process by inserting random data into fixed templates such that each TP query accessed one row and each batch processing task accessed at least 100 rows at a time. 6) We launched all batch transactions in stress mode without pausing for user think time or keyboard typing time. 05

SQL Transactions per Second (TPS) The Results Our goal was to assess the ability of V-locity Server to create an environment in which our SQL Server 2012 benchmark would be able to reach maximum performance and scalability. To achieve our goal, we first created a set of jobs that represented a fixed level of batch production processing. In particular, three simultaneous batch jobs independently accessed and updated a minimum of 100 database rows at a time as rapidly as possible. To our base level of production processing, we added a sequence of TP daemon processes to represent interactive users, who would typically be creating and placing online orders. To maximize application stress on our system, we introduced no pauses for queuing time and forced each TP daemon to continuously launch simple queries that accessed one row of data per transaction. V-locity Server TP Performance Improvement ANSI AS3AP Benchmark Database Schema OLTP Daemons u OPENBENCH LABS SQL Server OLTP Load ANSI AS3AP Benchmark Database Schema V-locity Server running V-locity Server not running w v OLTP Daemons V-locity Server running V-locity Server not running Number of Daemon Processes V-locity Server did not change the internal scalability of our SQL benchmark. Instead, V-locity created a more responsive environment in which the benchmark could leverage storage and memory resources as it ran. In particular, We measured a 73% lower initial response time 22 ms vs. 38 ms for our benchmark and a base transaction rateu of 44 TPS, which was twice that without V-locity Server. We added daemons to increase the transaction rate to 200 TPS, which is typical of high-volume TP applications. More importantly, as we increased the TPS ratev to 200 TPS, daemon load TPS rates averaged 89% higher with V-locity Server running. At the other end of the load spectrum, our 100 ms limit on response time provided a large enough window to reach the maximum TPS rate that our physical configuration could sustain either with or without V-locity Server. When we reached those limitsw 417 TPS running 30 daemons without V-locity and 677 TPS running 55 users with V-locity we could add fewer than five users without exceeding the average response time limit, at which point the TPS rate plummeted. 06

To test scalability, we added five TP daemons at a time, until the average query response time for each daemon exceeded 100 ms. As a result, the number of queries processed and the average query response time measured on each daemon load cycle provided a measure of the ability of V-locity Server to create an optimal environment in which to run an application driven by SQL Server 2012. We began by running our stress tests on SQL Server 2012 with the default memory configuration, which in theory allowed SQL Server to utilize all of the memory provisioned in our Dell PowerEdge server. On this test, we measured the amount of memory claimed by SQL Server 2012 and the amount of memory that SQL Server 2012 utilized. In all of these tests, SQL Server 2012 conservatively claimed about 3GB of memory and utilized less than 2GB of the 8GB that were physically installed. Next, we reset the maximum memory for SQL Server from its default value to 3GB. When we made this change, overall TP throughput improved between 5% to 10% on nearly all of our test loads. As a result, we imposed a 3GB memory limit for all further testing. The Results: A Closer Look The following table highlights the test results: I/O Activity 1 daemon* 5 daemons 10 daemons 15 daemons 20 daemons 25 daemons 30 daemons 32 daemons 35 daemons 40 daemons 45 daemons 50 daemons 55 daemons 58 daemons Disk Reads Cache Reads I/O Latency Log File Fragments V-locity Server Performance: SQL Server Stress 1 to 58 Daemon Loads Applied to a 30GB Database TPS Rate With V-locity 044* 176 281 383 450 543 602 544 513 532 544 571 677 601*** 60,466-58% 31,568-42% 1.2 ms. TPS Rate Without V-locity 020 0103 188 315 330 411 417 320** TPS Rate Delta 22 73 93 82 120 132 185 224 Response Time With V-locity 22 ms. 28 ms. 37 ms. 42 ms. 45 ms. 50 ms. 50 ms. 61 ms. 67 ms. 70 ms. 80 ms. 89 ms. 97 ms. 98*** ms. V-locity Server Database I/O Caching (IntelliMemory) N.A. N.A. 6 ms. Response Time Without V-locity 38 ms. 43 ms. 53 ms. 64 ms. 65 ms. 72 ms. 82 ms. 96** ms. V-locity Server Database Log File Fragment Prevention (IntelliWrite) 768 Prevented 803 Fragmented *One daemon generates the equivalent of 70 normally-queued user processes **Without V-locity Server running, we reached the benchmark s maximum response time at 32 daemon processes ***With V-locity Server running, we were able to reach 58 daemon processes Response Time Delta (16 ms.) (15 ms.) (16 ms.) (22 ms.) (20 ms.) (22 ms.) (21 ms.) (35 ms.) 07

When we ran SQL Server with V-locity Server, IntelliMemory came into play for reads not cached by SQL Server. In particular, IntelliMemory cached 42% of the physical reads associated with running SQL Server. By lowering the number of physical I/Os, V-locity improved the overall IOPS performance of the server s entire I/O subsystem, which included faster writes to log files which in turn reduced log file wait time. For workloads of 1 to 32 daemons (approximately 70 normally-queued users per daemon), V-locity Server improved SQL Server TP performance by an average of 55%. Similarly, V-locity Server improved transaction response time by an average of 33%. With the average transaction response time on the initial single-user test reduced by more than 100% in a V-locity Server environment, we were able to complete tests on which the average query response time was less than 100 ms for workloads that included up to 58 daemons without V-locity the system maxed out at 32 daemons. What s more, we were able to scale workloads and process close to the maximum number of transactions that our physical server was capable of handling while keeping the average transaction response time under 100 ms. Bottom Line On a server running SQL Server 2012, V-locity Server created an environment that enabled up to 55% higher TPS rates, improved transaction response time by 33%, and enabled SQL to process 62% more transactions at peak transaction rates. As a result, IT has a powerful tool to maximize the ROI associated with any business application initiative driven by SQL Server at the back end. Westborough, Mass.-based openbench Labs was founded in 2005 by Dr. Jack Fegreus. openbench Labs is a trusted IT industry source, providing hands-on evaluation and certification of Information Technology products and services. openbench Labs enjoys a unique position in the information technology sector. As the premier independent test lab and provider of third-party validation services, OBL has worked with virtually every major vendor and evaluated the most important products and technologies to appear over the past decade. 2013 OpenBench. All Rights Reserved. V-locity, IntelliWrite, InvisiTasking and IntelliMemory are trademarks or registered trademarks of Condusiv Technologies Corporation. All other trademarks are the property of their respective owners. 08