PERFORMANCE STUDY OCTOBER 2017 ORACLE MONSTER VIRTUAL MACHINE PERFORMANCE. VMware vsphere 6.5

Similar documents
Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

PERFORMANCE CHARACTERIZATION OF MICROSOFT SQL SERVER USING VMWARE CLOUD ON AWS PERFORMANCE STUDY JULY 2018

W H I T E P A P E R. Comparison of Storage Protocol Performance in VMware vsphere 4

Microsoft Exchange Server 2010 Performance on VMware vsphere 5

Managing Performance Variance of Applications Using Storage I/O Control

Configuration Maximums VMware Infrastructure 3: ESX Server 3.5 Update 2, ESX Server 3i version 3.5 Update 2, VirtualCenter 2.

Performance of vsphere 6.7 Scheduling Options

What s New in VMware vsphere 4.1 Performance. VMware vsphere 4.1

Performance of Virtual Desktops in a VMware Infrastructure 3 Environment VMware ESX 3.5 Update 2

W H I T E P A P E R. What s New in VMware vsphere 4: Performance Enhancements

Adobe Acrobat Connect Pro 7.5 and VMware ESX Server

Performance Scaling. When deciding how to implement a virtualized environment. with Dell PowerEdge 2950 Servers and VMware Virtual Infrastructure 3

Eliminate the Complexity of Multiple Infrastructure Silos

CLOUD PROVIDER POD RELEASE NOTES

Comparison of Storage Protocol Performance ESX Server 3.5

IBM Emulex 16Gb Fibre Channel HBA Evaluation

Storage Considerations for VMware vcloud Director. VMware vcloud Director Version 1.0

CLOUD PROVIDER POD. for VMware. Release Notes. VMware Cloud Provider Pod January 2019 Check for additions and updates to these release notes

Configuration Maximums VMware vsphere 5.0

VVD for Cloud Providers: Scale and Performance Guidelines. October 2018

What s New in VMware vsphere 5.1 Platform

Maintaining End-to-End Service Levels for VMware Virtual Machines Using VMware DRS and EMC Navisphere QoS

What s New in VMware vsphere Flash Read Cache TECHNICAL MARKETING DOCUMENTATION

Compatibility Matrixes for VMware vcenter Site Recovery Manager 4.0 and Later

VMWARE EBOOK. Easily Deployed Software-Defined Storage: A Customer Love Story

A Performance Characterization of Microsoft SQL Server 2005 Virtual Machines on Dell PowerEdge Servers Running VMware ESX Server 3.

Certified Reference Design for VMware Cloud Providers

How to Use a Tomcat Stack on vcloud to Develop Optimized Web Applications. A VMware Cloud Evaluation Reference Document

Pexip Infinity Server Design Guide

Veritas Access. Installing Veritas Access in VMWare ESx environment. Who should read this paper? Veritas Pre-Sales, Partner Pre-Sales

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA

vrealize Business System Requirements Guide

Cloud Pod Architecture with VMware Horizon 6.1

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

VMWARE VSPHERE FEATURE COMPARISON

WHAT S NEW IN PERFORMANCE?

Performance & Scalability Testing in Virtual Environment Hemant Gaidhani, Senior Technical Marketing Manager, VMware

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

Performance Implications of Storage I/O Control Enabled NFS Datastores in VMware vsphere 5.0

Mobile Secure Desktop Implementation with Pivot3 HOW-TO GUIDE

NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp

CLOUD PROVIDER POD RELEASE NOTES

VMWARE HORIZON 6 ON HYPER-CONVERGED INFRASTRUCTURES. Horizon 6 version 6.2 VMware vsphere 6U1 / VMware Virtual SAN 6U1 Supermicro TwinPro 2 4 Nodes

Accelerate Database Performance and Reduce Response Times in MongoDB Humongous Environments with the LSI Nytro MegaRAID Flash Accelerator Card

VMware VMFS Volume Management VMware Infrastructure 3

10 QUESTIONS, 10 ANSWERS. Get to know VMware Cloud on AWS The Best-in-Class Hybrid Cloud Service

Recommendations for Aligning VMFS Partitions

Table of Contents. HCIBench Tool Architecture

VMware vrealize Suite and vcloud Suite

VMware vfabric Data Director Installation Guide

EMC Backup and Recovery for Microsoft SQL Server

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

Surveillance Dell EMC Storage with Digifort Enterprise

Number of Hosts: 2 Uniform Hosts [yes/no]: yes Total sockets/core/threads in test: 4/32/64

A Kaminario Reference Architecture: Reference Architecture for Running SQL Server on ESXi

vcloud Automation Center Reference Architecture vcloud Automation Center 5.2

Configuring iscsi in a VMware ESX Server 3 Environment B E S T P R A C T I C E S

Windows Server 2012: Server Virtualization

Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory

Installing VMware vsphere 5.1 Components

Cisco HyperFlex Solutions

VMware Virtual SAN Backup Using VMware vsphere Data Protection Advanced SEPTEMBER 2014

Surveillance Dell EMC Storage with FLIR Latitude

VMware Infrastructure 3 Primer Update 2 and later for ESX Server 3.5, ESX Server 3i version 3.5, VirtualCenter 2.5

VMware vsphere 4. The Best Platform for Building Cloud Infrastructures

EXPLORING MONITORING AND ANALYTICS VMware Horizon

What s New in VMware vsphere 5.1 Platform

Preparing Virtual Machines for Cisco APIC-EM

Preparing Virtual Machines for Cisco APIC-EM

VMware VMmark V2.5.2 Results

88X + PERFORMANCE GAINS USING IBM DB2 WITH BLU ACCELERATION ON INTEL TECHNOLOGY

Emulex LPe16000B 16Gb Fibre Channel HBA Evaluation

T E C H N I C A L S A L E S S O L U T I O N S

Performance Testing of SQL Server on Kaminario K2 Storage

Functional Testing of SQL Server on Kaminario K2 Storage

Surveillance Dell EMC Storage with Cisco Video Surveillance Manager

AMD: WebBench Virtualization Performance Study

Large Page Performance ESX Server 3.5 and ESX Server 3i v3.5

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

7 Things ISVs Must Know About Virtualization

Entuity Network Monitoring and Analytics 10.5 Server Sizing Guide

VMware vcenter. Update Manager 5.0 Performance and Best Practices. Performance Study TECHNICAL WHITE PAPER

Virtualizing SQL Server 2008 Using EMC VNX Series and VMware vsphere 4.1. Reference Architecture

2 to 4 Intel Xeon Processor E v3 Family CPUs. Up to 12 SFF Disk Drives for Appliance Model. Up to 6 TB of Main Memory (with GB LRDIMMs)

Dedicated Hosted Cloud with vcloud Director

WHITE PAPER SEPTEMBER VMWARE vsphere AND vsphere WITH OPERATIONS MANAGEMENT. Licensing, Pricing and Packaging

EMC XTREMCACHE ACCELERATES MICROSOFT SQL SERVER

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

KVM Virtualized I/O Performance

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

vrealize Hyperic Supported Configurations and System Requirements vrealize Hyperic 5.8.4

VMWare Horizon View 6 VDI Scalability Testing on Cisco 240c M4 HyperFlex Cluster System

VMware vsphere 5.0 Evaluation Guide

What s New in VMware vsphere Storage Appliance 5.1 TECHNICAL MARKETING DOCUMENTATION V 1.0/UPDATED JULY 2012

Setup for Microsoft Cluster Service Update 1 Release for ESX Server 3.5, ESX Server 3i version 3.5, VirtualCenter 2.5

Exchange Server 2007 Performance Comparison of the Dell PowerEdge 2950 and HP Proliant DL385 G2 Servers

Notes Section Notes for Workload. Configuration Section Configuration. mailserver olio dvdstorea dvdstoreb dvdstorec

Getting Started with ESXi Embedded

LAMP Performance on Dell PowerEdge 1855 Blade Servers Running VMware ESX Server

Transcription:

PERFORMANCE STUDY OCTOBER 2017 ORACLE MONSTER VIRTUAL MACHINE PERFORMANCE VMware vsphere 6.5

Table of Contents Executive Summary...3 Introduction...3 Test Environment... 4 Test Workload... 5 Virtual Machine Configuration... 5 Generational Monster VM Comparison... 5 Performance Gain from Hyper-Threading... 6 NUMA Performance...7 Best Practices... 9 CPU... 9 Memory... 9 Storage... 9 Oracle Database... 10 Conclusion... 10 References... 10 PERFORMANCE STUDY 2

Executive Summary The ability to run extremely large virtual machines on vsphere 6.5 aligns closely with the needs of large databases in the enterprise today. Specifically, Oracle database instances in some cases require large amounts of RAM, many processors, and high-speed flash-based storage to achieve the performance needed. Tests were run in the VMware lab using monster virtual machines running Oracle databases to study their performance. Tests were run across a range of server generations and a flash storage array to illustrate how much performance has increased with each new generation of processor, the performance benefit of Hyper- Threading, and NUMA effects. The results of these tests show that the performance of Oracle databases on large virtual machines is excellent. Introduction In 2005, VMware ESX 3.0 was released with support for virtual machines with 4 vcpu and 16 GB of memory. This capability was an early step toward the massive capacity of monster VMs in vsphere 6.5 that had a maximum of 128 vcpus and 6 TB of RAM when it was released at the end of 2016. The capability of vsphere virtual machines has grown as servers have increased capacity and as large application needs have increased. The tests conducted in VMware labs shows how the performance of large Oracle databases running in monster VMs has scaled along with new server generations. Combining the key elements of vsphere, Oracle database, IBM FlashSystem storage array, and Intel server processors creates a test environment for demonstrating what is possible within the enterprise today. vsphere 6.5 is the latest version of the industry-leading virtualization and management platform. In addition to its ability to host large virtual machines, it also includes the tools to manage and effectively run both small and large virtualization environments. Oracle provides one of the industry-leading database platforms for the enterprise. As a leader in this environment, its databases are used by many large organizations as the basis for information management at the heart of their operations. The IBM FlashSystem A9000 storage array used in the testing environment is an example of leading-edge storage performance. It integrates the extreme performance of IBM FlashCore technology, highly parallel architecture, and comprehensive data reduction into one powerful solution. IBM FlashSystem A9000 is designed to accelerate workloads to help reduce costs, increase revenue, meet service level agreements, and improve customer satisfaction. Performance testing with large virtual machines using databases on the latest server and storage technology was done with an open source database testing tool. Five generations of server hardware were used for the tests to illustrate how performance has improved over time relative to each new server processor. PERFORMANCE STUDY 3

Test Environment Tests were conducted across five generations of servers. All servers were four-socket systems and configured with high performance Intel E7 and Platinum Xeon processors. SERVERS TESTED INTEL XEON MODEL Total Cores/Threads GHz Codename Release Date E7-4870 40 / 80 2.40 Westmere-EX 2011 E7-4890 v2 60 / 120 2.80 Ivy-Bridge-EX 2014 E7-8890 v3 72 / 144 2.49 Haswell-EX 2015 E7-8890 v4 96 / 192 2.20 Broadwell-EX 2016 Platinum 8180 112 / 224 2.50 Skylake-SP 2017 Table 1. Types of Intel Xeon Processors tested The servers were all connected via Fibre Channel to an IBM FlashSystem A9000 high performance, all flash array. All monster virtual machines were on this storage and simply moved between the different servers for each set of tests. Virtual machines were powered down and cold migrated between the hosts to allow them to take advantage of the features of each processor to the fullest extent. Figure 1. Testbed configuration PERFORMANCE STUDY 4

Test Workload The test workload used was the open-source DVD Store 3 [1]. DVD Store simulates an online store that allows customers to log on, search for DVDs, read customer reviews, rate the helpfulness of reviews, and purchase DVDs. Many common database features are used to support these operations including tables, indexes, triggers, stored procedures, transactions, foreign keys, simple queries, and complex multi-join queries. A client program is included that generates load against the database by simulating the actions of users on the site. Each simulated user thread does the full set of steps for a user from logon through purchase. The workload reports throughput in orders per minute (OPM). For each test configuration, the number of user threads was increased with each iteration until the throughput reached its maximum level. It was found in these tests that the maximum throughput corresponded to host CPU utilization of between 90 and 99 percent. DVD Store 3 allows for any size database to be created, and, for these tests, the on-disk size was approximately 200 GB. Virtual Machine Configuration The configuration of the virtual machines was kept constant except for the number of vcpus, which were varied depending on the specific test. VMXNET3 virtual network adapter and paravirtual SCSI (PVSCSI) adapters were used. Data and log virtual disks were assigned to separate paravirtual adapters. Memory was configured to be 256 GB. Red Hat Enterprise Linux (RHEL) 7.1 was used as the operating system, and Oracle Database 12.0.1.2 was installed. The Oracle-provided installer RPM was used and the Oracle best practices for installing were followed according to their documentation. Huge pages were configured for RHEL and enabled in Oracle. The Oracle sga_target parameter was set to 100 GB. This ensured that a large amount of the database would in the buffer cache, but not all so that some disk accesses would occur during the tests. Generational Monster VM Comparison The graph below shows the results of running a monster VM with vcpus equal to physical cores across five generations of Intel-based, four-socket servers. The earliest generation server was based on the Intel E7-4870 processor that was originally released in 2011 and was codenamed Westmere-EX. Each processor has 10 cores, which means that the server has a total of 40 cores. The virtual machine, when run on this server, was configured with 40 vcpus to match the physical core count. The virtual NUMA configuration was set to match the underlying physical server with 10 vcpus per socket. The same approach was used for the other servers and virtual machines in the test, up to the latest generation server with 112 cores and a virtual machine with 112 vcpus. The results show that performance increased over 4x from the Westmere- to Skylake-based servers. This shows that the performance of monster virtual machines scales with more vcpus and on newer servers with higher core counts. PERFORMANCE STUDY 5

Generational Monster Oracle DB VM Performance 4.5 Relative Orders Per Minute (OPM) 4 3.5 3 2.5 2 1.5 1 0.5 0 Westmere 40 vcpu VM Ivy-Bridge 60 vcpu VM Haswell 72 vcpu VM Broadwell 96 vcpu VM Skylake 112 vcpu VM Figure 2. Oracle DB VM performance increased 4x from Westmere to Skylake servers Performance Gain from Hyper-Threading With Hyper-Threading enabled, there are two execution threads sharing a single physical core. Because they are sharing a single physical core, the performance gain is not doubled but is usually in the 10 to 30 percent range. The 112-core Skylake-based server was used to measure the performance gain from Hyper-Threading in these tests. The performance of a single 112-vCPU virtual machine was compared against the combined performance of two 112-vCPU virtual machines. Using two 112-vCPU virtual machines means that there are 224 vcpus active and all 224 execution threads on the server are active. The gain in performance due to hyper threading was 27%, as shown in the graph below. PERFORMANCE STUDY 6

Performance Gain Using HyperThreads on 4 Socket Intel Skylake-SP Server Relative Orders Per Minute (OPM) 1.4 1.2 1 0.8 0.6 1 x 112 vcpu VM 2 x 112 vcpu VMs Figure 3. Performance gain was 11% using Hyper-Threads NUMA Performance Today s servers use Non-Uniform Memory Access (NUMA) in their architecture. This means that each processor has memory that is directly connected to it. The combination of a processor and its memory is known as a NUMA node. Performance can be optimized by reducing the number of remote memory accesses for a process from the local NUMA node it is running on. A best practice for virtual machines is to right-size them to the amount of CPU and memory that is needed, and to keep the NUMA node boundaries in mind when doing this sizing. In other words, assign the amount of vcpus and memory that are needed for the VM, but limit this to what is really needed. If the size of the VM can be kept to equal or less than the size of the NUMA node, then performance can be optimized, as remote NUMA node memory access is limited or eliminated. This aspect of NUMA performance was measured on the 4-socket (or 4 NUMA node) Broadwell server. Four 24- vcpu virtual machines were tested and compared against the performance of a single 96-vCPU VM. The results in the chart below show that the total throughput of the four VMs was 15% higher than the throughput of the single VM. PERFORMANCE STUDY 7

Relative Orders Per Minute (OPM) 1.2 1.1 1 0.9 0.8 Performance Advantage of 1 VM Per NUMA Node vs 1 VM with 4 Virtual NUMA Nodes 1 x 96 vcpu VM 4 x 24 vcpu VMs Figure 4. Total throughput of four VMs is 15% higher than the throughput of one VM, where the logical total of vcpus in each case is 96 PERFORMANCE STUDY 8

Best Practices The VMware-published Oracle Databases on VMware Best Practices Guide [1] is the best source for complete details on best practices. In this section, the key best practices for large virtual machines and Oracle databases are summarized, but for more details the full best practices guide should be consulted. CPU The latest generation server CPUs provide the best performance. Whenever possible, use the latest generation processors to get the best performance for database virtual machines. Right-sizing of vcpus and memory for the needed load is important to achieve best performance. Monster virtual machines can provide great performance, so assign the database virtual machine all the vcpus that it needs. However, overprovisioning virtual machines is counterproductive to achieving best overall performance. Keeping the physical NUMA node sizes in mind when sizing virtual machines is a good guide. Limit the number of physical NUMA nodes to a minimum that a virtual machine will span to get the best performance. Do not use CPU affinity. While it is possible to achieve some small gains in performance with CPU affinity in some cases, it is more likely to cause problems and performance issues than it is to provide significant benefit. Enable Hyper-Threading on the host. It has been found that Hyper-Threading can improve host performance by 10 to 30 percent with many database workloads. Memory Size memory based on NUMA node size. Overallocation of memory to a virtual machine beyond what is needed can result in crossing physical NUMA node boundaries unnecessarily. Assign the needed memory, but don t over allocate. Set a memory reservation for the virtual machine equal to the Oracle SGA. This will ensure that the memory needed for the buffer cache of the database is reserved and will ensure good performance. Use large memory pages. Large memory pages are more efficient and improve the performance of databases due to their heavy usage of memory. In Linux, large pages are referred to as huge pages. Storage Use dedicated LUNs or datastores for data and log virtual disks. Ensure that the storage being used for the data and log virtual disks is not shared with other virtual machines or applications. Configure all database virtual machines with only paravirtual SCSI adapters. Use VMFS for all vsphere datastores. Tests have shown that VMFS and RDM provide essentially the same performance, but VMFS is much easier to manage and maintain. High performance all-flash storage should be used when possible. Oracle databases greatly benefit from the low latency performance of high-end storage arrays. The IBM FlashSystem A9000 used in these tests maintained low latency performance during all tests, enabling excellent performance for each VM configuration. PERFORMANCE STUDY 9

Oracle Database Follow Oracle best practices within the virtual machine. Oracle inside a properly configured virtual machine behaves like Oracle on a bare metal system. Evaluate the use of Oracle s NUMA support before using it. The default value for Oracle s NUMA support is disabled, and in testing with the DVD Store it was found that this default value of disabled provided the best performance. It is important to note here that NUMA can be enabled or disabled also at the virtual machine and physical server levels. The best combination for performance is to leave it enabled for virtual machine and physical server levels, while leaving it disabled for Oracle. These are the default settings. Conclusion Testing shows that running Oracle databases on monster virtual machines provides excellent performance. By using established best practices and using the latest server and storage technology, virtual databases provide the needed performance capacity for enterprises of any size. References [1] Todd Muirhead. (2017, April) DVD Store 3. https://github.com/dvdstore/ds3 [2] VMware, Inc. (2016, May) Oracle Databases on VMware Best Practices. https://www.vmware.com/content/dam/digitalmarketing/vmware/en/pdf/solutions/vmware-oracledatabases-on-vmware-best-practices-guide.pdf PERFORMANCE STUDY 10

About the Author Todd Muirhead is a performance engineer at VMware and has worked with many different databases, servers, and storage systems to develop best practices and performance guides. He is also the co-creator and maintainer of the DVD Store open source benchmark. Acknowledgements The author would like to thank David Morse for valuable conversations on database performance; Dave Jaffe, for his partnership on the DVD Store Benchmark; and many colleagues in the performance engineering organization for their support and feedback. VMware, Inc. 3401 Hillview Avenue Palo Alto CA 94304 USA Tel 877-486-9273 Fax 650-427-5001 www.vmware.com Copyright 2017 VMware, Inc. All rights reserved. This product is protected by U.S. and international copyright and intellectual property laws. VMware products are covered by one or more patents listed at http://www.vmware.com/go/patents. VMware is a registered trademark or trademark of VMware, Inc. in the United States and/or other jurisdictions. All other marks and names mentioned herein may be trademarks of their respective companies. Comments on this document: https://communities.vmware.com/docs/doc-35166