Virtualization and the Metrics of Performance & Capacity Management

Similar documents
Performance & Scalability Testing in Virtual Environment Hemant Gaidhani, Senior Technical Marketing Manager, VMware

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Chapter 5 C. Virtual machines

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

vsphere 4 The Best Platform for Business-Critical Applications Gaetan Castelein Sr Product Marketing Manager VMware, Inc.

Virtual SQL Servers. Actual Performance. 2016

Virtualization of the MS Exchange Server Environment

BlackBerry AtHoc Networked Crisis Communication Capacity Planning Guidelines. AtHoc SMS Codes

Cloud & Datacenter EGA

PRESENTATION TITLE GOES HERE

davidklee.net gplus.to/kleegeek linked.com/a/davidaklee

Reference Architecture for Dell VIS Self-Service Creator and VMware vsphere 4

Amazon EC2 Deep Dive. Michael #awssummit

Public Cloud Leverage For IT/Business Alignment Business Goals Agility to speed time to market, adapt to market demands Elasticity to meet demand whil

Comparison of Storage Protocol Performance ESX Server 3.5

vnetwork Future Direction Howie Xu, VMware R&D November 4, 2008

Emulex LPe16000B 16Gb Fibre Channel HBA Evaluation

VMware vsphere Beginner s Guide

What s New in VMware vsphere 4.1 Performance. VMware vsphere 4.1

W H I T E P A P E R. Comparison of Storage Protocol Performance in VMware vsphere 4

How it can help your organisation

A Performance Characterization of Microsoft SQL Server 2005 Virtual Machines on Dell PowerEdge Servers Running VMware ESX Server 3.

Quantifying Load Imbalance on Virtualized Enterprise Servers

Key metrics for effective storage performance and capacity reporting

Extremely Fast Distributed Storage for Cloud Service Providers

Application-Specific Configuration Selection in the Cloud: Impact of Provider Policy and Potential of Systematic Testing

Nested Virtualization and Server Consolidation

End to End SLA for Enterprise Multi-Tenant Applications

Maailman paras palvelinjärjestelmä. Tommi Salli Distinguished Engineer

Power your planet. Optimizing the Enterprise Data Center POWER7 Powers a Smarter Infrastructure

VMware Cloud on AWS Technical Deck VMware, Inc.

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

CS370 Operating Systems

PAC094 Performance Tips for New Features in Workstation 5. Anne Holler Irfan Ahmad Aravind Pavuluri

HP ProLiant BladeSystem Gen9 vs Gen8 and G7 Server Blades on Data Warehouse Workloads

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

The Oracle Database Appliance I/O and Performance Architecture

vstart 50 VMware vsphere Solution Specification

Real-time Monitoring, Inventory and Change Tracking for. Track. Report. RESOLVE!

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into

VMware vsphere with ESX 4 and vcenter

SR-IOV Support for Virtualization on InfiniBand Clusters: Early Experience

Virtualizing Oracle on VMware

Windows Servers In Microsoft Azure

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

W H I T E P A P E R. What s New in VMware vsphere 4: Performance Enhancements

How Symantec Backup solution helps you to recover from disasters?

Data Path acceleration techniques in a NFV world

Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel

CONTAINER CLOUD SERVICE. Managing Containers Easily on Oracle Public Cloud

Deploying Liferay Digital Experience Platform in Amazon Web Services

INNOV-4: Fun With Virtualization. Or, How I learned to love computers that don t really exist...

FIVE REASONS YOU SHOULD RUN CONTAINERS ON BARE METAL, NOT VMS

A Comparative Study of Microsoft Exchange 2010 on Dell PowerEdge R720xd with Exchange 2007 on Dell PowerEdge R510

PERFORMANCE STUDY OCTOBER 2017 ORACLE MONSTER VIRTUAL MACHINE PERFORMANCE. VMware vsphere 6.5

LATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN

Veritas Dynamic Multi-Pathing for VMware 6.0 Chad Bersche, Principal Technical Product Manager Storage and Availability Management Group

监控您的 SmartCloud 刘鹤 IBM 软件部

Virtualization with VMware ESX and VirtualCenter SMB to Enterprise

Gavin Payne Senior Consultant.

PERFORMANCE CHARACTERIZATION OF MICROSOFT SQL SERVER USING VMWARE CLOUD ON AWS PERFORMANCE STUDY JULY 2018

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING

Lecture 09: VMs and VCS head in the clouds

Self-driving Datacenter: Analytics

Measuring VMware Environments

VMware Virtual SAN Technology

Lightstreamer. The Streaming-Ajax Revolution. Product Insight

TALK THUNDER SOFTWARE FOR BARE METAL HIGH-PERFORMANCE SOFTWARE FOR THE MODERN DATA CENTER WITH A10 DATASHEET YOUR CHOICE OF HARDWARE

Spring 2017 :: CSE 506. Introduction to. Virtual Machines. Nima Honarmand

Key Considerations for Improving Performance And Virtualization in Microsoft SQL Server Environments

Migration Strategies from vsphere to Linux and OpenStack via a shared virtualized network

Using MySQL in a Virtualized Environment. Scott Seighman Systems Engineer Sun Microsystems

Service Mesh and Microservices Networking

Oracle Solaris 11: No-Compromise Virtualization

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

Virtual Infrastructure Plug-in Metric Reference Manual 13c Release 1 ( )

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Session 201-B: Accelerating Enterprise Applications with Flash Memory

VIRTUAL APPLIANCES. Frequently Asked Questions (FAQ)

Software Defined Storage at the Speed of Flash. PRESENTATION TITLE GOES HERE Carlos Carrero Rajagopal Vaideeswaran Symantec

Virtual Business Servers technical description

Free up rack space by replacing old servers and storage

Live Migration of Virtualized Edge Networks: Analytical Modeling and Performance Evaluation

Webinar Series: Triangulate your Storage Architecture with SvSAN Caching. Luke Pruen Technical Services Director

Taming the Multi-Cloud With Simplicity and Openness. Minh Dang Cisco Systems Vietnam 2018 January

Understanding VMware Capacity

TrueSight 10 Architecture & Scalability Q&A Best Practice Webinar 8/18/2015

Performance and Scalability of Server Consolidation

Virtual Leverage: Server Consolidation in Open Source Environments. Margaret Lewis Commercial Software Strategist AMD

Microsoft Applications on Nutanix

BUYING SERVER HARDWARE FOR A SCALABLE VIRTUAL INFRASTRUCTURE

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

TIPS TO. ELIMINATE LATENCY in your virtualized environment

IaaS Vendor Comparison

PARDA: Proportional Allocation of Resources for Distributed Storage Access

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage

Veritas CommandCentral Supporting the Virtual Enterprise. April 2009

Real-Time Cache Management for Multi-Core Virtualization

INTEL NEXT GENERATION TECHNOLOGY - POWERING NEW PERFORMANCE LEVELS

Building Private Cloud Infrastructure

Transcription:

23 S September t b 2011 Virtualization and the Metrics of Performance & Capacity Management Has the world changed? Mark Preston

Agenda Reality Check. General Observations Traditional metrics for a non-virtual environment How virtualization scheduling/dispatching of shared environments can change the paradigm Case Study Conclusions Discussion 2

Reality Check Application users don t care about system utilization as long as response time is met Consumers purchase units of computing to support functionality, not CPU cycles Internal Clusters, SAN, Networking Cloud - Ex: AWS; One EC2 Compute Unit provides the equivalent CPU capacity of a 1.0-1.2 1 GHz 2007 Opteron or 2007 Xeon processor. This is also the equivalent to an early-2006 1.7 GHz Xeon processor Virtualized environments exhibit a range of latency & neighborhood h sensitivity Shared environment interference effects are real Consumers of IT are realizing that in a virtual world utilization based capacity planning is suspect, if not useless in some cases 3

Reality Check System building blocks incur latency Virtualization is not a free technology (adds another layer) In the drive to reduce enterprise cost host are over-subscribed Over-subscribed host and shared resources (Multi-CPU, I/O consolidation) require scheduling, which incurs overhead The application team doesn t have control of the neighborhood Let s reword the opening Reality Check Application admin s don t care about system utilization as long as response time is met 4

General Observations The neighborhood is affected by many factors Hypervisor, Over-subscription, Load synchronicity y Underlying HW virtualization (AMD-V, Intel VT-x, sriov, mriov) Some classes of applications are better suited for virtualization than others Financial trading systems, Infrastructure Monitoring Less compatible Transaction processing Depends on workload Web facing Highly compatible Virtualization is best suited for dissimilar applications Multi-threaded applications (databases) prefer bare metal Intra-tier sensitivity may produce unexpected application effects A Virtualized DB, sensitive to virtualization latency, may result in low Web tier utilization and Server is too busy messages Cloud adds both on-site and connection latency 5

CPU Utilization Traditional Metrics CPU is generally considered to be a proxy for all other performance/capacity vectors Memory Network I/O Assumption - Interference effects from other workloads do not occur for the CPU vector Free Memory An indication of goodness Virtualization i can over-commit memory ccnuma can add latency - local L1 cache vs. remote inter-socket L2/L3 cache Storage Throughput Typically specified as IOPs, MB/s Assumption - Interference effects from other workloads do not occur for the CPU vector 6

How Does Virtualization ti Change Things? Shared-occupancy host environments exhibit not-easily- visible interference effects Resource scheduler / dispatcher design sharpens this interference effect CPU busy and storage throughput at the guest level is no longer a reliable indicator of workload performance Usage and throughput metrics are increasingly decoupled 7

Virtualization ti adds layers and latency App App App App 2 vcpu/mem vfs, vfa, vnic 2 vcpu/mem vfs, vfa, vnic 4 vcpu/mem vfs, vfa, vnic 4 vcpu/mem vfs, vfa, vnic 12 vcpus Inc creasing Lat tency Virtualization Scheduling, Load Balancing & Bandwidth Management pcpu pcpu pcpu pcpu pnic pnic pfa pfa Cache Cache Cache pmemory Over Sub bscription 4 pcpus 2 pnics 2 pfas Switch SAN 8

Each Layer Incurs Latency Your Data Your Application Designs, Lessons and Advice from Building Large Distributed Systems Numbers Everyone Should Know Jeff Dean Google Fellow 9

Virtualization exhibits the classic latency curve Source: VMware ESX 3: Ready Time Observations - Feb 2004 10

Storage also follows the traditional latency curve 11

Capacity Planning with Traditional Metrics %CPU < 60-70%..The world is happy!. I think Traditional allocation of resources Produces an ESX host with apparently significant capacity 12

Case Study Our application provides: Infrastructure discovery Event correlation Service assurance Deployed across entire enterprise Composed of multiple tiers / components Collect data across thousands of monitored resources Ingest and processing of events Console functionality for operation centers/admins Some components exhibit synchronous behavior Reporting elements can be resource hogs Users characterized application as Unusable 13

Environment Capacity Audit N P bl F d No Problem Found Host CPU Utilization Memory Utilization Storage Throughput ESX Host A 16-20% ~12.5% 30MB/s ESX Host B 19-25% ~28% 3MB/s ESX Host C ~10% ~12.5% 22MB/s for App A kb/s for App of Interest 14

Performance Analysis Significant waiting for resources Performance Test Traditional utilization metrics ti indicate significant capacity Virtualization metrics indicate significant ifi waiting i for CPU resources %Rdy Ready-to-Run but no physical CPU resource free 5% - Could be sign of trouble 10% - There is a problem 10-15% - In this case the application was unusable 15

Environment Performance Audit Host CPU Utilization Memory Utilization Storage Throughput CPU %Rdy Latency ESX Host A 16-20% ~12.5% 30MB/s 10% ESX Host B 19-25% ~28% 3MB/s 35-55% ESX Host C ~10% ~12.5% 22MB/s for App A kb/s for App of Interest 22-35% 16

Resolution Starting point Performance team brought in after many months of application team trying to make application usable Existing infrastructure - ESX 3.5 + vmware tools + multi-core ESC host + SAN Followed a structured use-case driven performance test, analysis, recommendations process Initial relatively short timeframe solution ESX 4.1 + vmware tools + same multi-core ESX host + SAN Result better but still judged Not Usable Final solution ESX 4.1 + vmware tools + new 24-core based server cluster + new dedicated VM neighborhood layout + dedicated one-to-one vcpu-to-pcpu + load- balanced SAN Although one step away from a physical environment, virtualization still provides ility benefits 17

Performance issues resolved %Rdy reduced to less than 5%! CPU utilization has virtually quadrupled! %CPU, a dependant metric, has increased to acceptable levels 18

Latency can be cumulative Web Business Intelligence DB DB. Inc creasing Lat tency From 2 vcpu/mem an overall 2 service vcpu/memperspective 4 vcpu/mem the impact 4 vcpu/mem of %Rdy vfs, vfa, vnic vfs, vfa, vnic vfs, vfa, vnic vfs, vfa, vnic is application, neighborhood and path dependent Virtualization Scheduling, Load Balancing & Bandwidth Management From a host perspective the impact of %Rdy can be summation dependent For sequential task the cumulative delay of %Rdy is most likely unacceptable pcpu pcpu pcpu pcpu pnic pnic pfa pfa Cache Cache Cache pmemory Careful latency driven selection and layout of VMs is critical to service performance Switch SAN 12 vcpus Over Sub bscription 4 pcpus 2 pnics 2 pfas 19

Summary Old metrics can be misleading Low CPU utilization may be an indication of significant waiting for CPU resources Low storage throughput may be an indication of significant waiting for SAN resources Review your application for suitability Financial trading systems, Infrastructure Monitoring Less compatible Transaction processing Depends on workload Web facing Highly compatible Adopt /augment new metrics for virtualized environments CPU For vmware ESX control for %Rdy Storage Latency is unique to class, architecture and technology of storage Similar to BTM, Control for Latency 20