Fit for Purpose Platform Positioning and Performance Architecture

Similar documents
Fit For Purpose Platform Architecture Selection and Design

Leading-edge Technology on System z

Fit For Purpose. Joe Temple IBM 8/9/2011 Session Time Horizon ISV Support. Scale. Deployment Model

White Paper. Major Performance Tuning Considerations for Weblogic Server

Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Leveraging Power7 with IBM Middleware solutions

Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions

IBM Systems: Helping the world use less servers

Adaptive SMT Control for More Responsive Web Applications

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Private Cloud Database Consolidation Name, Title

Power your planet. Optimizing the Enterprise Data Center POWER7 Powers a Smarter Infrastructure

Accelerate Applications Using EqualLogic Arrays with directcache

Optimize OLAP & Business Analytics Performance with Oracle 12c In-Memory Database Option

Real-time Session Performance

The Role of Database Aware Flash Technologies in Accelerating Mission- Critical Databases

Ensuring Business Continuity with the IBM Fit For Purpose Methodology

vsan 6.6 Performance Improvements First Published On: Last Updated On:

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Evaluating Hyperconverged Full Stack Solutions by, David Floyer

Optimizing the IT/Server Ecosystem for Cloud Computing

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer

... IBM Advanced Technical Skills IBM Oracle International Competency Center September 2013

5 Myths in Systems Performance

z Processor Consumption Analysis Part 1, or What Is Consuming All The CPU?

Copyright 2011, Oracle and/or its affiliates. All rights reserved.

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

Oracle WebLogic Server Multitenant:

IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM s sole discretion.

Taking your next integration or BPM project to the cloud WebSphere Integration User Group, 12 July 2012 IBM Hursley

Oracle Exadata: Strategy and Roadmap

IBM Education Assistance for z/os V2R2

Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems

Reduce Costs & Increase Oracle Database OLTP Workload Service Levels:

1. Which programming language is used in approximately 80 percent of legacy mainframe applications?

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. reserved. Insert Information Protection Policy Classification from Slide 8

AMD Opteron Processors In the Cloud

vsphere 4 The Best Platform for Business-Critical Applications Gaetan Castelein Sr Product Marketing Manager VMware, Inc.

IBM Integration Bus v9.0 System Administration: Course Content By Yuvaraj C Panneerselvam

QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER

Exadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant

Oracle Platform Performance Baseline Oracle 12c on Hitachi VSP G1000. Benchmark Report December 2014

Workload Thinking for zenterprise Fit for Purpose!

Future-ready IT Systems with Performance Prediction using Analytical Models

Oracle Performance on M5000 with F20 Flash Cache. Benchmark Report September 2011

<Insert Picture Here> Maximizing Database Performance: Performance Tuning with DB Time

Extreme Performance Platform for Real-Time Streaming Analytics

Joe Butler, Principal Engineer, Director Cloud Services Lab. Nov , OpenStack Summit Paris.

End to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015

Isilon Performance. Name

Optimize Your Heterogeneous SOA Infrastructure

Live Migration: Even faster, now with a dedicated thread!

This presentation is for informational purposes only and may not be incorporated into a contract or agreement.

Forget IOPS: A Proper Way to Characterize & Test Storage Performance Peter Murray SwiftTest

Maximizing offload to ziip processors with DB2 9 for z/os native SQL stored procedures

Infor Lawson on IBM i 7.1 and IBM POWER7+

Performance and Scalability of Server Consolidation

vsphere 4 The Best Platform for Business-Critical Applications

Crescando: Predictable Performance for Unpredictable Workloads

Moving Databases to Oracle Cloud: Performance Best Practices

Oracle Database 12c: RAC Administration Ed 1

Oracle Autonomous Database

Was ist dran an einer spezialisierten Data Warehousing platform?

Capacity Planning for Application Design

Best Practices. Deploying Optim Performance Manager in large scale environments. IBM Optim Performance Manager Extended Edition V4.1.0.

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)

Background Heterogeneous Architectures Performance Modeling Single Core Performance Profiling Multicore Performance Estimation Test Cases Multicore

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING

How Architecture Design Can Lower Hyperconverged Infrastructure (HCI) Total Cost of Ownership (TCO)

VERITAS Storage Foundation 4.0 for Oracle

Oracle Database 10g Resource Manager. An Oracle White Paper October 2005

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database option

WebSphere MQ for Windows V6.0 - Performance Evaluations. Version 1.1. August Peter Toghill

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd

Session 1079: Using Real Application Testing to Successfully Migrate to Exadata - Best Practices and Customer Case Studies

Netezza The Analytics Appliance

IBM Cognitive Systems Cognitive Infrastructure for the digital business transformation

Architecting For Availability, Performance & Networking With ScaleIO

PureSystems: Changing The Economics And Experience Of IT

Using Automatic Workload Repository for Database Tuning: Tips for Expert DBAs. Kurt Engeleiter Product Manager

Oracle Database 11g Direct NFS Client Oracle Open World - November 2007

Oracle Database 12c R2: RAC Administration Ed 2

Oracle 1Z0-054 Exam Questions and Answers (PDF) Oracle 1Z0-054 Exam Questions 1Z0-054 BrainDumps

Oracle Database 10g The Self-Managing Database

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option. Kai Yu Oracle Solutions Engineering Dell Inc

The Modern Mainframe At the Heart of Your Business

Chapter 03. Authors: John Hennessy & David Patterson. Copyright 2011, Elsevier Inc. All rights Reserved. 1

Copyright 2014, Oracle and/or its affiliates. All rights reserved.

QLE10000 Series Adapter Provides Application Benefits Through I/O Caching

SoftNAS Cloud Performance Evaluation on AWS

Storage Optimization with Oracle Database 11g

Oracle #1 RDBMS Vendor

Lenovo Software Defined Infrastructure Solutions. Aleš Simončič Technical Sales Manager, Lenovo South East Europe

A New Metric for Analyzing Storage System Performance Under Varied Workloads

IBM DB2 Analytics Accelerator use cases

The Oracle Database Appliance I/O and Performance Architecture

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION

HITACHI UNIFIED COMPUTE PLATFORM

Oracle EXAM - 1Z Oracle Database 11g: Performance Tuning. Buy Full Product.

A Performance Characterization of Microsoft SQL Server 2005 Virtual Machines on Dell PowerEdge Servers Running VMware ESX Server 3.

Transcription:

Fit for Purpose Platform Positioning and Performance Architecture Joe Temple IBM Monday, February 4, 11AM-12PM Session Number 12927 Insert Custom Session QR if Desired.

Fit for Purpose Categorized Workload Types Mixed Workload Type 1 Scales up Updates to shared data and work queues Complex virtualization Business Intelligence with heavy data sharing and ad hoc queries Parallel Data Structures Type 3 Scales well on clusters XML parsing Buisness intelligence with Structured Queries HPC applications Application Function Data Structure Usage Pattern SLA Integration Scale Highly Threaded Type 2 Scales well on large SMP Web application servers Single instance of an ERP system Some partitioned databases Small Discrete Type 4 Limited scaling needs HTTP servers File and print FTP servers Small end user apps Black are design factors Blue are local factors

These do not define workload Insert Custom Session QR if Desired. Languages c/c++,cobol,fortran, JAVA, etc. Middleware Oracle, DB2, Websphere, MQ, Tuxedo,CICS, Encina,etc. Workload Type OLTP, Analytics, Business Applications, Infrastructure Mixed/Consolidated, Highly Threaded, Parallel, Small Discrete Workload types can be used for positioning machines, but are not enough to guide platform selection 3

Workload Definition A workload consists of a workload type plus local factors: Usage Pattern Load Variability Scale Size of load Service Level Response or turnaround expectation at load Desired Efficiency Target utilization level Integration Connections and shared data impacts 4

x86 Performance Degrades As I/O Demand Increases Run multiple virtual machines on x86 server Each virtual machine has an average I/O rate x86 processor utilization is consumed as I/O rate increases Intel CPU As IO Load Increases 50 45 CPU CPU% utilization 40 35 30 25 20 15 Excess CPU cycles spent on processing I/O Saturation No IO 256KB IO 512KB IO 1MB IO 1.5MB IO 10 5 0 0 5 10 15 20 25 Source: CPO Internal Study. 12-core Westmere EP with KVM. FB at 22 tps with varying IO per transaction. VMs 05. What System z Can Do That x86-based Systems Can t 5

Scaling Matters Productive Nodes 12 11 10 9 8 7 6 5 4 3 2 1 1.78 DB2 for z/os Near Linear Scalability 1.69 3.48 Perfect Linear Performance Productive Nodes 5.24 2.44 6.86 9.98 1 2 3 4 5 6 7 8 9 10 11 12 Nodes in Cluster DB2 for z/os OLTP result (ITG 03) Oracle RAC Poor Scalability Oracle RAC is inefficient by design Network based lock and buffer management Scaling RAC requires complex tuning and partitioning Application partition awareness makes it difficult to add or remove nodes Published studies demonstrate difficult or poor scalability Dell (shown in chart): Poor scalability despite using InfiniBand for RAC interconnect CERN: Four month team effort to tune RAC, change database, change application Insight Technology: Even a simple application on two node RAC requires complex tuning and partitioning to scale Oracle RAC characteristics as shown in Dell RAC InfiniBand Study http://www.dell.com/downloads/global/power/ps2q07-20070279-mahmood.pdf CERN (European Organization for Nuclear Research) http://www.oracleracsig.org/pls/apex/rac_sig.download_my_file?p_file=1001900 Insight Technology http://www.insight-tec.com/en/mailmagazine/vol136.html

Cache Working Set Matters 1000$ Cache(Hierarchy(up(to(node(memory( 100$ Latency((nSec)( 10$ 1$ Sandy$Bridge$ p7+$ite$ p780$$ zec12$ 0.1$ 0.01$ 0.1$ 1$ 10$ 100$ Capacity((MB)( 7

Queuing and Load Variability Matter!Wait!Time!v!U+liza+on!by!variability! 10" 9" 8" 7" 6" 5" 4" 3" 2" 1" 0" 0" 0.2" 0.4" 0.6" 0.8" 1" Very"Low" Low" Moderate" High" Very"High" 8

Response (me and Consolida(on ma/er 10" 9" 8" Response'Teim' 7" 6" 5" 4" 3" 2" 1" Sandy"Bridge" Westmere"EX" P780" p7+"p6"mode" Power"7+" z196" zec12" 0" 0" 0.5" 1" 1.5" 2" 2.5" 3" 3.5" N' Each machine par,,on is three dedicated cores, number of loads varies

Modeling and benchmarks There is enough data in the machine specification to make an architectural performance model We know that distributed on line loads usually have high variability However, the resulting model has relatively low precision It is better to use measurements Traditional measurement of maximum throughput metrics will not help enhance the model. It simply replaces it with another low precision model. We should measure single thread speed Single thread Saturation Scaling with increased thread count (related to saturation) Interval data of the usage and possibly throughput pattern 10

Performance architecture involves requirements as well as comparisons How is response time defined? Completion of a single thread of work? Completion of many threads of work? What response time is required and what fraction of the peaks need to be covered? There is a trade off between peak coverage, cost and utilization efficiency. Feasibility can become and issue if the SLA is too tight. Is aggregate throughput meaningful to users or is the preferred metric the number of loads contained in the machine while meeting the SLA? 11

There is a design tradeoff between throughput and capacity 300" Max$Thruput$v$Thread$Capacity$ 250" 200" 150" 100" 50" SB" WM"EX" P460" P7+" ZEC12" 0" 0" 20" 40" 60" 80" 100" Here: Throughput is Clock * SMT muttiplier/threads per core * total threads Thread Capacity is Clock * SMT muttiplier/threads per core * cache/thread It is best to replace is Clock * SMT muttiplier/threads per core by measured thread speed Throughput can t be faster that thread speed * Threads Thread capacity is how much work can stack on a single thread which is related to both the thread speed and the cache available. 12

Virtual Machine Density and the Tradeoff As VM s per core of the workload increases the importance of aggregate throughput decreases As the size of a virtual machine increases The importance of its internal throughput rate increases. Increased density favors favors z; increased VM size favors Power Intel is favored when resources can be aggregate without scaling penalties. Power and z are favored when resources can be shared without scaling penalties. 13

Do you need a deep dive to understand workload fit? Workload fit involves more than determining the workload type and a throughput ratio rule of thumb. Operational considerations will change the relative capacity of machines Throughput ratios do not generally take operational tradeoffs into consideration An Performance Architecture workshop can provide such a deep dive. The objectives of the workshop are to build a model which produces characteristic curves Response time v Throughput Response time v Load Count or VM count Response time v utilization Throughput v utilization Scaling The workshop can work with machine specs and assumed usage patterns in lieu of data but collection of data will yield better results 14

Fit for purpose thinking comes down to: Know the legacy, workload, and costs Know the current IT Environment Understand the workload Examine costs Workload analysis gets technical fast, and real cost analysis is a deep dive.

Eagle Engagements Free of Charge total cost of ownership study that helps customers evaluate the lowest cost option among alternative approaches. The study usually requires one day for an on-site visit and is specifically tailored to a customer s enterprise. The study can be focused on at least one of the areas below : Fit For Purpose Platform Selection Private Cloud Implementation Enterprise Server Issues We conduct Eagle studies for System z, POWER, and PureSystems accounts Over 300 customer studies since the formation of the TCO Eagle team in 2007 Engage our Eagle-Eyed Experts! Start by requesting your IBM Contact to send an email to eagletco@us.ibm.com For deep workload analysis workshop use the same link and ask for Joe Temple Will be ramping up capability for workload deep dives in the coming quarters.