WHITE PAPER AGILOFT SCALABILITY AND REDUNDANCY

Similar documents
An Oracle White Paper December Accelerating Deployment of Virtualized Infrastructures with the Oracle VM Blade Cluster Reference Configuration

MyCloud Computing Business computing in the cloud, ready to go in minutes

IBM InfoSphere Streams v4.0 Performance Best Practices

Performance Pack. Benchmarking with PlanetPress Connect and PReS Connect

Storage Consolidation with the Dell PowerVault MD3000i iscsi Storage

Demartek September Intel 10GbE Adapter Performance Evaluation for FCoE and iscsi. Introduction. Evaluation Environment. Evaluation Summary

Assessing performance in HP LeftHand SANs

Silicon House. Phone: / / / Enquiry: Visit:

Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions

Lenovo Database Configuration Guide

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

Cisco Prime Home 6.X Minimum System Requirements: Standalone and High Availability

RIGHTNOW A C E

IBM s Data Warehouse Appliance Offerings

Microsoft Office SharePoint Server 2007

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

IBM Emulex 16Gb Fibre Channel HBA Evaluation

Enhancing Oracle VM Business Continuity Using Dell Compellent Live Volume

Enterprise print management in VMware Horizon

Intel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage

Drecom Accelerates Anytime, Anywhere Gaming

Adobe Acrobat Connect Pro 7.5 and VMware ESX Server

vsphere Installation and Setup Update 2 Modified on 10 JULY 2018 VMware vsphere 6.5 VMware ESXi 6.5 vcenter Server 6.5

ADDENDUM TO: BENCHMARK TESTING RESULTS UNPARALLELED SCALABILITY OF ITRON ENTERPRISE EDITION ON SQL SERVER

The advantages of architecting an open iscsi SAN

Sugon TC6600 blade server

Maximum Availability Architecture: Overview. An Oracle White Paper July 2002

<Insert Picture Here> Introducing Oracle WebLogic Server on Oracle Database Appliance

A Rational software Whitepaper 05/25/03. IBM Rational Rapid Developer Scalability and Performance Benchmark

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING

Whitepaper / Benchmark

An Oracle White Paper Released April 2008

WHITE PAPER: BEST PRACTICES. Sizing and Scalability Recommendations for Symantec Endpoint Protection. Symantec Enterprise Security Solutions Group

Oracle Database 12c: JMS Sharded Queues

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA

An Oracle White Paper September, Oracle Real User Experience Insight Server Requirements

Parallels Virtuozzo Containers

IBM BladeCenter S Competitive Summary

Executive Briefing. All SAN storage

Minimum Requirements for Cencon 4 with Microsoft R SQL 2008 R2 Enterprise

Benefits of an Exclusive Multimaster Deployment of Oracle Directory Server Enterprise Edition

Virtualization of the MS Exchange Server Environment

Vendor: Citrix. Exam Code: 1Y Exam Name: Designing Citrix XenDesktop 7.6 Solutions. Version: Demo

Intel Cluster Ready Allowed Hardware Variances

IBM Scale Out Network Attached Storage (SONAS) using the Acuo Universal Clinical Platform

Improving Blade Economics with Virtualization

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

SharePoint 2010 Technical Case Study: Microsoft SharePoint Server 2010 Enterprise Intranet Collaboration Environment

AMD: WebBench Virtualization Performance Study

HP SAS benchmark performance tests

High-Performance Lustre with Maximum Data Assurance

System Requirements. v7.5. May 10, For the most recent version of this document, visit kcura's Documentation Site.

Pexip Infinity Server Design Guide

SGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012

Patriot Hardware and Systems Software Requirements

Installing VMware vsphere 5.1 Components

Best Practices for Setting BIOS Parameters for Performance

A Comparative Study of Microsoft Exchange 2010 on Dell PowerEdge R720xd with Exchange 2007 on Dell PowerEdge R510

Copyright 2009 by Scholastic Inc. All rights reserved. Published by Scholastic Inc. PDF0090 (PDF)

Hardware & System Requirements

SCALING UP VS. SCALING OUT IN A QLIKVIEW ENVIRONMENT

Reference Architecture for a Virtualized SharePoint 2010 Document Management Solution A Dell Technical White Paper

Cisco Integration Platform

Episode Engine. Best Practices - Deployment. Single Engine Deployment

White Paper. Low Cost High Availability Clustering for the Enterprise. Jointly published by Winchester Systems Inc. and Red Hat Inc.

An Oracle White Paper. Released April 2013

Data Sheet FUJITSU Server PRIMERGY CX400 S2 Multi-Node Server Enclosure

Vendor: EMC. Exam Code: E Exam Name: Cloud Infrastructure and Services Exam. Version: Demo

A Performance Characterization of Microsoft SQL Server 2005 Virtual Machines on Dell PowerEdge Servers Running VMware ESX Server 3.

NEC Express5800/ft series

What's New in vsan 6.2 First Published On: Last Updated On:

High Availability Without the Cluster (or the SAN) Josh Sekel IT Manager, Faculty of Business Brock University

SGI Origin 400. The Integrated Workgroup Blade System Optimized for SME Workflows

On BigFix Performance: Disk is King. How to get your infrastructure right the first time! Case Study: IBM Cloud Development - WW IT Services

HP ProLiant DL580 G5. HP ProLiant BL680c G5. IBM p570 POWER6. Fujitsu Siemens PRIMERGY RX600 S4. Egenera BladeFrame PB400003R.

Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel

Database Solutions Engineering. Best Practices for running Microsoft SQL Server and Microsoft Hyper-V on Dell PowerEdge Servers and Storage

Lenovo Database Configuration

Backup Appliances. Geir Aasarmoen og Kåre Juvkam

Lenovo Database Configuration

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

MidoNet Scalability Report

Market Focus Complete Solution Certified Solution Purpose Built Patented Technologies Industry experts Right Size Rasilient s Mission

Competitive Power Savings with VMware Consolidation on the Dell PowerEdge 2950

Dell Solution for JD Edwards EnterpriseOne with Windows and SQL 2000 for 50 Users Utilizing Dell PowerEdge Servers And Dell Services

ORACLE IDENTITY MANAGER SIZING GUIDE. An Oracle White Paper March 2007

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd

QLogic TrueScale InfiniBand and Teraflop Simulations

Virtuozzo Containers

DocuShare 6.6 Customer Expectation Setting

An Oracle White Paper September Oracle Utilities Meter Data Management Demonstrates Extreme Performance on Oracle Exadata/Exalogic

BENEFITS AND BEST PRACTICES FOR DEPLOYING SSDS IN AN OLTP ENVIRONMENT USING DELL EQUALLOGIC PS SERIES

Database Solutions Engineering. Best Practices for Deploying SSDs in an Oracle OLTP Environment using Dell TM EqualLogic TM PS Series

Cloud Services. Introduction

CiscoWorks NCM 1.7 Support Matrix

Extremely Fast Distributed Storage for Cloud Service Providers

Functional Testing of SQL Server on Kaminario K2 Storage

Acano solution. White Paper on Virtualized Deployments. Simon Evans, Acano Chief Scientist. March B

Transcription:

WHITE PAPER AGILOFT SCALABILITY AND REDUNDANCY

Table of Contents Introduction 3 Performance on Hosted Server 3 Figure 1: Real World Performance 3 Benchmarks 3 System configuration used for benchmarks 3 Figure 2a: New tickets per minute on dual E5440 processors 3 Figure 2b: New tickets per minute on dual E5530 processors 4 Figure 3a: Effect of the number of records on database search performance 4 Figure 3b: Effect of the number of records on file search performance 4 Figure 4a: Number of logins per minute to the End User Interface 5 Figure 4b: New logins per minute to the Power User interface on E5440 processors 5 Figure 5: Effect of the number of logged in users 5 System Requirements 6 Load Balancing 6 Separate database and J2EE servers 6 Multiple database servers 6 Multiple application servers 6 Hyper-Threading Performance 6 Server Redundancy 7 Infrastructure Redundancy 7 Performance Analysis 7 Conclusions 8

INTRODUCTION This paper reviews Agiloft scalability from the perspective of actual use on a hosted server and benchmarks on standard hardware. This analysis is followed by a discussion of the load that is placed on the system by typical users and a brief summary. The benchmark section is focused on operations that are common in production use, such as logging into the system, creating or updating a record and searching for an existing record, and analyzes how scalability increases with the number of CPU cores. It does not address the time taken to execute infrequent administrative actions such as creating a custom table, since these are intrinsically single threaded and are almost unaffected by the number of processors. PERFORMANCE ON HOSTED SERVER Figure 1: Real World Performance Avg Load 9am 5pm Peak Load 9am 5pm Hosted Server 0.68 1.17 with 1512 users These numbers were obtained from a server configured with dual Xeon X5355 processors, RAID 10 SATA hard drives and 24G RAM with 41 hosted knowledgebases (KB) and an average of ~36 users per KB for a total of 1483 users. The 41 KB s provide a broad cross-section of use patterns and the load remains low throughout the business day. Performance typically starts to degrade once the load reaches a number between 2.5 and 3.5 so this server could support a significant number of additional users. BENCHMARKS System configuration used for benchmarks Database: MySQL Hardware: Dual Xeon E5440 processors, 24 G RAM, 3 Seagate SAS drives OS: Suse Linux 11 Agiloft Release: 2008_03-12507-127-r82842 The Nehalem server processors, released in Q1 2009, are expected to increase performance by 40-80%. SAS hard drives provide ~15% better raw server performance for most operations, but except on high volume systems, this is not noticeable for users accessing the system over the Internet because server-side performance is dwarfed by the 1-2 second Internet delay for each user transaction. Agiloft also supports MS SQL and it provides the same performance as MySQL to within 15%. On average MySQL is slightly faster than MS SQL for most tests/loading conditions. Figure 2a: New tickets per minute on dual E5440 processors New Records 1600 1400 1200 1000 800 600 400 All customers on this particular server are located in North America, so the load is highest between 5am and 5pm PST. A machine that served an even distribution of international customers could support about twice as many users without increasing the peak load. 200 0 0 1 2 3 4 5 6 7 8 Active Cores

This test was run on the default Demo KB. Performance will increase or decrease depending upon the size of the ticket, the complexity of the KB structure and the number of active business rules. The time required to edit an existing ticket is typically the same as the time required to create a new one. Customers can run the same test on their hardware by selecting Setup/Performance Test from the admin console. As indicated by figure 3a, the number of records in the database does not have a measurable effect on performance, so a system with dual E5440 processors (8 cores) can generate over 80,000 records per hour until it runs out of disk space. Figure 2b: New tickets per minute on dual E5530 processors 3500 With hyper-threading turned on, the number of records per minute jumped to 3870.As indicated by figure 3a, the number of records in the database does not have a measurable effect on performance, so a system with dual E5530 processors (8 cores) system can generate over 232,000 records per hour until it runs out of disk space. Figure 3a: Effect of the number of records on database search performance Number of records in database 10,000 < 1 second 1,000,000 < 1 second Time to find text string in database 10,000,000 < 1 second (see video) These numbers reflect the fact that Agiloft incorporates a database-independent full text search engine. 3000 There are operations available only to system 2500 administrators, such as performing a mass update of all tickets in the knowledgebase, that will vary New Records 2000 1500 1000 linearly with the number of tickets in the database. Figure 3b: Effect of the number of records on file search performance 500 Number of records in database Time to find text string in attached file 0 0 1 2 3 4 5 6 7 8 10,000 < 1 second Active Cores 1,000,000 < 1 second 10,000,000 < 1 second This test illustrates the performance gains that are available from an alternate hardware configuration of dual i7 processors and dual Intel X25-E SSD hard drives. These numbers reflect the fact that Agiloft incorporates a database-independent full text search engine that indexes attached files.

Figure 4a: Number of logins per minute to the End User Interface Number of active processor cores Number of end user logins per minute 1 375 2 700 4 1150 6 1469 8 1750 These numbers reflect the number of unique users that may login to the Agiloft end user interface per minute. Comparing with figure 2, we see that when a user logs into the system, it imposes a load comparable to creating a single ticket. As detailed in figure 5, any number of users may be logged into Agiloft and unless they are actively doing something, they do not impose a measurable load on the system. Figure 4b: New logins per minute to the Power User interface on E5440 processors Number of active processor cores Number of power user logins per minute 1 153 2 284 4 466 6 597 8 712 These numbers reflect the number of unique users that may login to the Agiloft power user interface per minute. As detailed in figure 5, any number of users may be logged into Agiloft and unless they are actively doing something, they do not impose a measurable load on the system. The number of new logins per minute on dual E5530 processors exceeds 1,500. Figure 5: Effect of the number of logged in users Number of logged in users 100 < 1 second 1,000 < 1 second 10,000 < 1 second 100,000 < 1 second Time to open a new ticket These numbers were obtained with a browser on the same local network as the server with one active user. When connecting over the Internet a delay of 1 to 2 seconds for network connectivity is typical. While the presence of a logged-in user does not affect performance, users who are actively creating or editing tickets will affect performance once the number of requests approaches the rate per CPU shown in figure 2. If 1,000 users were creating new tickets at the rate of one per user per hour, there would not be a significant performance impact. 1,000 users who were each attempting to create new tickets at the rate of one per user per minute would bring an 8-core system to a crawl. Although the number of logged in users does not directly affect performance unless they are active, it does affect the amount of RAM required by JBoss. Each logged in user uses about 5K of RAM, so a server that is intended to support 100,000 loggedin users would be using 5G RAM to hold the login information in memory. Naturally, this requirement only applies to the number of logged-in users. Users who exist in the database but are not logged in do not use any RAM.

SYSTEM REQUIREMENTS Agiloft is built on J2EE architecture for scalability and has been tuned through years of production use. Provided the system is configured with at least 8G of RAM, very little time is spent on I/O and performance depends primarily upon the CPU. Because the system is J2EE based, integer performance and the amount of cache in the CPU have a strong impact on performance and recent generations of the Intel chipset are recommended over Opteron or Sparc processors. LOAD BALANCING Load balancing can be applied at the database level, the application server level and by splitting the J2EE and database servers across different machines. Separate database and J2EE servers About 55% of CPU resources are spent on Java processes and 45% on database processes. By placing the application server and database on separate machines, it is therefore possible to significantly increase scalability. However, it is important to distinguish between scalability for peak loads and performance under typical loads. Splitting the database and application server across different machines will increase scalability but it also increases the communication delay between the database and application server, so while this method will improve performance under peak-load conditions when all cores are active concurrently, performance under typical load conditions will decrease slightly. Multiple database servers The database can be configured to distribute reads among multiple systems, but of course writes must be applied to all systems concurrently. As in the example above, this can increase scalability under peak loads, particularly if the system has an unusually heavy load from SQL read statements, but the introduction of the load balancer overhead will decrease performance under typical loads. An alternative to deploying multiple database servers is to configure the machine with SSD hard drives. These provide read times several times faster than traditional hard drives and seek times an order of magnitude faster, at a price point that is significantly lower than an additional server. Such a configuration not only improves scalability under peak loads, but also improves performance under typical loads. Multiple application servers The J2EE architecture can be leveraged to cluster the application server across multiple machines and JSP server instances can also be distributed across multiple machines. However, these both carry a high communications overhead since the cache of each application server or JSP server instance must be synchronized. This is only practicable if the application servers are closely coupled by at least a 1Gps connection and in practice; any performance gain from CPU utilization is swamped by the connection overhead under typical load conditions. If the servers were connected over a WAN, performance would suffer severely. For the above reasons, this method of load balancing is no longer supported. HYPER-THREADING PERFORMANCE For customers running on the Intel i7 architecture, the re-introduction of hyper-threading provides a compelling alternative to load balancing for distributing the CPU load. Whereas prior generations of hyper-threading did little to improve performance, the increased cache sizes and improved architecture of the i7 processors have radically changed the situation. For example, by turning on hyper-threading on a single i7 920 CPU, the number of new tickets that may be created per minute jumps from 833 to 1,200. A single E5530 processor, with hyperthreading turned on and an SSD hard drive, was benchmarked at 2,134 records per minute. Dual E5530 processors, with SSD hard drives were benchmarked at 3,870 records per minute.

SERVER REDUNDANCY For high availability configurations, DRDB provides full redundancy by mirroring the hard drives across multiple machines while heartbeat activates the necessary services on the secondary server if the primary dies. The IP address of the primary server is transferred to the secondary so that it transparently replaces the primary machine and the total service interruption time in the event of catastrophic hardware failure is less than 3 minutes. We have used this configuration with our hosted service for several years and provide preconfigured systems or instructions for configuring your own Linux hardware at no additional charge. This methodology can be used if the secondary machine is located at a different facility; however DRDB proxy should be used in this case. INFRASTRUCTURE REDUNDANCY In addition to the application server redundancy detailed above, the infrastructure at the co-location facility is configured for high-availability as detailed below: The Ethernet drop to the central switch is redundant and configured for hot-swap failover in the event that the co-location facility s router should fail. The server blades are redundant and are connected via ISCSI to a hot-swap redundant file servers. The file servers are configured with redundant power supplies and RAID 10 hard drives with hotswap spares. The blade server and file servers use EEC RAM. The firewalls are redundant and configured for hotswap failover. he central switch is redundant and configured for hot-swap failover. All components are subject to active monitoring and connected to remote power supplies. PERFORMANCE ANALYSIS System resources are only used when the user actually does something, such as creating or editing a record. As is typical for HTML based web products, the overhead from passive users is almost zero. For example, when users are filling out a web form, they are entirely passive from the perspective of the system and only become active on clicking the Finish button. This is important because it means that performance is not based on how many users there are, but how many tickets those users are creating/editing per minute. The average member of a sales team or support staff works on 3-8 records per day. If we assume that all users are at the top end of this range and work on 8 records per day, we would expect a sales/ support staff with 1,000 full time agents to generate or edit 8,000 records per day, significantly less than our entry level system can handle in half an hour. Of course, it is necessary to add a safety factor for sudden peak loads, and ideally CPU utilization should be less than 25%, so we recommend quad CPU configurations for such customers. This analysis is based on typical configurations, but the system supports creation of an unlimited number of business rules, each of which runs in the foreground or as a scheduled background task to call external scripts, generate emails, update related records and carry out other operations necessary to automate the business. The overhead from such processes can be significant, so for very large companies with highly sophisticated business rules, we support the IBM P-Series with up to 64 processors. Although few enterprises will actually need more than a couple of machines configured with dual Intel X5560 CPU s, support for the P-Series guarantees that plenty of headroom is available if the need arises. Another aspect of scalability is the size of knowledgebase that can be supported while

maintaining good performance. For common operations such as finding records containing certain text and editing them, scalability is almost unlimited. The following video demonstrates searching a table containing over 10,000,000 user records for some text (17 such records are found), editing one of them and saving the result. From usage analysis on our ASP servers, we know that about 50% of CPU resources are spent submitting new records such as tickets and emails or editing existing ones, 20% is spent generating charts/reports, 15% is spent searching for information and 15% is spent on miscellaneous tasks such as changing tabs, logging in, viewing history information, creating saved searches, changing views, etc. Subject to the points raised in the Performance Analysis section, Agiloft can handle creation of over 3,800 tickets per minute on a system configured with dual E5530 CPU s, and can handle creation of over 1,320 tickets per minute on a system configured with dual Xeon E5440 CPU s, regardless of the number of logged in users or existing tickets. For very high load systems, capable of handling over 3,000 tickets per minute, we therefore recommend dual i7 CPU s, such as E5530 s coupled with SSD hard drives. For high availability, we recommend use of a hot-swap redundant server with replication and failover provided by DRDB and heartbeat. CONCLUSIONS As detailed in figures 2a and 2b, for systems with two or more CPU cores, throughput increases by a factor of 1.5 to 1.9 each time that the number of active CPU cores is doubled. This is within 25% of the theoretical maximum scalability of 2 and indicates that the system is free of bottlenecks. As detailed in figures 3a, 3b, and 5, testing with over 10,000,000 records and 100,000 active users demonstrates that the number of records and the number of concurrent users does not place a measurable load on the system. Performance is only affected by the number of records that are actually being edited at any one time. ABOUT AGILOFT Over 3 million users at organizations ranging from small enterprises to U.S Government agencies and Fortune 100 companies depend on Agiloft s top rated product suites for Contract Management, Service Desk, Custom Workflow, and more. Agiloft specializes in automating processes that are too complex for competing vendors. Our best practice templates and agile technology ensure rapid deployment and a fully extensible system. For more information, visit https://www.agiloft.com. Agiloft, Inc. 460 Seaport Court, Suite 200 Redwood City, CA 94063 United States 650-587-8615 1-888-727-2209 info@agiloft.com https://www.agiloft.com