Benchmark Performance Results for Pervasive PSQL v11. A Pervasive PSQL White Paper September 2010

Similar documents
SCALING UP VS. SCALING OUT IN A QLIKVIEW ENVIRONMENT

Performance Pack. Benchmarking with PlanetPress Connect and PReS Connect

Why Parallel Architecture

Understanding the performance of an X user environment

Sage 300 ERP. Compatibility Guide Version Revised: Oct 1, Version 6.0 Compatibility Guide i

CA Single Sign-On. Performance Test Report R12

Multithreading: Exploiting Thread-Level Parallelism within a Processor

Entuity Network Monitoring and Analytics 10.5 Server Sizing Guide

Parallel Computing. Parallel Computing. Hwansoo Han

Software within building physics and ground heat storage. HEAT3 version 7. A PC-program for heat transfer in three dimensions Update manual

System recommendations for version 17.1

QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades

System recommendations for version 17.1

Sage ERP Accpac. Compatibility Guide Version 6.0. Revised: November 18, 2010

IBM InfoSphere Streams v4.0 Performance Best Practices

Cube Base Reference Guide Cube Base CUBE BASE VERSION 6.4.4

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA

Sage ERP Accpac. Compatibility Guide Version 6.0. Revised: February 2, Version 6.0 Compatibility Guide

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

Virtualizing SQL Server 2008 Using EMC VNX Series and VMware vsphere 4.1. Reference Architecture

What s New in Pervasive PSQL

Sage ERP MAS 200 SQL Version 4.50 Supported Platform Matrix Revised as of July 24, 2014

Terminal Services Scalability Study

Comparing Software versus Hardware RAID Performance

Oracle Database 12c: JMS Sharded Queues

CS 31: Intro to Systems Threading & Parallel Applications. Kevin Webb Swarthmore College November 27, 2018

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations

PAC094 Performance Tips for New Features in Workstation 5. Anne Holler Irfan Ahmad Aravind Pavuluri

SAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation

Difference Engine: Harnessing Memory Redundancy in Virtual Machines (D. Gupta et all) Presented by: Konrad Go uchowski

Parallelism and Concurrency. COS 326 David Walker Princeton University

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

X300 Seven User Environment

Four-Socket Server Consolidation Using SQL Server 2008

Best Practices for Setting BIOS Parameters for Performance

Virtuozzo Hyperconverged Platform Uses Intel Optane SSDs to Accelerate Performance for Containers and VMs

Sage ERP Accpac. Compatibility Guide Versions 5.5 and 5.6. Revised: November 18, Compatibility Guide for Supported Versions

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Two hours - online. The exam will be taken on line. This paper version is made available as a backup

White Paper. Executive summary

Low Latency Evaluation of Fibre Channel, iscsi and SAS Host Interfaces

SQL Server 2005 on a Dell Scalable Enterprise Foundation

The Arrival of Affordable In-Memory Database Management Systems

Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture

RIGHTNOW A C E

Scaling PortfolioCenter on a Network using 64-Bit Computing

ARTSYL DOCALPHA INSTALLATION GUIDE

Parallel Computer Architecture

Sage 100 Premium Version 2017 Supported Platform Matrix Created as of October 28, 2016

CSCI-GA Multicore Processors: Architecture & Programming Lecture 10: Heterogeneous Multicore

Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator

QLE10000 Series Adapter Provides Application Benefits Through I/O Caching

CS 590: High Performance Computing. Parallel Computer Architectures. Lab 1 Starts Today. Already posted on Canvas (under Assignment) Let s look at it

Intel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

docalpha Installation Guide

Accelerating Enterprise Search with Fusion iomemory PCIe Application Accelerators

Lessons learned implementing a multithreaded SMB2 server in OneFS

Sage MAS 90 Extended Enterprise Suite Version 1.4 Supported Platform Matrix Revised as of March 11, 2010

CUDA GPGPU Workshop 2012

Utilizing the IOMMU scalably

Advanced RDMA-based Admission Control for Modern Data-Centers

Parallel Programming Multicore systems

Operating System Performance and Large Servers 1

Cisco Prime Home 6.X Minimum System Requirements: Standalone and High Availability

SAP SD Benchmark with DB2 and Red Hat Enterprise Linux 5 on IBM System x3850 M2

Performance of Multicore LUP Decomposition

SUPERMICRO, VEXATA AND INTEL ENABLING NEW LEVELS PERFORMANCE AND EFFICIENCY FOR REAL-TIME DATA ANALYTICS FOR SQL DATA WAREHOUSE DEPLOYMENTS

HP and CATIA HP Workstations for running Dassault Systèmes CATIA

MobiLink 10 Performance. A whitepaper from Sybase ianywhere. Author: Graham Hurst. Date: March 2009

Free up rack space by replacing old servers and storage

The Central Processing Unit

Accelerating String Matching Algorithms on Multicore Processors Cheng-Hung Lin

AMD: WebBench Virtualization Performance Study

Technical Brief: Specifying a PC for Mascot

W H I T E P A P E R U n l o c k i n g t h e P o w e r o f F l a s h w i t h t h e M C x - E n a b l e d N e x t - G e n e r a t i o n V N X

IBM and HP 6-Gbps SAS RAID Controller Performance

Lowering Cost per Bit With 40G ATCA

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

STAR Watch Statewide Technology Assistance Resources Project A publication of the Western New York Law Center,Inc.

The dark powers on Intel processor boards

Data Protection at Cloud Scale. A reference architecture for VMware Data Protection using CommVault Simpana IntelliSnap and Dell Compellent Storage

Improving Performance and Power of Multi-Core Processors with Wonderware and System Platform 3.0

Trend Micro Core Protection Module 10.6 SP1 System Requirements

Profiling Grid Data Transfer Protocols and Servers. George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA

Enabling Efficient and Scalable Zero-Trust Security

32 Hyper-Threading on SMP Systems

L-Series 30 User Environment

Performance Baseline of Hitachi Data Systems UCP for Oracle

Benchmark Tests of Asterisk as a B2BUA

One Solution. training guide. Getting started with Clarity Professional v5

L-series 30-user environment

Scaling PostgreSQL on SMP Architectures

A Comparative Performance Evaluation of Different Application Domains on Server Processor Architectures

WHITE PAPER AGILOFT SCALABILITY AND REDUNDANCY

Benefits of Automatic Data Tiering in OLTP Database Environments with Dell EqualLogic Hybrid Arrays

IBM Emulex 16Gb Fibre Channel HBA Evaluation

Whitepaper / Benchmark

Transcription:

Benchmark Performance Results for Pervasive PSQL v11 A Pervasive PSQL White Paper September 2010

Table of Contents Executive Summary... 3 Impact Of New Hardware Architecture On Applications... 3 The Design of Pervasive PSQL v11... 4 Configurations... 6 Memory Caches... 7 Benchmark Testing... 7 Atomics Testing... 8 Conclusion... 8 2

Executive Summary Advances in hardware technology previously meant advances in processing speed. Existing applications just ran faster with no extra effort to enhance them. Today, advances in computing technology mean increased parallelism and not increased clock speeds. Multiple cores provide the avenue for increased parallelism, but applications must be written with parallelized code to take advantage of multiple cores simultaneously. Pervasive PSQL v11 Server is specifically designed to increase scalability and performance on multi-core machines in multiclient environments. This paper discusses the performance results from benchmark tests on Pervasive PSQL v11 Server. Test comparisons are shown between Pervasive PSQL v11 and the latest release in the 10.x family, Pervasive PSQL v10 SP3. On multi-core machines with multiple clients accessing data, the performance of Pervasive PSQL v11 significantly exceeds that of Pervasive PSQL v10 SP3, sometimes over 300%. Impact Of New Hardware Architecture On Applications Multi-core machines are now the norm. When you upgrade your singlecore machines, your applications will have to function with a different hardware architecture. And for many applications, the multicore architecture dramatically changes how the application performs. As we explore this deeper, consider the following performance trend when Pervasive PSQL v10 Server moves from a single-core machine to a multi-core machine. Figure 1. Performance of Pervasive PSQL v10 Server on Single-core and Multi-core Machines 3

As you can see, performance decreases with more client sessions. This behavior is not limited just to Pervasive PSQL v10 Server. Any complex multi-client application not specifically designed to take advantage of multi-core architecture and many applications originated during the single-core era will likely experience performance drops on multi-core machines. Why? The technical reasons are complex and thoroughly explained in the white paper The Multi-core Dilemma by Dan Woods, CTO of CITO Research. But a brief summary here will clarify why it may seem counterintuitive that multithreaded applications would run slower on multi-core systems. In a multithreaded application that shares data, the synchronization between threads consumes a lot of system resources on a multi-core machine. In fact, shared data is the bane of parallel computing. If multiple threads access the same data, access must be synchronized. And when you synchronize access to the data, that portion of code cannot be executed by more than one thread and therefore it is not concurrent. This section of the code then becomes the single door that everyone must line up to go through. Caching does not improve this problem; it exacerbates it. If multiple cores or processors have caches that point to the same data and one core modifies the data, the cached data on the other core is no longer valid, and the caches must be synchronized. The accumulated overhead for all the synchronization of these operations is significant; ultimately it means that performance on multiple cores can be worse than on a single core machine that does not require such synchronization. Where possible, each core should work on separate data; otherwise, there is overhead associated with synchronization and that overhead can slow down performance significantly. The Design of Pervasive PSQL v11 Pervasive PSQL v11 has been architected to provide parallel threads performing similar activities. The gains in increased parallel processing improve the throughput to the point that multiple processors are engaged. The result is that performance of the database engine increases in multi-core environments with multiple clients accessing a central server. Pervasive PSQL v11 also provides enhancements to the low-level synchronizations mechanisms in the transactional interface. Multiple users can read the same cached file pages simultaneously and their operations can proceed on independent server CPUs. Non-user activity such as checkpoints and log management can also use additional server CPUs. 4

The scalability of Pervasive PSQL v11 has been enhanced through architecture designs made specifically for multi-core hardware. For example, multiple users accessing independent files can proceed on independent server CPUs. The database engine can also handle higher user loads with less overhead, resulting in steadier throughput. How do those design changes improve performance? Let s compare the performance of Pervasive PSQL v11 on multi-core machines with that of Pervasive PSQL v10. Just as in Figure 1, the results are based on the transactional interface accessing 16 files fully cached in memory. 5

Figure 2. Performance of Pervasive PSQL v11 Server and Pervasive PSQL v10 Server on Multicore Machines As you can see, the architecture changes in Pervasive PSQL v11 provide a significant increase in performance. And the best news: your multiclient application can benefit from this increased performance without requiring you to recompile or rearchitect your code. Configurations The performance tests discussed in this paper were conducted using the configurations described in the following table. Table 1. Configurations Used for Benchmark Tests Server Machine Processors Dual Intel Xeon CPU E5420 @ 2.50 GHz, 4 Cores Server Machine Total Cores 8 * Server Machine Total Memory 16 GB Server Machine Operating System Microsoft 2008 Enterprise Server (Service Pack 2), 64-bit Server Software Pervasive PSQL Server Settings Pervasive PSQL v11, Pervasive PSQL v10 SP3 Installed with defaults Client Machine Processors Single Intel Core2 Quad CPU Q9400 @ 2.66 GHz, 4 Core (Two physical machines running against Pervasive PSQL Server with client sessions split evenly between the client machines.) Client Machine Total Cores 4 (each machine) * Client Machine Total Memory 4 GB (each machine) Client Machine Operating System Microsoft XP (Service Pack 3) 32-bit Client Software Pervasive PSQL v11 Client to Pervasive PSQL v11 Server Pervasive PSQL v10 SP3 Client to Pervasive PSQL v10 SP3 Server Pervasive PSQL Client Settings Installed with defaults * Since commodity hardware today is typically 4 or 8 core machines, Pervasive Software chose such machines as configurations realistic to what our customers will experience. As the number of cores increases in commodity machines, Pervasive PSQL will 6

continue to be optimized based on such changes. Memory Caches Pervasive PSQL contains two primary caches, Level1 and Level2, termed L1 and L2 respectively. The L1 cache is a fixed size based on the Cache Allocation Size setting. It does not expand or contract based on database operations. The benchmark tests show results for data fully cached in L1. Benchmark Testing To demonstrate the effect of different client session loads, Pervasive Software ran a test harness with a standard installation of Pervasive PSQL on the server and on the client (see Configuration section above). The test harness simulates a processing load similar to the industry standard TPC-B benchmark. A single transaction performs a READ and UPDATE on a record in each of three files, then performs an INSERT of a record to a fourth file. For details about TCP-B from the Transaction Processing Performance Council, see http://www.tpc.org/tpcb/default.asp. Figure 3. Performance Results for Transactional Interface (Btrieve) Four Files Fully Cached in L1 Sixteen Files Fully Cached in L1 Percent increase in performance of Pervasive PSQL v11 by client session: 8 10% 16 86% 32 206% 64 297% 96 347% 8 6% 16 77% 32 191% 64 302% 96 328% 7

Figure 4. Performance Results for Relational Interface (ODBC) Four Tables Fully Cached in L1 Sixteen Tables Fully Cached in L1 Percent increase in performance of Pervasive PSQL v11 by client session: 8 79% 16 136% 32 240% 64 292% 96 326% 8 69% 16 116% 32 232% 64 284% 96 304% Atomics Testing In addition to the TCP-B benchmark tests, atomics testing of READ and UPDATE actions were performed. The atomics results on multi-core machines are similar to the TCP-B results. They show that Pervasive PSQL v11 outperforms Pervasive PSQL v10 SP3. If you would like to receive the results of the atomics tests, e-mail database@pervasive.com or telephone 1 800 287 4383. Conclusion The numbers speak for themselves. Pervasive PSQL v11 Server demonstrates increased performance over Pervasive PSQL v10 Server on multi-core machines. The majority of increases are over 100%, with some exceeding 300%. The more data files involved, the better the performance as the increased parallel processing in Pervasive PSQL v11 engages multiple processors. Multiple users can access independent files as well as read the same cached file pages simultaneously while such operations proceed on independent server CPUs. In contrast, the performance of Pervasive PSQL v10 Server decreases on multi-core machines as client sessions increase. Any complex multiuser application that originated during the single-core era will likely experience performance drops on multi-core machines. The numerous 8

design considerations required for optimal performance on multi-core hardware simply could not have been anticipated at the time. Multi-core support is a primary feature of Pervasive PSQL v11 because the new multi-core hardware dramatically changes how most applications perform. That support is of primary importance to you as you transition your multi-client applications into multi-core environments. 9