Improving Altibase Performance with Solarflare 10GbE Server Adapters and OpenOnload

Similar documents
OpenOnload. Dave Parry VP of Engineering Steve Pope CTO Dave Riddoch Chief Software Architect

A Low Latency Solution Stack for High Frequency Trading. High-Frequency Trading. Solution. White Paper

Solarflare and OpenOnload Solarflare Communications, Inc.

10Gb Ethernet: The Foundation for Low-Latency, Real-Time Financial Services Applications and Other, Latency-Sensitive Applications

FPGA Augmented ASICs: The Time Has Come

Oracle Exadata: Strategy and Roadmap

RoCE vs. iwarp Competitive Analysis

Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions

Application Acceleration Beyond Flash Storage

Network Design Considerations for Grid Computing

QuickSpecs. HP Z 10GbE Dual Port Module. Models

PCI Express x8 Single Port SFP+ 10 Gigabit Server Adapter (Intel 82599ES Based) Single-Port 10 Gigabit SFP+ Ethernet Server Adapters Provide Ultimate

Chelsio Communications. Meeting Today s Datacenter Challenges. Produced by Tabor Custom Publishing in conjunction with: CUSTOM PUBLISHING

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd

IBM WebSphere MQ Low Latency Messaging Software Tested With Arista 10 Gigabit Ethernet Switch and Mellanox ConnectX

Introduction to OpenOnload Building Application Transparency and Protocol Conformance into Application Acceleration Middleware

Architecture of a Real-Time Operational DBMS

ARISTA: Improving Application Performance While Reducing Complexity

DataON and Intel Select Hyper-Converged Infrastructure (HCI) Maximizes IOPS Performance for Windows Server Software-Defined Storage

Spark Over RDMA: Accelerate Big Data SC Asia 2018 Ido Shamay Mellanox Technologies

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC

CLUSTERING HIVEMQ. Building highly available, horizontally scalable MQTT Broker Clusters

Meltdown and Spectre Interconnect Performance Evaluation Jan Mellanox Technologies

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Diffusion TM 5.0 Performance Benchmarks

Extremely Fast Distributed Storage for Cloud Service Providers

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Benefits of 25, 40, and 50GbE Networks for Ceph and Hyper- Converged Infrastructure John F. Kim Mellanox Technologies

Chelsio 10G Ethernet Open MPI OFED iwarp with Arista Switch

HCI: Hyper-Converged Infrastructure

Netronome 25GbE SmartNICs with Open vswitch Hardware Offload Drive Unmatched Cloud and Data Center Infrastructure Performance

Looking ahead with IBM i. 10+ year roadmap

Memcached Design on High Performance RDMA Capable Interconnects

Solace JMS Broker Delivers Highest Throughput for Persistent and Non-Persistent Delivery

SR-IOV Support for Virtualization on InfiniBand Clusters: Early Experience

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer

The rcuda middleware and applications

Oracle TimesTen Scaleout: Revolutionizing In-Memory Transaction Processing

Oracle TimesTen In-Memory Database 18.1

Persistent Memory. High Speed and Low Latency. White Paper M-WP006

Future Routing Schemes in Petascale clusters

Cavium FastLinQ 25GbE Intelligent Ethernet Adapters vs. Mellanox Adapters

Much Faster Networking

InfiniBand Networked Flash Storage

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

EMC Business Continuity for Microsoft SharePoint Server (MOSS 2007)

Intel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage

TIBCO, HP and Mellanox High Performance Extreme Low Latency Messaging

SNIA Developers Conference - Growth of the iscsi RDMA (iser) Ecosystem

Accelerating Hadoop Applications with the MapR Distribution Using Flash Storage and High-Speed Ethernet

The NE010 iwarp Adapter

Reduce Costs & Increase Oracle Database OLTP Workload Service Levels:

Whitepaper / Benchmark

Next-Generation Cloud Platform

Birds of a Feather Presentation

Scaling Internet TV Content Delivery ALEX GUTARIN DIRECTOR OF ENGINEERING, NETFLIX

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

Oracle Database Mobile Server, Version 12.2

NVMe over Universal RDMA Fabrics

Intel PRO/1000 PT and PF Quad Port Bypass Server Adapters for In-line Server Appliances

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

A-GEAR 10Gigabit Ethernet Server Adapter X520 2xSFP+

Advanced Computer Networks. End Host Optimization

A Plugin-based Approach to Exploit RDMA Benefits for Apache and Enterprise HDFS

Introduction to Ethernet Latency

N V M e o v e r F a b r i c s -

MySQL & NoSQL: The Best of Both Worlds

LAMMPS and WRF on iwarp vs. InfiniBand FDR

Performance and Scalability with Griddable.io

TrafficDB: HERE s High Performance Shared-Memory Data Store Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, and Anis Moussa

IBM Power 9 надежная платформа для развертывания облаков. Ташкент. Юрий Кондратенко Cross-Brand Sales Specialist

LAMMPS, LS- DYNA, HPL, and WRF on iwarp vs. InfiniBand FDR

DriveScale-DellEMC Reference Architecture

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

MongoDB on Kaminario K2

打造 Linux 下的高性能网络 北京酷锐达信息技术有限公司技术总监史应生.

VPI / InfiniBand. Performance Accelerated Mellanox InfiniBand Adapters Provide Advanced Data Center Performance, Efficiency and Scalability

Learn Your Alphabet - SRIOV, NPIV, RoCE, iwarp to Pump Up Virtual Infrastructure Performance

Solarflare X2522 network adapter Quick Start Guide

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

Scaling to Petaflop. Ola Torudbakken Distinguished Engineer. Sun Microsystems, Inc

Evaluation of the Chelsio T580-CR iscsi Offload adapter

Fast Lanes and Speed Bumps on the Road To Higher-Performance, Higher-Speed Networking

S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda. The Ohio State University

NVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory

Solace Message Routers and Cisco Ethernet Switches: Unified Infrastructure for Financial Services Middleware

Apache Spark Graph Performance with Memory1. February Page 1 of 13

Oracle Enterprise Architecture. Software. Hardware. Complete. Oracle Exalogic.

Use of the Internet SCSI (iscsi) protocol

NEC Express5800 R320f Fault Tolerant Servers & NEC ExpressCluster Software

Advances of parallel computing. Kirill Bogachev May 2016

QLE10000 Series Adapter Provides Application Benefits Through I/O Caching

Oracle Database 11g Direct NFS Client Oracle Open World - November 2007

Comparing Ethernet & Soft RoCE over 1 Gigabit Ethernet

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING

Data Center & Cloud Computing DATASHEET. PCI Express x8 Single Port SFP+ 10 Gigabit Server Adapter. Flexibility and Scalability in Data Centers

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

Transcription:

Improving Altibase Performance with Solarflare 10GbE Server Adapters and OpenOnload Summary As today s corporations process more and more data, the business ramifications of faster and more resilient database management systems can be staggering. Tens of processing hours can result in the loss of hundreds of millions of dollars and system outages can cause severe revenue loss, security threats, and eroded customer confidence leading to additional lost business. The Altibase HDB database addresses these needs with higher performance and availability. To demonstrate significantly improved performance using Solarflare 10GbE server adapters and OpenOnload application acceleration middleware, Altibase conducted benchmark and replication performance tests. The results showed that database benchmark performance increased 38% using Solarflare SFN7122F 10GbE server adapters and the Linux kernel network driver compared with the performance using 1GbE server adapters. When using OpenOnload with the SFN7122F adapters, database benchmark performance increased an additional 20%. High availability performance testing with asynchronous replication and the Solarflare SFN7122F 10GbE server adapters and the Linux kernel network driver resulted in a 21% increase compared with the performance using 1GbE server adapters. When using OpenOnload with the SFN7122F adapters, asynchronous replication performance increased an additional 27%. Additional asynchronous replication testing verified that OpenOnload performance also scales with parallel replication. The following Solution Brief documents the testing configurations and methodologies. The benchmarks conclusively demonstrated that the performance of Altibase HDB s asynchronous replication improved drastically when using Solarflare SFN7122F 10GbE server adapters with OpenOnload application acceleration middleware. Background Altibase Since 1999, Altibase has pioneered in-memory databases and in-memory computing with a rich installation base consisting of over 550 global enterprise clients and thousands of mission critical deployments. As the go-to source for mission critical in-memory for the fastest databases (IMDBMS or MMDBMS), Altibase is the only company to offer a feature rich, ACID compliant database that combines the unparalleled speed of in-memory computing along with the economical scalability of traditional disks in a single unified engine.

2 Altibase HDB The Altibase HDB is a hybrid database that combines an in-memory database and on-disk database in a single uniform engine and interface, enabling real-time access to large volumes of data while simultaneously simplifying and revolutionizing high-intensity data processing. While Altibase HDB is SQL standards compliant and adheres to all fundamental principles of traditional relational databases, its unique hybrid architecture allows users to enhance performance significantly when needed. HDB is several times faster than the leading in-memory only databases and 10 times faster than conventional on-disk databases. HDB combines several key services such as high-availability, fault tolerance and load balancing with its replication feature that maintains an up-to-date backup of the database on an active server. In the event the primary server becomes unavailable, services immediately resume from an identical database on an alternate server for a non-stop operating environment. The replication is a TCP/IP based service, supports up to 32-way replication, and is available with a choice of two modes of replication: asynchronous and synchronous. Asynchronous Mode Replication Asynchronous mode focuses on high performance and is the default mode of Altibase HDB replication. The term asynchronous signifies that the local server does not wait until a remote server is done applying a transaction. Figure 1 below illustrates the normal flow of tasks performed in asynchronous mode. Application DML (1) Query Processor Query Processor Storage Manager Storage Manager (2) (3) Sender XLOG Receiver (4) Transaction Log XSN Meta XSN Meta Transaction Log Local HDB Server Remote HDB Server Figure 1. Asynchronous Mode Replication Task Flow.

3 Asynchronous Mode Replication Task Flow (previous page) 1. An application transaction occurs on a local server (master transaction), and a DML statement is executed on a table that is a target for replication. 2. Altibase HDB applies the changes to the data from the master transaction and creates the transaction log records. 3. The sender thread collects transaction logs recorded by the master transaction, converts them into XLOGs, records the XSN in the replication metadata table, and sends XLOGs out to the receiver thread on the remote server. On the local server, Altibase HDB commits the master transaction, and continues processing other transactions without waiting for any information related to the replication. 4. The receiver thread receives the XLOGs and commits the replicated transaction to its database instance on the remote server. Synchronous Mode Replication While asynchronous mode focuses on high performance, synchronous mode puts the focus on data consistency. Synchronous mode keeps all replication targets synchronized by updating all nodes as part of a single atomic transaction. In synchronous mode, when a master transaction occurs on a local server, the local server commits the transaction only after it has received confirmation that each of the change logs has been properly applied on the remote server(s). Synchronous mode guarantees 100% data integrity and is recommended if high availability of a database is the main focus, and an Active-Active architecture implementation is planned. Figure 2 illustrates the normal flow tasks performed in synchronous mode replication. Application DML (1) Query Processor Query Processor Storage Manager Storage Manager (2) (6) Sender (3) ACK XLOG (5) Receiver (4) Transaction Log XSN Meta XSN Meta Transaction Log Local HDB Server Remote HDB Server Figure 2. Synchronous Mode Replication Task Flow.

4 Solarflare Solarflare is the leading provider of application-intelligent networking I/O software and hardware that accelerate, monitor and secure network data, and is the pioneer in high-performance, lowlatency 10/40GbE server networking solutions. The company s products are widely used in scale-out server environments such as database, grid, electronic trading, high performance computing, cloud, virtualization and big data. Solarflare Flareon Server I/O Adapters Solarflare Flareon 10/40GbE PCIe 3.0 server I/O adapters deliver industry-leading message rates with the lowest latency and jitter over standard Ethernet along with low CPU-utilization, enabling the industry s best performance and scalability for enterprise data center environments including database applications. Figure 3. Solarflare Flareon Ultra SFN7122F 10GbE PCIe 3.0 Server I/O Adapter. OpenOnload OpenOnload application acceleration middleware is an accelerated network stack that provides acceleration of TCP- and UDP-based applications. OpenOnload comprises a user-level shared library that implements the protocol stack and a supporting kernel module. By operating in user space, OpenOnload dramatically reduces CPU interrupts, data copies and context switching resulting in reduced latency and higher message rates. OpenOnload is binary compatible with the industry standard BSD Sockets API, thereby providing acceleration with no need to run a new protocol such as RDMA, RoCE or iwarp, on the wire. OpenOnload is ideal for applications that benefit from lower latency (with decreasing jitter) and higher throughput, including financial services, geosciences, biosciences, weather and other HPC applications, and high performance databases. Figure 4. OpenOnload.

5 Test Objective Altibase conducted a series of performance tests with two separate test beds to validate that: Solarflare Flareon 10GbE server adapters and OpenOnload can improve benchmark and replication performance with Altibase HDB, and Solarflare OpenOnload performance improvement can scale with the parallel replication option Test Bed A Table 1. System Under Test Component Server OS Network Adapters Description Intel Xeon E5 processor@2.4ghz and 6 cores CentOS release 6.3 (final) with Linux kernel 2.6.32-279.el6.x86_64 Altibase HDB version 6.3.1.2.2 1 Gigabit adapter Solarflare Flareon Ultra SFN7122F 10GbE SFP+ server adapter with OpenOnload Configuration - The SUT was configured as both the active database and the clients - Logging is to disk Test Codes - Standard BMT - AT-P26-Replication - AT-P55-ReplicationSync Standard BMT Benchmark The BMT benchmark tests the performance characteristics of Altibase HDB on the testing environment. The BMT test consists of a simple C program that performs a user provided number of SELECT, INSERT, UPDATE and DELETE operations against the respective database.

6 Schema create table tb_test1 ( c1 integer primary key, c2 char(500), c3 integer, c4 char(500), c5 char(500), c6 char(500), c7 char(500), c8 char(500), c9 integer, c10 date ); Create index TB_TEST1_IDX on TB_TEST1 (c1, c3); Test Bed A Test Results Standard BMT Benchmark For this benchmark: - 8 clients were used - INSERT and DELETE of tables were performed - Transaction: Commit/10 statements - Results are in TPS (Transactions per Second) - 1G: 1Gigabit Ethernet adapter - 10G: Solarflare SFN7122F 10GbE adapter - Onload: SFN7122F with OpenOnload 1G 10G Onload Ratio 10G vs. 1G Ratio Onload vs. 1G Insert 45,119 60,127 69,105 133% 153% Delete 52,180 72,113 86,493 138% 166% 90,000 80,000 70,000 Insert Delete 60,000 50,000 40,000 30,000 20,000 10,000 0 1G 10G Figure 5. Test bed A Standard BMT Benchmark Results. Onload

7 As shown in the results, the Flareon SFN7122F network adapter showed a performance increase of 33% for INSERT and 38% for DELETE compared with using a 1 Gigabit Ethernet adapter. When OpenOnload kernel bypass middleware was used, the Flareon SFN7122F performance improvement increased to 53% for INSERT and 66% for DELETE compared with the performance using a 1 Gigabit Ethernet adapter. Replication Test Codes One of Altibase HDB s key features is its built in replication engine. This feature can be utilized both for load balancing and high availability purposes. AT-P26-Replication Schema create table tb_test1 ( c1 integer primary key, c2 char(10), c3 integer, c4 char(255), c5 char(255), c6 char(255), c7 char(10), c8 char(10), c9 integer, c10 date ); Primary key(c1, c2); T1_idx on t1(c10); For this benchmark: - 8 clients were used - INSERT and DELETE of tables were performed - Transaction: Commit/10 statements - Results are in TPS (Transactions per Second) - 1G: 1Gigabit Ethernet adapter - 10G: Solarflare SFN7122F 10GbE adapter - Onload: SFN7122F with OpenOnload 1G 10G Onload Insert TPS 81,623 83,847 82,974 Replication TPS 36,372 43,841 55,776 Replication Performance Ratio 1G vs. 10G 45% 100% 52% 121% 67% 1G vs. Onload 100% 153%

8 Client 7 Insert TPS Replication TPS 90,000 80,000 70,000 60,000 50,000 40,000 30,000 20,000 10,000 0 1G 10G Onload AT-P55-Replication Figure 6. Test bed A AT-P26-Replications Standby Server Test. As shown in the results, the Flareon SFN7122F 10GbE server adapters provided a 21% asynchronous replication performance improvement compared with using 1GbE server adapters. When using OpenOnload with the SFN7122F adapters, performance improvement increased to 53% compared with 1GbE server adapters. Schema create table t1( c1 integer, c2 char(10), c3 integer, c4 char(255), c5 char(255), c6 char(255), c7 char(10), c8 char(10), c9 integer, c10 date ); Primary key(c1, c2); T1_iex on t1(c10); For this benchmark: - Data: 10 tables with 100,000 records per table - Transaction: Commit / 50000 record insert - 1G: 1 Gigabit Ethernet adapter - 10G: Solarflare SFN7122F 10GbE adapter - Onload: SFN7122F with OpenOnload

9 Thread 1G 10G Onload Ratio Onload vs 1G 1 8 82,386 90,090 82,339 163,949 87,958 163,618 107% 182% Table 2. Test bed A AT-P55-Replication Test Results. Unit: RPS (replicated records number per second) There was a large performance improvement when using the SFN7122F 10GbE server adapter. But there were no observable performance improvements when using the SFN7122F with OpenOnload. This implies that the performance increase was due to increased bandwidth. Test Bed B Target Server Server OS Description Intel Xeon E5-2605 processor @ 2.4GHz and 8 cores with 64GB memory CentOS 2.6.32 Altibase HDB version 6.5.1.0.0 Client Server OS Network Solarflare Intel Description Intel Xeon E5603 processor @ 2.53GHz and 16 cores with 64GB memory Linux 2.6.18 Description Flareon Ultra SFN7122F 10GbE SFP+ server adapter with OpenOnload 1GbE Adapter Table 3. Test bed B System Under Test. Configuration - 1 transaction consists of one commit per 10 insert records - 96 clients Test Codes - AT-P26-Replication

10 AT-P26-Replication Schema create table tb_test1 ( c1 integer primary key, c2 char(10), c3 integer, c4 char(10), c5 char(10), c6 char(10), c7 char(10), c8 char(10), c9 integer, c10 date ); Test Bed B Test Results Replication performance improvement rate No Parallel (10G w/o Onload) 100%(base) No Parallel (10G with Onload) 132% 6 Parallel (10G w/o Onload) 152% 6 Parallel (10G with Onload) 198% Replication Performance TPS 12,000 10,000 8,000 6,000 4,000 2,000 0 No Parallel w/o Onload No Parallel with Onload 6 Parallel w/o Onload 6 Parallel with Onload In Test bed B, the test results showed that OpenOnload performance improvement for asynchronous replication scales with increased parallel replication option processing. Without the parallel replication option, replication performance with OpenOnload increased by 32% compared with using the kernel driver. With 6 parallel replication threads, performance with OpenOnload increased by 30% compared with using the Linux kernel driver. SF-114775-CD Issue 1 AB_SB_091815 Copyright 2015 Solarflare Communications, Inc. All rights reserved.