Database Acceleration Solution Using FPGAs and Integrated Flash Storage

Similar documents
Oracle Exadata: Strategy and Roadmap

Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration

LegUp: Accelerating Memcached on Cloud FPGAs

Accelerating Business Analytics with Flash Storage and FPGAs

QLIK INTEGRATION WITH AMAZON REDSHIFT

GPU ACCELERATED DATABASE MANAGEMENT SYSTEMS

An NVMe-based FPGA Storage Workload Accelerator

SDA: Software-Defined Accelerator for general-purpose big data analysis system

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing

SSDs that Think. Noam Mizrahi Vice President, Technology and Architecture CTO Office, Marvell

Fast, In-Memory Analytics on PPDM. Calgary 2016

New Approach to Unstructured Data

Data Transformation and Migration in Polystores

When MPPDB Meets GPU:

The Evolution of Big Data Platforms and Data Science

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

A Brief Introduction of TiDB. Dongxu (Edward) Huang CTO, PingCAP

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE

Was ist dran an einer spezialisierten Data Warehousing platform?

Thomas Lin, Naif Tarafdar, Byungchul Park, Paul Chow, and Alberto Leon-Garcia

QLE10000 Series Adapter Provides Application Benefits Through I/O Caching

Cloud Computing & Visualization

stec Host Cache Solution

SDACCEL DEVELOPMENT ENVIRONMENT. The Xilinx SDAccel Development Environment. Bringing The Best Performance/Watt to the Data Center

OPERATIONALIZING MACHINE LEARNING USING GPU ACCELERATED, IN-DATABASE ANALYTICS

WHITEPAPER. MemSQL Enterprise Feature List

Hadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved

Impala. A Modern, Open Source SQL Engine for Hadoop. Yogesh Chockalingam

SoftFlash: Programmable Storage in Future Data Centers Jae Do Researcher, Microsoft Research

OpenPOWER Performance

EMC XTREMCACHE ACCELERATES MICROSOFT SQL SERVER

In-Memory Computing EXASOL Evaluation

Apache Spark Graph Performance with Memory1. February Page 1 of 13

Four Steps to Unleashing The Full Potential of Your Database

FLASHARRAY//M Business and IT Transformation in 3U

BlueDBM: An Appliance for Big Data Analytics*

Accelerating Real-Time Big Data. Breaking the limitations of captive NVMe storage

STATE OF MODERN APPLICATIONS IN THE CLOUD

NVMe SSDs with Persistent Memory Regions

1

HPE SimpliVity 380. Simplyfying Hybrid IT with HPE Wolfgang Privas Storage Category Manager

Lenovo Database Configuration for Microsoft SQL Server TB

HiTune. Dataflow-Based Performance Analysis for Big Data Cloud

Onto Petaflops with Kubernetes

SUPERMICRO, VEXATA AND INTEL ENABLING NEW LEVELS PERFORMANCE AND EFFICIENCY FOR REAL-TIME DATA ANALYTICS FOR SQL DATA WAREHOUSE DEPLOYMENTS

cstore_fdw Columnar store for analytic workloads Hadi Moshayedi & Ben Redman

Cloud Computing with FPGA-based NVMe SSDs

Building dense NVMe storage

ASPERA HIGH-SPEED TRANSFER. Moving the world s data at maximum speed

Computational Storage: Acceleration Through Intelligence & Agility

Toward timely, predictable and cost-effective data analytics. Renata Borovica-Gajić DIAS, EPFL

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

How Flash-Based Storage Performs on Real Applications Session 102-C

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

Fusion iomemory PCIe Solutions from SanDisk and Sqrll make Accumulo Hypersonic

2011 IBM Research Strategic Initiative: Workload Optimized Systems

XPU A Programmable FPGA Accelerator for Diverse Workloads

Percona Server for MySQL 8.0 Walkthrough

Scaling DreamFactory

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

GPU Accelerated Data Processing Speed of Thought Analytics at Scale

Hardware NVMe implementation on cache and storage systems

5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992

Aerospike Scales with Google Cloud Platform

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

Microservices without the Servers: AWS Lambda in Action

Oracle Autonomous Database

Validating the NetApp Virtual Storage Tier in the Oracle Database Environment to Achieve Next-Generation Converged Infrastructures

HCI: Hyper-Converged Infrastructure

Introduction to the Active Everywhere Database

BIG DATA COURSE CONTENT

IBM s Data Warehouse Appliance Offerings

IaaS Vendor Comparison

Recent Innovations in Data Storage Technologies Dr Roger MacNicol Software Architect

Merging Enterprise Applications with Docker* Container Technology

Next-Generation Cloud Platform

Future of Database. - Journey to the Cloud. Juan Loaiza Senior Vice President Oracle Database Systems

FPGA Implementation of Erasure Codes in NVMe based JBOFs

Introduction to Database Services

NAND Interleaving & Performance

Accelerating Hadoop Applications with the MapR Distribution Using Flash Storage and High-Speed Ethernet

ArcGIS Enterprise: Architecting Your Deployment

Accelerating Data Center Workloads with FPGAs

GRVI Phalanx Update: Plowing the Cloud with Thousands of RISC-V Chickens. Jan Gray

DATABASE SCALE WITHOUT LIMITS ON AWS

MySQL & NoSQL: The Best of Both Worlds

Nutanix White Paper. Hyper-Converged Infrastructure for Enterprise Applications. Version 1.0 March Enterprise Applications on Nutanix

Providing Multi-tenant Services with FPGAs: Case Study on a Key-Value Store

Zhang Tianfei. Rosen Xu

PG-Strom v2.0 Release Technical Brief (17-Apr-2018) PG-Strom Development Team

Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

High Performance Packet Processing with FlexNIC

OPENSTACK: THE OPEN CLOUD

Heckaton. SQL Server's Memory Optimized OLTP Engine

Modern Data Warehouse The New Approach to Azure BI

Enabling Flexible Network FPGA Clusters in a Heterogeneous Cloud Data Center

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4

Analytics with IMS and QMF

Transcription:

Database Acceleration Solution Using FPGAs and Integrated Flash Storage HK Verma, Xilinx Inc. August 2017 1

FPGA Analytics in Flash Storage System In-memory or Flash storage based DB reduce disk access penalty, but do not ease CPU compute" bottleneck FPGA Analytics accelerators overcome the CPU bottleneck, enable new workloads FPGAs bring compute closer to flash storage devices Server Bottleneck Storage In-memory system does not ease the CPU Bottleneck, FPGA reduces the bottleneck August 2017 2

FPGA Accelerator between Server and Storage FPGA connected to SSD using PCIe Integrated FPGA & SSD board Tools, methodology, platforms, acceleration design blocks available for customers to implement target acceleration logic on FPGA Tighter integration with storage reduces CPU overheads, adds new functionality in storage Acceleration solution available on Amazon AWS as F1 instance to try and deploy Reduce CPU Usage, add new functionalities in storage solutions by using FPGA Acceleration August 2017 3

Database acceleration offload to FPGAs Big Data Application 10s-100s 0f GigaBytes TeraBytes Stored in Database or Text files RDBMS or NoSQL such as PostgreSQL Text Files like csv Key-value Database like RocksDB Identify acceleration components from the big data applications Use Xilinx tools, methodology to build an offload engine Release and try it on cloud such as Amazon AWS F1 Application Interface Prepare Data Blocks Use instruction as metadata of the Data Blocks to program for target functionality Stream into FPGA using Data Mover Gather the results from FPGA and produce the final result Standard API for an efficient data mover Common across different types of application Data DMA, Kernel executions synchronized using OpenCL events to ensure maximal sustained data rate over PCIe FPGA and flash storage integrated solutions offering application specific offloads FPGA Data Mover Data Block August 2017 4..... Enqueue Compute Block Units Data Block Xilinx SDx Runtime Xilinx Platform Processing Unit Block... Processing Unit Block Processing Unit Block... Processing Unit Block Data Block..... Data Block Merge Block PU results Flash Storage Device Discrete or FPGA integrated controller solution Direct FPGA connection or connected through PCIe

FPGA Implementation Methodology for Database Acceleration Efficient implementation of massively parallel processing units (PU) Provides 10-25x performance improvement over CPUs across many applications Accelerated functionality uses less than half of Xeon power Innovative technology offering 10-50x compute efficiency improvement over CPUs August 2017 5

Database TPCH Query 6 & 1 Acceleration Processing Units in C HLS Native support of Date, Decimal SQL types Complete query logic on FPGA Combine many parallel PU results in HLS for a Compute Unit Kernels Query 6 - KU60 can fit up to 64 Units, KU115 128 Units Application pulls data from PostgreSQL, and sends to FPGA platform accelerating Query 6 and Query 1 of TPCH benchmark processing 40GB database size Query implemented on FPGA device using C HLS and massively parallel instantiations of Query Units Code and methodology available now CPU query implementation is from default PostgreSQL results on Xeon E5 platform Query 1 - KU60 can fit up to 16 Units, KU115 32 Units TPCH Query demo with PostgreSQL, text data available on Nimbix cloud August 2017 6

SQL Query Engine on Xilinx FPGA PostgreSQL interface SQL engine with SQL processing opcodes that processes Postgres storage blocks Customers can offload existing SQL query on FPGA Suitable for large data with scan, filter, aggregate, hash aggregate processing PostgreSQL UDF (User Defined Function) PostgreSQL UDF exposed as a SQL command User calls this query for qualified scanning and aggregation FPGA offload on the PostgreSQL data PostgreSQL Generic SQL Client side plugin to insert FPGA offload computation into a Postgres SQL query plan Automatic generation of SQL Engine opcodes for the given SQL query Process any user SQL query without reprogramming FPGA Provides a solution to offload existing customer SQL commands to an FPGA PostgreSQL Amazon AWS F1 image (AFI) using Xilinx FPGAs releasing shortly!! August 2017 7

Summary A system architecture integrating FPGAs and flash storage proposed and available on cloud for customers to try it out Hardware programming in FPGAs enable tighter integration with flash storage devices FPGAs overcome CPU compute bottleneck FPGAs have superior energy efficiency and performance Proposed efficient method to build massively parallel Processing Unit architecture on FPGAs Custom hardware integration of parallel units reduces inefficiencies with parallelization Standard API based data mover for Xilinx platforms with SDx runtime libraries to move data from Application user memory to FPGA device memory Application designers need to build efficient application interface August 2017 8