HyPer on Cloud 9. Thomas Neumann. February 10, Technische Universität München

Size: px
Start display at page:

Download "HyPer on Cloud 9. Thomas Neumann. February 10, Technische Universität München"

Transcription

1 HyPer on Cloud 9 Thomas Neumann Technische Universität München February 10, 2016

2 HyPer HyPer is the main-memory database system developed in our group a very fast database system with ACID transactions quite comprehensive SQL support (SQL92 plus many SQL99 features) queries are compiled into machine code using LLVM very little overhead, excellent performance You can try it out (or download a demo) online: We use it for teaching, too. (Easy to use web interface for students). Thomas Neumann HyPer on Cloud 9 2 / 13

3 Hyper in the Cloud Currently HyPer is primarily run on a single dedicated node but of course we looked at running HyPer in the cloud Wolf Rödiger will talk about distributed processing next I will concentrate on issues we found during experiments some of them quite surprising; could be relevant for your project, too not all of them have perfect solutions yet Thomas Neumann HyPer on Cloud 9 3 / 13

4 Costs of Virtualization Cloud applications usually run on virtualized machined shields the application from the actual hardware simplified management, allows for easy migration, etc. several different techniques to do that (containers, hypervisors, etc.) what does that actually cost? in-memory processing is CPU bound in an ideal world virtualization would be (nearly) free Thomas Neumann HyPer on Cloud 9 4 / 13

5 Thomas Neumann HyPer on Cloud 9 5 / 13

6 Thomas Neumann HyPer on Cloud 9 6 / 13

7 Thomas Neumann HyPer on Cloud 9 7 / 13

8 Thomas Neumann HyPer on Cloud 9 8 / 13

9 Thomas Neumann HyPer on Cloud 9 9 / 13

10 Cooperative Behavior Cloud infrastructure relies upon cooperative behavior virtualized applications utilize ideal resources combined system looks more powerful than it really is hogging resources is bad for cloud processing Main resource for HyPer (and probably most other DBMSs): main memory not only for the real data, but also for intermediate results some queries have large intermediate results allocate a lot of memory Thomas Neumann HyPer on Cloud 9 10 / 13

11 Allocating and Releasing Memory Memory allocation is a multi-step process: 1. pages are created in the page table, but (usually) not physically assigned 2. upon access, pages are assigned and zeroed out 3. upon release, page table is modified again Problem: this does not scale a lot of locking and overhead within the kernel costs more than a factor 2 performance with large queries on large hosts even more pronounced on Windows, there freeing pages is surprisingly expensive Solutions? never release memory? (i.e., do your own memory management) best performance, but hogs resources non-portable techniques? (madvise etc.) Unsatisfying. Thomas Neumann HyPer on Cloud 9 11 / 13

12 Durability For ACID, we must 1) not report commit until the WAL record hits durable media, and 2) avoid data corruption during writes. difficult to guarantee even on bare metal often requires battery-backed RAID controllers to avoid all issues (including tearing) How do we guarantee that on virtualized hardware? nearly impossible the (virtualized) hardware lies to us an inconveniently timed power failure can lead to a disaster Microsoft installed special hooks for their cloud DBMS, but that is not publicly usable other vendors probably, too, but not generic solutions exists Thomas Neumann HyPer on Cloud 9 12 / 13

13 Conclusion Databases in the cloud are an interesting topic many effects that we see are implementation artifacts choice of virtualization product has a large impact many open issues concerning both performance and correctness but apparently many people are happy with living dangerously Still many open research questions. Thomas Neumann HyPer on Cloud 9 13 / 13

14 HyPer on Cloud 9 Wolf Rödiger Technische Universität München

15 Traditional Data Warehouse OLTP Extract Transform Load OLAP Database Data Warehouse Wolf Rödiger HyPer on Cloud 9 2 / 28

16 HyPer: Hybrid OLTP & OLAP HyPer OLTP & OLAP on the same data at the same time Efficient snapshotting Data-centric code generation OLTP OLAP > 100,000 TPC-C TX/s Best-of-breed response times Real-time analytics fork() OLTP OLAP snapshot a* OLTP process a copy on update OLAP Wolf Rödiger HyPer on Cloud 9 3 / 28

17 Wishlist for HyPer in the Cloud Scale query performance with the cluster size Sustain transaction performance Scale capacity to process larger workloads Elastically add servers to the cluster Provide high availability Wolf Rödiger HyPer on Cloud 9 4 / 28

18 1st Design: Full Replication via Redo Log Multicasting primary HyPer persistent storage OLTP multicast redo log redo log snapshot secondaries OLAP load balancing Tobias Mühlbauer, Wolf Rödiger, Angelika Reiser, Alfons Kemper, Thomas Neumann, ScyPer: Elastic OLAP Throughput on Transactional Data, DanaC Wolf Rödiger HyPer on Cloud 9 5 / 28

19 Scale query throughput, sustain transaction performance 150k OLTP throughput OLAP throughput 60k transactions/s 100k 50k 40k 20k queries/h number of nodes 0 Wolf Rödiger HyPer on Cloud 9 6 / 28

20 Elasticity: Materialized snapshots and multicasting primary HyPer persistent storage OLTP 3. subscribe to multicast 2. apply redo log 1. load snapshot secondaries new server OLAP 4. receive queries Wolf Rödiger HyPer on Cloud 9 7 / 28

21 High Availability: Secondaries can replace the primary primary HyPer persistent storage 3. start multicast 1. replay redo log secondaries 2. accept OLTP OLTP OLAP new primary Wolf Rödiger HyPer on Cloud 9 8 / 28

22 Multi-Tenancy: Flexible database-as-a-service deployment secondary primary primary Tenant 1 Tenant 1 Tenant 2 Tenant 4 Tenant 3 Tenant 1 Tenant 4 Tenant 4 Tenant 2 Tenant 3 secondary secondary secondary Wolf Rödiger HyPer on Cloud 9 9 / 28

23 1st Design: Full Replication via Redo Log Multicasting Scales query throughput with the cluster size Sustains transaction performance Allows to elastically add servers to the cluster Provides high availability Limited to workloads that fit into a single server Same query response times as a single server Wolf Rödiger HyPer on Cloud 9 10 / 28

24 2nd Design: Query Processing on Fragmented Relations Lineitem Lineitem 11 Customer Customer 11 Orders Orders 11 Lineitem Lineitem 21 Customer Customer 21 Orders Orders 21 Lineitem Lineitem 31 Customer Customer 31 Orders Orders 31 Lineitem Lineitem 41 Customer Customer 41 Orders Orders 41 Lineitem Lineitem 51 Customer Customer 51 Orders Orders 51 Lineitem Lineitem 61 Customer Customer 61 Orders Orders 61 Fragment relations across servers to use the combined capacity of the cluster, making room for larger workloads Wolf Rödiger, Tobias Mühlbauer, Alfons Kemper, Thomas Neumann, High-Speed Query Processing over High-Speed Networks, VLDB Wolf Rödiger HyPer on Cloud 9 11 / 28

25 Main Challenges 128 GB 59.7 GB/s CPU CPU 0 CPU CPU 1 10 cores 16 GB/s QPI QPI 10 cores 59.7 GB/s 128 GB 16 GB/s PCIe GB/s host 0 host 1 4 GB/s Infiniband 4 QDR HCA host 2 host 3 host 4 host 5 Challenge 1: Network is a bottleneck to query processing Challenge 2: Utilize all cores and all servers Wolf Rödiger HyPer on Cloud 9 12 / 28

26 Challenge 1: Network is a Bottleneck Network is bottleneck for distributed processing Manual schema-tuning is time-consuming and workload-dependent TPC-H speed-up (SF 100) Faster network hardware is not enough, software 0 has to change TCP/IP (40 Gb/s InfiniBand) TCP/IP (1 Gb/s Ethernet) number of servers Wolf Rödiger HyPer on Cloud 9 13 / 28

27 Remote Direct Memory Access Zero-copy network communication: High network throughput Almost no CPU cost Less memory bus traffic Less CPU cost, higher throughput than TCP RDMA achieves full speed at only 4 % CPU load Recently used to implement a distributed radix join* Application Buffer Operating System sender Application Buffer Operating System receiver HCA HCA *Barthels et al., Rack-Scale In-Memory Join Processing using RDMA, SIGMOD Wolf Rödiger HyPer on Cloud 9 14 / 28

28 Network Scheduling Uncoordinated network communication causes switch contention Communicate in a strict round-robin fashion Synchronize via low-latency RDMA operations Network scheduling improves throughput by 40 % for an 8-server InfiniBand cluster throughput in GB/s all-to-all scheduling number of servers Wolf Rödiger HyPer on Cloud 9 15 / 28

29 Challenge 2: Utilize all cores and all servers Exchange partitions tuples for joins and aggregations Traditionally used to parallelize for multiple cores and servers But: Exchange does not scale well due to inflexible design TPC-H speed-up (SF 300) HyPer ( operators) Vectorwise ( operators) 6 (1) 30 (5) 60 (10) 90 (15) 120 (20) number of cores (per server) Wolf Rödiger HyPer on Cloud 9 16 / 28

30 Classic Exchange Fixed degree of parallelism An needs a buffer for every other : #buffers per server = servers cores 2 2,400 buffers 1 GB of main memory per server Also: Each join key value is processed by a specific operator Heavy hitters are assigned to a single Wolf Rödiger HyPer on Cloud 9 17 / 28

31 Decoupled Exchange Communicate indirectly via communication multiplexers Address servers not threads: Decreases memory consumption Reduces negative impact of heavy hitters Improves applicability of broadcast optimization Local load balancing via work stealing* important for good scalability CM CM CM *Leis et al., Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age, SIGMOD Wolf Rödiger HyPer on Cloud 9 18 / 28

32 Hybrid Parallelism = RDMA-based Communication + Decoupled Exchange Operators server 0 NUMA socket 1 parallelized query pipeline parallelized query pipeline 7. produce tuples 6. deserialize tuples NUMA socket 0 per thread next operator per thread 1. consume tuples prev. operator RDMA Key NUMA node retain count ID last message bytes used content incoming message 2. partition + serialize tuples 5a. receive local or 5b. steal work communication multiplexer receive queue NUMA socket 1 receive queue NUMA socket 0 per server network scheduler per server RDMA RDMA Key RDMA Key Key NUMA NUMA node NUMA node node retain retain count retain count count ID 3. send message ID ID when full last message last message bytes used used bytes used content content 4. reuse message outgoing messages message pool send queues per socket Wolf Rödiger HyPer on Cloud 9 19 / 28

33 Hybrid parallelism scales in the number of cores... TPC-H speed-up (SF 300) HyPer (hybrid parallelism) HyPer () Vectorwise () 6 (1) 30 (5) 60 (10) 90 (15) 120 (20) number of cores (per server) Wolf Rödiger HyPer on Cloud 9 20 / 28

34 ... and in the number of servers in the cluster TPC-H speed-up (SF 100) RDMA (40 Gb/s InfiniBand) + scheduling TCP/IP (40 Gb/s InfiniBand) TCP/IP (1 Gb/s Ethernet) number of servers Wolf Rödiger HyPer on Cloud 9 21 / 28

35 How do we compare? HyPer (partitioned) 20,739 HyPer (chunked) 16,090 Vectorwise (partitioned) 3,856 MemSQL (partitioned) Impala (chunked) Spark SQL (chunked) k 10k 15k 20k 25k TPC-H queries per hour (SF 100) Note: MemSQL takes several seconds for query compilation, but these compilation times are not included in this experiment. Wolf Rödiger HyPer on Cloud 9 22 / 28

36 2nd Design: Query Processing on Fragmented Relations Utilizes the combined capacity of the cluster Reduces query response times Scales query throughput with the cluster size? How can we add servers with minimal disruption?? How can we survive server failures?? How can we sustain the transaction performance? Wolf Rödiger HyPer on Cloud 9 23 / 28

37 Outlook: Elasticity via Highest Random Weight Hashing e5fa4 4f2b3 1c1fb 7448d 8798a ff5e c13ff 7d5d9 a3db5 24e7a f6f9e Orders 3 Orders 5 Orders 2 Orders 6 new server 553b6 021e7 360d0 Orders 1 Orders 4 452e2 56f9b 62d4b Redistribute data when servers are added/removed Avoids reshuffling the whole database, balances load evenly Mukherjee et al., Distributed Architecture of Oracle Database In-memory, VLDB Wolf Rödiger HyPer on Cloud 9 24 / 28

38 Outlook: High availability via HDFS-style replication Orders PART 5 Orders PART 1 Orders PART 4 8 Orders PART 2 5 Orders PART 1 2 Orders PART 3 Orders PART 5 Orders PART 2 Orders PART 3 Orders PART 2 Orders PART 3 Orders PART 5 Orders PART 4 Orders PART 4 Orders PART 1 Replication factor of x allows for x 1 server failures Wolf Rödiger HyPer on Cloud 9 25 / 28

39 Outlook: Sustain transaction performance OLAP Distributed transactions are overly expensive: Two-phase commit Global locking Deadlock detection H-Store model: partitioned execution of transactions Works well for TPC-C Drawbacks: Requires schema tuning Not for all workloads OLTP OLTP OLTP Wolf Rödiger HyPer on Cloud 9 26 / 28

40 Outlook: Hot/cold approach for distributed transactions primary HyPer HOT OLTP multicast redo log secondaries COLD COLD 7 7 HOT HOT COLD COLD 4 4 COLD COLD 1 1 COLD COLD 8 8 HOT HOT COLD COLD 5 5 COLD COLD 2 2 HOT HOT COLD COLD 9 9 COLD COLD 6 6 COLD COLD 3 3 OLAP distributed queries on global TX-consistent snapshots Wolf Rödiger HyPer on Cloud 9 27 / 28

41 HyPer on Cloud 9 Scale capacity to process larger workloads Reduce query response times Scale query throughput with the cluster size Elastically add servers to the cluster Provide high availability Sustain transaction performance Wolf Rödiger HyPer on Cloud 9 28 / 28

42 Backup Slides Wolf Rödiger HyPer on Cloud 9 1 / 5

43 Keeping secondaries up-to-date Logical log: transaction identifier and parameters Physical log: insert, update and delete operations Replay log on secondaries Logical log excludes: Readers Aborts Redo logging Undo logging Physical log also excludes: Read operations Costly transaction logic execution time 800 ms 600 ms 400 ms 200 ms 0 ms writers readers aborts normal execution 23% faster logical log replaying 56% faster physical log replaying Wolf Rödiger HyPer on Cloud 9 2 / 5

44 Redo Log Multicasting Independent of cluster size UDP unreliable, PGM reliable TPC-C transactions 60,000 log entries/s 1,500 B physical log entry/tx 250 B logical log entry/tx 1 GbE vs. InfiniBand 1 GbE needs: Group commits LZ4 compression ( 50 %) InfiniBand (IPoIB) is sufficient; PGM is CPU-bound 1 GbE PGM Bandwidth [Mbit/s] 675 Packets [1,000/s] 43 Latency [µs] InfiniBand 4 QDR PGM Bandwidth [Mbit/s] 1,832 Packets [1,000/s] 112 Latency [µs] 13.5 Wolf Rödiger HyPer on Cloud 9 3 / 5

45 Guaranteeing correct results 3. read a 1. read a 2. read a 1. fork() LSN 101 LSN 100 LSN 103 LSN write a=a* a* a a* a Order-preserving serializability violation Diverging distributed reads Logical time defined by log sequence number (LSN) Global TX-consistent snapshots avoid consistency problems Wolf Rödiger HyPer on Cloud 9 4 / 5

46 NUMA one socket interleaved NUMA-aware 0 1k 2k 3k 4k 5k TPC-H queries per hour (SF 100) NUMA-aware allocation of message buffers improves TPC-H performance by a factor of 2 for a 4-socket server Our communication multiplexer provides NUMA-local message buffers to the decoupled operators Wolf Rödiger HyPer on Cloud 9 5 / 5

HyPer-sonic Combined Transaction AND Query Processing

HyPer-sonic Combined Transaction AND Query Processing HyPer-sonic Combined Transaction AND Query Processing Thomas Neumann Technische Universität München December 2, 2011 Motivation There are different scenarios for database usage: OLTP: Online Transaction

More information

HyPer-sonic Combined Transaction AND Query Processing

HyPer-sonic Combined Transaction AND Query Processing HyPer-sonic Combined Transaction AND Query Processing Thomas Neumann Technische Universität München October 26, 2011 Motivation - OLTP vs. OLAP OLTP and OLAP have very different requirements OLTP high

More information

Main-Memory Databases 1 / 25

Main-Memory Databases 1 / 25 1 / 25 Motivation Hardware trends Huge main memory capacity with complex access characteristics (Caches, NUMA) Many-core CPUs SIMD support in CPUs New CPU features (HTM) Also: Graphic cards, FPGAs, low

More information

Accelerating Analytical Workloads

Accelerating Analytical Workloads Accelerating Analytical Workloads Thomas Neumann Technische Universität München April 15, 2014 Scale Out in Big Data Analytics Big Data usually means data is distributed Scale out to process very large

More information

Administration Naive DBMS CMPT 454 Topics. John Edgar 2

Administration Naive DBMS CMPT 454 Topics. John Edgar 2 Administration Naive DBMS CMPT 454 Topics John Edgar 2 http://www.cs.sfu.ca/coursecentral/454/johnwill/ John Edgar 4 Assignments 25% Midterm exam in class 20% Final exam 55% John Edgar 5 A database stores

More information

Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation

Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation Harald Lang 1, Tobias Mühlbauer 1, Florian Funke 2,, Peter Boncz 3,, Thomas Neumann 1, Alfons Kemper 1 1

More information

User Perspective. Module III: System Perspective. Module III: Topics Covered. Module III Overview of Storage Structures, QP, and TM

User Perspective. Module III: System Perspective. Module III: Topics Covered. Module III Overview of Storage Structures, QP, and TM Module III Overview of Storage Structures, QP, and TM Sharma Chakravarthy UT Arlington sharma@cse.uta.edu http://www2.uta.edu/sharma base Management Systems: Sharma Chakravarthy Module I Requirements analysis

More information

HAWQ: A Massively Parallel Processing SQL Engine in Hadoop

HAWQ: A Massively Parallel Processing SQL Engine in Hadoop HAWQ: A Massively Parallel Processing SQL Engine in Hadoop Lei Chang, Zhanwei Wang, Tao Ma, Lirong Jian, Lili Ma, Alon Goldshuv Luke Lonergan, Jeffrey Cohen, Caleb Welton, Gavin Sherry, Milind Bhandarkar

More information

Crescando: Predictable Performance for Unpredictable Workloads

Crescando: Predictable Performance for Unpredictable Workloads Crescando: Predictable Performance for Unpredictable Workloads G. Alonso, D. Fauser, G. Giannikis, D. Kossmann, J. Meyer, P. Unterbrunner Amadeus S.A. ETH Zurich, Systems Group (Funded by Enterprise Computing

More information

Memory-Based Cloud Architectures

Memory-Based Cloud Architectures Memory-Based Cloud Architectures ( Or: Technical Challenges for OnDemand Business Software) Jan Schaffner Enterprise Platform and Integration Concepts Group Example: Enterprise Benchmarking -) *%'+,#$)

More information

High Performance Transactions in Deuteronomy

High Performance Transactions in Deuteronomy High Performance Transactions in Deuteronomy Justin Levandoski, David Lomet, Sudipta Sengupta, Ryan Stutsman, and Rui Wang Microsoft Research Overview Deuteronomy: componentized DB stack Separates transaction,

More information

Mark Falco Oracle Coherence Development

Mark Falco Oracle Coherence Development Achieving the performance benefits of Infiniband in Java Mark Falco Oracle Coherence Development 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. reserved. Insert Information Protection Policy Classification from Slide 8

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. reserved. Insert Information Protection Policy Classification from Slide 8 The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,

More information

SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING

SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING ZEYNEP KORKMAZ CS742 - PARALLEL AND DISTRIBUTED DATABASE SYSTEMS UNIVERSITY OF WATERLOO OUTLINE. Background 2. What is Schism?

More information

Morsel- Drive Parallelism: A NUMA- Aware Query Evaluation Framework for the Many- Core Age. Presented by Dennis Grishin

Morsel- Drive Parallelism: A NUMA- Aware Query Evaluation Framework for the Many- Core Age. Presented by Dennis Grishin Morsel- Drive Parallelism: A NUMA- Aware Query Evaluation Framework for the Many- Core Age Presented by Dennis Grishin What is the problem? Efficient computation requires distribution of processing between

More information

Data Blocks: Hybrid OLTP and OLAP on compressed storage

Data Blocks: Hybrid OLTP and OLAP on compressed storage Data Blocks: Hybrid OLTP and OLAP on compressed storage Ben Brümmer Technische Universität München Fürstenfeldbruck, 26. November 208 Ben Brümmer 26..8 Lehrstuhl für Datenbanksysteme Problem HDD/Archive/Tape-Storage

More information

In-Memory Columnar Databases - Hyper (November 2012)

In-Memory Columnar Databases - Hyper (November 2012) 1 In-Memory Columnar Databases - Hyper (November 2012) Arto Kärki, University of Helsinki, Helsinki, Finland, arto.karki@tieto.com Abstract Relational database systems are today the most common database

More information

VOLTDB + HP VERTICA. page

VOLTDB + HP VERTICA. page VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics

More information

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure Nutanix Tech Note Virtualizing Microsoft Applications on Web-Scale Infrastructure The increase in virtualization of critical applications has brought significant attention to compute and storage infrastructure.

More information

The mixed workload CH-BenCHmark. Hybrid y OLTP&OLAP Database Systems Real-Time Business Intelligence Analytical information at your fingertips

The mixed workload CH-BenCHmark. Hybrid y OLTP&OLAP Database Systems Real-Time Business Intelligence Analytical information at your fingertips The mixed workload CH-BenCHmark Hybrid y OLTP&OLAP Database Systems Real-Time Business Intelligence Analytical information at your fingertips Richard Cole (ParAccel), Florian Funke (TU München), Leo Giakoumakis

More information

Track Join. Distributed Joins with Minimal Network Traffic. Orestis Polychroniou! Rajkumar Sen! Kenneth A. Ross

Track Join. Distributed Joins with Minimal Network Traffic. Orestis Polychroniou! Rajkumar Sen! Kenneth A. Ross Track Join Distributed Joins with Minimal Network Traffic Orestis Polychroniou Rajkumar Sen Kenneth A. Ross Local Joins Algorithms Hash Join Sort Merge Join Index Join Nested Loop Join Spilling to disk

More information

CSE 544: Principles of Database Systems

CSE 544: Principles of Database Systems CSE 544: Principles of Database Systems Anatomy of a DBMS, Parallel Databases 1 Announcements Lecture on Thursday, May 2nd: Moved to 9am-10:30am, CSE 403 Paper reviews: Anatomy paper was due yesterday;

More information

Storage Optimization with Oracle Database 11g

Storage Optimization with Oracle Database 11g Storage Optimization with Oracle Database 11g Terabytes of Data Reduce Storage Costs by Factor of 10x Data Growth Continues to Outpace Budget Growth Rate of Database Growth 1000 800 600 400 200 1998 2000

More information

Huge market -- essentially all high performance databases work this way

Huge market -- essentially all high performance databases work this way 11/5/2017 Lecture 16 -- Parallel & Distributed Databases Parallel/distributed databases: goal provide exactly the same API (SQL) and abstractions (relational tables), but partition data across a bunch

More information

Rethinking Distributed Query Execution on High-Speed Networks

Rethinking Distributed Query Execution on High-Speed Networks Rethinking Distributed Query Execution on High-Speed Networks Abdallah Salama *, Carsten Binnig, Tim Kraska, Ansgar Scherp *, Tobias Ziegler Brown University, Providence, RI, USA * Kiel University, Germany

More information

Performance and Scalability with Griddable.io

Performance and Scalability with Griddable.io Performance and Scalability with Griddable.io Executive summary Griddable.io is an industry-leading timeline-consistent synchronized data integration grid across a range of source and target data systems.

More information

Be Fast, Cheap and in Control with SwitchKV. Xiaozhou Li

Be Fast, Cheap and in Control with SwitchKV. Xiaozhou Li Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Goal: fast and cost-efficient key-value store Store, retrieve, manage key-value objects Get(key)/Put(key,value)/Delete(key) Target: cluster-level

More information

The End of a Myth: Distributed Transactions Can Scale

The End of a Myth: Distributed Transactions Can Scale The End of a Myth: Distributed Transactions Can Scale Erfan Zamanian, Carsten Binnig, Tim Harris, and Tim Kraska Rafael Ketsetsides Why distributed transactions? 2 Cheaper for the same processing power

More information

CSE 190D Database System Implementation

CSE 190D Database System Implementation CSE 190D Database System Implementation Arun Kumar Topic 6: Transaction Management Chapter 16 of Cow Book Slide ACKs: Jignesh Patel 1 Transaction Management Motivation and Basics The ACID Properties Transaction

More information

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( ) Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL

More information

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15 Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2014/15 Lecture X: Parallel Databases Topics Motivation and Goals Architectures Data placement Query processing Load balancing

More information

Architectural challenges for building a low latency, scalable multi-tenant data warehouse

Architectural challenges for building a low latency, scalable multi-tenant data warehouse Architectural challenges for building a low latency, scalable multi-tenant data warehouse Mataprasad Agrawal Solutions Architect, Services CTO 2017 Persistent Systems Ltd. All rights reserved. Our analytics

More information

IX: A Protected Dataplane Operating System for High Throughput and Low Latency

IX: A Protected Dataplane Operating System for High Throughput and Low Latency IX: A Protected Dataplane Operating System for High Throughput and Low Latency Adam Belay et al. Proc. of the 11th USENIX Symp. on OSDI, pp. 49-65, 2014. Presented by Han Zhang & Zaina Hamid Challenges

More information

Stateless Network Functions:

Stateless Network Functions: Stateless Network Functions: Breaking the Tight Coupling of State and Processing Murad Kablan, Azzam Alsudais, Eric Keller, Franck Le University of Colorado IBM Networks Need Network Functions Firewall

More information

Migrating Oracle Databases To Cassandra

Migrating Oracle Databases To Cassandra BY UMAIR MANSOOB Why Cassandra Lower Cost of ownership makes it #1 choice for Big Data OLTP Applications. Unlike Oracle, Cassandra can store structured, semi-structured, and unstructured data. Cassandra

More information

Scaling Without Sharding. Baron Schwartz Percona Inc Surge 2010

Scaling Without Sharding. Baron Schwartz Percona Inc Surge 2010 Scaling Without Sharding Baron Schwartz Percona Inc Surge 2010 Web Scale!!!! http://www.xtranormal.com/watch/6995033/ A Sharding Thought Experiment 64 shards per proxy [1] 1 TB of data storage per node

More information

Advanced Computer Networks. End Host Optimization

Advanced Computer Networks. End Host Optimization Oriana Riva, Department of Computer Science ETH Zürich 263 3501 00 End Host Optimization Patrick Stuedi Spring Semester 2017 1 Today End-host optimizations: NUMA-aware networking Kernel-bypass Remote Direct

More information

Concurrency Control In Distributed Main Memory Database Systems. Justin A. DeBrabant

Concurrency Control In Distributed Main Memory Database Systems. Justin A. DeBrabant In Distributed Main Memory Database Systems Justin A. DeBrabant debrabant@cs.brown.edu Concurrency control Goal: maintain consistent state of data ensure query results are correct The Gold Standard: ACID

More information

White Paper. Major Performance Tuning Considerations for Weblogic Server

White Paper. Major Performance Tuning Considerations for Weblogic Server White Paper Major Performance Tuning Considerations for Weblogic Server Table of Contents Introduction and Background Information... 2 Understanding the Performance Objectives... 3 Measuring your Performance

More information

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE Oracle Database Exadata Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE Oracle Database Exadata combines the best database with the best cloud platform. Exadata is the culmination of more

More information

Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age

Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age Viktor Leis Peter Boncz Alfons Kemper Thomas Neumann Technische Universität München {leis,kemper,neumann}@in.tum.de

More information

Networking for Data Acquisition Systems. Fabrice Le Goff - 14/02/ ISOTDAQ

Networking for Data Acquisition Systems. Fabrice Le Goff - 14/02/ ISOTDAQ Networking for Data Acquisition Systems Fabrice Le Goff - 14/02/2018 - ISOTDAQ Outline Generalities The OSI Model Ethernet and Local Area Networks IP and Routing TCP, UDP and Transport Efficiency Networking

More information

Anti-Caching: A New Approach to Database Management System Architecture. Guide: Helly Patel ( ) Dr. Sunnie Chung Kush Patel ( )

Anti-Caching: A New Approach to Database Management System Architecture. Guide: Helly Patel ( ) Dr. Sunnie Chung Kush Patel ( ) Anti-Caching: A New Approach to Database Management System Architecture Guide: Helly Patel (2655077) Dr. Sunnie Chung Kush Patel (2641883) Abstract Earlier DBMS blocks stored on disk, with a main memory

More information

Hardware Transactional Memory on Haswell

Hardware Transactional Memory on Haswell Hardware Transactional Memory on Haswell Viktor Leis Technische Universität München 1 / 15 Introduction transactional memory is a very elegant programming model transaction { transaction { a = a 10; c

More information

Gavin Payne Senior Consultant.

Gavin Payne Senior Consultant. Gavin Payne Senior Consultant gavin@coeo.com Getting Started with SQL Server and Virtualisation Designing successful virtual database environments Monitoring a virtual database environments Summary The

More information

How VoltDB does Transactions

How VoltDB does Transactions TEHNIL NOTE How does Transactions Note to readers: Some content about read-only transaction coordination in this document is out of date due to changes made in v6.4 as a result of Jepsen testing. Updates

More information

Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li

Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Raghav Sethi Michael Kaminsky David G. Andersen Michael J. Freedman Goal: fast and cost-effective key-value store Target: cluster-level storage for

More information

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block

More information

Conceptual Modeling on Tencent s Distributed Database Systems. Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc.

Conceptual Modeling on Tencent s Distributed Database Systems. Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc. Conceptual Modeling on Tencent s Distributed Database Systems Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc. Outline Introduction System overview of TDSQL Conceptual Modeling on TDSQL Applications Conclusion

More information

Data Transformation and Migration in Polystores

Data Transformation and Migration in Polystores Data Transformation and Migration in Polystores Adam Dziedzic, Aaron Elmore & Michael Stonebraker September 15th, 2016 Agenda Data Migration for Polystores: What & Why? How? Acceleration of physical data

More information

Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017

Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017 Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017 About the Presentation Problems Existing Solutions Denis Magda

More information

Oracle Exadata: Strategy and Roadmap

Oracle Exadata: Strategy and Roadmap Oracle Exadata: Strategy and Roadmap - New Technologies, Cloud, and On-Premises Juan Loaiza Senior Vice President, Database Systems Technologies, Oracle Safe Harbor Statement The following is intended

More information

Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Copyright 2011, Oracle and/or its affiliates. All rights reserved. The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,

More information

MULTI-THREADED QUERIES

MULTI-THREADED QUERIES 15-721 Project 3 Final Presentation MULTI-THREADED QUERIES Wendong Li (wendongl) Lu Zhang (lzhang3) Rui Wang (ruiw1) Project Objective Intra-operator parallelism Use multiple threads in a single executor

More information

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE White Paper EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE EMC XtremSF, EMC XtremCache, EMC Symmetrix VMAX and Symmetrix VMAX 10K, XtremSF and XtremCache dramatically improve Oracle performance Symmetrix

More information

IBM POWER8 100 GigE Adapter Best Practices

IBM POWER8 100 GigE Adapter Best Practices Introduction IBM POWER8 100 GigE Adapter Best Practices With higher network speeds in new network adapters, achieving peak performance requires careful tuning of the adapters and workloads using them.

More information

CSE 344 Final Review. August 16 th

CSE 344 Final Review. August 16 th CSE 344 Final Review August 16 th Final In class on Friday One sheet of notes, front and back cost formulas also provided Practice exam on web site Good luck! Primary Topics Parallel DBs parallel join

More information

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING EXECUTIVE SUMMARY Today, businesses are increasingly turning to cloud services for rapid deployment of apps and services.

More information

Current Topics in OS Research. So, what s hot?

Current Topics in OS Research. So, what s hot? Current Topics in OS Research COMP7840 OSDI Current OS Research 0 So, what s hot? Operating systems have been around for a long time in many forms for different types of devices It is normally general

More information

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2 Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,

More information

Database Architectures

Database Architectures Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL

More information

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Data Modeling and Databases Ch 14: Data Replication Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Database Replication What is database replication The advantages of

More information

Jyotheswar Kuricheti

Jyotheswar Kuricheti Jyotheswar Kuricheti 1 Agenda: 1. Performance Tuning Overview 2. Identify Bottlenecks 3. Optimizing at different levels : Target Source Mapping Session System 2 3 Performance Tuning Overview: 4 What is

More information

S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda. The Ohio State University

S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda. The Ohio State University Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda The Ohio State University

More information

Schema-Agnostic Indexing with Azure Document DB

Schema-Agnostic Indexing with Azure Document DB Schema-Agnostic Indexing with Azure Document DB Introduction Azure DocumentDB is Microsoft s multi-tenant distributed database service for managing JSON documents at Internet scale Multi-tenancy is an

More information

What s New in MySQL 5.7 Geir Høydalsvik, Sr. Director, MySQL Engineering. Copyright 2015, Oracle and/or its affiliates. All rights reserved.

What s New in MySQL 5.7 Geir Høydalsvik, Sr. Director, MySQL Engineering. Copyright 2015, Oracle and/or its affiliates. All rights reserved. What s New in MySQL 5.7 Geir Høydalsvik, Sr. Director, MySQL Engineering Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes

More information

PostgreSQL Cluster. Mar.16th, Postgres XC Write Scalable Cluster

PostgreSQL Cluster. Mar.16th, Postgres XC Write Scalable Cluster Postgres XC: Write Scalable PostgreSQL Cluster NTT Open Source Software Center EnterpriseDB Corp. Postgres XC Write Scalable Cluster 1 What is Postgres XC (or PG XC)? Wit Write scalable lbl PostgreSQL

More information

Building Durable Real-time Data Pipeline

Building Durable Real-time Data Pipeline Building Durable Real-time Data Pipeline Apache BookKeeper at Twitter @sijieg Twitter Background Layered Architecture Agenda Design Details Performance Scale @Twitter Q & A Publish-Subscribe Online services

More information

Spark Over RDMA: Accelerate Big Data SC Asia 2018 Ido Shamay Mellanox Technologies

Spark Over RDMA: Accelerate Big Data SC Asia 2018 Ido Shamay Mellanox Technologies Spark Over RDMA: Accelerate Big Data SC Asia 2018 Ido Shamay 1 Apache Spark - Intro Spark within the Big Data ecosystem Data Sources Data Acquisition / ETL Data Storage Data Analysis / ML Serving 3 Apache

More information

Distributed Databases

Distributed Databases Distributed Databases Chapter 22, Part B Database Management Systems, 2 nd Edition. R. Ramakrishnan and Johannes Gehrke 1 Introduction Data is stored at several sites, each managed by a DBMS that can run

More information

Survey of ETSI NFV standardization documents BY ABHISHEK GUPTA FRIDAY GROUP MEETING FEBRUARY 26, 2016

Survey of ETSI NFV standardization documents BY ABHISHEK GUPTA FRIDAY GROUP MEETING FEBRUARY 26, 2016 Survey of ETSI NFV standardization documents BY ABHISHEK GUPTA FRIDAY GROUP MEETING FEBRUARY 26, 2016 VNFaaS (Virtual Network Function as a Service) In our present work, we consider the VNFaaS use-case

More information

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22

More information

Software-Defined Data Infrastructure Essentials

Software-Defined Data Infrastructure Essentials Software-Defined Data Infrastructure Essentials Cloud, Converged, and Virtual Fundamental Server Storage I/O Tradecraft Greg Schulz Server StorageIO @StorageIO 1 of 13 Contents Preface Who Should Read

More information

CPS 512 midterm exam #1, 10/7/2016

CPS 512 midterm exam #1, 10/7/2016 CPS 512 midterm exam #1, 10/7/2016 Your name please: NetID: Answer all questions. Please attempt to confine your answers to the boxes provided. If you don t know the answer to a question, then just say

More information

Big and Fast. Anti-Caching in OLTP Systems. Justin DeBrabant

Big and Fast. Anti-Caching in OLTP Systems. Justin DeBrabant Big and Fast Anti-Caching in OLTP Systems Justin DeBrabant Online Transaction Processing transaction-oriented small footprint write-intensive 2 A bit of history 3 OLTP Through the Years relational model

More information

Low Overhead Concurrency Control for Partitioned Main Memory Databases. Evan P. C. Jones Daniel J. Abadi Samuel Madden"

Low Overhead Concurrency Control for Partitioned Main Memory Databases. Evan P. C. Jones Daniel J. Abadi Samuel Madden Low Overhead Concurrency Control for Partitioned Main Memory Databases Evan P. C. Jones Daniel J. Abadi Samuel Madden" Banks" Payment Processing" Airline Reservations" E-Commerce" Web 2.0" Problem:" Millions

More information

April Copyright 2013 Cloudera Inc. All rights reserved.

April Copyright 2013 Cloudera Inc. All rights reserved. Hadoop Beyond Batch: Real-time Workloads, SQL-on- Hadoop, and the Virtual EDW Headline Goes Here Marcel Kornacker marcel@cloudera.com Speaker Name or Subhead Goes Here April 2014 Analytic Workloads on

More information

A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture

A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture By Gaurav Sheoran 9-Dec-08 Abstract Most of the current enterprise data-warehouses

More information

Messaging Overview. Introduction. Gen-Z Messaging

Messaging Overview. Introduction. Gen-Z Messaging Page 1 of 6 Messaging Overview Introduction Gen-Z is a new data access technology that not only enhances memory and data storage solutions, but also provides a framework for both optimized and traditional

More information

Database Replication in Tashkent. CSEP 545 Transaction Processing Sameh Elnikety

Database Replication in Tashkent. CSEP 545 Transaction Processing Sameh Elnikety Database Replication in Tashkent CSEP 545 Transaction Processing Sameh Elnikety Replication for Performance Expensive Limited scalability DB Replication is Challenging Single database system Large, persistent

More information

The Google File System

The Google File System The Google File System Sanjay Ghemawat, Howard Gobioff and Shun Tak Leung Google* Shivesh Kumar Sharma fl4164@wayne.edu Fall 2015 004395771 Overview Google file system is a scalable distributed file system

More information

management systems Elena Baralis, Silvia Chiusano Politecnico di Torino Pag. 1 Distributed architectures Distributed Database Management Systems

management systems Elena Baralis, Silvia Chiusano Politecnico di Torino Pag. 1 Distributed architectures Distributed Database Management Systems atabase Management Systems istributed database istributed architectures atabase Management Systems istributed atabase Management Systems ata and computation are distributed over different machines ifferent

More information

Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery

Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery White Paper Business Continuity Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery Table of Contents Executive Summary... 1 Key Facts About

More information

StarWind Storage Appliance

StarWind Storage Appliance #1 HyperConverged Appliance for SMB and ROBO StarWind Storage Appliance Value Proposition For SMBs and Enterprises who are looking for high performance primary storage for server virtualization, VDI, database

More information

LATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN

LATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN LATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN Russ Fellows Enabling you to make the best technology decisions November 2017 EXECUTIVE OVERVIEW* The new Intel Xeon Scalable platform

More information

A Distributed Hash Table for Shared Memory

A Distributed Hash Table for Shared Memory A Distributed Hash Table for Shared Memory Wytse Oortwijn Formal Methods and Tools, University of Twente August 31, 2015 Wytse Oortwijn (Formal Methods and Tools, AUniversity Distributed of Twente) Hash

More information

Hadoop Virtualization Extensions on VMware vsphere 5 T E C H N I C A L W H I T E P A P E R

Hadoop Virtualization Extensions on VMware vsphere 5 T E C H N I C A L W H I T E P A P E R Hadoop Virtualization Extensions on VMware vsphere 5 T E C H N I C A L W H I T E P A P E R Table of Contents Introduction... 3 Topology Awareness in Hadoop... 3 Virtual Hadoop... 4 HVE Solution... 5 Architecture...

More information

The Google File System

The Google File System The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung SOSP 2003 presented by Kun Suo Outline GFS Background, Concepts and Key words Example of GFS Operations Some optimizations in

More information

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage Performance Study of Microsoft SQL Server 2016 Dell Engineering February 2017 Table of contents

More information

Scalability of web applications

Scalability of web applications Scalability of web applications CSCI 470: Web Science Keith Vertanen Copyright 2014 Scalability questions Overview What's important in order to build scalable web sites? High availability vs. load balancing

More information

Last Class Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications

Last Class Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications Last Class Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications C. Faloutsos A. Pavlo Lecture#23: Concurrency Control Part 3 (R&G ch. 17) Lock Granularities Locking in B+Trees The

More information

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd Performance Study Dell EMC Engineering October 2017 A Dell EMC Performance Study Revisions Date October 2017

More information

Lenovo Database Configuration

Lenovo Database Configuration Lenovo Database Configuration for Microsoft SQL Server OLTP on Flex System with DS6200 Reduce time to value with pretested hardware configurations - 20TB Database and 3 Million TPM OLTP problem and a solution

More information

FOEDUS: OLTP Engine for a Thousand Cores and NVRAM

FOEDUS: OLTP Engine for a Thousand Cores and NVRAM FOEDUS: OLTP Engine for a Thousand Cores and NVRAM Hideaki Kimura HP Labs, Palo Alto, CA Slides By : Hideaki Kimura Presented By : Aman Preet Singh Next-Generation Server Hardware? HP The Machine UC Berkeley

More information

Everything You Need to Know About MySQL Group Replication

Everything You Need to Know About MySQL Group Replication Everything You Need to Know About MySQL Group Replication Luís Soares (luis.soares@oracle.com) Principal Software Engineer, MySQL Replication Lead Copyright 2017, Oracle and/or its affiliates. All rights

More information

MySQL Database Scalability

MySQL Database Scalability MySQL Database Scalability Nextcloud Conference 2016 TU Berlin Oli Sennhauser Senior MySQL Consultant at FromDual GmbH oli.sennhauser@fromdual.com 1 / 14 About FromDual GmbH Support Consulting remote-dba

More information

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for

More information

Recoverability. Kathleen Durant PhD CS3200

Recoverability. Kathleen Durant PhD CS3200 Recoverability Kathleen Durant PhD CS3200 1 Recovery Manager Recovery manager ensures the ACID principles of atomicity and durability Atomicity: either all actions in a transaction are done or none are

More information

VERITAS Storage Foundation 4.0 TM for Databases

VERITAS Storage Foundation 4.0 TM for Databases VERITAS Storage Foundation 4.0 TM for Databases Powerful Manageability, High Availability and Superior Performance for Oracle, DB2 and Sybase Databases Enterprises today are experiencing tremendous growth

More information

CSE 544 Principles of Database Management Systems

CSE 544 Principles of Database Management Systems CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 5 - DBMS Architecture and Indexing 1 Announcements HW1 is due next Thursday How is it going? Projects: Proposals are due

More information