HyPer on Cloud 9. Thomas Neumann. February 10, Technische Universität München
|
|
- Mervin Flowers
- 5 years ago
- Views:
Transcription
1 HyPer on Cloud 9 Thomas Neumann Technische Universität München February 10, 2016
2 HyPer HyPer is the main-memory database system developed in our group a very fast database system with ACID transactions quite comprehensive SQL support (SQL92 plus many SQL99 features) queries are compiled into machine code using LLVM very little overhead, excellent performance You can try it out (or download a demo) online: We use it for teaching, too. (Easy to use web interface for students). Thomas Neumann HyPer on Cloud 9 2 / 13
3 Hyper in the Cloud Currently HyPer is primarily run on a single dedicated node but of course we looked at running HyPer in the cloud Wolf Rödiger will talk about distributed processing next I will concentrate on issues we found during experiments some of them quite surprising; could be relevant for your project, too not all of them have perfect solutions yet Thomas Neumann HyPer on Cloud 9 3 / 13
4 Costs of Virtualization Cloud applications usually run on virtualized machined shields the application from the actual hardware simplified management, allows for easy migration, etc. several different techniques to do that (containers, hypervisors, etc.) what does that actually cost? in-memory processing is CPU bound in an ideal world virtualization would be (nearly) free Thomas Neumann HyPer on Cloud 9 4 / 13
5 Thomas Neumann HyPer on Cloud 9 5 / 13
6 Thomas Neumann HyPer on Cloud 9 6 / 13
7 Thomas Neumann HyPer on Cloud 9 7 / 13
8 Thomas Neumann HyPer on Cloud 9 8 / 13
9 Thomas Neumann HyPer on Cloud 9 9 / 13
10 Cooperative Behavior Cloud infrastructure relies upon cooperative behavior virtualized applications utilize ideal resources combined system looks more powerful than it really is hogging resources is bad for cloud processing Main resource for HyPer (and probably most other DBMSs): main memory not only for the real data, but also for intermediate results some queries have large intermediate results allocate a lot of memory Thomas Neumann HyPer on Cloud 9 10 / 13
11 Allocating and Releasing Memory Memory allocation is a multi-step process: 1. pages are created in the page table, but (usually) not physically assigned 2. upon access, pages are assigned and zeroed out 3. upon release, page table is modified again Problem: this does not scale a lot of locking and overhead within the kernel costs more than a factor 2 performance with large queries on large hosts even more pronounced on Windows, there freeing pages is surprisingly expensive Solutions? never release memory? (i.e., do your own memory management) best performance, but hogs resources non-portable techniques? (madvise etc.) Unsatisfying. Thomas Neumann HyPer on Cloud 9 11 / 13
12 Durability For ACID, we must 1) not report commit until the WAL record hits durable media, and 2) avoid data corruption during writes. difficult to guarantee even on bare metal often requires battery-backed RAID controllers to avoid all issues (including tearing) How do we guarantee that on virtualized hardware? nearly impossible the (virtualized) hardware lies to us an inconveniently timed power failure can lead to a disaster Microsoft installed special hooks for their cloud DBMS, but that is not publicly usable other vendors probably, too, but not generic solutions exists Thomas Neumann HyPer on Cloud 9 12 / 13
13 Conclusion Databases in the cloud are an interesting topic many effects that we see are implementation artifacts choice of virtualization product has a large impact many open issues concerning both performance and correctness but apparently many people are happy with living dangerously Still many open research questions. Thomas Neumann HyPer on Cloud 9 13 / 13
14 HyPer on Cloud 9 Wolf Rödiger Technische Universität München
15 Traditional Data Warehouse OLTP Extract Transform Load OLAP Database Data Warehouse Wolf Rödiger HyPer on Cloud 9 2 / 28
16 HyPer: Hybrid OLTP & OLAP HyPer OLTP & OLAP on the same data at the same time Efficient snapshotting Data-centric code generation OLTP OLAP > 100,000 TPC-C TX/s Best-of-breed response times Real-time analytics fork() OLTP OLAP snapshot a* OLTP process a copy on update OLAP Wolf Rödiger HyPer on Cloud 9 3 / 28
17 Wishlist for HyPer in the Cloud Scale query performance with the cluster size Sustain transaction performance Scale capacity to process larger workloads Elastically add servers to the cluster Provide high availability Wolf Rödiger HyPer on Cloud 9 4 / 28
18 1st Design: Full Replication via Redo Log Multicasting primary HyPer persistent storage OLTP multicast redo log redo log snapshot secondaries OLAP load balancing Tobias Mühlbauer, Wolf Rödiger, Angelika Reiser, Alfons Kemper, Thomas Neumann, ScyPer: Elastic OLAP Throughput on Transactional Data, DanaC Wolf Rödiger HyPer on Cloud 9 5 / 28
19 Scale query throughput, sustain transaction performance 150k OLTP throughput OLAP throughput 60k transactions/s 100k 50k 40k 20k queries/h number of nodes 0 Wolf Rödiger HyPer on Cloud 9 6 / 28
20 Elasticity: Materialized snapshots and multicasting primary HyPer persistent storage OLTP 3. subscribe to multicast 2. apply redo log 1. load snapshot secondaries new server OLAP 4. receive queries Wolf Rödiger HyPer on Cloud 9 7 / 28
21 High Availability: Secondaries can replace the primary primary HyPer persistent storage 3. start multicast 1. replay redo log secondaries 2. accept OLTP OLTP OLAP new primary Wolf Rödiger HyPer on Cloud 9 8 / 28
22 Multi-Tenancy: Flexible database-as-a-service deployment secondary primary primary Tenant 1 Tenant 1 Tenant 2 Tenant 4 Tenant 3 Tenant 1 Tenant 4 Tenant 4 Tenant 2 Tenant 3 secondary secondary secondary Wolf Rödiger HyPer on Cloud 9 9 / 28
23 1st Design: Full Replication via Redo Log Multicasting Scales query throughput with the cluster size Sustains transaction performance Allows to elastically add servers to the cluster Provides high availability Limited to workloads that fit into a single server Same query response times as a single server Wolf Rödiger HyPer on Cloud 9 10 / 28
24 2nd Design: Query Processing on Fragmented Relations Lineitem Lineitem 11 Customer Customer 11 Orders Orders 11 Lineitem Lineitem 21 Customer Customer 21 Orders Orders 21 Lineitem Lineitem 31 Customer Customer 31 Orders Orders 31 Lineitem Lineitem 41 Customer Customer 41 Orders Orders 41 Lineitem Lineitem 51 Customer Customer 51 Orders Orders 51 Lineitem Lineitem 61 Customer Customer 61 Orders Orders 61 Fragment relations across servers to use the combined capacity of the cluster, making room for larger workloads Wolf Rödiger, Tobias Mühlbauer, Alfons Kemper, Thomas Neumann, High-Speed Query Processing over High-Speed Networks, VLDB Wolf Rödiger HyPer on Cloud 9 11 / 28
25 Main Challenges 128 GB 59.7 GB/s CPU CPU 0 CPU CPU 1 10 cores 16 GB/s QPI QPI 10 cores 59.7 GB/s 128 GB 16 GB/s PCIe GB/s host 0 host 1 4 GB/s Infiniband 4 QDR HCA host 2 host 3 host 4 host 5 Challenge 1: Network is a bottleneck to query processing Challenge 2: Utilize all cores and all servers Wolf Rödiger HyPer on Cloud 9 12 / 28
26 Challenge 1: Network is a Bottleneck Network is bottleneck for distributed processing Manual schema-tuning is time-consuming and workload-dependent TPC-H speed-up (SF 100) Faster network hardware is not enough, software 0 has to change TCP/IP (40 Gb/s InfiniBand) TCP/IP (1 Gb/s Ethernet) number of servers Wolf Rödiger HyPer on Cloud 9 13 / 28
27 Remote Direct Memory Access Zero-copy network communication: High network throughput Almost no CPU cost Less memory bus traffic Less CPU cost, higher throughput than TCP RDMA achieves full speed at only 4 % CPU load Recently used to implement a distributed radix join* Application Buffer Operating System sender Application Buffer Operating System receiver HCA HCA *Barthels et al., Rack-Scale In-Memory Join Processing using RDMA, SIGMOD Wolf Rödiger HyPer on Cloud 9 14 / 28
28 Network Scheduling Uncoordinated network communication causes switch contention Communicate in a strict round-robin fashion Synchronize via low-latency RDMA operations Network scheduling improves throughput by 40 % for an 8-server InfiniBand cluster throughput in GB/s all-to-all scheduling number of servers Wolf Rödiger HyPer on Cloud 9 15 / 28
29 Challenge 2: Utilize all cores and all servers Exchange partitions tuples for joins and aggregations Traditionally used to parallelize for multiple cores and servers But: Exchange does not scale well due to inflexible design TPC-H speed-up (SF 300) HyPer ( operators) Vectorwise ( operators) 6 (1) 30 (5) 60 (10) 90 (15) 120 (20) number of cores (per server) Wolf Rödiger HyPer on Cloud 9 16 / 28
30 Classic Exchange Fixed degree of parallelism An needs a buffer for every other : #buffers per server = servers cores 2 2,400 buffers 1 GB of main memory per server Also: Each join key value is processed by a specific operator Heavy hitters are assigned to a single Wolf Rödiger HyPer on Cloud 9 17 / 28
31 Decoupled Exchange Communicate indirectly via communication multiplexers Address servers not threads: Decreases memory consumption Reduces negative impact of heavy hitters Improves applicability of broadcast optimization Local load balancing via work stealing* important for good scalability CM CM CM *Leis et al., Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age, SIGMOD Wolf Rödiger HyPer on Cloud 9 18 / 28
32 Hybrid Parallelism = RDMA-based Communication + Decoupled Exchange Operators server 0 NUMA socket 1 parallelized query pipeline parallelized query pipeline 7. produce tuples 6. deserialize tuples NUMA socket 0 per thread next operator per thread 1. consume tuples prev. operator RDMA Key NUMA node retain count ID last message bytes used content incoming message 2. partition + serialize tuples 5a. receive local or 5b. steal work communication multiplexer receive queue NUMA socket 1 receive queue NUMA socket 0 per server network scheduler per server RDMA RDMA Key RDMA Key Key NUMA NUMA node NUMA node node retain retain count retain count count ID 3. send message ID ID when full last message last message bytes used used bytes used content content 4. reuse message outgoing messages message pool send queues per socket Wolf Rödiger HyPer on Cloud 9 19 / 28
33 Hybrid parallelism scales in the number of cores... TPC-H speed-up (SF 300) HyPer (hybrid parallelism) HyPer () Vectorwise () 6 (1) 30 (5) 60 (10) 90 (15) 120 (20) number of cores (per server) Wolf Rödiger HyPer on Cloud 9 20 / 28
34 ... and in the number of servers in the cluster TPC-H speed-up (SF 100) RDMA (40 Gb/s InfiniBand) + scheduling TCP/IP (40 Gb/s InfiniBand) TCP/IP (1 Gb/s Ethernet) number of servers Wolf Rödiger HyPer on Cloud 9 21 / 28
35 How do we compare? HyPer (partitioned) 20,739 HyPer (chunked) 16,090 Vectorwise (partitioned) 3,856 MemSQL (partitioned) Impala (chunked) Spark SQL (chunked) k 10k 15k 20k 25k TPC-H queries per hour (SF 100) Note: MemSQL takes several seconds for query compilation, but these compilation times are not included in this experiment. Wolf Rödiger HyPer on Cloud 9 22 / 28
36 2nd Design: Query Processing on Fragmented Relations Utilizes the combined capacity of the cluster Reduces query response times Scales query throughput with the cluster size? How can we add servers with minimal disruption?? How can we survive server failures?? How can we sustain the transaction performance? Wolf Rödiger HyPer on Cloud 9 23 / 28
37 Outlook: Elasticity via Highest Random Weight Hashing e5fa4 4f2b3 1c1fb 7448d 8798a ff5e c13ff 7d5d9 a3db5 24e7a f6f9e Orders 3 Orders 5 Orders 2 Orders 6 new server 553b6 021e7 360d0 Orders 1 Orders 4 452e2 56f9b 62d4b Redistribute data when servers are added/removed Avoids reshuffling the whole database, balances load evenly Mukherjee et al., Distributed Architecture of Oracle Database In-memory, VLDB Wolf Rödiger HyPer on Cloud 9 24 / 28
38 Outlook: High availability via HDFS-style replication Orders PART 5 Orders PART 1 Orders PART 4 8 Orders PART 2 5 Orders PART 1 2 Orders PART 3 Orders PART 5 Orders PART 2 Orders PART 3 Orders PART 2 Orders PART 3 Orders PART 5 Orders PART 4 Orders PART 4 Orders PART 1 Replication factor of x allows for x 1 server failures Wolf Rödiger HyPer on Cloud 9 25 / 28
39 Outlook: Sustain transaction performance OLAP Distributed transactions are overly expensive: Two-phase commit Global locking Deadlock detection H-Store model: partitioned execution of transactions Works well for TPC-C Drawbacks: Requires schema tuning Not for all workloads OLTP OLTP OLTP Wolf Rödiger HyPer on Cloud 9 26 / 28
40 Outlook: Hot/cold approach for distributed transactions primary HyPer HOT OLTP multicast redo log secondaries COLD COLD 7 7 HOT HOT COLD COLD 4 4 COLD COLD 1 1 COLD COLD 8 8 HOT HOT COLD COLD 5 5 COLD COLD 2 2 HOT HOT COLD COLD 9 9 COLD COLD 6 6 COLD COLD 3 3 OLAP distributed queries on global TX-consistent snapshots Wolf Rödiger HyPer on Cloud 9 27 / 28
41 HyPer on Cloud 9 Scale capacity to process larger workloads Reduce query response times Scale query throughput with the cluster size Elastically add servers to the cluster Provide high availability Sustain transaction performance Wolf Rödiger HyPer on Cloud 9 28 / 28
42 Backup Slides Wolf Rödiger HyPer on Cloud 9 1 / 5
43 Keeping secondaries up-to-date Logical log: transaction identifier and parameters Physical log: insert, update and delete operations Replay log on secondaries Logical log excludes: Readers Aborts Redo logging Undo logging Physical log also excludes: Read operations Costly transaction logic execution time 800 ms 600 ms 400 ms 200 ms 0 ms writers readers aborts normal execution 23% faster logical log replaying 56% faster physical log replaying Wolf Rödiger HyPer on Cloud 9 2 / 5
44 Redo Log Multicasting Independent of cluster size UDP unreliable, PGM reliable TPC-C transactions 60,000 log entries/s 1,500 B physical log entry/tx 250 B logical log entry/tx 1 GbE vs. InfiniBand 1 GbE needs: Group commits LZ4 compression ( 50 %) InfiniBand (IPoIB) is sufficient; PGM is CPU-bound 1 GbE PGM Bandwidth [Mbit/s] 675 Packets [1,000/s] 43 Latency [µs] InfiniBand 4 QDR PGM Bandwidth [Mbit/s] 1,832 Packets [1,000/s] 112 Latency [µs] 13.5 Wolf Rödiger HyPer on Cloud 9 3 / 5
45 Guaranteeing correct results 3. read a 1. read a 2. read a 1. fork() LSN 101 LSN 100 LSN 103 LSN write a=a* a* a a* a Order-preserving serializability violation Diverging distributed reads Logical time defined by log sequence number (LSN) Global TX-consistent snapshots avoid consistency problems Wolf Rödiger HyPer on Cloud 9 4 / 5
46 NUMA one socket interleaved NUMA-aware 0 1k 2k 3k 4k 5k TPC-H queries per hour (SF 100) NUMA-aware allocation of message buffers improves TPC-H performance by a factor of 2 for a 4-socket server Our communication multiplexer provides NUMA-local message buffers to the decoupled operators Wolf Rödiger HyPer on Cloud 9 5 / 5
HyPer-sonic Combined Transaction AND Query Processing
HyPer-sonic Combined Transaction AND Query Processing Thomas Neumann Technische Universität München December 2, 2011 Motivation There are different scenarios for database usage: OLTP: Online Transaction
More informationHyPer-sonic Combined Transaction AND Query Processing
HyPer-sonic Combined Transaction AND Query Processing Thomas Neumann Technische Universität München October 26, 2011 Motivation - OLTP vs. OLAP OLTP and OLAP have very different requirements OLTP high
More informationMain-Memory Databases 1 / 25
1 / 25 Motivation Hardware trends Huge main memory capacity with complex access characteristics (Caches, NUMA) Many-core CPUs SIMD support in CPUs New CPU features (HTM) Also: Graphic cards, FPGAs, low
More informationAccelerating Analytical Workloads
Accelerating Analytical Workloads Thomas Neumann Technische Universität München April 15, 2014 Scale Out in Big Data Analytics Big Data usually means data is distributed Scale out to process very large
More informationAdministration Naive DBMS CMPT 454 Topics. John Edgar 2
Administration Naive DBMS CMPT 454 Topics John Edgar 2 http://www.cs.sfu.ca/coursecentral/454/johnwill/ John Edgar 4 Assignments 25% Midterm exam in class 20% Final exam 55% John Edgar 5 A database stores
More informationData Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation
Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation Harald Lang 1, Tobias Mühlbauer 1, Florian Funke 2,, Peter Boncz 3,, Thomas Neumann 1, Alfons Kemper 1 1
More informationUser Perspective. Module III: System Perspective. Module III: Topics Covered. Module III Overview of Storage Structures, QP, and TM
Module III Overview of Storage Structures, QP, and TM Sharma Chakravarthy UT Arlington sharma@cse.uta.edu http://www2.uta.edu/sharma base Management Systems: Sharma Chakravarthy Module I Requirements analysis
More informationHAWQ: A Massively Parallel Processing SQL Engine in Hadoop
HAWQ: A Massively Parallel Processing SQL Engine in Hadoop Lei Chang, Zhanwei Wang, Tao Ma, Lirong Jian, Lili Ma, Alon Goldshuv Luke Lonergan, Jeffrey Cohen, Caleb Welton, Gavin Sherry, Milind Bhandarkar
More informationCrescando: Predictable Performance for Unpredictable Workloads
Crescando: Predictable Performance for Unpredictable Workloads G. Alonso, D. Fauser, G. Giannikis, D. Kossmann, J. Meyer, P. Unterbrunner Amadeus S.A. ETH Zurich, Systems Group (Funded by Enterprise Computing
More informationMemory-Based Cloud Architectures
Memory-Based Cloud Architectures ( Or: Technical Challenges for OnDemand Business Software) Jan Schaffner Enterprise Platform and Integration Concepts Group Example: Enterprise Benchmarking -) *%'+,#$)
More informationHigh Performance Transactions in Deuteronomy
High Performance Transactions in Deuteronomy Justin Levandoski, David Lomet, Sudipta Sengupta, Ryan Stutsman, and Rui Wang Microsoft Research Overview Deuteronomy: componentized DB stack Separates transaction,
More informationMark Falco Oracle Coherence Development
Achieving the performance benefits of Infiniband in Java Mark Falco Oracle Coherence Development 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy
More information1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. reserved. Insert Information Protection Policy Classification from Slide 8
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
More informationSCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING
SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING ZEYNEP KORKMAZ CS742 - PARALLEL AND DISTRIBUTED DATABASE SYSTEMS UNIVERSITY OF WATERLOO OUTLINE. Background 2. What is Schism?
More informationMorsel- Drive Parallelism: A NUMA- Aware Query Evaluation Framework for the Many- Core Age. Presented by Dennis Grishin
Morsel- Drive Parallelism: A NUMA- Aware Query Evaluation Framework for the Many- Core Age Presented by Dennis Grishin What is the problem? Efficient computation requires distribution of processing between
More informationData Blocks: Hybrid OLTP and OLAP on compressed storage
Data Blocks: Hybrid OLTP and OLAP on compressed storage Ben Brümmer Technische Universität München Fürstenfeldbruck, 26. November 208 Ben Brümmer 26..8 Lehrstuhl für Datenbanksysteme Problem HDD/Archive/Tape-Storage
More informationIn-Memory Columnar Databases - Hyper (November 2012)
1 In-Memory Columnar Databases - Hyper (November 2012) Arto Kärki, University of Helsinki, Helsinki, Finland, arto.karki@tieto.com Abstract Relational database systems are today the most common database
More informationVOLTDB + HP VERTICA. page
VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics
More informationNutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure
Nutanix Tech Note Virtualizing Microsoft Applications on Web-Scale Infrastructure The increase in virtualization of critical applications has brought significant attention to compute and storage infrastructure.
More informationThe mixed workload CH-BenCHmark. Hybrid y OLTP&OLAP Database Systems Real-Time Business Intelligence Analytical information at your fingertips
The mixed workload CH-BenCHmark Hybrid y OLTP&OLAP Database Systems Real-Time Business Intelligence Analytical information at your fingertips Richard Cole (ParAccel), Florian Funke (TU München), Leo Giakoumakis
More informationTrack Join. Distributed Joins with Minimal Network Traffic. Orestis Polychroniou! Rajkumar Sen! Kenneth A. Ross
Track Join Distributed Joins with Minimal Network Traffic Orestis Polychroniou Rajkumar Sen Kenneth A. Ross Local Joins Algorithms Hash Join Sort Merge Join Index Join Nested Loop Join Spilling to disk
More informationCSE 544: Principles of Database Systems
CSE 544: Principles of Database Systems Anatomy of a DBMS, Parallel Databases 1 Announcements Lecture on Thursday, May 2nd: Moved to 9am-10:30am, CSE 403 Paper reviews: Anatomy paper was due yesterday;
More informationStorage Optimization with Oracle Database 11g
Storage Optimization with Oracle Database 11g Terabytes of Data Reduce Storage Costs by Factor of 10x Data Growth Continues to Outpace Budget Growth Rate of Database Growth 1000 800 600 400 200 1998 2000
More informationHuge market -- essentially all high performance databases work this way
11/5/2017 Lecture 16 -- Parallel & Distributed Databases Parallel/distributed databases: goal provide exactly the same API (SQL) and abstractions (relational tables), but partition data across a bunch
More informationRethinking Distributed Query Execution on High-Speed Networks
Rethinking Distributed Query Execution on High-Speed Networks Abdallah Salama *, Carsten Binnig, Tim Kraska, Ansgar Scherp *, Tobias Ziegler Brown University, Providence, RI, USA * Kiel University, Germany
More informationPerformance and Scalability with Griddable.io
Performance and Scalability with Griddable.io Executive summary Griddable.io is an industry-leading timeline-consistent synchronized data integration grid across a range of source and target data systems.
More informationBe Fast, Cheap and in Control with SwitchKV. Xiaozhou Li
Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Goal: fast and cost-efficient key-value store Store, retrieve, manage key-value objects Get(key)/Put(key,value)/Delete(key) Target: cluster-level
More informationThe End of a Myth: Distributed Transactions Can Scale
The End of a Myth: Distributed Transactions Can Scale Erfan Zamanian, Carsten Binnig, Tim Harris, and Tim Kraska Rafael Ketsetsides Why distributed transactions? 2 Cheaper for the same processing power
More informationCSE 190D Database System Implementation
CSE 190D Database System Implementation Arun Kumar Topic 6: Transaction Management Chapter 16 of Cow Book Slide ACKs: Jignesh Patel 1 Transaction Management Motivation and Basics The ACID Properties Transaction
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More informationSystems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15
Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2014/15 Lecture X: Parallel Databases Topics Motivation and Goals Architectures Data placement Query processing Load balancing
More informationArchitectural challenges for building a low latency, scalable multi-tenant data warehouse
Architectural challenges for building a low latency, scalable multi-tenant data warehouse Mataprasad Agrawal Solutions Architect, Services CTO 2017 Persistent Systems Ltd. All rights reserved. Our analytics
More informationIX: A Protected Dataplane Operating System for High Throughput and Low Latency
IX: A Protected Dataplane Operating System for High Throughput and Low Latency Adam Belay et al. Proc. of the 11th USENIX Symp. on OSDI, pp. 49-65, 2014. Presented by Han Zhang & Zaina Hamid Challenges
More informationStateless Network Functions:
Stateless Network Functions: Breaking the Tight Coupling of State and Processing Murad Kablan, Azzam Alsudais, Eric Keller, Franck Le University of Colorado IBM Networks Need Network Functions Firewall
More informationMigrating Oracle Databases To Cassandra
BY UMAIR MANSOOB Why Cassandra Lower Cost of ownership makes it #1 choice for Big Data OLTP Applications. Unlike Oracle, Cassandra can store structured, semi-structured, and unstructured data. Cassandra
More informationScaling Without Sharding. Baron Schwartz Percona Inc Surge 2010
Scaling Without Sharding Baron Schwartz Percona Inc Surge 2010 Web Scale!!!! http://www.xtranormal.com/watch/6995033/ A Sharding Thought Experiment 64 shards per proxy [1] 1 TB of data storage per node
More informationAdvanced Computer Networks. End Host Optimization
Oriana Riva, Department of Computer Science ETH Zürich 263 3501 00 End Host Optimization Patrick Stuedi Spring Semester 2017 1 Today End-host optimizations: NUMA-aware networking Kernel-bypass Remote Direct
More informationConcurrency Control In Distributed Main Memory Database Systems. Justin A. DeBrabant
In Distributed Main Memory Database Systems Justin A. DeBrabant debrabant@cs.brown.edu Concurrency control Goal: maintain consistent state of data ensure query results are correct The Gold Standard: ACID
More informationWhite Paper. Major Performance Tuning Considerations for Weblogic Server
White Paper Major Performance Tuning Considerations for Weblogic Server Table of Contents Introduction and Background Information... 2 Understanding the Performance Objectives... 3 Measuring your Performance
More informationOracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE
Oracle Database Exadata Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE Oracle Database Exadata combines the best database with the best cloud platform. Exadata is the culmination of more
More informationMorsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age
Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age Viktor Leis Peter Boncz Alfons Kemper Thomas Neumann Technische Universität München {leis,kemper,neumann}@in.tum.de
More informationNetworking for Data Acquisition Systems. Fabrice Le Goff - 14/02/ ISOTDAQ
Networking for Data Acquisition Systems Fabrice Le Goff - 14/02/2018 - ISOTDAQ Outline Generalities The OSI Model Ethernet and Local Area Networks IP and Routing TCP, UDP and Transport Efficiency Networking
More informationAnti-Caching: A New Approach to Database Management System Architecture. Guide: Helly Patel ( ) Dr. Sunnie Chung Kush Patel ( )
Anti-Caching: A New Approach to Database Management System Architecture Guide: Helly Patel (2655077) Dr. Sunnie Chung Kush Patel (2641883) Abstract Earlier DBMS blocks stored on disk, with a main memory
More informationHardware Transactional Memory on Haswell
Hardware Transactional Memory on Haswell Viktor Leis Technische Universität München 1 / 15 Introduction transactional memory is a very elegant programming model transaction { transaction { a = a 10; c
More informationGavin Payne Senior Consultant.
Gavin Payne Senior Consultant gavin@coeo.com Getting Started with SQL Server and Virtualisation Designing successful virtual database environments Monitoring a virtual database environments Summary The
More informationHow VoltDB does Transactions
TEHNIL NOTE How does Transactions Note to readers: Some content about read-only transaction coordination in this document is out of date due to changes made in v6.4 as a result of Jepsen testing. Updates
More informationBe Fast, Cheap and in Control with SwitchKV Xiaozhou Li
Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Raghav Sethi Michael Kaminsky David G. Andersen Michael J. Freedman Goal: fast and cost-effective key-value store Target: cluster-level storage for
More informationEvaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization
Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block
More informationConceptual Modeling on Tencent s Distributed Database Systems. Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc.
Conceptual Modeling on Tencent s Distributed Database Systems Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc. Outline Introduction System overview of TDSQL Conceptual Modeling on TDSQL Applications Conclusion
More informationData Transformation and Migration in Polystores
Data Transformation and Migration in Polystores Adam Dziedzic, Aaron Elmore & Michael Stonebraker September 15th, 2016 Agenda Data Migration for Polystores: What & Why? How? Acceleration of physical data
More informationAccelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017
Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017 About the Presentation Problems Existing Solutions Denis Magda
More informationOracle Exadata: Strategy and Roadmap
Oracle Exadata: Strategy and Roadmap - New Technologies, Cloud, and On-Premises Juan Loaiza Senior Vice President, Database Systems Technologies, Oracle Safe Harbor Statement The following is intended
More informationCopyright 2011, Oracle and/or its affiliates. All rights reserved.
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
More informationMULTI-THREADED QUERIES
15-721 Project 3 Final Presentation MULTI-THREADED QUERIES Wendong Li (wendongl) Lu Zhang (lzhang3) Rui Wang (ruiw1) Project Objective Intra-operator parallelism Use multiple threads in a single executor
More informationEMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE
White Paper EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE EMC XtremSF, EMC XtremCache, EMC Symmetrix VMAX and Symmetrix VMAX 10K, XtremSF and XtremCache dramatically improve Oracle performance Symmetrix
More informationIBM POWER8 100 GigE Adapter Best Practices
Introduction IBM POWER8 100 GigE Adapter Best Practices With higher network speeds in new network adapters, achieving peak performance requires careful tuning of the adapters and workloads using them.
More informationCSE 344 Final Review. August 16 th
CSE 344 Final Review August 16 th Final In class on Friday One sheet of notes, front and back cost formulas also provided Practice exam on web site Good luck! Primary Topics Parallel DBs parallel join
More informationRACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING
RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING EXECUTIVE SUMMARY Today, businesses are increasingly turning to cloud services for rapid deployment of apps and services.
More informationCurrent Topics in OS Research. So, what s hot?
Current Topics in OS Research COMP7840 OSDI Current OS Research 0 So, what s hot? Operating systems have been around for a long time in many forms for different types of devices It is normally general
More informationB.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2
Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL
More informationData Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 14: Data Replication Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Database Replication What is database replication The advantages of
More informationJyotheswar Kuricheti
Jyotheswar Kuricheti 1 Agenda: 1. Performance Tuning Overview 2. Identify Bottlenecks 3. Optimizing at different levels : Target Source Mapping Session System 2 3 Performance Tuning Overview: 4 What is
More informationS. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda. The Ohio State University
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda The Ohio State University
More informationSchema-Agnostic Indexing with Azure Document DB
Schema-Agnostic Indexing with Azure Document DB Introduction Azure DocumentDB is Microsoft s multi-tenant distributed database service for managing JSON documents at Internet scale Multi-tenancy is an
More informationWhat s New in MySQL 5.7 Geir Høydalsvik, Sr. Director, MySQL Engineering. Copyright 2015, Oracle and/or its affiliates. All rights reserved.
What s New in MySQL 5.7 Geir Høydalsvik, Sr. Director, MySQL Engineering Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes
More informationPostgreSQL Cluster. Mar.16th, Postgres XC Write Scalable Cluster
Postgres XC: Write Scalable PostgreSQL Cluster NTT Open Source Software Center EnterpriseDB Corp. Postgres XC Write Scalable Cluster 1 What is Postgres XC (or PG XC)? Wit Write scalable lbl PostgreSQL
More informationBuilding Durable Real-time Data Pipeline
Building Durable Real-time Data Pipeline Apache BookKeeper at Twitter @sijieg Twitter Background Layered Architecture Agenda Design Details Performance Scale @Twitter Q & A Publish-Subscribe Online services
More informationSpark Over RDMA: Accelerate Big Data SC Asia 2018 Ido Shamay Mellanox Technologies
Spark Over RDMA: Accelerate Big Data SC Asia 2018 Ido Shamay 1 Apache Spark - Intro Spark within the Big Data ecosystem Data Sources Data Acquisition / ETL Data Storage Data Analysis / ML Serving 3 Apache
More informationDistributed Databases
Distributed Databases Chapter 22, Part B Database Management Systems, 2 nd Edition. R. Ramakrishnan and Johannes Gehrke 1 Introduction Data is stored at several sites, each managed by a DBMS that can run
More informationSurvey of ETSI NFV standardization documents BY ABHISHEK GUPTA FRIDAY GROUP MEETING FEBRUARY 26, 2016
Survey of ETSI NFV standardization documents BY ABHISHEK GUPTA FRIDAY GROUP MEETING FEBRUARY 26, 2016 VNFaaS (Virtual Network Function as a Service) In our present work, we consider the VNFaaS use-case
More informationCMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS
Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22
More informationSoftware-Defined Data Infrastructure Essentials
Software-Defined Data Infrastructure Essentials Cloud, Converged, and Virtual Fundamental Server Storage I/O Tradecraft Greg Schulz Server StorageIO @StorageIO 1 of 13 Contents Preface Who Should Read
More informationCPS 512 midterm exam #1, 10/7/2016
CPS 512 midterm exam #1, 10/7/2016 Your name please: NetID: Answer all questions. Please attempt to confine your answers to the boxes provided. If you don t know the answer to a question, then just say
More informationBig and Fast. Anti-Caching in OLTP Systems. Justin DeBrabant
Big and Fast Anti-Caching in OLTP Systems Justin DeBrabant Online Transaction Processing transaction-oriented small footprint write-intensive 2 A bit of history 3 OLTP Through the Years relational model
More informationLow Overhead Concurrency Control for Partitioned Main Memory Databases. Evan P. C. Jones Daniel J. Abadi Samuel Madden"
Low Overhead Concurrency Control for Partitioned Main Memory Databases Evan P. C. Jones Daniel J. Abadi Samuel Madden" Banks" Payment Processing" Airline Reservations" E-Commerce" Web 2.0" Problem:" Millions
More informationApril Copyright 2013 Cloudera Inc. All rights reserved.
Hadoop Beyond Batch: Real-time Workloads, SQL-on- Hadoop, and the Virtual EDW Headline Goes Here Marcel Kornacker marcel@cloudera.com Speaker Name or Subhead Goes Here April 2014 Analytic Workloads on
More informationA Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture
A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture By Gaurav Sheoran 9-Dec-08 Abstract Most of the current enterprise data-warehouses
More informationMessaging Overview. Introduction. Gen-Z Messaging
Page 1 of 6 Messaging Overview Introduction Gen-Z is a new data access technology that not only enhances memory and data storage solutions, but also provides a framework for both optimized and traditional
More informationDatabase Replication in Tashkent. CSEP 545 Transaction Processing Sameh Elnikety
Database Replication in Tashkent CSEP 545 Transaction Processing Sameh Elnikety Replication for Performance Expensive Limited scalability DB Replication is Challenging Single database system Large, persistent
More informationThe Google File System
The Google File System Sanjay Ghemawat, Howard Gobioff and Shun Tak Leung Google* Shivesh Kumar Sharma fl4164@wayne.edu Fall 2015 004395771 Overview Google file system is a scalable distributed file system
More informationmanagement systems Elena Baralis, Silvia Chiusano Politecnico di Torino Pag. 1 Distributed architectures Distributed Database Management Systems
atabase Management Systems istributed database istributed architectures atabase Management Systems istributed atabase Management Systems ata and computation are distributed over different machines ifferent
More informationProtecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery
White Paper Business Continuity Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery Table of Contents Executive Summary... 1 Key Facts About
More informationStarWind Storage Appliance
#1 HyperConverged Appliance for SMB and ROBO StarWind Storage Appliance Value Proposition For SMBs and Enterprises who are looking for high performance primary storage for server virtualization, VDI, database
More informationLATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN
LATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN Russ Fellows Enabling you to make the best technology decisions November 2017 EXECUTIVE OVERVIEW* The new Intel Xeon Scalable platform
More informationA Distributed Hash Table for Shared Memory
A Distributed Hash Table for Shared Memory Wytse Oortwijn Formal Methods and Tools, University of Twente August 31, 2015 Wytse Oortwijn (Formal Methods and Tools, AUniversity Distributed of Twente) Hash
More informationHadoop Virtualization Extensions on VMware vsphere 5 T E C H N I C A L W H I T E P A P E R
Hadoop Virtualization Extensions on VMware vsphere 5 T E C H N I C A L W H I T E P A P E R Table of Contents Introduction... 3 Topology Awareness in Hadoop... 3 Virtual Hadoop... 4 HVE Solution... 5 Architecture...
More informationThe Google File System
The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung SOSP 2003 presented by Kun Suo Outline GFS Background, Concepts and Key words Example of GFS Operations Some optimizations in
More informationTPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage
TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage Performance Study of Microsoft SQL Server 2016 Dell Engineering February 2017 Table of contents
More informationScalability of web applications
Scalability of web applications CSCI 470: Web Science Keith Vertanen Copyright 2014 Scalability questions Overview What's important in order to build scalable web sites? High availability vs. load balancing
More informationLast Class Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications
Last Class Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications C. Faloutsos A. Pavlo Lecture#23: Concurrency Control Part 3 (R&G ch. 17) Lock Granularities Locking in B+Trees The
More informationImplementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd
Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd Performance Study Dell EMC Engineering October 2017 A Dell EMC Performance Study Revisions Date October 2017
More informationLenovo Database Configuration
Lenovo Database Configuration for Microsoft SQL Server OLTP on Flex System with DS6200 Reduce time to value with pretested hardware configurations - 20TB Database and 3 Million TPM OLTP problem and a solution
More informationFOEDUS: OLTP Engine for a Thousand Cores and NVRAM
FOEDUS: OLTP Engine for a Thousand Cores and NVRAM Hideaki Kimura HP Labs, Palo Alto, CA Slides By : Hideaki Kimura Presented By : Aman Preet Singh Next-Generation Server Hardware? HP The Machine UC Berkeley
More informationEverything You Need to Know About MySQL Group Replication
Everything You Need to Know About MySQL Group Replication Luís Soares (luis.soares@oracle.com) Principal Software Engineer, MySQL Replication Lead Copyright 2017, Oracle and/or its affiliates. All rights
More informationMySQL Database Scalability
MySQL Database Scalability Nextcloud Conference 2016 TU Berlin Oli Sennhauser Senior MySQL Consultant at FromDual GmbH oli.sennhauser@fromdual.com 1 / 14 About FromDual GmbH Support Consulting remote-dba
More information<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure
MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for
More informationRecoverability. Kathleen Durant PhD CS3200
Recoverability Kathleen Durant PhD CS3200 1 Recovery Manager Recovery manager ensures the ACID principles of atomicity and durability Atomicity: either all actions in a transaction are done or none are
More informationVERITAS Storage Foundation 4.0 TM for Databases
VERITAS Storage Foundation 4.0 TM for Databases Powerful Manageability, High Availability and Superior Performance for Oracle, DB2 and Sybase Databases Enterprises today are experiencing tremendous growth
More informationCSE 544 Principles of Database Management Systems
CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 5 - DBMS Architecture and Indexing 1 Announcements HW1 is due next Thursday How is it going? Projects: Proposals are due
More information