Finding the Pitfalls in Query Performance

Size: px
Start display at page:

Download "Finding the Pitfalls in Query Performance"

Transcription

1 Finding the Pitfalls in Query Performance M.L. Kersten P. Koutsourakis Y. Zhang CWI, MonetDB Solutions EU H2020 project ACTiCLOUD

2 The Challenge MonetDB Mar-18 Which system is relatively better? Postgres 10.6 MonetDB Jul-17 Postgres 11 SQLite

3 The Solution, TPC-H? Q4 Q5 Q6 MonetDB Mar-18 Q3 Q7 Q15 Q1 Q2 Q9 Q8 Q13 Q14 Q16 Postgres 10.6 Q10 Q12 Q17 Q11 Q19 Q18 Q21 Q22 MonetDB Jul-17 Q20 Postgres 11 SQLite

4 TPC-H is a collection of random points? MonetDB Mar-18 Postgres 10.6 Query space MonetDB Jul-17 Postgres 11 SQLite

5 TPC-H may miss clarifying queries Query tells a better story for MonetDB MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Postgres 11 SQLite

6 TPC-H may miss discriminative queries Postgres is relatively better than MonetDB MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Postgres 11 SQLite

7 TPC-H may miss discriminative queries MonetDB Mar-18 MonetDB Degrades on A query Postgres 10.6 MonetDB Jul-17 Postgres 11 SQLite

8 TPC-H may miss discriminative queries MonetDB Mar-18 Postgres 10.6 SQLite is superior MonetDB Jul-17 Postgres 11 SQLite

9 The Challenge Which system is better on a benchmark? MonetDB Mar-18 What queries perform relatively better on system A than system B? Postgres 10.6 MonetDB Jul-17 Postgres 11 SQLite

10 The Challenge MonetDB Mar-18 SQL scalpel Find the discriminative queries Postgres 10.6 MonetDB Jul-17 Postgres 11 SQLite

11 Assumptions SQL scalpel Find the discriminative queries The database schema and data distribution are given A collection of business inspired queries are available No direct access to the Database/DBMS/Platform

12 The Solution, TPC-H as a start Q1 Q2 Q10 Q3 Q9 Q11 Q4 Q8 Q12 Q19 Q5 Q7 Q13 Q18 Q6 Q14 Q17 Q21 Q15 Q16 Q22 MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Q20 Postgres 11 SQLite

13 The Solution, TPC-H as a start -- Query 6 Q3 select Q2 Q8 Q14 sum(l_extendedprice * l_discount) as revenue from Q1 Q9 Q13 lineitem Q10 Q12 Q17 where l_shipdate >= date ' Q18 Q11 and l_shipdate < date ' ' + interval '1' year and Q19 Q21 l_discount between and and l_quantity < 24; Q20 Q4 Q5 Q7 Q6 Q15 Q16 Q22 SQLite MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Postgres 11

14 The Solution, TPC-H as a start -- Query 6 Q3 Q4 Q5 Q7 select Q2 Q8 sum(l_extendedprice * l_discount) as revenue from Q1 Q9 Q13 lineitem Q10 Q12 where l_shipdate >= date ' Q18 Q11 Q19 Q6 Q14 Q17 Q21 Q15 Q16 Q22 MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Q20 Postgres 11 SQLite

15 The Solution, TPC-H as a start -- Query 6 Q3 Q4 Q5 Q7 select Q2 Q8 sum(l_extendedprice * l_discount) as revenue from Q1 Q9 Q13 lineitem Q10 Q12 where l_discount between Q18 Q and and l_quantity < 24; Q19 Q6 Q14 Q17 Q21 Q15 Q16 Q22 MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Q20 Postgres 11 SQLite

16 The Solution, TPC-H as a start -- Query 6 Q3 Q4 Q5 Q7 select Q2 Q8 sum(l_extendedprice * l_discount) as revenue from Q1 Q9 Q13 lineitem Q10 Q12 where l_shipdate < date ' ' Q18 Q11 + interval '1' year Q19 Q6 Q14 Q17 Q21 Q15 Q16 Q22 MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Q20 Postgres 11 SQLite

17 The Solution, TPC-H as a start -- Query 6 Q3 select Q2 Q8 Q14 sum(l_extendedprice * l_discount) as revenue from Q1 Q9 Q13 lineitem Q10 Q12 Q17 where l_shipdate >= date ' Q18 Q11 and l_shipdate < date ' ' + interval '1' year and Q19 Q21 l_discount between and and l_quantity < 24; Q20 Q4 Q5 Q7 Q6 Q15 Q16 Q22 SQLite MonetDB Mar-18 Postgres 10.6 MonetDB Jul-17 Postgres 11

18 SQLscalpel compiles the query into a grammar -- Query 6 select sum(l_extendedprice * l_discount) as revenue from lineitem where l_shipdate >= date ' and l_shipdate < date ' ' + interval '1' year and l_discount between and and l_quantity < 24; tpch_q6: SELECT ${projection} FROM ${table} WHERE ${pred} ${predlist}* projection: sum(l_extendedprice * l_discount) as revenue table: lineitem pred: l_shipdate >= date ' ' l_shipdate < date ' ' + interval '1' year l_discount between and l_quantity < 24 predlist: AND ${pred}

19 SQLscalpel enumerates templates SELECT ${projection} FROM ${table} WHERE ${pred} SELECT ${projection} FROM ${table} WHERE ${pred} AND ${pred} SELECT ${projection} FROM ${table} WHERE ${pred} AND ${pred} AND ${pred} SELECT ${projection} FROM ${table} WHERE ${pred} AND ${pred} AND ${pred} AND ${pred} tpch_q6: SELECT ${projection} FROM ${table} WHERE ${pred} ${predlist}* projection: sum(l_extendedprice * l_discount) as revenue table: lineitem pred: l_shipdate >= date ' ' l_shipdate < date ' ' + interval '1' year l_discount between and l_quantity < 24 predlist: AND ${pred}

20

21 Architecture Maintain a query pool Pick promising candidates Keep a workflow database SQL scalpel Support analysis

22 SQLscalpel architecture Maintain a query pool Pick promising candidates Keep a workflow database Support analysis SQL scalpel SQLscalpel server Script parser Grammar Template pool Task Generator SQLscalpel client Task Executor SQLscalpel database

23 SQLscalpel architecture Maintain a query pool Pick promising candidates Keep a workflow database Support analysis Strategies: Baseline Random Alter a lexical term Expand a query Prune a query Ditch manually

24 SQLscalpel architecture Maintain a query pool Pick promising candidates Keep a workflow database Support analysis Strategies: FCFS Manual steering Simulated annealing Biased terms

25

26

27

28

29

30

31 SQLscalpel pitfalls Handle SQL dialects Easily generate erroneous queries Brings the target down using Cartesian results tpch_q6: SELECT ${projection} FROM ${table} WHERE ${pred} ${predlist}* projection: sum(l_extendedprice * l_discount) as revenue table: lineitem lineitem, regions pred: l_shipdate >= date ' ' l_shipdate < date ' ' + interval '1' year l_discount between and l_quantity < 24 predlist: AND ${pred}

32 SQLscalpel prototype is up and running: Full-stack infrastructure Drivers for MonetDB, MonetDBlite, SQLite, Postgres Functional enhancements planned: Multi-party public/private projects Built-in forum to share the story

33

34 Divergence Q1: SELECT count(*) FROM nation WHERE nation.n_name='brazil' Q2: SELECT count(*) FROM nation WHERE nation.n_name='brazil AND nation.n_regionkey=1 T A (Q 2 ) T B (Q 2 ) T A (Q 1 ) T B (Q 1 ) T A (Q 2 ) T B (Q 2 ) T B (Q 1 ) T A (Q 1 )

35 Divergence Q1: SELECT count(*) FROM nation WHERE nation.n_name='brazil' Q2: SELECT count(*) FROM nation WHERE nation.n_name='brazil AND nation.n_regionkey=1 < 1 System A is better than B on Q 2 against Q1 T A (Q 2 ) T B (Q 2 ) T B (Q 1 ) T A (Q 1 ) = 1 > 1 System B is better than A on Q 2 against Q1

Vectorized Postgres (VOPS extension) Konstantin Knizhnik Postgres Professional

Vectorized Postgres (VOPS extension) Konstantin Knizhnik Postgres Professional Vectorized Postgres (VOPS extension) Konstantin Knizhnik Postgres Professional Why Postgres is slow on OLAP queries? 1. Unpacking tuple overhead (heap_deform_tuple) 2. Interpretation overhead (invocation

More information

6.830 Problem Set 2 (2017)

6.830 Problem Set 2 (2017) 6.830 Problem Set 2 1 Assigned: Monday, Sep 25, 2017 6.830 Problem Set 2 (2017) Due: Monday, Oct 16, 2017, 11:59 PM Submit to Gradescope: https://gradescope.com/courses/10498 The purpose of this problem

More information

Just In Time Compilation in PostgreSQL 11 and onward

Just In Time Compilation in PostgreSQL 11 and onward Just In Time Compilation in PostgreSQL 11 and onward Andres Freund PostgreSQL Developer & Committer Email: andres@anarazel.de Email: andres.freund@enterprisedb.com Twitter: @AndresFreundTec anarazel.de/talks/2018-09-07-pgopen-jit/jit.pdf

More information

MOIRA A Goal-Oriented Incremental Machine Learning Approach to Dynamic Resource Cost Estimation in Distributed Stream Processing Systems

MOIRA A Goal-Oriented Incremental Machine Learning Approach to Dynamic Resource Cost Estimation in Distributed Stream Processing Systems MOIRA A Goal-Oriented Incremental Machine Learning Approach to Dynamic Resource Cost Estimation in Distributed Stream Processing Systems Daniele Foroni, C. Axenie, S. Bortoli, M. Al Hajj Hassan, R. Acker,

More information

Challenges in Query Optimization. Doug Inkster, Ingres Corp.

Challenges in Query Optimization. Doug Inkster, Ingres Corp. Challenges in Query Optimization Doug Inkster, Ingres Corp. Abstract Some queries are inherently more difficult than others for a query optimizer to generate efficient plans. This session discusses the

More information

When and How to Take Advantage of New Optimizer Features in MySQL 5.6. Øystein Grøvlen Senior Principal Software Engineer, MySQL Oracle

When and How to Take Advantage of New Optimizer Features in MySQL 5.6. Øystein Grøvlen Senior Principal Software Engineer, MySQL Oracle When and How to Take Advantage of New Optimizer Features in MySQL 5.6 Øystein Grøvlen Senior Principal Software Engineer, MySQL Oracle Program Agenda Improvements for disk-bound queries Subquery improvements

More information

Orri Erling (Program Manager, OpenLink Virtuoso), Ivan Mikhailov (Lead Developer, OpenLink Virtuoso).

Orri Erling (Program Manager, OpenLink Virtuoso), Ivan Mikhailov (Lead Developer, OpenLink Virtuoso). Orri Erling (Program Manager, OpenLink Virtuoso), Ivan Mikhailov (Lead Developer, OpenLink Virtuoso). Business Intelligence Extensions for SPARQL Orri Erling and Ivan Mikhailov OpenLink Software, 10 Burlington

More information

35 Database benchmarking 25/10/17 12:11 AM. Database benchmarking

35 Database benchmarking 25/10/17 12:11 AM. Database benchmarking Database benchmarking 1 Database benchmark? What is it? A database benchmark is a sample database and a group of database applications able to run on several different database systems in order to measure

More information

GPU-Accelerated Analytics on your Data Lake.

GPU-Accelerated Analytics on your Data Lake. GPU-Accelerated Analytics on your Data Lake. Data Lake Data Swamp ETL Hell DATA LAKE 0001010100001001011010110 >>>>>>>>>>>>>>>>>>>>>> >>>>>>>> >>>>>> >>>>>>>>>>>>>>>>> >>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>>>>>

More information

Midterm Review. March 27, 2017

Midterm Review. March 27, 2017 Midterm Review March 27, 2017 1 Overview Relational Algebra & Query Evaluation Relational Algebra Rewrites Index Design / Selection Physical Layouts 2 Relational Algebra & Query Evaluation 3 Relational

More information

CSIT115/CSIT815 Data Management and Security Assignment 2

CSIT115/CSIT815 Data Management and Security Assignment 2 School of Computing and Information Technology Session: Autumn 2016 University of Wollongong Lecturer: Janusz R. Getta CSIT115/CSIT815 Data Management and Security Assignment 2 Scope This assignment consists

More information

Jignesh M. Patel. Blog:

Jignesh M. Patel. Blog: Jignesh M. Patel Blog: http://bigfastdata.blogspot.com Go back to the design Query Cache from Processing for Conscious 98s Modern (at Algorithms Hardware least for Hash Joins) 995 24 2 Processor Processor

More information

Efficient in-memory query execution using JIT compiling. Han-Gyu Park

Efficient in-memory query execution using JIT compiling. Han-Gyu Park Efficient in-memory query execution using JIT compiling Han-Gyu Park 2012-11-16 CONTENTS Introduction How DCX works Experiment(purpose(at the beginning of this slide), environment, result, analysis & conclusion)

More information

ADMS/VLDB, August 27 th 2018, Rio de Janeiro, Brazil OPTIMIZING GROUP-BY AND AGGREGATION USING GPU-CPU CO-PROCESSING

ADMS/VLDB, August 27 th 2018, Rio de Janeiro, Brazil OPTIMIZING GROUP-BY AND AGGREGATION USING GPU-CPU CO-PROCESSING ADMS/VLDB, August 27 th 2018, Rio de Janeiro, Brazil 1 OPTIMIZING GROUP-BY AND AGGREGATION USING GPU-CPU CO-PROCESSING OPTIMIZING GROUP-BY AND AGGREGATION USING GPU-CPU CO-PROCESSING MOTIVATION OPTIMIZING

More information

TPC-H Benchmark Set. TPC-H Benchmark. DDL for TPC-H datasets

TPC-H Benchmark Set. TPC-H Benchmark. DDL for TPC-H datasets TPC-H Benchmark Set TPC-H Benchmark TPC-H is an ad-hoc and decision support benchmark. Some of queries are available in the current Tajo. You can download the TPC-H data generator here. DDL for TPC-H datasets

More information

Columnstore and B+ tree. Are Hybrid Physical. Designs Important?

Columnstore and B+ tree. Are Hybrid Physical. Designs Important? Columnstore and B+ tree Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 B+ tree & Columnstore on same table = Hybrid design 4? C O L C O L B+ tree B+ tree ? C O L C O L B+ tree B+ tree

More information

Anorexic Plan Diagrams

Anorexic Plan Diagrams Anorexic Plan Diagrams E0 261 Jayant Haritsa Computer Science and Automation Indian Institute of Science JAN 2014 Plan Diagram Reduction 1 Query Plan Selection Core technique Query (Q) Query Optimizer

More information

Basic operators: selection, projection, cross product, union, difference,

Basic operators: selection, projection, cross product, union, difference, CS145 Lecture Notes #6 Relational Algebra Steps in Building and Using a Database 1. Design schema 2. Create schema in DBMS 3. Load initial data 4. Repeat: execute queries and updates on the database Database

More information

When MPPDB Meets GPU:

When MPPDB Meets GPU: When MPPDB Meets GPU: An Extendible Framework for Acceleration Laura Chen, Le Cai, Yongyan Wang Background: Heterogeneous Computing Hardware Trend stops growing with Moore s Law Fast development of GPU

More information

On-Disk Bitmap Index Performance in Bizgres 0.9

On-Disk Bitmap Index Performance in Bizgres 0.9 On-Disk Bitmap Index Performance in Bizgres 0.9 A Greenplum Whitepaper April 2, 2006 Author: Ayush Parashar Performance Engineering Lab Table of Contents 1.0 Summary...1 2.0 Introduction...1 3.0 Performance

More information

Efficiency Analysis of the access method with the cascading Bloom filter to the data warehouse on the parallel computing platform

Efficiency Analysis of the access method with the cascading Bloom filter to the data warehouse on the parallel computing platform Journal of Physics: Conference Series PAPER OPEN ACCESS Efficiency Analysis of the access method with the cascading Bloom filter to the data warehouse on the parallel computing platform To cite this article:

More information

Recommending Materialized Views and Indexes with the IBM DB2 Design Advisor

Recommending Materialized Views and Indexes with the IBM DB2 Design Advisor Recommending Materialized Views and Indexes with the IBM DB2 Design Advisor Daniel C. Zilio et al Proceedings of the International Conference on Automatic Computing (ICAC 04) Rolando Blanco CS848 - Spring

More information

vsql: Verifying Arbitrary SQL Queries over Dynamic Outsourced Databases

vsql: Verifying Arbitrary SQL Queries over Dynamic Outsourced Databases vsql: Verifying Arbitrary SQL Queries over Dynamic Outsourced Databases Yupeng Zhang, Daniel Genkin, Jonathan Katz, Dimitrios Papadopoulos and Charalampos Papamanthou Verifiable Databases client SQL database

More information

Optimizer Standof. MySQL 5.6 vs MariaDB 5.5. Peter Zaitsev, Ovais Tariq Percona Inc April 18, 2012

Optimizer Standof. MySQL 5.6 vs MariaDB 5.5. Peter Zaitsev, Ovais Tariq Percona Inc April 18, 2012 Optimizer Standof MySQL 5.6 vs MariaDB 5.5 Peter Zaitsev, Ovais Tariq Percona Inc April 18, 2012 Thank you Ovais Tariq Ovais Did a lot of heavy lifing for this presentation He could not come to talk together

More information

MySQL 8.0: Common Table Expressions

MySQL 8.0: Common Table Expressions MySQL 8.0: Common Table Expressions Guilhem Bichot Lead Software Engineer MySQL Optimizer Team, Oracle Who With MySQL AB / Sun / Oracle since 2002 Coder on the Server from 4.0 to 8.0 Replication, on-line

More information

Towards Comprehensive Testing Tools

Towards Comprehensive Testing Tools Towards Comprehensive Testing Tools Redefining testing mechanisms! Kuntal Ghosh (Software Engineer) PGCon 2017 26.05.2017 1 Contents Motivation Picasso Visualizer Picasso Art Gallery for PostgreSQL 10

More information

Whitepaper. Big Data implementation: Role of Memory and SSD in Microsoft SQL Server Environment

Whitepaper. Big Data implementation: Role of Memory and SSD in Microsoft SQL Server Environment Whitepaper Big Data implementation: Role of Memory and SSD in Microsoft SQL Server Environment Scenario Analysis of Decision Support System with Microsoft Windows Server 2012 OS & SQL Server 2012 and Samsung

More information

On-Stack Replacement for Program Generators and Source-to-Source Compilers

On-Stack Replacement for Program Generators and Source-to-Source Compilers Abstract On-Stack Replacement for Program Generators and Source-to-Source Compilers On-stack-replacement (OSR) describes the ability to replace currently executing code with a different version, either

More information

Technical Report - Distributed Database Victor FERNANDES - Université de Strasbourg /2000 TECHNICAL REPORT

Technical Report - Distributed Database Victor FERNANDES - Université de Strasbourg /2000 TECHNICAL REPORT TECHNICAL REPORT Distributed Databases And Implementation of the TPC-H Benchmark Victor FERNANDES DESS Informatique Promotion : 1999 / 2000 Page 1 / 29 TABLE OF CONTENTS ABSTRACT... 3 INTRODUCTION... 3

More information

Infrastructure at your Service. In-Memory-Pläne für den 12.2-Optimizer: Teuer oder billig?

Infrastructure at your Service. In-Memory-Pläne für den 12.2-Optimizer: Teuer oder billig? Infrastructure at your Service. In-Memory-Pläne für den 12.2-Optimizer: Teuer oder billig? About me Infrastructure at your Service. Clemens Bleile Senior Consultant Oracle Certified Professional DB 11g,

More information

High Volume In-Memory Data Unification

High Volume In-Memory Data Unification 25 March 2017 High Volume In-Memory Data Unification for UniConnect Platform powered by Intel Xeon Processor E7 Family Contents Executive Summary... 1 Background... 1 Test Environment...2 Dataset Sizes...

More information

Perm Integrating Data Provenance Support in Database Systems

Perm Integrating Data Provenance Support in Database Systems Perm Integrating Data Provenance Support in Database Systems Boris Glavic Database Technology Group Department of Informatics University of Zurich glavic@ifi.uzh.ch Gustavo Alonso Systems Group Department

More information

Successful Upgrade Secrets: Preventing Performance Problems with Database Replay

Successful Upgrade Secrets: Preventing Performance Problems with Database Replay Successful Upgrade Secrets: Preventing Performance Problems with Database Replay Prabhaker Gongloor (GP), Leonidas Galanis, Karl Dias Database Manageability Oracle Corporation Oracle s Complete Enterprise

More information

Statisticum: Data Statistics Management in SAP HANA

Statisticum: Data Statistics Management in SAP HANA Statisticum: Data Statistics Management in SAP HANA Anisoara Nica, Reza Sherkat, Mihnea Andrei, Xun Cheng Martin Heidel, Christian Bensberg, Heiko Gerwens SAP SE firstname.lastname@sap.com ABSTRACT We

More information

Dr. Tom Hicks. Computer Science Department Trinity University

Dr. Tom Hicks. Computer Science Department Trinity University Dr. Tom Hicks Computer Science Department Trinity University 1 1 About Design With Reuse 2 Software Reuse Why Do We Care About Reuse? Historically: In Most Engineering Disciplines, Systems are Designed

More information

Simba: Towards Building Interactive Big Data Analytics Systems. Feifei Li

Simba: Towards Building Interactive Big Data Analytics Systems. Feifei Li Simba: Towards Building Interactive Big Data Analytics Systems Feifei Li Complex Operators over Rich Data Types Integrated into System Kernel For Example: SELECT k-means from Population WHERE k=5 and feature=age

More information

arxiv: v1 [cs.db] 23 Mar 2017

arxiv: v1 [cs.db] 23 Mar 2017 Flare: Native Compilation for Heterogeneous Workloads in Apache Spark Grégory M. Essertel, Ruby Y. Tahboub, James M. Decker, Kevin J. Brown 2, Kunle Olukotun 2, Tiark Rompf Purdue University, 2 Stanford

More information

Comparison of Database Cloud Services

Comparison of Database Cloud Services Comparison of Database Cloud Services Benchmark Testing Overview ORACLE WHITE PAPER SEPTEMBER 2016 Table of Contents Table of Contents 1 Disclaimer 2 Preface 3 Introduction 4 Cloud OLTP Workload 5 Cloud

More information

Jayant Haritsa. Database Systems Lab Indian Institute of Science Bangalore, India

Jayant Haritsa. Database Systems Lab Indian Institute of Science Bangalore, India Jayant Haritsa Database Systems Lab Indian Institute of Science Bangalore, India Query Execution Plans SQL, the standard database query interface, is a declarative language Specifies only what is wanted,

More information

Index Merging. Abstract. 1. Introduction

Index Merging. Abstract. 1. Introduction Index Merging Surajit Chaudhuri and Vivek Narasayya 1 Microsoft Research, Redmond Email: {surajitc, viveknar}@microsoft.com http://research.microsoft.com/~{surajitc, viveknar} Abstract Indexes play a vital

More information

HYRISE In-Memory Storage Engine

HYRISE In-Memory Storage Engine HYRISE In-Memory Storage Engine Martin Grund 1, Jens Krueger 1, Philippe Cudre-Mauroux 3, Samuel Madden 2 Alexander Zeier 1, Hasso Plattner 1 1 Hasso-Plattner-Institute, Germany 2 MIT CSAIL, USA 3 University

More information

Relational Database Systems 2 1. System Architecture

Relational Database Systems 2 1. System Architecture Relational Database Systems 2 1. System Architecture Silke Eckstein Andreas Kupfer Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Organizational Information

More information

Who we are: Database Research - Provenance, Integration, and more hot stuff. Boris Glavic. Department of Computer Science

Who we are: Database Research - Provenance, Integration, and more hot stuff. Boris Glavic. Department of Computer Science Who we are: Database Research - Provenance, Integration, and more hot stuff Boris Glavic Department of Computer Science September 24, 2013 Hi, I am Boris Glavic, Assistant Professor Hi, I am Boris Glavic,

More information

<Insert Picture Here> DBA s New Best Friend: Advanced SQL Tuning Features of Oracle Database 11g

<Insert Picture Here> DBA s New Best Friend: Advanced SQL Tuning Features of Oracle Database 11g DBA s New Best Friend: Advanced SQL Tuning Features of Oracle Database 11g Peter Belknap, Sergey Koltakov, Jack Raitto The following is intended to outline our general product direction.

More information

Self-Tuning Database Systems: The AutoAdmin Experience

Self-Tuning Database Systems: The AutoAdmin Experience Self-Tuning Database Systems: The AutoAdmin Experience Surajit Chaudhuri Data Management and Exploration Group Microsoft Research http://research.microsoft.com/users/surajitc surajitc@microsoft.com 5/10/2002

More information

Spark-GPU: An Accelerated In-Memory Data Processing Engine on Clusters

Spark-GPU: An Accelerated In-Memory Data Processing Engine on Clusters 1 Spark-GPU: An Accelerated In-Memory Data Processing Engine on Clusters Yuan Yuan, Meisam Fathi Salmi, Yin Huai, Kaibo Wang, Rubao Lee and Xiaodong Zhang The Ohio State University Paypal Inc. Databricks

More information

Algebricks: A Data Model-Agnostic Compiler Backend for Big Data Languages

Algebricks: A Data Model-Agnostic Compiler Backend for Big Data Languages Algebricks: A Data Model-Agnostic Compiler Backend for Big Data Languages Vinayak Borkar 2* Yingyi Bu 1 E. Preston Carman, Jr. 3 Nicola Onose 2* Till Westmann 4 Pouria Pirzadeh 1 Michael J. Carey 1 Vassilis

More information

Lecture #14 ADVANCED DATABASE SYSTEMS. // // Spring 2018

Lecture #14 ADVANCED DATABASE SYSTEMS. // // Spring 2018 Lecture #14 ADVANCED DATABASE SYSTEMS Networking @Andy_Pavlo // 15-721 // Spring 2018 2 COURSE ANNOUNCEMENTS Mid-Term: Wednesday March 7 th @ 3:00pm Project #2: Monday March 12 th @ 11:59pm Project #3

More information

Configuration changes such as conversion from a single instance to RAC, ASM, etc.

Configuration changes such as conversion from a single instance to RAC, ASM, etc. Today, enterprises have to make sizeable investments in hardware and software to roll out infrastructure changes. For example, a data center may have an initiative to move databases to a low cost computing

More information

Comparison of Database Cloud Services

Comparison of Database Cloud Services Comparison of Database Cloud Services Testing Overview ORACLE WHITE PAPER SEPTEMBER 2016 Table of Contents Table of Contents 1 Disclaimer 2 Preface 3 Introduction 4 Cloud OLTP Workload 5 Cloud Analytic

More information

NewSQL Databases MemSQL and VoltDB Experimental Evaluation

NewSQL Databases MemSQL and VoltDB Experimental Evaluation NewSQL Databases MemSQL and VoltDB Experimental Evaluation João Oliveira 1 and Jorge Bernardino 1,2 1 ISEC, Polytechnic of Coimbra, Rua Pedro Nunes, Coimbra, Portugal 2 CISUC Centre for Informatics and

More information

MCSE Productivity. A Success Guide to Prepare- Core Solutions of Microsoft SharePoint Server edusum.com

MCSE Productivity. A Success Guide to Prepare- Core Solutions of Microsoft SharePoint Server edusum.com 70-331 MCSE Productivity A Success Guide to Prepare- Core Solutions of Microsoft SharePoint Server 2013 edusum.com Table of Contents Introduction to 70-331 Exam on Core Solutions of Microsoft SharePoint

More information

Schema Tuning. Tuning Schemas : Overview

Schema Tuning. Tuning Schemas : Overview Administração e Optimização de Bases de Dados 2012/2013 Schema Tuning Bruno Martins DEI@Técnico e DMIR@INESC-ID Tuning Schemas : Overview Trade-offs among normalization / denormalization Overview When

More information

Implement a Data Warehouse with Microsoft SQL Server

Implement a Data Warehouse with Microsoft SQL Server Implement a Data Warehouse with Microsoft SQL Server 20463D; 5 days, Instructor-led Course Description This course describes how to implement a data warehouse platform to support a BI solution. Students

More information

Oracle 11g Partitioning new features and ILM

Oracle 11g Partitioning new features and ILM Oracle 11g Partitioning new features and ILM H. David Gnau Sales Consultant NJ Mark Van de Wiel Principal Product Manager The following is intended to outline our general product

More information

Accelerate Applications Using EqualLogic Arrays with directcache

Accelerate Applications Using EqualLogic Arrays with directcache Accelerate Applications Using EqualLogic Arrays with directcache Abstract This paper demonstrates how combining Fusion iomemory products with directcache software in host servers significantly improves

More information

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile Course Content Course Description: This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse 2014, implement ETL with

More information

Avoiding Sorting and Grouping In Processing Queries

Avoiding Sorting and Grouping In Processing Queries Avoiding Sorting and Grouping In Processing Queries Outline Motivation Simple Example Order Properties Grouping followed by ordering Order Property Optimization Performance Results Conclusion Motivation

More information

JITing PostgreSQL using LLVM

JITing PostgreSQL using LLVM JITing PostgreSQL using LLVM Andres Freund PostgreSQL Developer & Committer Email: andres@anarazel.de Email: andres.freund@enterprisedb.com Twitter: @AndresFreundTec anarazel.de/talks/fosdem-2018-02-03/jit.pdf

More information

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server Page 1 of 6 Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: 4 days; Instructor-Led Introduction This course

More information

Hands-on Lab: Working with SQL using Hive

Hands-on Lab: Working with SQL using Hive Hands-n Lab: Wrking with SQL using Hive Wrking with SQL using Hive Hands-n Lab Overview Apache Hadp and its map-reduce framewrk have becme very ppular fr its rbust, scalable distributed prcessing. While

More information

In-Memory Data Management

In-Memory Data Management In-Memory Data Management Martin Faust Research Assistant Research Group of Prof. Hasso Plattner Hasso Plattner Institute for Software Engineering University of Potsdam Agenda 2 1. Changed Hardware 2.

More information

Take Me to SSD: A Hybrid Block-Selection Method on HDFS based on Storage Type

Take Me to SSD: A Hybrid Block-Selection Method on HDFS based on Storage Type Take Me to SSD: A Hybrid Block-Selection Method on HDFS based on Storage Type Minkyung Kim Yonsei University 50 Yonsei-ro, Seodaemun-gu Seoul, Korea +82 2 2123 7757 goodgail@cs.yonsei.ac.kr Mincheol Shin

More information

Essential SQLAlchemy. An Overview of SQLAlchemy. Rick Copeland Author, Essential SQLAlchemy Predictix, LLC

Essential SQLAlchemy. An Overview of SQLAlchemy. Rick Copeland Author, Essential SQLAlchemy Predictix, LLC Essential SQLAlchemy An Overview of SQLAlchemy Rick Copeland Author, Essential SQLAlchemy Predictix, LLC SQLAlchemy Philosophy SQL databases behave less like object collections the more size and performance

More information

Shark: SQL and Rich Analytics at Scale. Michael Xueyuan Han Ronny Hajoon Ko

Shark: SQL and Rich Analytics at Scale. Michael Xueyuan Han Ronny Hajoon Ko Shark: SQL and Rich Analytics at Scale Michael Xueyuan Han Ronny Hajoon Ko What Are The Problems? Data volumes are expanding dramatically Why Is It Hard? Needs to scale out Managing hundreds of machines

More information

Overtaking CPU DBMSes with a GPU in Whole-Query Analytic Processing

Overtaking CPU DBMSes with a GPU in Whole-Query Analytic Processing Overtaking CPU DBMSes with a GPU in Whole-Query Analytic Processing Adnan Agbaria David Minor Natan Peterfreund Eyal Rozenberg Ofer Rosenberg - now at Intel - now at GE Research - now a post-doc at CWI

More information

Flare: Optimizing Apache Spark for Scale-Up Architectures and Medium-Size Data

Flare: Optimizing Apache Spark for Scale-Up Architectures and Medium-Size Data Flare: Optimizing Apache Spark for Scale-Up Architectures and Medium-Size Data Paper # 387 Abstract In recent years, Apache Spark has become the de facto standard for big data processing. Spark has enabled

More information

Microsoft Implementing a Data Warehouse with Microsoft SQL Server 2014

Microsoft Implementing a Data Warehouse with Microsoft SQL Server 2014 1800 ULEARN (853 276) www.ddls.com.au Microsoft 20463 - Implementing a Data Warehouse with Microsoft SQL Server 2014 Length 5 days Price $4290.00 (inc GST) Version D Overview Please note: Microsoft have

More information

TECHNOLOGY: Testing Performing Through Changes

TECHNOLOGY: Testing Performing Through Changes TECHNOLOGY: Testing Performing Through Changes By Arup Nanda Measure the impact of changes on SQL workload with SQL performance analyzer. Here is a not-so-uncommon scenario: a query is running slowly.

More information

Best Practices for Decision Support Systems with Microsoft SQL Server 2012 using Dell EqualLogic PS Series Storage Arrays

Best Practices for Decision Support Systems with Microsoft SQL Server 2012 using Dell EqualLogic PS Series Storage Arrays Best Practices for Decision Support Systems with Microsoft SQL Server 2012 using Dell EqualLogic PS Series A Dell EqualLogic Reference Architecture Dell Storage Engineering July 2013 Revisions Date July

More information

Pathfinder/MonetDB: A High-Performance Relational Runtime for XQuery

Pathfinder/MonetDB: A High-Performance Relational Runtime for XQuery Introduction Problems & Solutions Join Recognition Experimental Results Introduction GK Spring Workshop Waldau: Pathfinder/MonetDB: A High-Performance Relational Runtime for XQuery Database & Information

More information

Review. Relational Query Optimization. Query Optimization Overview (cont) Query Optimization Overview. Cost-based Query Sub-System

Review. Relational Query Optimization. Query Optimization Overview (cont) Query Optimization Overview. Cost-based Query Sub-System Review Relational Query Optimization R & G Chapter 12/15 Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops: simple, exploits extra memory

More information

CSC317/MCS9317. Database Performance Tuning. Class test

CSC317/MCS9317. Database Performance Tuning. Class test CSC317/MCS9317 Database Performance Tuning Class test 7 October 2015 Please read all instructions (including these) carefully. The test time is approximately 120 minutes. The test is close book and close

More information

Analyzing the Behavior of a Distributed Database. Query Optimizer. Deepali Nemade

Analyzing the Behavior of a Distributed Database. Query Optimizer. Deepali Nemade Analyzing the Behavior of a Distributed Database Query Optimizer A Project Report Submitted in partial fulfilment of the requirements for the Degree of Master of Engineering in Computer Science and Engineering

More information

IBPS SO Examination 2013 IT Officer Professional Knowledge Question Paper

IBPS SO Examination 2013 IT Officer Professional Knowledge Question Paper IBPS SO Examination 2013 IT Officer Professional Knowledge Question Paper 1. The tracks on a disk which can be accused without repositioning the R/W heads is (A) Surface (B) Cylinder (C) Cluster 2. Which

More information

Beyond EXPLAIN. Query Optimization From Theory To Code. Yuto Hayamizu Ryoji Kawamichi. 2016/5/20 PGCon Ottawa

Beyond EXPLAIN. Query Optimization From Theory To Code. Yuto Hayamizu Ryoji Kawamichi. 2016/5/20 PGCon Ottawa Beyond EXPLAIN Query Optimization From Theory To Code Yuto Hayamizu Ryoji Kawamichi 2016/5/20 PGCon 2016 @ Ottawa Historically Before Relational Querying was physical Need to understand physical organization

More information

Performance Tuning BI on SAP NetWeaver Using DB2 for i5/os and i5 Navigator

Performance Tuning BI on SAP NetWeaver Using DB2 for i5/os and i5 Navigator Performance Tuning BI on SAP NetWeaver Using DB2 for i5/os and i5 Navigator Susan Bestgen System i ERP SAP team System i TM with DB2 for i5/os TM leads the industry in SAP BI-D and SAP BW benchmark certifications

More information

Afterburner: The Case for In-Browser Analytics

Afterburner: The Case for In-Browser Analytics Afterburner: The Case for In-Browser Analytics Kareem El Gebaly and Jimmy Lin David R. Cheriton School of Computer Science University of Waterloo, Ontario, Canada {kareem.elgebaly, jimmylin@uwaterloo.ca

More information

Session 1079: Using Real Application Testing to Successfully Migrate to Exadata - Best Practices and Customer Case Studies

Session 1079: Using Real Application Testing to Successfully Migrate to Exadata - Best Practices and Customer Case Studies Session 1079: Using Real Application Testing to Successfully Migrate to Exadata - Best Practices and Customer Case Studies Prabhaker Gongloor (GP) Product Management Director, Database Manageability, Oracle

More information

Histogram Support in MySQL 8.0

Histogram Support in MySQL 8.0 Histogram Support in MySQL 8.0 Øystein Grøvlen Senior Principal Software Engineer MySQL Optimizer Team, Oracle February 2018 Program Agenda 1 2 3 4 5 Motivating example Quick start guide How are histograms

More information

Lotus Technical Night School XPages and RDBMS

Lotus Technical Night School XPages and RDBMS Lotus Technical Night School XPages and RDBMS Note: Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing

More information

Variance Aware Optimization of Parameterized Queries

Variance Aware Optimization of Parameterized Queries Variance Aware Optimization of Parameterized Queries Surajit Chaudhuri Hongrae Lee Vivek Narasayya Microsoft Research University of British Columbia Microsoft Research ABSTRACT Parameterized queries are

More information

a linear algebra approach to olap

a linear algebra approach to olap a linear algebra approach to olap Rogério Pontes December 14, 2015 Universidade do Minho data warehouse ETL OLTP OLAP ETL Warehouse OLTP Data Mining ETL OLTP Data Marts 2 olap Online analytical processing

More information

Mysql Workbench Import Sql No Database. Selected >>>CLICK HERE<<<

Mysql Workbench Import Sql No Database. Selected >>>CLICK HERE<<< Mysql Workbench Import Sql No Database Selected Mar 14, 2015. I tried several Versions of Workbench, with 6.2.5 it was possible again to Export my databases. ERROR 1046 (3D000) at line 22: No database

More information

Canvas Data Utilities Documentation

Canvas Data Utilities Documentation Canvas Data Utilities Documentation Release 0.0.1a Kajigga Dev Mar 07, 2017 Contents 1 CanvasData Utilities 3 1.1 Module Usage.............................................. 3 1.2 Config File................................................

More information

Implementing a SQL Data Warehouse

Implementing a SQL Data Warehouse Implementing a SQL Data Warehouse 20767B; 5 days, Instructor-led Course Description This 4-day instructor led course describes how to implement a data warehouse platform to support a BI solution. Students

More information

Improving Query Plans. CS157B Chris Pollett Mar. 21, 2005.

Improving Query Plans. CS157B Chris Pollett Mar. 21, 2005. Improving Query Plans CS157B Chris Pollett Mar. 21, 2005. Outline Parse Trees and Grammars Algebraic Laws for Improving Query Plans From Parse Trees To Logical Query Plans Syntax Analysis and Parse Trees

More information

JIT-Compiling SQL Queries in PostgreSQL Using LLVM

JIT-Compiling SQL Queries in PostgreSQL Using LLVM JIT-Compiling SQL Queries in PostgreSQL Using LLVM Dmitry Melnik *, Ruben Buchatskiy, Roman Zhuykov, Eugene Sharygin Institute for System Programming of the Russian Academy of Sciences (ISP RAS) * dm@ispras.ru

More information

A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture

A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture By Gaurav Sheoran 9-Dec-08 Abstract Most of the current enterprise data-warehouses

More information

Processing Analytical Queries over Encrypted Data

Processing Analytical Queries over Encrypted Data Processing Analytical Queries over Encrypted Data The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published Publisher

More information

Monthly SEO Report. Example Client 16 November 2012 Scott Lawson. Date. Prepared by

Monthly SEO Report. Example Client 16 November 2012 Scott Lawson. Date. Prepared by Date Monthly SEO Report Prepared by Example Client 16 November 212 Scott Lawson Contents Thanks for using TrackPal s automated SEO and Analytics reporting template. Below is a brief explanation of the

More information

Benchmark TPC-H 100.

Benchmark TPC-H 100. Benchmark TPC-H 100 vs Benchmark TPC-H Transaction Processing Performance Council (TPC) is a non-profit organization founded in 1988 to define transaction processing and database benchmarks and to disseminate

More information

Implementing a Data Warehouse with Microsoft SQL Server 2014

Implementing a Data Warehouse with Microsoft SQL Server 2014 Course 20463D: Implementing a Data Warehouse with Microsoft SQL Server 2014 Page 1 of 5 Implementing a Data Warehouse with Microsoft SQL Server 2014 Course 20463D: 4 days; Instructor-Led Introduction This

More information

Sql Server 2008 Query Table Schema Management Studio Create

Sql Server 2008 Query Table Schema Management Studio Create Sql Server 2008 Query Table Schema Management Studio Create using SQL Server Management Studio or Transact-SQL by creating a new table and in Microsoft SQL Server 2016 Community Technology Preview 2 (CTP2).

More information

Federation with Db2 11.1

Federation with Db2 11.1 Federation with Db2 11.1 Gerhard Paulus, ids-system GmbH 1 Copyright 2018 Gerhard Paulus, ids-system GmbH Agenda What is federation? Types of federation in Db2 Federation with Db2 before and with Db2 11.1

More information

Automated Benchmark Management. Lisa Nguyen, Ben Hermann

Automated Benchmark Management. Lisa Nguyen, Ben Hermann Automated Benchmark Management Lisa Nguyen, Ben Hermann Metaprogramming Metaprogramming A programming technique in which programs take other programs as their data. Source code generation Program analysis

More information

EsgynDB Enterprise 2.0 Platform Reference Architecture

EsgynDB Enterprise 2.0 Platform Reference Architecture EsgynDB Enterprise 2.0 Platform Reference Architecture This document outlines a Platform Reference Architecture for EsgynDB Enterprise, built on Apache Trafodion (Incubating) implementation with licensed

More information

Robust Optimization of Database Queries

Robust Optimization of Database Queries Robust Optimization of Database Queries Jayant Haritsa Database Systems Lab Indian Institute of Science July 2011 Robust Query Optimization (IASc Mid-year Meeting) 1 Database Management Systems (DBMS)

More information

Principles of Data Management. Lecture #9 (Query Processing Overview)

Principles of Data Management. Lecture #9 (Query Processing Overview) Principles of Data Management Lecture #9 (Query Processing Overview) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Notable News v Midterm

More information

Hunting PostgreSQL Bugs with SQLsmith

Hunting PostgreSQL Bugs with SQLsmith Hunting PostgreSQL Bugs with SQLsmith andreas.seltenreich@credativ.de November 2, 2016 andreas.seltenreich@credativ.de PGConf.EU 2016 1 / 26 Outline Motivation Testing Methodology Analysis of Bugs Uncovered

More information