class 13 scans vs indexes prof. Stratos Idreos
|
|
- Anabel Cameron
- 5 years ago
- Views:
Transcription
1 class 13 scans vs indexes prof. Stratos Idreos
2 b-tree - dynamic tree - always balanced 35,50 35, 12,20 50, 1,2,3 12,15,17 20, Stratos Idreos 2 /24
3 select from R where A<v and. (secondary) index vs scan: the eternal battle A A Stratos Idreos 3 /24
4 select from R where A<v and. (secondary) index vs scan: the eternal battle A A Just having indexes in the system is or can be useless or even bad for performance Knowing when to use an index is key Stratos Idreos 3 /24
5 select from R where A<v and. Primary index vs secondary vs scan? (secondary) index vs scan: the eternal battle A A Just having indexes in the system is or can be useless or even bad for performance Knowing when to use an index is key Stratos Idreos 3 /24
6 applications sql algorithms/operators database kernel design/implement numerous possible algorithms + data representations choose the best data source, algorithms and path for each query data data data Stratos Idreos 4 /24
7 scan secondary index scan Stratos Idreos 5 /24
8 scan secondary index scan Stratos Idreos 5 /24
9 A B C a1 a2 a3 a4 a5 b1 b2 b3 b4 b5 c1 c2 c3 c4 c5 A a5 a3 a2 a1 a4 secondary index on A values out of order with base data a query that select on A and then needs B intermediate out of order Stratos Idreos 6 /24
10 A B C A a1 a2 a3 a4 a5 b1 b2 b3 b4 b5 c1 c2 c3 c4 c a5 a3 a2 a1 a4 secondary index on A values out of order with base data a query that select on A and then needs B intermediate out of order Stratos Idreos 6 /24
11 A B C A a1 a2 a3 a4 a5 b1 b2 b3 b4 b5 c1 c2 c3 c4 c a5 a3 a2 a1 a4 secondary index on A values out of order with base data a query that select on A and then needs B A a5 a3 a2 a1 a4 select Stratos Idreos B b1 b2 b3 b4 b5 intermediate out of order /24
12 covering index: contains all columns needed for a set of queries A A B no need to go to base data but Stratos Idreos 7 /24
13 A A Vs. random access to traverse the tree & need to sort result sequential access pattern but needs to access all data Stratos Idreos 8 /24
14 Stratos Idreos 9 /24
15 the standard solution 1) maintain statistics, 2) optimizer chooses access path depending on estimated selectivity Stratos Idreos 10/24
16 what is wrong with that the standard solution 1) maintain statistics, 2) optimizer chooses access path depending on estimated selectivity Stratos Idreos 10/24
17 Execution time (sec) Index Scan Full Scan Result selectivity (%) Normalized execution time Original Tuned Q1 Q3 Q5 Q7 Q9 Q11 Q13 Q16 Q19 Q22 TPC-H Query Stratos Idreos 11/24
18 Execution time (sec) Index Scan Full Scan Result selectivity (%) Normalized execution time Original Tuned Q1 Q3 Q5 Q7 Q9 Q11 Q13 Q16 Q19 Q22 TPC-H Query ROBUSTNESS Stratos Idreos 11/24
19 can we just recompute the statistics? Execution time (sec) basic stats per column for pair Full Scan 200 Index Scan Optimizer decision Avg. statistics collection Result selectivity (%) Result selectivity (%) Result selectivity (%) Stratos Idreos 12/24
20 2012, somewhere in Germany if I keep 30 data systems researchers trapped in a castle for a week, we might be able to define robust query processing and find a few solutions Stratos Idreos 13/24
21 robust query processing (best definition to date by Goetz) graceful degradation when the environment changes 14 Stratos Idreos 14/24
22 Renata Borovica University of Melbourne Marcin Zukowski Snowflake Campbell Fraser Google Can we avoid bad access path selection (secondary index vs scan) when we have stale (or no) statistics? index response time scan Stratos Idreos selectivity 15/24
23 select min(a) from R where B<10 and C<80 logical plan optimizer mid query reoptimization physical plan execution Stratos Idreos 16/24
24 Index Scan Cost ~ TS cost TS cost Switch Scan Smooth Scan Table Scan SWITCH SCAN while index probing switch to scan if cardinality > estimation good: avoids worst case bad: performance cliff SMOOTH SCAN goal avoid performance cliff close to optimal X EC Cardinality Design smooth scan Stratos Idreos 17/24
25 Stratos Idreos 18/24
26 Stratos Idreos 18/24
27 Read: Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe? Mike. Kester, Manos Athanassoulis, and Stratos Idreos ACM SIGMOD International Conference on Management of Data, 2017 Browse: Smooth Scan: Statistics-Oblivious Access Paths Renata Borovica, Stratos Idreos, Anastasia Ailamaki, Marcin Zukowski and Campbell Fraser IEEE International Conference on Data Engineering (ICDE), 2015 Extra: Efficient mid-query re-optimization of sub-optimal query execution plans Navin Kabra and David DeWitt ACM SIGMOD International Conference on Management of Data, 1998 Stratos Idreos 19/24
28 class 13 scans vs indexes DATA SYSTEMS prof. Stratos Idreos
Smooth Scan: Statistics-Oblivious Access Paths. Renata Borovica-Gajic Stratos Idreos Anastasia Ailamaki Marcin Zukowski Campbell Fraser
Smooth Scan: Statistics-Oblivious Access Paths Renata Borovica-Gajic Stratos Idreos Anastasia Ailamaki Marcin Zukowski Campbell Fraser Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Q13 Q14 Q16 Q18 Q19 Q21 Q22
More informationclass 20 updates 2.0 prof. Stratos Idreos
class 20 updates 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ UPDATE table_name SET column1=value1,column2=value2,... WHERE some_column=some_value INSERT INTO table_name VALUES
More informationclass 9 fast scans 1.0 prof. Stratos Idreos
class 9 fast scans 1.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ 1 pass to merge into 8 sorted pages (2N pages) 1 pass to merge into 4 sorted pages (2N pages) 1 pass to merge into
More informationSmooth Scan: Robust Access Path Selection without Cardinality Estimation
The VLDB Journal manuscript No. (will be inserted by the editor) Smooth Scan: Robust Access Path Selection without Cardinality Estimation Renata Borovica-Gajic Stratos Idreos Anastasia Ailamaki Marcin
More informationSmooth Scan: Statistics-Oblivious Access Paths
Smooth : Statistics-Oblivious Access Paths Renata Borovica-Gajic, Stratos Idreos, Anastasia Ailamaki, Marcin Zukowski and Campbell Fraser École Polytechnique Fédérale de Lausanne Harvard University Snowflake
More informationData Systems that are Easy to Design, Tune and Use. Stratos Idreos
Data Systems that are Easy to Design, Tune and Use data systems that are easy to: (years) (months) design & build set-up & tune (hours/days) use e.g., adapt to new applications, new hardware, spin off
More informationclass 17 updates prof. Stratos Idreos
class 17 updates prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ UPDATE table_name SET column1=value1,column2=value2,... WHERE some_column=some_value INSERT INTO table_name VALUES (value1,value2,value3,...)
More informationSmooth Scan: robust access path selection without cardinality estimation
The VLDB Journal (28) 27:52 545 https://doi.org/.7/s778-8-57-8 REGULAR PAPER Smooth Scan: robust access path selection without cardinality estimation Renata Borovica-Gajic Stratos Idreos 2 Anastasia Ailamaki
More informationbasic db architectures & layouts
class 4 basic db architectures & layouts prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ videos for sections 3 & 4 are online check back every week (1-2 sections weekly) there is a schedule
More informationclass 5 column stores 2.0 prof. Stratos Idreos
class 5 column stores 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ worth thinking about what just happened? where is my data? email, cloud, social media, can we design systems
More informationclass 10 b-trees 2.0 prof. Stratos Idreos
class 10 b-trees 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ CS Colloquium HV Jagadish Prof University of Michigan 10/6 Stratos Idreos /29 2 CS Colloquium Magdalena Balazinska
More informationclass 12 b-trees 2.0 prof. Stratos Idreos
class 12 b-trees 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ A B C A B C clustered/primary index on A Stratos Idreos /26 2 A B C A B C clustered/primary index on A pos C pos
More informationSQL & intro to db architectures
class 3 SQL & intro to db architectures prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ welcome brave cs165 students! 35+62 Stratos Idreos 2 /55 guest lecture Laura Haas Data Systems
More informationclass 17 updates prof. Stratos Idreos
class 17 updates prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ early/late tuple reconstruction, tuple-at-a-time, vectorized or bulk processing, intermediates format, pushing selects
More informationRobust Performance in Database Query Processing
Report from Dagstuhl Seminar 17222 Robust Performance in Database Query Processing Edited by Renata Borovica-Gajic 1, Goetz Graefe 2, and Allison Lee 3 1 The University of Melbourne, AU, renata.borovica@unimelb.edu.au
More informationclass 6 more about column-store plans and compression prof. Stratos Idreos
class 6 more about column-store plans and compression prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ query compilation an ancient yet new topic/research challenge query->sql->interpet
More informationToward timely, predictable and cost-effective data analytics. Renata Borovica-Gajić DIAS, EPFL
Toward timely, predictable and cost-effective data analytics Renata Borovica-Gajić DIAS, EPFL Big data proliferation Big data is when the current technology does not enable users to obtain timely, cost-effective,
More informationdata systems 101 prof. Stratos Idreos class 2
class 2 data systems 101 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ big data V s (it is not about size only) volume velocity variety veracity actually none of that is really new
More informationcomplex plans and hybrid layouts
class 7 complex plans and hybrid layouts prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ essential column-stores features virtual ids late tuple reconstruction (if ever) vectorized execution
More informationdata systems 101 prof. Stratos Idreos class 2
class 2 data systems 101 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ 2 classes per week - OH/Labs every day 1 presentation/discussion lead - 2 reviews each week research (or systems)
More informationclass 8 b-trees prof. Stratos Idreos
class 8 b-trees prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ I spend a lot of time debugging am I doing something wrong? maybe but probably not 1. learn to use gdb 2. after spending
More informationclass 11 b-trees prof. Stratos Idreos
class 11 b-trees prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ Midway check-in: Two design docs tmr (Canvas) & tests on Sunday Next weekend: Lab marathon for midway check-in & tests
More informationModels & Intro to DB Architectures
class 3 Models & Intro to DB Architectures prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ welcome brave cs165 students! 42+44 Stratos Idreos 2 /49 NO LAPTOP/PHONE POLICY class is based
More informationNoDB: Querying Raw Data. --Mrutyunjay
NoDB: Querying Raw Data --Mrutyunjay Overview Introduction Motivation NoDB Philosophy: PostgreSQL Results Opportunities NoDB in Action: Adaptive Query Processing on Raw Data Ioannis Alagiannis, Renata
More informationcolumn-stores basics
class 3 column-stores basics prof. HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ Goetz Graefe Google Research guest lecture Justin Levandoski Microsoft Research projects option 1: systems project (now
More informationfrom bits to systems
class 2 from bits to systems prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ today logistics, goals, etc big data & systems (cont d) designing a data system algorithm: what can go wrong
More informationcolumn-stores basics
class 3 column-stores basics prof. HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ project description is now online First background info will be given this Friday and detailed lecture on Feb 21 Basic Readings
More informationModels & Intro to DB Architectures
class 3 Models & Intro to DB Architectures prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ welcome brave cs165 students! Stratos Idreos 2 /55 NO LAPTOP/PHONE POLICY class is based on
More informationOverview of Data Exploration Techniques. Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri
Overview of Data Exploration Techniques Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri data exploration not always sure what we are looking for (until we find it) data has always been big volume
More informationAccess Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe?
Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe? Michael S. Kester Manos Athanassoulis Stratos Idreos Harvard University {kester, manos, stratos}@seas.harvard.edu
More informationQuery processing on raw files. Vítor Uwe Reus
Query processing on raw files Vítor Uwe Reus Outline 1. Introduction 2. Adaptive Indexing 3. Hybrid MapReduce 4. NoDB 5. Summary Outline 1. Introduction 2. Adaptive Indexing 3. Hybrid MapReduce 4. NoDB
More informationAccess Path Selection in Main-Memory Optimized Data Systems
Access Path Selection in Main-Memory Optimized Data Systems Should I Scan or Should I Probe? Manos Athanassoulis Harvard University Talk at CS265, February 16 th, 2018 1 Access Path Selection SELECT x
More informationSandor Heman, Niels Nes, Peter Boncz. Dynamic Bandwidth Sharing. Cooperative Scans: Marcin Zukowski. CWI, Amsterdam VLDB 2007.
Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS Marcin Zukowski Sandor Heman, Niels Nes, Peter Boncz CWI, Amsterdam VLDB 2007 Outline Scans in a DBMS Cooperative Scans Benchmarks DSM version VLDB,
More informationMid-Query Re-Optimization
Mid-Query Re-Optimization Navin Kabra David J. DeWitt Ivan Terziev University of Pennsylvania Query Optimization Ideally an optimal plan should be found. However, this is not the case especially for complex
More informationA Self-Designing Key-Value Store
A Self-Designing Key-Value Store Niv Dayan, Wilson Qin, Manos Athanassoulis, Stratos Idreos http://daslab.seas.harvard.edu/crimsondb/ storage is cheaper inserts & updates price per GB workload time storage
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY Database Systems: Fall 2008 Quiz II
Department of Electrical Engineering and Computer Science MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.830 Database Systems: Fall 2008 Quiz II There are 14 questions and 11 pages in this quiz booklet. To receive
More informationAdaptivity. Luca Schroeder & Thomas Lively
Adaptivity Luca Schroeder & Thomas Lively H2O: A Hands-free Adaptive Store. Ioannis Alagiannis, Stratos Idreos and Anastassia Ailamaki ACM SIGMOD International Conference on Data Management, 2014 Three
More informationColumn Store Internals
Column Store Internals Sebastian Meine SQL Stylist with sqlity.net sebastian@sqlity.net Outline Outline Column Store Storage Aggregates Batch Processing History 1 History First mention of idea to cluster
More informationDatabase Management System
Database Management System Lecture Join * Some materials adapted from R. Ramakrishnan, J. Gehrke and Shawn Bowers Today s Agenda Join Algorithm Database Management System Join Algorithms Database Management
More informationHOW INDEX TO STORE DATA DATA
Stratos Idreos HOW INDEX DATA TO STORE DATA ALGORITHMS data structure decisions define the algorithms that access data INDEX DATA ALGORITHMS unordered [7,4,2,6,1,3,9,10,5,8] INDEX DATA ALGORITHMS unordered
More informationReview. Support for data retrieval at the physical level:
Query Processing Review Support for data retrieval at the physical level: Indices: data structures to help with some query evaluation: SELECTION queries (ssn = 123) RANGE queries (100
More informationLecture #14 Optimizer Implementation (Part I)
15-721 ADVANCED DATABASE SYSTEMS Lecture #14 Optimizer Implementation (Part I) Andy Pavlo / Carnegie Mellon University / Spring 2016 @Andy_Pavlo // Carnegie Mellon University // Spring 2017 2 TODAY S AGENDA
More informationNotes. Some of these slides are based on a slide set provided by Ulf Leser. CS 640 Query Processing Winter / 30. Notes
uery Processing Olaf Hartig David R. Cheriton School of Computer Science University of Waterloo CS 640 Principles of Database Management and Use Winter 2013 Some of these slides are based on a slide set
More informationAdvanced Database Systems
Lecture IV Query Processing Kyumars Sheykh Esmaili Basic Steps in Query Processing 2 Query Optimization Many equivalent execution plans Choosing the best one Based on Heuristics, Cost Will be discussed
More information2017 & 2018 ACM SIGMOD Most Reproducible Paper Award
STRATOS IDREOS Assistant Professor in Computer Science Ph.D.: June 2010 Harvard John A. Paulson School of Engineering and Applied Sciences Citations > 2800 stratos.seas.harvard.edu stratos@seas.harvard.edu
More informationPLP: Page Latch free
PLP: Page Latch free Shared everything OLTP Ippokratis Pandis Pınar Tözün Ryan Johnson Anastasia Ailamaki IBM Almaden Research Center École Polytechnique Fédérale de Lausanne University of Toronto OLTP
More informationCARNEGIE MELLON UNIVERSITY DEPT. OF COMPUTER SCIENCE DATABASE APPLICATIONS
CARNEGIE MELLON UNIVERSITY DEPT. OF COMPUTER SCIENCE 15-415 DATABASE APPLICATIONS C. Faloutsos Indexing and Hashing 15-415 Database Applications http://www.cs.cmu.edu/~christos/courses/dbms.s00/ general
More informationScaling up analytical queries with column-stores
Scaling up analytical queries with column-stores Ioannis Alagiannis, Manos Athanassoulis, Anastasia Ailamaki Ecole Polytechnique Fédérale de Lausanne Lausanne, VD, Switzerland {ioannis.alagiannis, manos.athanassoulis,
More informationclass 10 fast scans 2.0 prof. Stratos Idreos
class 10 fast scans 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ always want to minimize data movement - computation & utilize all resources! registers on chip cache on board
More informationCourse Outline. Performance Tuning and Optimizing SQL Databases Course 10987B: 4 days Instructor Led
Performance Tuning and Optimizing SQL Databases Course 10987B: 4 days Instructor Led About this course This four-day instructor-led course provides students who manage and maintain SQL Server databases
More informationOutline. Database Management and Tuning. Outline. Join Strategies Running Example. Index Tuning. Johann Gamper. Unit 6 April 12, 2012
Outline Database Management and Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 6 April 12, 2012 1 Acknowledgements: The slides are provided by Nikolaus Augsten
More informationPS2 out today. Lab 2 out today. Lab 1 due today - how was it?
6.830 Lecture 7 9/25/2017 PS2 out today. Lab 2 out today. Lab 1 due today - how was it? Project Teams Due Wednesday Those of you who don't have groups -- send us email, or hand in a sheet with just your
More informationDASlab: The Data Systems Laboratory
DASlab: The Data Systems Laboratory at Harvard SEAS Stratos Idreos Harvard University http://daslab.seas.harvard.edu ABSTRACT DASlab is a new laboratory at the Harvard School of Engineering and Applied
More informationParallel DBMS. Prof. Yanlei Diao. University of Massachusetts Amherst. Slides Courtesy of R. Ramakrishnan and J. Gehrke
Parallel DBMS Prof. Yanlei Diao University of Massachusetts Amherst Slides Courtesy of R. Ramakrishnan and J. Gehrke I. Parallel Databases 101 Rise of parallel databases: late 80 s Architecture: shared-nothing
More informationDatabase System Concepts
Chapter 13: Query Processing s Departamento de Engenharia Informática Instituto Superior Técnico 1 st Semester 2008/2009 Slides (fortemente) baseados nos slides oficiais do livro c Silberschatz, Korth
More informationAUTOMATED physical design has been the burning issue
EDIC RESEARCH PROPOSAL 1 A Trustworthy Physical Designer for Databases in the Cloud Renata Borovica DIAS, I&C, EPFL Abstract Configuring a state-of-the-art database management system (DBMS) to provide
More informationRobustness in Automatic Physical Database Design
Robustness in Automatic Physical Database Design Kareem El Gebaly David R. Cheriton School of Computer Science University of Waterloo Technical Report CS-2007-29 Robustness in Automatic Physical Database
More informationOrdered Indices To gain fast random access to records in a file, we can use an index structure. Each index structure is associated with a particular search key. Just like index of a book, library catalog,
More informationAdaptive Cache Mode Selection for Queries over Raw Data
Adaptive Cache Mode Selection for Queries over Raw Data Tahir Azim, Azqa Nadeem and Anastasia Ailamaki École Polytechnique Fédérale de Lausanne TU Delft RAW Labs SA tahir.azim@epfl.ch, A.Nadeem@student.tudelft.nl,
More informationChapter 12: Query Processing. Chapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 12: Query Processing Overview Measures of Query Cost Selection Operation Sorting Join
More informationIndexing. Week 14, Spring Edited by M. Naci Akkøk, , Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel
Indexing Week 14, Spring 2005 Edited by M. Naci Akkøk, 5.3.2004, 3.3.2005 Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel Overview Conventional indexes B-trees Hashing schemes
More informationHolds in Polaris. Presented at: OHPUG 2014 By: Wes Osborn
Holds in Polaris Presented at: OHPUG 2014 By: Wes Osborn Topics Statistics Requests Fulfillment Troubleshooting Statistics How much of an impact are holds on the system? Statistics - Circulation Holds
More informationIntroduction to Database Systems CSE 414. Lecture 26: More Indexes and Operator Costs
Introduction to Database Systems CSE 414 Lecture 26: More Indexes and Operator Costs CSE 414 - Spring 2018 1 Student ID fname lname Hash table example 10 Tom Hanks Index Student_ID on Student.ID Data File
More informationA Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture
A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture By Gaurav Sheoran 9-Dec-08 Abstract Most of the current enterprise data-warehouses
More informationVisualization of HashStash with Qt
Visualization of HashStash with Qt Zhe Zhao Department of Computer Science Brown University Providence, RI zhe_zhao at brown.edu May 8, 2015 Abstract HashStash is a new abstraction for the multi-query
More informationTriangle SQL Server User Group Adaptive Query Processing with Azure SQL DB and SQL Server 2017
Triangle SQL Server User Group Adaptive Query Processing with Azure SQL DB and SQL Server 2017 Joe Sack, Principal Program Manager, Microsoft Joe.Sack@Microsoft.com Adaptability Adapt based on customer
More informationIntroduction Alternative ways of evaluating a given query using
Query Optimization Introduction Catalog Information for Cost Estimation Estimation of Statistics Transformation of Relational Expressions Dynamic Programming for Choosing Evaluation Plans Introduction
More informationJoin Processing for Flash SSDs: Remembering Past Lessons
Join Processing for Flash SSDs: Remembering Past Lessons Jaeyoung Do, Jignesh M. Patel Department of Computer Sciences University of Wisconsin-Madison $/MB GB Flash Solid State Drives (SSDs) Benefits of
More informationI am: Rana Faisal Munir
Self-tuning BI Systems Home University (UPC): Alberto Abelló and Oscar Romero Host University (TUD): Maik Thiele and Wolfgang Lehner I am: Rana Faisal Munir Research Progress Report (RPR) [1 / 44] Introduction
More informationChapter 13: Query Processing
Chapter 13: Query Processing! Overview! Measures of Query Cost! Selection Operation! Sorting! Join Operation! Other Operations! Evaluation of Expressions 13.1 Basic Steps in Query Processing 1. Parsing
More informationHolistic Indexing in Main-memory Column-stores
Holistic Indexing in Main-memory Column-stores Eleni Petraki CWI Amsterdam petraki@cwi.nl Stratos Idreos Harvard University stratos@seas.harvard.edu Stefan Manegold CWI Amsterdam manegold@cwi.nl ABSTRACT
More informationPERFORMANCE TUNING SQL SERVER ON CRAPPY HARDWARE 3/1/2019 1
PERFORMANCE TUNING SQL SERVER ON CRAPPY HARDWARE 3/1/2019 1 FEEDBACK FORMS PLEASE FILL OUT AND PASS TO YOUR HELPER BEFORE YOU LEAVE THE SESSION MONICA RATHBUN Consultant Denny Cherry & Associates Consulting
More informationNew Bucket Join Algorithm for Faster Join Query Results
The International Arab Journal of Information Technology, Vol. 12, No. 6A, 2015 701 New Bucket Algorithm for Faster Query Results Hemalatha Gunasekaran 1 and ThanushkodiKeppana Gowder 2 1 Department Of
More informationAutomated Physical Designers: What You See is (Not) What You Get
Automated Physical Designers: What You See is (Not) What You Get Renata Borovica Ioannis Alagiannis Anastasia Ailamaki École Polytechnique Fédérale de Lausanne 1015 Lausanne, Switzerland name.surname@epfl.ch
More informationDatabase Management and Tuning
Database Management and Tuning Index Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 4 Acknowledgements: The slides are provided by Nikolaus Augsten and have
More informationOutline. Database Management and Tuning. What is an Index? Key of an Index. Index Tuning. Johann Gamper. Unit 4
Outline Database Management and Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 4 1 2 Conclusion Acknowledgements: The slides are provided by Nikolaus Augsten
More informationIntroduction to Database Systems CSE 344
Introduction to Database Systems CSE 344 Lecture 6: Basic Query Evaluation and Indexes 1 Announcements Webquiz 2 is due on Tuesday (01/21) Homework 2 is posted, due week from Monday (01/27) Today: query
More informationHow Achaeans Would Construct Columns in Troy. Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte
How Achaeans Would Construct Columns in Troy Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte Number of Visas Received 1 0,75 0,5 0,25 0 Alekh Jens Health Level
More informationFrom Routing to Traffic Engineering
1 From Routing to Traffic Engineering Robert Soulé Advanced Networking Fall 2016 2 In the beginning B Goal: pair-wise connectivity (get packets from A to B) Approach: configure static rules in routers
More informationChapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Overview Chapter 12: Query Processing Measures of Query Cost Selection Operation Sorting Join
More information! A relational algebra expression may have many equivalent. ! Cost is generally measured as total elapsed time for
Chapter 13: Query Processing Basic Steps in Query Processing! Overview! Measures of Query Cost! Selection Operation! Sorting! Join Operation! Other Operations! Evaluation of Expressions 1. Parsing and
More informationChapter 13: Query Processing Basic Steps in Query Processing
Chapter 13: Query Processing Basic Steps in Query Processing! Overview! Measures of Query Cost! Selection Operation! Sorting! Join Operation! Other Operations! Evaluation of Expressions 1. Parsing and
More informationLeveraging Query Parallelism In PostgreSQL
Leveraging Query Parallelism In PostgreSQL Get the most from intra-query parallelism! Dilip Kumar (Principle Software Engineer) Rafia Sabih (Software Engineer) 2013 EDB All rights reserved. 1 Overview
More informationShark. Hive on Spark. Cliff Engle, Antonio Lupher, Reynold Xin, Matei Zaharia, Michael Franklin, Ion Stoica, Scott Shenker
Shark Hive on Spark Cliff Engle, Antonio Lupher, Reynold Xin, Matei Zaharia, Michael Franklin, Ion Stoica, Scott Shenker Agenda Intro to Spark Apache Hive Shark Shark s Improvements over Hive Demo Alpha
More information[MS10987A]: Performance Tuning and Optimizing SQL Databases
[MS10987A]: Performance Tuning and Optimizing SQL Databases Length : 4 Days Audience(s) : IT Professionals Level : 300 Technology : Microsoft SQL Server Delivery Method : Instructor-led (Classroom) Course
More informationSpanner A distributed database system
Presented by Yue Xia Spanner A distributed database system Background - Developed by Google initially as a key-value storage system - Developers want traditional database features like query language -
More informationData Storage. Query Performance. Index. Data File Types. Introduction to Data Management CSE 414. Introduction to Database Systems CSE 414
Introduction to Data Management CSE 414 Unit 4: RDBMS Internals Logical and Physical Plans Query Execution Query Optimization Introduction to Database Systems CSE 414 Lecture 16: Basics of Data Storage
More informationDatabase System Concepts, 6 th Ed. Silberschatz, Korth and Sudarshan See for conditions on re-use
Chapter 11: Indexing and Hashing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B + -Tree Index Files Static
More informationDBMS Data Loading: An Analysis on Modern Hardware. Adam Dziedzic, Manos Karpathiotakis*, Ioannis Alagiannis, Raja Appuswamy, Anastasia Ailamaki
DBMS Data Loading: An Analysis on Modern Hardware Adam Dziedzic, Manos Karpathiotakis*, Ioannis Alagiannis, Raja Appuswamy, Anastasia Ailamaki Data loading: A necessary evil Volume => Expensive 4 zettabytes
More informationData Modeling and Databases Ch 10: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 10: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationData Modeling and Databases Ch 9: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 9: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationAdvanced Databases: Parallel Databases A.Poulovassilis
1 Advanced Databases: Parallel Databases A.Poulovassilis 1 Parallel Database Architectures Parallel database systems use parallel processing techniques to achieve faster DBMS performance and handle larger
More informationDatabase Group Research Overview. Immanuel Trummer
Database Group Research Overview Immanuel Trummer Talk Overview User Query Data Analysis Result Processing Talk Overview Fact Checking Query User Data Vocalization Data Analysis Result Processing Query
More informationIntro to DB CHAPTER 12 INDEXING & HASHING
Intro to DB CHAPTER 12 INDEXING & HASHING Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree Index Files Static Hashing Dynamic Hashing Comparison of Ordered Indexing
More informationRe- op&mizing Data Parallel Compu&ng
Re- op&mizing Data Parallel Compu&ng Sameer Agarwal Srikanth Kandula, Nicolas Bruno, Ming- Chuan Wu, Ion Stoica, Jingren Zhou UC Berkeley A Data Parallel Job can be a collec/on of maps, A Data Parallel
More informationCOMP102: Introduction to Databases, 14
COMP102: Introduction to Databases, 14 Dr Muhammad Sulaiman Khan Department of Computer Science University of Liverpool U.K. 8 March, 2011 Physical Database Design: Some Aspects Specific topics for today:
More informationAn Initial Study of Overheads of Eddies
An Initial Study of Overheads of Eddies Amol Deshpande University of California Berkeley, CA USA amol@cs.berkeley.edu Abstract An eddy [2] is a highly adaptive query processing operator that continuously
More informationSTRATOS IDREOS Curriculum Vitae Mar 2018
STRATOS IDREOS Curriculum Vitae Mar 2018 Assistant Professor in Computer Science Ph.D.: June 2010 Harvard John A. Paulson School of Engineering and Applied Sciences Publications > 50 http://stratos.seas.harvard.edu/
More informationChapter 9. Cardinality Estimation. How Many Rows Does a Query Yield? Architecture and Implementation of Database Systems Winter 2010/11
Chapter 9 How Many Rows Does a Query Yield? Architecture and Implementation of Database Systems Winter 2010/11 Wilhelm-Schickard-Institut für Informatik Universität Tübingen 9.1 Web Forms Applications
More informationCMSC424: Database Design. Instructor: Amol Deshpande
CMSC424: Database Design Instructor: Amol Deshpande amol@cs.umd.edu Databases Data Models Conceptual representa1on of the data Data Retrieval How to ask ques1ons of the database How to answer those ques1ons
More information