Exercise 9 Solution proposal
|
|
- William Berry
- 6 years ago
- Views:
Transcription
1 Prof. Dr.-Ing. Dr. h. c. T. Härder Computer Science Department Databases and Information Systems University of Kaiserslautern Exercise 9 Solution proposal Documentation of the lecture: (July 13, 2011, 3.30 pm, ) Exercise 9.1 k-d Tree a) Please store the following tuples into a 3-d-Tree (k-d-tree with k=3) and keys formed by Lastname, Firstname, and Age Lastname Firstname Age 1. Müller Horst Maier Klaus Müller Kurt Gerber Anton Frank Klaus Paul Ludwig 24 Müller Horst 44 Maier Klaus 30 Müller Kurt 38 Gerber Anton 17 Frank Klaus 50 Paul Ludwig 24 b) For comparison, please store the tuples in reverse order (6 -> 1) Paul Ludwig 24 Frank Klaus 50 Gerber Anton 17 Müller Kurt 38 Müller Horst 44 Maier Klaus 30 c) Please search in the trees built in a) and b) for the following tuples, respectively: 1. Müller 2. Gerber, Anton 1
2 Exercise 9.2 Grid File Please get familiar with the dynamic reorganization for inserts and deletions in grid files. Please assume an empty grid file and buckets can hold 3 records. For the relation CAR (ID. BRAND, COLOR), BRAND and COLOR shall be indexed using a grid file. Please sketch all necessary structures (directory, search space, and buckets) after every insertion of the tuples listed below. Thereafter, please delete the first 4 entries and re-draw the aforementioned structures. ID BRAND, COLOR a) KL-PP 1 OPEL GRAY b) PS-A17 FORD BLUE c) KL-CX 33 OPEL BLUE d) KIB-AM 13 BMW RED e) SB-F16 AUDI GREEN f) KL-DZ 12 ALFA SILVER g) ZW-AL 43 VW WHITE h) HOM-C 1 FIAT VIOLET i) SLS-AF 47 AUDI AQUAMARINE j) KL-DM 31 ALFA BLACK k) VW-FS 40 SAAB BROWN Insert a, b, c a, b, c Inserting d requires a split for >=, Insert e b, d, e a, c Inserting f requires a split for >= Rot, Insert g, h, i b, e, i Red a, c, g d, f, h 2
3 Inserting j requires a split for >= BMW, Red BMW B4 e, i, j a, c, g d, f, h b, B4 Inserting k requires a split for >= Rot Red BMW B5 e, i, j a, c, k g, d, f, h B4 B5 B4 b, After deleting the first 4 entries, the structure has the following buckets> and B5 are not merged, because new insertions would again require a split. and B5 are merged iff, and can be merged, too.. Red B5 e, i, j k g, B5 f, h A merge is performed, if> 1. buckets are siblings in the tree 2. a bucket is empty or the dimension discriminator disappears, which is hard to check. 3
4 Exercise 9.3 R-Tree Please assume the following arrangement of 2-dimensional objects. Please store them into an R-tree with M=2: F1 - F F1 F4 3 2 F3 F F Structure of entries: x-coor. bottom, left x-voor. top, right y-coor. bottom, left y-coor. top, right 1. discriminator: x-coor bottom, left, 2. discriminator: x-coor. top, right After inserting all object areas. 0.5; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; 5 Please perform the following search operations by utilizing the R-tree: a) Select all objects that contain the point (6, 3.5). The search returns all possible candidate as well as objects F3 and F5. Because the surrounding rectangle must not be completely equal to the requested object, for all objects, the containment criteria must be checked. By doing so, only F3 remains as solution. b) Which objects are completely included by a rectangle defined by (3.5, 0) and (8, 5.5)? 4
5 This query can be solely answered using the R-tree entries and results in F3 and F4 c) Please determine all objects that are overlapping with the rectangle given by (2, 2), (4, 4): Here, two candidates are returned (F1 and F3). For each of them, it must be made sure that they are in the requested frame. For both of them, this is true. Exercise 9.4 R-Tree - Search Procedure Let there be the following arrangement of areas a.) Please insert them in a sequence according to their number into an R-tree with m=3 (maximum number of entries in a node) and m=2 (minimal number of entries in a node) After inserting areas 1, 2, 3 7; 8 After inserting area 4 3; 8 7; 8 5
6 After inserting areas 5, 6, 7, 8, 9 (For inserting F8, F3 is moved) 0; 7 2; 7 0; 4 4; 9 1; 7 4; 5 2; 3 4; 6 1; 3 7; 8 8; 9 After inserting all areas 0; 7 4; 9 0; 4 1; 7 0; 4 4; 9 7; 9 7; 8 8; 9 1; 3 0; 1 1; 7 4; 5 2; 3 4; 6 1; 3 6; 9 b.) Please define the following functions FIND_ALL_COMPLETELY_CONTAINED_AREAS( Rectangle ) This function completely work on the R-tree, because actual objects are out of interest. They are completely specified by their surrounding rectangles. FIND_ALL_PARTIALLY_CONTAINED_AREAS( Rectangle ) The R-tree only returns the areas that probably overlap with the specified rectangle. For every candidate, a separate check is necessary. c.) Please test your functions with the rectangle given by (3,1) and (7,4). Completely: 3, 8 Partially: 12, 4 (and 3, 8 completely) d.) Which improvements allows for a simpler search? The R*-tree provides simple search by restricting the search space using a more sophisticated splitting of (sub-)areas (Here, split at x=3 and x=7). As a drawback, tree maintenance becomes much more expensive compared to the R-tree. 6
7 Exercise 9.5 Translating SQL to a Record-Oriented Interface a) Please write for the following SQL query a program using the operations of the record-oriented interface: SELECT ENO, ENAME, SAL FROM EMP WHERE AGE > 45 AND SAL > AND DNO >= K50 AND DNO <= K60 We make the following assumptions for query processing using an index scan: For every key value of a query interval, the TIDs of the corresponding records are read from the index pages to directly access the records in the data pages. Consequently, each page is read only once for each value. Because of buffer replacements, a page might be replaced before it is re-used for accessing the next value. Doing query processing this way has the advantage that the records are returned in the order specified by the index attribute. This approach has the disadvantage that some pages might probably be read several times. If sorting of records is not required, it would be cheaper to get all TIDs for all values of the interval first. Thereafter, they would be sorted according to their page number (e.g., by using merge sort). Consequently, for each consecutive access, each data page is only read once and the movements on the hard disks are minimized. Please develop an optimal program for accessing the records using an index scan, if: 1. Indexes are available for AGE, SAL, and DNO. Which influence on optimization has the existence of an index with clustering for the three attributes? OPEN SCAN on EMP (index scan on the index with the highest selectivity and exploiting the start condition) FETCH TUPLE (SCB, Next,...) (with both predicates, for which no index scan was chosen, as search condition) CLOSE SCAN (if the stop condition for the index scan is reached. Hence, if DNO > K60, for an index scan on DNO) For clustering, the selection of an appropriate index does not only depend on the selectivity, but the number of page accesses must be considered, too. We implicitly made this assumption before, because we did not assume any clustering. Therefore, for low selectivities, more page accesses must be expected. 2. We have indexes for AGE and SAL and a clustered index for ENO. The index on ENO, where no selection is performed, is only beneficial, if an ORDER BY ENO is specified. Hence, everything remains unchanged with the exception that the index on ENO is not used. 3. For all attributes of the predicate, indexes without clustering can be used and the table EMP is stored in a segment with high locality (space usage b > 0.9) Probably, a simple table scan is cheaper, because no index information must be read, OPEN SCAN on EMP (table scan with start condition BOS) FETCH TUPLE (SCB, Next,...) (with all predicates as search conditions (SSA)) CLOSE SCAN 7
8 (if the stop condition EOS is reached) b) In which way can the following SQL query be answered using the record-oriented interface? SELECT X.ENAME, Y.ENAME FROM EMP X, EMP Y WHERE X.MNO = Y.ENO AND X.SAL > Y.SAL Please discuss the possible solutions, if 1. Indexes for MNO, ENO, and SAL are available. 2. No additional access paths can be exploited. Please describe the possible solutions using the operations of the record-oriented interface. 1.) Index scans on I EMP (ENO) and I EMP (MNO): If Y.ENO=X.MNO, then TID accesses for (1+n) records are necessary for checking X.SAL>Y.SAL. The index on SAL cannot be exploited. 2.) Nested-loops join on a physical table with Y.ENO as condition for the inner loop. Then, on the average, a test can be skipped after finishing half of the iterations! In this case, it would also be interesting to have a look at probable generalized access paths for X.MNO and Y.ENO. The evaluation of the SAL conditions requires accesses to all X-records (except for those that only represent the k highest bosses (k>=1)). For Y records, only manager records are accessed. c) In which way would you evaluate the following SQL query, which requires a partitioning of the intermediate results by the attribute values: SELECT DNO FROM EMP WHERE JOB = PROGRAMMER GROUP BY DNO HAVING COUNT (*) > 10 Please provide a program for the record-oriented interface, if 1. there is an index on JOB. 2. there is an index on DNO. Which of the strategies is more efficient? 1.) - index scan over table EMP using the index JOB - count the occurrence of each DNO - finally return those that satisfy the HAVING condition 2.) Works only, if there is a clustered index on DNO. For every DNO, all EMP records must be read to eval- 8
9 uate the WHERE condition, if the number of values exceeds 10. For this approach, more page accesses are required compared to a sequential access to EMP (table scan on JOB= PROGRAMMER as search condition) and an internal evaluation of the condition including statistics gathering. If clustering on DNO is assumed and if many small departments exist, some page accesses might be saved. 9
10. Record-Oriented DB Interface
10 Record-Oriented DB Interface Theo Härder wwwhaerderde Goals - Design principles for record-oriented and navigation on logical access paths - Development of a scan technique and a Main reference: Theo
More informationIndices. We consider B-Trees only
We consider B-Trees only key attributes: a 1,..., a n data attributes: d 1,..., d m Often: one special data attribute holding the TID of a tuple Some notions: simple/complex key unique/non-unique index
More informationData on External Storage
Advanced Topics in DBMS Ch-1: Overview of Storage and Indexing By Syed khutubddin Ahmed Assistant Professor Dept. of MCA Reva Institute of Technology & mgmt. Data on External Storage Prg1 Prg2 Prg3 DBMS
More information8. Secondary and Hierarchical Access Paths
8. Secondary and Access Paths Theo Härder www.haerder.de Main reference: Theo Härder, Erhard Rahm: Datenbanksysteme Konzepte und Techniken der Implementierung, Springer, 2, Chapter 8. Patrick O Neil, Elizabeth
More informationUniversity of Waterloo Midterm Examination Sample Solution
1. (4 total marks) University of Waterloo Midterm Examination Sample Solution Winter, 2012 Suppose that a relational database contains the following large relation: Track(ReleaseID, TrackNum, Title, Length,
More informationTemporal Relations 369 / 482
Temporal Relations and Table Functions Temporal Relations The query optimizer might introduce temporal relations: a relations just for the query allows for reusing intermediate results related: temporary
More information192 Chapter 14. TotalCost=3 (1, , 000) = 6, 000
192 Chapter 14 5. SORT-MERGE: With 52 buffer pages we have B> M so we can use the mergeon-the-fly refinement which costs 3 (M + N). TotalCost=3 (1, 000 + 1, 000) = 6, 000 HASH JOIN: Now both relations
More informationCSIT5300: Advanced Database Systems
CSIT5300: Advanced Database Systems L11: Physical Database Design Dr. Kenneth LEUNG Department of Computer Science and Engineering The Hong Kong University of Science and Technology Hong Kong SAR, China
More informationImplementing Relational Operators: Selection, Projection, Join. Database Management Systems, R. Ramakrishnan and J. Gehrke 1
Implementing Relational Operators: Selection, Projection, Join Database Management Systems, R. Ramakrishnan and J. Gehrke 1 Readings [RG] Sec. 14.1-14.4 Database Management Systems, R. Ramakrishnan and
More informationTotalCost = 3 (1, , 000) = 6, 000
156 Chapter 12 HASH JOIN: Now both relations are the same size, so we can treat either one as the smaller relation. With 15 buffer pages the first scan of S splits it into 14 buckets, each containing about
More informationDatabase Management Systems (COP 5725) Homework 3
Database Management Systems (COP 5725) Homework 3 Instructor: Dr. Daisy Zhe Wang TAs: Yang Chen, Kun Li, Yang Peng yang, kli, ypeng@cise.uf l.edu November 26, 2013 Name: UFID: Email Address: Pledge(Must
More informationAdvanced Database Systems
Lecture IV Query Processing Kyumars Sheykh Esmaili Basic Steps in Query Processing 2 Query Optimization Many equivalent execution plans Choosing the best one Based on Heuristics, Cost Will be discussed
More informationQuery Processing with Indexes. Announcements (February 24) Review. CPS 216 Advanced Database Systems
Query Processing with Indexes CPS 216 Advanced Database Systems Announcements (February 24) 2 More reading assignment for next week Buffer management (due next Wednesday) Homework #2 due next Thursday
More informationHash-Based Indexing 165
Hash-Based Indexing 165 h 1 h 0 h 1 h 0 Next = 0 000 00 64 32 8 16 000 00 64 32 8 16 A 001 01 9 25 41 73 001 01 9 25 41 73 B 010 10 10 18 34 66 010 10 10 18 34 66 C Next = 3 011 11 11 19 D 011 11 11 19
More informationOutline. Database Management and Tuning. Index Data Structures. Outline. Index Tuning. Johann Gamper. Unit 5
Outline Database Management and Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 5 1 2 Conclusion Acknowledgements: The slides are provided by Nikolaus Augsten
More informationQUERY OPTIMIZATION FOR DATABASE MANAGEMENT SYSTEM BY APPLYING DYNAMIC PROGRAMMING ALGORITHM
QUERY OPTIMIZATION FOR DATABASE MANAGEMENT SYSTEM BY APPLYING DYNAMIC PROGRAMMING ALGORITHM Wisnu Adityo NIM 13506029 Information Technology Department Institut Teknologi Bandung Jalan Ganesha 10 e-mail:
More informationTemporal Aggregation and Join
TSDB15, SL05 1/49 A. Dignös Temporal and Spatial Database 2014/2015 2nd semester Temporal Aggregation and Join SL05 Temporal aggregation Span, instant, moving window aggregation Aggregation tree, balanced
More informationNew Requirements. Advanced Query Processing. Top-N/Bottom-N queries Interactive queries. Skyline queries, Fast initial response time!
Lecture 13 Advanced Query Processing CS5208 Advanced QP 1 New Requirements Top-N/Bottom-N queries Interactive queries Decision making queries Tolerant of errors approximate answers acceptable Control over
More informationChapter 18 Strategies for Query Processing. We focus this discussion w.r.t RDBMS, however, they are applicable to OODBS.
Chapter 18 Strategies for Query Processing We focus this discussion w.r.t RDBMS, however, they are applicable to OODBS. 1 1. Translating SQL Queries into Relational Algebra and Other Operators - SQL is
More informationDatenbanksysteme II: Caching and File Structures. Ulf Leser
Datenbanksysteme II: Caching and File Structures Ulf Leser Content of this Lecture Caching Overview Accessing data Cache replacement strategies Prefetching File structure Index Files Ulf Leser: Implementation
More informationIndexing. Week 14, Spring Edited by M. Naci Akkøk, , Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel
Indexing Week 14, Spring 2005 Edited by M. Naci Akkøk, 5.3.2004, 3.3.2005 Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel Overview Conventional indexes B-trees Hashing schemes
More informationStorage and Indexing
CompSci 516 Data Intensive Computing Systems Lecture 5 Storage and Indexing Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Announcement Homework 1 Due on Feb
More informationChapter 12: Query Processing. Chapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 12: Query Processing Overview Measures of Query Cost Selection Operation Sorting Join
More informationDatenbanksysteme II: Implementing Joins. Ulf Leser
Datenbanksysteme II: Implementing Joins Ulf Leser Content of this Lecture Nested loop and blocked nested loop Sort-merge join Hash-based join strategies Index join Ulf Leser: Implementation of Database
More informationSome Practice Problems on Hardware, File Organization and Indexing
Some Practice Problems on Hardware, File Organization and Indexing Multiple Choice State if the following statements are true or false. 1. On average, repeated random IO s are as efficient as repeated
More informationCS2 Current Technologies Lecture 3: SQL - Joins and Subqueries
T E H U N I V E R S I T Y O H F R G E D I N B U CS2 Current Technologies Lecture 3: SQL - Joins and Subqueries Chris Walton (cdw@dcs.ed.ac.uk) 11 February 2002 Multiple Tables 1 Redundancy requires excess
More informationChapter 11: Indexing and Hashing
Chapter 11: Indexing and Hashing Basic Concepts Ordered Indices B + -Tree Index Files B-Tree Index Files Static Hashing Dynamic Hashing Comparison of Ordered Indexing and Hashing Index Definition in SQL
More information(Storage System) Access Methods Buffer Manager
6.830 Lecture 5 9/20/2017 Project partners due next Wednesday. Lab 1 due next Monday start now!!! Recap Anatomy of a database system Major Components: Admission Control Connection Management ---------------------------------------(Query
More informationPhysical Design. Elena Baralis, Silvia Chiusano Politecnico di Torino. Phases of database design D B M G. Database Management Systems. Pag.
Physical Design D B M G 1 Phases of database design Application requirements Conceptual design Conceptual schema Logical design ER or UML Relational tables Logical schema Physical design Physical schema
More informationSystems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15
Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2014/15 Lecture II: Indexing Part I of this course Indexing 3 Database File Organization and Indexing Remember: Database tables
More informationDatabase System Concepts
Chapter 13: Query Processing s Departamento de Engenharia Informática Instituto Superior Técnico 1 st Semester 2008/2009 Slides (fortemente) baseados nos slides oficiais do livro c Silberschatz, Korth
More informationQuery Processing. Introduction to Databases CompSci 316 Fall 2017
Query Processing Introduction to Databases CompSci 316 Fall 2017 2 Announcements (Tue., Nov. 14) Homework #3 sample solution posted in Sakai Homework #4 assigned today; due on 12/05 Project milestone #2
More informationOverview of Storage and Indexing
Overview of Storage and Indexing Chapter 8 How index-learning turns no student pale Yet holds the eel of science by the tail. -- Alexander Pope (1688-1744) Database Management Systems 3ed, R. Ramakrishnan
More informationOverview of Implementing Relational Operators and Query Evaluation
Overview of Implementing Relational Operators and Query Evaluation Chapter 12 Motivation: Evaluating Queries The same query can be evaluated in different ways. The evaluation strategy (plan) can make orders
More informationQUERY OPTIMIZATION. CS 564- Spring ACKs: Jeff Naughton, Jignesh Patel, AnHai Doan
QUERY OPTIMIZATION CS 564- Spring 2018 ACKs: Jeff Naughton, Jignesh Patel, AnHai Doan WHAT IS THIS LECTURE ABOUT? What is a query optimizer? Generating query plans Cost estimation of query plans 2 ARCHITECTURE
More informationChapter 11: Indexing and Hashing
Chapter 11: Indexing and Hashing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 11: Indexing and Hashing Basic Concepts Ordered Indices B + -Tree Index Files B-Tree
More informationChapter 13: Query Processing
Chapter 13: Query Processing! Overview! Measures of Query Cost! Selection Operation! Sorting! Join Operation! Other Operations! Evaluation of Expressions 13.1 Basic Steps in Query Processing 1. Parsing
More informationIntro to DB CHAPTER 12 INDEXING & HASHING
Intro to DB CHAPTER 12 INDEXING & HASHING Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree Index Files Static Hashing Dynamic Hashing Comparison of Ordered Indexing
More informationCost Models. the query database statistics description of computational resources, e.g.
Cost Models An optimizer estimates costs for plans so that it can choose the least expensive plan from a set of alternatives. Inputs to the cost model include: the query database statistics description
More informationChapter 17. Disk Storage, Basic File Structures, and Hashing. Records. Blocking
Chapter 17 Disk Storage, Basic File Structures, and Hashing Records Fixed and variable length records Records contain fields which have values of a particular type (e.g., amount, date, time, age) Fields
More informationRemember. 376a. Database Design. Also. B + tree reminders. Algorithms for B + trees. Remember
376a. Database Design Dept. of Computer Science Vassar College http://www.cs.vassar.edu/~cs376 Class 14 B + trees, multi-key indices, partitioned hashing and grid files B and B + -trees are used one implementation
More informationCSIT5300: Advanced Database Systems
CSIT5300: Advanced Database Systems L08: B + -trees and Dynamic Hashing Dr. Kenneth LEUNG Department of Computer Science and Engineering The Hong Kong University of Science and Technology Hong Kong SAR,
More informationCSIT5300: Advanced Database Systems
CSIT5300: Advanced Database Systems E11: Exercises on Query Optimization Dr. Kenneth LEUNG Department of Computer Science and Engineering The Hong Kong University of Science and Technology Hong Kong SAR,
More informationChapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Overview Chapter 12: Query Processing Measures of Query Cost Selection Operation Sorting Join
More informationAnnouncements. Reading Material. Today. Different File Organizations. Selection of Indexes 9/24/17. CompSci 516: Database Systems
CompSci 516 Database Systems Lecture 9 Index Selection and External Sorting Announcements Private project threads created on piazza Please use these threads (and not emails) for all communications on your
More informationAnnouncement. Reading Material. Overview of Query Evaluation. Overview of Query Evaluation. Overview of Query Evaluation 9/26/17
Announcement CompSci 516 Database Systems Lecture 10 Query Evaluation and Join Algorithms Project proposal pdf due on sakai by 5 pm, tomorrow, Thursday 09/27 One per group by any member Instructor: Sudeepa
More informationAccess Methods. Basic Concepts. Index Evaluation Metrics. search key pointer. record. value. Value
Access Methods This is a modified version of Prof. Hector Garcia Molina s slides. All copy rights belong to the original author. Basic Concepts search key pointer Value record? value Search Key - set of
More information! A relational algebra expression may have many equivalent. ! Cost is generally measured as total elapsed time for
Chapter 13: Query Processing Basic Steps in Query Processing! Overview! Measures of Query Cost! Selection Operation! Sorting! Join Operation! Other Operations! Evaluation of Expressions 1. Parsing and
More informationChapter 13: Query Processing Basic Steps in Query Processing
Chapter 13: Query Processing Basic Steps in Query Processing! Overview! Measures of Query Cost! Selection Operation! Sorting! Join Operation! Other Operations! Evaluation of Expressions 1. Parsing and
More informationChapter 12: Indexing and Hashing. Basic Concepts
Chapter 12: Indexing and Hashing! Basic Concepts! Ordered Indices! B+-Tree Index Files! B-Tree Index Files! Static Hashing! Dynamic Hashing! Comparison of Ordered Indexing and Hashing! Index Definition
More informationIndexing and Hashing
C H A P T E R 1 Indexing and Hashing This chapter covers indexing techniques ranging from the most basic one to highly specialized ones. Due to the extensive use of indices in database systems, this chapter
More informationCompSci 516: Database Systems
CompSci 516 Database Systems Lecture 9 Index Selection and External Sorting Instructor: Sudeepa Roy Duke CS, Fall 2017 CompSci 516: Database Systems 1 Announcements Private project threads created on piazza
More informationChapter 12: Indexing and Hashing
Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree Index Files Static Hashing Dynamic Hashing Comparison of Ordered Indexing and Hashing Index Definition in SQL
More informationLecture 15: The Details of Joins
Lecture 15 Lecture 15: The Details of Joins (and bonus!) Lecture 15 > Section 1 What you will learn about in this section 1. How to choose between BNLJ, SMJ 2. HJ versus SMJ 3. Buffer Manager Detail (PS#3!)
More informationWhy Is This Important? Overview of Storage and Indexing. Components of a Disk. Data on External Storage. Accessing a Disk Page. Records on a Disk Page
Why Is This Important? Overview of Storage and Indexing Chapter 8 DB performance depends on time it takes to get the data from storage system and time to process Choosing the right index for faster access
More informationQuery Processing. Debapriyo Majumdar Indian Sta4s4cal Ins4tute Kolkata DBMS PGDBA 2016
Query Processing Debapriyo Majumdar Indian Sta4s4cal Ins4tute Kolkata DBMS PGDBA 2016 Slides re-used with some modification from www.db-book.com Reference: Database System Concepts, 6 th Ed. By Silberschatz,
More informationQUERY OPTIMIZATION [CH 15]
Spring 2017 QUERY OPTIMIZATION [CH 15] 4/12/17 CS 564: Database Management Systems; (c) Jignesh M. Patel, 2013 1 Example SELECT distinct ename FROM Emp E, Dept D WHERE E.did = D.did and D.dname = Toy EMP
More informationFundamentals of Database Systems
Fundamentals of Database Systems Assignment: 4 September 21, 2015 Instructions 1. This question paper contains 10 questions in 5 pages. Q1: Calculate branching factor in case for B- tree index structure,
More informationDatabase Applications (15-415)
Database Applications (15-415) DBMS Internals- Part V Lecture 15, March 15, 2015 Mohammad Hammoud Today Last Session: DBMS Internals- Part IV Tree-based (i.e., B+ Tree) and Hash-based (i.e., Extendible
More informationINTELLIGENT DATABASE GROUP. Foundations of Information Systems. 5 DBMS Architecture. Prof. Dr.-Ing. Wolfgang Lehner
Prof. Dr.-Ing. Wolfgang Lehner INTELLIGENT DATABASE GROUP 5 DBMS Architecture What is in the Lecture?. Database Usage Query Programming Design 2. Database Architecture Indexes Transactions Query Processing
More informationOverview of Query Evaluation. Chapter 12
Overview of Query Evaluation Chapter 12 1 Outline Query Optimization Overview Algorithm for Relational Operations 2 Overview of Query Evaluation DBMS keeps descriptive data in system catalogs. SQL queries
More informationChapter 11: Indexing and Hashing
Chapter 11: Indexing and Hashing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 11: Indexing and Hashing Basic Concepts Ordered Indices B + -Tree Index Files B-Tree
More informationCSC 261/461 Database Systems Lecture 19
CSC 261/461 Database Systems Lecture 19 Fall 2017 Announcements CIRC: CIRC is down!!! MongoDB and Spark (mini) projects are at stake. L Project 1 Milestone 4 is out Due date: Last date of class We will
More informationQuery Execution [15]
CSC 661, Principles of Database Systems Query Execution [15] Dr. Kalpakis http://www.csee.umbc.edu/~kalpakis/courses/661 Query processing involves Query processing compilation parsing to construct parse
More informationIntroduction to Query Processing and Query Optimization Techniques. Copyright 2011 Ramez Elmasri and Shamkant Navathe
Introduction to Query Processing and Query Optimization Techniques Outline Translating SQL Queries into Relational Algebra Algorithms for External Sorting Algorithms for SELECT and JOIN Operations Algorithms
More informationStep 4: Choose file organizations and indexes
Step 4: Choose file organizations and indexes Asst. Prof. Dr. Kanda Saikaew (krunapon@kku.ac.th) Dept of Computer Engineering Khon Kaen University Overview How to analyze users transactions to determine
More informationExamples of Physical Query Plan Alternatives. Selected Material from Chapters 12, 14 and 15
Examples of Physical Query Plan Alternatives Selected Material from Chapters 12, 14 and 15 1 Query Optimization NOTE: SQL provides many ways to express a query. HENCE: System has many options for evaluating
More informationOutline. Query Types
Outline Database Tuning Nikolaus Augsten nikolaus.augsten@sbg.ac.at Department of Computer Sciences University of Salzburg http://dbresearch.uni-salzburg.at 1 Examples SS 2017/18 Version May 14, 2018 Adapted
More informationMidterm Review. March 27, 2017
Midterm Review March 27, 2017 1 Overview Relational Algebra & Query Evaluation Relational Algebra Rewrites Index Design / Selection Physical Layouts 2 Relational Algebra & Query Evaluation 3 Relational
More informationSomething to think about. Problems. Purpose. Vocabulary. Query Evaluation Techniques for large DB. Part 1. Fact:
Query Evaluation Techniques for large DB Part 1 Fact: While data base management systems are standard tools in business data processing they are slowly being introduced to all the other emerging data base
More informationDatabase Management System
Database Management System Lecture Join * Some materials adapted from R. Ramakrishnan, J. Gehrke and Shawn Bowers Today s Agenda Join Algorithm Database Management System Join Algorithms Database Management
More informationCS330. Query Processing
CS330 Query Processing 1 Overview of Query Evaluation Plan: Tree of R.A. ops, with choice of alg for each op. Each operator typically implemented using a `pull interface: when an operator is `pulled for
More informationQUERY PROCESSING & OPTIMIZATION CHAPTER 19 (6/E) CHAPTER 15 (5/E)
QUERY PROCESSING & OPTIMIZATION CHAPTER 19 (6/E) CHAPTER 15 (5/E) 2 LECTURE OUTLINE Query Processing Methodology Basic Operations and Their Costs Generation of Execution Plans 3 QUERY PROCESSING IN A DDBMS
More informationAnnouncements. Reading Material. Recap. Today 9/17/17. Storage (contd. from Lecture 6)
CompSci 16 Intensive Computing Systems Lecture 7 Storage and Index Instructor: Sudeepa Roy Announcements HW1 deadline this week: Due on 09/21 (Thurs), 11: pm, no late days Project proposal deadline: Preliminary
More informationOutline. Database Management and Tuning. Outline. Join Strategies Running Example. Index Tuning. Johann Gamper. Unit 6 April 12, 2012
Outline Database Management and Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 6 April 12, 2012 1 Acknowledgements: The slides are provided by Nikolaus Augsten
More informationModern Database Systems Lecture 1
Modern Database Systems Lecture 1 Aristides Gionis Michael Mathioudakis T.A.: Orestis Kostakis Spring 2016 logistics assignment will be up by Monday (you will receive email) due Feb 12 th if you re not
More informationOverview of Query Evaluation. Overview of Query Evaluation
Overview of Query Evaluation Chapter 12 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Overview of Query Evaluation v Plan: Tree of R.A. ops, with choice of alg for each op. Each operator
More informationTradeoffs in Processing Multi-Way Join Queries via Hashing in Multiprocessor Database Machines
Tradeoffs in Processing Multi-Way Queries via Hashing in Multiprocessor Database Machines Donovan A. Schneider David J. DeWitt Computer Sciences Department University of Wisconsin This research was partially
More informationDatabase Applications (15-415)
Database Applications (15-415) DBMS Internals- Part V Lecture 13, March 10, 2014 Mohammad Hammoud Today Welcome Back from Spring Break! Today Last Session: DBMS Internals- Part IV Tree-based (i.e., B+
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 7 - Query execution
CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2007 Lecture 7 - Query execution References Generalized Search Trees for Database Systems. J. M. Hellerstein, J. F. Naughton
More informationChapter 12: Indexing and Hashing
Chapter 12: Indexing and Hashing Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B + -Tree Index Files B-Tree
More informationUniversity of Massachusetts Amherst Department of Computer Science Prof. Yanlei Diao
University of Massachusetts Amherst Department of Computer Science Prof. Yanlei Diao CMPSCI 445 Midterm Practice Questions NAME: LOGIN: Write all of your answers directly on this paper. Be sure to clearly
More informationCMPUT 391 Database Management Systems. Query Processing: The Basics. Textbook: Chapter 10. (first edition: Chapter 13) University of Alberta 1
CMPUT 391 Database Management Systems Query Processing: The Basics Textbook: Chapter 10 (first edition: Chapter 13) Based on slides by Lewis, Bernstein and Kifer University of Alberta 1 External Sorting
More informationOverview of Storage and Indexing
Overview of Storage and Indexing Chapter 8 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Data on External Storage Disks: Can retrieve random page at fixed cost But reading several consecutive
More informationCost-based Query Sub-System. Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications. Last Class.
Cost-based Query Sub-System Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications Queries Select * From Blah B Where B.blah = blah Query Parser Query Optimizer C. Faloutsos A. Pavlo
More informationPS2 out today. Lab 2 out today. Lab 1 due today - how was it?
6.830 Lecture 7 9/25/2017 PS2 out today. Lab 2 out today. Lab 1 due today - how was it? Project Teams Due Wednesday Those of you who don't have groups -- send us email, or hand in a sheet with just your
More informationIntroduction to Spatial Database Systems
Introduction to Spatial Database Systems by Cyrus Shahabi from Ralf Hart Hartmut Guting s VLDB Journal v3, n4, October 1994 Data Structures & Algorithms 1. Implementation of spatial algebra in an integrated
More informationModule 9: Selectivity Estimation
Module 9: Selectivity Estimation Module Outline 9.1 Query Cost and Selectivity Estimation 9.2 Database profiles 9.3 Sampling 9.4 Statistics maintained by commercial DBMS Web Forms Transaction Manager Lock
More informationPhysical Level of Databases: B+-Trees
Physical Level of Databases: B+-Trees Adnan YAZICI Computer Engineering Department METU (Fall 2005) 1 B + -Tree Index Files l Disadvantage of indexed-sequential files: performance degrades as file grows,
More informationHash table example. B+ Tree Index by Example Recall binary trees from CSE 143! Clustered vs Unclustered. Example
Student Introduction to Database Systems CSE 414 Hash table example Index Student_ID on Student.ID Data File Student 10 Tom Hanks 10 20 20 Amy Hanks ID fname lname 10 Tom Hanks 20 Amy Hanks Lecture 26:
More informationUniversity of Waterloo Midterm Examination Solution
University of Waterloo Midterm Examination Solution Winter, 2011 1. (6 total marks) The diagram below shows an extensible hash table with four hash buckets. Each number x in the buckets represents an entry
More informationLecture 14. Lecture 14: Joins!
Lecture 14 Lecture 14: Joins! Lecture 14 Announcements: Two Hints You may want to do Trigger activity for project 2. We ve noticed those who do it have less trouble with project! Seems like we re good
More informationCSE 530A. B+ Trees. Washington University Fall 2013
CSE 530A B+ Trees Washington University Fall 2013 B Trees A B tree is an ordered (non-binary) tree where the internal nodes can have a varying number of child nodes (within some range) B Trees When a key
More informationAlgorithms for Query Processing and Optimization. 0. Introduction to Query Processing (1)
Chapter 19 Algorithms for Query Processing and Optimization 0. Introduction to Query Processing (1) Query optimization: The process of choosing a suitable execution strategy for processing a query. Two
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY Database Systems: Fall 2015 Quiz I
Department of Electrical Engineering and Computer Science MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.830 Database Systems: Fall 2015 Quiz I There are 12 questions and 13 pages in this quiz booklet. To receive
More informationSystem R Optimization (contd.)
System R Optimization (contd.) Instructor: Sharma Chakravarthy sharma@cse.uta.edu The University of Texas @ Arlington Database Management Systems, S. Chakravarthy 1 Optimization Criteria number of page
More informationCSCI-6421 Final Exam York University Fall Term 2004
6 December 2004 CS-6421 Final Exam p. 1 of 7 CSCI-6421 Final Exam York University Fall Term 2004 Due: 6pm Wednesday 15 December 2004 Last Name: First Name: Instructor: Parke Godfrey Exam Duration: take
More information2, 3, 5, 7, 11, 17, 19, 23, 29, 31
148 Chapter 12 Indexing and Hashing implementation may be by linking together fixed size buckets using overflow chains. Deletion is difficult with open hashing as all the buckets may have to inspected
More informationWhat happens. 376a. Database Design. Execution strategy. Query conversion. Next. Two types of techniques
376a. Database Design Dept. of Computer Science Vassar College http://www.cs.vassar.edu/~cs376 Class 16 Query optimization What happens Database is given a query Query is scanned - scanner creates a list
More informationChapter 11: Indexing and Hashing
Chapter 11: Indexing and Hashing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B + -Tree Index Files B-Tree
More information