Adaptive Query Processing on Prefix Trees Wolfgang Lehner
|
|
- Erin May
- 5 years ago
- Views:
Transcription
1 Adaptive Query Processing on Prefix Trees Wolfgang Lehner Fachgruppentreffen, TU München Prof. Dr.-Ing. Wolfgang Lehner
2 > Challenges for Database Systems Three things are important in the database world: performance, performance and performance. Bruce Lindsey M. Stonebraker Extreme data Extreme performance Dynamo Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 2
3 > Challenges for Database Systems Extreme Performance Flexibility in Database Systems + during deployment time (schema definition) - during database lifetime (schema evolution) - during query runtime (scheduling, ) Extreme Flexibility Extreme Data Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 3
4 Database System > Flexibility from feet applications data comes first, schema comes second querying Web-Tables role-based object models Open Data platforms Flexibility is cross-cutting alternative query model (drill-beyond) record management logging an persistency HW/SW DB-CoDesign (MV)CC domain-specific data models Demand flexibility statistical operators query processing data model- related storage model- related Provide flexibility operating system & hardware StorageClassMemory MegaCore systems TeraByte-Capacity GPUs FPGAs/FPPAs Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 4
5 > Example 1: Data-driven Schema Design GSM GSM TriBand GSM, GPRS, Bluetooth, FM Dimensions/Weight Dimensions/Weight/ Form Factor Dimensions/Weight/ Form Factor GSM, UMTS, Bluetooth, USB, WiFi Dimensions/Weight GSM, UMTS, HSPA+, Bluetooth, FM, USB, WiFi, agps, HDMI Dimensions/Weight DotMatrix Display DotMatrix Display 2.0, QVGA, Color 3.5, , Touch 4.3, , Touch Standby/Talk Time Standby/Talk Time Standby/Talk Time Battery Capacity Battery Capacity #Names, #SMS #Names, #SMS Memory/Card Slot Memory Memory/Storage/Ca rd Slot Polyphonic Ringtones MP3 Ringtones/MP3 Player/Browser Browser/Exchange Browser/Exchange 3MP Camera 3MP Camera Front/Rear Camera/HD Video CPU, OS CPU, GPU, OS Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 5
6 > Example 2: Web Data Management Web Data: A Multitude of Data Models and Sources Example 1: Queries spanning millions of data sources with as many schemata Example 2: Augment local datasets with (external) third-party information select from cust, where nations.gdp > $1B Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 6
7 > Example 3: Energy-Adaptive High-Speed Computing Platform SFB912: HAEC Highly Adaptive Energy Efficient Computing Optical Interconnect adaptive analog/digital circuits for e/o transceiver embedded polymer waveguide packaging technologies (e.g. 3D stacking of Si/III-V hybrids) 90 coupling of laser Radio Interconnect on-chip/on-package antenna arrays analog/digital beamsteering and interference minimization 100Gb/s GHz channel 3D routing & flow management Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 7
8 > QPPT Overview 3 Composed Operators Construction of composed operators, like multi-way-select-join-group-sort Avoids Intermediate Materialization 2 Cooperative Operators Operators optimize their output for the next operator 1 Intermediate Indexed Tables Operators exchange clustered es 3 aggregate 1 Prefix Treebased Indexes CIDR In-Memory Database Cluster Indexes Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 8
9 > Prefix Trees Prefix Trees Unbalanced Balanced read/write performance Main memory-optimized Efficient parallel access B+-Trees Balanced Low update performance Disk-optimized Low scalability Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 9
10 > Index Landscape KISS-Tree 32bit key Buzzard NUMA optimized Clustered Column Store Scan optimized Adaptive Radix Tree (ART) Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 10
11 > KISS-Tree Overview Properties Specialized version for 32bit keys Latch-free updates Order-preserving 2-3 memory accesses per key Comparable fast to reported order-preserving in-memory es for read access BUT: High update performance Heterogeneous in-memory structure Combination of direct and indirect addressing Takes advantage of virtual memory management Enables different compression mechanisms DAEMON 2012 Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 11
12 > Level 1: Virtual Level Decimal Key: Split key in 3 fragments 16bit (f 1 ) 5 10bit (f 2 ) 6bit (f 3 ) Calculate corresponding 2nd node address from the first fragment Direct addressing No memory required for level 1 Level 2 Node Node Node Node Node Node Node Node Requirement for Level 2: All nodes have to be stored sequentially in memory Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 12
13 > Level 1: Virtual Level Decimal Key: Split key in 3 fragments 16bit (f 1 ) 10bit (f 2 ) 6bit (f 3 ) Level 2 Virtual Memory Address 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB Page Directory Memory Management Unit (MMU) Hardware Operating System 4 KB Physical Memory Address 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 13
14 > Level 2: On-demand Level Decimal Key: bit (f 1 ) 10bit (f 2 ) 6bit (f 3 ) Change over to indirect addressing 1024 buckets per node containing a compact pointer to the 3 rd level node 256 MB maximum Level 2 Virtual Memory Address 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB 4 KB 32 bit = 1024 pointer to L3 leaf nodes = potential L3 leaf nodes of a size between 2 0 and 2 6 Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 14
15 > Level 2: On-demand Level KB 10bit (f 2 ) 32bit Number of existing elements in L3 leaf node (compression) 6bit 32bit 26bit Position of L3 leaf node in corresponding memory segment Allocation of consecutive virtual memory segments for each of the 64 node sizes possible on the third level Each segment consists of a maximum of 2 26 blocks of the respective node size blocks of size 1 pre-allocated 2 26 blocks of node size 64 pre-allocated Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 15
16 > Level 3: Compression Decimal Key: bit (f 1 ) 10bit (f 2 ) 6bit (f 3 ) possible node sizes Bitmap indicates which bucket is in use only existing values are stored 64bit Bitmask s Read-copy-updates Compare-and-swap No in-place updates possible (lost updates) Thread-Local Memory Management Subsystem Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 16
17 > Duplicate Handling Efficient duplicate handling necessary for query processing Scanning a linked list results in random memory accesses Page Size Limit Growing Segment Size 64Byte 128Byte 4KB Physical Memory (4KB Pages) Hardware Prefetching Boundary Page boundaries are a barrier for hardware prefetchers store values sequentially in 4KB blocks blocks grow exponentially until reaching 4KB trade-off between scan performance and memory consumption Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 17
18 > Prefix Tree Performance Insert/Update Performance Lookup Performance Performance depends on the number of memory accesses per key KISS-Tree: 3 memory accesses per key Latency Hiding with Batch Updates/Lookups KISS-Tree performs better than hash table implementations Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 18
19 > Cooperative Operators Each operator adjusts its output to the requirements of the successive operator traditional operators mostly build internal es anyway skip plain tuple exchange overhead aggregate provide high-quality input to each operator date.datekey lineorder.orderdate supplier.suppkey lineorder.suppkey part.partkey lineorder.partkey date supplier part lineorder Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 19
20 > Composed Operators (1) Materialization is costly; thus try to completly avoid it! aggregate integrate operators that materialize large intermediate results date.datekey lineorder.orderdate lineorder.suppkey supplier.suppkey part.partkey lineorder.partkey date supplier part lineorder Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 20
21 > Composed Operators (2) Materialization is costly; thus try to completly avoid it! 3-way-select-join aggregate date.datekey lineorder.orderdate supplier.suppkey 3 lineorder.partkey date supplier part lineorder Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 21
22 > Evaluation (1) Star Schema Benchmark (SSB) Scale Factor = 15 Intel i7-3960x 15 MB LLC 3.3 GHz (Turbo Mode disabled) 32 GB Main Memory Ubuntu Single-threaded execution Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 22
23 Execution Time [ms] > Evaluation (2) MonetDB Commercial DBMS DexterDB Q1.1 Q1.2 Q1.3 Q2.1 Q2.2 Q2.3 Q3.1 Q3.2 Q3.3 Q3.4 Q4.1 Q4.2 Q4.3 (*) : we are in touch with MonetDB; performance numbers are under investigation Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 23
24 > Evaluation (3) SSB Query 1.1 Single Join Select-Join operator avoids intermediate result materialization Batched KISS-Tree Lookups/Inserts SSB Query Joins over 5 Tables Multi-way join operator avoids intermediate result materialization Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 24
25 > Adaptive QPPT (1) Flexible Query Execution on Adaptive Hardware aggregate date.datekey lineorder.orderdate supplier.suppkey 3 lineorder.partkey date supplier part lineorder Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 25
26 part lineorder supplier date lineorder > Adaptive QPPT (2) Parallel Running Query 3 Board 1 Board reduce hops by wireless point-to-point interconnects operator data / data operator shipping adaptive hardware components (e.g., trade cache for compute units) Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 26
27 CMOS (industry focus) > Comprehensive Approach: 9 Paths More-Shots-on-Goal Information Processing Devices & Circuits Materials & Functions H HAEC: Highly Adaptive Energy-Efficient Computing A Silicon Nanowires B Carbon F Orchestration G Resilience C Organic D Biomolecular Assembly E Chemical I Biological Systems Prof. Dr.-Ing. Wolfgang Lehner Adaptive Query Processing on Prefix Trees 27
28 Adaptive Query Processing on Prefix Trees Wolfgang Lehner Fachgruppentreffen, TU München Prof. Dr.-Ing. Wolfgang Lehner
Main-Memory Centric Data Management Open Problems and Some Solutions
Main-Memory Centric Data Management Open Problems and Some Solutions Wolfgang Lehner Technische Universität Dresden Prof. Dr.-Ing. Wolfgang Lehner > The Reality: The Petabyte Age New Realities TB disks
More informationThe Tactile Internet Driving 5G
All rights reserved The Tactile Internet Driving 5G Gerhard P. Fettweis Vodafone Chair Professor 3D Chip-Stacks Saxony Abu Dhabi TU Dresden TI Dresden Masdar Institute TSVs inductive/capacitive optical
More informationPerformance in the Multicore Era
Performance in the Multicore Era Gustavo Alonso Systems Group -- ETH Zurich, Switzerland Systems Group Enterprise Computing Center Performance in the multicore era 2 BACKGROUND - SWISSBOX SwissBox: An
More informationIndexing. Jan Chomicki University at Buffalo. Jan Chomicki () Indexing 1 / 25
Indexing Jan Chomicki University at Buffalo Jan Chomicki () Indexing 1 / 25 Storage hierarchy Cache Main memory Disk Tape Very fast Fast Slower Slow (nanosec) (10 nanosec) (millisec) (sec) Very small Small
More informationFile System Implementation. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University
File System Implementation Jin-Soo Kim (jinsookim@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu Implementing a File System On-disk structures How does file system represent
More informationStorage hierarchy. Textbook: chapters 11, 12, and 13
Storage hierarchy Cache Main memory Disk Tape Very fast Fast Slower Slow Very small Small Bigger Very big (KB) (MB) (GB) (TB) Built-in Expensive Cheap Dirt cheap Disks: data is stored on concentric circular
More informationMain-Memory Databases 1 / 25
1 / 25 Motivation Hardware trends Huge main memory capacity with complex access characteristics (Caches, NUMA) Many-core CPUs SIMD support in CPUs New CPU features (HTM) Also: Graphic cards, FPGAs, low
More informationKISS-Tree: Smart Latch-Free In-Memory Indexing on Modern Architectures
KISS-Tree: Smart Latch-Free In-Memory Indexing on Modern Architectures Thomas Kissinger, Benjamin Schlegel, Dirk Habich, Wolfgang Lehner Database Technology Group Technische Universität Dresden 006 Dresden,
More informationFILE SYSTEMS. CS124 Operating Systems Winter , Lecture 23
FILE SYSTEMS CS124 Operating Systems Winter 2015-2016, Lecture 23 2 Persistent Storage All programs require some form of persistent storage that lasts beyond the lifetime of an individual process Most
More informationFile System Implementation
File System Implementation Jinkyu Jeong (jinkyu@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu SSE3044: Operating Systems, Fall 2016, Jinkyu Jeong (jinkyu@skku.edu) Implementing
More informationCS3600 SYSTEMS AND NETWORKS
CS3600 SYSTEMS AND NETWORKS NORTHEASTERN UNIVERSITY Lecture 11: File System Implementation Prof. Alan Mislove (amislove@ccs.neu.edu) File-System Structure File structure Logical storage unit Collection
More informationB.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2
Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,
More informationCSE 4/521 Introduction to Operating Systems. Lecture 23 File System Implementation II (Allocation Methods, Free-Space Management) Summer 2018
CSE 4/521 Introduction to Operating Systems Lecture 23 File System Implementation II (Allocation Methods, Free-Space Management) Summer 2018 Overview Objective: To discuss how the disk is managed for a
More informationAdvanced Databases: Parallel Databases A.Poulovassilis
1 Advanced Databases: Parallel Databases A.Poulovassilis 1 Parallel Database Architectures Parallel database systems use parallel processing techniques to achieve faster DBMS performance and handle larger
More informationKathleen Durant PhD Northeastern University CS Indexes
Kathleen Durant PhD Northeastern University CS 3200 Indexes Outline for the day Index definition Types of indexes B+ trees ISAM Hash index Choosing indexed fields Indexes in InnoDB 2 Indexes A typical
More informationFILE SYSTEMS, PART 2. CS124 Operating Systems Fall , Lecture 24
FILE SYSTEMS, PART 2 CS124 Operating Systems Fall 2017-2018, Lecture 24 2 Last Time: File Systems Introduced the concept of file systems Explored several ways of managing the contents of files Contiguous
More informationRAID in Practice, Overview of Indexing
RAID in Practice, Overview of Indexing CS634 Lecture 4, Feb 04 2014 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke 1 Disks and Files: RAID in practice For a big enterprise
More informationData Modeling and Databases Ch 10: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 10: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationChe-Wei Chang Department of Computer Science and Information Engineering, Chang Gung University
Che-Wei Chang chewei@mail.cgu.edu.tw Department of Computer Science and Information Engineering, Chang Gung University Chapter 10: File System Chapter 11: Implementing File-Systems Chapter 12: Mass-Storage
More informationIndexing: Overview & Hashing. CS 377: Database Systems
Indexing: Overview & Hashing CS 377: Database Systems Recap: Data Storage Data items Records Memory DBMS Blocks blocks Files Different ways to organize files for better performance Disk Motivation for
More informationColumn Stores vs. Row Stores How Different Are They Really?
Column Stores vs. Row Stores How Different Are They Really? Daniel J. Abadi (Yale) Samuel R. Madden (MIT) Nabil Hachem (AvantGarde) Presented By : Kanika Nagpal OUTLINE Introduction Motivation Background
More informationData Modeling and Databases Ch 9: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 9: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationAdvanced Database Systems
Lecture IV Query Processing Kyumars Sheykh Esmaili Basic Steps in Query Processing 2 Query Optimization Many equivalent execution plans Choosing the best one Based on Heuristics, Cost Will be discussed
More informationDatenbanksysteme II: Caching and File Structures. Ulf Leser
Datenbanksysteme II: Caching and File Structures Ulf Leser Content of this Lecture Caching Overview Accessing data Cache replacement strategies Prefetching File structure Index Files Ulf Leser: Implementation
More informationChapter 8 Main Memory
COP 4610: Introduction to Operating Systems (Spring 2014) Chapter 8 Main Memory Zhi Wang Florida State University Contents Background Swapping Contiguous memory allocation Paging Segmentation OS examples
More informationFile System Internals. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University
File System Internals Jin-Soo Kim (jinsookim@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu Today s Topics File system implementation File descriptor table, File table
More informationSome Practice Problems on Hardware, File Organization and Indexing
Some Practice Problems on Hardware, File Organization and Indexing Multiple Choice State if the following statements are true or false. 1. On average, repeated random IO s are as efficient as repeated
More informationHyPer-sonic Combined Transaction AND Query Processing
HyPer-sonic Combined Transaction AND Query Processing Thomas Neumann Technische Universität München December 2, 2011 Motivation There are different scenarios for database usage: OLTP: Online Transaction
More informationMemory Management. Dr. Yingwu Zhu
Memory Management Dr. Yingwu Zhu Big picture Main memory is a resource A process/thread is being executing, the instructions & data must be in memory Assumption: Main memory is infinite Allocation of memory
More informationChapter 8: Main Memory
Chapter 8: Main Memory Chapter 8: Memory Management Background Swapping Contiguous Memory Allocation Segmentation Paging Structure of the Page Table Example: The Intel 32 and 64-bit Architectures Example:
More informationHyPer-sonic Combined Transaction AND Query Processing
HyPer-sonic Combined Transaction AND Query Processing Thomas Neumann Technische Universität München October 26, 2011 Motivation - OLTP vs. OLAP OLTP and OLAP have very different requirements OLTP high
More informationNPTEL Course Jan K. Gopinath Indian Institute of Science
Storage Systems NPTEL Course Jan 2012 (Lecture 39) K. Gopinath Indian Institute of Science Google File System Non-Posix scalable distr file system for large distr dataintensive applications performance,
More informationExternal Sorting Implementing Relational Operators
External Sorting Implementing Relational Operators 1 Readings [RG] Ch. 13 (sorting) 2 Where we are Working our way up from hardware Disks File abstraction that supports insert/delete/scan Indexing for
More informationIndexing. Week 14, Spring Edited by M. Naci Akkøk, , Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel
Indexing Week 14, Spring 2005 Edited by M. Naci Akkøk, 5.3.2004, 3.3.2005 Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel Overview Conventional indexes B-trees Hashing schemes
More informationChapter 8: Memory-Management Strategies
Chapter 8: Memory-Management Strategies Chapter 8: Memory Management Strategies Background Swapping Contiguous Memory Allocation Segmentation Paging Structure of the Page Table Example: The Intel 32 and
More informationDATABASE PERFORMANCE AND INDEXES. CS121: Relational Databases Fall 2017 Lecture 11
DATABASE PERFORMANCE AND INDEXES CS121: Relational Databases Fall 2017 Lecture 11 Database Performance 2 Many situations where query performance needs to be improved e.g. as data size grows, query performance
More informationJignesh M. Patel. Blog:
Jignesh M. Patel Blog: http://bigfastdata.blogspot.com Go back to the design Query Cache from Processing for Conscious 98s Modern (at Algorithms Hardware least for Hash Joins) 995 24 2 Processor Processor
More informationColumn-Stores vs. Row-Stores. How Different are they Really? Arul Bharathi
Column-Stores vs. Row-Stores How Different are they Really? Arul Bharathi Authors Daniel J.Abadi Samuel R. Madden Nabil Hachem 2 Contents Introduction Row Oriented Execution Column Oriented Execution Column-Store
More informationHYDRAstor: a Scalable Secondary Storage
HYDRAstor: a Scalable Secondary Storage 7th USENIX Conference on File and Storage Technologies (FAST '09) February 26 th 2009 C. Dubnicki, L. Gryz, L. Heldt, M. Kaczmarczyk, W. Kilian, P. Strzelczak, J.
More informationWhy Is This Important? Overview of Storage and Indexing. Components of a Disk. Data on External Storage. Accessing a Disk Page. Records on a Disk Page
Why Is This Important? Overview of Storage and Indexing Chapter 8 DB performance depends on time it takes to get the data from storage system and time to process Choosing the right index for faster access
More informationSchema-Agnostic Indexing with Azure Document DB
Schema-Agnostic Indexing with Azure Document DB Introduction Azure DocumentDB is Microsoft s multi-tenant distributed database service for managing JSON documents at Internet scale Multi-tenancy is an
More informationVoldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation
Voldemort Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Leader Election 1/29 Outline 1 2 3 Smruti R. Sarangi Leader Election 2/29 Data
More informationOperating Systems. Week 9 Recitation: Exam 2 Preview Review of Exam 2, Spring Paul Krzyzanowski. Rutgers University.
Operating Systems Week 9 Recitation: Exam 2 Preview Review of Exam 2, Spring 2014 Paul Krzyzanowski Rutgers University Spring 2015 March 27, 2015 2015 Paul Krzyzanowski 1 Exam 2 2012 Question 2a One of
More informationCHAPTER 8 - MEMORY MANAGEMENT STRATEGIES
CHAPTER 8 - MEMORY MANAGEMENT STRATEGIES OBJECTIVES Detailed description of various ways of organizing memory hardware Various memory-management techniques, including paging and segmentation To provide
More informationInformation Systems (Informationssysteme)
Information Systems (Informationssysteme) Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Summer 2018 c Jens Teubner Information Systems Summer 2018 1 Part IX B-Trees c Jens Teubner Information
More informationCOMP 530: Operating Systems File Systems: Fundamentals
File Systems: Fundamentals Don Porter Portions courtesy Emmett Witchel 1 Files What is a file? A named collection of related information recorded on secondary storage (e.g., disks) File attributes Name,
More informationChapter 8: Main Memory. Operating System Concepts 9 th Edition
Chapter 8: Main Memory Silberschatz, Galvin and Gagne 2013 Chapter 8: Memory Management Background Swapping Contiguous Memory Allocation Segmentation Paging Structure of the Page Table Example: The Intel
More informationFile Systems: Fundamentals
File Systems: Fundamentals 1 Files! What is a file? Ø A named collection of related information recorded on secondary storage (e.g., disks)! File attributes Ø Name, type, location, size, protection, creator,
More informationDisks & Files. Yanlei Diao UMass Amherst. Slides Courtesy of R. Ramakrishnan and J. Gehrke
Disks & Files Yanlei Diao UMass Amherst Slides Courtesy of R. Ramakrishnan and J. Gehrke DBMS Architecture Query Parser Query Rewriter Query Optimizer Query Executor Lock Manager for Concurrency Access
More informationOutlook. File-System Interface Allocation-Methods Free Space Management
File System Outlook File-System Interface Allocation-Methods Free Space Management 2 File System Interface File Concept File system is the most visible part of an OS Files storing related data Directory
More informationFile System Internals. Jo, Heeseung
File System Internals Jo, Heeseung Today's Topics File system implementation File descriptor table, File table Virtual file system File system design issues Directory implementation: filename -> metadata
More informationFile Systems: Fundamentals
1 Files Fundamental Ontology of File Systems File Systems: Fundamentals What is a file? Ø A named collection of related information recorded on secondary storage (e.g., disks) File attributes Ø Name, type,
More informationHICAMP Bitmap. A Space-Efficient Updatable Bitmap Index for In-Memory Databases! Bo Wang, Heiner Litz, David R. Cheriton Stanford University DAMON 14
HICAMP Bitmap A Space-Efficient Updatable Bitmap Index for In-Memory Databases! Bo Wang, Heiner Litz, David R. Cheriton Stanford University DAMON 14 Database Indexing Databases use precomputed indexes
More informationEFFICIENTLY ENABLING CONVENTIONAL BLOCK SIZES FOR VERY LARGE DIE- STACKED DRAM CACHES
EFFICIENTLY ENABLING CONVENTIONAL BLOCK SIZES FOR VERY LARGE DIE- STACKED DRAM CACHES MICRO 2011 @ Porte Alegre, Brazil Gabriel H. Loh [1] and Mark D. Hill [2][1] December 2011 [1] AMD Research [2] University
More informationCPSC 421 Database Management Systems. Lecture 11: Storage and File Organization
CPSC 421 Database Management Systems Lecture 11: Storage and File Organization * Some material adapted from R. Ramakrishnan, L. Delcambre, and B. Ludaescher Today s Agenda Start on Database Internals:
More informationArchitecture and Implementation of Database Systems (Winter 2014/15)
Jens Teubner Architecture & Implementation of DBMS Winter 2014/15 1 Architecture and Implementation of Database Systems (Winter 2014/15) Jens Teubner, DBIS Group jens.teubner@cs.tu-dortmund.de Winter 2014/15
More informationRack-scale Data Processing System
Rack-scale Data Processing System Jana Giceva, Darko Makreshanski, Claude Barthels, Alessandro Dovis, Gustavo Alonso Systems Group, Department of Computer Science, ETH Zurich Rack-scale Data Processing
More informationClassifying Information Stored in Memory! Memory Management in a Uniprogrammed System! Segments of a Process! Processing a User Program!
Memory Management in a Uniprogrammed System! A! gets a fixed segment of (usually highest )"! One process executes at a time in a single segment"! Process is always loaded at "! Compiler and linker generate
More informationSegmentation with Paging. Review. Segmentation with Page (MULTICS) Segmentation with Page (MULTICS) Segmentation with Page (MULTICS)
Review Segmentation Segmentation Implementation Advantage of Segmentation Protection Sharing Segmentation with Paging Segmentation with Paging Segmentation with Paging Reason for the segmentation with
More informationOS impact on performance
PhD student CEA, DAM, DIF, F-91297, Arpajon, France Advisor : William Jalby CEA supervisor : Marc Pérache 1 Plan Remind goal of OS Reproducibility Conclusion 2 OS : between applications and hardware 3
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More informationChapter 12: Query Processing. Chapter 12: Query Processing
Chapter 12: Query Processing Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 12: Query Processing Overview Measures of Query Cost Selection Operation Sorting Join
More informationPathfinder/MonetDB: A High-Performance Relational Runtime for XQuery
Introduction Problems & Solutions Join Recognition Experimental Results Introduction GK Spring Workshop Waldau: Pathfinder/MonetDB: A High-Performance Relational Runtime for XQuery Database & Information
More informationFile System Implementation
File System Implementation Last modified: 16.05.2017 1 File-System Structure Virtual File System and FUSE Directory Implementation Allocation Methods Free-Space Management Efficiency and Performance. Buffering
More informationL9: Storage Manager Physical Data Organization
L9: Storage Manager Physical Data Organization Disks and files Record and file organization Indexing Tree-based index: B+-tree Hash-based index c.f. Fig 1.3 in [RG] and Fig 2.3 in [EN] Functional Components
More informationGoals of Memory Management
Memory Management Goals of Memory Management Allocate available memory efficiently to multiple processes Main functions Allocate memory to processes when needed Keep track of what memory is used and what
More informationAnnouncements. Reading Material. Recap. Today 9/17/17. Storage (contd. from Lecture 6)
CompSci 16 Intensive Computing Systems Lecture 7 Storage and Index Instructor: Sudeepa Roy Announcements HW1 deadline this week: Due on 09/21 (Thurs), 11: pm, no late days Project proposal deadline: Preliminary
More informationDistributed Filesystem
Distributed Filesystem 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributing Code! Don t move data to workers move workers to the data! - Store data on the local disks of nodes in the
More informationBuffer Management for XFS in Linux. William J. Earl SGI
Buffer Management for XFS in Linux William J. Earl SGI XFS Requirements for a Buffer Cache Delayed allocation of disk space for cached writes supports high write performance Delayed allocation main memory
More informationChapter 7: Main Memory. Operating System Concepts Essentials 8 th Edition
Chapter 7: Main Memory Operating System Concepts Essentials 8 th Edition Silberschatz, Galvin and Gagne 2011 Chapter 7: Memory Management Background Swapping Contiguous Memory Allocation Paging Structure
More informationHeckaton. SQL Server's Memory Optimized OLTP Engine
Heckaton SQL Server's Memory Optimized OLTP Engine Agenda Introduction to Hekaton Design Consideration High Level Architecture Storage and Indexing Query Processing Transaction Management Transaction Durability
More informationChapter 17: Parallel Databases
Chapter 17: Parallel Databases Introduction I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems Database Systems
More informationTrafficDB: HERE s High Performance Shared-Memory Data Store Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, and Anis Moussa
TrafficDB: HERE s High Performance Shared-Memory Data Store Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, and Anis Moussa EPL646: Advanced Topics in Databases Christos Hadjistyllis
More informationChapter 14: File-System Implementation
Chapter 14: File-System Implementation Directory Implementation Allocation Methods Free-Space Management Efficiency and Performance Recovery 14.1 Silberschatz, Galvin and Gagne 2013 Objectives To describe
More informationMeet the Walkers! Accelerating Index Traversals for In-Memory Databases"
Meet the Walkers! Accelerating Index Traversals for In-Memory Databases Onur Kocberber Boris Grot, Javier Picorel, Babak Falsafi, Kevin Lim, Parthasarathy Ranganathan Our World is Data-Driven! Data resides
More informationDisks, Memories & Buffer Management
Disks, Memories & Buffer Management The two offices of memory are collection and distribution. - Samuel Johnson CS3223 - Storage 1 What does a DBMS Store? Relations Actual data Indexes Data structures
More informationJoin Processing for Flash SSDs: Remembering Past Lessons
Join Processing for Flash SSDs: Remembering Past Lessons Jaeyoung Do, Jignesh M. Patel Department of Computer Sciences University of Wisconsin-Madison $/MB GB Flash Solid State Drives (SSDs) Benefits of
More informationCHAPTER 11: IMPLEMENTING FILE SYSTEMS (COMPACT) By I-Chen Lin Textbook: Operating System Concepts 9th Ed.
CHAPTER 11: IMPLEMENTING FILE SYSTEMS (COMPACT) By I-Chen Lin Textbook: Operating System Concepts 9th Ed. File-System Structure File structure Logical storage unit Collection of related information File
More informationHuge market -- essentially all high performance databases work this way
11/5/2017 Lecture 16 -- Parallel & Distributed Databases Parallel/distributed databases: goal provide exactly the same API (SQL) and abstractions (relational tables), but partition data across a bunch
More informationOperating Systems Memory Management. Mathieu Delalandre University of Tours, Tours city, France
Operating Systems Memory Management Mathieu Delalandre University of Tours, Tours city, France mathieu.delalandre@univ-tours.fr 1 Operating Systems Memory Management 1. Introduction 2. Contiguous memory
More informationBest Practices for Setting BIOS Parameters for Performance
White Paper Best Practices for Setting BIOS Parameters for Performance Cisco UCS E5-based M3 Servers May 2013 2014 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page
More informationPreview. Memory Management
Preview Memory Management With Mono-Process With Multi-Processes Multi-process with Fixed Partitions Modeling Multiprogramming Swapping Memory Management with Bitmaps Memory Management with Free-List Virtual
More informationProfiling and Debugging OpenCL Applications with ARM Development Tools. October 2014
Profiling and Debugging OpenCL Applications with ARM Development Tools October 2014 1 Agenda 1. Introduction to GPU Compute 2. ARM Development Solutions 3. Mali GPU Architecture 4. Using ARM DS-5 Streamline
More informationChapter 8: Main Memory
Chapter 8: Main Memory Silberschatz, Galvin and Gagne 2013 Chapter 8: Memory Management Background Swapping Contiguous Memory Allocation Segmentation Paging Structure of the Page Table Example: The Intel
More informationChapter Outline. Chapter 2 Distributed Information Systems Architecture. Layers of an information system. Design strategies.
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 2 Distributed Information Systems Architecture Chapter Outline
More informationUsing space-filling curves for multidimensional
Using space-filling curves for multidimensional indexing Dr. Bisztray Dénes Senior Research Engineer 1 Nokia Solutions and Networks 2014 In medias res Performance problems with RDBMS Switch to NoSQL store
More informationChapter 3 - Memory Management
Chapter 3 - Memory Management Luis Tarrataca luis.tarrataca@gmail.com CEFET-RJ L. Tarrataca Chapter 3 - Memory Management 1 / 222 1 A Memory Abstraction: Address Spaces The Notion of an Address Space Swapping
More informationAdvanced Database Systems
Lecture II Storage Layer Kyumars Sheykh Esmaili Course s Syllabus Core Topics Storage Layer Query Processing and Optimization Transaction Management and Recovery Advanced Topics Cloud Computing and Web
More informationThe Google File System
The Google File System Sanjay Ghemawat, Howard Gobioff and Shun Tak Leung Google* Shivesh Kumar Sharma fl4164@wayne.edu Fall 2015 004395771 Overview Google file system is a scalable distributed file system
More informationFile System Internals. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University
File System Internals Jin-Soo Kim (jinsookim@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu Today s Topics File system implementation File descriptor table, File table
More informationNear Memory Key/Value Lookup Acceleration MemSys 2017
Near Key/Value Lookup Acceleration MemSys 2017 October 3, 2017 Scott Lloyd, Maya Gokhale Center for Applied Scientific Computing This work was performed under the auspices of the U.S. Department of Energy
More informationVirtual Memory 1. To do. q Segmentation q Paging q A hybrid system
Virtual Memory 1 To do q Segmentation q Paging q A hybrid system Address spaces and multiple processes IBM OS/360 Split memory in n parts (possible!= sizes) A process per partition Program Code Heap Operating
More informationOrdered Indices To gain fast random access to records in a file, we can use an index structure. Each index structure is associated with a particular search key. Just like index of a book, library catalog,
More informationComputer Architecture. Lecture 8: Virtual Memory
Computer Architecture Lecture 8: Virtual Memory Dr. Ahmed Sallam Suez Canal University Spring 2015 Based on original slides by Prof. Onur Mutlu Memory (Programmer s View) 2 Ideal Memory Zero access time
More informationA Disseminated Distributed OS for Hardware Resource Disaggregation Yizhou Shan
LegoOS A Disseminated Distributed OS for Hardware Resource Disaggregation Yizhou Shan, Yutong Huang, Yilun Chen, and Yiying Zhang Y 4 1 2 Monolithic Server OS / Hypervisor 3 Problems? 4 cpu mem Resource
More information! Parallel machines are becoming quite common and affordable. ! Databases are growing increasingly large
Chapter 20: Parallel Databases Introduction! Introduction! I/O Parallelism! Interquery Parallelism! Intraquery Parallelism! Intraoperation Parallelism! Interoperation Parallelism! Design of Parallel Systems!
More informationChapter 20: Parallel Databases
Chapter 20: Parallel Databases! Introduction! I/O Parallelism! Interquery Parallelism! Intraquery Parallelism! Intraoperation Parallelism! Interoperation Parallelism! Design of Parallel Systems 20.1 Introduction!
More informationChapter 20: Parallel Databases. Introduction
Chapter 20: Parallel Databases! Introduction! I/O Parallelism! Interquery Parallelism! Intraquery Parallelism! Intraoperation Parallelism! Interoperation Parallelism! Design of Parallel Systems 20.1 Introduction!
More informationDatenbanksysteme II: Modern Hardware. Stefan Sprenger November 23, 2016
Datenbanksysteme II: Modern Hardware Stefan Sprenger November 23, 2016 Content of this Lecture Introduction to Modern Hardware CPUs, Cache Hierarchy Branch Prediction SIMD NUMA Cache-Sensitive Skip List
More informationDatabase Technology Database Architectures. Heiko Paulheim
Database Technology Database Architectures Today So far, we have treated Database Systems as a black box We can define a schema...and write data into it...and read data from it Today Opening the black
More information