Data Systems that are Easy to Design, Tune and Use. Stratos Idreos
|
|
- Tracy Gaines
- 5 years ago
- Views:
Transcription
1 Data Systems that are Easy to Design, Tune and Use
2 data systems that are easy to: (years) (months) design & build set-up & tune (hours/days) use e.g., adapt to new applications, new hardware, spin off alternative designs fast, create truly tailored solutions, enable more applications, bring development/set-up cost down (start-ups/scientific labs), etc
3 declarative interface ask what you want db system the system decides how to best store and access data
4 too many preparation options lead to complex installation schema storage load indexing query timeline expert users - idle time - workload knowledge!=exploration, easy
5 users/applications declarative interface ask what you want DBA db system
6 DBA1 users/applications DBA2 need to choose the proper system & workloads/ applications change rapidly data system 1 data system 2
7 be able to query the data immediately & with good performance schema X X X storage load indexing query
8 be able to query the data immediately & with good performance schema X X X storage load indexing query raw data explore data and gain knowledge immediately
9 design space it all starts with how we store data every bit matters from static to dynamic designs
10
11 idle time workload knowledge external tools human control
12 initialization querying indexing idle time workload knowledge external tools human control
13 initialization querying indexing idle time workload knowledge external tools human control
14 initialization querying indexing idle time workload every query is treated as an advice knowledge external tools on how data should be stored human control
15 initialization querying indexing continuous, lightweight actions to co-locate relevant data idle time workload every query is treated as an advice knowledge external tools on how data should be stored human control
16 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) TPC-H Query 15 normal MonetDB selection cracking Query sequence
17 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) TPC-H Query 15 normal MonetDB selection cracking presorted MonetDB preparation cost 3-14 minutes Query sequence
18 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) presorted MonetDB preparation cost 3-14 minutes TPC-H Query Query sequence normal MonetDB selection cracking MonetDB with sideways cracking
19 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) presorted MonetDB preparation cost 3-14 minutes TPC-H Query Query sequence normal MonetDB selection cracking MonetDB with sideways cracking
20 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) presorted MonetDB preparation cost 3-14 minutes TPC-H Query Query sequence normal MonetDB selection cracking MonetDB with sideways cracking
21 MonetDB Presorted Sel. Crack Sid. Crack MySQL Presorted Response time (milli secs) presorted MonetDB preparation cost 3-14 minutes TPC-H Query Query sequence normal MonetDB selection cracking MonetDB with sideways cracking
22 query plan scan
23 query plan scan db
24 query plan scan loading
25 query plan scan files loading access raw data adaptively on-the-fly
26 query plan scan files access raw data adaptively on-the-fly cache loading
27 selective parsing file indexing file splitting online statistics query plan scan files access raw data adaptively on-the-fly cache loading
28 basics (CIDR07) databases with adaptive storage updates (SIGMOD07) >1 columns >1 columns (SIGMOD09) storagerestrictions (SIGMOD09) robustness robustne (PVLDB12) algorithms (PVLDB11) benchmarking (TPCTC10) concurrency control (PVLDB12) adaptive storage (SIGMOD14) time-series (SIGMOD14) multi-cores (SIGMOD15) encryption (SIGMOD16) 12
29 design space
30 design space data system X
31 design space data system X adaptive data system Y
32 dai data 2.5 grows today y 2 [IB data systems are nearly everywhere
33 dai data 2.5 grows today y 2 [IB data systems are nearly everywhere continuous need for new and tailored data systems
34 dai data 2.5 grows today y 2 [IB data systems are nearly everywhere continuous need for new and tailored data systems tomorrow
35 daily data data* daily skills data years [IBMbigdata] years [StratosGuess] data system use, set-up, tune, design
36 e.g., modern main-memory optimized systems: first ideas in 80s, first advanced architectures in 90s, first rather complete designs in early 2000s, mainstream industry adoption still limited indexing, limited cost based optimizations,
37 disk memory flash easily utilize past concepts
38 self-designing data systems data+queries+hardware data system
39 self-designing data systems data+queries+hardware data system easy to design
40 1. write/extend modules in a high level language (optimizations) 2. modules= storage/execution/data flow 3. try out >>1 designs (sets of modules) ACM SIGMOD Blog, June 15
41 H20, ACM SIGMOD14, + evolutionary prototype row-store hybrid store column-store key-value store automatically choose the right layout using genetic algorithms + generate tailored code
42 H20, ACM SIGMOD14, + evolutionary prototype row-store hybrid store column-store key-value store automatically choose the right layout using genetic algorithms + generate tailored code construct access methods out of basic components (and their tradeoffs) e.g., scan*, tree*, bloom filters, bitmaps, hash tables, etc. RUM conjecture, EDBT16
43 SQL/correctness=expertise+knowledge!=everyone can explore data systems that are easy to use dbtouch cidr2013/icde2014 show me something interesting DATA Queriosity bigdata2015
44 data systems today allow us to answer queries fast db data systems tomorrow should allow us to find fast which queries to ask db explore
45 thank you! Martin Kersten Stefan Manegold Goetz Graefe Harumi Kuno Anastasia Ailamaki Themis Palpanas Eleni Petraki Ioannis Alagiannis Miguel Branco Renata Borovica Erietta Liarou Felix Halim Ronald Yap Panos Karras Holger Pirk Kostas Zoumpatianos Manos Athanassoulis Lukas Maas Abdul Wasay Mike Kester Dhruv Gupta
Overview of Data Exploration Techniques. Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri
Overview of Data Exploration Techniques Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri data exploration not always sure what we are looking for (until we find it) data has always been big volume
More information2017 & 2018 ACM SIGMOD Most Reproducible Paper Award
STRATOS IDREOS Assistant Professor in Computer Science Ph.D.: June 2010 Harvard John A. Paulson School of Engineering and Applied Sciences Citations > 2800 stratos.seas.harvard.edu stratos@seas.harvard.edu
More informationDASlab: The Data Systems Laboratory
DASlab: The Data Systems Laboratory at Harvard SEAS Stratos Idreos Harvard University http://daslab.seas.harvard.edu ABSTRACT DASlab is a new laboratory at the Harvard School of Engineering and Applied
More informationSTRATOS IDREOS Curriculum Vitae Mar 2018
STRATOS IDREOS Curriculum Vitae Mar 2018 Assistant Professor in Computer Science Ph.D.: June 2010 Harvard John A. Paulson School of Engineering and Applied Sciences Publications > 50 http://stratos.seas.harvard.edu/
More informationQuery processing on raw files. Vítor Uwe Reus
Query processing on raw files Vítor Uwe Reus Outline 1. Introduction 2. Adaptive Indexing 3. Hybrid MapReduce 4. NoDB 5. Summary Outline 1. Introduction 2. Adaptive Indexing 3. Hybrid MapReduce 4. NoDB
More informationNoDB: Querying Raw Data. --Mrutyunjay
NoDB: Querying Raw Data --Mrutyunjay Overview Introduction Motivation NoDB Philosophy: PostgreSQL Results Opportunities NoDB in Action: Adaptive Query Processing on Raw Data Ioannis Alagiannis, Renata
More informationclass 13 scans vs indexes prof. Stratos Idreos
class 13 scans vs indexes prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ b-tree - dynamic tree - always balanced 35,50 35, 12,20 50, 1,2,3 12,15,17 20, Stratos Idreos 2 /24 select from
More informationclass 17 updates prof. Stratos Idreos
class 17 updates prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ early/late tuple reconstruction, tuple-at-a-time, vectorized or bulk processing, intermediates format, pushing selects
More informationclass 17 updates prof. Stratos Idreos
class 17 updates prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ UPDATE table_name SET column1=value1,column2=value2,... WHERE some_column=some_value INSERT INTO table_name VALUES (value1,value2,value3,...)
More informationclass 6 more about column-store plans and compression prof. Stratos Idreos
class 6 more about column-store plans and compression prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ query compilation an ancient yet new topic/research challenge query->sql->interpet
More informationsystems & research project
class 4 systems & research project prof. HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ index index knows order about the data data filtering data: point/range queries index data A B C sorted A B C initial
More informationNoDB: Efficient Query Execution on Raw Data Files
NoDB: Efficient Query Execution on Raw Data Files Ioannis Alagiannis Renata Borovica-Gajic Miguel Branco Stratos Idreos Anastasia Ailamaki Ecole Polytechnique Fédérale de Lausanne {ioannisalagiannis, renataborovica,
More informationSTUDY ON POTENTIAL CAPABILITIES OF A NODB SYSTEM
STUDY ON POTENTIAL CAPABILITIES OF A NODB SYSTEM Y. Jayanta Singh 1 and L. Kananbala Devi 2 Department of Computer Science & Engineering and Information Technology, Don Bosco College of Engineering and
More informationBenchmarking Adaptive Indexing
Benchmarking Adaptive Indexing Goetz Graefe 2, Stratos Idreos 1, Harumi Kuno 2, and Stefan Manegold 1 1 CWI Amsterdam The Netherlands first.last@cwi.nl 2 Hewlett-Packard Laboratories Palo Alto, CA first.last@hp.com
More informationFrom Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems
From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems ABSTRACT Stratos Idreos Harvard University We survey new opportunities to design data systems, data structures and
More informationDBMS Data Loading: An Analysis on Modern Hardware. Adam Dziedzic, Manos Karpathiotakis*, Ioannis Alagiannis, Raja Appuswamy, Anastasia Ailamaki
DBMS Data Loading: An Analysis on Modern Hardware Adam Dziedzic, Manos Karpathiotakis*, Ioannis Alagiannis, Raja Appuswamy, Anastasia Ailamaki Data loading: A necessary evil Volume => Expensive 4 zettabytes
More informationclass 5 column stores 2.0 prof. Stratos Idreos
class 5 column stores 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ worth thinking about what just happened? where is my data? email, cloud, social media, can we design systems
More informationCitation for published version (APA): Ydraios, E. (2010). Database cracking: towards auto-tunning database kernels
UvA-DARE (Digital Academic Repository) Database cracking: towards auto-tunning database kernels Ydraios, E. Link to publication Citation for published version (APA): Ydraios, E. (2010). Database cracking:
More informationScaling up analytical queries with column-stores
Scaling up analytical queries with column-stores Ioannis Alagiannis, Manos Athanassoulis, Anastasia Ailamaki Ecole Polytechnique Fédérale de Lausanne Lausanne, VD, Switzerland {ioannis.alagiannis, manos.athanassoulis,
More informationA Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture
A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture By Gaurav Sheoran 9-Dec-08 Abstract Most of the current enterprise data-warehouses
More informationThe mixed workload CH-BenCHmark. Hybrid y OLTP&OLAP Database Systems Real-Time Business Intelligence Analytical information at your fingertips
The mixed workload CH-BenCHmark Hybrid y OLTP&OLAP Database Systems Real-Time Business Intelligence Analytical information at your fingertips Richard Cole (ParAccel), Florian Funke (TU München), Leo Giakoumakis
More informationSmooth Scan: Statistics-Oblivious Access Paths. Renata Borovica-Gajic Stratos Idreos Anastasia Ailamaki Marcin Zukowski Campbell Fraser
Smooth Scan: Statistics-Oblivious Access Paths Renata Borovica-Gajic Stratos Idreos Anastasia Ailamaki Marcin Zukowski Campbell Fraser Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Q13 Q14 Q16 Q18 Q19 Q21 Q22
More informationMerging What s Cracked, Cracking What s Merged: Adaptive Indexing in Main-Memory Column-Stores
Merging What s Cracked, Cracking What s Merged: Adaptive Indexing in Main-Memory Column-Stores Stratos Idreos Stefan Manegold Harumi Kuno Goetz Graefe CWI, Amsterdam {stratosidreos, stefanmanegold}@cwinl
More informationThe Periodic Table of Data Structures
The Periodic Table of Data Structures Stratos Idreos Kostas Zoumpatianos Manos Athanassoulis Niv Dayan Brian Hentschel Michael S. Kester Demi Guo Lukas Maas Wilson Qin Abdul Wasay Yiyou Sun Harvard University
More informationHolistic Indexing in Main-memory Column-stores
Holistic Indexing in Main-memory Column-stores Eleni Petraki CWI Amsterdam petraki@cwi.nl Stratos Idreos Harvard University stratos@seas.harvard.edu Stefan Manegold CWI Amsterdam manegold@cwi.nl ABSTRACT
More informationclass 20 updates 2.0 prof. Stratos Idreos
class 20 updates 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ UPDATE table_name SET column1=value1,column2=value2,... WHERE some_column=some_value INSERT INTO table_name VALUES
More informationAccess Path Selection in Main-Memory Optimized Data Systems
Access Path Selection in Main-Memory Optimized Data Systems Should I Scan or Should I Probe? Manos Athanassoulis Harvard University Talk at CS265, February 16 th, 2018 1 Access Path Selection SELECT x
More informationAnastasia Ailamaki. Performance and energy analysis using transactional workloads
Performance and energy analysis using transactional workloads Anastasia Ailamaki EPFL and RAW Labs SA students: Danica Porobic, Utku Sirin, and Pinar Tozun Online Transaction Processing $2B+ industry Characteristics:
More informationDesigning Database Operators for Flash-enabled Memory Hierarchies
Designing Database Operators for Flash-enabled Memory Hierarchies Goetz Graefe Stavros Harizopoulos Harumi Kuno Mehul A. Shah Dimitris Tsirogiannis Janet L. Wiener Hewlett-Packard Laboratories, Palo Alto,
More informationAdaptivity. Luca Schroeder & Thomas Lively
Adaptivity Luca Schroeder & Thomas Lively H2O: A Hands-free Adaptive Store. Ioannis Alagiannis, Stratos Idreos and Anastassia Ailamaki ACM SIGMOD International Conference on Data Management, 2014 Three
More informationcomplex plans and hybrid layouts
class 7 complex plans and hybrid layouts prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ essential column-stores features virtual ids late tuple reconstruction (if ever) vectorized execution
More informationJignesh M. Patel. Blog:
Jignesh M. Patel Blog: http://bigfastdata.blogspot.com Go back to the design Query Cache from Processing for Conscious 98s Modern (at Algorithms Hardware least for Hash Joins) 995 24 2 Processor Processor
More informationMonetDB/DataCell: leveraging the column-store database technology for efficient and scalable stream processing Liarou, E.
UvA-DARE (Digital Academic Repository) MonetDB/DataCell: leveraging the column-store database technology for efficient and scalable stream processing Liarou, E. Link to publication Citation for published
More informationbasic db architectures & layouts
class 4 basic db architectures & layouts prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ videos for sections 3 & 4 are online check back every week (1-2 sections weekly) there is a schedule
More informationcolumn-stores basics
class 3 column-stores basics prof. HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ project description is now online First background info will be given this Friday and detailed lecture on Feb 21 Basic Readings
More informationdata systems 101 prof. Stratos Idreos class 2
class 2 data systems 101 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ 2 classes per week - OH/Labs every day 1 presentation/discussion lead - 2 reviews each week research (or systems)
More informationToward timely, predictable and cost-effective data analytics. Renata Borovica-Gajić DIAS, EPFL
Toward timely, predictable and cost-effective data analytics Renata Borovica-Gajić DIAS, EPFL Big data proliferation Big data is when the current technology does not enable users to obtain timely, cost-effective,
More informationAdaptive Cache Mode Selection for Queries over Raw Data
Adaptive Cache Mode Selection for Queries over Raw Data Tahir Azim, Azqa Nadeem and Anastasia Ailamaki École Polytechnique Fédérale de Lausanne TU Delft RAW Labs SA tahir.azim@epfl.ch, A.Nadeem@student.tudelft.nl,
More informationcolumn-stores basics
class 3 column-stores basics prof. HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ Goetz Graefe Google Research guest lecture Justin Levandoski Microsoft Research projects option 1: systems project (now
More informationDATABASE CRACKING: Fancy Scan, not Poor Man s Sort! Don. Holger Pirk Eleni Petraki Strato Idreos
DATABASE CRACKING: Fancy Scan, not Poor Man s Sort! Hardware Folks Cracking Folks Don Holger Pirk Eleni Petraki Strato Idreos Stefan Manegold Martin Kersten EVALUATING RANGE PREDICATES COMPLEXITY ON PAPER
More informationIntroduction to K2View Fabric
Introduction to K2View Fabric 1 Introduction to K2View Fabric Overview In every industry, the amount of data being created and consumed on a daily basis is growing exponentially. Enterprises are struggling
More informationJoin Processing for Flash SSDs: Remembering Past Lessons
Join Processing for Flash SSDs: Remembering Past Lessons Jaeyoung Do, Jignesh M. Patel Department of Computer Sciences University of Wisconsin-Madison $/MB GB Flash Solid State Drives (SSDs) Benefits of
More informationConfiguration changes such as conversion from a single instance to RAC, ASM, etc.
Today, enterprises have to make sizeable investments in hardware and software to roll out infrastructure changes. For example, a data center may have an initiative to move databases to a low cost computing
More informationDATABASES AND THE CLOUD. Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland
DATABASES AND THE CLOUD Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland AVALOQ Conference Zürich June 2011 Systems Group www.systems.ethz.ch Enterprise Computing Center
More informationDesign Tradeoffs of Data Access Methods
Design Tradeoffs of Data Access Methods Manos Athanassoulis manos@seas.harvard.edu Harvard University Stratos Idreos stratos@seas.harvard.edu Harvard University ABSTRACT Database researchers and practitioners
More informationclass 9 fast scans 1.0 prof. Stratos Idreos
class 9 fast scans 1.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ 1 pass to merge into 8 sorted pages (2N pages) 1 pass to merge into 4 sorted pages (2N pages) 1 pass to merge into
More informationclass 12 b-trees 2.0 prof. Stratos Idreos
class 12 b-trees 2.0 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ A B C A B C clustered/primary index on A Stratos Idreos /26 2 A B C A B C clustered/primary index on A pos C pos
More informationIn-Memory Data Management Jens Krueger
In-Memory Data Management Jens Krueger Enterprise Platform and Integration Concepts Hasso Plattner Intitute OLTP vs. OLAP 2 Online Transaction Processing (OLTP) Organized in rows Online Analytical Processing
More informationWelcome to CO 572: Advanced Databases
Welcome to CO 572: Advanced Databases Holger Pirk Holger Pirk Welcome to CO 572: Advanced Databases 1 / 41 Purpose of this Lecture Figuring stu out What you know This should mostly be revision (tell me
More informationData Modeling and Databases Ch 10: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 10: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationBest Practices for MySQL Scalability. Peter Zaitsev, CEO, Percona Percona Technical Webinars May 1, 2013
Best Practices for MySQL Scalability Peter Zaitsev, CEO, Percona Percona Technical Webinars May 1, 2013 About the Presentation Look into what is MySQL Scalability Identify Areas which impact MySQL Scalability
More informationData Modeling and Databases Ch 9: Query Processing - Algorithms. Gustavo Alonso Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 9: Query Processing - Algorithms Gustavo Alonso Systems Group Department of Computer Science ETH Zürich Transactions (Locking, Logging) Metadata Mgmt (Schema, Stats) Application
More informationCourse Outline. Performance Tuning and Optimizing SQL Databases Course 10987B: 4 days Instructor Led
Performance Tuning and Optimizing SQL Databases Course 10987B: 4 days Instructor Led About this course This four-day instructor-led course provides students who manage and maintain SQL Server databases
More informationclass 8 b-trees prof. Stratos Idreos
class 8 b-trees prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ I spend a lot of time debugging am I doing something wrong? maybe but probably not 1. learn to use gdb 2. after spending
More informationCrescando: Predictable Performance for Unpredictable Workloads
Crescando: Predictable Performance for Unpredictable Workloads G. Alonso, D. Fauser, G. Giannikis, D. Kossmann, J. Meyer, P. Unterbrunner Amadeus S.A. ETH Zurich, Systems Group (Funded by Enterprise Computing
More informationAn Integrated Approach to Performance Monitoring for Autonomous Tuning
IEEE International Conference on Data Engineering An Integrated Approach to Performance Monitoring for Autonomous Tuning Alexander Thiem Ingres Germany GmbH alexander.thiem@ingres.com Kai-Uwe Sattler Ilmenau
More informationVisualization of HashStash with Qt
Visualization of HashStash with Qt Zhe Zhao Department of Computer Science Brown University Providence, RI zhe_zhao at brown.edu May 8, 2015 Abstract HashStash is a new abstraction for the multi-query
More informationdata systems 101 prof. Stratos Idreos class 2
class 2 data systems 101 prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS265/ big data V s (it is not about size only) volume velocity variety veracity actually none of that is really new
More informationAN ELASTIC MULTI-CORE ALLOCATION MECHANISM FOR DATABASE SYSTEMS
1 AN ELASTIC MULTI-CORE ALLOCATION MECHANISM FOR DATABASE SYSTEMS SIMONE DOMINICO 1, JORGE A. MEIRA 2, MARCO A. Z. ALVES 1, EDUARDO C. DE ALMEIDA 1 FEDERAL UNIVERSITY OF PARANÁ, BRAZIL 1, UNIVERSITY OF
More informationOracle Data Warehousing Pushing the Limits. Introduction. Case Study. Jason Laws. Principal Consultant WhereScape Consulting
Oracle Data Warehousing Pushing the Limits Jason Laws Principal Consultant WhereScape Consulting Introduction Oracle is the leading database for data warehousing. This paper covers some of the reasons
More informationArchitecture-Conscious Database Systems
Architecture-Conscious Database Systems 2009 VLDB Summer School Shanghai Peter Boncz (CWI) Sources Thank You! l l l l Database Architectures for New Hardware VLDB 2004 tutorial, Anastassia Ailamaki Query
More informationHow Achaeans Would Construct Columns in Troy. Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte
How Achaeans Would Construct Columns in Troy Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte Number of Visas Received 1 0,75 0,5 0,25 0 Alekh Jens Health Level
More informationSelf Tuning Databases
Sections Self Tuning Databases Projects from Microsoft and IBM 1. Microsoft s Database Research Group 1. Situation and Vision 2. The AutoAdmin Project 3. Strategies for Index Selection 4. Components of
More informationThe Case for Heterogeneous HTAP
The Case for Heterogeneous HTAP Raja Appuswamy, Manos Karpathiotakis, Danica Porobic, and Anastasia Ailamaki Data-Intensive Applications and Systems Lab EPFL 1 HTAP the contract with the hardware Hybrid
More informationHOW INDEX TO STORE DATA DATA
Stratos Idreos HOW INDEX DATA TO STORE DATA ALGORITHMS data structure decisions define the algorithms that access data INDEX DATA ALGORITHMS unordered [7,4,2,6,1,3,9,10,5,8] INDEX DATA ALGORITHMS unordered
More informationStoring and Processing Temporal Data in a Main Memory Column Store
Storing and Processing Temporal Data in a Main Memory Column Store Martin Kaufmann (supervised by Prof. Dr. Donald Kossmann) SAP AG, Walldorf, Germany and Systems Group, ETH Zürich, Switzerland martin.kaufmann@inf.ethz.ch
More informationArrays are a very commonly used programming language construct, but have limited support within relational databases. Although an XML document or
Performance problems come in many flavors, with many different causes and many different solutions. I've run into a number of these that I have not seen written about or presented elsewhere and I want
More informationCut Me Some Slack : Latency-Aware Live Migration for Databases. Sean Barker, Yun Chi, Hyun Jin Moon, Hakan Hacigumus, and Prashant Shenoy
Cut Me Some Slack : Latency-Aware Live Migration for s Sean Barker, Yun Chi, Hyun Jin Moon, Hakan Hacigumus, and Prashant Shenoy University of Massachusetts Amherst NEC Laboratories America Department
More informationQuery Processing on Multi-Core Architectures
Query Processing on Multi-Core Architectures Frank Huber and Johann-Christoph Freytag Department for Computer Science, Humboldt-Universität zu Berlin Rudower Chaussee 25, 12489 Berlin, Germany {huber,freytag}@dbis.informatik.hu-berlin.de
More informationTosska SQL Tuning Expert Pro for Oracle
Tosska SQL Tuning Expert Pro for Oracle Intelligent SQL tuning without touching your source code It is not another SQL Tuning tool There have been already a lot of SQL tuning products in the market providing
More informationH2O: A Hands-free Adaptive Store
H2O: A Hands-free Adaptive Store Ioannis Alagiannis Stratos Idreos Anastasia Ailamaki Ecole Polytechnique Fédérale de Lausanne {ioannis.alagiannis, anastasia.ailamaki}@epfl.ch Harvard University stratos@seas.harvard.edu
More informationNoSQL database and its business applications
COSC 657 Db. Management Systems Professor: RAMESH K. Student: BUER JIANG Research paper NoSQL database and its business applications The original purpose has been contemporary web-expand dbs. The movement
More informationDatabase Management Systems
DATABASE CONCEPTS & APPLICATIONS Database Management Systems A Database Management System (DBMS) is a software package designed to store and manage databases through database applications. User Database
More informationOpen Data Standards for Administrative Data Processing
University of Pennsylvania ScholarlyCommons 2018 ADRF Network Research Conference Presentations ADRF Network Research Conference Presentations 11-2018 Open Data Standards for Administrative Data Processing
More informationIn-Memory Data Management
In-Memory Data Management Martin Faust Research Assistant Research Group of Prof. Hasso Plattner Hasso Plattner Institute for Software Engineering University of Potsdam Agenda 2 1. Changed Hardware 2.
More informationBridging the Processor/Memory Performance Gap in Database Applications
Bridging the Processor/Memory Performance Gap in Database Applications Anastassia Ailamaki Carnegie Mellon http://www.cs.cmu.edu/~natassa Memory Hierarchies PROCESSOR EXECUTION PIPELINE L1 I-CACHE L1 D-CACHE
More informationWeaving Relations for Cache Performance
Weaving Relations for Cache Performance Anastassia Ailamaki Carnegie Mellon David DeWitt, Mark Hill, and Marios Skounakis University of Wisconsin-Madison Memory Hierarchies PROCESSOR EXECUTION PIPELINE
More informationWeaving Relations for Cache Performance
Weaving Relations for Cache Performance Anastassia Ailamaki Carnegie Mellon Computer Platforms in 198 Execution PROCESSOR 1 cycles/instruction Data and Instructions cycles
More informationPerformance in the Multicore Era
Performance in the Multicore Era Gustavo Alonso Systems Group -- ETH Zurich, Switzerland Systems Group Enterprise Computing Center Performance in the multicore era 2 BACKGROUND - SWISSBOX SwissBox: An
More informationPS2 out today. Lab 2 out today. Lab 1 due today - how was it?
6.830 Lecture 7 9/25/2017 PS2 out today. Lab 2 out today. Lab 1 due today - how was it? Project Teams Due Wednesday Those of you who don't have groups -- send us email, or hand in a sheet with just your
More informationBIS Database Management Systems.
BIS 512 - Database Management Systems http://www.mis.boun.edu.tr/durahim/ Ahmet Onur Durahim Learning Objectives Database systems concepts Designing and implementing a database application Life of a Query
More informationMIS Database Systems.
MIS 335 - Database Systems http://www.mis.boun.edu.tr/durahim/ Ahmet Onur Durahim Learning Objectives Database systems concepts Designing and implementing a database application Life of a Query in a Database
More informationLecture #14 Optimizer Implementation (Part I)
15-721 ADVANCED DATABASE SYSTEMS Lecture #14 Optimizer Implementation (Part I) Andy Pavlo / Carnegie Mellon University / Spring 2016 @Andy_Pavlo // Carnegie Mellon University // Spring 2017 2 TODAY S AGENDA
More informationclass 11 b-trees prof. Stratos Idreos
class 11 b-trees prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ Midway check-in: Two design docs tmr (Canvas) & tests on Sunday Next weekend: Lab marathon for midway check-in & tests
More informationUpgrading Databases. without losing your data, your performance or your mind. Charity
Upgrading Databases without losing your data, your performance or your mind Charity Majors @mipsytipsy Upgrading Databases without losing your data, your performance or your mind Charity Majors @mipsytipsy
More informationSQL Tuning Reading Recent Data Fast
SQL Tuning Reading Recent Data Fast Dan Tow singingsql.com Introduction Time is the key to SQL tuning, in two respects: Query execution time is the key measure of a tuned query, the only measure that matters
More informationIBM DB2 11 DBA for z/os Certification Review Guide Exam 312
Introduction IBM DB2 11 DBA for z/os Certification Review Guide Exam 312 The purpose of this book is to assist you with preparing for the IBM DB2 11 DBA for z/os exam (Exam 312), one of the two required
More informationI am: Rana Faisal Munir
Self-tuning BI Systems Home University (UPC): Alberto Abelló and Oscar Romero Host University (TUD): Maik Thiele and Wolfgang Lehner I am: Rana Faisal Munir Research Progress Report (RPR) [1 / 44] Introduction
More informationLow Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR
Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR Table of Contents Foreword... 2 New Era of Rapid Data Warehousing... 3 Eliminating Slow Reporting and Analytics Pains... 3 Applying 20 Years
More informationAutomated Physical Designers: What You See is (Not) What You Get
Automated Physical Designers: What You See is (Not) What You Get Renata Borovica Ioannis Alagiannis Anastasia Ailamaki École Polytechnique Fédérale de Lausanne 1015 Lausanne, Switzerland name.surname@epfl.ch
More informationRAID in Practice, Overview of Indexing
RAID in Practice, Overview of Indexing CS634 Lecture 4, Feb 04 2014 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke 1 Disks and Files: RAID in practice For a big enterprise
More informationHybrid Storage for Data Warehousing. Colin White, BI Research September 2011 Sponsored by Teradata and NetApp
Hybrid Storage for Data Warehousing Colin White, BI Research September 2011 Sponsored by Teradata and NetApp HYBRID STORAGE FOR DATA WAREHOUSING Ever since the advent of enterprise data warehousing some
More informationCS317 File and Database Systems
CS317 File and Database Systems Lecture 9 Intro to Physical DBMS Design October 22, 2017 Sam Siewert Reminders Assignment #4 Due Friday, Monday Late Assignment #3 Returned Assignment #5, B-Trees and Physical
More informationParallel In-situ Data Processing with Speculative Loading
Parallel In-situ Data Processing with Speculative Loading Yu Cheng, Florin Rusu University of California, Merced Outline Background Scanraw Operator Speculative Loading Evaluation SAM/BAM Format More than
More informationIntroduction to Azure DocumentDB. Jeff Renz, BI Architect RevGen Partners
Introduction to Azure DocumentDB Jeff Renz, BI Architect RevGen Partners Thank You Presenting Sponsors Gain insights through familiar tools while balancing monitoring and managing user created content
More informationOracle Application Express Schema Design Guidelines Presenter: Flavio Casetta, Yocoya.com
Oracle Application Express Schema Design Guidelines Presenter: Flavio Casetta, Yocoya.com about me Flavio Casetta Founder of Yocoya.com Editor of blog OracleQuirks.blogspot.com 25+ years in the IT 10+
More information[MS10987A]: Performance Tuning and Optimizing SQL Databases
[MS10987A]: Performance Tuning and Optimizing SQL Databases Length : 4 Days Audience(s) : IT Professionals Level : 300 Technology : Microsoft SQL Server Delivery Method : Instructor-led (Classroom) Course
More informationTraditional RDBMS Wisdom is All Wrong -- In Three Acts. Michael Stonebraker
Traditional RDBMS Wisdom is All Wrong -- In Three Acts Michael Stonebraker The Stonebraker Says Webinar Series The first three acts: 1. Why main memory is the answer for OLTP Recording available at VoltDB.com
More informationKey to A Successful Exadata POC
BY UMAIR MANSOOB Who Am I Oracle Certified Administrator from Oracle 7 12c Exadata Certified Implementation Specialist since 2011 Oracle Database Performance Tuning Certified Expert Oracle Business Intelligence
More informationCOLUMN DATABASES A NDREW C ROTTY & ALEX G ALAKATOS
COLUMN DATABASES A NDREW C ROTTY & ALEX G ALAKATOS OUTLINE RDBMS SQL Row Store Column Store C-Store Vertica MonetDB Hardware Optimizations FACULTY MEMBER VERSION EXPERIMENT Question: How does time spent
More information