Load Shedding in a Data Stream Manager
|
|
- Eric Marshall
- 6 years ago
- Views:
Transcription
1 Load Shedding in a Data Stream Manager Nesime Tatbul, Uur U Çetintemel, Stan Zdonik Brown University Mitch Cherniack Brandeis University Michael Stonebraker M.I.T.
2 The Overload Problem Push-based data sources High and unpredictable data rates Problem: Load > Capacity during spikes Load Shedding : eliminating excess load by dropping data 2
3 Aurora Data Stream Management System Aurora Query Network input data streams QoS QoS output to applications QoS Stream Query Algebra (SQuAl) Quality of Service Run-time Operation: Operator Scheduling Storage Management Performance Monitoring Load Shedding 3
4 Load Shedding by Inserting Drops QoS QoS two types of drops: Random Drop Drop k % Semantic Drop Filter P(value) 4
5 Quality of Service Loss-tolerance QoS utility Value-based QoS utility % delivery values 5
6 Assumptions Processor is the main scarce resource Operators that produce new attribute values are not considered (e.g., aggregate, map) Load Shedder operates independently from Scheduler 6
7 Problem Statement N: query network I : set of input streams C : processing capacity when Load(N(I)) > C, transform N to N such that Load(N (I)) < C Utility(N(I))-Utility(N (I)) is minimized key questions: when to shed load? where to shed load? how much load to shed? which tuples to drop? 7
8 Talk Outline Introduction Technical Overview Experimental Results Related Work Conclusions and Future Work 8
9 Algorithmic Overview evaluate the Query Network Load IF Load > Capacity insert Drops to the Query Network ELSE IF Load < Capacity AND Drops in the Query Network remove Drops from the Query Network read stats, network modify network System Catalog look up Drop Insertion Plans Queries Statistics 9
10 Load Evaluation Load Coefficients for each input stream R i Cost 1 Sel 1 Cost 2 Sel 2 Cost n Sel n n j 1 L = Sel Cost j= 1 k= 1 i k j Total Load at run time (CPU cycles per tuple) can be pre-computed! m i= 1 L i R i (CPU cycles per time unit) 10
11 Load Shedding Road Map (LSRM) Excess Load Drop Insertion Plan QoS Cursors 10% shed less! cursor 20% shed more! 300% 11
12 Constructing the LSRM 1. identify potential drop locations 2. sort the drop locations 3. apply the drop locations in order: insert one unit of drop if semantic drop, determine the filter predicate create an LSRM entry 12
13 Drop Locations utility utility % delivery 4 utility % delivery 7 8 early drops save more cycles drops before splits cause more utility loss % delivery 13
14 Best Drop Location Goal: maximize cycle gain, minimize utility loss subnetwork R Drop x % L subnetwork Cycle gain: R ( x L D) if x > 0 Gx ( ) = 0 otherwise Loss/Gain Ratio D cycles/tuple Utility loss: U(x) x du ( x) / dx negative slope of U ( x) = dg( x)/ dx R L the smaller, the better! 14
15 From Values to % Delivery when, where, how much? have the same answer for Random and Semantic Load Shedding Trick: translation between QoS functions Assumption: drop the lowest utility values first utility relative frequency utility values values %delivery 15
16 Determining the Filter Predicate utility utility relative frequency %delivery values values drop 20% lowest utility value interval drop values < 25 Filter Predicate: value 25 16
17 Talk Outline Introduction Technical Overview Experimental Results Related Work Conclusions and Future Work 17
18 Setup: Experimental Study streams: uniformly distributed integer values networks: a mix of filters and unions QoS: : value-based QoS,, range utilities chosen from a Zipf distribution Metrics used: percent loss in total average utility on loss-tolerance QoS (Tuple Utility Loss) value-based QoS (Value Utility Loss) 18
19 Random-LS beats Admission Control Input-Uniform Random-LS Load ~20% Load ~300% 19
20 Semantic-LS beats Admission Control Input-Uniform Semantic-LS Load ~20% Load ~300% 20
21 Semantic-LS beats Random-LS r=0.15 r=0.20 r=
22 Performance for Networks with Sharing theta=0.99 theta=0.5 theta=
23 Talk Outline Introduction Technical Overview Experimental Results Related Work Conclusions and Future Work 23
24 Relevance to Existing Work Congestion control in networks Multimedia streaming Approximate query answering Data stream processing sampling, shedding on aggregates STREAM [MW+03, BDM03] approximate join processing [DGR03] adjusting rates to windowed joins [KNV03] 24
25 Talk Outline Introduction Technical Overview Experimental Results Related Work Conclusions and Future Work 25
26 Highlights An end-to to-end solution for detecting and resolving overload in data stream systems Utility loss minimization for networks of queries which share resources and processing semantic utility as well as quantity-based utility Static drop insertion plans, dynamic instantiation 26
27 Future Directions Handling complex operators joins and aggregates Other resource limitations memory - windowed operators bandwidth - Aurora* power - at sensor level 27
28 More Information Aurora Web Page: Demo 28
Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2012/13
Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Data Stream Processing Topics Model Issues System Issues Distributed Processing Web-Scale Streaming 3 System Issues Architecture
More informationStaying FIT: Efficient Load Shedding Techniques for Distributed Stream Processing
Staying FIT: Efficient Load Shedding Techniques for Distributed Stream Processing Nesime Tatbul Uğur Çetintemel Stan Zdonik Talk Outline Problem Introduction Approach Overview Advance Planning with an
More informationBig Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich
Big Data Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich Data Stream Processing Overview Introduction Models and languages Query processing systems Distributed stream processing Real-time analytics
More informationStaying FIT: Efficient Load Shedding Techniques for Distributed Stream Processing
Staying FIT: Efficient Load Shedding Techniques for Distributed Stream Processing Nesime Tatbul ETH Zurich tatbul@inf.ethz.ch Uğur Çetintemel Brown University ugur@cs.brown.edu Stan Zdonik Brown University
More informationAurora: a new model and architecture for data stream management
The VLDB Journal (2003) / Digital Object Identifier (DOI) 10.1007/s00778-003-0095-z Aurora: a new model and architecture for data stream management Daniel J. Abadi 1, Don Carney 2, Uǧur Çetintemel 2, Mitch
More informationAurora: a new model and architecture for data stream management
The VLDB Journal (2003) 12: 120 139 / Digital Object Identifier (DOI) 10.1007/s00778-003-0095-z Aurora: a new model and architecture for data stream management Daniel J. Abadi 1, Don Carney 2,Uǧur Çetintemel
More informationUDP Packet Monitoring with Stanford Data Stream Manager
UDP Packet Monitoring with Stanford Data Stream Manager Nadeem Akhtar #1, Faridul Haque Siddiqui #2 # Department of Computer Engineering, Aligarh Muslim University Aligarh, India 1 nadeemalakhtar@gmail.com
More informationThe Aurora and Borealis Stream Processing Engines
The Aurora and Borealis Stream Processing Engines Uğur Çetintemel 1, Daniel Abadi 2, Yanif Ahmad 1, Hari Balakrishnan 2, Magdalena Balazinska 2, Mitch Cherniack 3, Jeong-Hyon Hwang 1, Wolfgang Lindner
More informationDealing with Overload in Distributed Stream Processing Systems
Dealing with Overload in Distributed Stream Processing Systems Nesime Tatbul Stan Zdonik Brown University, Department of Computer Science, Providence, RI USA E-mail: {tatbul, sbz}@cs.brown.edu Abstract
More informationSystems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2012/13
Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Data Stream Processing Topics Model Issues System Issues Distributed Processing Web-Scale Streaming 3 Data Streams Continuous
More informationMonitoring Streams A New Class of Data Management Applications
Monitoring Streams A New Class of Data Management Applications Don Carney dpc@csbrownedu Uğur Çetintemel ugur@csbrownedu Mitch Cherniack Brandeis University mfc@csbrandeisedu Christian Convey cjc@csbrownedu
More informationLoad Shedding for Aggregation Queries over Data Streams
Load Shedding for Aggregation Queries over Data Streams Brian Babcock Mayur Datar Rajeev Motwani Department of Computer Science Stanford University, Stanford, CA 94305 {babcock, datar, rajeev}@cs.stanford.edu
More informationDSMS Benchmarking. Morten Lindeberg University of Oslo
DSMS Benchmarking Morten Lindeberg University of Oslo Agenda Introduction DSMS Recap General Requirements Metrics Example: Linear Road Example: StreamBench 30. Sep. 2009 INF5100 - Morten Lindeberg 2 Introduction
More informationOutline. Introduction. Introduction Definition -- Selection and Join Semantics The Cost Model Load Shedding Experiments Conclusion Discussion Points
Static Optimization of Conjunctive Queries with Sliding Windows Over Infinite Streams Ahmed M. Ayad and Jeffrey F.Naughton Presenter: Maryam Karimzadehgan mkarimzadehgan@cs.uwaterloo.ca Outline Introduction
More informationRetrospective on Aurora
The VLDB Journal manuscript No. (will be inserted by the editor) Retrospective on Aurora Hari Balakrishnan 3, Magdalena Balazinska 3, Don Carney 2, Uğur Çetintemel 2, Mitch Cherniack 1, Christian Convey
More informationDesign Issues for Second Generation Stream Processing Engines
Design Issues for Second Generation Stream Processing Engines Daniel J. Abadi, Yanif Ahmad, Magdalena Balazinska, Uğur Çetintemel, Mitch Cherniack, Jeong-Hyon Hwang, Wolfgang Lindner, Anurag S. Maskey,
More informationDATA STREAMS AND DATABASES. CS121: Introduction to Relational Database Systems Fall 2016 Lecture 26
DATA STREAMS AND DATABASES CS121: Introduction to Relational Database Systems Fall 2016 Lecture 26 Static and Dynamic Data Sets 2 So far, have discussed relatively static databases Data may change slowly
More informationAbstract of Load Shedding Techniques for Data Stream Management Systems by Emine Nesime Tatbul, Ph.D., Brown University, May 2007.
Abstract of Load Shedding Techniques for Data Stream Management Systems by Emine Nesime Tatbul, Ph.D., Brown University, May 2007. In recent years, we have witnessed the emergence of a new class of applications
More informationAbstract of Load Management Techniques for Distributed Stream Processing by Ying Xing, Ph.D., Brown University, May 2006.
Abstract of Load Management Techniques for Distributed Stream Processing by Ying Xing, Ph.D., Brown University, May 2006. Distributed and parallel computing environments are becoming inexpensive and commonplace.
More informationStreaming Data Integration: Challenges and Opportunities. Nesime Tatbul
Streaming Data Integration: Challenges and Opportunities Nesime Tatbul Talk Outline Integrated data stream processing An example project: MaxStream Architecture Query model Conclusions ICDE NTII Workshop,
More informationutility utility utility message % output value delay
QoS-Driven Load Shedding on Data Streams? Nesime Tatbul Brown University Department of Computer Science Providence, RI 02912, USA tatbul@cs.brown.edu Abstract. In this thesis, we are working on the optimized
More informationRetrospective on Aurora
The VLDB Journal (2004) 13: 370 383 / Digital Object Identifier (DOI) 10.1007/s00778-004-0133-5 Retrospective on Aurora Hari Balakrishnan 3, Magdalena Balazinska 3, Don Carney 2,Uğur Çetintemel 2, Mitch
More informationData Streams. Building a Data Stream Management System. DBMS versus DSMS. The (Simplified) Big Picture. (Simplified) Network Monitoring
Building a Data Stream Management System Prof. Jennifer Widom Joint project with Prof. Rajeev Motwani and a team of graduate students http://www-db.stanford.edu/stream stanfordstreamdatamanager Data Streams
More informationRUNTIME OPTIMIZATION AND LOAD SHEDDING IN MAVSTREAM: DESIGN AND IMPLEMENTATION BALAKUMAR K. KENDAI. Presented to the Faculty of the Graduate School of
RUNTIME OPTIMIZATION AND LOAD SHEDDING IN MAVSTREAM: DESIGN AND IMPLEMENTATION by BALAKUMAR K. KENDAI Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial
More informationclass 17 updates prof. Stratos Idreos
class 17 updates prof. Stratos Idreos HTTP://DASLAB.SEAS.HARVARD.EDU/CLASSES/CS165/ early/late tuple reconstruction, tuple-at-a-time, vectorized or bulk processing, intermediates format, pushing selects
More informationSimulation of Quality of Service (QoS) Graph-Based Load-Shedding Algorithm in. Aurora System Using CSIM. Xiaolan Hu.
Simulation of Quality of Service (QoS) Graph-Based Load-Shedding Algorithm in Aurora System Using CSM Xiaolan Hu Department of Computer Science Brown University Submitted in Partial Fulfillment of the
More informationQuery Processing. Introduction to Databases CompSci 316 Fall 2017
Query Processing Introduction to Databases CompSci 316 Fall 2017 2 Announcements (Tue., Nov. 14) Homework #3 sample solution posted in Sakai Homework #4 assigned today; due on 12/05 Project milestone #2
More informationEvaluation of Relational Operations: Other Techniques. Chapter 14 Sayyed Nezhadi
Evaluation of Relational Operations: Other Techniques Chapter 14 Sayyed Nezhadi Schema for Examples Sailors (sid: integer, sname: string, rating: integer, age: real) Reserves (sid: integer, bid: integer,
More informationStorage-centric load management for data streams with update semantics
Research Collection Report Storage-centric load management for data streams with update semantics Author(s): Moga, Alexandru; Botan, Irina; Tatbul, Nesime Publication Date: 2009-03 Permanent Link: https://doi.org/10.3929/ethz-a-006733707
More informationOperator Scheduling in a Data Stream Manager
Operator Scheduling in a Data Stream Manager Don Carney, Uğur Çetintemel, Alex Rasin, Stan Zdonik Mitch Cherniack, Mike Stonebraker ± Department of Computer Science, Brown University Department of Computer
More informationC-Store: A column-oriented DBMS
Presented by: Manoj Karthick Selva Kumar C-Store: A column-oriented DBMS MIT CSAIL, Brandeis University, UMass Boston, Brown University Proceedings of the 31 st VLDB Conference, Trondheim, Norway 2005
More informationStreamGlobe Adaptive Query Processing and Optimization in Streaming P2P Environments
StreamGlobe Adaptive Query Processing and Optimization in Streaming P2P Environments A. Kemper, R. Kuntschke, and B. Stegmaier TU München Fakultät für Informatik Lehrstuhl III: Datenbanksysteme http://www-db.in.tum.de/research/projects/streamglobe
More informationQuery Processing over Data Streams. Formula for a Database Research Project. Following the Formula
Query Processing over Data Streams Joint project with Prof. Rajeev Motwani and a group of graduate students stanfordstreamdatamanager Formula for a Database Research Project Pick a simple but fundamental
More informationA Cost Model for Adaptive Resource Management in Data Stream Systems
A Cost Model for Adaptive Resource Management in Data Stream Systems Michael Cammert, Jürgen Krämer, Bernhard Seeger, Sonny Vaupel Department of Mathematics and Computer Science University of Marburg 35032
More informationTAG: A TINY AGGREGATION SERVICE FOR AD-HOC SENSOR NETWORKS
TAG: A TINY AGGREGATION SERVICE FOR AD-HOC SENSOR NETWORKS SAMUEL MADDEN, MICHAEL J. FRANKLIN, JOSEPH HELLERSTEIN, AND WEI HONG Proceedings of the Fifth Symposium on Operating Systems Design and implementation
More informationParser: SQL parse tree
Jinze Liu Parser: SQL parse tree Good old lex & yacc Detect and reject syntax errors Validator: parse tree logical plan Detect and reject semantic errors Nonexistent tables/views/columns? Insufficient
More informationAccommodating Bursts in Distributed Stream Processing Systems
Accommodating Bursts in Distributed Stream Processing Systems Distributed Real-time Systems Lab University of California, Riverside {drougas,vana}@cs.ucr.edu http://www.cs.ucr.edu/~{drougas,vana} Stream
More informationA Cost-Space Approach to Distributed Query Optimization in Stream Based Overlays
A Cost-Space Approach to Distributed Query Optimization in Stream Based Overlays Jeffrey Shneidman, Peter Pietzuch, Matt Welsh, Margo Seltzer, Mema Roussopoulos Systems Research Group Harvard University
More informationLecture 21 11/27/2017 Next Lecture: Quiz review & project meetings Streaming & Apache Kafka
Lecture 21 11/27/2017 Next Lecture: Quiz review & project meetings Streaming & Apache Kafka What problem does Kafka solve? Provides a way to deliver updates about changes in state from one service to another
More information1. General. 2. Stream. 3. Aurora. 4. Conclusion
1. General 2. Stream 3. Aurora 4. Conclusion 1. Motivation Applications 2. Definition of Data Streams 3. Data Base Management System (DBMS) vs. Data Stream Management System(DSMS) 4. Stream Projects interpreting
More informationProviding Resiliency to Load Variations in Distributed Stream Processing
Providing Resiliency to Load Variations in Distributed Stream Processing Ying Xing, Jeong-Hyon Hwang, Uğur Çetintemel, and Stan Zdonik Department of Computer Science, Brown University {yx, jhhwang, ugur,
More informationA Comparison of Stream-Oriented High-Availability Algorithms
A Comparison of Stream-Oriented High-Availability Algorithms Jeong-Hyon Hwang Magdalena Balazinska Alexander Rasin Brown University MIT Brown University jhhwang@cs.brown.edu mbalazin@lcs.mit.edu alexr@cs.brown.edu
More informationQuerying Data with Transact SQL
Course 20761A: Querying Data with Transact SQL Course details Course Outline Module 1: Introduction to Microsoft SQL Server 2016 This module introduces SQL Server, the versions of SQL Server, including
More informationLoad Shedding for Aggregation Queries over Data Streams
Load Shedding for Aggregation Queries over Data Streams Brian Babcock Mayur Datar Rajeev Motwani Department of Computer Science Stanford University, Stanford, CA 94305 {babcock, datar, rajeev}@cs.stanford.edu
More informationCAS CS 460/660 Introduction to Database Systems. Query Evaluation II 1.1
CAS CS 460/660 Introduction to Database Systems Query Evaluation II 1.1 Cost-based Query Sub-System Queries Select * From Blah B Where B.blah = blah Query Parser Query Optimizer Plan Generator Plan Cost
More informationNetwork Awareness in Internet-Scale Stream Processing
Network Awareness in Internet-Scale Stream Processing Yanif Ahmad, Uğur Çetintemel John Jannotti, Alexander Zgolinski, Stan Zdonik {yna,ugur,jj,amz,sbz}@cs.brown.edu Dept. of Computer Science, Brown University,
More informationBrown University Providence, RI 02912
AD Award Number: DAMD17-02-2-0048 TITLE: Monitoring and Mining Data Streams PRINCIPAL INVESTIGATOR: Stan B. Zdonik, Ph.D. CONTRACTING ORGANIZATION: Brown University Providence, RI 02912 REPORT DATE: October
More information7. Query Processing and Optimization
7. Query Processing and Optimization Processing a Query 103 Indexing for Performance Simple (individual) index B + -tree index Matching index scan vs nonmatching index scan Unique index one entry and one
More informationChallenges in Data Stream Processing
Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Challenges in Data Stream Processing Corso di Sistemi e Architetture per Big Data A.A. 2016/17 Valeria
More informationProfile-Driven Data Management
Profile-Driven Data Management Mitch Cherniack Department of Computer Science Brandeis University mfc@cs.brandeis.edu Joint work: Mike Franklin (Berkeley), Stan Zdonik (Brown) The State of the Web Large
More informationEvaluation of Relational Operations
Evaluation of Relational Operations Chapter 12, Part A Database Management Systems, R. Ramakrishnan and J. Gehrke 1 Relational Operations We will consider how to implement: Selection ( ) Selects a subset
More informationTuple Routing Strategies for Distributed Eddies
Tuple Routing Strategies for Distributed Eddies Feng Tian David J. DeWitt Department of Computer Sciences University of Wisconsin, Madison Madison, WI, 53706 {ftian, dewitt}@cs.wisc.edu Abstract Many applications
More informationC-STORE: A COLUMN- ORIENTED DBMS
C-STORE: A COLUMN- ORIENTED DBMS MIT CSAIL, Brandeis University, UMass Boston And Brown University Proceedings Of The 31st VLDB Conference, Trondheim, Norway, 2005 Presented By: Udit Panchal Timeline of
More information[AVNICF-MCSASQL2012]: NICF - Microsoft Certified Solutions Associate (MCSA): SQL Server 2012
[AVNICF-MCSASQL2012]: NICF - Microsoft Certified Solutions Associate (MCSA): SQL Server 2012 Length Delivery Method : 5 Days : Instructor-led (Classroom) Course Overview Participants will learn technical
More informationNew Data Types and Operations to Support. Geo-stream
New Data Types and Operations to Support s Yan Huang Chengyang Zhang Department of Computer Science and Engineering University of North Texas GIScience 2008 Outline Motivation Introduction to Motivation
More informationLOAD SHEDDING IN DATA STREAM SYSTEMS
Chapter 7 LOAD SHEDDING IN DATA STREAM SYSTEMS Brian Babcock Department of Computer Science Stanford University Mayur Datar Google, Inc. datar@cs.stanford.edu Rajeev Motwani Department of Computer Science
More informationData Replication in the Quality Space
Data Replication in the Quality Space Yicheng Tu April 1, 2008 At Commnet USF Roadmap Introduction Static data replication Dynamic data replication Experimental (simulation) results Summary Data Replication
More informationCh 5 : Query Processing & Optimization
Ch 5 : Query Processing & Optimization Basic Steps in Query Processing 1. Parsing and translation 2. Optimization 3. Evaluation Basic Steps in Query Processing (Cont.) Parsing and translation translate
More informationCOMP 249 Advanced Distributed Systems Multimedia Networking. Performance of Multimedia Delivery on the Internet Today
COMP 249 Advanced Distributed Systems Multimedia Networking Performance of Multimedia Delivery on the Internet Today Kevin Jeffay Department of Computer Science University of North Carolina at Chapel Hill
More informationDESIGN AND IMPLEMENTATION OF SCHEDULING STRATEGIES AND THEIR EVALUAION IN MAVSTREAM. Sharma Chakravarthy Supervising Professor.
DESIGN AND IMPLEMENTATION OF SCHEDULING STRATEGIES AND THEIR EVALUAION IN MAVSTREAM The members of the Committee approve the master s thesis of Vamshi Krishna Pajjuri Sharma Chakravarthy Supervising Professor
More informationQuery optimization. Elena Baralis, Silvia Chiusano Politecnico di Torino. DBMS Architecture D B M G. Database Management Systems. Pag.
Database Management Systems DBMS Architecture SQL INSTRUCTION OPTIMIZER MANAGEMENT OF ACCESS METHODS CONCURRENCY CONTROL BUFFER MANAGER RELIABILITY MANAGEMENT Index Files Data Files System Catalog DATABASE
More informationQuery Optimization. Introduction to Databases CompSci 316 Fall 2018
Query Optimization Introduction to Databases CompSci 316 Fall 2018 2 Announcements (Tue., Nov. 20) Homework #4 due next in 2½ weeks No class this Thu. (Thanksgiving break) No weekly progress update due
More informationDatabase Applications (15-415)
Database Applications (15-415) DBMS Internals- Part VI Lecture 17, March 24, 2015 Mohammad Hammoud Today Last Two Sessions: DBMS Internals- Part V External Sorting How to Start a Company in Five (maybe
More informationTuning SQL without the Tuning Pack. John Larkin JP Morgan Chase
Tuning SQL without the Tuning Pack John Larkin JP Morgan Chase Who am I Originally a mainframe COBOL programmer DBA for the last 23 years, the last 15 with Oracle. UNIX (Solaris, Aix, Windows, Linux) Recently
More informationR & G Chapter 13. Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops:
Relational Query Optimization R & G Chapter 13 Review Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops: simple, exploits extra memory
More informationRethinking the Design of Distributed Stream Processing Systems
Rethinking the Design of Distributed Stream Processing Systems Yongluan Zhou Karl Aberer Ali Salehi Kian-Lee Tan EPFL, Switzerland National University of Singapore Abstract In this paper, we present a
More informationLoad Shedding. Recommended Reading
Comp. by: RPushparaj Stage: Revises1 ChapterID: 000000A808 Date:11/6/09 Time:18:23:58 Load Shedding L 45 Routing and Maintenance Structure Recommended Reading 1. Aberer K., Datta A., Hauswirth M., and
More informationSchema for Examples. Query Optimization. Alternative Plans 1 (No Indexes) Motivating Example. Alternative Plans 2 With Indexes
Schema for Examples Query Optimization (sid: integer, : string, rating: integer, age: real) (sid: integer, bid: integer, day: dates, rname: string) Similar to old schema; rname added for variations. :
More informationCopyright 2016 Ramez Elmasri and Shamkant B. Navathe
CHAPTER 19 Query Optimization Introduction Query optimization Conducted by a query optimizer in a DBMS Goal: select best available strategy for executing query Based on information available Most RDBMSs
More informationImplementation of Relational Operations. Introduction. CS 186, Fall 2002, Lecture 19 R&G - Chapter 12
Implementation of Relational Operations CS 186, Fall 2002, Lecture 19 R&G - Chapter 12 First comes thought; then organization of that thought, into ideas and plans; then transformation of those plans into
More informationQuery Optimization. Schema for Examples. Motivating Example. Similar to old schema; rname added for variations. Reserves: Sailors:
Query Optimization Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Schema for Examples (sid: integer, sname: string, rating: integer, age: real) (sid: integer, bid: integer, day: dates,
More informationAccelerating Analytical Workloads
Accelerating Analytical Workloads Thomas Neumann Technische Universität München April 15, 2014 Scale Out in Big Data Analytics Big Data usually means data is distributed Scale out to process very large
More informationIncremental Evaluation of Sliding-Window Queries over Data Streams
Incremental Evaluation of Sliding-Window Queries over Data Streams Thanaa M. Ghanem 1 Moustafa A. Hammad 2 Mohamed F. Mokbel 3 Walid G. Aref 1 Ahmed K. Elmagarmid 1 1 Department of Computer Science, Purdue
More informationParallel SQL and Streaming Expressions in Apache Solr 6. Shalin Shekhar Lucidworks Inc.
Parallel SQL and Streaming Expressions in Apache Solr 6 Shalin Shekhar Mangar @shalinmangar Lucidworks Inc. Introduction Shalin Shekhar Mangar Lucene/Solr Committer PMC Member Senior Solr Consultant with
More informationScheduling Strategies for Processing Continuous Queries Over Streams
Department of Computer Science and Engineering University of Texas at Arlington Arlington, TX 76019 Scheduling Strategies for Processing Continuous Queries Over Streams Qingchun Jiang, Sharma Chakravarthy
More informationCHAPTER 3 EFFECTIVE ADMISSION CONTROL MECHANISM IN WIRELESS MESH NETWORKS
28 CHAPTER 3 EFFECTIVE ADMISSION CONTROL MECHANISM IN WIRELESS MESH NETWORKS Introduction Measurement-based scheme, that constantly monitors the network, will incorporate the current network state in the
More informationOne-Pass Streaming Algorithms
One-Pass Streaming Algorithms Theory and Practice Complaints and Grievances about theory in practice Disclaimer Experiences with Gigascope. A practitioner s perspective. Will be using my own implementations,
More informationRASC: Dynamic Rate Allocation for Distributed Stream Processing Applications
RASC: Dynamic Rate Allocation for Distributed Stream Processing Applications Yannis Drougas, Vana Kalogeraki Department of Computer Science and Engineering University of California-Riverside, Riverside,
More informationQoS Management of Real-Time Data Stream Queries in Distributed. environment.
QoS Management of Real-Time Data Stream Queries in Distributed Environments Yuan Wei, Vibha Prasad, Sang H. Son Department of Computer Science University of Virginia, Charlottesville 22904 {yw3f, vibha,
More informationDifferentiated Services
1 Differentiated Services QoS Problem Diffserv Architecture Per hop behaviors 2 Problem: QoS Need a mechanism for QoS in the Internet Issues to be resolved: Indication of desired service Definition of
More informationIntroduction to Data Management CSE 344. Lectures 8: Relational Algebra
Introduction to Data Management CSE 344 Lectures 8: Relational Algebra CSE 344 - Winter 2016 1 Announcements Homework 3 is posted Microsoft Azure Cloud services! Use the promotion code you received Due
More informationThe Raindrop Engine: Continuous Query Processing at WPI
The Raindrop Engine: Continuous Query Processing at WPI Elke A. Rundensteiner Database Systems Research Lab, WPI 2003 This talk is based on joint work with students at DSRG lab. Invited Talk at Database
More informationOutline. q Database integration & querying. q Peer-to-Peer data management q Stream data management q MapReduce-based distributed data management
Outline n Introduction & architectural issues n Data distribution n Distributed query processing n Distributed query optimization n Distributed transactions & concurrency control n Distributed reliability
More informationStorm. Distributed and fault-tolerant realtime computation. Nathan Marz Twitter
Storm Distributed and fault-tolerant realtime computation Nathan Marz Twitter Storm at Twitter Twitter Web Analytics Before Storm Queues Workers Example (simplified) Example Workers schemify tweets and
More informationImplementation of Relational Operations
Implementation of Relational Operations Module 4, Lecture 1 Database Management Systems, R. Ramakrishnan 1 Relational Operations We will consider how to implement: Selection ( ) Selects a subset of rows
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 7 - Query execution
CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2007 Lecture 7 - Query execution References Generalized Search Trees for Database Systems. J. M. Hellerstein, J. F. Naughton
More informationQuerying Sliding Windows over On-Line Data Streams
Querying Sliding Windows over On-Line Data Streams Lukasz Golab School of Computer Science, University of Waterloo Waterloo, Ontario, Canada N2L 3G1 lgolab@uwaterloo.ca Abstract. A data stream is a real-time,
More informationEECS 647: Introduction to Database Systems
EECS 647: Introduction to Database Systems Instructor: Luke Huan Spring 2009 External Sorting Today s Topic Implementing the join operation 4/8/2009 Luke Huan Univ. of Kansas 2 Review DBMS Architecture
More informationLoad Shedding for Window Joins on Multiple Data Streams
Load Shedding for Window Joins on Multiple Data Streams Yan-Nei Law Bioinformatics Institute 3 Biopolis Street, Singapore 138671 lawyn@bii.a-star.edu.sg Carlo Zaniolo Computer Science Dept., UCLA Los Angeles,
More informationStreaming Live Media over a Peer-to-Peer Network
Streaming Live Media over a Peer-to-Peer Network Technical Report Stanford University Deshpande, Hrishikesh Bawa, Mayank Garcia-Molina, Hector Presenter: Kang, Feng Outline Problem in media streaming and
More informationApplication of wavelet filtering to image compression
Application of wavelet filtering to image compression LL3 HL3 LH3 HH3 LH2 HL2 HH2 HL1 LH1 HH1 Fig. 9.1 Wavelet decomposition of image. Application to image compression Application to image compression
More informationRelational Query Optimization
Relational Query Optimization Module 4, Lectures 3 and 4 Database Management Systems, R. Ramakrishnan 1 Overview of Query Optimization Plan: Tree of R.A. ops, with choice of alg for each op. Each operator
More informationEvaluation of Relational Operations. Relational Operations
Evaluation of Relational Operations Chapter 14, Part A (Joins) Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Relational Operations v We will consider how to implement: Selection ( )
More informationOperator Implementation Wrap-Up Query Optimization
Operator Implementation Wrap-Up Query Optimization 1 Last time: Nested loop join algorithms: TNLJ PNLJ BNLJ INLJ Sort Merge Join Hash Join 2 General Join Conditions Equalities over several attributes (e.g.,
More informationReview. Relational Query Optimization. Query Optimization Overview (cont) Query Optimization Overview. Cost-based Query Sub-System
Review Relational Query Optimization R & G Chapter 12/15 Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops: simple, exploits extra memory
More informationCHAPTER 5 ANT-FUZZY META HEURISTIC GENETIC SENSOR NETWORK SYSTEM FOR MULTI - SINK AGGREGATED DATA TRANSMISSION
CHAPTER 5 ANT-FUZZY META HEURISTIC GENETIC SENSOR NETWORK SYSTEM FOR MULTI - SINK AGGREGATED DATA TRANSMISSION 5.1 INTRODUCTION Generally, deployment of Wireless Sensor Network (WSN) is based on a many
More informationRelational Query Optimization. Highlights of System R Optimizer
Relational Query Optimization Chapter 15 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Highlights of System R Optimizer v Impact: Most widely used currently; works well for < 10 joins.
More informationAnalytical and Experimental Evaluation of Stream-Based Join
Analytical and Experimental Evaluation of Stream-Based Join Henry Kostowski Department of Computer Science, University of Massachusetts - Lowell Lowell, MA 01854 Email: hkostows@cs.uml.edu Kajal T. Claypool
More informationRouting in a Delay Tolerant Network
Routing in a Delay Tolerant Network Vladislav Marinov Jacobs University Bremen March 31st, 2008 Vladislav Marinov Routing in a Delay Tolerant Network 1 Internet for a Remote Village Dial-up connection
More informationSemantic Multicast for Content-based Stream Dissemination
Semantic Multicast for Content-based Stream Dissemination Olga Papaemmanouil Brown University Uğur Çetintemel Brown University Stream Dissemination Applications Clients Data Providers Push-based applications
More information