Preference Queries over Sets
|
|
- Allen Hutchinson
- 5 years ago
- Views:
Transcription
1 Preference Queries over Sets Xi Zhang Jan Chomicki SUNY at Buffalo April 15, 2011 Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
2 Tuple Preferences vs Set Preferences Tuple Preferences Well known preferences: top-k, skyline etc. Set Preferences Preferences between sets of tuples. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
3 Motivating Example Alice is buying 3 books as gifts. Title Genre Rating Price Vendor a 1 sci-fi 5.0 $15.00 Amazon a 2 biography 4.8 $20.00 B&N a 3 sci-fi 4.5 $25.00 Amazon a 4 romance 4.4 $10.00 B&N a 5 sci-fi 4.3 $15.00 Amazon a 6 romance 4.2 $12.00 B&N a 7 biography 4.0 $18.00 Amazon a 8 sci-fi 3.5 $18.00 Amazon Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
4 Motivating Example Alice is buying 3 books as gifts. Title Genre Rating Price Vendor a 1 sci-fi 5.0 $15.00 Amazon a 2 biography 4.8 $20.00 B&N a 3 sci-fi 4.5 $25.00 Amazon a 4 romance 4.4 $10.00 B&N a 5 sci-fi 4.3 $15.00 Amazon a 6 romance 4.2 $12.00 B&N a 7 biography 4.0 $18.00 Amazon a 8 sci-fi 3.5 $18.00 Amazon She has the following wishes... (C1) Spend as little money as possible. (C2) Get one sci-fi book. (C3) Prioritize (C2) over (C1) Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
5 Motivating Example Alice is buying 3 books as gifts. Title Genre Rating Price Vendor a 1 sci-fi 5.0 $15.00 Amazon a 2 biography 4.8 $20.00 B&N a 3 sci-fi 4.5 $25.00 Amazon a 4 romance 4.4 $10.00 B&N a 5 sci-fi 4.3 $15.00 Amazon a 6 romance 4.2 $12.00 B&N a 7 biography 4.0 $18.00 Amazon a 8 sci-fi 3.5 $18.00 Amazon She has the following wishes... (C1) Spend as little money as possible. (C2) Get one sci-fi book. (C3) Prioritize (C2) over (C1) the cheapest 3 books Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
6 Tuple Preference - Winnow (Chomicki [Cho03]) Tuple Preference: t 1 is preferred to t 2 t 1 C t 2 ô t 1.rating sci-fi ^ t 1.price t 2.price Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
7 Tuple Preference - Winnow (Chomicki [Cho03]) Tuple Preference: t 1 is preferred to t 2 t 1 C t 2 ô t 1.rating sci-fi ^ t 1.price t 2.price Winnow Operator ω C : return all tuples in a database relation r for which there is no preferred tuple in r ω C prq tt P r Dt 1 P r.t 1 C tu Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
8 Tuple Preference - Winnow (Chomicki [Cho03]) Tuple Preference: t 1 is preferred to t 2 t 1 C t 2 ô t 1.rating sci-fi ^ t 1.price t 2.price Winnow Operator ω C : return all tuples in a database relation r for which there is no preferred tuple in r ω C prq tt P r Dt 1 P r.t 1 C tu Set Preference - tuple set tt 1, t 2,..., t k u is preferred to tuple set tt 1 1, t1 2,..., t1 k u Fixed cardinality (k) assumption Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
9 Profile-based Set Preference k-subsets: subsets of relation r, with fixed cardinality k Set Pref. Quantities of Interest Desired Value or Order (C1) total cost (C2) # of sci-fi books 1 (C3) total cost, # of sci-fi books (C2) (C1) Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
10 Profile-based Set Preference k-subsets: subsets of relation r, with fixed cardinality k Set Pref. Quantities of Interest Desired Value or Order (C1) total cost (C2) # of sci-fi books 1 (C3) total cost, # of sci-fi books (C2) (C1) features F 1, F 2,..., F m Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
11 Profile-based Set Preference k-subsets: subsets of relation r, with fixed cardinality k Set Pref. Quantities of Interest Desired Value or Order (C1) total cost (C2) # of sci-fi books 1 (C3) total cost, # of sci-fi books (C2) (C1) features F 1, F 2,..., F m preferences over profiles profile xf 1, F 2,..., F m y Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
12 Features and Set Preferences Features: mini-sql Queries F 1 SELECT sum(price) FROM $S F 2 SELECT count(title) FROM $S WHERE genre= sci-fi Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
13 Features and Set Preferences Features: mini-sql Queries F 1 SELECT sum(price) FROM $S F 2 SELECT count(title) FROM $S WHERE genre= sci-fi Set Preferences s 1 Ï C1 s 2 ô F 1ps 1q F 1ps 2q. s 1 Ï C2 s 2 ô F 2ps 1q 1 ^ F 2ps 2q 1. s 1 Ï C3 s 2 ô pf 2ps 1q 1 ^ F 2ps 2q 1q _pf 2ps 1q 1 ^ F 2ps 2q 1 ^ F 1ps 1q _pf 2ps 1q 1 ^ F 2ps 2q 1 ^ F 1ps 1q F 1ps 2qq F 1ps 2qq. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
14 Additive Features Assume additive features in this talk. Efficient optimizations exist for additive features. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
15 Additive Features Assume additive features in this talk. Efficient optimizations exist for additive features. Additive Feature A feature F is additive iff for every subset s of relation r, and every t P r s Fps Y ttuq Fpsq fptq Fpttuq fptq Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
16 Computing the Best Sets Naive Algorithm Generate all k-subsets of relation r and compute their profiles. Run the winnow operator over all the profiles and get the best profiles. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
17 Computing the Best Sets Naive Algorithm Generate all k-subsets of relation r and compute their profiles. Run the winnow operator over all the profiles and get the best profiles. Too many candidate k-subsets! Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
18 Computing the Best Sets Naive Algorithm Generate all k-subsets of relation r and compute their profiles. Run the winnow operator over all the profiles and get the best profiles. Too many candidate k-subsets! Example k 3, r 1000 ñ candidate subsets Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
19 Superpreference Optimization Goal: Generate as few candidate k-subsets as possible Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
20 Superpreference Optimization Goal: Generate as few candidate k-subsets as possible Superpreference Find a superpreference ( ) over the relation r, such that t 1 t 2 ô s 1 Y tt 1 u Ï C s 1 Y tt 2 u. for every (k-1)-subset s 1 of r containing neither t 1 nor t 2. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
21 Superpreference Optimization Goal: Generate as few candidate k-subsets as possible Superpreference Find a superpreference ( ) over the relation r, such that t 1 t 2 ô s 1 Y tt 1 u Ï C s 1 Y tt 2 u. for every (k-1)-subset s 1 of r containing neither t 1 nor t 2. Pruning Condition Let coverptq tt 1 P r t 1 tu, i.e. all tuples preferred to t. Dt P s, coverptq s ñ s is not a best subset. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
22 Superpreference Optimization Goal: Generate as few candidate k-subsets as possible Superpreference Find a superpreference ( ) over the relation r, such that t 1 t 2 ô s 1 Y tt 1 u Ï C s 1 Y tt 2 u. for every (k-1)-subset s 1 of r containing neither t 1 nor t 2. Pruning Condition Let coverptq tt 1 P r t 1 tu, i.e. all tuples preferred to t. Dt P s, coverptq s ñ s is not a best subset. Systematic Construction Possible if all the features are additive. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
23 Example - Superpreference Set preference: (C5)X(C6) (C5) Alice wants to spend as little money as possible on sci-fi books. (C6) Alice wants the average rating of books to be as high as possible. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
24 Example - Superpreference Set preference: (C5)X(C6) (C5) Alice wants to spend as little money as possible on sci-fi books. (C6) Alice wants the average rating of books to be as high as possible. Features: F 5 SELECT sum(price) FROM $S WHERE genre= sci-fi F 6 SELECT avg(rating) FROM $S Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
25 Example - Superpreference Set preference: (C5)X(C6) (C5) Alice wants to spend as little money as possible on sci-fi books. (C6) Alice wants the average rating of books to be as high as possible. Features: F 5 SELECT sum(price) FROM $S WHERE genre= sci-fi F 6 SELECT avg(rating) FROM $S Profile preference: s 1 Ï C s 2 F 5 ps 1 q F 5 ps 2 q ^ F 6 ps 1 q F 6 ps 2 q Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
26 Example - Superpreference Set preference: (C5)X(C6) (C5) Alice wants to spend as little money as possible on sci-fi books. (C6) Alice wants the average rating of books to be as high as possible. Features: F 5 SELECT sum(price) FROM $S WHERE genre= sci-fi F 6 SELECT avg(rating) FROM $S Profile preference: s 1 Ï C s 2 F 5 ps 1 q F 5 ps 2 q ^ F 6 ps 1 q F 6 ps 2 q Superpreference formula (assuming price 0) t 1 t 2 t 1.rating t 2.rating ^ t 2.genre sci-fi ^pt 1.price t 2.price _ t 1.genre sci-fi q. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
27 M-relation Goal Avoid redundancy in generating profiles Book: Title Genre Rating Price Vendor a 1 sci-fi 5.0 $15.00 Amazon a 2 biography 4.8 $20.00 B&N a 3 sci-fi 4.5 $25.00 Amazon a 4 romance 4.4 $10.00 B&N a 5 sci-fi 4.3 $15.00 Amazon a 6 romance 4.2 $12.00 B&N a 7 biography 4.0 $18.00 Amazon a 8 sci-fi 3.5 $18.00 Amazon a 9 romance 4.0 $20.00 Amazon a 10 history 4.0 $19.00 Amazon Redundancy Example profile Γ pta 1, a 2, a 7 uq = profile Γ pta 1, a 2, a 9 uq = x15.00, 13.8y Exchangeable Tuples a 7, a 9 For any 2-subset s of Bookzta 7, a 9 u Profile Γ tf 5, F 6u F 5 SELECT sum(price) FROM $S WHERE genre= sci-fi F 6 SELECT sum(rating) FROM $S profile Γ ps Y ta 7 uq profile Γ ps Y ta 9 uq Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
28 M-relation Generation Book: Title Genre Rating Price Vendor a 7 biography 4.0 $18.00 Amazon a 8 sci-fi 3.5 $18.00 Amazon a 9 romance 4.0 $20.00 Amazon a 10 history 4.0 $19.00 Amazon M-relation: A 5 A 6 A cnt m 7,9,10 $ m 8 $ Profile Γ tf 5, F 6 u F 5 SELECT sum(price) FROM $S WHERE genre= sci-fi F 6 SELECT sum(rating) FROM $S M-relation Generation SQL SELECT CASE WHEN r.genre= sci-fi THEN r.price ELSE 0 END AS A 5, r.rating AS A 6, count(*) AS A cnt FROM r GROUP BY A 5, A 6 Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
29 Set Preference via M-relation Set preference over the original relations ñ set preference over its M-relation Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
30 Conclusions and Future Work Conclusions A formal logic + SQL framework for specifying restricted set preferences and implementing set preference queries. Effective optimizations yielding improvements of several orders of magnitude. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
31 Conclusions and Future Work Conclusions A formal logic + SQL framework for specifying restricted set preferences and implementing set preference queries. Effective optimizations yielding improvements of several orders of magnitude. Future Work Query optimization for non-additive features. Set preference elicitation. Embedding best-subset generation in relational query languages. Additional set ranking or browsing techniques for result navigation. Relaxing the fixed cardinality assumption: Superpreference depends on it assumption while M-relation does not. Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
32 Thank you! Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
33 Dataset and Set Preferences Dataset 8000 book quotes from Amazon Schema: xtitle, genre, rating, price, vendory Features F 5 SELECT sum(price) FROM $S WHERE genre= sci-fi F 6 SELECT sum(rating) FROM $S F 7 SELECT sum(rating) FROM $S WHERE genre= sci-fi F 8 SELECT sum(price) FROM $S F 9 SELECT count(title) FROM $S WHERE genre= sci-fi and price < F 10 SELECT sum(rating) FROM $S WHERE rating >= 4.0 Set Preferences Set Pref. Name Profile Schema Γ Profile Pref. Formula C SP1 xf 5, F 6 y F 5 ps 1 q F 5 ps 2 q ^ F 6 ps 1 q F 6 ps 2 q SP2 xf 9, F 10 y F 9 ps 1 q F 9 ps 2 q ^ F 10 ps 1 q F 10 ps 2 q SP3 xf 11, F 12 y F 11 ps 1 q F 11 ps 2 q ^ F 12 ps 1 q F 12 ps 2 q Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
34 Performance of Different Algorithms NAIVE SUPER MREL SM MS 1 Set Pref 1 # of sets in Generation (g) 1e+009 1e+008 1e+007 1e K Input Size (n) Generate Percentage e-005 SUPER MREL SM MS NAIVE SUPER MREL SM MS 1 Set Pref 2 # of sets in Generation (g) 1e+009 1e+008 1e+007 1e K Input Size (n) Generate Percentage e-005 SUPER MREL SM MS Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
35 Performance of Different Algorithms Set Pref 3 Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
36 Related Work Guha et al. [GGK 03] Problem: find an optimal subset of tuples Set property: aggrpaq parameter Set preference: objective function min { max Binshtok et al. [BBS 07] Problem: find a optimal subset of items Set property: predicate Set preference: TCP-net or scoring function Consider subsets of any cardinality, in the fixed-cardinality case, it is subsumed by our framework desjardins and Wagstaff [dw05] Consider fixed-cardinality set preference Consider two set features: diversity and depth Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
37 References I Maxim Binshtok, Ronen I. Brafman, Solomon Eyal Shimony, Ajay Mani, and Craig Boutilier. Computing optimal subsets. In AAAI, pages , Jan Chomicki. Preference formulas in relational queries. ACM Trans. Database Syst., 28(4): , Marie desjardins and Kiri Wagstaff. DD-pref: A language for expressing preferences over sets. In AAAI, pages , Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
38 References II Sudipto Guha, Dimitrios Gunopulos, Nick Koudas, Divesh Srivastava, and Michail Vlachos. Efficient approximation of optimization queries under parametric aggregation constraints. In VLDB, pages , Xi Zhang, Jan Chomicki (SUNY at Buffalo) Preference Queries over Sets April 15, / 21
Semantic Optimization of Preference Queries
Semantic Optimization of Preference Queries Jan Chomicki University at Buffalo http://www.cse.buffalo.edu/ chomicki 1 Querying with Preferences Find the best answers to a query, instead of all the answers.
More informationLearning Optimal Subsets with Implicit User Preferences
Learning Optimal Subsets with Implicit User Preferences Yunsong Guo and Carla Gomes Department of Computer Science Cornell University {guoys, gomes}@cs.cornell.edu Abstract We study the problem of learning
More informationFoundations of Preferences in Database Systems
Foundations of Preferences in Database Systems Werner Kießling Institute of Computer Science University of Augsburg W. Kießling, 28th Intern. Conf. on Very Large Databases (VLDB 2002), Hong Kong, 21.08.02
More informationComputing Optimal Subsets
Computing Optimal Subsets Maxim Binshtok Ben-Gurion University maximbi@cs.bgu.ac.il Ronen I. Brafman Ben-Gurion University brafman@cs.bgu.ac.il Solomon E. Shimony Ben-Gurion University shimony@cs.bgu.ac.il
More informationLogical Foundations of Preference Queries
Logical Foundations of Preference Queries Jan Chomicki University at Buffalo chomicki@buffalo.edu Abstract The notion of preference plays an increasing role in today s information systems. In particular,
More information3. Relational Data Model 3.5 The Tuple Relational Calculus
3. Relational Data Model 3.5 The Tuple Relational Calculus forall quantification Syntax: t R(P(t)) semantics: for all tuples t in relation R, P(t) has to be fulfilled example query: Determine all students
More informationData Integration: Logic Query Languages
Data Integration: Logic Query Languages Jan Chomicki University at Buffalo Datalog Datalog A logic language Datalog programs consist of logical facts and rules Datalog is a subset of Prolog (no data structures)
More informationPrivacy Breaches in Privacy-Preserving Data Mining
1 Privacy Breaches in Privacy-Preserving Data Mining Johannes Gehrke Department of Computer Science Cornell University Joint work with Sasha Evfimievski (Cornell), Ramakrishnan Srikant (IBM), and Rakesh
More informationImplementation of Skyline Sweeping Algorithm
Implementation of Skyline Sweeping Algorithm BETHINEEDI VEERENDRA M.TECH (CSE) K.I.T.S. DIVILI Mail id:veeru506@gmail.com B.VENKATESWARA REDDY Assistant Professor K.I.T.S. DIVILI Mail id: bvr001@gmail.com
More informationRelational Algebra and Calculus
and Calculus Marek Rychly mrychly@strathmore.edu Strathmore University, @ilabafrica & Brno University of Technology, Faculty of Information Technology Advanced Databases and Enterprise Systems 7 September
More informationImproving Privacy And Data Utility For High- Dimensional Data By Using Anonymization Technique
Improving Privacy And Data Utility For High- Dimensional Data By Using Anonymization Technique P.Nithya 1, V.Karpagam 2 PG Scholar, Department of Software Engineering, Sri Ramakrishna Engineering College,
More informationAdvanced Data Management Technologies
ADMT 2017/18 Unit 13 J. Gamper 1/42 Advanced Data Management Technologies Unit 13 DW Pre-aggregation and View Maintenance J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:
More informationTransformation and aggregation preprocessing for top-k recommendation GAP rules induction
Transformation and aggregation preprocessing for top-k recommendation GAP rules induction Marta Vomlelova, Michal Kopecky, Peter Vojtas Faculty of Mathematics and Physics Charles University in Prague Malostranske
More informationDD-PREF: A Language for Expressing Preferences Over Sets
DD-PREF: A Language for Expressing Preferences Over Sets Marie desjardins University of Maryland Baltimore County Computer Science and Electrical Engineering Department Hilltop Circle, Baltimore, MD 22
More information6 February 2014 CSE-3421M Test #1 w/ answers p. 1 of 14. CSE-3421M Test #1. Design
6 February 2014 CSE-3421M Test #1 w/ answers p. 1 of 14 CSE-3421M Test #1 Design Sur / Last Name: Given / First Name: Student ID: Instructor: Parke Godfrey Exam Duration: 75 minutes Term: Winter 2014 Answer
More informationQuery Processing & Optimization
Query Processing & Optimization 1 Roadmap of This Lecture Overview of query processing Measures of Query Cost Selection Operation Sorting Join Operation Other Operations Evaluation of Expressions Introduction
More informationCP nets: representing and reasoning with preferences of multiple agents
CP nets: representing and reasoning with preferences of multiple agents F. Rossi and K. B. Venable Dept. of Mathematics University of Padova Padova Italy frossikvenable @math.unipd.it Abstract We introduce
More informationRelational Databases
Relational Databases Jan Chomicki University at Buffalo Jan Chomicki () Relational databases 1 / 49 Plan of the course 1 Relational databases 2 Relational database design 3 Conceptual database design 4
More informationInconsistency-tolerant logics
Inconsistency-tolerant logics CS 157 Computational Logic Autumn 2010 Inconsistent logical theories T 1 = { p(a), p(a) } T 2 = { x(p(x) q(x)), p(a), q(a) } Definition: A theory T is inconsistent if T has
More informationDatabase Systems CSE 414
Database Systems CSE 414 Lecture 15-16: Basics of Data Storage and Indexes (Ch. 8.3-4, 14.1-1.7, & skim 14.2-3) 1 Announcements Midterm on Monday, November 6th, in class Allow 1 page of notes (both sides,
More informationRELATIVE QUERY RESULTS RANKING FOR ONLINE USERS IN WEB DATABASES
RELATIVE QUERY RESULTS RANKING FOR ONLINE USERS IN WEB DATABASES Pramod Kumar Ghadei Research Scholar, Sathyabama University, Chennai, Tamil Nadu, India pramod-ghadei@yahoo.com Dr. S. Sridhar Research
More informationIterative Modification and Incremental Evaluation of Preference Queries
Iterative Modification and Incremental Evaluation of Preference Queries Jan Chomicki Dept. of Computer Science and Engineering, University at Buffalo, Buffalo, NY 14260-2000 chomicki@cse.buffalo.edu Abstract.
More informationAdvanced Databases. Lecture 4 - Query Optimization. Masood Niazi Torshiz Islamic Azad university- Mashhad Branch
Advanced Databases Lecture 4 - Query Optimization Masood Niazi Torshiz Islamic Azad university- Mashhad Branch www.mniazi.ir Query Optimization Introduction Transformation of Relational Expressions Catalog
More informationChapter 3: Relational Model
Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational Calculus Domain Relational Calculus Extended Relational-Algebra-Operations Modification of the Database
More informationDatabase Systems External Sorting and Query Optimization. A.R. Hurson 323 CS Building
External Sorting and Query Optimization A.R. Hurson 323 CS Building External sorting When data to be sorted cannot fit into available main memory, external sorting algorithm must be applied. Naturally,
More informationGeneric Preferences over Subsets of Structured Objects
Journal of Artificial Intelligence Research 34 (2009) 133-164 Submitted 07/08; published 03/09 Generic Preferences over Subsets of Structured Objects Maxim Binshtok Ronen I. Brafman Computer Science Department
More informationKeyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan
Keyword search in relational databases By SO Tsz Yan Amanda & HON Ka Lam Ethan 1 Introduction Ubiquitous relational databases Need to know SQL and database structure Hard to define an object 2 Query representation
More informationData Mining Lecture 2: Recommender Systems
Data Mining Lecture 2: Recommender Systems Jo Houghton ECS Southampton February 19, 2019 1 / 32 Recommender Systems - Introduction Making recommendations: Big Money 35% of Amazons income from recommendations
More informationCS34800 Information Systems. The Relational Model Prof. Walid Aref 29 August, 2016
CS34800 Information Systems The Relational Model Prof. Walid Aref 29 August, 2016 1 Chapter: The Relational Model Structure of Relational Databases Relational Algebra Tuple Relational Calculus Domain Relational
More informationRecommender Systems - Introduction. Data Mining Lecture 2: Recommender Systems
Recommender Systems - Introduction Making recommendations: Big Money 35% of amazons income from recommendations Netflix recommendation engine worth $ Billion per year And yet, Amazon seems to be able to
More informationArnab Bhattacharya, Shrikant Awate. Dept. of Computer Science and Engineering, Indian Institute of Technology, Kanpur
Arnab Bhattacharya, Shrikant Awate Dept. of Computer Science and Engineering, Indian Institute of Technology, Kanpur Motivation Flight from Kolkata to Los Angeles No direct flight Through Singapore or
More informationANU MLSS 2010: Data Mining. Part 2: Association rule mining
ANU MLSS 2010: Data Mining Part 2: Association rule mining Lecture outline What is association mining? Market basket analysis and association rule examples Basic concepts and formalism Basic rule measurements
More informationAn Incremental Anytime Algorithm for Multi-Objective Query Optimization
An Incremental Anytime Algorithm for Multi-Objective Query Optimization Immanuel Trummer and Christoph Koch {firstname}.{lastname}@epfl.ch École Polytechnique Fédérale de Lausanne ABSTRACT Query plans
More informationRoute Planning for Bicycles Exact Constrained Shortest Paths made Practical via Contraction Hierarchy
Route Planning for Bicycles Exact Constrained Shortest Paths made Practical via Contraction Hierarchy ICAPS 2012 Route Planning for Bicycles - Exact Constrained Shortest Path made Practical via Contraction
More informationmodern database systems lecture 5 : top-k retrieval
modern database systems lecture 5 : top-k retrieval Aristides Gionis Michael Mathioudakis spring 2016 announcements problem session on Monday, March 7, 2-4pm, at T2 solutions of the problems in homework
More informationINF 212 ANALYSIS OF PROG. LANGS SQL AND SPREADSHEETS. Instructors: Crista Lopes Copyright Instructors.
INF 212 ANALYSIS OF PROG. LANGS SQL AND SPREADSHEETS Instructors: Crista Lopes Copyright Instructors. Data-Centric Programming Focus on data Interactive data: SQL Dataflow: Spreadsheets Dataflow: Iterators,
More informationCan We Trust SQL as a Data Analytics Tool?
Can We Trust SQL as a Data nalytics Tool? SQL The query language for relational databases International Standard since 1987 Implemented in all systems (free and commercial) $30B/year business Most common
More informationHow Achaeans Would Construct Columns in Troy. Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte
How Achaeans Would Construct Columns in Troy Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte Number of Visas Received 1 0,75 0,5 0,25 0 Alekh Jens Health Level
More informationTransactional Information Systems:
Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control and Recovery Gerhard Weikum and Gottfried Vossen Teamwork is essential. It allows you to blame someone else.
More informationThe Multi-Relational Skyline Operator
The Multi-Relational Skyline Operator Wen Jin 1 Martin Ester 1 Zengjian Hu 1 Jiawei Han 2 1 School of Computing Science Simon Fraser University, Canada {wjin,ester,zhu}@cs.sfu.ca 2 Department of Computer
More informationLecture Query evaluation. Combining operators. Logical query optimization. By Marina Barsky Winter 2016, University of Toronto
Lecture 02.03. Query evaluation Combining operators. Logical query optimization By Marina Barsky Winter 2016, University of Toronto Quick recap: Relational Algebra Operators Core operators: Selection σ
More informationQuery Optimization. Chapter 13
Query Optimization Chapter 13 What we want to cover today Overview of query optimization Generating equivalent expressions Cost estimation 432/832 2 Chapter 13 Query Optimization OVERVIEW 432/832 3 Query
More informationSKYLINE QUERIES FOR MULTI-CRITERIA DECISION SUPPORT SYSTEMS SATYAVEER GOUD GUDALA. B.E., Osmania University, 2009 A REPORT
SKYLINE QUERIES FOR MULTI-CRITERIA DECISION SUPPORT SYSTEMS by SATYAVEER GOUD GUDALA B.E., Osmania University, 2009 A REPORT submitted in partial fulfillment of the requirements for the degree MASTER OF
More informationNavigating and Exploring RDF Data using Formal Concept Analysis
Navigating and Exploring RDF Data using Formal Concept Analysis Mehwish Alam and Amedeo Napoli LORIA (CNRS Inria Nancy Grand Est Université de Lorraine) BP 239, 54506 Vandoeuvre-les-Nancy, France {mehwish.alam,amedeo.napoli@loria.fr}
More informationBachelor in Information Technology (BIT) O Term-End Examination
No. of Printed Pages : 6 I CSI-14 I Bachelor in Information Technology (BIT) O Term-End Examination cn Cn1 June, 2010 CD cp CSI-14 : DATA ANALYSIS AND DATABASE DESIGN Time : 3 hours Maximum Marks : 75
More informationEvaluation of Conditional Preference Queries
Evaluation of Conditional Preference Queries Fabíola S.F. Pereira, Sandra de Amo Universidade Federal de Uberlândia, Brazil deamo@ufu.br fabfernandes@comp.ufu.br Abstract. The need for incorporating preference
More informationCOMPUSOFT, An international journal of advanced computer technology, 4 (6), June-2015 (Volume-IV, Issue-VI)
ISSN:2320-0790 Duplicate Detection in Hierarchical XML Multimedia Data Using Improved Multidup Method Prof. Pramod. B. Gosavi 1, Mrs. M.A.Patel 2 Associate Professor & HOD- IT Dept., Godavari College of
More informationPrefJoin: An Efficient Preference-aware Join Operator
: An Efficient Preference-aware Join Operator Mohamed E. Khalefa Mohamed F. Mobel Justin J. Levandosi Department of Computer Science and Engineering, University of Minnesota {halefa,mobel,justin}@cs.umn.edu
More informationSPARQL Extensions with Preferences: a Survey Olivier Pivert, Olfa Slama, Virginie Thion
SPARQL Extensions with Preferences: a Survey Olivier Pivert, Olfa Slama, Virginie Thion 31 st ACM Symposium on Applied Computing Pisa, Italy April 4-8, 2016 Outline 1 Introduction 2 3 4 Outline Introduction
More informationRelational Query Optimization
Relational Query Optimization Chapter 15 Ramakrishnan & Gehrke (Sections 15.1-15.6) CPSC404, Laks V.S. Lakshmanan 1 What you will learn from this lecture Cost-based query optimization (System R) Plan space
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu [Kumar et al. 99] 2/13/2013 Jure Leskovec, Stanford CS246: Mining Massive Datasets, http://cs246.stanford.edu
More informationProUD: Probabilistic Ranking in Uncertain Databases
Proc. 20th Int. Conf. on Scientific and Statistical Database Management (SSDBM'08), Hong Kong, China, 2008. ProUD: Probabilistic Ranking in Uncertain Databases Thomas Bernecker, Hans-Peter Kriegel, Matthias
More informationTemporal Dependencies
Temporal Dependencies Jef Wijsen University of Mons-Hainaut SYNONYMS None DEFINITION Static integrity constraints involve only the current database state. Temporal integrity constraints involve current,
More informationRIPPLE: A Scalable Framework for Distributed Processing of Rank Queries
RIPPLE: A Scalable Framework for Distributed Processing of Rank Queries George Tsatsanifos (NTUA) Dimitris Sacharidis ( Athena RC) Timos Sellis (RMIT) 17th International Conference on Extending Database
More informationAgenda. Discussion. Database/Relation/Tuple. Schema. Instance. CSE 444: Database Internals. Review Relational Model
Agenda CSE 444: Database Internals Review Relational Model Lecture 2 Review of the Relational Model Review Queries (will skip most slides) Relational Algebra SQL Review translation SQL à RA Needed for
More informationAutomatically Synthesizing SQL Queries from Input-Output Examples
Automatically Synthesizing SQL Queries from Input-Output Examples Sai Zhang University of Washington Joint work with: Yuyin Sun Goal: making it easier for non-expert users to write correct SQL queries
More informationData Integration: Schema Mapping
Data Integration: Schema Mapping Jan Chomicki University at Buffalo and Warsaw University March 8, 2007 Jan Chomicki (UB/UW) Data Integration: Schema Mapping March 8, 2007 1 / 13 Data integration Data
More informationMining Generalised Emerging Patterns
Mining Generalised Emerging Patterns Xiaoyuan Qian, James Bailey, Christopher Leckie Department of Computer Science and Software Engineering University of Melbourne, Australia {jbailey, caleckie}@csse.unimelb.edu.au
More informationChapter 11: Query Optimization
Chapter 11: Query Optimization Chapter 11: Query Optimization Introduction Transformation of Relational Expressions Statistical Information for Cost Estimation Cost-based optimization Dynamic Programming
More informationData Integration: Schema Mapping
Data Integration: Schema Mapping Jan Chomicki University at Buffalo and Warsaw University March 8, 2007 Jan Chomicki (UB/UW) Data Integration: Schema Mapping March 8, 2007 1 / 13 Data integration Jan Chomicki
More informationRelational Database Model. III. Introduction to the Relational Database Model. Relational Database Model. Relational Terminology.
III. Introduction to the Relational Database Model Relational Database Model In 1970, E. F. Codd published A Relational Model of Data for Large Shared Data Banks in CACM. In the early 1980s, commercially
More informationMotivation and Basics Flat networks Hierarchy by dominating sets Hierarchy by clustering Adaptive node activity. Topology Control
Topology Control Andreas Wolf (0325330) 17.01.2007 1 Motivation and Basics 2 Flat networks 3 Hierarchy by dominating sets 4 Hierarchy by clustering 5 Adaptive node activity Options for topology control
More informationVision Document 2.0 Online Book Store Phase-II. Vamsi Krishna Mummaneni
Vision Document 2.0 Online Book Store Phase-II Submitted in partial fulfillment of the requirements of the degree of Master of Software Engineering Vamsi Krishna Mummaneni CIS 895 MSE Project Kansas State
More informationTHE RELATIONAL MODEL. University of Waterloo
THE RELATIONAL MODEL 1-1 List of Slides 1 2 The Relational Model 3 Relations and Databases 4 Example 5 Another Example 6 What does it mean? 7 Example Database 8 What can we do with it? 9 Variables and
More informationQuery Evaluation and Optimization
Query Evaluation and Optimization Jan Chomicki University at Buffalo Jan Chomicki () Query Evaluation and Optimization 1 / 21 Evaluating σ E (R) Jan Chomicki () Query Evaluation and Optimization 2 / 21
More informationRank-aware XML Data Model and Algebra: Towards Unifying Exact Match and Similar Match in XML
Proceedings of the 7th WSEAS International Conference on Multimedia, Internet & Video Technologies, Beijing, China, September 15-17, 2007 253 Rank-aware XML Data Model and Algebra: Towards Unifying Exact
More informationChristofides Algorithm
2. compute minimum perfect matching of odd nodes 2. compute minimum perfect matching of odd nodes 2. compute minimum perfect matching of odd nodes 3. find Eulerian walk node order 2. compute minimum perfect
More informationHYRISE In-Memory Storage Engine
HYRISE In-Memory Storage Engine Martin Grund 1, Jens Krueger 1, Philippe Cudre-Mauroux 3, Samuel Madden 2 Alexander Zeier 1, Hasso Plattner 1 1 Hasso-Plattner-Institute, Germany 2 MIT CSAIL, USA 3 University
More informationCSE 344 Midterm Exam
CSE 344 Midterm Exam February 9, 2015 Question 1 / 10 Question 2 / 39 Question 3 / 16 Question 4 / 28 Question 5 / 12 Total / 105 The exam is closed everything except for 1 letter-size sheet of notes.
More informationContents Contents Introduction Basic Steps in Query Processing Introduction Transformation of Relational Expressions...
Contents Contents...283 Introduction...283 Basic Steps in Query Processing...284 Introduction...285 Transformation of Relational Expressions...287 Equivalence Rules...289 Transformation Example: Pushing
More informationSeq2SQL: Generating Structured Queries from Natural Language Using Reinforcement Learning
Seq2SQL: Generating Structured Queries from Natural Language Using Reinforcement Learning V. Zhong, C. Xiong, R. Socher Salesforce Research arxiv: 1709.00103 Reviewed by : Bill Zhang University of Virginia
More informationADVANCED DATABASES ; Spring 2015 Prof. Sang-goo Lee (11:00pm: Mon & Wed: Room ) Advanced DB Copyright by S.-g.
4541.564; Spring 2015 Prof. Sang-goo Lee (11:00pm: Mon & Wed: Room 301-203) ADVANCED DATABASES Copyright by S.-g. Lee Review - 1 General Info. Text Book Database System Concepts, 6 th Ed., Silberschatz,
More informationRelational Algebra. Procedural language Six basic operators
Relational algebra Relational Algebra Procedural language Six basic operators select: σ project: union: set difference: Cartesian product: x rename: ρ The operators take one or two relations as inputs
More informationLecture 16. The Relational Model
Lecture 16 The Relational Model Lecture 16 Today s Lecture 1. The Relational Model & Relational Algebra 2. Relational Algebra Pt. II [Optional: may skip] 2 Lecture 16 > Section 1 1. The Relational Model
More informationInteractive Query Refinement
Interactive Query Refinement Chaitanya Mishra University of Toronto cmishra@cs.toronto.edu Nick Koudas University of Toronto koudas@cs.toronto.edu ABSTRACT We investigate the problem of refining SQL queries
More informationTwo hours UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE
COMP 62421 Two hours UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE Querying Data on the Web Date: Wednesday 24th January 2018 Time: 14:00-16:00 Please answer all FIVE Questions provided. They amount
More informationf xx (x, y) = 6 + 6x f xy (x, y) = 0 f yy (x, y) = y In general, the quantity that we re interested in is
1. Let f(x, y) = 5 + 3x 2 + 3y 2 + 2y 3 + x 3. (a) Final all critical points of f. (b) Use the second derivatives test to classify the critical points you found in (a) as a local maximum, local minimum,
More informationChapter 2: Intro to Relational Model
Non è possibile visualizzare l'immagine. Chapter 2: Intro to Relational Model Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Example of a Relation attributes (or columns)
More informationSQLite, Firefox, and our small IMDB movie database. CS3200 Database design (sp18 s2) Version 1/17/2018
SQLite, Firefox, and our small IMDB movie database CS3200 Database design (sp18 s2) Version 1/17/2018 1 Overview This document covers 2 issues: How to install SQLite manager in Firefox browser: - SQLite
More informationReverse Engineering Aggregation Queries
Reverse Engineering Aggregation Queries Wei Chit Tan Meihui Zhang Hazem Elmeleegy 2 Divesh Srivastava 3 Singapore University of Technology and Design, 2 Turn Inc., 3 AT&T Labs-Research weichit tan@mymail.sutd.edu.sg,
More informationDIRA : A FRAMEWORK OF DATA INTEGRATION USING DATA QUALITY
DIRA : A FRAMEWORK OF DATA INTEGRATION USING DATA QUALITY Reham I. Abdel Monem 1, Ali H. El-Bastawissy 2 and Mohamed M. Elwakil 3 1 Information Systems Department, Faculty of computers and information,
More informationWhy SQL? SQL is a very-high-level language. Database management system figures out best way to execute query
Basic SQL Queries 1 Why SQL? SQL is a very-high-level language Say what to do rather than how to do it Avoid a lot of data-manipulation details needed in procedural languages like C++ or Java Database
More informationManaging Inconsistencies in Collaborative Data Management
Managing Inconsistencies in Collaborative Data Management Eric Kao Logic Group Computer Science Department Stanford University Talk given at HP Labs on November 9, 2010 Structured Data Public Sources Company
More informationKeyword query interpretation over structured data
Keyword query interpretation over structured data Advanced Methods of Information Retrieval Elena Demidova SS 2018 Elena Demidova: Advanced Methods of Information Retrieval SS 2018 1 Recap Elena Demidova:
More informationA Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods
A Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods S.Anusuya 1, M.Balaganesh 2 P.G. Student, Department of Computer Science and Engineering, Sembodai Rukmani Varatharajan Engineering
More informationDatabase Applications (15-415)
Database Applications (15-415) DMS Internals- Part X Lecture 21, April 7, 2015 Mohammad Hammoud Last Session: DMS Internals- Part IX Query Optimization Today Today s Session: DMS Internals- Part X Query
More informationInstructor: Craig Duckett. Lecture 03: Tuesday, April 3, 2018 SQL Sorting, Aggregates and Joining Tables
Instructor: Craig Duckett Lecture 03: Tuesday, April 3, 2018 SQL Sorting, Aggregates and Joining Tables 1 Assignment 1 is due LECTURE 5, Tuesday, April 10 th, 2018 in StudentTracker by MIDNIGHT MID-TERM
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 17 Database Systems as a Cloud Service
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 17 Database Systems as a Cloud Service Final Project Presentations Presentation Logistics Where: CSE 403 When
More information5/13/2009. Introduction. Introduction. Introduction. Introduction. Introduction
Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007) Two types of technologies are widely used to overcome
More informationCS 377 Database Systems
CS 377 Database Systems Relational Data Model Li Xiong Department of Mathematics and Computer Science Emory University 1 Outline Relational Model Concepts Relational Model Constraints Relational Database
More informationInformation Systems (Informationssysteme)
Information Systems (Informationssysteme) Jens Teubner, TU Dortmund jensteubner@cstu-dortmundde Summer 2018 c Jens Teubner Information Systems Summer 2018 1 Part IV Database Design c Jens Teubner Information
More informationAnnouncements. Outline UNIQUE. (Inner) joins. (Inner) Joins. Database Systems CSE 414. WQ1 is posted to gradebook double check scores
Announcements Database Systems CSE 414 Lectures 4: Joins & Aggregation (Ch. 6.1-6.4) WQ1 is posted to gradebook double check scores WQ2 is out due next Sunday HW1 is due Tuesday (tomorrow), 11pm HW2 is
More informationChapter 2: Relational Model
Chapter 2: Relational Model Database System Concepts, 5 th Ed. See www.db-book.com for conditions on re-use Chapter 2: Relational Model Structure of Relational Databases Fundamental Relational-Algebra-Operations
More informationCPSC 310: Database Systems / CSPC 603: Database Systems and Applications Final Exam Fall 2005
CPSC 310: Database Systems / CSPC 603: Database Systems and Applications Final Exam Fall 2005 Name: Instructions: 1. This is a closed book exam. Do not use any notes or books, other than your three 8.5-by-11
More informationData Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems
Data Warehousing and Data Mining CPS 116 Introduction to Database Systems Announcements (December 1) 2 Homework #4 due today Sample solution available Thursday Course project demo period has begun! Check
More informationReDRIVE: Result-Driven Database Exploration through Recommendations
ReDRIVE: Result-Driven Database Exploration through Recommendations Marina Drosou Computer Science Department University of Ioannina, Greece mdrosou@cs.uoi.gr Evaggelia Pitoura Computer Science Department
More informationSpace-efficient Online Approximation of Time Series Data: Streams, Amnesia, and Out-of-order
Space-efficient Online Approximation of Time Series Data: Streams, Amnesia, and Out-of-order Luca Foschini joint work with Sorabh Gandhi and Subhash Suri University of California Santa Barbara ICDE 2010
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY Database Systems: Fall 2015 Quiz I
Department of Electrical Engineering and Computer Science MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.830 Database Systems: Fall 2015 Quiz I There are 12 questions and 13 pages in this quiz booklet. To receive
More informationFastQRE: Fast Query Reverse Engineering
FastQRE: Fast Query Reverse Engineering Dmitri V. Kalashnikov AT&T Labs Research dvk@research.att.com Laks V.S. Lakshmanan University of British Columbia laks@cs.ubc.ca Divesh Srivastava AT&T Labs Research
More informationDiscovering Periodic Patterns in Database Audit Trails
Vol.29 (DTA 2013), pp.365-371 http://dx.doi.org/10.14257/astl.2013.29.76 Discovering Periodic Patterns in Database Audit Trails Marcin Zimniak 1, Janusz R. Getta 2, and Wolfgang Benn 1 1 Faculty of Computer
More information