Keyword Search in Databases

Size: px
Start display at page:

Download "Keyword Search in Databases"

Transcription

1 + Databases and Information Retrieval Integration TIETS42 Keyword Search in Databases Autumn 2016 Kostas Stefanidis

2 + Ranking Results of Keyword Search Keyword-based search is very popular! It allows user to discover information without knowing the structure of data or any query language Goal: Enable IR-style keyword search over DBMSs Examples: Movies database, Online shopping, Why ranking: Too many results may match a keyword query Users are interested in a few results 2

3 + Ranking Results of Keyword Search Basic idea in relational databases: locate tuples in the database that contain query keywords and can be joined together idm title genre year director m1 Dracula thriller 1992 F. F. Coppola m2 Twelve Monkeys thriller 1996 T. Gilliam m3 Seven thriller 1996 D. Fincher m4 Schindler s List drama 1993 S. Spielberg m5 Picking up the Pieces comed y 2000 A. Arau Movies Play idm m1 m2 m3 m4 m5 ida a1 a2 a2 a3 a4 3 ida name gender dob a1 G. Oldman male 1958 a2 B. Pitt male 1963 a3 L. Neeson male 1952 a4 W. Allen male 1935 Actors

4 + Keyword Search in Relational Databases Q = {thriller, B. Pitt} idm title genre year director m1 Dracula thriller 1992 F. F. Coppola m2 Twelve Monkeys thriller 1996 T. Gilliam m3 Seven thriller 1996 D. Fincher m4 Schindler s List drama 1993 S. Spielberg m5 Play Movies idm m1 m2 m3 m4 m5 Picking up the Pieces ida a1 a2 a2 a3 a4 comedy 2000 A. Arau ida name gender dob a1 G. Oldman male 1958 a2 B. Pitt male 1963 a3 L. Neeson male 1952 a4 W. Allen male 1935 Actors 4 m2, Twelve Monkeys, thriller, 1996, T. Gilliam a2, B. Pitt, male, 1963 m2, a2 m3, Seven, thriller, 1996, D. Fincher a2, B. Pitt, male, 1963 query result: joining trees of tuples (JTTs) total minimal m3, a2

5 + Ranking Results of Keyword Search Given the abundance of available information, exploring the contents of a database is a complex procedure A huge volume of data may be returned Results may be vague The need to rank results arises 5

6 + Ranking Results of Keyword Search Rank JTTs based on their relevance to the query Relevance based on the JTT size (e.g., Hristidis et al. [VLDB2002], Agrawal et al. [ICDE 2002]) The smaller the size of JTT, the smaller the number of joins, thus the largest its relevance Relevance based on the importance of its tuples e.g., assign scores to JTTs based on the prestige of their tuples (Bhalotia et al. [ICDE 2002]) or adapt IR-style document relevance ranking (Hristidis et al. [VLDB 2003]) Exploit user preferences in ranking keyword search results e.g., Koutrika et al. [ICDE 2006], Stefanidis et al. [EDBT 2010] 6

7 + Keyword Search in Relational Databases Schema-based keyword search Use the schema of the database Graph-based keyword search Materialize the database as a directed graph 7

8 + How to compute keyword search results Discover [VLDB 2002] Use a database schema based approach to retrieve JTTs that answer a query 8

9 + Keyword Query Processing Q = {thriller, B. Pitt} Results: m2, Twelve Monkeys, thriller, 1996, T. Gilliam a2, B. Pitt, male, 1963 m2, a2 idm title genre year director m1 Dracula thriller 1992 F. F. Coppola m2 Twelve Monkeys thriller 1996 T. Gilliam m3 Seven thriller 1996 D. Fincher m4 Schindler s List drama 1993 S. Spielberg m5 Movies Picking up the Pieces comedy 2000 A. Arau m3, Seven, thriller, 1996, D. Fincher a2, B. Pitt, male, 1963 m3, a2 idm m1 m2 m3 m4 m5 ida a1 a2 a2 a3 a4 ida name gender dob a1 G. Oldman male 1958 a2 B. Pitt male 1963 a3 L. Neeson male 1952 a4 W. Allen male 1935 Actors These JTTs are produced using the schema level tree: Movies {thriller} Play {} Actors {B. Pitt} Such trees are called joining trees of tuple sets (JTSs) Play Construct JTSs as an intermediate step of the computation of JTTs

10 + Algorithm Sketch Given a query Q, the algorithm constructs the JTSs with size up to s Compute all possible tuple sets R i X R ix = {t t R i and w x X, t contains w x and w y Q\X, t does not contain w y } Select randomly a query keyword w z Locate all tuple sets R ix, for which w z X These are the initial JTSs with only one node Expand trees either by adding a tuple set that contains at least another query keyword or a tuple set for which X = {} (free tuple set) These trees can be further expanded Movies {thriller} - Play {} - Actors {B. Pitt}

11 + Algorithm Sketch Given a query Q, the algorithm constructs the JTSs with size up to s Compute all possible tuple sets R i X Select randomly a query keyword w z Locate all tuple sets R ix, for which w z X Expand trees either by adding a tuple set that contains at least another query keyword or a tuple set for which X = {} (free tuple set) These trees can be further expanded Movies {thriller} - Play {} - Actors {B. Pitt} JTSs that contain all query keywords are returned JTSs of the form R ix R j {} R iy, where an edge (R j R i ) exists in the schema graph, are pruned JTTs produced by them have more than one occurrence of the same tuple for every instance of the database

12 + Reusability Opportunities Each JTS corresponds to a SQL statement JTS1: O Smith C O Miller JTS2: O Smith C N C O Miller Execution Plan JTS1 O Smith C O Miller JTS2 O Smith C N C O Miller 12

13 + Reuse Common Sub-expressions Execution Plan JTS1 O Smith C O Miller JTS2 O Smith C N C O Miller Optimized Execution Plan Temp O Smith C JTS1 Temp O Miller JTS2 Temp N C O Miller 13

14 + How to compute keyword search results DBXplorer [ICDE 2002] Use a database schema based approach to retrieve JTTs that answer a query 14

15 + How to compute keyword search results DBXplorer [ICDE 2002] Publish: index the database keywords (Symbol Table S) For each keyword, keep the columns that the keyword appears For each keyword, keep the tuples that contain the keyword Search: Look at S to identify the tables, and columns/rows containing the query keywords Identify and enumerate all possible joins Generate an SQL statement for each join 15

16 + How to compute keyword search results Banks [ICDE 2002] Model the database as a graph to retrieve JTTs that answer a query 16

17 + Basic Model Model the database as a graph Nodes tuples Edges references between tuples Foreign key (edges are directed) ProgressiveSk:Skyline Queries Yufei:ProgressiveSk MBR:Topology in R trees PaperId:PaperName paper AuthorID:PaperId writes AuthorId Yufei Tao Papadias Sellis author 17

18 + Answer Model Query: set of keywords {k 1,, k n } For each ki, find the set of nodes Si containing/matching ki Query example: {Papadias, Sellis} Answer: rooted and directed trees with nodes with matching keywords Root nodes with some significance, e.g., use entities, not relationships Ranking based on proximity and prestige 18

19 + Example Q = {Papadias, Sellis} Writes Paper Topological relations in R trees Writes Author Dimitris Papadias Timos Sellis Author Goal: Find sets of (closely) connected tuples that match all given keywords 19

20 + Edges Directionality Directions may lead to missing answers Q ={DBXplorer, ObjectRank} BANKS CitedBy Cites Cites Cited DBXplorer Cited ObjectRank 20

21 + Edges Directionality Add backward edges Q ={DBXplorer, ObjectRank} BANKS CitedBy Cites Cites Cited DBXplorer Cited ObjectRank 21

22 + Weights Weights of forward edges Use the database schema Weights of backward edges Number of edges pointing to the node (in-degree) Weights of nodes Node in-degree Nodes with so many references are of a higher prestige Combine nodes and edges weights 22

23 + How to compute keyword search results Symbol Table: index the database keywords For each keyword, keep the nodes that contain the keyword/matching nodes Search: Backward Expanding Search Algorithm Assume sets S ki with nodes containing keyword ki Idea: find nodes from which a forward path exists to at least one node from each S ki 23

24 + Search Backward Expanding Search Algorithm Run concurrently single source shortest path algorithm from each node matching a keyword Create an iterator for each node containing a keyword Traverse the graph edges in reverse direction Do best-first search across iterators Output an answer when its root has been reached from each keyword Assumption: The graph fits in memory Answer trees may not be generated in relevance order 24

25 + Example Q ={Yufei, Papadias} PaperId:PaperName Yufei:ProgressiveSk ProgressiveSk:Skyline Queries paper AuthorID:PaperId writes Yufei Tao Dimitris Papadias AuthorId author Iterators 25

26 + Ranking This tree is output Better Root Missed 26

27 + Ranking First generate the results, then rank them High computational cost Better solution: use a heap, order based on the relevance of the trees Return the highest ranked tree from the heap 27

28 + Plain text coexists with structured data Enable IR-style keyword search over databases 28

29 + Example Complaints Database Schema Products prodid manufacturer model Complaints prodid custid date comments Customers custid name occupation example from Vagelis Hristidis

30 Example - Complaints Database Data Complaints tupleid prodid custid date comments c1 p121 c disk crashed after just one week of moderate use on an IBM Netvista X41 c2 p131 c lower-end IBM Netvista caught fire, starting apparently with disk c3 p131 c IBM Netvista unstable with Maxtor HD Customers tupleid custid name occupation u1 c3232 John Smith u2 c3131 Jack Lucas u3 c3143 John Mayer Software Engineer Architect Student Products tupleid prodid manufacturer model p1 p121 Maxtor D540X p2 p131 IBM Netvista p3 p141 Tripplite Smart 700VA

31 Example Keyword Query [Maxtor Netvista] Complaints tupleid prodid custid date comments c1 p121 c disk crashed after just one week of moderate use on an IBM Netvista X41 c2 p131 c lower-end IBM Netvista caught fire, starting apparently with disk c3 p131 c IBM Netvista unstable with Maxtor HD Customers tupleid custid name occupation u1 c3232 John Smith u2 c3131 Jack Lucas u3 c3143 John Mayer Software Engineer Architect Student Products tupleid prodid manufacturer model p1 p121 Maxtor D540X p2 p131 IBM Netvista p3 p141 Tripplite Smart 700VA

32 + Semantics Keywords in tuples connected through primary foreign key relationships Score of a result tree computed with an IR-style technique 32

33 Example Keyword Query [Maxtor Netvista] Complaints tupleid prodid custid date comments c1 p121 c disk crashed after just one week of moderate use on an IBM Netvista X41 c2 p131 c lower-end IBM Netvista caught fire, starting apparently with disk c3 p131 c IBM Netvista unstable with Maxtor HD Customers tupleid custid name occupation u1 c3232 John Smith u2 c3131 Jack Lucas u3 c3143 John Mayer Software Engineer Architect Student Products tupleid prodid manufacturer model p1 p121 Maxtor D540X p2 p131 IBM Netvista p3 p141 Tripplite Smart 700VA Results: (1) c3, (2) p2 c3, (3) p1 c1 (2) ranked higher than (3): score for c3 is higher than that of c1

34 + Keyword Query Result AND semantics Every query keywords appears in the result tree OR semantics Some query keywords might be missing from the result tree Score of a result tree T : a T Score(a)/size(T) For Score(a) use IR ranking functions 34

35 Example Keyword Query [Maxtor Netvista] Complaints Customers tupleid prodid custid date comments c1 p121 c disk crashed after just one week of moderate use on an IBM Netvista X41 c2 p131 c lower-end IBM Netvista caught fire, starting apparently with disk Score(p1 c1) = (1+1/3)/2 = 4/6 c3 p131 c IBM Netvista unstable with Maxtor HD tupleid custid name occupation u1 c3232 John Smith u2 c3131 Jack Lucas Score(p2 c3) = (1+4/3)/2 = 7/6 u3 c3143 John Mayer Software Engineer Architect Student Products Score(c3) = 4/3 tupleid prodid manufacturer model p1 p121 Maxtor D540X p2 p131 IBM Netvista p3 p141 Tripplite Smart 700VA score 1/3 1/3 4/3 score Results: (1) c3, (2) p2 c3, (3) p1 c1

36 + Questions? 36

Personalized Keyword Search Drawbacks found ANNE JERONEN, ARMAND NOUBISIE, YUDIT PONCE

Personalized Keyword Search Drawbacks found ANNE JERONEN, ARMAND NOUBISIE, YUDIT PONCE Personalized Keyword Search Drawbacks found 28..26 ANNE JERONEN, ARMAND NOUBISIE, YUDIT PONCE Introduction Personalized keyword search Drawbacks Suggested solution Introduction Georgia Koutrika, Alkis

More information

Keyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan

Keyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan Keyword search in relational databases By SO Tsz Yan Amanda & HON Ka Lam Ethan 1 Introduction Ubiquitous relational databases Need to know SQL and database structure Hard to define an object 2 Query representation

More information

Personalized Keyword Search Contributions

Personalized Keyword Search Contributions Personalized Keyword Search Contributions Introduction Georgia Koutrika, Alkis Simitsis, Yannis E. Ioannidis: Précis: The Essence of a Query Answer. ICDE 26 Kostas Stefanidis, Marina Drosou, Evaggelia

More information

Fast Contextual Preference Scoring of Database Tuples

Fast Contextual Preference Scoring of Database Tuples Fast Contextual Preference Scoring of Database Tuples Kostas Stefanidis Department of Computer Science, University of Ioannina, Greece Joint work with Evaggelia Pitoura http://dmod.cs.uoi.gr 2 Motivation

More information

Keyword Search in Databases

Keyword Search in Databases Keyword Search in Databases Wei Wang University of New South Wales, Australia Outline Based on the tutorial given at APWeb 2006 Introduction IR Preliminaries Systems Open Issues Dr. Wei Wang @ CSE, UNSW

More information

SPARK: Top-k Keyword Query in Relational Database

SPARK: Top-k Keyword Query in Relational Database SPARK: Top-k Keyword Query in Relational Database Wei Wang University of New South Wales Australia 20/03/2007 1 Outline Demo & Introduction Ranking Query Evaluation Conclusions 20/03/2007 2 Demo 20/03/2007

More information

Roadmap. Roadmap. Ranking Web Pages. PageRank. Roadmap. Random Walks in Ranking Query Results in Semistructured Databases

Roadmap. Roadmap. Ranking Web Pages. PageRank. Roadmap. Random Walks in Ranking Query Results in Semistructured Databases Roadmap Random Walks in Ranking Query in Vagelis Hristidis Roadmap Ranking Web Pages Rank according to Relevance of page to query Quality of page Roadmap PageRank Stanford project Lawrence Page, Sergey

More information

Implementation of Skyline Sweeping Algorithm

Implementation of Skyline Sweeping Algorithm Implementation of Skyline Sweeping Algorithm BETHINEEDI VEERENDRA M.TECH (CSE) K.I.T.S. DIVILI Mail id:veeru506@gmail.com B.VENKATESWARA REDDY Assistant Professor K.I.T.S. DIVILI Mail id: bvr001@gmail.com

More information

Querying Wikipedia Documents and Relationships

Querying Wikipedia Documents and Relationships Querying Wikipedia Documents and Relationships Huong Nguyen Thanh Nguyen Hoa Nguyen Juliana Freire School of Computing and SCI Institute, University of Utah {huongnd,thanhh,thanhhoa,juliana}@cs.utah.edu

More information

Intranet Search. Exploiting Databases for Document Retrieval. Christoph Mangold Universität Stuttgart

Intranet Search. Exploiting Databases for Document Retrieval. Christoph Mangold Universität Stuttgart Intranet Search Exploiting Databases for Document Retrieval Christoph Mangold Universität Stuttgart 2 /6 The Big Picture: Assume. there is a glueing problem with product P7 Has this happened before? Is

More information

Extending Keyword Search to Metadata in Relational Database

Extending Keyword Search to Metadata in Relational Database DEWS2008 C6-1 Extending Keyword Search to Metadata in Relational Database Jiajun GU Hiroyuki KITAGAWA Graduate School of Systems and Information Engineering Center for Computational Sciences University

More information

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016 + Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html

More information

Keyword query interpretation over structured data

Keyword query interpretation over structured data Keyword query interpretation over structured data Advanced Methods of IR Elena Demidova Materials used in the slides: Jeffrey Xu Yu, Lu Qin, Lijun Chang. Keyword Search in Databases. Synthesis Lectures

More information

Keyword query interpretation over structured data

Keyword query interpretation over structured data Keyword query interpretation over structured data Advanced Methods of Information Retrieval Elena Demidova SS 2018 Elena Demidova: Advanced Methods of Information Retrieval SS 2018 1 Recap Elena Demidova:

More information

Keyword Search over Hybrid XML-Relational Databases

Keyword Search over Hybrid XML-Relational Databases SICE Annual Conference 2008 August 20-22, 2008, The University Electro-Communications, Japan Keyword Search over Hybrid XML-Relational Databases Liru Zhang 1 Tadashi Ohmori 1 and Mamoru Hoshi 1 1 Graduate

More information

International Journal of Advance Engineering and Research Development. Performance Enhancement of Search System

International Journal of Advance Engineering and Research Development. Performance Enhancement of Search System Scientific Journal of Impact Factor(SJIF): 3.134 International Journal of Advance Engineering and Research Development Volume 2,Issue 7, July -2015 Performance Enhancement of Search System Ms. Uma P Nalawade

More information

Effective Top-k Keyword Search in Relational Databases Considering Query Semantics

Effective Top-k Keyword Search in Relational Databases Considering Query Semantics Effective Top-k Keyword Search in Relational Databases Considering Query Semantics Yanwei Xu 1,2, Yoshiharu Ishikawa 1, and Jihong Guan 2 1 Graduate School of Information Science, Nagoya University, Japan

More information

MovieNet: A Social Network for Movie Enthusiasts

MovieNet: A Social Network for Movie Enthusiasts MovieNet: A Social Network for Movie Enthusiasts 445 Course Project MovieNet is a social network for movie enthusiasts, containing a database of movies, actors/actresses, directors, etc., and a social

More information

Preferences in Databases. Representation Composition

Preferences in Databases. Representation Composition Preferences in Databases Representation Composition & Application Georgia Koutrika (1), Evaggelia Pitoura (2), Kostas Stefanidis (2) (1) Stanford University, (2) University of Ioannina Introduction 2 Preferences

More information

MovieNet: A Social Network for Movie Enthusiasts

MovieNet: A Social Network for Movie Enthusiasts MovieNet: A Social Network for Movie Enthusiasts 445 Course Project Yanlei Diao UMass Amherst Overview MovieNet is a social network for movie enthusiasts, containing a database of movies, actors/actresses,

More information

Volume 2, Issue 11, November 2014 International Journal of Advance Research in Computer Science and Management Studies

Volume 2, Issue 11, November 2014 International Journal of Advance Research in Computer Science and Management Studies Volume 2, Issue 11, November 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com

More information

PACOKS: Progressive Ant-Colony-Optimization-Based Keyword Search over Relational Databases

PACOKS: Progressive Ant-Colony-Optimization-Based Keyword Search over Relational Databases PACOKS: Progressive Ant-Colony-Optimization-Based Keyword Search over Relational Databases Ziyu Lin 1(B), Qian Xue 1, and Yongxuan Lai 2 1 Department of Computer Science, Xiamen University, Xiamen, China

More information

SPARK2: Top-k Keyword Query in Relational Databases

SPARK2: Top-k Keyword Query in Relational Databases TKDE SPECIAL IUE: KEYWORD SEARCH ON STRUCTURED DATA, 20 SPARK2: Top-k Keyword Query in Relational Databases Yi Luo, Wei Wang, Member, IEEE, Xuemin Lin, Xiaofang Zhou, Senior Member, IEEE Jianmin Wang,

More information

Keyword Search in External Memory Graph Representations of Data

Keyword Search in External Memory Graph Representations of Data Keyword Search in External Memory Graph Representations of Data B. Tech. Seminar Report Submitted in partial fulfillment of the requirements for the degree of Bachelor of Technology by Avin Mittal Roll

More information

Hierarchical Result Views for Keyword Queries over Relational Databases

Hierarchical Result Views for Keyword Queries over Relational Databases Hierarchical Result Views for Keyword Queries over Relational Databases Shiyuan Wang Department of Computer Science, UC Santa Barbara Santa Barbara, CA, USA sywang@cs.ucsb.edu Oliver Po NEC Laboratories

More information

A System for Query-Specific Document Summarization

A System for Query-Specific Document Summarization A System for Query-Specific Document Summarization Ramakrishna Varadarajan, Vagelis Hristidis. FLORIDA INTERNATIONAL UNIVERSITY, School of Computing and Information Sciences, Miami. Roadmap Need for query-specific

More information

Lecture #14 Optimizer Implementation (Part I)

Lecture #14 Optimizer Implementation (Part I) 15-721 ADVANCED DATABASE SYSTEMS Lecture #14 Optimizer Implementation (Part I) Andy Pavlo / Carnegie Mellon University / Spring 2016 @Andy_Pavlo // Carnegie Mellon University // Spring 2017 2 TODAY S AGENDA

More information

Précis: The Essence of a Query Answer *

Précis: The Essence of a Query Answer * Précis: The Essence of a Query Answer * Georgia Koutrika University of Athens koutrika@di.uoa.gr Alkis Simitsis Nat. Tech. Univ. of Athens asimi@dblab.ntua.gr Yannis Ioannidis University of Athens yannis@di.uoa.gr

More information

Efficient Keyword Search Across Heterogeneous Relational Databases

Efficient Keyword Search Across Heterogeneous Relational Databases Efficient Keyword Search Across Heterogeneous Relational Databases Mayssam Sayyadian 1, Hieu LeKhac 2, AnHai Doan 1, Luis Gravano 3 1 University of Wisconsin-Madison 2 University of Illinois-Urbana 3 Columbia

More information

EFFICIENT APPROACH FOR DETECTING HARD KEYWORD QUERIES WITH MULTI-LEVEL NOISE GENERATION

EFFICIENT APPROACH FOR DETECTING HARD KEYWORD QUERIES WITH MULTI-LEVEL NOISE GENERATION EFFICIENT APPROACH FOR DETECTING HARD KEYWORD QUERIES WITH MULTI-LEVEL NOISE GENERATION B.Mohankumar 1, Dr. P. Marikkannu 2, S. Jansi Rani 3, S. Suganya 4 1 3 4Asst Prof, Department of Information Technology,

More information

Searching Databases with Keywords

Searching Databases with Keywords Shan Wang et al.: Searching Databases with Keywords 1 Searching Databases with Keywords Shan Wang and Kun-Long Zhang School of Information, Renmin University of China, Beijing, 100872, P.R. China E-mail:

More information

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 2.114

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 2.114 [Saranya, 4(3): March, 2015] ISSN: 2277-9655 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A SURVEY ON KEYWORD QUERY ROUTING IN DATABASES N.Saranya*, R.Rajeshkumar, S.Saranya

More information

Top-k Keyword Search Over Graphs Based On Backward Search

Top-k Keyword Search Over Graphs Based On Backward Search Top-k Keyword Search Over Graphs Based On Backward Search Jia-Hui Zeng, Jiu-Ming Huang, Shu-Qiang Yang 1College of Computer National University of Defense Technology, Changsha, China 2College of Computer

More information

Keyword Join: Realizing Keyword Search in P2P-based Database Systems

Keyword Join: Realizing Keyword Search in P2P-based Database Systems Keyword Join: Realizing Keyword Search in P2P-based Database Systems Bei Yu, Ling Liu 2, Beng Chin Ooi 3 and Kian-Lee Tan 3 Singapore-MIT Alliance 2 Georgia Institute of Technology, 3 National University

More information

CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E)

CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E) 1 CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E) QUESTION Film title genre year director runtime budget gross The Company Men drama 2010 John Wells 104 15,000,000 4,439,063 Steven Lincoln biography

More information

MET CS 669 Database Design and Implementation for Business Term Project: Online DVD Rental Business

MET CS 669 Database Design and Implementation for Business Term Project: Online DVD Rental Business MET CS 669 Database Design and Implementation for Business Term Project: Online DVD Rental Business Objective Create an initial design for the database schema for an online DVD rental business that is

More information

Effective Keyword Search in Relational Databases for Lyrics

Effective Keyword Search in Relational Databases for Lyrics Effective Keyword Search in Relational Databases for Lyrics Navin Kumar Trivedi Assist. Professor, Department of Computer Science & Information Technology Divya Singh B.Tech (CSe) Scholar Pooja Pandey

More information

A FRAMEWORK FOR PROCESSING KEYWORD-BASED QUERIES IN RELATIONAL DATABASES

A FRAMEWORK FOR PROCESSING KEYWORD-BASED QUERIES IN RELATIONAL DATABASES A FRAMEWORK FOR PROCESSING KEYWORD-BASED QUERIES IN RELATIONAL DATABASES 1 EYAS EL-QAWASMEH, 1 OSSAMA ABU-EID, 2 ABDALLAH ALASHQUR 1 Jordan University of Science and Technology, Jordan 2 Applied Science

More information

Effective Searching of RDF Knowledge Bases

Effective Searching of RDF Knowledge Bases Effective Searching of RDF Knowledge Bases Shady Elbassuoni Joint work with: Maya Ramanath and Gerhard Weikum RDF Knowledge Bases Annie Hall is a 1977 American romantic comedy directed by Woody Allen and

More information

A Graph Method for Keyword-based Selection of the top-k Databases

A Graph Method for Keyword-based Selection of the top-k Databases This is the Pre-Published Version A Graph Method for Keyword-based Selection of the top-k Databases Quang Hieu Vu 1, Beng Chin Ooi 1, Dimitris Papadias 2, Anthony K. H. Tung 1 hieuvq@nus.edu.sg, ooibc@comp.nus.edu.sg,

More information

Supporting Fuzzy Keyword Search in Databases

Supporting Fuzzy Keyword Search in Databases I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as

More information

Graph-Based Synopses for Relational Data. Alkis Polyzotis (UC Santa Cruz)

Graph-Based Synopses for Relational Data. Alkis Polyzotis (UC Santa Cruz) Graph-Based Synopses for Relational Data Alkis Polyzotis (UC Santa Cruz) Data Synopses Data Query Result Data Synopsis Query Approximate Result Problem: exact answer may be too costly to compute Examples:

More information

MAINTAIN TOP-K RESULTS USING SIMILARITY CLUSTERING IN RELATIONAL DATABASE

MAINTAIN TOP-K RESULTS USING SIMILARITY CLUSTERING IN RELATIONAL DATABASE INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 MAINTAIN TOP-K RESULTS USING SIMILARITY CLUSTERING IN RELATIONAL DATABASE Syamily K.R 1, Belfin R.V 2 1 PG student,

More information

ADVANCED DATABASE SYSTEMS. Lecture #15. Optimizer Implementation (Part // // Spring 2018

ADVANCED DATABASE SYSTEMS. Lecture #15. Optimizer Implementation (Part // // Spring 2018 Lecture #15 ADVANCED DATABASE SYSTEMS Optimizer Implementation (Part I) @Andy_Pavlo // 15-721 // Spring 2018 2 Background Implementation Design Decisions Optimizer Search Strategies 3 QUERY OPTIMIZATION

More information

Administrivia. CS 133: Databases. Cost-based Query Sub-System. Goals for Today. Midterm on Thursday 10/18. Assignments

Administrivia. CS 133: Databases. Cost-based Query Sub-System. Goals for Today. Midterm on Thursday 10/18. Assignments Administrivia Midterm on Thursday 10/18 CS 133: Databases Fall 2018 Lec 12 10/16 Prof. Beth Trushkowsky Assignments Lab 3 starts after fall break No problem set out this week Goals for Today Cost-based

More information

CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E)

CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E) 1 CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E) 3 LECTURE OUTLINE Constraints in Relational Databases Update Operations 4 SATISFYING INTEGRITY CONSTRAINTS Constraints are restrictions on the

More information

PAPER SRT-Rank: Ranking Keyword Query Results in Relational Databases Using the Strongly Related Tree

PAPER SRT-Rank: Ranking Keyword Query Results in Relational Databases Using the Strongly Related Tree 2398 PAPER SRT-Rank: Ranking Keyword Query Results in Relational Databases Using the Strongly Related Tree In-Joong KIM, Student Member, Kyu-Young WHANG a), and Hyuk-Yoon KWON, Nonmembers SUMMARY A top-k

More information

Interactive keyword-based access to large-scale structured datasets

Interactive keyword-based access to large-scale structured datasets Interactive keyword-based access to large-scale structured datasets 2 nd Keystone Summer School 20 July 2016 Dr. Elena Demidova University of Southampton 1 Overview Keyword-based access to structured data

More information

Information Retrieval Overview

Information Retrieval Overview Roadmap Information Retrieval Overview Vagelis Hristidis School of Computer Science Florida International University COP 6727 What is IR? Matching Models Evaluation of Results Digital Libraries vs. IR

More information

Semantic Search Focus: IR on Structured Data

Semantic Search Focus: IR on Structured Data Semantic Search Focus: IR on Structured Data 8th European Summer School on Information Retrieval Duc Thanh Tran Institute AIFB, KIT, Germany Tran@aifb.uni-karlsruhe.de http://sites.google.com/site/kimducthanh

More information

2. E/R Design Considerations

2. E/R Design Considerations 2. E/R Design Considerations 32 What you will learn in this section Relationships cont d: multiplicity, multi-way Design considerations Conversion to SQL 33 Multiplicity of E/R Relationships Multiplicity

More information

Relational Model, Key Constraints

Relational Model, Key Constraints Relational Model, Key Constraints PDBM 6.1 Dr. Chris Mayfield Department of Computer Science James Madison University Jan 23, 2019 What is a data model? Notation for describing data or information Structure

More information

Integrating and Querying Source Code of Programs Working on a Database

Integrating and Querying Source Code of Programs Working on a Database Integrating and Querying Source Code of Working on a Database Carlos Garcia-Alvarado University of Houston Dept. of Computer Science Houston, TX, USA Carlos Ordonez University of Houston Dept. of Computer

More information

Contextual Database Preferences

Contextual Database Preferences Evaggelia Pitoura Dept. of Computer Science University of Ioannina, Greece pitoura@cs.uoi.gr Contextual Database Preferences Kostas Stefanidis Dept. of Computer Science and Engineering Chinese University

More information

Fast Contextual Preference Scoring of Database Tuples

Fast Contextual Preference Scoring of Database Tuples Fast Contextual Preference Scoring of Database Tuples Kostas Stefanidis Department of Computer Science University of Ioannina GR-45 Ioannina, Greece kstef@cs.uoi.gr Evaggelia Pitoura Department of Computer

More information

Keyword Join: Realizing Keyword Search for Information Integration

Keyword Join: Realizing Keyword Search for Information Integration Keyword Join: Realizing Keyword Search for Information Integration Bei YU, Ling LIU 2, Beng Chin OOI,3 and Kian-Lee TAN,3 Singapore-MIT Alliance, National University of Singapore 2 College of Computing,

More information

In-class activities: Sep 25, 2017

In-class activities: Sep 25, 2017 In-class activities: Sep 25, 2017 Activities and group work this week function the same way as our previous activity. We recommend that you continue working with the same 3-person group. We suggest that

More information

Outline. Quick Introduction to Database Systems. Data Manipulation Tasks. What do they all have in common? CSE142 Wi03 G-1

Outline. Quick Introduction to Database Systems. Data Manipulation Tasks. What do they all have in common? CSE142 Wi03 G-1 Outline Quick Introduction to Database Systems Why do we need a different kind of system? What is a database system? Separating the what the how: The relational data model Querying the databases: SQL May

More information

Bidirectional Expansion For Keyword Search on Graph Databases

Bidirectional Expansion For Keyword Search on Graph Databases Bidirectional Expansion For Keyword Search on Graph Databases Varun Kacholia Shashank Pandit Soumen Chakrabarti S. Sudarshan Rushi Desai Hrishikesh Karambelkar Indian Institute of Technology, Bombay varunk@acm.org

More information

Keyword search in databases: the power of RDBMS

Keyword search in databases: the power of RDBMS Keyword search in databases: the power of RDBMS 1 Introduc

More information

Approaches. XML Storage. Storing arbitrary XML. Mapping XML to relational. Mapping the link structure. Mapping leaf values

Approaches. XML Storage. Storing arbitrary XML. Mapping XML to relational. Mapping the link structure. Mapping leaf values XML Storage CPS 296.1 Topics in Database Systems Approaches Text files Use DOM/XSLT to parse and access XML data Specialized DBMS Lore, Strudel, exist, etc. Still a long way to go Object-oriented DBMS

More information

10.1 Physical Design: Introduction. 10 Physical schema design. Physical Design: I/O cost. Physical Design: I/O cost.

10.1 Physical Design: Introduction. 10 Physical schema design. Physical Design: I/O cost. Physical Design: I/O cost. 10 Physical schema design 10.1 Introduction Motivation Disk technology RAID 10.2 Index structures in DBS Indexing concept Primary and Secondary indexes 10.3. ISAM and B + -Trees 10.4. SQL and indexes Criteria

More information

arxiv: v1 [cs.db] 22 Apr 2011

arxiv: v1 [cs.db] 22 Apr 2011 EMBANKS: Towards Disk Based Algorithms For Keyword-Search In Structured Databases Submitted in partial fulfillment of the requirements for the degree of arxiv:1104.4384v1 [cs.db] 22 Apr 2011 Bachelor of

More information

Database Management Systems Introduction to DBMS

Database Management Systems Introduction to DBMS Database Management Systems Introduction to DBMS D B M G 1 Introduction to DBMS Data Base Management System (DBMS) A software package designed to store and manage databases We are interested in internal

More information

Ontology Based Prediction of Difficult Keyword Queries

Ontology Based Prediction of Difficult Keyword Queries Ontology Based Prediction of Difficult Keyword Queries Lubna.C*, Kasim K Pursuing M.Tech (CSE)*, Associate Professor (CSE) MEA Engineering College, Perinthalmanna Kerala, India lubna9990@gmail.com, kasim_mlp@gmail.com

More information

Databases - Relations in Databases. (N Spadaccini 2010) Relations in Databases 1 / 16

Databases - Relations in Databases. (N Spadaccini 2010) Relations in Databases 1 / 16 Databases - Relations in Databases (N Spadaccini 2010) Relations in Databases 1 / 16 Re-capping - data model A data model is a precise, conceptual description of the data stored in a database. The relational

More information

Refinement of keyword queries over structured data with ontologies and users

Refinement of keyword queries over structured data with ontologies and users Refinement of keyword queries over structured data with ontologies and users Advanced Methods of IR Elena Demidova SS 2014 Materials used in the slides: Sandeep Tata and Guy M. Lohman. SQAK: doing more

More information

Database Management Systems

Database Management Systems Database Management Systems Distributed Databases Doug Shook What does it mean to be distributed? Multiple nodes connected by a network Data on the nodes is logically related The nodes do not need to be

More information

Review Problems. Computer Science E-66 Harvard University David G. Sullivan, Ph.D. Tree-Based Index Structure Problem

Review Problems. Computer Science E-66 Harvard University David G. Sullivan, Ph.D. Tree-Based Index Structure Problem Review Problems Computer Science E-66 Harvard University David G. Sullivan, Ph.D. Tree-Based Index Structure Problem Consider the following tree-based index structure, in which the keys are a person's

More information

9/23/2009 CONFERENCES CONTINUOUS NEAREST NEIGHBOR SEARCH INTRODUCTION OVERVIEW PRELIMINARY -- POINT NN QUERIES

9/23/2009 CONFERENCES CONTINUOUS NEAREST NEIGHBOR SEARCH INTRODUCTION OVERVIEW PRELIMINARY -- POINT NN QUERIES CONFERENCES Short Name SIGMOD Full Name Special Interest Group on Management Of Data CONTINUOUS NEAREST NEIGHBOR SEARCH Yufei Tao, Dimitris Papadias, Qiongmao Shen Hong Kong University of Science and Technology

More information

Effective Keyword Search over (Semi)-Structured Big Data Mehdi Kargar

Effective Keyword Search over (Semi)-Structured Big Data Mehdi Kargar Effective Keyword Search over (Semi)-Structured Big Data Mehdi Kargar School of Computer Science Faculty of Science University of Windsor How Big is this Big Data? 40 Billion Instagram Photos 300 Hours

More information

Physical DB Issues, Indexes, Query Optimisation. Database Systems Lecture 13 Natasha Alechina

Physical DB Issues, Indexes, Query Optimisation. Database Systems Lecture 13 Natasha Alechina Physical DB Issues, Indexes, Query Optimisation Database Systems Lecture 13 Natasha Alechina In This Lecture Physical DB Issues RAID arrays for recovery and speed Indexes and query efficiency Query optimisation

More information

Keyword Search over RDF Graphs. Elisa Menendez

Keyword Search over RDF Graphs. Elisa Menendez Elisa Menendez emenendez@inf.puc-rio.br Summary Motivation Keyword Search over RDF Process Challenges Example QUIOW System Next Steps Motivation Motivation Keyword search is an easy way to retrieve information

More information

Probabilistic/Uncertain Data Management

Probabilistic/Uncertain Data Management Probabilistic/Uncertain Data Management 1. Dalvi, Suciu. Efficient query evaluation on probabilistic databases, VLDB Jrnl, 2004 2. Das Sarma et al. Working models for uncertain data, ICDE 2006. Slides

More information

5/13/2009. Introduction. Introduction. Introduction. Introduction. Introduction

5/13/2009. Introduction. Introduction. Introduction. Introduction. Introduction Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007) Two types of technologies are widely used to overcome

More information

QUERY OPTIMIZATION FOR DATABASE MANAGEMENT SYSTEM BY APPLYING DYNAMIC PROGRAMMING ALGORITHM

QUERY OPTIMIZATION FOR DATABASE MANAGEMENT SYSTEM BY APPLYING DYNAMIC PROGRAMMING ALGORITHM QUERY OPTIMIZATION FOR DATABASE MANAGEMENT SYSTEM BY APPLYING DYNAMIC PROGRAMMING ALGORITHM Wisnu Adityo NIM 13506029 Information Technology Department Institut Teknologi Bandung Jalan Ganesha 10 e-mail:

More information

Information Retrieval Using Keyword Search Technique

Information Retrieval Using Keyword Search Technique Information Retrieval Using Keyword Search Technique Dhananjay A. Gholap, Dr.Gumaste S. V Department of Computer Engineering, Sharadchandra Pawar College of Engineering, Dumbarwadi, Otur, Pune, India ABSTRACT:

More information

Dr. Lyn Mathis Page 1

Dr. Lyn Mathis Page 1 CSIS 3222, Fall 2008, Chapter 1, 3, 4, 5 (through p. 128) Name (Six Pages) Part I: MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (3 points each)

More information

Event Stores (I) [Source: DB-Engines.com, accessed on August 28, 2016]

Event Stores (I) [Source: DB-Engines.com, accessed on August 28, 2016] Event Stores (I) Event stores are database management systems implementing the concept of event sourcing. They keep all state changing events for an object together with a timestamp, thereby creating a

More information

Query Evaluation Strategies

Query Evaluation Strategies Introduction to Search Engine Technology Term-at-a-Time and Document-at-a-Time Evaluation Ronny Lempel Yahoo! Research (Many of the following slides are courtesy of Aya Soffer and David Carmel, IBM Haifa

More information

ISSN Vol.08,Issue.18, October-2016, Pages:

ISSN Vol.08,Issue.18, October-2016, Pages: ISSN 2348 2370 Vol.08,Issue.18, October-2016, Pages:3571-3578 www.ijatir.org Efficient Prediction of Difficult Keyword Queries Over Data Bases SHALINI ATLA 1, DEEPTHI JANAGAMA 2 1 PG Scholar, Dept of CSE,

More information

SECTION 1 DBMS LAB 1.0 INTRODUCTION 1.1 OBJECTIVES 1.2 INTRODUCTION TO MS-ACCESS. Structure Page No.

SECTION 1 DBMS LAB 1.0 INTRODUCTION 1.1 OBJECTIVES 1.2 INTRODUCTION TO MS-ACCESS. Structure Page No. SECTION 1 DBMS LAB DBMS Lab Structure Page No. 1.0 Introduction 05 1.1 Objectives 05 1.2 Introduction to MS-Access 05 1.3 Database Creation 13 1.4 Use of DBMS Tools/ Client-Server Mode 15 1.5 Forms and

More information

Keywords Machine learning, Pattern matching, Query processing, NLP

Keywords Machine learning, Pattern matching, Query processing, NLP Volume 7, Issue 3, March 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Ratatta: Chatbot

More information

ROCHESTER INSTITUTE OF TECHNOLOGY. SQL Tool Kit

ROCHESTER INSTITUTE OF TECHNOLOGY. SQL Tool Kit ROCHESTER INSTITUTE OF TECHNOLOGY SQL Tool Kit Submitted by: Chandni Pakalapati Advisor: Dr.Carlos Rivero 1 Table of Contents 1. Abstract... 3 2. Introduction... 4 3. System Overview... 5 4. Implementation...

More information

Ranked Search on Data Graphs

Ranked Search on Data Graphs Ranked Search on Data Graphs Ramakrishna R. Varadarajan Doctoral Dissertation Defense FLORIDA INTERNATIONAL UNIVERSITY, School of Computing and Information Sciences, Miami. Roadmap Problem Statement &

More information

Qunits: queried units for database search

Qunits: queried units for database search Qunits: queried units for database search Arnab Nandi Computer Science, EECS University of Michigan, Ann Arbor arnab@umich.edu H.V. Jagadish Computer Science, EECS University of Michigan, Ann Arbor jag@umich.edu

More information

Department of Computer Engineering, Sharadchandra Pawar College of Engineering, Dumbarwadi, Otur, Pune, Maharashtra, India

Department of Computer Engineering, Sharadchandra Pawar College of Engineering, Dumbarwadi, Otur, Pune, Maharashtra, India Volume 5, Issue 6, June 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Information Retrieval

More information

Principles of Dataspaces

Principles of Dataspaces Principles of Dataspaces Seminar From Databases to Dataspaces Summer Term 2007 Monika Podolecheva University of Konstanz Department of Computer and Information Science Tutor: Prof. M. Scholl, Alexander

More information

Administration Naive DBMS CMPT 454 Topics. John Edgar 2

Administration Naive DBMS CMPT 454 Topics. John Edgar 2 Administration Naive DBMS CMPT 454 Topics John Edgar 2 http://www.cs.sfu.ca/coursecentral/454/johnwill/ John Edgar 4 Assignments 25% Midterm exam in class 20% Final exam 55% John Edgar 5 A database stores

More information

The functions performed by a typical DBMS are the following:

The functions performed by a typical DBMS are the following: MODULE NAME: Database Management TOPIC: Introduction to Basic Database Concepts LECTURE 2 Functions of a DBMS The functions performed by a typical DBMS are the following: Data Definition The DBMS provides

More information

Social Data Exploration

Social Data Exploration Social Data Exploration Sihem Amer-Yahia DR CNRS @ LIG Sihem.Amer-Yahia@imag.fr Big Data & Optimization Workshop 12ème Séminaire POC LIP6 Dec 5 th, 2014 Collaborative data model User space (with attributes)

More information

A Survey on Representation, Composition and Application of Preferences in Database Systems

A Survey on Representation, Composition and Application of Preferences in Database Systems A Survey on Representation, Composition and Application of Preferences in Database Systems KOSTAS STEFANIDIS Chinese University of Hong Kong, Hong Kong GEORGIA KOUTRIKA IBM Almaden Research Center, USA

More information

BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E)

BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E) 1 BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E) 2 LECTURE OUTLINE SQL Data Definition and Data Types Specifying Constraints in SQL Basic Retrieval Queries in SQL Set Operations in SQL 3 BASIC SQL Structured

More information

Database Management Systems MIT Introduction By S. Sabraz Nawaz

Database Management Systems MIT Introduction By S. Sabraz Nawaz Database Management Systems MIT 22033 Introduction By S. Sabraz Nawaz Recommended Reading Database Management Systems 3 rd Edition, Ramakrishnan, Gehrke Murach s SQL Server 2008 for Developers Any book

More information

Enumerated Attributes for Relational Databases

Enumerated Attributes for Relational Databases Enumerated Attributes for Relational Databases Phyllis Jones and Joel Jones Department of Computer Science University of Alabama pecj@cs.ua.edu jones@cs.ua.edu Aliases Add-a-bead Problem How do we enforce

More information

Hash-Based Indexing 165

Hash-Based Indexing 165 Hash-Based Indexing 165 h 1 h 0 h 1 h 0 Next = 0 000 00 64 32 8 16 000 00 64 32 8 16 A 001 01 9 25 41 73 001 01 9 25 41 73 B 010 10 10 18 34 66 010 10 10 18 34 66 C Next = 3 011 11 11 19 D 011 11 11 19

More information

QUERY RECOMMENDATION SYSTEM USING USERS QUERYING BEHAVIOR

QUERY RECOMMENDATION SYSTEM USING USERS QUERYING BEHAVIOR International Journal of Emerging Technology and Innovative Engineering QUERY RECOMMENDATION SYSTEM USING USERS QUERYING BEHAVIOR V.Megha Dept of Computer science and Engineering College Of Engineering

More information

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index B. SATYA MOUNIKA 1, J. VENKATA KRISHNA 2 1 M-Tech Dept. of CSE SreeVahini Institute of Science and Technology TiruvuruAndhra

More information

Personalized Keyword Search Related Works ANNE JERONEN, ARMAND NOUBISIE, YUDIT PONCE

Personalized Keyword Search Related Works ANNE JERONEN, ARMAND NOUBISIE, YUDIT PONCE Personalized Keyword Search Related Works ANNE JERONEN, ARMAND NOUBISIE, YUDIT PONCE Introduction Georgia Koutrika, Alkis Simitsis, Yannis E. Ioannidis: Précis: The Essence of a Query Answer. ICDE 2006

More information

BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E)

BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E) 1 BASIC SQL CHAPTER 4 (6/E) CHAPTER 8 (5/E) 2 CHAPTER 4 OUTLINE SQL Data Definition and Data Types Specifying Constraints in SQL Basic Retrieval Queries in SQL Set Operations in SQL 3 BASIC SQL Structured

More information