Databases & Information Retrieval

Size: px
Start display at page:

Download "Databases & Information Retrieval"

Transcription

1 Databases & Information Retrieval Maya Ramanath (Further Reading: Combining Database and Information-Retrieval Techniques for Knowledge Discovery. G. Weikum, G. Kasneci, M. Ramanath and F.M. Suchanek, CACM, April 2009 DB & IR: Both Sides Now. G. Weikum, Keynote at SIGMOD 2007)

2 DB and IR: Different Motivations Both deal with large amounts of information, but DB IR Applications Emphasis online reservation, banking data consistency, efficiency libraries result quality, user satisfaction Data structured records unstructured text Queries precise interpretations vary Results exact match/all results ranked/top-k results

3 Why Combine Now? The applications drive the need The need to manage both structured and unstructured data in an integrated manner Healthcare example Find young patients in central Europe who have been reported, in the last two weeks, to have symptoms of tropical virus diseases and an indication of anomalies. Newspaper archives, product catalogues, etc.

4 Integrating DB & IR Untructured queries / ranked results (keywords/top-k) Structured queries / boolean match results (SQL) top-k processing, keyword search query on processing IR for Systems graphs text search, effective query interfaces, ranking for structured extracting entities DB Systems data and relationships, ranking for entities Structured data (relational) Unstructured data (text)

5 Modules 1. Top-k processing 2. Query Processing and Interfaces 3. Keyword Search on Graphs 4. Entity and Relationship Extraction 5. Ranking and Structured Data

6 1. Top-k Processing (1/2) Structured data, with scores in multiple dimensions Return the top-k objects Car Color Car Mileage Car Service BMW X1 0.9 Honda City 0.8 Maruti Swift 0.6 Tata Nano 0.1 Honda City 0.8 Maruti Swift 0.6 Tata Nano 0.3 BMW X1 0.1 Score(O) = i {color, mileage, service} S i (O) Tata Nano 0.7 Maruti Swift 0.6 Honda City 0.3 BMW X1 0.1

7 1. Top-k Processing (2/2) Top-k Joins Example: Return the best house-school pair Houses Rating Location H1 0.9 L1 H2 0.8 L2 H3 0.6 L3 H4 0.1 L3 Schools Rating Location S1 0.4 L2 S2 0.2 L2 S3 0.8 L3 S4 0.1 L3

8 2. Query Processing and Interfaces (1/3) Given: Database of text documents and a textcentric task. Extract information about disease outbreaks Strategies Scan all documents very expensive Filter promising documents affects recall Develop cost models and execution strategies appropriate for this setting

9 2. Query Processing and Interfaces (2/3) Querying with typed keywords Keyword querying: Easy to use Structured queries: Precise Find the middle ground Instead of german has won nobel award q(x) :- GERMAN(x), haswonprize(x,y), NOBEL_PRIZE(y) è german, has won (nobel award)

10 2. Query Processing and Interfaces (3/3) Does the output have to be a boring list of ranked results? Nope!

11 3. Keyword Search on Graphs (1/3) Lots of graphs around Relational DB (tuples+foreign keys) XML data (elements/sub-elements/id/idrefs) RDF (graph-structured knowledge-bases) Easy to query with keywords, instead of SQL/ XQuery/SPARQL Results are the top-k interconnections between the keywords

12 3. Keyword Search on Graphs (2/3)

13 3. Keyword Search on Graphs (3/3) Query: Einstein, Bohr isa Einstein vegetarian isa Tom Cruise bornin won Nobel Prize won Bohr diedin 1962

14 4. Entity and Relationship Extraction (1/2) Information Extraction (or Knowledge Harvesting) Bill Gates was the founder of Microsoft and later it s CEO. Apple was established on April 1, 1976 by Steve Jobs, Steve Wozniak, and Ronald Wayne. Infosys was founded on 2 July 1981 by seven entrepreneurs: N. R. Narayana Murthy, Nandan Nilekani, Company Microsoft Apple Apple Infosys Founder Bill Gates Steve Jobs Steve Wozniak N. R. Narayana Murthy

15 4. Entity and Relationship Extraction (2/2) How to build a knowledge-base of facts? Structurize Wikipedia Construct rules for extraction How do I acquire all the facts in the world? Extract everything Don t stop extracting

16 5. Ranking and Structured Data Not the same as top-k processing Given: Data with stucture in it Relational tables (flat) XML (trees/graphs) Text documents consisting of entities Task: Rank the query results SQL/Xquery/ typed keywords

17 QUESTIONS?

A Keyword-based Structured Query Language

A Keyword-based Structured Query Language Expressive and Flexible Access to Web-Extracted Data : A Keyword-based Structured Query Language Department of Computer Science and Engineering Indian Institute of Technology Delhi 22th September 2011

More information

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016 + Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html

More information

Effective Searching of RDF Knowledge Bases

Effective Searching of RDF Knowledge Bases Effective Searching of RDF Knowledge Bases Shady Elbassuoni Joint work with: Maya Ramanath and Gerhard Weikum RDF Knowledge Bases Annie Hall is a 1977 American romantic comedy directed by Woody Allen and

More information

SAS Certification Handout #7: Ch

SAS Certification Handout #7: Ch SAS Certification Handout #7: Ch. 19-21 /************ Ch. 19 ********************/ /* Consider a mailing list example, partial from http://mlb.mlb.com/team/ 1---+----10---+----20---+ Kansas City Royals

More information

NAGA: Searching and Ranking Knowledge. Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum

NAGA: Searching and Ranking Knowledge. Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum NAGA: Searching and Ranking Knowledge Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum MPI I 2007 5 001 March 2007 Authors Addresses Gjergji Kasneci Max-Planck-Institut

More information

Tools and Infrastructure for Supporting Enterprise Knowledge Graphs

Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Sumit Bhatia, Nidhi Rajshree, Anshu Jain, and Nitish Aggarwal IBM Research sumitbhatia@in.ibm.com, {nidhi.rajshree,anshu.n.jain}@us.ibm.com,nitish.aggarwal@ibm.com

More information

From ER Diagrams to the Relational Model. Rose-Hulman Institute of Technology Curt Clifton

From ER Diagrams to the Relational Model. Rose-Hulman Institute of Technology Curt Clifton From ER Diagrams to the Relational Model Rose-Hulman Institute of Technology Curt Clifton Review Entity Sets and Attributes Entity set: collection of things in the DB Attribute: property of an entity calories

More information

CS490W: Web Information Search & Management. CS-490W Web Information Search and Management. Luo Si. Department of Computer Science Purdue University

CS490W: Web Information Search & Management. CS-490W Web Information Search and Management. Luo Si. Department of Computer Science Purdue University CS490W: Web Information Search & Management CS-490W Web Information Search and Management Luo Si Department of Computer Science Purdue University Overview Web: Growth of the Web The world produces between

More information

CS-490WIR Web Information Retrieval and Management. Luo Si

CS-490WIR Web Information Retrieval and Management. Luo Si CS490W: Web Information Retrieval & Management CS-490WIR Web Information Retrieval and Management Luo Si Department of Computer Science Purdue University Overview Web: Growth of the Web The world produces

More information

XML RETRIEVAL. Introduction to Information Retrieval CS 150 Donald J. Patterson

XML RETRIEVAL. Introduction to Information Retrieval CS 150 Donald J. Patterson Introduction to Information Retrieval CS 150 Donald J. Patterson Content adapted from Manning, Raghavan, and Schütze http://www.informationretrieval.org OVERVIEW Introduction Basic XML Concepts Challenges

More information

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day

More information

Information Governance: What About Business Applications & Structured Data? Webinar for ACC Hosted by Navigant March 19, 2015

Information Governance: What About Business Applications & Structured Data? Webinar for ACC Hosted by Navigant March 19, 2015 Information Governance: What About Business Applications & Structured Data? Webinar for ACC Hosted by Navigant March 19, 2015 Introduction» Introduction of Pamela Strong Legal Technology Solutions (LTS)

More information

Informa/on Retrieval. Text Search. CISC437/637, Lecture #23 Ben CartereAe. Consider a database consis/ng of long textual informa/on fields

Informa/on Retrieval. Text Search. CISC437/637, Lecture #23 Ben CartereAe. Consider a database consis/ng of long textual informa/on fields Informa/on Retrieval CISC437/637, Lecture #23 Ben CartereAe Copyright Ben CartereAe 1 Text Search Consider a database consis/ng of long textual informa/on fields News ar/cles, patents, web pages, books,

More information

SPARK: Top-k Keyword Query in Relational Database

SPARK: Top-k Keyword Query in Relational Database SPARK: Top-k Keyword Query in Relational Database Wei Wang University of New South Wales Australia 20/03/2007 1 Outline Demo & Introduction Ranking Query Evaluation Conclusions 20/03/2007 2 Demo 20/03/2007

More information

Multiple-Choice. 1. Which of the following is equivalent to a table? (3 pts.) a. record b. relation c. relationship d. constraint e.

Multiple-Choice. 1. Which of the following is equivalent to a table? (3 pts.) a. record b. relation c. relationship d. constraint e. Database Design, CSCI 340, Spring 2016 2 nd Exam, April 1 Multiple-Choice 1. Which of the following is equivalent to a table? (3 pts.) a. record b. relation c. relationship d. constraint e. schema 2. Which

More information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Information Retrieval

Information Retrieval https://vvtesh.sarahah.com/ Information Retrieval Venkatesh Vinayakarao Term: Aug Dec, 2018 Indian Institute of Information Technology, Sri City My whole life, I ve been a seeker, searching for something.

More information

DB Project. Database Systems Spring 2013

DB Project. Database Systems Spring 2013 DB Project Database Systems Spring 2013 1 Database project YAGO Yet Another Great Ontology 2 About YAGO * A huge semantic knowledge base knowledge base a special kind of DB for knowledge management (e.g.,

More information

Introduction to Databases CS348

Introduction to Databases CS348 Introduction to Databases CS348 University of Waterloo Winter 2007 University of Waterloo () Introduction to Databases 1 / 20 Course Outline Why do we use Databases? How do we use a DBMS? Functionality

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Keyword query interpretation over structured data

Keyword query interpretation over structured data Keyword query interpretation over structured data Advanced Methods of Information Retrieval Elena Demidova SS 2018 Elena Demidova: Advanced Methods of Information Retrieval SS 2018 1 Recap Elena Demidova:

More information

Introduction to Information Retrieval. Lecture Outline

Introduction to Information Retrieval. Lecture Outline Introduction to Information Retrieval Lecture 1 CS 410/510 Information Retrieval on the Internet Lecture Outline IR systems Overview IR systems vs. DBMS Types, facets of interest User tasks Document representations

More information

A Comparative Study Weighting Schemes for Double Scoring Technique

A Comparative Study Weighting Schemes for Double Scoring Technique , October 19-21, 2011, San Francisco, USA A Comparative Study Weighting Schemes for Double Scoring Technique Tanakorn Wichaiwong Member, IAENG and Chuleerat Jaruskulchai Abstract In XML-IR systems, the

More information

OKKAM-based instance level integration

OKKAM-based instance level integration OKKAM-based instance level integration Paolo Bouquet W3C RDF2RDB This work is co-funded by the European Commission in the context of the Large-scale Integrated project OKKAM (GA 215032) RoadMap Using the

More information

Directory Search Engines Searching the Yahoo Directory

Directory Search Engines Searching the Yahoo Directory Searching on the WWW Directory Oriented Search Engines Often looking for some specific information WWW has a growing collection of Search Engines to aid in locating information The Search Engines return

More information

Chapter 2. Architecture of a Search Engine

Chapter 2. Architecture of a Search Engine Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them

More information

Forensic and Log Analysis GUI

Forensic and Log Analysis GUI Forensic and Log Analysis GUI David Collett I am not representing my Employer April 2005 1 Introduction motivations and goals For sysadmins Agenda log analysis basic investigations, data recovery For forensics

More information

GRAPH DB S & APPLICATIONS

GRAPH DB S & APPLICATIONS GRAPH DB S & APPLICATIONS DENIS VRDOLJAK GUNNAR KLEEMANN UC BERKELEY SCHOOL OF INFORMATION BERKELEY DATA SCIENCE GROUP, LLC PRESENTATION ROAD MAP Intro Background Examples Our Work Graph Databases Intro

More information

Information Retrieval CSCI

Information Retrieval CSCI Information Retrieval CSCI 4141-6403 My name is Anwar Alhenshiri My email is: anwar@cs.dal.ca I prefer: aalhenshiri@gmail.com The course website is: http://web.cs.dal.ca/~anwar/ir/main.html 5/6/2012 1

More information

CSE 344 Midterm. Friday, February 8, 2013, 9:30-10:20. Question Points Score Total: 100

CSE 344 Midterm. Friday, February 8, 2013, 9:30-10:20. Question Points Score Total: 100 CSE 344 Midterm Friday, February 8, 2013, 9:30-10:20 Name: Question Points Score 1 60 2 30 3 10 Total: 100 This exam is open book and open notes but NO laptops or other portable devices. You have 50 minutes;

More information

Peter T. Wood. Third Alberto Mendelzon International Workshop on Foundations of Data Management

Peter T. Wood. Third Alberto Mendelzon International Workshop on Foundations of Data Management Languages Languages query School of Computer Science and Information Systems Birkbeck, University of London ptw@dcs.bbk.ac.uk Third Alberto Mendelzon International Workshop on Foundations of Data Management

More information

Computer Basics. Computer Technology

Computer Basics. Computer Technology Computer Basics Computer Technology What is a Computer Information Processor Input Output Processing Storage Are physical parts like monitor, mouse, keyboard essential? Computer History Abacus 3,000 B.C.

More information

Shortest paths on large graphs: Systems, Algorithms, Applications

Shortest paths on large graphs: Systems, Algorithms, Applications Shortest paths on large graphs: Systems, Algorithms, Applications Andrey Gubichev TU München January 2012 Andrey Gubichev Shortest paths on large graphs 1 / 53 Outline Introduction Systems Algorithms Applications

More information

TriAD: A Distributed Shared-Nothing RDF Engine based on Asynchronous Message Passing

TriAD: A Distributed Shared-Nothing RDF Engine based on Asynchronous Message Passing TriAD: A Distributed Shared-Nothing RDF Engine based on Asynchronous Message Passing Sairam Gurajada, Stephan Seufert, Iris Miliaraki, Martin Theobald Databases & Information Systems Group ADReM Research

More information

Database Optimization Techniques Applied to a Database Contain Soy and Corn Based Products to Ensure a Quick Search in this Database

Database Optimization Techniques Applied to a Database Contain Soy and Corn Based Products to Ensure a Quick Search in this Database Bulletin UASVM Horticulture, 69(2)/2012 Print ISSN 1843-5254; Electronic ISSN 1843-5394 Database Optimization Techniques Applied to a Database Contain Soy and Corn Based Products to Ensure a Quick Search

More information

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities 112 Outline Morning program Preliminaries Semantic matching Learning to rank Afternoon program Modeling user behavior Generating responses Recommender systems Industry insights Q&A 113 are polysemic Finding

More information

RESOURCE DISCOVERY PAST WORK AND FUTURE PLANS

RESOURCE DISCOVERY PAST WORK AND FUTURE PLANS RESOURCE DISCOVERY PAST WORK AND FUTURE PLANS Mandy Stewart Resource Discovery Research and Projects Manager May 2013 The Implementation of Primo Primo was a 1 st step in implementing new search and navigation

More information

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES DEC. 1-5 Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES Monday Overview of Databases A web search engine is a large database containing information about Web pages that have been registered

More information

Multimedia Information Systems

Multimedia Information Systems Multimedia Information Systems Samson Cheung EE 639, Fall 2004 Lecture 6: Text Information Retrieval 1 Digital Video Library Meta-Data Meta-Data Similarity Similarity Search Search Analog Video Archive

More information

the internet premier source for finding BIOS Updates

the internet premier source for finding BIOS Updates the internet premier source for finding BIOS Updates Media Kit 2011 About Wim s BIOS Wim s BIOS (http://www.wimsbios.com) is the internet premier source for finding BIOS Upgrades. Started in 1996, the

More information

Large Scale Semantic Annotation, Indexing, and Search at The National Archives Diana Maynard Mark Greenwood

Large Scale Semantic Annotation, Indexing, and Search at The National Archives Diana Maynard Mark Greenwood Large Scale Semantic Annotation, Indexing, and Search at The National Archives Diana Maynard Mark Greenwood University of Sheffield, UK 1 Burning questions you may have... In the last 3 years, which female

More information

Keyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan

Keyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan Keyword search in relational databases By SO Tsz Yan Amanda & HON Ka Lam Ethan 1 Introduction Ubiquitous relational databases Need to know SQL and database structure Hard to define an object 2 Query representation

More information

$533,151 grant to Wayne State University

$533,151 grant to Wayne State University Source: Thinkstock July 03, 2017 - The National Institute of Mental Health and the National Institutes of Health (NIH) awarded a four-year $533,151 grant to Wayne State University (https://research.wayne.edu/news.php?id=24103)

More information

December 4, BigData 2017 Enterprise Knowledge Graphs for Large Scale Analytics 1/47

December 4, BigData 2017 Enterprise Knowledge Graphs for Large Scale Analytics 1/47 December 4, 2017 BigData 2017 Enterprise Knowledge Graphs for Large Scale Analytics 1/47 Knowledge Graphs Analytics Knowledge Graph Analytics Finding Entities of Interest Entity Search and Recommendation

More information

Multi-agent and Semantic Web Systems: RDF Data Structures

Multi-agent and Semantic Web Systems: RDF Data Structures Multi-agent and Semantic Web Systems: RDF Data Structures Fiona McNeill School of Informatics 31st January 2013 Fiona McNeill Multi-agent Semantic Web Systems: RDF Data Structures 31st January 2013 0/25

More information

CS50 Quiz Review. November 13, 2017

CS50 Quiz Review. November 13, 2017 CS50 Quiz Review November 13, 2017 Info http://docs.cs50.net/2017/fall/quiz/about.html 48-hour window in which to take the quiz. You should require much less than that; expect an appropriately-scaled down

More information

Querying Wikipedia Documents and Relationships

Querying Wikipedia Documents and Relationships Querying Wikipedia Documents and Relationships Huong Nguyen Thanh Nguyen Hoa Nguyen Juliana Freire School of Computing and SCI Institute, University of Utah {huongnd,thanhh,thanhhoa,juliana}@cs.utah.edu

More information

Introduction to Information Retrieval

Introduction to Information Retrieval Introduction to Information Retrieval http://informationretrieval.org IIR 10: XML Retrieval Hinrich Schütze, Christina Lioma Center for Information and Language Processing, University of Munich 2010-07-12

More information

Indexing and Query Processing. What will we cover?

Indexing and Query Processing. What will we cover? Indexing and Query Processing CS 510 Winter 2007 1 What will we cover? Key concepts and terminology Inverted index structures Organization, creation, maintenance Compression Distribution Answering queries

More information

Keyword query interpretation over structured data

Keyword query interpretation over structured data Keyword query interpretation over structured data Advanced Methods of IR Elena Demidova Materials used in the slides: Jeffrey Xu Yu, Lu Qin, Lijun Chang. Keyword Search in Databases. Synthesis Lectures

More information

Ranked Retrieval. Evaluation in IR. One option is to average the precision scores at discrete. points on the ROC curve But which points?

Ranked Retrieval. Evaluation in IR. One option is to average the precision scores at discrete. points on the ROC curve But which points? Ranked Retrieval One option is to average the precision scores at discrete Precision 100% 0% More junk 100% Everything points on the ROC curve But which points? Recall We want to evaluate the system, not

More information

Knowledge Graphs: In Theory and Practice

Knowledge Graphs: In Theory and Practice Knowledge Graphs: In Theory and Practice Nitish Aggarwal, IBM Watson, USA, Sumit Bhatia, IBM Research, India Saeedeh Shekarpour, Knoesis Research Centre Ohio, USA Amit Sheth, Knoesis Research Centre Ohio,

More information

Knowledge Graphs: In Theory and Practice

Knowledge Graphs: In Theory and Practice Knowledge Graphs: In Theory and Practice Sumit Bhatia 1 and Nitish Aggarwal 2 1 IBM Research, New Delhi, India 2 IBM Watson, San Jose, CA sumitbhatia@in.ibm.com, nitish.aggarwal@ibm.com November 10, 2017

More information

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report Technical Report A B2B Search Engine Abstract In this report, we describe a business-to-business search engine that allows searching for potential customers with highly-specific queries. Currently over

More information

Intranet Search. Exploiting Databases for Document Retrieval. Christoph Mangold Universität Stuttgart

Intranet Search. Exploiting Databases for Document Retrieval. Christoph Mangold Universität Stuttgart Intranet Search Exploiting Databases for Document Retrieval Christoph Mangold Universität Stuttgart 2 /6 The Big Picture: Assume. there is a glueing problem with product P7 Has this happened before? Is

More information

eresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library

eresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library eresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library 1 Overview The MPG eresearch, a Definition Max Planck Digital Library (MPDL):

More information

itrails: Pay-as-you-go Information Integration in Dataspaces

itrails: Pay-as-you-go Information Integration in Dataspaces itrails: Pay-as-you-go Information Integration in Dataspaces Marcos Vaz Salles Jens Dittrich Shant Karakashian Olivier Girard Lukas Blunschi ETH Zurich VLDB 2007 Outline Motivation itrails Experiments

More information

Database Administration. Database Administration CSCU9Q5. The Data Dictionary. 31Q5/IT31 Database P&A November 7, Overview:

Database Administration. Database Administration CSCU9Q5. The Data Dictionary. 31Q5/IT31 Database P&A November 7, Overview: Database Administration CSCU9Q5 Slide 1 Database Administration Overview: Data Dictionary Data Administrator Database Administrator Distributed Databases Slide 2 The Data Dictionary A DBMS must provide

More information

WHITE PAPER DATA DICATIONARY DEFINING AND USING A COMMON LANGUAGE

WHITE PAPER DATA DICATIONARY DEFINING AND USING A COMMON LANGUAGE DATA DICATIONARY DEFINING AND USING A COMMON LANGUAGE TABLE OF CONTENTS INTRODUCTION 3 DEFINED 3 NEED FOR 3 USES OF 3 ANATOMY OF 4 BENEFITS OF USING 4 2 P a g e INTRODUCTION The success of any business

More information

CSE 444 Final Exam. August 21, Question 1 / 15. Question 2 / 25. Question 3 / 25. Question 4 / 15. Question 5 / 20.

CSE 444 Final Exam. August 21, Question 1 / 15. Question 2 / 25. Question 3 / 25. Question 4 / 15. Question 5 / 20. CSE 444 Final Exam August 21, 2009 Name Question 1 / 15 Question 2 / 25 Question 3 / 25 Question 4 / 15 Question 5 / 20 Total / 100 CSE 444 Final, August 21, 2009 Page 1 of 10 Question 1. B+ trees (15

More information

Next Generation DWH Modeling. An overview of DWH modeling methods

Next Generation DWH Modeling. An overview of DWH modeling methods Next Generation DWH Modeling An overview of DWH modeling methods Ronald Kunenborg www.grundsatzlich-it.nl Topics Where do we stand today Data storage and modeling through the ages Current data warehouse

More information

CS 4460 Intro. to Information Visualization September 15, 2017 John Stasko

CS 4460 Intro. to Information Visualization September 15, 2017 John Stasko Case Study: Jigsaw CS 4460 Intro. to Information Visualization September 15, 2017 John Stasko Learning Objectives Become familiar with investigative analysis process carried out by various types of analysts

More information

Information Retrieval (Part 1)

Information Retrieval (Part 1) Information Retrieval (Part 1) Fabio Aiolli http://www.math.unipd.it/~aiolli Dipartimento di Matematica Università di Padova Anno Accademico 2008/2009 1 Bibliographic References Copies of slides Selected

More information

Digital Libraries: Interoperability

Digital Libraries: Interoperability Digital Libraries: Interoperability RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Interoperability...............................................

More information

COMP718: Ontologies and Knowledge Bases

COMP718: Ontologies and Knowledge Bases 1/35 COMP718: Ontologies and Knowledge Bases Lecture 9: Ontology/Conceptual Model based Data Access Maria Keet email: keet@ukzn.ac.za home: http://www.meteck.org School of Mathematics, Statistics, and

More information

Database Intro. INFO/CSE 100, Spring 2005 Fluency in Information Technology.

Database Intro. INFO/CSE 100, Spring 2005 Fluency in Information Technology. Database Intro INFO/CSE 100, Spring 2005 Fluency in Information Technology http://www.cs.washington.edu/100 5/20/05 fit100-21-databases 2005 University of Washington 1 Reading Readings and References»

More information

BI/DWH Test specifics

BI/DWH Test specifics BI/DWH Test specifics Jaroslav.Strharsky@s-itsolutions.at 26/05/2016 Page me => TestMoto: inadequate test scope definition? no problem problem cold be only bad test strategy more than 16 years in IT more

More information

Technology In Action, Complete, 14e (Evans et al.) Chapter 11 Behind the Scenes: Databases and Information Systems

Technology In Action, Complete, 14e (Evans et al.) Chapter 11 Behind the Scenes: Databases and Information Systems Technology In Action, Complete, 14e (Evans et al.) Chapter 11 Behind the Scenes: Databases and Information Systems 1) A is a collection of related data that can be stored, sorted, organized, and queried.

More information

Transaction Management Exercises KEY

Transaction Management Exercises KEY Transaction Management Exercises KEY I/O and CPU activities can be and are overlapped to minimize (disk and processor) idle time and to maximize throughput (units of work per time unit). This motivates

More information

LinkedMDB. The first linked data source dedicated to movies

LinkedMDB. The first linked data source dedicated to movies Oktie Hassanzadeh Mariano Consens University of Toronto April 20th, 2009 Madrid, Spain Presentation at the Linked Data On the Web (LDOW) 2009 Workshop LinkedMDB 2 The first linked data source dedicated

More information

Assignment 1. Assignment 2. Relevance. Performance Evaluation. Retrieval System Evaluation. Evaluate an IR system

Assignment 1. Assignment 2. Relevance. Performance Evaluation. Retrieval System Evaluation. Evaluate an IR system Retrieval System Evaluation W. Frisch Institute of Government, European Studies and Comparative Social Science University Vienna Assignment 1 How did you select the search engines? How did you find the

More information

Project Revision. just links to Principles of Information and Database Management 198:336 Week 13 May 2 Matthew Stone

Project Revision.  just links to Principles of Information and Database Management 198:336 Week 13 May 2 Matthew Stone Project Revision Principles of Information and Database Management 198:336 Week 13 May 2 Matthew Stone Email just links to mdstone@cs Link to code (on the web) Link to writeup (on the web) Link to project

More information

Data Classification. The Foundation for Intelligent Information Management. Infostructure Associates Leveraging Information for Organizational Success

Data Classification. The Foundation for Intelligent Information Management. Infostructure Associates Leveraging Information for Organizational Success Data Classification The Foundation for Intelligent Information Management David Hill Principal Wayne Kernochan President Infostructure Associates Leveraging Information for Organizational Success SWC Legal

More information

Extracting Rankings for Spatial Keyword Queries from GPS Data

Extracting Rankings for Spatial Keyword Queries from GPS Data Extracting Rankings for Spatial Keyword Queries from GPS Data Ilkcan Keles Christian S. Jensen Simonas Saltenis Aalborg University Outline Introduction Motivation Problem Definition Proposed Method Overview

More information

Structured Data on the Web

Structured Data on the Web Structured Data on the Web Alon Halevy Google Australasian Computer Science Week January, 2010 Structured Data & The Web Andree Hudson, 4 th of July Hard to find structured data via search engines

More information

Semantic Web Mining and its application in Human Resource Management

Semantic Web Mining and its application in Human Resource Management International Journal of Computer Science & Management Studies, Vol. 11, Issue 02, August 2011 60 Semantic Web Mining and its application in Human Resource Management Ridhika Malik 1, Kunjana Vasudev 2

More information

Applying big data analytics in practice

Applying big data analytics in practice ARISTOTLE UNIVERSITY of THESSALONIKI Applying big data analytics in practice Anastasios Gounaris School of Informatics datalab.csd.auth.gr/~gounaris email: gounaria@csd.auth.gr New data every 1 min 2 What

More information

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis Informatics 1: Data & Analysis Lecture 3: The Relational Model Ian Stark School of Informatics The University of Edinburgh Tuesday 24 January 2017 Semester 2 Week 2 https://blog.inf.ed.ac.uk/da17 Lecture

More information

Submitting business costs & expenses

Submitting business costs & expenses Submitting business costs & expenses One of the many perks you ll enjoy as an employee of Parasol is the ability to claim legitimate business costs and expenses. There are many things you can claim for,

More information

Database of historical places, persons, and lemmas

Database of historical places, persons, and lemmas Database of historical places, persons, and lemmas Natalia Korchagina Outline 1. Introduction 1.1 Swiss Law Sources Foundation as a Digital Humanities project 1.2 Data to be stored 1.3 Final goal: how

More information

SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL

SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL Wang Wei, Payam M. Barnaghi School of Computer Science and Information Technology The University of Nottingham Malaysia Campus {Kcy3ww, payam.barnaghi}@nottingham.edu.my

More information

The University of Nottingham

The University of Nottingham The University of Nottingham SCHOOL OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY A LEVEL 1 MODULE, SPRING SEMESTER 2006-2007 DATABASE SYSTEMS Time allowed TWO hours Candidates must NOT start writing

More information

Enabling Seamless Sharing of Data among Organizations Using the DaaS Model in a Cloud

Enabling Seamless Sharing of Data among Organizations Using the DaaS Model in a Cloud Enabling Seamless Sharing of Data among Organizations Using the DaaS Model in a Cloud Addis Mulugeta Ethiopian Sugar Corporation, Addis Ababa, Ethiopia addismul@gmail.com Abrehet Mohammed Omer Department

More information

GFI Product Comparison. GFI MailEssentials vs Sophos PureMessage

GFI Product Comparison. GFI MailEssentials vs Sophos PureMessage GFI Product Comparison GFI MailEssentials vs PureMessage GFI MailEssentials Integrates with Microsoft Exchange Server 2003/2007/2010/2013 Scans incoming and outgoing emails Scans internal emails within

More information

Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online

Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online Yingjie Hu 1, Krzysztof Janowicz 1, Sathya Prasad 2, and Song Gao 1 1 STKO Lab, Department

More information

SkyEyes: A Semantic Browser For the KB-Grid

SkyEyes: A Semantic Browser For the KB-Grid SkyEyes: A Semantic Browser For the KB-Grid Yuxin Mao, Zhaohui Wu, Huajun Chen Grid Computing Lab, College of Computer Science, Zhejiang University, Hangzhou 310027, China {maoyx, wzh, huajunsir}@zju.edu.cn

More information

Database Systems. Bence Molnár

Database Systems. Bence Molnár Database Systems Bence Molnár Info Bence Molnár molnar.bence@epito.bme.hu K 131 http://abr.fmt.bme.hu http://www.epito.bme.hu/database-system s 2 credits 1 test at week 12th 1 database project as homework

More information

[ ] Pre Clinic [ ] Clinic Passport Expiration Date:

[ ] Pre Clinic [ ] Clinic Passport Expiration Date: INTERNATIONAL TROPICAL MEDICINE SUMMER SCHOOL Muhammadiyah Medical Student s Activities Faculty of Medicine and Health Science Universitas Muhammadiyah Yogyakarta PASSPORT DIY - Indonesia SIZED P: 62 274

More information

ListCreator: Entity Ranking on the Web

ListCreator: Entity Ranking on the Web ListCreator: Entity Ranking on the Web Alexandros Komninos Department of Electrical and Computer Engineering Democritus University of Thrace Xanthi 67100, Greece alexkomn@ee.duth.gr Avi Arampatzis Department

More information

Building Geospatial Mashups to Visualize Information for Crisis Management. Shubham Gupta and Craig A. Knoblock University of Southern California

Building Geospatial Mashups to Visualize Information for Crisis Management. Shubham Gupta and Craig A. Knoblock University of Southern California Building Geospatial Mashups to Visualize Information for Crisis Management Shubham Gupta and Craig A. Knoblock University of Southern California 1 WHAT IS A GEOSPATIAL MASHUP? Integrated View of data combined

More information

SEO Search Engine Optimisation An Intro to SEO. Prepared By: Paul Dobinson Tel

SEO Search Engine Optimisation An Intro to SEO. Prepared By: Paul Dobinson Tel SEO Search Engine Optimisation An Intro to SEO Prepared By: Paul Dobinson Tel 441-278-1004 paul@bermudayp.com Points - What is SEO - differences SEO vs SEM - Search Engines - Crawlers /Accessibility -

More information

The Functional Extension Parser (FEP) A Document Understanding Platform

The Functional Extension Parser (FEP) A Document Understanding Platform The Functional Extension Parser (FEP) A Document Understanding Platform Günter Mühlberger University of Innsbruck Department for German Language and Literature Studies Introduction A book is more than

More information

The Metadata Challenge:

The Metadata Challenge: The Metadata Challenge: Determining local and global needs and expectations for your metadata Gareth Knight, Kultivate Metadata workshop 24 th May 2011 Centre for e-research (CeRch), King s College London

More information

Index everything One query type Low latency High concurrency. Index nothing Queries as programs High latency Low concurrency

Index everything One query type Low latency High concurrency. Index nothing Queries as programs High latency Low concurrency SCHEMA ON READ Index everything One query type Low latency High concurrency Index nothing Queries as programs High latency Low concurrency Index everything One query type Low latency High concurrency Index

More information

Extracting and Querying Probabilistic Information From Text in BayesStore-IE

Extracting and Querying Probabilistic Information From Text in BayesStore-IE Extracting and Querying Probabilistic Information From Text in BayesStore-IE Daisy Zhe Wang, Michael J. Franklin, Minos Garofalakis 2, Joseph M. Hellerstein University of California, Berkeley Technical

More information

Advanced Data Management Technologies

Advanced Data Management Technologies ADMT 2017/18 Unit 13 J. Gamper 1/42 Advanced Data Management Technologies Unit 13 DW Pre-aggregation and View Maintenance J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Midterm Exam Search Engines ( / ) October 20, 2015

Midterm Exam Search Engines ( / ) October 20, 2015 Student Name: Andrew ID: Seat Number: Midterm Exam Search Engines (11-442 / 11-642) October 20, 2015 Answer all of the following questions. Each answer should be thorough, complete, and relevant. Points

More information

Worm Detection, Early Warning and Response Based on Local Victim Information

Worm Detection, Early Warning and Response Based on Local Victim Information Worm Detection, Early Warning and Response Based on Local Victim Information Guofei Gu, Monirul Sharif, Xinzhou Qin, David Dagon, Wenke Lee, and George Riley Georgia Institute of Technology ACSAC'04 1

More information

Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO

Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome

More information