Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

Size: px
Start display at page:

Download "Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016"

Transcription

1 + Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis

2 + DB & IR Integration Databases and information retrieval are two areas that have been developed separately! They have focused on different areas of application They have given emphasis to different methodologies 2

3 + DB & IR Integration In databases: We pose queries to data with a particular schema, we use an algebra, we care about the accuracy of the queries results In information retrieval: We focus on queries expressed with keywords, queries are applied on free text documents, we care about how to rank the queries results, based on statistics and probabilities 3

4 + DB & IR Integration Nowadays, there are many applications that require the concurrent management of structured and unstructured data, so necessary shows the integration of these two worlds 4

5 + Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis

6 + Adding Ranking to DB OR Adding Semantics to IR unstructured search (keywords) [keyword search on databases] IR Systems Search Engines structured search (sql, xquery) Database Systems [querying entities] structured data (records) unstructured data (documents) 6

7 + DB & IR Differences Databases Structured data Structured querying Soundness & Completeness User is expected to be aware of the underlying structure of the data or a query language IR Unstructured data Unstructured querying High Precision & Recall No expectations 7

8 + Why DB & IR Integration? DB and IR have evolved as separate communities Their focus is on very different application areas, e.g.: (DB) accounting and reservation systems (IR) library and patent information So, they have different methodological paradigms (DB) precise querying over schematized data, based on logic and algebra (IR) keyword search and ranking over text and uncertain data, based on statistics and probability theory 8

9 + Why DB & IR Integration? TODAY: many applications require managing both structured and unstructured data Considerations on how to integrate the DB and IR worlds at both foundational and software-system levels In the next slides: Tenets, from different viewpoints, on why DB & IR Integration is desirable 9

10 + too-many-answers Example: Searches over travel portals or product catalogs Too-many-answers problem What if, tightening the query conditions? This may produce too few or even no results Note also: interactive reformulation and browsing is timeconsuming and may irritate customers/users For large result sets, ranking! Data and/or workload statistics User profiles 10

11 + text-matching Because of misspellings, spelling variants, etc., there is a need for adding text-matching functionality to DB systems Need for approximate matching E.g., record linkage for matching entities Reconcile Hector Garcia-Molina and Garcia-Molina, H. Intuitively, approximate matching by similarity measures requires ranking! 11

12 + heterogeneity Typically, applications access multiple databases Often with a run-time choice of the data sources No unified global schema Even if the sources contain structured, exact data records and have an explicit schema The application has to cope with the heterogeneity of the underlying schema names, XML tags, or RDF properties Queries need to be schema-agnostic or tolerant to schema relaxation 12

13 + information-extraction Textual information (natural-language sentences) contains named entities and relationships between them Information-extraction techniques (pattern matching, statistical learning) for locating the entities Potentially, large knowledge bases whose facts with some uncertainty Querying the extracted facts: need for ranking! 13

14 + information-extraction Querying the extracted facts: Use keywords rather than sophisticated expressions in SQL or Xquery If the extracted data are organized in graph structures: Determine when keyword occurrences are interconnected in a meaningful way Efficiently compute answers in ranked order (new, or not so new, research problems ) 14

15 + structured IR Structured IR: go beyond keyword search by understanding attributes, XML tags and metadata Digital libraries, enterprise intranets, e-science portals, and businessoriented Web sites Example: faceted search paradigm Access information organized according to multiple dimensions (ranking in multiple ways) Allow users to explore a collection of information by applying multiple filters Internet merchant sites for product search, result refinement, interactive exploration 15

16 + search-result personalization Take care for the user s information needs Better search precision/recall, higher user satisfaction Exploit: User preferences Profiling User s long-term history of queries, clicks and data usage Contextual profiling User s short-term behavior in the context of the current task Personalization already in Web, news and blog search Enormous potential for individualizing 16

17 + Different Views of the Coin About the need for structure DB emphasizes relaxation of structure IR emphasizes adding structure to information (The Web community takes a mix of structured and unstructured data for granted) About the need for named entities DB emphasizes approximate matching and ranking IR emphasizes adding relationships between entities 17

18 + DB & IR Integration Learning outcomes After completing the course, the students are expected to: know the basic concepts and techniques for the integration of databases and information retrieval be able to handle contemporary research issues and problems on the topic be able to perform a comparative assessment of existing works 18

19 + DB & IR Integration 24 Oct 16 Dec (8 weeks) Two parts: 1st part (4 weeks) all lectures will be given by the instructor 2nd part (4 weeks) lectures (in the form of assignments) will be mostly given by the students 19

20 + DB & IR Integration 1st part (4 weeks) all lectures will be given by the instructor Introduction on big data and on the need for data exploration, on the techniques that will be presented at the lectures, and on the structure/organization of the course For this part, algorithmic exercises or extensions on the presented approaches will be given to the students on a weekly basis (each student will work on his/her own) 20

21 + DB & IR Integration 1st part (4 weeks) all lectures will be given by the instructor Top-k and skyline queries Rank aggregation, top-k algorithms, skylines Keyword-based search Schema-based & graph-based approaches in databases Preferential search Preference representation and composition, preferential query processing Recommender systems Collaborative filtering, content-based recommendations 21

22 + DB & IR Integration 2nd part (4 weeks) lectures (in the form of assignments) will be mostly given by the students Students will form groups (at most 4 students per group: TBD) Each group will be assigned with a project Each project will be associated with two research papers Each week, each group will make a short presentation 22

23 + DB & IR Integration 2nd part (4 weeks) lectures (in the form of assignments) will be mostly given by the students Each week, each group will make a short presentation (~10-15 mins) 1st week: shortly describe the topic and the solutions of the papers of the projects 2nd week: describe the main disadvantages/drawbacks of the solutions given by the original authors 3rd week: present ideas from other related papers published after the papers of the project Search for upcoming papers related to the project 4th week: extend the ideas of the project students contributions 23

24 + DB & IR Integration 2nd part (4 weeks) lectures (in the form of assignments) will be mostly given by the students + 1 assignment from my side on a weekly basis related to one of the projects 24

25 + DB & IR Integration Grades The final grade will be determined: 30% by the assignments of the first part 20% by the assignments of the second part, and 50% by the presentations of the project 25

26 + Course Projects Project 1: Top-k join tuples Project 2: Preference integration in databases Project 3: Personalized keyword search Project 4: Contextual recommendations Project 5: Recommend packages Project 6: Recommendations for groups Project 7: Diversity in recommender systems Project 8: Efficient diverse search Project 9: Frameworks based on different definitions of diversity Project 10: Tags for search Project 11: Interactive data exploration

27 + Where, When When: Monday, Thursday, Friday: (24 Oct Dec 2016) Where: Pinni B0016 Instructor: Kostas Stefanidis Course web page:

Course Design Document: IS202 Data Management. Version 4.5

Course Design Document: IS202 Data Management. Version 4.5 Course Design Document: IS202 Data Management Version 4.5 Friday, October 1, 2010 Table of Content 1. Versions History... 4 2. Overview of the Data Management... 5 3. Output and Assessment Summary... 6

More information

Introduction to Databases Fall-Winter 2009/10. Syllabus

Introduction to Databases Fall-Winter 2009/10. Syllabus Introduction to Databases Fall-Winter 2009/10 Syllabus Werner Nutt Syllabus Lecturer Werner Nutt, nutt@inf.unibz.it, Room TRA 2.01 Office hours: Thursday, 16:00 18:00 (If you want to meet up with me, send

More information

Introduction to Databases Fall-Winter 2010/11. Syllabus

Introduction to Databases Fall-Winter 2010/11. Syllabus Introduction to Databases Fall-Winter 2010/11 Syllabus Werner Nutt Syllabus Lecturer Werner Nutt, nutt@inf.unibz.it, Room POS 2.09 Office hours: Tuesday, 14:00 16:00 and by appointment (If you want to

More information

CS 525 Advanced Database Organization - Spring 2017 Mon + Wed 1:50-3:05 PM, Room: Stuart Building 111

CS 525 Advanced Database Organization - Spring 2017 Mon + Wed 1:50-3:05 PM, Room: Stuart Building 111 CS 525 Advanced Database Organization - Spring 2017 Mon + Wed 1:50-3:05 PM, Room: Stuart Building 111 Instructor: Boris Glavic, Stuart Building 226 C, Phone: 312 567 5205, Email: bglavic@iit.edu Office

More information

Can you name one application that does not need any data? Can you name one application that does not need organized data?

Can you name one application that does not need any data? Can you name one application that does not need organized data? Introduction Why Databases? Can you name one application that does not need any data? No, a program itself is data Can you name one application that does not need organized data? No, programs = algorithms

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query

More information

LIS 2680: Database Design and Applications

LIS 2680: Database Design and Applications School of Information Sciences - University of Pittsburgh LIS 2680: Database Design and Applications Summer 2012 Instructor: Zhen Yue School of Information Sciences, University of Pittsburgh E-mail: zhy18@pitt.edu

More information

Modern Database Systems CS-E4610

Modern Database Systems CS-E4610 Modern Database Systems CS-E4610 Aristides Gionis Michael Mathioudakis Spring 2017 what is a database? a collection of data what is a database management system?... a.k.a. database system software to store,

More information

Overview of the Class and Introduction to DB schemas and queries. Lois Delcambre

Overview of the Class and Introduction to DB schemas and queries. Lois Delcambre Overview of the Class and Introduction to DB schemas and queries Lois Delcambre 1 CS 386/586 Introduction to Databases Instructor: Lois Delcambre lmd@cs.pdx.edu 503 725-2405 TA: TBA Office Hours: Immediately

More information

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Cloudera s Developer Training for Apache Spark and Hadoop delivers the key concepts and expertise need to develop high-performance

More information

San José State University Computer Science Department CS157A: Introduction to Database Management Systems Sections 5 and 6, Fall 2015

San José State University Computer Science Department CS157A: Introduction to Database Management Systems Sections 5 and 6, Fall 2015 San José State University Computer Science Department CS157A: Introduction to Database Management Systems Sections 5 and 6, Fall 2015 Course and Contact Information Instructor: Ron Gutman Office Location:

More information

CS317 File and Database Systems

CS317 File and Database Systems CS317 File and Database Systems http://dilbert.com/strips/comic/1995-10-11/ Lecture 5 More SQL and Intro to Stored Procedures September 24, 2017 Sam Siewert SQL Theory and Standards Completion of SQL in

More information

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES DEC. 1-5 Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES Monday Overview of Databases A web search engine is a large database containing information about Web pages that have been registered

More information

20464 Developing Microsoft SQL Server Databases

20464 Developing Microsoft SQL Server Databases Course Overview This 5-day instructor-led course introduces SQL Server 2014 and describes logical table design, indexing and query plans. It also focuses on the creation of database objects including views,

More information

Introduction to Database S ystems Systems CSE 444 Lecture 1 Introduction CSE Summer

Introduction to Database S ystems Systems CSE 444 Lecture 1 Introduction CSE Summer Introduction to Database Systems CSE 444 Lecture 1 Introduction 1 Staff Instructor: Hal Perkins CSE 548, perkins@cs.washington.edu Office hours: labs tba, office drop-ins and appointments welcome TA: David

More information

COAP 3110 INTERACTIVE SITE DEVELOPMENT

COAP 3110 INTERACTIVE SITE DEVELOPMENT COAP 3110 INTERACTIVE SITE DEVELOPMENT http://wwwai.wu-wien.ac.at/~hahsler/webster/coap3110/ Instructor Michael Hahsler Tel. 31336/6081 0699 100 00 598 E-mail: hahsler@ai.wu-wien.ac.at 1 Course description

More information

CS157a Fall 2018 Sec3 Home Page/Syllabus

CS157a Fall 2018 Sec3 Home Page/Syllabus CS157a Fall 2018 Sec3 Home Page/Syllabus Introduction to Database Management Systems Instructor: Chris Pollett Office: MH 214 Phone Number: (408) 924 5145 Email: chris@pollett.org Office Hours: MW 4:30-5:45pm

More information

CSE 132A. Database Systems Principles

CSE 132A. Database Systems Principles CSE 132A Database Systems Principles Prof. Victor Vianu 1 Data Management An evolving, expanding field: Classical stand-alone databases (Oracle, DB2, SQL Server) Computer science is becoming data-centric:

More information

COMP-421: Database Systems. Joseph D silva McConnel Engg. 102

COMP-421: Database Systems. Joseph D silva McConnel Engg. 102 COMP-421: Database Systems Joseph D silva joseph.dsilva@mail.mcgill.ca McConnel Engg. 102 Class: Names and Numbers Mondays, Wednesdays 10:05-11:25 Lecturer: TAs: Joseph D silva joseph.dsilva@mail.mcgill.ca

More information

Elementary IR: Scalable Boolean Text Search. (Compare with R & G )

Elementary IR: Scalable Boolean Text Search. (Compare with R & G ) Elementary IR: Scalable Boolean Text Search (Compare with R & G 27.1-3) Information Retrieval: History A research field traditionally separate from Databases Hans P. Luhn, IBM, 1959: Keyword in Context

More information

Course and Contact Information. Course Description. Course Objectives

Course and Contact Information. Course Description. Course Objectives San Jose State University College of Science Department of Computer Science CS157A, Introduction to Database Management Systems, Sections 1 and 2, Fall2016 Course and Contact Information Instructor: Dr.

More information

Introduction to Database Systems CSE 444. Lecture #1 March 26, 2007

Introduction to Database Systems CSE 444. Lecture #1 March 26, 2007 Introduction to Database Systems CSE 444 Lecture #1 March 26, 2007 1 About Me Dan Suciu: Joined the department in 2000 Before that: Bell Labs, AT&T Labs Research: Past: XML and semi-structured data: Query

More information

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web What you have learned so far Interoperability Introduction to the Semantic Web Tutorial at ISWC 2010 Jérôme Euzenat Data can be expressed in RDF Linked through URIs Modelled with OWL ontologies & Retrieved

More information

Jennifer Widom. Stanford University

Jennifer Widom. Stanford University Principled Research in Database Systems Stanford University What Academics Give Talks About Other people s papers Thesis and new results Significant research projects The research field BIG VISION Other

More information

CSE 544 Principles of Database Management Systems

CSE 544 Principles of Database Management Systems CSE 544 Principles of Database Management Systems Lecture 1 - Introduction and the Relational Model 1 Outline Introduction Class overview Why database management systems (DBMS)? The relational model 2

More information

Database Systems (INFR10070) Dr Paolo Guagliardo. University of Edinburgh. Fall 2016

Database Systems (INFR10070) Dr Paolo Guagliardo. University of Edinburgh. Fall 2016 Database Systems (INFR10070) Dr Paolo Guagliardo University of Edinburgh Fall 2016 Databases are everywhere Electronic commerce, websites (e.g., Wordpress blogs) Banking applications, booking systems,

More information

Databases TDA357/DIT620. Niklas Broberg

Databases TDA357/DIT620. Niklas Broberg Databases TDA357/DIT620 Niklas Broberg niklas.broberg@chalmers.se 1 What s a database anyway? 2 A database is Structured Persistant Changable Digital True to integrity constraints 3 DBMS Database == Data

More information

TITLE OF COURSE SYLLABUS, SEMESTER, YEAR

TITLE OF COURSE SYLLABUS, SEMESTER, YEAR TITLE OF COURSE SYLLABUS, SEMESTER, YEAR Instructor Contact Information Jennifer Weller Jweller2@uncc.edu Office Hours Time/Location of Course Mon 9-11am MW 8-9:15am, BINF 105 Textbooks Needed: none required,

More information

CSE 344 JANUARY 3 RD - INTRODUCTION

CSE 344 JANUARY 3 RD - INTRODUCTION CSE 344 JANUARY 3 RD - INTRODUCTION COURSE FORMAT Lectures Location: SIG 134 Please attend Sections: Content: exercises, tutorials, questions, new materials (occasionally) Locations: see web Please attend

More information

CMPUT 391 Database Management Systems. Fall Semester 2006, Section A1, Dr. Jörg Sander. Introduction

CMPUT 391 Database Management Systems. Fall Semester 2006, Section A1, Dr. Jörg Sander. Introduction CMPUT 391 Database Management Systems Fall Semester 2006, Section A1, Dr. Jörg Sander Introduction University of Alberta 1 Objectives of Lecture 1 Get a rough initial idea about the content of the course:

More information

Information Retrieval

Information Retrieval Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have

More information

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE YING DING 1 Digital Enterprise Research Institute Leopold-Franzens Universität Innsbruck Austria DIETER FENSEL Digital Enterprise Research Institute National

More information

20762B: DEVELOPING SQL DATABASES

20762B: DEVELOPING SQL DATABASES ABOUT THIS COURSE This five day instructor-led course provides students with the knowledge and skills to develop a Microsoft SQL Server 2016 database. The course focuses on teaching individuals how to

More information

SQL Server Development 20762: Developing SQL Databases in Microsoft SQL Server Upcoming Dates. Course Description.

SQL Server Development 20762: Developing SQL Databases in Microsoft SQL Server Upcoming Dates. Course Description. SQL Server Development 20762: Developing SQL Databases in Microsoft SQL Server 2016 Learn how to design and Implement advanced SQL Server 2016 databases including working with tables, create optimized

More information

Writing Queries Using Microsoft SQL Server 2008 Transact- SQL

Writing Queries Using Microsoft SQL Server 2008 Transact- SQL Writing Queries Using Microsoft SQL Server 2008 Transact- SQL Course 2778-08; 3 Days, Instructor-led Course Description This 3-day instructor led course provides students with the technical skills required

More information

Introduction to Databases

Introduction to Databases Introduction to Databases Syllabus Web Page http://www.cs.northwestern.edu/~pdinda/db (Note: may change with reorganization of ECE/CS) Instructor Peter A. Dinda Technological Institute L463 847-467-7859

More information

Course and Contact Information. Course Description. Course Objectives

Course and Contact Information. Course Description. Course Objectives San Jose State University College of Science Department of Computer Science CS157A, Introduction to Database Management Systems, Sections 1 and 2, Fall2017 Course and Contact Information Instructor: Dr.

More information

Introduction to Database Systems CSE 444. Lecture 1 Introduction

Introduction to Database Systems CSE 444. Lecture 1 Introduction Introduction to Database Systems CSE 444 Lecture 1 Introduction 1 About Me: General Prof. Magdalena Balazinska (magda) At UW since January 2006 PhD from MIT Born in Poland Grew-up in Poland, Algeria, and

More information

Fundamentals of Databases

Fundamentals of Databases Fundamentals of Databases College of Computers and Information Technology Taif University Saudi Arabia s.zahrani@tu.edu.sa/ admin@c2learn.com Fundamentals of Databases Dr. Salha M. Alzahrani 1 Fundamentals

More information

SRM UNIVERSITY FACULTY OF ENGINEERING AND TECHNOLOGY SCHOOL OF COMPUTING DEPARTMENT OF CSE COURSE PLAN

SRM UNIVERSITY FACULTY OF ENGINEERING AND TECHNOLOGY SCHOOL OF COMPUTING DEPARTMENT OF CSE COURSE PLAN SRM UNIVERSITY FACULTY OF ENGINEERING AND TECHNOLOGY SCHOOL OF COMPUTING DEPARTMENT OF CSE COURSE PLAN Course Code : CS0304 Course Title : Data Base Management Systems Semester : VI Course Time : Dec 2012-

More information

Microsoft. [MS20762]: Developing SQL Databases

Microsoft. [MS20762]: Developing SQL Databases [MS20762]: Developing SQL Databases Length : 5 Days Audience(s) : IT Professionals Level : 300 Technology : Microsoft SQL Server Delivery Method : Instructor-led (Classroom) Course Overview This five-day

More information

745: Advanced Database Systems

745: Advanced Database Systems 745: Advanced Database Systems Yanlei Diao University of Massachusetts Amherst Outline Overview of course topics Course requirements Database Management Systems 1. Online Analytical Processing (OLAP) vs.

More information

ITM DEVELOPMENT (ITMD)

ITM DEVELOPMENT (ITMD) ITM Development (ITMD) 1 ITM DEVELOPMENT (ITMD) ITMD 361 Fundamentals of Web Development This course will cover the creation of Web pages and sites using HTML, CSS, Javascript, jquery, and graphical applications

More information

Information Retrieval CSCI

Information Retrieval CSCI Information Retrieval CSCI 4141-6403 My name is Anwar Alhenshiri My email is: anwar@cs.dal.ca I prefer: aalhenshiri@gmail.com The course website is: http://web.cs.dal.ca/~anwar/ir/main.html 5/6/2012 1

More information

Developing SQL Databases

Developing SQL Databases Course 20762B: Developing SQL Databases Page 1 of 9 Developing SQL Databases Course 20762B: 4 days; Instructor-Led Introduction This four-day instructor-led course provides students with the knowledge

More information

Writing Queries Using Microsoft SQL Server 2008 Transact-SQL. Overview

Writing Queries Using Microsoft SQL Server 2008 Transact-SQL. Overview Writing Queries Using Microsoft SQL Server 2008 Transact-SQL Overview The course has been extended by one day in response to delegate feedback. This extra day will allow for timely completion of all the

More information

What s a database anyway?

What s a database anyway? Lecture 1 Databases TDA357/DIT620 Pablo Picazo pablop@chalmers.se What s a database anyway? Structured Persistant Changable Digital A database is True to integrity constraints DBMS Database == Data collection

More information

Course Outline Faculty of Computing and Information Technology

Course Outline Faculty of Computing and Information Technology Course Outline Faculty of Computing and Information Technology Title Code Instructor Name Credit Hours Prerequisite Prerequisite Skill/Knowledge/Understanding Category Course Goals Statement of Course

More information

EECS 647: Introduction to Database Systems

EECS 647: Introduction to Database Systems EECS 647: Introduction to Database Systems Instructor: Luke Huan Spring 2009 Queries for Today What is a database? What is a database management system? Why take a database course? Who will teach? How

More information

PROJECT PERIODIC REPORT

PROJECT PERIODIC REPORT PROJECT PERIODIC REPORT Grant Agreement number: 257403 Project acronym: CUBIST Project title: Combining and Uniting Business Intelligence and Semantic Technologies Funding Scheme: STREP Date of latest

More information

Database Technology Introduction. Heiko Paulheim

Database Technology Introduction. Heiko Paulheim Database Technology Introduction Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query Processing Transaction Manager Introduction to the Relational Model

More information

Chapter 2 Introduction to Relational Models

Chapter 2 Introduction to Relational Models CMSC 461, Database Management Systems Spring 2018 Chapter 2 Introduction to Relational Models These slides are based on Database System Concepts book and slides, 6th edition, and the 2009 CMSC 461 slides

More information

CS425 Fall 2016 Boris Glavic Chapter 1: Introduction

CS425 Fall 2016 Boris Glavic Chapter 1: Introduction CS425 Fall 2016 Boris Glavic Chapter 1: Introduction Modified from: Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Textbook: Chapter 1 1.2 Database Management System (DBMS)

More information

Information Management (IM)

Information Management (IM) 1 2 3 4 5 6 7 8 9 Information Management (IM) Information Management (IM) is primarily concerned with the capture, digitization, representation, organization, transformation, and presentation of information;

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

CSC 261/461 Database Systems Lecture 19

CSC 261/461 Database Systems Lecture 19 CSC 261/461 Database Systems Lecture 19 Fall 2017 Announcements CIRC: CIRC is down!!! MongoDB and Spark (mini) projects are at stake. L Project 1 Milestone 4 is out Due date: Last date of class We will

More information

CSE 303: Database. Teaching Staff. Lecture 01. Lectures: 1 st half - from a user s perspective. Lectures: 2 nd half - understanding how it works

CSE 303: Database. Teaching Staff. Lecture 01. Lectures: 1 st half - from a user s perspective. Lectures: 2 nd half - understanding how it works CSE 303: Database Lecture 01 Instructors: Teaching Staff 2016 Introductory Lecture Ashikur Rahman Prof. ASM Latiful Hoque Office: CSE 117 Office: CSE 118 1 2 Lectures: 1 st half - from a user s perspective

More information

SRM UNIVERSITY. : Batch1: TP1102 Batch2: TP406

SRM UNIVERSITY. : Batch1: TP1102 Batch2: TP406 1 SRM UNIVERSITY FACULTY OF ENGINEERING AND TECHNOLOGY SCHOOL OF COMPUTING DEPARTMENT OF COMPUTERSCIENCE AND ENGINEERING COURSE PLAN Course Code Course Title Semester : 15CS424E : SEMANTIC WEB : V Course

More information

: Semantic Web (2013 Fall)

: Semantic Web (2013 Fall) 03-60-569: Web (2013 Fall) University of Windsor September 4, 2013 Table of contents 1 2 3 4 5 Definition of the Web The World Wide Web is a system of interlinked hypertext documents accessed via the Internet

More information

CS 572: Information Retrieval. Lecture 1: Course Overview and Introduction 11 January 2016

CS 572: Information Retrieval. Lecture 1: Course Overview and Introduction 11 January 2016 CS 572: Information Retrieval Lecture 1: Course Overview and Introduction 11 January 2016 1/11/2016 CS 572: Information Retrieval. Spring 2016 1 Lecture Plan What is IR? (the big questions) Course overview

More information

INDE499B: Information Systems Course Review Autumn 2000

INDE499B: Information Systems Course Review Autumn 2000 INDE499B: Information Systems Course Review Autumn 2000 WHAT did we cover? HOW did we cover it? WHY did we cover it? And the exam, what s on it INDE499B: Information Systems Course Review Autumn 2000 WHAT

More information

COMP-421: Database Systems. Joseph D silva McConnel Engg. 102

COMP-421: Database Systems. Joseph D silva McConnel Engg. 102 COMP-421: Database Systems Joseph D silva joseph.dsilva@mail.mcgill.ca McConnel Engg. 102 Class: Names and Numbers Mondays, Wednesdays 11:35-12:55 Lecturer: Joseph D silva joseph.dsilva@mail.mcgill.ca

More information

Essay Question: Explain 4 different means by which constrains are represented in the Conceptual Data Model (CDM).

Essay Question: Explain 4 different means by which constrains are represented in the Conceptual Data Model (CDM). Question 1 Essay Question: Explain 4 different means by which constrains are represented in the Conceptual Data Model (CDM). By specifying participation conditions By specifying the degree of relationship

More information

Introduction to Data Management. Lecture #2 (Big Picture, Cont.) Instructor: Chen Li

Introduction to Data Management. Lecture #2 (Big Picture, Cont.) Instructor: Chen Li Introduction to Data Management Lecture #2 (Big Picture, Cont.) Instructor: Chen Li 1 Announcements v We added 10 more seats to the class for students on the waiting list v Deadline to drop the class:

More information

TEACHING & ASSESSMENT PLAN

TEACHING & ASSESSMENT PLAN Doc. Ref. No. Issue Version Date UoN-STC-T&A 1 June 2009 TEACHING & ASSESSMENT PLAN 1. College : Arts & Sciences 2. Department : Mathematical & Physical Sciences 3. Semester : Spring 2013/2014 4. Course

More information

INSTITUTE OF AERONAUTICAL ENGINEERING

INSTITUTE OF AERONAUTICAL ENGINEERING INSTITUTE OF AERONAUTICAL ENGINEERING Course Title Course Code Regulation (Autonomous) Dundigal, yderabad - 500 043 COMPUTER SCIENCE AND ENGINEERING COURSE DESCRIPTION FORM DATABASE MANAGEMENT SYSTEMS

More information

Microsoft FAST Search Server 2010 for SharePoint for Application Developers Course 10806A; 3 Days, Instructor-led

Microsoft FAST Search Server 2010 for SharePoint for Application Developers Course 10806A; 3 Days, Instructor-led Microsoft FAST Search Server 2010 for SharePoint for Application Developers Course 10806A; 3 Days, Instructor-led Course Description This course is designed to highlight the differentiating features of

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Data Integration Systems

Data Integration Systems Data Integration Systems Haas et al. 98 Garcia-Molina et al. 97 Levy et al. 96 Chandrasekaran et al. 2003 Zachary G. Ives University of Pennsylvania January 13, 2003 CIS 650 Data Sharing and the Web Administrivia

More information

Advanced Relational Database Management MISM Course S A3 Spring 2019 Carnegie Mellon University

Advanced Relational Database Management MISM Course S A3 Spring 2019 Carnegie Mellon University Advanced Relational Database Management MISM Course S19-95736 A3 Spring 2019 Carnegie Mellon University Instructor: Randy Trzeciak Office: HBH 1104C Office hours: By Appointment Phone: 412-268-7040 E-mail:

More information

Database Management Systems MIT Introduction By S. Sabraz Nawaz

Database Management Systems MIT Introduction By S. Sabraz Nawaz Database Management Systems MIT 22033 Introduction By S. Sabraz Nawaz Recommended Reading Database Management Systems 3 rd Edition, Ramakrishnan, Gehrke Murach s SQL Server 2008 for Developers Any book

More information

Fundamentals of Database Systems

Fundamentals of Database Systems Fundamentals of Database Systems Semester 1, 2017 Fundamentals of Database Systems COMPSCI/SOFTENG 351 COMPSCI 751 Instructors: Gill Dobbie, Miika Hannula, Sebastian Link, Gerald Weber Department of Computer

More information

Data Integration and Data Warehousing Database Integration Overview

Data Integration and Data Warehousing Database Integration Overview Data Integration and Data Warehousing Database Integration Overview Sergey Stupnikov Institute of Informatics Problems, RAS ssa@ipi.ac.ru Outline Information Integration Problem Heterogeneous Information

More information

Aggregation for searching complex information spaces. Mounia Lalmas

Aggregation for searching complex information spaces. Mounia Lalmas Aggregation for searching complex information spaces Mounia Lalmas mounia@acm.org Outline Document Retrieval Focused Retrieval Aggregated Retrieval Complexity of the information space (s) INEX - INitiative

More information

Avi Silberschatz, Henry F. Korth, S. Sudarshan, Database System Concept, McGraw- Hill, ISBN , 6th edition.

Avi Silberschatz, Henry F. Korth, S. Sudarshan, Database System Concept, McGraw- Hill, ISBN , 6th edition. Instructor: James Markulic Lecture: Distance Learning Office Hour: By appointment E-Mail: Markulic@njit.edu Course textbook: Avi Silberschatz, Henry F. Korth, S. Sudarshan, Database System Concept, McGraw-

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Chapter 1: Introduction Purpose of Database Systems Database Languages Relational Databases Database Design Data Models Database Internals Database Users and Administrators Overall

More information

Keyword Search in Databases

Keyword Search in Databases + Databases and Information Retrieval Integration TIETS42 Keyword Search in Databases Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html

More information

Learning Alliance Corporation, Inc. For more info: go to

Learning Alliance Corporation, Inc. For more info: go to Writing Queries Using Microsoft SQL Server Transact-SQL Length: 3 Day(s) Language(s): English Audience(s): IT Professionals Level: 200 Technology: Microsoft SQL Server Type: Course Delivery Method: Instructor-led

More information

CMPT 354 Database Systems I. Spring 2012 Instructor: Hassan Khosravi

CMPT 354 Database Systems I. Spring 2012 Instructor: Hassan Khosravi CMPT 354 Database Systems I Spring 2012 Instructor: Hassan Khosravi Textbook First Course in Database Systems, 3 rd Edition. Jeffry Ullman and Jennifer Widom Other text books Ramakrishnan SILBERSCHATZ

More information

INF 315E Introduction to Databases School of Information Fall 2015

INF 315E Introduction to Databases School of Information Fall 2015 INF 315E Introduction to Databases School of Information Fall 2015 Class Hours: Tuesday & Thursday10:30 am-12:00 pm Instructor: Eunyoung Moon Email: eymoon@utexas.edu Course Description Almost every website

More information

I. PREREQUISITES For information regarding prerequisites for this course, please refer to the Academic Course Catalog.

I. PREREQUISITES For information regarding prerequisites for this course, please refer to the Academic Course Catalog. Note: Course content may be changed, term to term, without notice. The information below is provided as a guide for course selection and is not binding in any form, and should not be used to purchase course

More information

Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML

Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML Kikori-KS An Effective and Efficient Keyword Search System for Digital Libraries in XML Toshiyuki Shimizu 1, Norimasa Terada 2, and Masatoshi Yoshikawa 1 1 Graduate School of Informatics, Kyoto University

More information

Case Study: Lufthansa Cargo Database

Case Study: Lufthansa Cargo Database Case Study: Lufthansa Cargo Database Carsten Schürmann 1 Today s lecture More on data modelling Introduction to Lufthansa Cargo Database Entity Relationship diagram Boyce-Codd normal form 2 From Lecture

More information

"Charting the Course... MOC C: Developing SQL Databases. Course Summary

Charting the Course... MOC C: Developing SQL Databases. Course Summary Course Summary Description This five-day instructor-led course provides students with the knowledge and skills to develop a Microsoft SQL database. The course focuses on teaching individuals how to use

More information

CSC 111 Introduction to Computer Science (Section C)

CSC 111 Introduction to Computer Science (Section C) CSC 111 Introduction to Computer Science (Section C) Course Description: (4h) Lecture and laboratory. Rigorous introduction to the process of algorithmic problem solving and programming in a modern programming

More information

Introduction to Data Management. Lecture #1 (Course Trailer )

Introduction to Data Management. Lecture #1 (Course Trailer ) Introduction to Data Management Lecture #1 (Course Trailer ) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Topics v Welcome to one

More information

KDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research

KDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research KDD 10 Tutorial: Recommender Problems for Web Applications Deepak Agarwal and Bee-Chung Chen Yahoo! Research Agenda Focus: Recommender problems for dynamic, time-sensitive applications Content Optimization

More information

Microsoft Developing SQL Databases

Microsoft Developing SQL Databases 1800 ULEARN (853 276) www.ddls.com.au Length 5 days Microsoft 20762 - Developing SQL Databases Price $4290.00 (inc GST) Version C Overview This five-day instructor-led course provides students with the

More information

An Archiving System for Managing Evolution in the Data Web

An Archiving System for Managing Evolution in the Data Web An Archiving System for Managing Evolution in the Web Marios Meimaris *, George Papastefanatos and Christos Pateritsas * Institute for the Management of Information Systems, Research Center Athena, Greece

More information

Introduction to Data Management. Lecture #1 (The Course Trailer )

Introduction to Data Management. Lecture #1 (The Course Trailer ) Introduction to Data Management Lecture #1 (The Course Trailer ) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Topics v Welcome to

More information

Chapter 1: Introduction. Chapter 1: Introduction

Chapter 1: Introduction. Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases

More information

Introduction to Database Systems CS432. CS432/433: Introduction to Database Systems. CS432/433: Introduction to Database Systems

Introduction to Database Systems CS432. CS432/433: Introduction to Database Systems. CS432/433: Introduction to Database Systems Introduction to Database Systems CS432 Instructor: Christoph Koch koch@cs.cornell.edu CS 432 Fall 2007 1 CS432/433: Introduction to Database Systems Underlying theme: How do I build a data management system?

More information

Database of historical places, persons, and lemmas

Database of historical places, persons, and lemmas Database of historical places, persons, and lemmas Natalia Korchagina Outline 1. Introduction 1.1 Swiss Law Sources Foundation as a Digital Humanities project 1.2 Data to be stored 1.3 Final goal: how

More information

ABD - Database Administration

ABD - Database Administration Coordinating unit: 270 - FIB - Barcelona School of Informatics Teaching unit: 747 - ESSI - Department of Service and Information System Engineering Academic year: Degree: 2017 BACHELOR'S DEGREE IN INFORMATICS

More information

Model 4.2 Faculty member + student Course syllabus for Advanced programming language - CS313D

Model 4.2 Faculty member + student Course syllabus for Advanced programming language - CS313D Model 4.2 Faculty member + student Course syllabus for Advanced programming language - CS1D 1. Faculty member information: Name of faculty member responsible for the course Dr. Myriam Hadjouni Office Hours

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

University of Virginia Department of Computer Science. CS 4501: Information Retrieval Fall 2015

University of Virginia Department of Computer Science. CS 4501: Information Retrieval Fall 2015 University of Virginia Department of Computer Science CS 4501: Information Retrieval Fall 2015 5:00pm-6:15pm, Monday, October 26th Name: ComputingID: This is a closed book and closed notes exam. No electronic

More information

Course and Contact Information. Catalog Description. Course Objectives

Course and Contact Information. Catalog Description. Course Objectives San Jose State University College of Science Department of Computer Science CS157A, Introduction to Database Management Systems, Sections 1 and 2, Fall2015 Course and Contact Information Instructor: Dr.

More information

Introduction to Data Management. Lecture #1 (Course Trailer ) Instructor: Chen Li

Introduction to Data Management. Lecture #1 (Course Trailer ) Instructor: Chen Li Introduction to Data Management Lecture #1 (Course Trailer ) Instructor: Chen Li 1 Today s Topics v Welcome to one of my biggest classes ever! v Read (and live by) the course wiki page: http://www.ics.uci.edu/~cs122a/

More information

Advances in Data Management - Web Data Integration A.Poulovassilis

Advances in Data Management - Web Data Integration A.Poulovassilis Advances in Data Management - Web Data Integration A.Poulovassilis 1 1 Integrating Deep Web Data Traditionally, the web has made available vast amounts of information in unstructured form (i.e. text).

More information