Wrap-Up: Data Sharing and the Web

Similar documents
Information Integration

Data Integration Systems

Wide Area Query Systems The Hydra of Databases

745: Advanced Database Systems

CSE 544 Principles of Database Management Systems

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES

CMPSCI 645 Database Design & Implementation

Dataspaces: A New Abstraction for Data Management. Mike Franklin, Alon Halevy, David Maier, Jennifer Widom

LIS 2680: Database Design and Applications

Course: Database Management Systems. Lê Thị Bảo Thu

COMP.3090/3100 Database I & II. Textbook

CMPUT 391 Database Management Systems. Fall Semester 2006, Section A1, Dr. Jörg Sander. Introduction

Mini-Project, Exam, Etc

Outline. Database Management Systems (DBMS) Database Management and Organization. IT420: Database Management and Organization

Who we are: Database Research - Provenance, Integration, and more hot stuff. Boris Glavic. Department of Computer Science

Introduction to Data Management CSE 344. Lecture 1: Introduction

DATABASE MANAGEMENT SYSTEMS

Elementary IR: Scalable Boolean Text Search. (Compare with R & G )

CS 347 Parallel and Distributed Data Processing

State of the Art and Trends in Search Engine Technology. Gerhard Weikum

COMP390 (Design &) Implementation

CMPT 354 Database Systems I. Spring 2012 Instructor: Hassan Khosravi

CSC 4710 / CSC 6710 Database Systems. Rao Casturi

COMP390 (Design &) Implementation

Data Analytics at Logitech Snowflake + Tableau = #Winning

CS 405G: Introduction to Database Systems. Lecture 1: Introduction

Why is Mariposa Important? Mariposa: A wide-area distributed database. Outline. Motivation: Assumptions. Motivation

Introduction to Data Management. Lecture #2 (Big Picture, Cont.)

CSE 344 JANUARY 3 RD - INTRODUCTION

Big Trend in Business Intelligence: Data Mining over Big Data Web Transaction Data. Fall 2012

Where do these data come from? What technologies do they use?? Whatever they use, they need models (schemas, metadata, )

CSC 261/461 Database Systems. Fall 2017 MW 12:30 pm 1:45 pm CSB 601

Introduction to Data Management. Lecture #2 (Big Picture, Cont.) Instructor: Chen Li

Administrivia. CS186 Class Wrap-Up. News. News (cont) Top Decision Support DBs. Lessons? (from the survey and this course)

CIS 4930/6930 Spring 2014 Introduction to Data Science /Data Intensive Computing. University of Florida, CISE Department Prof.

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

CSE 132A. Database Systems Principles

Contents. Introduction

Database Management System Implementation. Who am I? Who is the teaching assistant? TR, 10:00am-11:20am NTRP B 140 Instructor: Dr.

II. Data Models. Importance of Data Models. Entity Set (and its attributes) Data Modeling and Data Models. Data Model Basic Building Blocks

Oracle Database 12c: New Features For Administrators

What s a database anyway?

Introduction to Database Systems CSE 444. Lecture #1 March 26, 2007

Lesson 3 Transcript: Part 1 of 2 - Tools & Scripting

B.H.GARDI COLLEGE OF MASTER OF COMPUTER APPLICATION. Ch. 1 :- Introduction Database Management System - 1

A Brief History of Big Data

Fall Principles of Knowledge Discovery in Databases. University of Alberta

Database Systems. Sven Helmer. Database Systems p. 1/567

P2P (Semantic) Mediation. Georges Gardarin Dragan Florin, Laurent Yeh PRiSM Lab. University of Versailles, France

Business Analytics in the Oracle 12.2 Database: Analytic Views. Event: BIWA 2017 Presenter: Dan Vlamis and Cathye Pendley Date: January 31, 2017

Oracle Database 18c and Autonomous Database

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS

Learn Well Technocraft

Chuck Cartledge, PhD. 19 January 2018

Fundamentals of Database Systems

G64DBS Database Systems. G64DBS Module. Recommended Textbook. Assessment. Recommended Textbook. Recommended Textbook.

CSC 261/461 Database Systems Lecture 20. Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101

Introduction to CS 4604

Dr. Angelika Reiser Chair for Database Systems (I3)

COMP390 (Design &) Implementation

What is the Future of PostgreSQL?

Introduction. Random things to do after this course. Course roadmap. CPS 116 Introduction to Database Systems

DOWNLOAD OR READ : RELATIONAL DATABASE DESIGN AND IMPLEMENTATION FOURTH EDITION PDF EBOOK EPUB MOBI

Kepware Whitepaper. IIoT Protocols to Watch. Aron Semle, R&D Lead. Introduction

CSE544 Database Architecture

Introduction to Data Management CSE 344

CS152 Computer Architecture and Engineering. Lecture 9 Performance Dave Patterson. John Lazzaro. www-inst.eecs.berkeley.

Specific Objectives Contents Teaching Hours 4 the basic concepts 1.1 Concepts of Relational Databases

data parallelism Chris Olston Yahoo! Research

purequery Deep Dive Part 2: Data Access Development Dan Galvin Galvin Consulting, Inc.

Performance Measurement (as seen by the customer)

High Availability- Disaster Recovery 101

MIS Database Systems.

CSE 417 Practical Algorithms. (a.k.a. Algorithms & Computational Complexity)

Databases 1. Daniel POP

BIS Database Management Systems.

Something to think about. Problems. Purpose. Vocabulary. Query Evaluation Techniques for large DB. Part 1. Fact:

COMP3311 Database Systems

Distributed Data Management

Part I What are Databases?

InfoSphere Warehouse V9.5 Exam.

Introduction and Overview

Advanced Database Management Systems

IBM EXAM - C DB Fundamentals. Buy Full Product.

Hype Cycle for Data Warehousing, 2003

Introduction to Database S ystems Systems CSE 444 Lecture 1 Introduction CSE Summer

Announcements. JSon Data Structures. JSon Syntax. JSon Semantics: a Tree! JSon Primitive Datatypes. Introduction to Database Systems CSE 414

Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications. Administrivia Final Exam. Administrivia Final Exam

Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications. Administrivia. Administrivia. Faloutsos/Pavlo CMU /615

Pre-Discussion. XQuery: An XML Query Language. Outline. 1. The story, in brief is. Other query languages. XML vs. Relational Data

CSE 344 MAY 7 TH EXAM REVIEW

Introduction to Data Management CSE 344

What we already know. Late Days. CSE 444: Database Internals. Lecture 3 DBMS Architecture

Department of Accounting & Law, School of Business. State University of New York at Albany. Acc 682 Analysis & Design of Accounting Databases

SOFTWARE DEVELOPMENT: DATA SCIENCE

: Semantic Web (2013 Fall)

Event semantics in asynchronous distributed event middleware

Announcements. Teams formed (sit with your group from now on) Project Deliverables

Netwrix Virtual. Customer Summit 2016

Syllabus DATABASE I Introduction to Database (INLS523)

Transcription:

Wrap-Up: Data Sharing and the Web Zachary G. Ives University of Pennsylvania April 16, 2003

Administrivia I Reminder: Monday is your project presentation About 5-7 minutes each Slides are allowed (but not required); demos welcome (but also not required) What did you do? What were the hard problems? How are you solving them? How are you evaluating your work? Take-home final exam also distributed Monday Will likely be 3-4 essay questions; open-book, open-notes Due Friday May 2 before 6PM EST 2

Administrivia II Project reports due Friday May 2 5-10 pages for implementation project; 10-15 pages for survey paper Structured like a conference paper Remember to answer: What problem you re addressing Why it s a problem How you re attacking it How you re going to show you succeeded 3

Where Have We Been? I Our themes: Data sharing using semantics Web and Internet The central characters: Data integration What s does it look like? What does it do? vs. warehousing? Decentralized variants Peer data management systems 4

Where Have We Been? II Processing queries Query optimization What were the two systems? Query execution Basic principles Adaptive variants Inter-query: SITs Intra-query: mid-query re-optimization, eddies Query processing in a distributed context Mariposa, Tukwila Answering queries using views 5

Where Have We Been? III Other ways of sharing data: Publish-subscribe: XFilter IR-style querying IR/DB hybrids: WHIRL, extended XML-QL Difference-based models: Change detection in semistructured data Heraclitus Harmony Semantic Web Groupware Matching schemas and data 6

How Much Real-World Impact? I Data integration: Systems available from Nimble Technology, Enosys Markets, IBM, BEA, Progressive Software, Query optimization: Every DBMS uses System-R or Volcano-derived optimizer Query execution: Basic techniques common to every DBMS Adaptive query processing: Only a few techniques (e.g., SITs) currently available Answering queries using views: Data integration systems; DB2, Oracle, likely SQL Server 7

How Much Real-World Impact? II Publish-subscribe: Not too much use (exception: RSS, available from Netscape/AOL, CNET, etc.) IR querying: Google is partly based on this! DB hybrids: DB2 Text Extender, SQL Server text ext. Synchronization/change detection: CVS, diff3, Unison Groupware: SMTP, Exchange server, Domino, Groove, Semantic Web: The jury is still out 8

What s Likely to Be Adopted Soon? Adaptive query processing Autonomic computing is the big buzzword Much like JIT compiling Industry may actually be driving most of the adaptive work in the future: Adapting to new configurations, new hardware Adapting to workloads Self-healing (failover) 9

What Are Today s Hottest Research Topics? I XML!! Everyone does papers on XML processing, XML indices, XML validation, XML constraints, XML compression, and so on Data streams!!!! The new hot topic, though it has many meanings Sensor data; stock data; astronomy data Mining streams; statistics for streams; querying streams over sliding windows; etc. Traditional stuff: optimization, execution, indexing 10

What Are Today s Hottest Research Topics? II Peer-to-peer architectures Using P2P for scalability reasons Semantic translation Mapping schemas; manipulating schemas Querying in this context Semantic Web Architecture-aware DBs Cache-aware algorithms; scheduler-aware algorithms 11

In Summary I hope you found this course to be less dull than average Databases doesn t have to mean payroll systems or web backends I hope you learned something that s useful regardless of whether you want to be a DB researcher How to read and analyze technical papers How to present How to write... And a little about what goes on inside the DBMS Thanks for being a fabulous class! 12