Jennifer Widom. Stanford University
|
|
- Buck Potter
- 5 years ago
- Views:
Transcription
1 Principled Research in Database Systems Stanford University
2 What Academics Give Talks About Other people s papers Thesis and new results Significant research projects The research field BIG VISION Other people s papers Grad Student Assistant Professor Associate Professor Full Professor 2
3 Recipe for Database Research Project Pick a simple but fundamental assumption underlying traditional database systems Drop it Must reconsider all aspects of data management and query processing Many Ph.D. theses Prototype from scratch 3
4 Recipe for Database Research Project Pick a simple but fundamental assumption underlying traditional database systems Drop it Example simple but fundamental assumptions Structure of data (schema) declared in advance Drop: Persistent (typically stable, disk-resident) data sets Drop: Data elements contain known values Drop: Semistructured data Data streams Uncertain data 4
5 Recipe for Database Research Project Must reconsider all aspects of data management and query processing Reconsidering all aspects Data model Foundations Query language Storage and indexing structures Query processing and optimization Concurrency control, recovery Application and user interfaces Implementation 5
6 Principled Database Systems Research To develop a new type of database system: Data Model Query Language System Consider all of them In this order Solid foundations first, then implementation 6
7 In Reality Applications Data Model Query Language System As research evolves, always revisit all three Solid foundations first, then implementation 7
8 Disclaimer This principled approach works for me Other s mileage may vary 8
9 Bulk of Talk Challenges of developing new data models and query languages that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable 9
10 Bulk of Talk Challenges of developing new (1) data models and (2) query languages that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable 10
11 Creating a New Data Model Examples from my own experience Semistructured data Data streams Uncertain data Harder Hardest Hard 11
12 Data Model Challenge #1 Semistructured Data No schema fixed in advance Data is self-describing Irregularity is OK Motivated by Data Integration, Web 12
13 The Object Exchange Model (OEM) 1 DBGroup Member 2 Name Name Project Member Project Member Member Project 6 Project Office Name Age Office Age Office Title Title 7 "Clark" 8 "Smith" "Gates 252" "Jones" "Lore" 16 "Tsimmis" Building Room Building Room "CIS" "411" "Gates"
14 XML Element tags 1 DBGroup Root element Subelements Member 2 Name Name Project Member Project Member Member Project 6 Project Office Name Age Office Age Office Title Title 7 "Clark" 8 "Smith" "Gates 252" "Jones" "Lore" 16 "Tsimmis" Building Room Building Room Text Values 17 "CIS" 18 "411" 19 "Gates" Attributes 14
15 Data Model Challenge #2 Data Streams Back to fixed-schema relational But data arrives as continuous, unbounded streams Data may (or may not) be timestamped; order may (or may not) be relevant 15
16 Model for Data Streams A data stream is an unbounded sequence of [tuple timestamp] pairs Temperature Sensor 1: [(72) 2:05] [(75) 2:20] [(74) 2:21] [(74) 2:24] [(81) 2:45] Temperature Sensor 2: [(73) 2:03] [(76) 2:20] [(73) 2:22] [(75) 2:22] [(79) 2:40] 16
17 Model for Data Streams A data stream is an unbounded sequence of [tuple timestamp] pairs Temperature Sensor 1: [(72) 2:05] [(75) 2:20] [(74) 2:21] [(74) 2:24] [(81) 2:45] Temperature Sensor 2: [(73) 2:03] [(76) 2:20] [(73) 2:22] [(75) 2:22] [(79) 2:40] Duplicate timestamps in streams? If yes, is order relevant? 17
18 Model for Data Streams A data stream is an unbounded sequence of [tuple timestamp] pairs Temperature Sensor 1: [(72) 2:05] [(75) 2:20] [(74) 2:21] [(74) 2:24] [(81) 2:45] Temperature Sensor 2: [(73) 2:03] [(76) 2:20] [(73) 2:22] [(75) 2:22] [(79) 2:40] Are timestamps coordinated across streams? Duplicates? Order relevant? 18
19 Model for Data Streams A data stream is an unbounded sequence of [tuple timestamp] pairs Temperature Sensor 1: [(72) 2:05] [(75) 2:20] [(74) 2:21] [(74) 2:24] [(81) 2:45] Temperature Sensor 2: [(76) 2:20] [(76) 2:20] [(73) 2:22] [(75) 2:22] [(79) 2:40] What about absolute duplicates? 19
20 Model for Data Streams A data stream is an unbounded sequence of [tuple timestamp] pairs Temperature Sensor 1: [(72) 2:05] [(75) 2:20] [(74) 2:21] [(74) 2:24] [(81) 2:45] Temperature Sensor 2: [(73) 2:03] [(76) 2:20] [(73) 2:22] [(75) 2:22] [(79) 2:40] Sample Query (continuous) Average discrepancy between sensors Result depends heavily on details of model 20
21 Data Model Challenge #3 Uncertain Data Tuples may have alternative possible values Tuple presence may be uncertain Uncertainty may have associated confidence (probability) values 21
22 Uncertain Data: Semantics Tuples may have alternative possible values Tuple presence may be uncertain Uncertainty may have associated confidence (probability) values An uncertain database represents a set of possible certain databases (possible-instances, possible-worlds) With (optionally) a probability for each one 22
23 Uncertain Data: Simple Example Jennifer attends a conference on Monday Hector attends on Monday, Tuesday, or not at all person day person Jennifer Monday Jennifer Monday person day Hector Monday Hector day Tuesday Jennifer Instance1 Instance2 Instance3 Monday Representation: person day Jennifer Monday Hector Monday Tuesday? 23
24 Models for Uncertain Data A representation scheme (hereafter model) for uncertain data is well-defined if we know how to map any database in the model to its set of possible-instances A model is complete if every set of possibleinstances can be represented Alternative values +? s (+ confidences) model is well-defined but not complete 24
25 Incompleteness Example person day Jennifer Monday Hector Tuesday Instance1 person day Jennifer Monday Instance2 person day Hector Tuesday Instance3 person Jennifer Hector day Monday Tuesday?? Generates 4 th instance: empty relation 25
26 Completeness vs. Closure All sets-of-instances Op2 Representable sets-of-instances Op1 Completeness blue=pink Closure arrow stays in blue An incomplete model may still be interesting if it s expressive enough and closed under all useful operations 26
27 More on Operations and Closure Relational operation Op on uncertain database D D implementation of Op D Closure D always exists possible instances D 1, D 2,, D n Op on each instance representation of instances Op(D 1 ), Op(D 2 ),, Op(D n ) 27
28 Non-Closure Example person Jennifer Hector day Monday Monday day Monday weather rain shine person Jennifer Hector weather rain shine rain shine Four possible-instances Should be two Various solutions Most obvious & general: add constraints of some type 28
29 Original Challenge Develop new data models [and query languages] that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable 29
30 Original Challenge Develop new data models [and query languages] that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable 30
31 Exploring Space of Models Only complete model Closure properties Relative expressiveness Only understandable models What a mess! R Possible models 31
32 Fairy-Tale Ending Added lineage to simple uncertainty model We wanted lineage in our model & system anyway Representation became complete 32
33 Principled DB Systems Research Data Model Query Language System 33
34 Query Language Design Notoriously difficult to publish But potential for huge long-term impact Semantics can be surprisingly tricky 34
35 Query Language Design Warm-up SQL Examples from my own experience Active databases Semistructured data Data streams Uncertain data Hardest Hardest Hardest Hardest 35
36 SQL: A Dubious Beginning Good: SQL is a declarative language Semantics independent of execution model Bad: Not everyone thinks of it that way For some, only understanding is execution model 36
37 SQL: A Dubious Beginning Good: SQL is a declarative language Semantics independent of execution model Bad: Not everyone thinks of it that way For some, only understanding is execution model Example: Add 5 to smallest values in table T UPDATE T SET A = A+5 WHERE A <= ALL (SELECT A FROM T) 37
38 SQL: A Dubious Beginning Good: SQL is a declarative language Semantics independent of execution model Bad: Not everyone thinks of it that way For some, only understanding is execution model Example: Group-By (non)restrictions SELECT day,count(*) FROM Attends GROUP BY day 38
39 SQL: A Dubious Beginning Good: SQL is a declarative language Semantics independent of execution model Bad: Not everyone thinks of it that way For some, only understanding is execution model Example: Group-By (non)restrictions SELECT day,count(*),person FROM Attends GROUP BY day 39
40 SQL: A Dubious Beginning Good: SQL is a declarative language Semantics independent of execution model Bad: Not everyone thinks of it that way For some, only understanding is execution model Query Language System 40
41 Query Language Challenge #1 Active Databases Install active rules (triggers) in the database WHEN action occurs IF condition holds THEN perform additional actions Insert/Delete/Update SQL 41
42 Tricky Semantics Example Trigger 1: WHEN X makes sale > 500 THEN increase X s salary by 1000 Trigger 2: WHEN average salary increases > 10% THEN increase everyone s salary by 500 Inserts: Sale(Mary,600) Sale(Mary,800) Sale(Mary,550) How many increases for Mary? If each causes average > 10%, how many global raises? What if global raise causes average > 10%? 42
43 The Lure of the System Transition tables, Conflicts, Confluence, We finished our rule system ages ago Write Code! 43
44 Happy Ending Semantics got defined At least on our side of the SF Bay System got built Even SQL triggers standard ended up reasonable Primarily through restrictions 44
45 Query Language Challenge #2 Semistructured Data Query: SELECT Student WHERE Advisor= Widom <Student> <ID> 123 </ID> <Name> Susan </Name> <Major> CS </Major> </Student> <Student> </Student> Error? Empty result? Warning? 45
46 Semistructured Data Query: SELECT Student WHERE Advisor= Widom <Student> <ID> 123 </ID> <Name> Susan </Name> <Major> CS </Major> </Student> <Student> </Student> Lore Empty result Warning 46
47 Semistructured Data Query: SELECT Student WHERE Advisor= Widom <Student> <ID> 123 </ID> <Name> Susan </Name> <Major> CS </Major> </Student> <Student> </Student> XQuery Empty result 47
48 Semistructured Data Query: SELECT Student WHERE Advisor= Widom <Student> <ID> 123 </ID> <Advisor> Garcia </Advisor> <Advisor> Widom </Advisor> </Student> <Student> </Student> Lore Implicit 48
49 Semistructured Data Query: SELECT Student WHERE Advisor= Widom <Student> <ID> 123 </ID> <Advisor> Garcia </Advisor> <Advisor> Widom </Advisor> </Student> <Student> </Student> XQuery Implicit Variety of = 49
50 Query Language Challenge #3 Data Streams 50
51 Data Streams Temperature Sensor: [(72) 2:00] [(74) 2:00] [(76) 2:00] [(60) 8:00] [(58) 8:00] [(56) 8:00] Query (continuous): Average of most recent three readings 51
52 Data Streams Temperature Sensor: [(72) 2:00] [(74) 2:00] [(76) 2:00] [(60) 8:00] [(58) 8:00] [(56) 8:00] Query (continuous): Average of most recent three readings System A: 74, 58 52
53 Data Streams Temperature Sensor: [(72) 2:00] [(74) 2:00] [(76) 2:00] [(60) 8:00] [(58) 8:00] [(56) 8:00] Query (continuous): Average of most recent three readings System A: 74, 58 System B: 74, 70, 64.7, 58 53
54 Original Challenge Develop new [data models and] query languages that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable 54
55 Original Challenge Develop new [data models and] query languages that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable No happy ending (yet) 55
56 The It s Just SQL Trap Tables: Attends(person,day) Weather(day,temp) Query: Hector chill factor SELECT W.temp FROM Attends A, Weather W WHERE A.day = W.day AND A.person= Hector person day Jennifer Monday 0.8 Tuesday 0.2 Hector Monday 0.3 Tuesday 0.7 day temp Monday [60,65] Tuesday [62,70] 56
57 The It s Just SQL Trap Two weighted temperature ranges? Single expected value across (weighted) days/ranges? What if Hector tuple has a?? SELECT W.temp FROM Attends A, Weather W WHERE A.day = W.day AND A.person= Hector person day Jennifer Monday 0.8 Tuesday 0.2 Hector Monday 0.3 Tuesday 0.7 day temp Monday [60,65] Tuesday [62,70] 57
58 Original Challenge Develop new [data models and] query languages that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable 58
59 Original Challenge Develop new [data models and] query languages that are Just defining SQL over a new data model can be tricky and complex (and fun) 1) Well-defined 2) Understandable Semistructured data 3) Sufficiently Data expressive streams (and not more) 4) Similar to Uncertain existing models data & languages 5) Implementable <Insert future model here> 59
60 Just SQL on Data Streams Stream Orders(cust,item) Table Items(item,store,price) SELECT * FROM Orders O, Items I WHERE O.item = I.item Is result a stream, relation, or something else? Is output ordering relevant for duplicate items? What happens if table is updated? 60
61 Original Challenge Develop new [data models and] query languages that are 1) Well-defined 2) Understandable 3) Sufficiently expressive (and not more) 4) Similar to existing models & languages 5) Implementable 61
62 Exploit Relational Whenever Possible Uncertain data semantics of query Q possible instances D D 1, D 2,, D n (implementation) Q on each instance Result representation of instances Q(D 1 ), Q(D 2 ),, Q(D n ) 30 years of refinement 62
63 Exploit Relational Whenever Possible Semantics of stream queries Streams Window Relations Relational Istream / Dstream 30 years of refinement 63
64 Exploit Relational Whenever Possible Active databases: transition tables Lore: semantics based on OQL 3 years of refinement 64
65 The Research Triple Data Model Query Language System Impact 65
66 Current Research Data TRIO Uncertainty Lineage 66
67 Current Research Data TRIO Uncertainty Lineage TRIO TRIO Data Uncertainty Data Uncertainty Lineage Lineage 67
68 Current Research Data TRIO Uncertainty Data TRIO DUO Uncertainty Lineage QUATRO TRIO Data Uncertainty Lineage Lineage 68
69 Current Research Data TRIO Uncertainty Data Lineage DUO Uncertainty Lineage Data QUATRO Lineage Uncertainty Entity Resolution 69
70 DUO: A System for Data and Lineage Lineage: Where data came from and how it evolved over time Critical for scientific data applications But useful for many others too Everyone wants it, nobody knows quite what it is Data Model Query Language System 70
71 DUO Features Lineage on query results Lineage to external data sources Processing-based lineage Rich constructs for querying lineage User-defined lineage creation and modification Lineage-assisted error detection and correction Efficient lineage management compaction, approximation 71
72 QUATRO Entity-Resolution Match & merge multiple data records representing same real-world entities Deduplication, record linkage, reference reconciliation, Premises Uncertainty + Lineage + DBMS Ideal platform for Entity-Resolution Data + Uncertainty + Lineage + E-R (Ideal?) platform for Data Integration 1. Trio-ER 2. Quatro 72
73 Loop until fixed-point: Trio-ER SELECT Combine * FROM Hotel h1 ERJ Hotel h2 ON EXISTS [h1.zip = h2.zip AND (StrComp(h1.URL,h2.URL) > 0.95 OR StrComp(h1.name,h2.name) > 0.95)] Entity-Resolution Data Model Query Language System 73
74 Principled Database Systems Research Data Model Query Language System Impact 74
Representation Formalisms for Uncertain Data
Representation Formalisms for Uncertain Data Jennifer Widom with Anish Das Sarma Omar Benjelloun Alon Halevy and other participants in the Trio Project !! Warning!! This work is preliminary and in flux
More informationQuery Processing over Data Streams. Formula for a Database Research Project. Following the Formula
Query Processing over Data Streams Joint project with Prof. Rajeev Motwani and a group of graduate students stanfordstreamdatamanager Formula for a Database Research Project Pick a simple but fundamental
More informationDatabase Systems. Sven Helmer. Database Systems p. 1/567
Database Systems Sven Helmer Database Systems p. 1/567 Chapter 1 Introduction and Motivation Database Systems p. 2/567 Introduction What is a database system (DBS)? Obviously a system for storing and managing
More informationDatabases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016
+ Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html
More informationIntroduction. Example Databases
Introduction Example databases Overview of concepts Why use database systems Example Databases University Data: departments, students, exams, rooms,... Usage: creating exam plans, enter exam results, create
More informationOutline. Quick Introduction to Database Systems. Data Manipulation Tasks. What do they all have in common? CSE142 Wi03 G-1
Outline Quick Introduction to Database Systems Why do we need a different kind of system? What is a database system? Separating the what the how: The relational data model Querying the databases: SQL May
More informationChapter 2 Introduction to Relational Models
CMSC 461, Database Management Systems Spring 2018 Chapter 2 Introduction to Relational Models These slides are based on Database System Concepts book and slides, 6th edition, and the 2009 CMSC 461 slides
More informationIntroduction to Data Management. Lecture #1 (Course Trailer )
Introduction to Data Management Lecture #1 (Course Trailer ) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Topics v Welcome to one
More informationDatabases TDA357/DIT620. Niklas Broberg
Databases TDA357/DIT620 Niklas Broberg niklas.broberg@chalmers.se 1 What s a database anyway? 2 A database is Structured Persistant Changable Digital True to integrity constraints 3 DBMS Database == Data
More informationIntroduction to Data Management. Lecture #1 (Course Trailer )
Introduction to Data Management Lecture #1 (Course Trailer ) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Topics! Welcome to my biggest
More informationIntroduction to Data Management. Lecture #1 (Course Trailer ) Instructor: Chen Li
Introduction to Data Management Lecture #1 (Course Trailer ) Instructor: Chen Li 1 Today s Topics v Welcome to one of my biggest classes ever! v Read (and live by) the course wiki page: http://www.ics.uci.edu/~cs122a/
More informationCS 245: Database System Principles
CS 245: Database System Principles Notes 01: Introduction Peter Bailis CS 245 Notes 1 1 This course pioneered by Hector Garcia-Molina All credit due to Hector All mistakes due to Peter CS 245 Notes 1 2
More informationCourse Book Academic Year
Nawroz University College of Computer and IT Department of Computer Science Stage: Third Course Book Academic Year 2015-2016 Subject Advanced Database No. of Hours No. of Units 6 Distribution of Marks
More informationIntroduction to Data Management CSE 344. Lecture 1: Introduction
Introduction to Data Management CSE 344 Lecture 1: Introduction CSE 344 - Winter 2014 1 Staff Instructor: Sudeepa Roy sudeepa@cs.washington.edu Office hours: Wednesdays, 3:30-4:20, in CSE 344 (my office)
More informationOverview of the Class and Introduction to DB schemas and queries. Lois Delcambre
Overview of the Class and Introduction to DB schemas and queries Lois Delcambre 1 CS 386/586 Introduction to Databases Instructor: Lois Delcambre lmd@cs.pdx.edu 503 725-2405 TA: TBA Office Hours: Immediately
More informationIntroduction to Databases Fall-Winter 2009/10. Syllabus
Introduction to Databases Fall-Winter 2009/10 Syllabus Werner Nutt Syllabus Lecturer Werner Nutt, nutt@inf.unibz.it, Room TRA 2.01 Office hours: Thursday, 16:00 18:00 (If you want to meet up with me, send
More informationIntroduction to Database Systems. Fundamental Concepts
Introduction to Database Systems Fundamental Concepts Werner Nutt 1 A DBMS Presents Programmers and Users with a Simplified Environment Database System Users/Programmers Queries / Application Programs
More informationCSCI1270 Introduction to Database Systems
CSCI1270 Introduction to Database Systems with thanks to Prof. George Kollios, Boston University Prof. Mitch Cherniack, Brandeis University Prof. Avi Silberschatz, Yale University 1.1 What is a Database
More informationCMPSCI 645 Database Design & Implementation
Welcome to CMPSCI 645 Database Design & Implementation Instructor: Gerome Miklau Overview of Databases Gerome Miklau CMPSCI 645 Database Design & Implementation UMass Amherst Jan 19, 2010 Some slide content
More informationData Streams. Building a Data Stream Management System. DBMS versus DSMS. The (Simplified) Big Picture. (Simplified) Network Monitoring
Building a Data Stream Management System Prof. Jennifer Widom Joint project with Prof. Rajeev Motwani and a team of graduate students http://www-db.stanford.edu/stream stanfordstreamdatamanager Data Streams
More informationCourse Introduction. CSC343 - Introduction to Databases Manos Papagelis
Course Introduction CSC343 - Introduction to Databases Manos Papagelis Thanks to Ryan Johnson, John Mylopoulos, Arnold Rosenbloom and Renee Miller for material in these slides Overview 2 What is a database?
More informationDataspaces: A New Abstraction for Data Management. Mike Franklin, Alon Halevy, David Maier, Jennifer Widom
Dataspaces: A New Abstraction for Data Management Mike Franklin, Alon Halevy, David Maier, Jennifer Widom Today s Agenda Why databases are great. What problems people really have Why databases are not
More informationBBM371- Data Management. Lecture 1: Course policies, Introduction to DBMS
BBM371- Data Management Lecture 1: Course policies, Introduction to DBMS 26.09.2017 Today Introduction About the class Organization of this course Introduction to Database Management Systems (DBMS) About
More informationCS425 Fall 2016 Boris Glavic Chapter 1: Introduction
CS425 Fall 2016 Boris Glavic Chapter 1: Introduction Modified from: Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Textbook: Chapter 1 1.2 Database Management System (DBMS)
More informationPANDA A System for Provenance and Data. Jennifer Widom Stanford University
PANDA A System for Provenance and Data Stanford University Example: Sales Prediction Workflow CustList 1 Europe CustList 2... Dedup Split Union Predict ItemAgg CustList n-1 CustList n USA Catalog Items
More informationChapter 1: Introduction
Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query
More informationCSE 132A. Database Systems Principles
CSE 132A Database Systems Principles Prof. Victor Vianu 1 Data Management An evolving, expanding field: Classical stand-alone databases (Oracle, DB2, SQL Server) Computer science is becoming data-centric:
More informationAdministration Naive DBMS CMPT 454 Topics. John Edgar 2
Administration Naive DBMS CMPT 454 Topics John Edgar 2 http://www.cs.sfu.ca/coursecentral/454/johnwill/ John Edgar 4 Assignments 25% Midterm exam in class 20% Final exam 55% John Edgar 5 A database stores
More informationDatabases. Jörg Endrullis. VU University Amsterdam
Databases Jörg Endrullis VU University Amsterdam Databases A database (DB) is a collection of data with a certain logical structure a specific semantics a specific group of users Databases A database (DB)
More informationRelational model continued. Understanding how to use the relational model. Summary of board example: with Copies as weak entity
COS 597A: Principles of Database and Information Systems Relational model continued Understanding how to use the relational model 1 with as weak entity folded into folded into branches: (br_, librarian,
More informationIntroduction to Database Systems CSE 444. Lecture 1 Introduction
Introduction to Database Systems CSE 444 Lecture 1 Introduction 1 About Me: General Prof. Magdalena Balazinska (magda) At UW since January 2006 PhD from MIT Born in Poland Grew-up in Poland, Algeria, and
More informationSRM UNIVERSITY FACULTY OF ENGINEERING AND TECHNOLOGY SCHOOL OF COMPUTING DEPARTMENT OF CSE COURSE PLAN
SRM UNIVERSITY FACULTY OF ENGINEERING AND TECHNOLOGY SCHOOL OF COMPUTING DEPARTMENT OF CSE COURSE PLAN Course Code : CS0304 Course Title : Data Base Management Systems Semester : VI Course Time : Dec 2012-
More informationTrio: A System for Data, Uncertainty, and Lineage. Jennifer Widom Stanford University
Trio: A System for Data, Uncertainty, and Lineage Jennifer Widom Stanford University Motivation Many applications involve data that is uncertain (approximate, probabilistic, inexact, incomplete, imprecise,
More informationIntroduction. Who wants to study databases?
Introduction Example databases Overview of concepts Why use database systems Who wants to study databases? What is the use of all the courses I have taken so far? This course shows very concrete how CS
More informationTrio: A System for Data, Uncertainty, and Lineage. Search stanford trio
Trio: A System for Data, Uncertainty, and Lineage Search stanford trio 2 Stanford Report: March 06 Trio 1. Data Student #123 is majoring in Econ: (123,Econ) Major 2. Uncertainty Student #123 is majoring
More informationSystems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2012/13
Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Data Stream Processing Topics Model Issues System Issues Distributed Processing Web-Scale Streaming 3 Data Streams Continuous
More informationIntroduction to Database Systems CSE 444. Lecture #1 March 26, 2007
Introduction to Database Systems CSE 444 Lecture #1 March 26, 2007 1 About Me Dan Suciu: Joined the department in 2000 Before that: Bell Labs, AT&T Labs Research: Past: XML and semi-structured data: Query
More informationDatabase Systems ( 資料庫系統 ) Practicum in Database Systems ( 資料庫系統實驗 ) 9/20 & 9/21, 2006 Lecture #1
Database Systems ( 資料庫系統 ) Practicum in Database Systems ( 資料庫系統實驗 ) 9/20 & 9/21, 2006 Lecture #1 1 Course Goals First course in database systems. Main Course (3 units) - Learn Use a relational database
More informationWhere Are We? Next Few Lectures. Integrity Constraints Motivation. Constraints in E/R Diagrams. Keys in E/R Diagrams
Where Are We? Introduction to Data Management CSE 344 Lecture 15: Constraints We know quite a bit about using a DBMS Start with real-world problem, design ER diagram From ER diagram to relations -> conceptual
More informationIntroduction to Databases Fall-Winter 2010/11. Syllabus
Introduction to Databases Fall-Winter 2010/11 Syllabus Werner Nutt Syllabus Lecturer Werner Nutt, nutt@inf.unibz.it, Room POS 2.09 Office hours: Tuesday, 14:00 16:00 and by appointment (If you want to
More informationIntroduction to Data Management. Lecture #1 (The Course Trailer )
Introduction to Data Management Lecture #1 (The Course Trailer ) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Topics v Welcome to
More informationRelational Database Design. Announcements. Database (schema) design. CPS 216 Advanced Database Systems. DB2 accounts have been set up
Relational Database Design CPS 216 Advanced Database Systems Announcements 2 DB2 accounts have been set up Let me know if you have not received an email from me regarding your account Recitation session
More informationProject Assignment 2 (due April 6 th, 2016, 4:00pm, in class hard-copy please)
Virginia Tech. Computer Science CS 4604 Introduction to DBMS Spring 2016, Prakash Project Assignment 2 (due April 6 th, 2016, 4:00pm, in class hard-copy please) Reminders: a. Out of 100 points. Contains
More informationExploiting Predicate-window Semantics over Data Streams
Exploiting Predicate-window Semantics over Data Streams Thanaa M. Ghanem Walid G. Aref Ahmed K. Elmagarmid Department of Computer Sciences, Purdue University, West Lafayette, IN 47907-1398 {ghanemtm,aref,ake}@cs.purdue.edu
More information745: Advanced Database Systems
745: Advanced Database Systems Yanlei Diao University of Massachusetts Amherst Outline Overview of course topics Course requirements Database Management Systems 1. Online Analytical Processing (OLAP) vs.
More informationWhat s a database anyway?
Lecture 1 Databases TDA357/DIT620 Pablo Picazo pablop@chalmers.se What s a database anyway? Structured Persistant Changable Digital A database is True to integrity constraints DBMS Database == Data collection
More informationIntroduction to Database S ystems Systems CSE 444 Lecture 1 Introduction CSE Summer
Introduction to Database Systems CSE 444 Lecture 1 Introduction 1 Staff Instructor: Hal Perkins CSE 548, perkins@cs.washington.edu Office hours: labs tba, office drop-ins and appointments welcome TA: David
More informationCS 4604: Introduction to Database Management Systems. B. Aditya Prakash Lecture #2: The Relational Model and Relational Algebra
CS 4604: Introduction to Database Management Systems B. Aditya Prakash Lecture #2: The Relational Model and Relational Algebra Course Outline Weeks 1 4: Query/ Manipulation Languages and Data Modeling
More informationA REVIEW OF BASIC KNOWLEDGE OF DATABASE SYSTEM
A REVIEW OF BASIC KNOWLEDGE OF DATABASE SYSTEM 1. Relational model I still remember it is my first time to read such an academic article in detail. I have to say that it s really a hard time for me at
More informationCan you name one application that does not need any data? Can you name one application that does not need organized data?
Introduction Why Databases? Can you name one application that does not need any data? No, a program itself is data Can you name one application that does not need organized data? No, programs = algorithms
More informationUDP Packet Monitoring with Stanford Data Stream Manager
UDP Packet Monitoring with Stanford Data Stream Manager Nadeem Akhtar #1, Faridul Haque Siddiqui #2 # Department of Computer Engineering, Aligarh Muslim University Aligarh, India 1 nadeemalakhtar@gmail.com
More informationChapter 1: Introduction
Chapter 1: Introduction Slides are slightly modified by F. Dragan Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 1: Introduction Purpose of Database Systems View
More informationChapter 1: Introduction. Chapter 1: Introduction
Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases
More informationCredits. Principles of Database Management Systems. Isn t Implementing a Database System Simple? Megatron 3000 Implementation Details
Principles of Database Management Systems (Tietokannanhallintajärjestelmät) Pekka Kilpeläinen Fall 2001 Credits Based on Stanford CS 245 lecture notes by original authors Hector Garcia- Molina, Jeff Ullman
More informationSYLLABUS ADMIN DATABASE SYSTEMS I WEEK 2 THE ENTITY-RELATIONSHIP MODEL. Assignment #2 changed. A2Q1 moved to A3Q1
DATABASE SYSTEMS I WEEK 2 THE ENTITY-RELATIONSHIP MODEL Class Time and Location: Tue 14:30-16:20 AQ3005 Thu 14:30-15:20 AQ3003 Course Website: http://www.cs.sfu.ca/cc/354/rfrank/ Instructor: Richard Frank,
More informationLegacy SQL is a terrible language -- Date paper in 1985
Legacy SQL is a terrible language -- Date paper in 1985 Animals (name, species, age, feeding_time, cid, kid) Original idea (1974) Block Block Block e.g. find the name of Freddies s keeper select name from
More informationData Integration Systems
Data Integration Systems Haas et al. 98 Garcia-Molina et al. 97 Levy et al. 96 Chandrasekaran et al. 2003 Zachary G. Ives University of Pennsylvania January 13, 2003 CIS 650 Data Sharing and the Web Administrivia
More informationSQL: Part III. Announcements. Constraints. CPS 216 Advanced Database Systems
SQL: Part III CPS 216 Advanced Database Systems Announcements 2 Reminder: Homework #1 due in 12 days Reminder: reading assignment posted on Web Reminder: recitation session this Friday (January 31) on
More informationCAS CS 460/660 Introduction to Database Systems. Fall
CAS CS 460/660 Introduction to Database Systems Fall 2017 1.1 About the course Administrivia Instructor: George Kollios, gkollios@cs.bu.edu MCS 283, Mon 2:30-4:00 PM and Tue 1:00-2:30 PM Teaching Fellows:
More informationDatabases 1. Daniel POP
Databases 1 Daniel POP Week 1 Getting around New things will be thought both in lectures and in labs; don t miss them; they all matter for final examination! FinalGrade = (LectureGrade>=5 and LabGrade>=5)?
More informationDatabase Management Systems (CPTR 312)
Database Management Systems (CPTR 312) Preliminaries Me: Raheel Ahmad Ph.D., Southern Illinois University M.S., University of Southern Mississippi B.S., Zakir Hussain College, India Contact: Science 116,
More informationDatabase Management Systems Paper Solution
Database Management Systems Paper Solution Following questions have been asked in GATE CS exam. 1. Given the relations employee (name, salary, deptno) and department (deptno, deptname, address) Which of
More informationProbabilistic/Uncertain Data Management
Probabilistic/Uncertain Data Management 1. Dalvi, Suciu. Efficient query evaluation on probabilistic databases, VLDB Jrnl, 2004 2. Das Sarma et al. Working models for uncertain data, ICDE 2006. Slides
More informationChapter 1 Introduction
Chapter 1 Introduction Contents The History of Database System Overview of a Database Management System (DBMS) Three aspects of database-system studies the state of the art Introduction to Database Systems
More informationRelational Algebra Homework 0 Due Tonight, 5pm! R & G, Chapter 4 Room Swap for Tuesday Discussion Section Homework 1 will be posted Tomorrow
Relational Algebra R & G, Chapter 4 By relieving the brain of all unnecessary work, a good notation sets it free to concentrate on more advanced problems, and, in effect, increases the mental power of
More informationSchema-Agnostic Indexing with Azure Document DB
Schema-Agnostic Indexing with Azure Document DB Introduction Azure DocumentDB is Microsoft s multi-tenant distributed database service for managing JSON documents at Internet scale Multi-tenancy is an
More informationDatabase Management System. Fundamental Database Concepts
Database Management System Fundamental Database Concepts CONTENTS Basics of DBMS Purpose of DBMS Applications of DBMS Views of Data Instances and Schema Data Models Database Languages Responsibility of
More informationCS157a Fall 2018 Sec3 Home Page/Syllabus
CS157a Fall 2018 Sec3 Home Page/Syllabus Introduction to Database Management Systems Instructor: Chris Pollett Office: MH 214 Phone Number: (408) 924 5145 Email: chris@pollett.org Office Hours: MW 4:30-5:45pm
More informationCS143: Relational Model
CS143: Relational Model Book Chapters (4th) Chapters 1.3-5, 3.1, 4.11 (5th) Chapters 1.3-7, 2.1, 3.1-2, 4.1 (6th) Chapters 1.3-6, 2.105, 3.1-2, 4.5 Things to Learn Data model Relational model Database
More informationChapter 1: Introduction
Chapter 1: Introduction Chapter 1: Introduction Purpose of Database Systems Database Languages Relational Databases Database Design Data Models Database Internals Database Users and Administrators Overall
More informationCS145 Introduction. About CS145 Relational Model, Schemas, SQL Semistructured Model, XML
CS145 Introduction About CS145 Relational Model, Schemas, SQL Semistructured Model, XML 1 Content of CS145 Design of databases. E/R model, relational model, semistructured model, XML, UML, ODL. Database
More informationAnnouncements (January 20) Relational Database Design. Database (schema) design. Entity-relationship (E/R) model. ODL (Object Definition Language)
Announcements (January 20) 2 Relational Database Design Review for Codd paper due tonight via email Follow instructions on course Web site Reading assignment for next week (Ailamaki et al., VLDB 2001)
More informationD.Hemavathi & R.Venkatalakshmi, Assistant Professor, SRM University, Kattankulathur
DATABASE SYSTEMS IT 0303 5 TH Semester D.Hemavathi & R.Venkatalakshmi, Assistant Professor, SRM University, Kattankulathur School of Computing, Department of IT Unit 1: introduction 1 Disclaimer The contents
More informationGoals for Today. CS 133: Databases. Final Exam: Logistics. Why Use a DBMS? Brief overview of course. Course evaluations
Goals for Today Brief overview of course CS 133: Databases Course evaluations Fall 2018 Lec 27 12/13 Course and Final Review Prof. Beth Trushkowsky More details about the Final Exam Practice exercises
More informationIn This Lecture. The Relational Model. The Relational Model. Relational Data Structure. Unnamed and named tuples. New thing:scheme (and attributes)
Database Systems Lecture 3 Natasha Alechina In This Lecture Relational data integrity For more information Connolly and Begg chapter 3 E.F. Codd s paper `A Relational Model of Data for Large Shared Data
More informationDatabase Management Systems MIT Introduction By S. Sabraz Nawaz
Database Management Systems MIT 22033 Introduction By S. Sabraz Nawaz Recommended Reading Database Management Systems 3 rd Edition, Ramakrishnan, Gehrke Murach s SQL Server 2008 for Developers Any book
More informationCourse Logistics & Chapter 1 Introduction
CMSC 461, Database Management Systems Spring 2018 Course Logistics & Chapter 1 Introduction These slides are based on Database System Concepts book th edition, and the 2009 CMSC 461 slides by Dr. Kalpakis
More informationB.H.GARDI COLLEGE OF MASTER OF COMPUTER APPLICATION. Ch. 1 :- Introduction Database Management System - 1
Basic Concepts :- 1. What is Data? Data is a collection of facts from which conclusion may be drawn. In computer science, data is anything in a form suitable for use with a computer. Data is often distinguished
More informationDatabase!! Structured data collection!! Records!! Relationships. Enforces that data maintains certain consistency properties
Relational Databases Sam Madden Key ideas: Declarative programming Transactions Database Structured data collection Records Relationships Database management system (DBMS) Why? 1) Widely used 2) Several
More informationWho we are: Database Research - Provenance, Integration, and more hot stuff. Boris Glavic. Department of Computer Science
Who we are: Database Research - Provenance, Integration, and more hot stuff Boris Glavic Department of Computer Science September 24, 2013 Hi, I am Boris Glavic, Assistant Professor Hi, I am Boris Glavic,
More informationNot just UNIQUE: Exclusion Constraints. Jeff Davis Truviso, Inc.
Not just UNIQUE: Exclusion Constraints Jeff Davis Truviso, Inc. 1 Schedule Conflict One person can't be in two places at the same time Two people can't use an exclusive resource at the same time Business
More informationInformatics 1: Data & Analysis
Informatics 1: Data & Analysis Lecture 3: The Relational Model Ian Stark School of Informatics The University of Edinburgh Tuesday 24 January 2017 Semester 2 Week 2 https://blog.inf.ed.ac.uk/da17 Lecture
More informationCS 4604: Introduction to Database Management Systems. B. Aditya Prakash Lecture #5: Entity/Relational Models---Part 1
CS 4604: Introduction to Database Management Systems B. Aditya Prakash Lecture #5: Entity/Relational Models---Part 1 E/R: NOT IN BOOK! IMPORTANT: Follow only lecture slides for this topic! Differences
More informationCS425 Midterm Exam Summer C 2012
Q1) List five responsibilities of a database-management system. Q2) Fill in the terms in the right hand side of the table that match the description from the list below: Instance SQL Integrity constraints
More informationCourse and Contact Information. Course Description. Course Objectives
San Jose State University College of Science Department of Computer Science CS157A, Introduction to Database Management Systems, Sections 1 and 2, Fall2017 Course and Contact Information Instructor: Dr.
More informationIntroduction to Database Systems. Fundamental Concepts
Introduction to Database Systems Fundamental Concepts Werner Nutt 1 Characteristics of the DB Approach Insulation of application programs and data from each other Use of a ue to store the schema Support
More informationCS 525 Advanced Database Organization - Spring 2017 Mon + Wed 1:50-3:05 PM, Room: Stuart Building 111
CS 525 Advanced Database Organization - Spring 2017 Mon + Wed 1:50-3:05 PM, Room: Stuart Building 111 Instructor: Boris Glavic, Stuart Building 226 C, Phone: 312 567 5205, Email: bglavic@iit.edu Office
More informationDatabase Principle. Zhuo Wang Spring
Database Principle Zhuo Wang zhuowang@sjtu.edu.cn 2017 Spring Overview Data Database Database Management System Database System References Database System Concepts Abraham Silberschatz, Henry F. Korth,
More informationCSCI3030U Database Models
CSCI3030U Database Models CSCI3030U RELATIONAL MODEL SEMISTRUCTURED MODEL 1 Content Design of databases. relational model, semistructured model. Database programming. SQL, XPath, XQuery. Not DBMS implementation.
More informationSemistructured Data and Mediation
Semistructured Data and Mediation Prof. Letizia Tanca Dipartimento di Elettronica e Informazione Politecnico di Milano SEMISTRUCTURED DATA FOR THESE DATA THERE IS SOME FORM OF STRUCTURE, BUT IT IS NOT
More informationAnnouncement. Relational Database Design Part I. Keys. Relational model: review. Schema vs. data. More examples of keys
Announcement 2 Relational Database Design Part I CPS 116 troduction to Database Systems DB2 accounts created; change your password! Let me know if you have NOT received the email Pick up Gradiance registration
More informationMIT Database Management Systems Lesson 01: Introduction
MIT 22033 Database Management Systems Lesson 01: Introduction By S. Sabraz Nawaz Senior Lecturer in MIT, FMC, SEUSL Learning Outcomes At the end of the module the student will be able to: Describe the
More informationStorage and Indexing
CompSci 516 Data Intensive Computing Systems Lecture 5 Storage and Indexing Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Announcement Homework 1 Due on Feb
More informationCSE 544 Principles of Database Management Systems. Lecture 4: Data Models a Never-Ending Story
CSE 544 Principles of Database Management Systems Lecture 4: Data Models a Never-Ending Story 1 Announcements Project Start to think about class projects If needed, sign up to meet with me on Monday (I
More informationCourse and Contact Information. Course Description. Course Objectives
San Jose State University College of Science Department of Computer Science CS157A, Introduction to Database Management Systems, Sections 1 and 2, Fall2016 Course and Contact Information Instructor: Dr.
More informationSymmetrically Exploiting XML
Symmetrically Exploiting XML Shuohao Zhang and Curtis Dyreson School of E.E. and Computer Science Washington State University Pullman, Washington, USA The 15 th International World Wide Web Conference
More informationMIS Database Systems.
MIS 335 - Database Systems http://www.mis.boun.edu.tr/durahim/ Ahmet Onur Durahim Learning Objectives Database systems concepts Designing and implementing a database application Life of a Query in a Database
More informationR & G Chapter 13. Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops:
Relational Query Optimization R & G Chapter 13 Review Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops: simple, exploits extra memory
More informationBIS Database Management Systems.
BIS 512 - Database Management Systems http://www.mis.boun.edu.tr/durahim/ Ahmet Onur Durahim Learning Objectives Database systems concepts Designing and implementing a database application Life of a Query
More informationSpring 2013 CS 122C & CS 222 Midterm Exam (and Comprehensive Exam, Part I) (Max. Points: 100)
Spring 2013 CS 122C & CS 222 Midterm Exam (and Comprehensive Exam, Part I) (Max. Points: 100) Instructions: - This exam is closed book and closed notes but open cheat sheet. - The total time for the exam
More information