On the Logical Modeling of ETL Processes

Size: px
Start display at page:

Download "On the Logical Modeling of ETL Processes"

Transcription

1 On the Logical Modeling of ETL Processes Panos Vassiliadis, Alkis Simitsis, Spiros Skiadopoulos National Technical University of Athens

2 What are Extract-Transform-Load (ETL) activities? Extract Transform & Clean Load Sources DSA DW Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

3 Preliminaries Data types (name, domain) and Constants Attributes Schemata (finite lists of attributes) RecordSets (name, schema, extension) standing for tables and record files Function types (name, a finite list of parameter data types, and a single return data type). A function is an instance of a function type Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

4 Elementary Activities Name Input Schemata Output Schema Rejections Schema Parameter List Output/Rejection Operational Semantics: an SQL statement describing the content passed to the output of the operation, with respect to its input. Data Provider/Consumer Semantics: is the output appended to the target, or overwritten? Are the input rows simply read, or removed from the sources as well? Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

5 Naïve Example of Elementary Activity Parameters Input 1 Elementary Activity + Output Input 2! Rejected Rows Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

6 Complex Example POPULATED_FIELD NotNull_A1 IN IN.A 1 OUT IN A 1 A 1 A 1 A 1 A 2 A 2 A 2 A 2 R A 3 A 3 A 3 A 3 A 4 A 4 A 4 A 4 Legend: OUT OPERATIONAL SEMANTICS R NotNull_A1 SK_A2 SELECT IN.A1 AS OUT.A1, IN.A2 AS OUT.A2,IN.A3 AS OUT.A3, IN.A4 AS OUT.A4 FROM NOTNULL_A1.IN WHERE IN.A1 NOT NULL Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

7 A Typical ETL Scenario DS.PS_NEW 1.PKEY, DS.PS_OLD 1.PKEY SUPPKEY=1 DS.PS 1.PKEY, LOOKUP_PS.SKEY, SUPPKEY COST DATE DS.PS_NEW 1 DIFF 1 DS.PS 1 Add_SPK 1 SK 1 rejected $2 rejected A2EDate rejected U DS.PS_OLD 1 Log Log Log DS.PS_NEW 2.PKEY, DS.PS_OLD 2.PKEY SUPPKEY=2 DS.PS 2.PKEY, LOOKUP_PS.SKEY, SUPPKEY COST DATE=SYSDATE QTY>0 DS.PS_NEW 2 DIFF 2 DS.PS 2 Add_SPK 2 SK 2 NotNULL AddDate CheckQTY DS.PS_OLD 2 Log rejected Log rejected PKEY, DAY MIN(COST) S 1 _PARTSUPP FTP 1 DW.PARTSUPP Aggregate 1 V 1 DW.PARTSUPP.DATE, DAY PKEY, MONTH AVG(COST) S 2 _PARTSUPP FTP 2 TIME Aggregate 2 V 2 Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

8 Template Activities Metamodel layer Elementary Activity ISA RecordSet Template layer Not Null Selection Aggregate IN Custom Layer myscenario mynot Null myselection Supplier LineItem PartSupp Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

9 Template Activities Filters SELECTION (φ(a n+1,,a n+k )) UNIQUE VALUE (R.A) NOT NULL (R.A) DOMAIN MISMATCH (R.A,x low,x high ) PRIMARY KEY VIOLATION (R.A 1,,R.A K ) FOREIGN KEY VIOLATION ([R.A 1,,R.A K ],[S.A 1,,S.A K ]) Unary Transformations PUSH AGGREGATION ([A 1,,A k ],[γ 1 (A 1 ),,γ m (A m )]) PROJECTION([A 1,,A k ]) FUNCTION APPLICATION(f 1 (A 1 ),,f k (A k )) SURROGATE KEY ASSIGNMENT (R.PRODKEY,S.SKEY,x) TUPLE NORMALIZATION (R.A n+1,,r.a n+k,a CODE,A VALUE,h) TUPLE DENORMALIZATION (A CODE,A VALUE,R.A n+1,,r.a n+k,h ) Binary Transformations UNION(R,S) JOIN(R,S,[(A 1,B 1 ),,(A k,b k )]) DIFF(R,S,[(A 1,B 1 ),,(A k,b k )]) Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

10 Example of Template Activity Activity type Input schema Parameters Output schema Rejection schema Equivalent SQL statement for the output of the activity DOMAIN MISMATCH (R.A,x low, x high ) [A 1,,A n ] [$FIELD,R.A] [$LOW,x low ] [$HIGH,x high ] [B 1,,B n ] [C 1,,C n ] SELECT A 1 AS B 1,, A n AS B n FROM R WHERE $FIELD IN [$LOW,$HIGH] Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

11 The ARKTOS II project Administration of DW Metrics Logical Model for DW, Sources & Activities Logical Design Tuning Full Activity Description Physical Model for DW, Sources & Activities Conceptual Model for DW, Sources & Activities Reverse Engineering of Sources & Requirements Collection Software Construction Software & SW Metrics Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

12 On-going & Future Work Conceptual model for the initial phases of ETL projects Logical model for activities Graph modeling for ETL scenarios [DMDW02] Optimization of ETL scenarios On-going development of a set of loosely coupled tools Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

13 Conceptual Model S2. Li nei t ems S3. Li nei t ems {Dur at i on<4h} U DW. Li n e I t e ms Or d e r Ke y Or d e r Ke y Par t Key SK Par t Key Qt yfor Li ne1 Li ne Numbe r Qt yfor Li ne2 N Qu a l i t y Qt yfor Li ne3 Pr i ce Amount Key Amount N Di s c o u n t Ta x St at usforli ne1 N St at us St at usforli ne2 N Co mmi t Da t e St at usforli ne3 Shipdate Ameri can t o Eur ope a n Co mmi t Da t e Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

14 Graph Modeling [DMDW 02] IN AddSPK1 OUT IN SK1 OUT PAR PAR PKEY PKEY PKEY PKEY PKEY QTY QTY QTY QTY QTY COST COST COST COST COST DATE DATE DATE DATE DATE Add_ const1 SUPPKEY SUPPKEY SUPPKEY DS.PS1 SKEY in out PKEY SOURCE 1 PKEY LU_PKEY LOOKUP_PS SOURCE SK LU_SOURCE LU_SKEY Vassiliadis,Simitsis,Skiadopoulos. On the Logical Modeling of ETL Processes. CAiSE'02 Toronto,

Logical Optimization of ETL Workflows

Logical Optimization of ETL Workflows Logical Optimization of ETL Workflows 1001 11001 111001 1001 1100011001 100011 100011 011001 10010 100101 10010 0101001 1001 1001 1001001 101001 010101001 010101001 1001001 1001001 1001001 1001001 1001

More information

Modeling ETL Activities as Graphs

Modeling ETL Activities as Graphs Modeling ETL Activities as Graphs Panos Vassiliadis, Alkis Simitsis, Spiros Skiadopoulos National Technical University of Athens, Dept. of Electrical and Computer Eng., Computer Science Division, Iroon

More information

Conceptual modeling for ETL

Conceptual modeling for ETL Conceptual modeling for ETL processes Author: Dhananjay Patil Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 08/26/04 Email: erg@evaltech.com Abstract: Due to the

More information

Graph-Based Modeling of ETL Activities with Multi-level Transformations and Updates

Graph-Based Modeling of ETL Activities with Multi-level Transformations and Updates Graph-Based Modeling of ETL Activities with Multi-level Transformations and Updates Alkis Simitsis 1, Panos Vassiliadis 2, Manolis Terrovitis 1, and Spiros Skiadopoulos 1 1 National Technical University

More information

Blueprints for ETL workflows

Blueprints for ETL workflows Blueprints for ETL workflows Panos Vassiliadis 1, Alkis Simitsis 2, Manolis Terrovitis 2, Spiros Skiadopoulos 2 1 University of Ioannina, Dept. of Computer Science, 45110, Ioannina, Greece pvassil@cs.uoi.gr

More information

Blueprints and Measures for ETL Workflows

Blueprints and Measures for ETL Workflows Blueprints and Measures for ETL Workflows Panos Vassiliadis 1, Alkis Simitsis 2, Manolis Terrovitis 2, Spiros Skiadopoulos 2 1 University of Ioannina, Dept. of Computer Science, Ioannina, Hellas pvassil@cs.uoi.gr

More information

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and education use, including for instruction at the authors institution

More information

Benchmarking ETL Workflows

Benchmarking ETL Workflows Benchmarking ETL Workflows Alkis Simitsis 1, Panos Vassiliadis 2, Umeshwar Dayal 1, Anastasios Karagiannis 2, Vasiliki Tziovara 2 1 HP Labs, Palo Alto, CA, USA, {alkis, Umeshwar.Dayal}@hp.com 2 University

More information

From conceptual design to performance optimization of ETL workflows: current state of research and open problems

From conceptual design to performance optimization of ETL workflows: current state of research and open problems The VLDB Journal (2017) 26:777 801 DOI 10.1007/s00778-017-0477-2 REGULAR PAPER From conceptual design to performance optimization of ETL workflows: current state of research and open problems Syed Muhammad

More information

ConceptualModelingforETLProcesses

ConceptualModelingforETLProcesses ConceptualModelingforETLProcesses PanosVassiliadis,AlkisSimitsis,SpirosSkiadopoulos 1 1 NationalTechnicalUniversityofAthens,Dept.ofElectricalandComputerEng., ComputerScienceDivision,IroonPolytechniou9,15773,Athens,Greece

More information

Relational Model, Relational Algebra, and SQL

Relational Model, Relational Algebra, and SQL Relational Model, Relational Algebra, and SQL August 29, 2007 1 Relational Model Data model. constraints. Set of conceptual tools for describing of data, data semantics, data relationships, and data integrity

More information

UNIVERSITY OF CINCINNATI

UNIVERSITY OF CINCINNATI UNIVERSITY OF CINCINNATI Date: 21-May-2010 I, Suma P Snehalatha, hereby submit this original work as part of the requirements for the degree of: Master of Science in Computer Engineering It is entitled:

More information

ARKTOS: A Tool For Data Cleaning and Transformation in Data Warehouse Environments

ARKTOS: A Tool For Data Cleaning and Transformation in Data Warehouse Environments ARKTOS: A Tool For Data Cleaning and Transformation in Data Warehouse Environments Panos Vassiliadis Zografoula Vagena Spiros Skiadopoulos Nikos Karayannidis Timos Sellis Knowledge and Database Systems

More information

Integrating ETL Processes from Information Requirements

Integrating ETL Processes from Information Requirements Integrating ETL Processes from Information Requirements Petar Jovanovic 1, Oscar Romero 1, Alkis Simitsis 2, and Alberto Abelló 1 1 Universitat Politècnica de Catalunya, BarcelonaTech Barcelona, Spain

More information

Arktos: towards the modeling, design, control and execution of ETL processes

Arktos: towards the modeling, design, control and execution of ETL processes Information Systems 26 (2001) 537 561 Arktos: towards the modeling, design, control and execution of ETL processes Panos Vassiliadis*, Zografoula Vagena, Spiros Skiadopoulos, Nikos Karayannidis, Timos

More information

Database Instance And Relational Schema Design A Fact Oriented Approach

Database Instance And Relational Schema Design A Fact Oriented Approach Database Instance And Relational Schema Design A Fact Oriented Approach File-oriented approaches create problems for organizations because of d) how master files maintain facts used by certain application

More information

CPS510 Database System Design Primitive SYSTEM STRUCTURE

CPS510 Database System Design Primitive SYSTEM STRUCTURE CPS510 Database System Design Primitive SYSTEM STRUCTURE Naïve Users Application Programmers Sophisticated Users Database Administrator DBA Users Application Interfaces Application Programs Query Data

More information

A Taxonomy of ETL Activities

A Taxonomy of ETL Activities Panos Vassiliadis University of Ioannina Ioannina, Greece pvassil@cs.uoi.gr A Taxonomy of ETL Activities Alkis Simitsis HP Labs Palo Alto, CA, USA alkis@hp.com Eftychia Baikousi University of Ioannina

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 03 Architecture of DW Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Basic

More information

8) A top-to-bottom relationship among the items in a database is established by a

8) A top-to-bottom relationship among the items in a database is established by a MULTIPLE CHOICE QUESTIONS IN DBMS (unit-1 to unit-4) 1) ER model is used in phase a) conceptual database b) schema refinement c) physical refinement d) applications and security 2) The ER model is relevant

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of Information Technology 2016 2017 (1) Points to Cover Problem:

More information

What-if Analysis for Data Warehouse Evolution

What-if Analysis for Data Warehouse Evolution What-if Analysis for Data Warehouse Evolution George Papastefanatos 1, Panos Vassiliadis 2, Alkis Simitsis 3, and Yannis Vassiliou 1 1 National Technical University of Athens, Dept. of Electrical and Computer

More information

Database Applications (15-415)

Database Applications (15-415) Database Applications (15-415) The Relational Model Lecture 3, January 18, 2015 Mohammad Hammoud Today Last Session: The entity relationship (ER) model Today s Session: ER model (Cont d): conceptual design

More information

Chapter 9: Relational DB Design byer/eer to Relational Mapping Relational Database Design Using ER-to- Relational Mapping Mapping EER Model

Chapter 9: Relational DB Design byer/eer to Relational Mapping Relational Database Design Using ER-to- Relational Mapping Mapping EER Model Chapter 9: Relational DB Design byer/eer to Relational Mapping Relational Database Design Using ER-to- Relational Mapping Mapping EER Model Constructs to Relations Relational Database Design by ER- and

More information

Supplier-Parts-DB SNUM SNAME STATUS CITY S1 Smith 20 London S2 Jones 10 Paris S3 Blake 30 Paris S4 Clark 20 London S5 Adams 30 Athens

Supplier-Parts-DB SNUM SNAME STATUS CITY S1 Smith 20 London S2 Jones 10 Paris S3 Blake 30 Paris S4 Clark 20 London S5 Adams 30 Athens Page 1 of 27 The Relational Data Model The data structures of the relational model Attributes and domains Relation schemas and database schemas (decomposition) The universal relation schema assumption

More information

Relational Databases

Relational Databases Relational Databases Jan Chomicki University at Buffalo Jan Chomicki () Relational databases 1 / 49 Plan of the course 1 Relational databases 2 Relational database design 3 Conceptual database design 4

More information

EECS-3421a: Test #1 Design

EECS-3421a: Test #1 Design 2016 October 12 EECS-3421a: Test #1 1 of 14 EECS-3421a: Test #1 Design Electrical Engineering & Computer Science Lassonde School of Engineering York University Family Name: Given Name: Student#: EECS Account:

More information

Benchmark TPC-H 100.

Benchmark TPC-H 100. Benchmark TPC-H 100 vs Benchmark TPC-H Transaction Processing Performance Council (TPC) is a non-profit organization founded in 1988 to define transaction processing and database benchmarks and to disseminate

More information

Relational Model History. COSC 416 NoSQL Databases. Relational Model (Review) Relation Example. Relational Model Definitions. Relational Integrity

Relational Model History. COSC 416 NoSQL Databases. Relational Model (Review) Relation Example. Relational Model Definitions. Relational Integrity COSC 416 NoSQL Databases Relational Model (Review) Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca Relational Model History The relational model was proposed by E. F. Codd

More information

Parser: SQL parse tree

Parser: SQL parse tree Jinze Liu Parser: SQL parse tree Good old lex & yacc Detect and reject syntax errors Validator: parse tree logical plan Detect and reject semantic errors Nonexistent tables/views/columns? Insufficient

More information

Complete. The. Reference. Christopher Adamson. Mc Grauu. LlLIJBB. New York Chicago. San Francisco Lisbon London Madrid Mexico City

Complete. The. Reference. Christopher Adamson. Mc Grauu. LlLIJBB. New York Chicago. San Francisco Lisbon London Madrid Mexico City The Complete Reference Christopher Adamson Mc Grauu LlLIJBB New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore Sydney Toronto Contents Acknowledgments

More information

Rule-based Management of Schema Changes at ETL sources

Rule-based Management of Schema Changes at ETL sources Rule-based Management of Schema Changes at ETL sources George Papastefanatos 1, Panos Vassiliadis 2, Alkis Simitsis 3, Timos Sellis 1, Yannis Vassiliou 1 1 National Technical University of Athens, Athens,

More information

Introduction to Relational Databases. Introduction to Relational Databases cont: Introduction to Relational Databases cont: Relational Data structure

Introduction to Relational Databases. Introduction to Relational Databases cont: Introduction to Relational Databases cont: Relational Data structure Databases databases Terminology of relational model Properties of database relations. Relational Keys. Meaning of entity integrity and referential integrity. Purpose and advantages of views. The relational

More information

The Entity-Relationship Model. Overview of Database Design

The Entity-Relationship Model. Overview of Database Design The Entity-Relationship Model Chapter 2, Chapter 3 (3.5 only) Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Overview of Database Design Conceptual design: (ER Model is used at this stage.)

More information

Introduction to Data Management. Lecture #4 (E-R à Relational Design)

Introduction to Data Management. Lecture #4 (E-R à Relational Design) Introduction to Data Management Lecture #4 (E-R à Relational Design) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Announcements v Reminders:

More information

Contents. Database. Information Policy. C03. Entity Relationship Model WKU-IP-C03 Database / Entity Relationship Model

Contents. Database. Information Policy. C03. Entity Relationship Model WKU-IP-C03 Database / Entity Relationship Model Information Policy Database C03. Entity Relationship Model Code: 164323-03 Course: Information Policy Period: Spring 2013 Professor: Sync Sangwon Lee, Ph. D 1 Contents 01. Overview of Database Design 02.

More information

SQL and Incomp?ete Data

SQL and Incomp?ete Data SQL and Incomp?ete Data A not so happy marriage Dr Paolo Guagliardo Applied Databases, Guest Lecture 31 March 2016 SQL is efficient, correct and reliable 1 / 25 SQL is efficient, correct and reliable...

More information

The Basic (Flat) Relational Model. Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

The Basic (Flat) Relational Model. Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley The Basic (Flat) Relational Model Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Outline The Relational Data Model and Relational Database Constraints Relational

More information

Call: SAS BI Course Content:35-40hours

Call: SAS BI Course Content:35-40hours SAS BI Course Content:35-40hours Course Outline SAS Data Integration Studio 4.2 Introduction * to SAS DIS Studio Features of SAS DIS Studio Tasks performed by SAS DIS Studio Navigation to SAS DIS Studio

More information

Relational Data Model

Relational Data Model Relational Data Model 1. Relational data model Information models try to put the real-world information complexity in a framework that can be easily understood. Data models must capture data structure

More information

CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E)

CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E) 1 CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E) 3 LECTURE OUTLINE Constraints in Relational Databases Update Operations 4 SATISFYING INTEGRITY CONSTRAINTS Constraints are restrictions on the

More information

BIRKBECK (University of London)

BIRKBECK (University of London) BIRKBECK (University of London) BSc Examination for Internal Students School of Computer Science and Information Systems Database Management COIY028U - Course Unit Value: 1/2 May 2006 : Afternoon 14.30

More information

Fundamentals of Physical Design: State of Art

Fundamentals of Physical Design: State of Art Fundamentals of Physical Design: State of Art David Toman D. R. Cheriton School of Computer Science D. Toman (Waterloo) Physical Design: State of Art 1 / 13 Benefits of Database Technology 1 High-level/declarative

More information

Data Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1)

Data Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1) Data Mining Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of CS 2016 2017 (1) Points to Cover Problem: Heterogeneous Information Sources

More information

Relational Model: History

Relational Model: History Relational Model: History Objectives of Relational Model: 1. Promote high degree of data independence 2. Eliminate redundancy, consistency, etc. problems 3. Enable proliferation of non-procedural DML s

More information

D B M G. SQL language: basics. Managing tables. Creating a table Modifying table structure Deleting a table The data dictionary Data integrity

D B M G. SQL language: basics. Managing tables. Creating a table Modifying table structure Deleting a table The data dictionary Data integrity SQL language: basics Creating a table Modifying table structure Deleting a table The data dictionary Data integrity 2013 Politecnico di Torino 1 Creating a table Creating a table (1/3) The following SQL

More information

Chapter 4. The Relational Model

Chapter 4. The Relational Model Chapter 4 The Relational Model Chapter 4 - Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations and relations in the relational model.

More information

CS2300: File Structures and Introduction to Database Systems

CS2300: File Structures and Introduction to Database Systems CS2300: File Structures and Introduction to Database Systems Lecture 9: Relational Model & Relational Algebra Doug McGeehan 1 Brief Review Relational model concepts Informal Terms Formal Terms Table Relation

More information

CSE 132A Database Systems Principles

CSE 132A Database Systems Principles CSE 132A Database Systems Principles Prof. Alin Deutsch RELATIONAL DATA MODEL Some slides are based or modified from originals by Elmasri and Navathe, Fundamentals of Database Systems, 4th Edition 2004

More information

Databases Model the Real World. The Entity- Relationship Model. Conceptual Design. Steps in Database Design. ER Model Basics. ER Model Basics (Contd.

Databases Model the Real World. The Entity- Relationship Model. Conceptual Design. Steps in Database Design. ER Model Basics. ER Model Basics (Contd. The Entity- Relationship Model R &G - Chapter 2 A relationship, I think, is like a shark, you know? It has to constantly move forward or it dies. And I think what we got on our hands is a dead shark. Woody

More information

Locking Protocols for Materialized Aggregate Join Views

Locking Protocols for Materialized Aggregate Join Views Locking Protocols for Materialized Aggregate Join Views Gang Luo Jeffrey F. Naughton Curt J. Ellmann Michael W. Watzke University of Wisconsin-Madison NCR Advance Development Lab {gangluo, naughton}@cs.wisc.edu

More information

Where Are We? Next Few Lectures. Integrity Constraints Motivation. Constraints in E/R Diagrams. Keys in E/R Diagrams

Where Are We? Next Few Lectures. Integrity Constraints Motivation. Constraints in E/R Diagrams. Keys in E/R Diagrams Where Are We? Introduction to Data Management CSE 344 Lecture 15: Constraints We know quite a bit about using a DBMS Start with real-world problem, design ER diagram From ER diagram to relations -> conceptual

More information

CSC 742 Database Management Systems

CSC 742 Database Management Systems CSC 742 Database Management Systems Topic #5: Relational Model Spring 2002 CSC 742: DBMS by Dr. Peng Ning 1 Motivation A relation is a mathematical abstraction for a table The theory of relations provides

More information

From ER to Relational Model. Book Chapter 3 (part 2 )

From ER to Relational Model. Book Chapter 3 (part 2 ) From ER to Relational Model Book Chapter 3 (part 2 ) Logical DB Design: ER to Relational Translate Entity sets to tables: ssn name Employees lot CREATE TABLE Employees (ssn CHAR(11), name CHAR(20), lot

More information

Web Science & Technologies University of Koblenz Landau, Germany. Relational Data Model

Web Science & Technologies University of Koblenz Landau, Germany. Relational Data Model Web Science & Technologies University of Koblenz Landau, Germany Relational Data Model Overview Relational data model; Tuples and relations; Schemas and instances; Named vs. unnamed perspective; Relational

More information

CS 377 Database Systems

CS 377 Database Systems CS 377 Database Systems Relational Data Model Li Xiong Department of Mathematics and Computer Science Emory University 1 Outline Relational Model Concepts Relational Model Constraints Relational Database

More information

Objectives of logical design... Transforming the ERD diagram into relations. Relational database components. Mapping a composite attribute

Objectives of logical design... Transforming the ERD diagram into relations. Relational database components. Mapping a composite attribute Logical database design and the relational model Objectives of logical design... Translate the conceptual design into a logical database design that can be implemented on a chosen DBMS Input: conceptual

More information

Information Modeling and Relational Databases

Information Modeling and Relational Databases Information Modeling and Relational Databases Second Edition Terry Halpin Neumont University Tony Morgan Neumont University AMSTERDAM» BOSTON. HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO

More information

Relational Algebra and Calculus

Relational Algebra and Calculus and Calculus Marek Rychly mrychly@strathmore.edu Strathmore University, @ilabafrica & Brno University of Technology, Faculty of Information Technology Advanced Databases and Enterprise Systems 7 September

More information

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores Announcements Shumo office hours change See website for details HW2 due next Thurs

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

The Entity-Relationship Model

The Entity-Relationship Model The Entity-Relationship Model Chapter 2 Database Management Systems, R. Ramakrishnan and J. Gehrke 1 Overview of Database Design Conceptual design: (ER Model is used at this stage.) What are the entities

More information

Elena Baralis and Tania Cerquitelli 2013 Politecnico di Torino 1

Elena Baralis and Tania Cerquitelli 2013 Politecnico di Torino 1 Introduction The INSERT command The DELETE command The UPDATE command SQL language: basics D B M G 2 (1/3) Inserting tuples Deleting tuples Modifying tuples Introduction D B M G 4 (2/3) INSERT inserting

More information

Relational Model History. COSC 304 Introduction to Database Systems. Relational Model and Algebra. Relational Model Definitions.

Relational Model History. COSC 304 Introduction to Database Systems. Relational Model and Algebra. Relational Model Definitions. COSC 304 Introduction to Database Systems Relational Model and Algebra Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca Relational Model History The relational model was

More information

CS275 Intro to Databases

CS275 Intro to Databases CS275 Intro to Databases The Relational Data Model Chap. 3 How Is Data Retrieved and Manipulated? Queries Data manipulation language (DML) Retrieval Add Delete Update An Example UNIVERSITY database Information

More information

Lyublena Antova, Christoph Koch, and Dan Olteanu Saarland University Database Group Saarbr ucken, Germany Presented By: Rana Daud

Lyublena Antova, Christoph Koch, and Dan Olteanu Saarland University Database Group Saarbr ucken, Germany Presented By: Rana Daud Lyublena Antova, Christoph Koch, and Dan Olteanu Saarland University Database Group Saarbr ucken, Germany 2007 1 Presented By: Rana Daud Introduction Application Scenarios I-SQL World-Set Algebra Algebraic

More information

Best Practices - Pentaho Data Modeling

Best Practices - Pentaho Data Modeling Best Practices - Pentaho Data Modeling This page intentionally left blank. Contents Overview... 1 Best Practices for Data Modeling and Data Storage... 1 Best Practices - Data Modeling... 1 Dimensional

More information

How Achaeans Would Construct Columns in Troy. Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte

How Achaeans Would Construct Columns in Troy. Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte How Achaeans Would Construct Columns in Troy Alekh Jindal, Felix Martin Schuhknecht, Jens Dittrich, Karen Khachatryan, Alexander Bunte Number of Visas Received 1 0,75 0,5 0,25 0 Alekh Jens Health Level

More information

Chapter 5. Relational Model Concepts 5/2/2008. Chapter Outline. Relational Database Constraints

Chapter 5. Relational Model Concepts 5/2/2008. Chapter Outline. Relational Database Constraints Chapter 5 The Relational Data Model and Relational Database Constraints Copyright 2004 Pearson Education, Inc. Chapter Outline Relational Model Concepts Relational Model Constraints and Relational Database

More information

Chapter 5. Relational Model Concepts 9/4/2012. Chapter Outline. The Relational Data Model and Relational Database Constraints

Chapter 5. Relational Model Concepts 9/4/2012. Chapter Outline. The Relational Data Model and Relational Database Constraints Chapter 5 The Relational Data Model and Relational Database Constraints Copyright 2004 Pearson Education, Inc. Chapter Outline Relational Model Constraints and Relational Database Schemas Update Operations

More information

2.2.2.Relational Database concept

2.2.2.Relational Database concept Foreign key:- is a field (or collection of fields) in one table that uniquely identifies a row of another table. In simpler words, the foreign key is defined in a second table, but it refers to the primary

More information

CSE 344 Final Examination

CSE 344 Final Examination CSE 344 Final Examination December 12, 2012, 8:30am - 10:20am Name: Question Points Score 1 30 2 20 3 30 4 20 Total: 100 This exam is open book and open notes but NO laptops or other portable devices.

More information

Two-level Data Staging ETL for Transaction Data

Two-level Data Staging ETL for Transaction Data Two-level Data Staging ETL for Transaction Data Xiufeng Liu University of Waterloo, Canada xiufeng.liu@uwaterloo.ca ABSTRACT In data warehousing, Extract-Transform-Load (ETL) extracts the data from data

More information

Modeling Your Data. Chapter 2. cs542 1

Modeling Your Data. Chapter 2. cs542 1 Modeling Your Data Chapter 2 cs542 1 Part II Discussion of the Model: Good Design/ Bad Design cs542 2 Design : The Obvious Use meaningful and descriptive s (it s for the human after all) Keep as simple

More information

D B M G 2. SQL language: basics. Update commands. Introduction The INSERT command The DELETE command The UPDATE command

D B M G 2. SQL language: basics. Update commands. Introduction The INSERT command The DELETE command The UPDATE command SQL language: basics DB M B G Introduction The INSERT command The DELETE command The UPDATE command D B M G 2 2013 Politecnico di Torino 1 Introduction DB M B G (1/3) Inserting tuples Deleting tuples Modifying

More information

Introduction to Data Management. Lecture #6 E-Rà Relational Mapping (Cont.)

Introduction to Data Management. Lecture #6 E-Rà Relational Mapping (Cont.) Introduction to Data Management Lecture #6 E-Rà Relational Mapping (Cont.) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 It s time again for...

More information

CPS 216 Spring 2003 Homework #1 Assigned: Wednesday, January 22 Due: Monday, February 10

CPS 216 Spring 2003 Homework #1 Assigned: Wednesday, January 22 Due: Monday, February 10 CPS 216 Spring 2003 Homework #1 Assigned: Wednesday, January 22 Due: Monday, February 10 Note: This is a long homework. Start early! If you have already taken CPS 196.3 or an equivalent undergraduate course

More information

Chapter 5. The Relational Data Model and Relational Database Constraints. Slide 5-١. Copyright 2007 Ramez Elmasri and Shamkant B.

Chapter 5. The Relational Data Model and Relational Database Constraints. Slide 5-١. Copyright 2007 Ramez Elmasri and Shamkant B. Slide 5-١ Chapter 5 The Relational Data Model and Relational Database Constraints Chapter Outline Relational Model Concepts Relational Model Constraints and Relational Database Schemas Update Operations

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 5-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 5-1 Slide 5-1 Chapter 5 The Relational Data Model and Relational Database Constraints Chapter Outline Relational Model Concepts Relational Model Constraints and Relational Database Schemas Update Operations

More information

COSC344 Database Theory and Applications. σ a= c (P) Lecture 3 The Relational Data. Model. π A, COSC344 Lecture 3 1

COSC344 Database Theory and Applications. σ a= c (P) Lecture 3 The Relational Data. Model. π A, COSC344 Lecture 3 1 COSC344 Database Theory and Applications σ a= c (P) S P Lecture 3 The Relational Data π A, C (H) Model COSC344 Lecture 3 1 Overview Last Lecture Database design ER modelling This Lecture Relational model

More information

The Relational Data Model and Relational Database Constraints

The Relational Data Model and Relational Database Constraints CHAPTER 5 The Relational Data Model and Relational Database Constraints Copyright 2017 Ramez Elmasri and Shamkant B. Navathe Slide 1-2 Chapter Outline Relational Model Concepts Relational Model Constraints

More information

Part I: Structured Data

Part I: Structured Data Inf1-DA 2011 2012 I: 24 / 117 Part I Structured Data Data Representation: I.1 The entity-relationship (ER) data model I.2 The relational model Data Manipulation: I.3 Relational algebra I.4 Tuple relational

More information

Data Warehousing. Jens Teubner, TU Dortmund Summer Jens Teubner Data Warehousing Summer

Data Warehousing. Jens Teubner, TU Dortmund Summer Jens Teubner Data Warehousing Summer Jens Teubner Data Warehousing Summer 2018 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Summer 2018 Jens Teubner Data Warehousing Summer 2018 160 Part VI ETL Process ETL Overview

More information

Conceptual Design. The Entity-Relationship (ER) Model

Conceptual Design. The Entity-Relationship (ER) Model Conceptual Design. The Entity-Relationship (ER) Model CS430/630 Lecture 12 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke Database Design Overview Conceptual design The Entity-Relationship

More information

01/01/2017. Chapter 5: The Relational Data Model and Relational Database Constraints: Outline. Chapter 5: Relational Database Constraints

01/01/2017. Chapter 5: The Relational Data Model and Relational Database Constraints: Outline. Chapter 5: Relational Database Constraints Chapter 5: The Relational Data Model and Relational Database Constraints: Outline Ramez Elmasri, Shamkant B. Navathe(2017) Fundamentals of Database Systems (7th Edition),pearson, isbn 10: 0-13-397077-9;isbn-13:978-0-13-397077-7.

More information

SQL. SQL DDL Statements

SQL. SQL DDL Statements SQL Structured Query Language Declarative Specify the properties that should hold in the result, not how to obtain the result Complex queries have procedural elements International Standard SQL1 (1986)

More information

Data warehouse design

Data warehouse design Database and data mining group, Data warehouse design DATA WAREHOUSE: DESIGN - Risk factors Database and data mining group, High user expectation the data warehouse is the solution of the company s problems

More information

ITCS 3160 DATA BASE DESIGN AND IMPLEMENTATION

ITCS 3160 DATA BASE DESIGN AND IMPLEMENTATION ITCS 3160 DATA BASE DESIGN AND IMPLEMENTATION JING YANG 2010 FALL Class 3: The Relational Data Model and Relational Database Constraints Outline 2 The Relational Data Model and Relational Database Constraints

More information

Lecture Query evaluation. Combining operators. Logical query optimization. By Marina Barsky Winter 2016, University of Toronto

Lecture Query evaluation. Combining operators. Logical query optimization. By Marina Barsky Winter 2016, University of Toronto Lecture 02.03. Query evaluation Combining operators. Logical query optimization By Marina Barsky Winter 2016, University of Toronto Quick recap: Relational Algebra Operators Core operators: Selection σ

More information

6 February 2014 CSE-3421M Test #1 w/ answers p. 1 of 14. CSE-3421M Test #1. Design

6 February 2014 CSE-3421M Test #1 w/ answers p. 1 of 14. CSE-3421M Test #1. Design 6 February 2014 CSE-3421M Test #1 w/ answers p. 1 of 14 CSE-3421M Test #1 Design Sur / Last Name: Given / First Name: Student ID: Instructor: Parke Godfrey Exam Duration: 75 minutes Term: Winter 2014 Answer

More information

Overview of db design Requirement analysis Data to be stored Applications to be built Operations (most frequent) subject to performance requirement

Overview of db design Requirement analysis Data to be stored Applications to be built Operations (most frequent) subject to performance requirement ITCS 3160 Data Base Design and Implementation Jing Yang 2010 Fall Class 12: Data Modeling Using the Entity-Relationship (ER) Model Overview of db design Requirement analysis Data to be stored Applications

More information

Data Modelling and Databases. Exercise Session 7: Integrity Constraints

Data Modelling and Databases. Exercise Session 7: Integrity Constraints Data Modelling and Databases Exercise Session 7: Integrity Constraints 1 Database Design Textual Description Complete Design ER Diagram Relational Schema Conceptual Modeling Logical Modeling Physical Modeling

More information

Final exam CSCC43 S14

Final exam CSCC43 S14 Final exam CSCC43 S14 You have 2h 50m (170 minutes) to complete this exam. Skip any question you do not know how to answer, or you will run out of time. Assume bag semantics for any problem where the distinction

More information

Reverse Query Processing

Reverse Query Processing Reverse Query Processing Carsten Binnig Donald Kossmann Eric Lo ETH Zurich, Switzerland Email: {firstname.lastname}@inf.ethz.ch University of Heidelberg, Germany Email: carsten.binnig@informatik.uni-heidelberg.de

More information

The Relational Model

The Relational Model The Relational Model What is the Relational Model Relations Domain Constraints SQL Integrity Constraints Translating an ER diagram to the Relational Model and SQL Views A relational database consists

More information

A database consists of several tables (relations) AccountNum

A database consists of several tables (relations) AccountNum rela%onal model Relational Model A database consists of several tables (relations) Customer Account Depositor CustID Name Street City State AccountNum Balance CustID AccountNum Columns in the tables are

More information

Jignesh M. Patel. Blog:

Jignesh M. Patel. Blog: Jignesh M. Patel Blog: http://bigfastdata.blogspot.com Go back to the design Query Cache from Processing for Conscious 98s Modern (at Algorithms Hardware least for Hash Joins) 995 24 2 Processor Processor

More information

Outline. 1 CS520-5) Data Exchange

Outline. 1 CS520-5) Data Exchange Outline 0) Course Info 1) Introduction 2) Data Preparation and Cleaning 3) Schema matching and mapping 4) Virtual Data Integration 5) Data Exchange 6) Data Warehousing 7) Big Data Analytics 8) Data Provenance

More information

elements of a relation are called tuples. Every tuple represents an entity or a relationship. Country name code population area...

elements of a relation are called tuples. Every tuple represents an entity or a relationship. Country name code population area... 2.2.1 Relations A (database) state associates each relation schema to a relation. elements of a relation are called tuples. Every tuple represents an entity or a relationship. (: Asia, area: 4.5E7) relations

More information

2.2 Relational Model (RM)

2.2 Relational Model (RM) 2.2 Relational Model (RM) Relational Model by Codd (1970): mathematical foundation: set theory, only a single structural concept Relation, entity/object types and relationship types are uniformly modeled

More information