V. Database Design CS448/ How to obtain a good relational database schema

Size: px
Start display at page:

Download "V. Database Design CS448/ How to obtain a good relational database schema"

Transcription

1 V. How to obtain a good relational database schema Deriving new relational schema from ER-diagrams Normal forms: use of constraints in evaluating existing relational schema CS448/648 1

2 Translating an E-R Model to a Relational Schema General approach is straightforward Each entity set maps to a new table Each attribute maps to a new table column Each relationship set maps to either new table columns or to a new table CS448/648 2

3 Introduction to Database Management Representing Strong Entity Sets Entity set with attributes table with attributes Entity of type row in table Primary key of entity set primary key of table Ex. Student Major StudentNum StudentName Student StudentNum StudentName Major CS448/648 3

4 Introduction to Database Management Representing Weak Entity Sets Weak entity set table Columns of table should include Attributes of the weak entity set Attributes of the identifying relationship set Primary key attributes of entity set for dominating entities Primary key of weak entity set primary key of table CS448/648 4

5 Representing Weak Entity Sets (cont d) Ex. Balance Account 1 AccNum Log N Transaction TransNum Date Amount Account AccNum Balance Transaction TransNum AccNum Date Amount CS448/648 5

6 Introduction to Database Management Representing Relationship Sets If the relationship set is an identifying relationship set for a weak entity set then no action needed If we can deduce the general cardinality constraint (1,1) for a component entity set then add following columns to table Attributes of the relationship set Primary key attributes of remaining component entity sets Otherwise: relationship set table CS448/648 6

7 Representing Relationship Sets (cont d) Columns of table should include Attributes of the relationship set Primary key attributes of each component entity set Primary key of table determined as follows If we can deduce the general cardinality constraint (0,1) for a component entity set, then choose the primary key attributes for Otherwise, choose primary key attributes of each component entity CS448/648 7

8 Representing Relationship Sets (cont d) Ex. Score Address Team HomeTeam Visitor Match Location TeamName LocName Team TeamName Location LocName Address Match HomeTeamName VisitorTeamName LocName Score Note that the role name of a component entity set should be prepended to its primary key attributes, if supplied CS448/648 8

9 Example Translation Course CourseNum CourseName Student StudentNum StudentName Section CourseNum SectionNum ProfNum EnrolledIn CourseNum SectionNum StudentNum Mark Professor ProfNum ProfName CS448/648 9

10 Representing Aggregation Tabular representation for aggregation of relationship set tabular representation for relationship set To represent relationship set involving aggregation of, treat the aggregation like an entity set whose primary key primary key of the table for CS448/648 10

11 Representing Aggregation (cont d) Ex. CourseNum Student N EnrolledIn N Course 1 StudentNum CourseAccount ExpirationDate 1 Account UserId Student Course Account StudentNum CourseNum UserId EnrolledIn StudentNum CourseNum CourseAccount UserId StudentNum CourseNum ExpirationDate CS448/648 11

12 Representing Specialization Create table for higher-level entity set, and treat specialized entity subsets like weak entity sets Ex. StudentNumber Student StudentName Graduate N (1, 1) SupervisedBy 1 (0, N) Professor Degrees ProfessorName Student StudentNumber StudentName Professor ProfessorName Graduate StudentNumber ProfessorName Degree StudentNumber Degree CS448/648 12

13 Representing Generalization Create a table for each lower-level entity set only Columns of new tables should include Attributes of lower level entity set Attributes of the superset The higher-level entity set can be defined as a view on the tables for the lower-level entity sets CS448/648 13

14 Representing Generalization (cont d) Ex. MakeAndModel LicenceNum Vehicle Price Tonnage Truck Car MaxSpeed AxelCount PassengerCount Truck LicenceNum MakeAndModel Price Tonnage AxelCount Car LicenceNum MakeAndModel Price MaxSpeed PassengerCount CS448/648 14

15 Normal Forms What is a good relational database schema? How can we evaluate an existing relational schema? Goals: Intuitive and straightforward changes Nonredundant storage of data We review: Boyce-Codd Normal Form (BCNF) Third Normal Form (3NF) CS448/648 15

16 Change Anomalies Assume we are given the E-R diagram Sno Sname Ino Supplied_Items City Iname Price CS448/648 16

17 Change Anomalies (cont d) This maps to Supplied Items Sno Sname City Ino Iname Price S1 Magna Ajax I1 Bolt 0.50 S1 Magna Ajax I2 Nut 0.25 S1 Magna Ajax I3 Screw 0.30 S2 Budd Hull I3 Screw 0.40 Problems 1. Update problems (e.g. changing name of supplier) 2. Insert problems (e.g. add a new item) 3. Delete problems (e.g. Budd no longer supplies screws) 4. Likely increase in space requirements CS448/648 17

18 Change Anomalies (cont d) Now compare to Supplier Sno Sname City S1 Magna Ajax S2 Budd Hull Item Ino I1 I2 I3 Iname Bolt Nut Screw Supplies Sno Ino Price S1 I S1 I S1 I S2 I CS448/648 18

19 Change Anomalies (cont d) But other extreme is also undesirable (information about relationships is lost) Snos Sno S1 S2 Inums Inum I1 I2 I3 Snames Sname Magna Budd Inames Iname Bolt Nut Screw Cities City Ajax Hull Prices Price CS448/648 19

20 Good What is a good relational database schema? Rule of thumb: Independent facts in separate tables or: Each relation schema should consist of a primary key and a set of mutually independent attributes CS448/648 20

21 Functional Dependencies Generalizes notion of superkey Used to characterize BCNF and 3NF Some notation: Allow projection operation on tuples Ex. when is the first tuple in Supplier, [Sno] = (S1) [Sname, City] = (Magna, Ajax) CS448/648 21

22 Examples of Functional Dependencies Consider EmpProj SIN PNum Hours EName PName PLoc Allowance Key constraint forbids two different rows and in EmpProj with [SIN, PNum] = [SIN, PNum] Also disallowed one SIN with two employee names one project number with two project names or two locations different allowances for the same number of hours at the same location Notation SIN EName PNum PName, Ploc PLoc, Hours Allowance CS448/648 22

23 Introduction to Database Management Formal Definitions Let be a relation schema, and. The functional dependency holds on if no legal instance of contains two tuples and with and functionally determines, is functionally dependent on Keys again: is a superkey for relation schema if dependency holds on Functional dependencies are constraints on all instances of a schema; they are design decisions based on the semantics of the attributes A relation can confirm that a functional dependency does not hold; the relation cannot confirm that a functional dependency must always hold. CS448/648 23

24 Formal Definitions (cont d) Ex. TEACH Teacher Course Text Smith Data Structures Bartram Smith Data Management Al-Nour Hall Compilers Hoffman Brown Data Structures Augenthaler Let denote a set of functional dependencies over. The closure of, denoted by is the set of all functional dependencies that are satisfied by every relation instance that satisfies (= logical implications of ) Note: CS448/648 24

25 Introduction to Database Management Reasoning About Functional Dependencies Need to be able to find superkeys decide membership in find minimal covers There are different sets of functional dependencies that have the same logical implications. Small simples sets are desirable, and are called minimal covers. CS448/648 25

26 Introduction to Database Management Reasoning About Functional Dependencies (cont d) Logical implications can be derived by using inference rules called Armstrong s axioms (reflexivity) (augmentation) (transitivity), Additional rules can be derived (union), (decomposition) The axioms are sound (anything derived from is in ) complete (anything in can be derived) CS448/648 26

27 Reasoning (cont d) Ex. Let consist of SIN, PNum Hours SIN EName PNum PName, PLoc PLoc, Hours Allowance A derivation of: SIN, PNum Allowance 1. SIN, PNum Hours ( ) 2. PNum PName, PLoc ( ) 3. PLoc, Hours Allowance ( ) 4. SIN, PNum PNum (reflexivity) 5. SIN, PNum PName, PLoc (transitivity, 4 and 2) 6. SIN, PNum PLoc (decomposition, 5) 7. SIN, PNum PLoc, Hours (union, 6, 1) 8. SIN, PNum Allowance (transitivity, 7 and 3) CS448/648 27

28 Introduction to Database Management Reasoning (cont d) There is a more efficient way of using Armstrong s axioms function begin := ; while true do if there exists (1) (2) then := else exit; return ; end, and such that Let be a relational schema and a set of functional dependencies on. Then Theorem: is a superkey of if and only if Theorem: if and only if CS448/648 28

29 Introduction to Database Management Finding Minimal Covers A minimal cover for can be computed in four steps. Note that each step must be repeated until it no longer succeeds in updating. Step 1. Replace with the pair and. Step 2. Remove. from if Step 3. Remove in from the left-hand-side of. in if is Step 4. Replace and in by. CS448/648 29

30 Introduction to Database Management Boyce-Codd Normal Form (BCNF) Formalization of the goal that independent relationships are stored in separate tables Let be a relation schema and a set of functional dependencies. A functional dependency is trivial if. Schema is in BCNF if and only if whenever, then either and is trivial, or is a superkey of A database schema is in BCNF is in BCNF if each relation schema CS448/648 30

31 BCNF (cont d) Why does BCNF avoid redundancy? For schema Supplied Items we had FD: Sno Sname, City Implies: supplier name Magna and city Ajax must be repeated for each item supplied by supplier S1. Assume FD holds over a schema that is in BCNF. This implies Sno is a superkey for each Sno value appears on one row only no need to repeat Sname and City values (Cf. the original Supplier table) CS448/648 31

32 Introduction to Database Management Computing a Normal Form What to do if a given relational schema is not in BCNF? Strategy: identify undesirable dependencies, decompose schema Let be a relation schema (= set of attributes). Collection of relation schemas is a decomposition of if A good decomposition does not lose information complicate checking of constraints CS448/648 32

33 Lossless-Join Decompositions We should be able to construct the original table from its decomposition Ex. Consider replacing Marks Student Assignment Group Mark Ann A1 G1 80 Ann A2 G3 60 Bob A1 G2 60 by decomposing (i.e. projecting) into two tables SGM Student Group Mark Ann G1 80 Ann G3 60 Bob G2 60 AM Assignment Mark A1 80 A2 60 A1 60 CS448/648 33

34 Lossless-Join Decompositions (cont d) But computing the natural join of SGM and AM produces Student Assignment Group Mark Ann A1 G1 80 Ann A2 G3 60 Ann A1 G3 60! Bob A2 G2 60! Bob A1 G2 60 We get extra data, spurious tuples, and would therefore lose information if we were to replace Marks by SGM and AM. If converse is true, if re-joining SGM and AM would always produce exactly the tuples in Marks, then we call SGM and AM a losslessjoin decomposition. CS448/648 34

35 Introduction to Database Management Identifying Lossless-Join Decompositions A decomposition of is lossless if and only if the common attributes of and form a superkey for either schema, that is or Ex. In the previous example we had = Student, Assignment, Group, Mark, = (Student, Assignment Group, Mark), = Student, Group, Mark, = Assignment, Mark Decomposition is lossy because a superkey of either SGM or AM is not CS448/648 35

36 Dependency Preservation Goal: efficient testing of constraints on the decomposed schema Ex. A table for a company database could be R Proj Dept Div with functional dependencies FD1: Proj FD2: Dept FD3: Proj Dept, Div, and Div Consider two decompositions = R1[Proj, Dept], R2[Dept, Div] = R1[Proj, Dept], R3[Proj, Div] Both are lossless. (Why?) CS448/648 36

37 Introduction to Database Management Dependency Preservation (cont d) Decomposition lets us test FD1 on table R1 and FD2 on table R2; if they are both satisfied, FD3 is automatically satisfied In decomposition we can test FD1 on table R1 and FD3 on table R3. Dependency FD2 is an interrelational constraint: testing it requires joining tables R1 and R3 Let be a relation schema and a set of functional dependencies on. A decomposition of is dependency preserving if there is an equivalent set of functional dependencies, none of which is interrelational in CS448/648 37

38 Introduction to Database Management Computing a Lossless-Join BCNF Decomposition function begin Result := ; while some Result and violate the BCNF condition do begin Replace by ; Add to Result; end; return Result; end No efficient procedure to do this exists. It is possible that no dependency preserving BCNF decomposition exists. Ex. Consider A, B, C and AB C, C B. CS448/648 38

39 Introduction to Database Management Third Normal Form (3NF) Let be a relation schema and a set of functional dependencies. Schema is in 3NF if and only if whenever, then either is trivial, or is a superkey of, or each attribute of A database schema is in 3NF is contained in a candidate key of and is in 3NF if each relation schema Because 3NF is looser than BCNF, it allows more redundancy A lossless-join, dependency preserving decomposition into 3NF relation schemas always exists. CS448/648 39

40 Introduction to Database Management Computing a 3NF Decomposition A lossless-join 3NF decomposition that is dependency preserving can be efficiently computed function begin Result := ; := a minimal cover for ; for each do Result := Result ; if there is no Result such that contains a candidate key for then begin compute a candidate key for ; Result := Result ; end; return Result; end CS448/648 40

Schema Refinement: Dependencies and Normal Forms

Schema Refinement: Dependencies and Normal Forms Schema Refinement: Dependencies and Normal Forms Grant Weddell David R. Cheriton School of Computer Science University of Waterloo CS 348 Introduction to Database Management Spring 2012 CS 348 (Intro to

More information

Schema Refinement: Dependencies and Normal Forms

Schema Refinement: Dependencies and Normal Forms Schema Refinement: Dependencies and Normal Forms Grant Weddell Cheriton School of Computer Science University of Waterloo CS 348 Introduction to Database Management Spring 2016 CS 348 (Intro to DB Mgmt)

More information

Schema Refinement: Dependencies and Normal Forms

Schema Refinement: Dependencies and Normal Forms Schema Refinement: Dependencies and Normal Forms M. Tamer Özsu David R. Cheriton School of Computer Science University of Waterloo CS 348 Introduction to Database Management Fall 2012 CS 348 Schema Refinement

More information

Translating Entity-Relationship to Relational Tables

Translating Entity-Relationship to Relational Tables Translating Entity-Relationship to Relational Tables Spring 2018 School of Computer Science University of Waterloo Databases CS348 (University of Waterloo) ER to Relational 1 / 39 E-R Diagram to Relational

More information

Translating Entity-Relationship to Relational Tables

Translating Entity-Relationship to Relational Tables Translating Entity-Relationship to Relational Tables Fall 2017 School of Computer Science University of Waterloo Databases CS348 (University of Waterloo) ER to Relational 1 / 39 Outline 1 ER to Relational

More information

CS 338 Functional Dependencies

CS 338 Functional Dependencies CS 338 Functional Dependencies Bojana Bislimovska Winter 2016 Outline Design Guidelines for Relation Schemas Functional Dependency Set and Attribute Closure Schema Decomposition Boyce-Codd Normal Form

More information

E-R Diagram to Relational Schema. Translating Entity-Relationship to Relational Tables. Representing Weak Entity Sets. Representing Strong Entity Sets

E-R Diagram to Relational Schema. Translating Entity-Relationship to Relational Tables. Representing Weak Entity Sets. Representing Strong Entity Sets E-R Diagram to Relational Schema Translating Entity-Relationship to Relational Tables Fall 2015 David Toman School of Computer Science University of Waterloo Main ideas: Each entity set maps to a new table

More information

Data Modeling and the Entity-Relationship Model

Data Modeling and the Entity-Relationship Model Data Modeling and the Entity-Relationship Model David Toman School of Computer Science University of Waterloo Introduction to Databases CS348 David Toman (University of Waterloo) ER Model 1 / 29 Overview

More information

Data Modeling and the Entity-Relationship Model

Data Modeling and the Entity-Relationship Model Data Modeling and the Entity-Relationship Model Grant Weddell Cheriton School of Computer Science University of Waterloo CS 348 Introduction to Database Management Winter 2017 CS 348 (Intro to DB Mgmt)

More information

Notes. These slides are based on a slide set provided by Prof. M. Tamer Öszu. CS 640 E-R Model Winter / 23. Notes

Notes. These slides are based on a slide set provided by Prof. M. Tamer Öszu. CS 640 E-R Model Winter / 23. Notes The Entity-Relationship Model Olaf Hartig David R. Cheriton School of Computer Science University of Waterloo CS 640 Principles of Database Management and Use Winter 203 These slides are based on a slide

More information

Normalization. Murali Mani. What and Why Normalization? To remove potential redundancy in design

Normalization. Murali Mani. What and Why Normalization? To remove potential redundancy in design 1 Normalization What and Why Normalization? To remove potential redundancy in design Redundancy causes several anomalies: insert, delete and update Normalization uses concept of dependencies Functional

More information

Chapter 10. Chapter Outline. Chapter Outline. Functional Dependencies and Normalization for Relational Databases

Chapter 10. Chapter Outline. Chapter Outline. Functional Dependencies and Normalization for Relational Databases Chapter 10 Functional Dependencies and Normalization for Relational Databases Chapter Outline 1 Informal Design Guidelines for Relational Databases 1.1Semantics of the Relation Attributes 1.2 Redundant

More information

Chapter 10. Normalization. Chapter Outline. Chapter Outline(contd.)

Chapter 10. Normalization. Chapter Outline. Chapter Outline(contd.) Chapter 10 Normalization Chapter Outline 1 Informal Design Guidelines for Relational Databases 1.1Semantics of the Relation Attributes 1.2 Redundant Information in Tuples and Update Anomalies 1.3 Null

More information

Relational Database design. Slides By: Shree Jaswal

Relational Database design. Slides By: Shree Jaswal Relational Database design Slides By: Shree Jaswal Topics: Design guidelines for relational schema, Functional Dependencies, Definition of Normal Forms- 1NF, 2NF, 3NF, BCNF, Converting Relational Schema

More information

Functional Dependencies and Normalization for Relational Databases Design & Analysis of Database Systems

Functional Dependencies and Normalization for Relational Databases Design & Analysis of Database Systems Functional Dependencies and Normalization for Relational Databases 406.426 Design & Analysis of Database Systems Jonghun Park jonghun@snu.ac.kr Dept. of Industrial Engineering Seoul National University

More information

Elmasri/Navathe, Fundamentals of Database Systems, Fourth Edition Chapter 10-2

Elmasri/Navathe, Fundamentals of Database Systems, Fourth Edition Chapter 10-2 Elmasri/Navathe, Fundamentals of Database Systems, Fourth Edition Chapter 10-2 Chapter Outline 1 Informal Design Guidelines for Relational Databases 1.1Semantics of the Relation Attributes 1.2 Redundant

More information

Informal Design Guidelines for Relational Databases

Informal Design Guidelines for Relational Databases Outline Informal Design Guidelines for Relational Databases Semantics of the Relation Attributes Redundant Information in Tuples and Update Anomalies Null Values in Tuples Spurious Tuples Functional Dependencies

More information

UNIT 3 DATABASE DESIGN

UNIT 3 DATABASE DESIGN UNIT 3 DATABASE DESIGN Objective To study design guidelines for relational databases. To know about Functional dependencies. To have an understanding on First, Second, Third Normal forms To study about

More information

Chapter 14. Database Design Theory: Introduction to Normalization Using Functional and Multivalued Dependencies

Chapter 14. Database Design Theory: Introduction to Normalization Using Functional and Multivalued Dependencies Chapter 14 Database Design Theory: Introduction to Normalization Using Functional and Multivalued Dependencies Copyright 2012 Ramez Elmasri and Shamkant B. Navathe Chapter Outline 1 Informal Design Guidelines

More information

Functional Dependencies and. Databases. 1 Informal Design Guidelines for Relational Databases. 4 General Normal Form Definitions (For Multiple Keys)

Functional Dependencies and. Databases. 1 Informal Design Guidelines for Relational Databases. 4 General Normal Form Definitions (For Multiple Keys) 1 / 13 1 Informal Design Guidelines for Relational Databases 1.1Semantics of the Relation Attributes 1.2 Redundant d Information in Tuples and Update Anomalies 1.3 Null Values in Tuples 1.4 Spurious Tuples

More information

COSC Dr. Ramon Lawrence. Emp Relation

COSC Dr. Ramon Lawrence. Emp Relation COSC 304 Introduction to Database Systems Normalization Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca Normalization Normalization is a technique for producing relations

More information

Relational Design: Characteristics of Well-designed DB

Relational Design: Characteristics of Well-designed DB 1. Minimal duplication Relational Design: Characteristics of Well-designed DB Consider table newfaculty (Result of F aculty T each Course) Id Lname Off Bldg Phone Salary Numb Dept Lvl MaxSz 20000 Cotts

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe CHAPTER 14 Basics of Functional Dependencies and Normalization for Relational Databases Slide 14-2 Chapter Outline 1 Informal Design Guidelines for Relational Databases 1.1 Semantics of the Relation Attributes

More information

Overview of E-R model. The Entity-Relationship (E-R) model. used for database (conceptual schema) design

Overview of E-R model. The Entity-Relationship (E-R) model. used for database (conceptual schema) design The Entity-Relationship (E-R) model Overview of E-R model used for database (conceptual schema) design Lecture topics: basic E-R modelling extensions to E-R modelling designing an E-R schema deriving relational

More information

This lecture. Databases -Normalization I. Repeating Data. Redundancy. This lecture introduces normal forms, decomposition and normalization.

This lecture. Databases -Normalization I. Repeating Data. Redundancy. This lecture introduces normal forms, decomposition and normalization. This lecture Databases -Normalization I This lecture introduces normal forms, decomposition and normalization (GF Royle 2006-8, N Spadaccini 2008) Databases - Normalization I 1 / 23 (GF Royle 2006-8, N

More information

The strategy for achieving a good design is to decompose a badly designed relation appropriately.

The strategy for achieving a good design is to decompose a badly designed relation appropriately. The strategy for achieving a good design is to decompose a badly designed relation appropriately. Functional Dependencies The single most important concept in relational schema design theory is that of

More information

Database Design Theory and Normalization. CS 377: Database Systems

Database Design Theory and Normalization. CS 377: Database Systems Database Design Theory and Normalization CS 377: Database Systems Recap: What Has Been Covered Lectures 1-2: Database Overview & Concepts Lecture 4: Representational Model (Relational Model) & Mapping

More information

Functional Dependencies & Normalization for Relational DBs. Truong Tuan Anh CSE-HCMUT

Functional Dependencies & Normalization for Relational DBs. Truong Tuan Anh CSE-HCMUT Functional Dependencies & Normalization for Relational DBs Truong Tuan Anh CSE-HCMUT 1 2 Contents 1 Introduction 2 Functional dependencies (FDs) 3 Normalization 4 Relational database schema design algorithms

More information

Chapter 7: Relational Database Design

Chapter 7: Relational Database Design Chapter 7: Relational Database Design Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 7: Relational Database Design Features of Good Relational Design Atomic Domains

More information

Chapter 8: Relational Database Design

Chapter 8: Relational Database Design Chapter 8: Relational Database Design Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 8: Relational Database Design Features of Good Relational Design Atomic Domains

More information

Databases -Normalization I. (GF Royle, N Spadaccini ) Databases - Normalization I 1 / 24

Databases -Normalization I. (GF Royle, N Spadaccini ) Databases - Normalization I 1 / 24 Databases -Normalization I (GF Royle, N Spadaccini 2006-2010) Databases - Normalization I 1 / 24 This lecture This lecture introduces normal forms, decomposition and normalization. We will explore problems

More information

Unit 3 : Relational Database Design

Unit 3 : Relational Database Design Unit 3 : Relational Database Design Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Content Relational Model: Basic concepts, Attributes and Domains, CODD's Rules, Relational

More information

Chapter 14 Outline. Normalization for Relational Databases: Outline. Chapter 14: Basics of Functional Dependencies and

Chapter 14 Outline. Normalization for Relational Databases: Outline. Chapter 14: Basics of Functional Dependencies and Ramez Elmasri, Shamkant B. Navathe(2016) Fundamentals of Database Systems (7th Edition), pearson, isbn 10: 0-13-397077-9;isbn-13:978-0-13-397077-7. Chapter 14: Basics of Functional Dependencies and Normalization

More information

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 4 - Schema Normalization

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 4 - Schema Normalization CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2009 Lecture 4 - Schema Normalization References R&G Book. Chapter 19: Schema refinement and normal forms Also relevant to

More information

CSE 562 Database Systems

CSE 562 Database Systems Goal CSE 562 Database Systems Question: The relational model is great, but how do I go about designing my database schema? Database Design Some slides are based or modified from originals by Magdalena

More information

CS 2451 Database Systems: Database and Schema Design

CS 2451 Database Systems: Database and Schema Design CS 2451 Database Systems: Database and Schema Design http://www.seas.gwu.edu/~bhagiweb/cs2541 Spring 2018 Instructor: Dr. Bhagi Narahari Relational Model: Definitions Review Relations/tables, Attributes/Columns,

More information

CS411 Database Systems. 05: Relational Schema Design Ch , except and

CS411 Database Systems. 05: Relational Schema Design Ch , except and CS411 Database Systems 05: Relational Schema Design Ch. 3.1-3.5, except 3.4.2-3.4.3 and 3.5.3. 1 How does this fit in? ER Diagrams: Data Definition Translation to Relational Schema: Data Definition Relational

More information

Mapping ER Diagrams to. Relations (Cont d) Mapping ER Diagrams to. Exercise. Relations. Mapping ER Diagrams to Relations (Cont d) Exercise

Mapping ER Diagrams to. Relations (Cont d) Mapping ER Diagrams to. Exercise. Relations. Mapping ER Diagrams to Relations (Cont d) Exercise CSC 74 Database Management Systems Topic #6: Database Design Weak Entity Type E Create a relation R Include all simple attributes and simple components of composite attributes. Include the primary key

More information

Database Design Principles

Database Design Principles Database Design Principles CPS352: Database Systems Simon Miner Gordon College Last Revised: 2/11/15 Agenda Check-in Design Project ERD Presentations Database Design Principles Decomposition Functional

More information

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2009 Lecture 3 - Schema Normalization

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2009 Lecture 3 - Schema Normalization CSE 544 Principles of Database Management Systems Magdalena Balazinska Fall 2009 Lecture 3 - Schema Normalization References R&G Book. Chapter 19: Schema refinement and normal forms Also relevant to this

More information

Example Examination. Allocated Time: 100 minutes Maximum Points: 250

Example Examination. Allocated Time: 100 minutes Maximum Points: 250 CS542 EXAMPLE EXAM Elke A. Rundensteiner Example Examination Allocated Time: 100 minutes Maximum Points: 250 STUDENT NAME: General Instructions: This test is a closed book exam (besides one cheat sheet).

More information

Normalisation. Normalisation. Normalisation

Normalisation. Normalisation. Normalisation Normalisation Normalisation Main objective in developing a logical data model for relational database systems is to create an accurate and efficient representation of the data, its relationships, and constraints

More information

Chapter 6: Relational Database Design

Chapter 6: Relational Database Design Chapter 6: Relational Database Design Chapter 6: Relational Database Design Features of Good Relational Design Atomic Domains and First Normal Form Decomposition Using Functional Dependencies Second Normal

More information

Functional Dependencies and Finding a Minimal Cover

Functional Dependencies and Finding a Minimal Cover Functional Dependencies and Finding a Minimal Cover Robert Soulé 1 Normalization An anomaly occurs in a database when you can update, insert, or delete data, and get undesired side-effects. These side

More information

Relational Database Design Theory. Introduction to Databases CompSci 316 Fall 2017

Relational Database Design Theory. Introduction to Databases CompSci 316 Fall 2017 Relational Database Design Theory Introduction to Databases CompSci 316 Fall 2017 2 Announcements (Thu. Sep. 14) Homework #1 due next Tuesday (11:59pm) Course project description posted Read it! Mixer

More information

Schema Refinement and Normal Forms

Schema Refinement and Normal Forms Schema Refinement and Normal Forms Chapter 19 Quiz #2 Next Wednesday Comp 521 Files and Databases Fall 2010 1 The Evils of Redundancy Redundancy is at the root of several problems associated with relational

More information

MODULE: 3 FUNCTIONAL DEPENDENCIES

MODULE: 3 FUNCTIONAL DEPENDENCIES MODULE: 3 (13 hours) Database design: functional dependencies - Inference Rules for Functional Dependencies - Closure -- Minimal Cover -Normal forms First-second and third normal forms Boyce- Codd normal

More information

Database design III. Quiz time! Using FDs to detect anomalies. Decomposition. Decomposition. Boyce-Codd Normal Form 11/4/16

Database design III. Quiz time! Using FDs to detect anomalies. Decomposition. Decomposition. Boyce-Codd Normal Form 11/4/16 Lecture 3 Quiz time! Database design III Functional dependencies cont. BCNF and 3NF What s wrong with this schema? {(, 2, Databases, Steven Van Acker ), (, 4, Databases, Rogardt Heldal )} Redundancy! Using

More information

Relational Database Design (II)

Relational Database Design (II) Relational Database Design (II) 1 Roadmap of This Lecture Algorithms for Functional Dependencies (cont d) Decomposition Using Multi-valued Dependencies More Normal Form Database-Design Process Modeling

More information

TDDD12 Databasteknik Föreläsning 4: Normalisering

TDDD12 Databasteknik Föreläsning 4: Normalisering What is Good Design TDDD12 Databasteknik Föreläsning 4: Normalisering Can we be sure that the translation from the EER diagram to relational tables results in a good database design? Or: Confronted with

More information

customer = (customer_id, _ customer_name, customer_street,

customer = (customer_id, _ customer_name, customer_street, Relational Database Design COMPILED BY: RITURAJ JAIN The Banking Schema branch = (branch_name, branch_city, assets) customer = (customer_id, _ customer_name, customer_street, customer_city) account = (account_number,

More information

We shall represent a relation as a table with columns and rows. Each column of the table has a name, or attribute. Each row is called a tuple.

We shall represent a relation as a table with columns and rows. Each column of the table has a name, or attribute. Each row is called a tuple. Logical Database design Earlier we saw how to convert an unorganized text description of information requirements into a conceptual design, by the use of ER diagrams. The advantage of ER diagrams is that

More information

CSCI 403: Databases 13 - Functional Dependencies and Normalization

CSCI 403: Databases 13 - Functional Dependencies and Normalization CSCI 403: Databases 13 - Functional Dependencies and Normalization Introduction The point of this lecture material is to discuss some objective measures of the goodness of a database schema. The method

More information

Functional dependency theory

Functional dependency theory Functional dependency theory Introduction to Database Design 2012, Lecture 8 Course evaluation Recalling normal forms Functional dependency theory Computing closures of attribute sets BCNF decomposition

More information

Database Systems. Basics of the Relational Data Model

Database Systems. Basics of the Relational Data Model Database Systems Relational Design Theory Jens Otten University of Oslo Jens Otten (UiO) Database Systems Relational Design Theory INF3100 Spring 18 1 / 30 Basics of the Relational Data Model title year

More information

FUNCTIONAL DEPENDENCIES

FUNCTIONAL DEPENDENCIES FUNCTIONAL DEPENDENCIES CS 564- Spring 2018 ACKs: Dan Suciu, Jignesh Patel, AnHai Doan WHAT IS THIS LECTURE ABOUT? Database Design Theory: Functional Dependencies Armstrong s rules The Closure Algorithm

More information

Functional Dependencies CS 1270

Functional Dependencies CS 1270 Functional Dependencies CS 1270 Constraints We use constraints to enforce semantic requirements on a DBMS Predicates that the DBMS must ensure to be always true. Predicates are checked when the DBMS chooses

More information

The Relational Model and Normalization

The Relational Model and Normalization The Relational Model and Normalization 1. Introduction 2 2. Relational Model Terminology 3 4. Normal Forms 11 5. Multi-valued Dependency 21 6. The Fifth Normal Form 22 The Relational Model and Normalization

More information

Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications. Overview - detailed. Goal. Faloutsos & Pavlo CMU SCS /615

Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications. Overview - detailed. Goal. Faloutsos & Pavlo CMU SCS /615 Faloutsos & Pavlo 15-415/615 Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications Lecture #17: Schema Refinement & Normalization - Normal Forms (R&G, ch. 19) Overview - detailed

More information

To overcome these anomalies we need to normalize the data. In the next section we will discuss about normalization.

To overcome these anomalies we need to normalize the data. In the next section we will discuss about normalization. Anomalies in DBMS There are three types of anomalies that occur when the database is not normalized. These are Insertion, update and deletion anomaly. Let s take an example to understand this. Example:

More information

Normalization. Anomalies Functional Dependencies Closures Key Computation Projecting Relations BCNF Reconstructing Information Other Normal Forms

Normalization. Anomalies Functional Dependencies Closures Key Computation Projecting Relations BCNF Reconstructing Information Other Normal Forms Anomalies Functional Dependencies Closures Key Computation Projecting Relations BCNF Reconstructing Information Other Normal Forms Normalization Niklas Fors (niklas.fors@cs.lth.se) Normalization 1 / 45

More information

Steps in normalisation. Steps in normalisation 7/15/2014

Steps in normalisation. Steps in normalisation 7/15/2014 Introduction to normalisation Normalisation Normalisation = a formal process for deciding which attributes should be grouped together in a relation Normalisation is the process of decomposing relations

More information

The Relational Data Model

The Relational Data Model The Relational Data Model Lecture 6 1 Outline Relational Data Model Functional Dependencies Logical Schema Design Reading Chapter 8 2 1 The Relational Data Model Data Modeling Relational Schema Physical

More information

Relational Design 1 / 34

Relational Design 1 / 34 Relational Design 1 / 34 Relational Design Basic design approaches. What makes a good design better than a bad design? How do we tell we have a "good" design? How to we go about creating a good design?

More information

Relational Database Systems 1

Relational Database Systems 1 Relational Database Systems 1 Wolf-Tilo Balke Simon Barthel Institut für Informationssysteme Technische Universität Braunschweig www.ifis.cs.tu-bs.de 10.0 Introduction Up to now, we have learned......

More information

Redundancy:Dependencies between attributes within a relation cause redundancy.

Redundancy:Dependencies between attributes within a relation cause redundancy. Normalization Normalization: It is the process of removing redundant data from your tables in order to improve storage efficiency, data integrity and scalability. This improvement is balanced against an

More information

CS348: INTRODUCTION TO DATABASE MANAGEMENT (Winter, 2011) FINAL EXAMINATION

CS348: INTRODUCTION TO DATABASE MANAGEMENT (Winter, 2011) FINAL EXAMINATION CS348: INTRODUCTION TO DATABASE MANAGEMENT (Winter, 2011) FINAL EXAMINATION INSTRUCTOR: Grant Weddell TIME: 150 minutes WRITE YOUR NAME AND ID HERE: NOTE 1: This is a closed book examination. For example,

More information

Relational Design Theory. Relational Design Theory. Example. Example. A badly designed schema can result in several anomalies.

Relational Design Theory. Relational Design Theory. Example. Example. A badly designed schema can result in several anomalies. Relational Design Theory Relational Design Theory A badly designed schema can result in several anomalies Update-Anomalies: If we modify a single fact, we have to change several tuples Insert-Anomalies:

More information

Lecture 5 Design Theory and Normalization

Lecture 5 Design Theory and Normalization CompSci 516 Data Intensive Computing Systems Lecture 5 Design Theory and Normalization Instructor: Sudeepa Roy Duke CS, Fall 2017 CompSci 516: Database Systems 1 HW1 deadline: Announcements Due on 09/21

More information

Desirable database characteristics Database design, revisited

Desirable database characteristics Database design, revisited CMPT 354 Desirable database characteristics Database design, revisited Normal Forms First Normal Form Second Normal Form Third Normal Form Boyce-Codd Normal Form Normal forms and functional dependencies

More information

Database Management System Prof. Partha Pratim Das Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur

Database Management System Prof. Partha Pratim Das Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Database Management System Prof. Partha Pratim Das Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Lecture - 19 Relational Database Design (Contd.) Welcome to module

More information

Relational Database Systems 1. Christoph Lofi Simon Barthel Institut für Informationssysteme Technische Universität Braunschweig

Relational Database Systems 1. Christoph Lofi Simon Barthel Institut für Informationssysteme Technische Universität Braunschweig Relational Database Systems 1 Christoph Lofi Simon Barthel Institut für Informationssysteme Technische Universität Braunschweig www.ifis.cs.tu-bs.de 10.0 Introduction Up to now, we have learned... how

More information

Chapter 16. Relational Database Design Algorithms. Database Design Approaches. Top-Down Design

Chapter 16. Relational Database Design Algorithms. Database Design Approaches. Top-Down Design Chapter 16 Relational Database Design Algorithms Database Design Approaches Top-Down design (Starting with conceptual design) Bottom-Up Design (relational synthesis) 2 Top-Down Design Design conceptual

More information

CMU SCS CMU SCS CMU SCS CMU SCS whole nothing but

CMU SCS CMU SCS CMU SCS CMU SCS whole nothing but Faloutsos & Pavlo 15-415/615 Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB Applications Lecture #17: Schema Refinement & Normalization - Normal Forms (R&G, ch. 19) Overview - detailed

More information

Database Management System

Database Management System Database Management System Lecture 4 Database Design Normalization and View * Some materials adapted from R. Ramakrishnan, J. Gehrke and Shawn Bowers Today s Agenda Normalization View Database Management

More information

Birkbeck. (University of London) BSc/FD EXAMINATION. Department of Computer Science and Information Systems. Database Management (COIY028H6)

Birkbeck. (University of London) BSc/FD EXAMINATION. Department of Computer Science and Information Systems. Database Management (COIY028H6) Birkbeck (University of London) BSc/FD EXAMINATION Department of Computer Science and Information Systems Database Management (COIY028H6) CREDIT VALUE: 15 credits Date of examination: 9 June 2016 Duration

More information

Schema Refinement & Normalization Theory 2. Week 15

Schema Refinement & Normalization Theory 2. Week 15 Schema Refinement & Normalization Theory 2 Week 15 1 How do we know R is in BCNF? If R has only two attributes, then it is in BCNF If F only uses attributes in R, then: R is in BCNF if and only if for

More information

Functional Dependencies

Functional Dependencies Functional Dependencies Redundancy in Database Design A table Students-take-courses (stud-id, name, address, phone, crs-id, instructor-name, office) Students(stud-id, name, address, phone, ) Instructors(name,

More information

Databases The theory of relational database design Lectures for m

Databases The theory of relational database design Lectures for m Databases The theory of relational database design Lectures for mathematics students April 2, 2017 General introduction Look; that s why there s rules, understand? So that you think before you break em.

More information

Relational Database Systems 1 Wolf-Tilo Balke Hermann Kroll, Janus Wawrzinek, Stephan Mennicke

Relational Database Systems 1 Wolf-Tilo Balke Hermann Kroll, Janus Wawrzinek, Stephan Mennicke Relational Database Systems 1 Wolf-Tilo Balke Hermann Kroll, Janus Wawrzinek, Stephan Mennicke Institut für Informationssysteme Technische Universität Braunschweig www.ifis.cs.tu-bs.de 10.0 Introduction

More information

Lecture 11 - Chapter 8 Relational Database Design Part 1

Lecture 11 - Chapter 8 Relational Database Design Part 1 CMSC 461, Database Management Systems Spring 2018 Lecture 11 - Chapter 8 Relational Database Design Part 1 These slides are based on Database System Concepts 6th edition book and are a modified version

More information

UNIT -III. Two Marks. The main goal of normalization is to reduce redundant data. Normalization is based on functional dependencies.

UNIT -III. Two Marks. The main goal of normalization is to reduce redundant data. Normalization is based on functional dependencies. UNIT -III Relational Database Design: Features of Good Relational Designs- Atomic Domains and First Normal Form- Second Normal Form-Decomposition Using Functional Dependencies- Functional-Dependency Theory-Algorithms

More information

Lecture 6a Design Theory and Normalization 2/2

Lecture 6a Design Theory and Normalization 2/2 CompSci 516 Data Intensive Computing Systems Lecture 6a Design Theory and Normalization 2/2 Instructor: Sudeepa Roy 1 HW1 deadline: Announcements Due on 09/21 (Thurs), 11:55 pm, no late days Project proposal

More information

Applied Databases. Sebastian Maneth. Lecture 5 ER Model, Normal Forms. University of Edinburgh - January 30 th, 2017

Applied Databases. Sebastian Maneth. Lecture 5 ER Model, Normal Forms. University of Edinburgh - January 30 th, 2017 Applied Databases Lecture 5 ER Model, Normal Forms Sebastian Maneth University of Edinburgh - January 30 th, 2017 Outline 2 1. Entity Relationship Model 2. Normal Forms From Last Lecture 3 the Lecturer

More information

18. Relational Database Design

18. Relational Database Design 18. Relational Database Design The relational datamodel was introduced by Codd in 1970. It is the most widely used datamodel extended with the possibilities of the World Wide Web, because of its simplicity

More information

ADVANCED DATABASES ; Spring 2015 Prof. Sang-goo Lee (11:00pm: Mon & Wed: Room ) Advanced DB Copyright by S.-g.

ADVANCED DATABASES ; Spring 2015 Prof. Sang-goo Lee (11:00pm: Mon & Wed: Room ) Advanced DB Copyright by S.-g. 4541.564; Spring 2015 Prof. Sang-goo Lee (11:00pm: Mon & Wed: Room 301-203) ADVANCED DATABASES Copyright by S.-g. Lee Review - 1 General Info. Text Book Database System Concepts, 6 th Ed., Silberschatz,

More information

CS352 Lecture - Conceptual Relational Database Design

CS352 Lecture - Conceptual Relational Database Design CS352 Lecture - Conceptual Relational Database Design Objectives: last revised September 20, 2006 1. To define the concepts functional dependency and multivalued dependency 2. To show how to find the closure

More information

Homework 6: FDs, NFs and XML (due April 13 th, 2016, 4:00pm, hard-copy in-class please)

Homework 6: FDs, NFs and XML (due April 13 th, 2016, 4:00pm, hard-copy in-class please) Virginia Tech. Computer Science CS 4604 Introduction to DBMS Spring 2016, Prakash Homework 6: FDs, NFs and XML (due April 13 th, 2016, 4:00pm, hard-copy in-class please) Reminders: a. Out of 100 points.

More information

Schema Normalization. 30 th August Submitted By: Saurabh Singla Rahul Bhatnagar

Schema Normalization. 30 th August Submitted By: Saurabh Singla Rahul Bhatnagar Schema Normalization 30 th August 2011 Submitted By: Saurabh Singla 09010146 Rahul Bhatnagar 09010136 Normalization Consider the following ER diagram with some FD: Instructor iid A Student sid Department

More information

Concepts from

Concepts from 1 Concepts from 3.1-3.2 Functional dependencies Keys & superkeys of a relation Reasoning about FDs Closure of a set of attributes Closure of a set of FDs Minimal basis for a set of FDs 2 Plan How can we

More information

Part II: Using FD Theory to do Database Design

Part II: Using FD Theory to do Database Design Part II: Using FD Theory to do Database Design 32 Recall that poorly designed table? part manufacturer manaddress seller selleraddress price 1983 Hammers R Us 99 Pinecrest ABC 1229 Bloor W 5.59 8624 Lee

More information

Typical relationship between entities is ((a,b),(c,d) ) is best represented by one table RS (a,b,c,d)

Typical relationship between entities is ((a,b),(c,d) ) is best represented by one table RS (a,b,c,d) Mapping ER Diagrams to a relational database.. Binary relationships: Three possible configurations: 1. One table.. 2. Two tables.. 3. Three tables.. 1-1 relationships R(AB) - S(CD) Typical relationship

More information

Handout 3: Functional Dependencies and Normalization

Handout 3: Functional Dependencies and Normalization Virginia Tech. Computer Science CS 4604 Introduction to DBMS Spring 2015, Prakash Handout 3: Functional Dependencies and Normalization Q1: Inferring FDs Consider the following schemas and sets of functional

More information

Relational Database Design. Announcements. Database (schema) design. CPS 216 Advanced Database Systems. DB2 accounts have been set up

Relational Database Design. Announcements. Database (schema) design. CPS 216 Advanced Database Systems. DB2 accounts have been set up Relational Database Design CPS 216 Advanced Database Systems Announcements 2 DB2 accounts have been set up Let me know if you have not received an email from me regarding your account Recitation session

More information

CS211 Lecture: Database Design

CS211 Lecture: Database Design CS211 Lecture: Database Design Objectives: last revised November 21, 2006 1. To introduce the anomalies that result from redundant storage of data 2. To introduce the notion of functional dependencies

More information

Homework 6: FDs, NFs and XML (due April 15 th, 2015, 4:00pm, hard-copy in-class please)

Homework 6: FDs, NFs and XML (due April 15 th, 2015, 4:00pm, hard-copy in-class please) Virginia Tech. Computer Science CS 4604 Introduction to DBMS Spring 2015, Prakash Homework 6: FDs, NFs and XML (due April 15 th, 2015, 4:00pm, hard-copy in-class please) Reminders: a. Out of 100 points.

More information

Announcements (January 20) Relational Database Design. Database (schema) design. Entity-relationship (E/R) model. ODL (Object Definition Language)

Announcements (January 20) Relational Database Design. Database (schema) design. Entity-relationship (E/R) model. ODL (Object Definition Language) Announcements (January 20) 2 Relational Database Design Review for Codd paper due tonight via email Follow instructions on course Web site Reading assignment for next week (Ailamaki et al., VLDB 2001)

More information

CS352 Lecture - Conceptual Relational Database Design

CS352 Lecture - Conceptual Relational Database Design CS352 Lecture - Conceptual Relational Database Design Objectives: last revised September 16, 2004 1. To define the concepts functional dependency and multivalued dependency 2. To show how to find the closure

More information

DATABASTEKNIK - 1DL116

DATABASTEKNIK - 1DL116 1 DATABASTEKNIK - 1DL116 Spring 2004 An introductury course on database systems http://user.it.uu.se/~udbl/dbt-vt2004/ Kjell Orsborn Uppsala Database Laboratory Department of Information Technology, Uppsala

More information

MIDTERM EXAMINATION Spring 2010 CS403- Database Management Systems (Session - 4) Ref No: Time: 60 min Marks: 38

MIDTERM EXAMINATION Spring 2010 CS403- Database Management Systems (Session - 4) Ref No: Time: 60 min Marks: 38 Student Info StudentID: Center: ExamDate: MIDTERM EXAMINATION Spring 2010 CS403- Database Management Systems (Session - 4) Ref No: 1356458 Time: 60 min Marks: 38 BC080402322 OPKST 5/28/2010 12:00:00 AM

More information