Ontologies and Database Schema: What s the Difference? Michael Uschold, PhD Semantic Arts.

Similar documents
Ontology and database schema: What s the difference?

Approach for Mapping Ontologies to Relational Databases

LAB 2 Notes. Conceptual Design ER. Logical DB Design (relational) Schema Refinement. Physical DD

Fundamentals of Physical Design: State of Art

Chapter 4. Enhanced Entity- Relationship Modeling. Enhanced-ER (EER) Model Concepts. Subclasses and Superclasses (1)

Using Relational Databases for Digital Research

Course Book Academic Year

High Level Database Models

What is a Data Model?

Where is the Semantics on the Semantic Web?

Ontology-Driven Conceptual Modelling

Review -Chapter 4. Review -Chapter 5

Informal Design Guidelines for Relational Databases

Best Practices - Pentaho Data Modeling

Semi-Automatic Conceptual Data Modeling Using Entity and Relationship Instance Repositories

Ontology - based Semantic Value Conversion

Models versus Ontologies - What's the Difference and where does it Matter?

OVERVIEW OF DATABASE DEVELOPMENT

B.H.GARDI COLLEGE OF MASTER OF COMPUTER APPLICATION. Ch. 1 :- Introduction Database Management System - 1

Practical Database Design Methodology and Use of UML Diagrams Design & Analysis of Database Systems

High-Level Database Models (ii)

The Emerging Data Lake IT Strategy

Chapter 8: Enhanced ER Model

TOWARDS ONTOLOGY DEVELOPMENT BASED ON RELATIONAL DATABASE

The challenge of reasoning for OLF s s IO G2

Assignment Session : July-March


ADVANCED DATABASES ; Spring 2015 Prof. Sang-goo Lee (11:00pm: Mon & Wed: Room ) Advanced DB Copyright by S.-g.

Evaluation Guide for ASP.NET Web CMS and Experience Platforms

Database Management Systems MIT Introduction By S. Sabraz Nawaz

An Ontological Approach to Domain Engineering

; Spring 2008 Prof. Sang-goo Lee (14:30pm: Mon & Wed: Room ) ADVANCED DATABASES

Data about data is database Select correct option: True False Partially True None of the Above

Week. Lecture Topic day (including assignment/test) 1 st 1 st Introduction to Module 1 st. Practical

DATABASE MANAGEMENT SYSTEMS

DATABASE DEVELOPMENT (H4)

Model Driven Engineering (MDE)

CSE 562 Database Systems

Chapter 2 Entity-Relationship Data Modeling: Tools and Techniques. Fundamentals, Design, and Implementation, 9/e

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2009 Lecture 3 - Schema Normalization

8 Designing classes. Main concepts to be covered. Software changes. Change or die. The Zuul Classes. World of Zuul

Chapter 2: Entity-Relationship Model

Review The Big Picture

UML-Based Conceptual Modeling of Pattern-Bases

Enterprise Information Integration using Semantic Web Technologies:

Do-It-Yourself Data Migration

Semantics-Based Integration of Embedded Systems Models

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 4 - Schema Normalization

The Semantic Planetary Data System

Software Architectures

Reference Resolution and Views Working Note 25

Conceptual Database Modeling

The Entity-Relationship Model

Lecture2: Database Environment

H1 Spring B. Programmers need to learn the SOAP schema so as to offer and use Web services.

Database Management System. Fundamental Database Concepts

Solved MCQ on fundamental of DBMS. Set-1

Schema Repository Database Evolution And Metamodeling

Chapter 2 Entity-Relationship Data Modeling: Tools and Techniques. Fundamentals, Design, and Implementation, 9/e

Improve SSIS Delivery with a Patterns-Based Approach. Meagan Longoria July 19, 2017

CPS510 Database System Design Primitive SYSTEM STRUCTURE

1.264 Midterm Exam Solutions Fall, Name:

Helmi Ben Hmida Hannover University, Germany

Essentials of Database Management

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

It Is What It Does: The Pragmatics of Ontology for Knowledge Sharing

LAB 3 Notes. Codd proposed the relational model in 70 Main advantage of Relational Model : Simple representation (relationstables(row,

Keyword Search over Hybrid XML-Relational Databases

SYLLABUS ADMIN DATABASE SYSTEMS I WEEK 2 THE ENTITY-RELATIONSHIP MODEL. Assignment #2 changed. A2Q1 moved to A3Q1

7. Query Processing and Optimization

SQL DDL. CS3 Database Systems Weeks 4-5 SQL DDL Database design. Key Constraints. Inclusion Constraints

(Extended) Entity Relationship

Fundamentals of Databases

Ontology as Knowledge Base for Spatial Data Harmonization

Review for Exam 1 CS474 (Norton)

Contents Contents 1 Introduction Entity Types... 37

Systems:;-'./'--'.; r. Ramez Elmasri Department of Computer Science and Engineering The University of Texas at Arlington

challenges in domain-specific modeling raphaël mannadiar august 27, 2009

DISCUSSION 5min 2/24/2009. DTD to relational schema. Inlining. Basic inlining

FUNDAMENTALS OF. Database S wctpmc. Shamkant B. Navathe College of Computing Georgia Institute of Technology. Addison-Wesley

Contents. Database. Information Policy. C03. Entity Relationship Model WKU-IP-C03 Database / Entity Relationship Model

BLU AGE 2009 Edition Agile Model Transformation

Overview of db design Requirement analysis Data to be stored Applications to be built Operations (most frequent) subject to performance requirement

Introduction to Database Systems CSE 414. Lecture 19: E/R Diagrams

Relational Model: History

ORACLE DATABASE 12C INTRODUCTION

CSE 344 JULY 30 TH DB DESIGN (CH 4)

FUNCTIONAL DEPENDENCIES

Software Design Fundamentals. CSCE Lecture 11-09/27/2016

Fundamentals of. Database Systems. Shamkant B. Navathe. College of Computing Georgia Institute of Technology PEARSON.

Chapter 4. The Relational Model

An overview of RDB2RDF techniques and tools

Chapter 8 The Enhanced Entity- Relationship (EER) Model

CSE 344 MAY 11 TH ENTITIES

The Unified Modelling Language. Example Diagrams. Notation vs. Methodology. UML and Meta Modelling

SKOS. COMP62342 Sean Bechhofer

Ch. 21: Object Oriented Databases

Protégé-2000: A Flexible and Extensible Ontology-Editing Environment

Chapter 7: Entity-Relationship Model

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE

Transcription:

Ontologies and Database Schema: What s the Difference? Michael Uschold, PhD Semantic Arts.

Objective To settle once and for all the question: What is the difference between an ontology and a database schema? Often asked. Never adequately answered. Page 2

Example: Hydraulics Pumping done-by Mechanical Device Hydraulic System Pump Engine Fuel System has-part Jet Engine supplies-fuel-to Hydraulic Pump Fuel Pump part-of connected-to Fuel Filter Aircraft Engine Driven Pump Ontology Logical DB Schema: IDEF1X Is this similarity just superficial? Page 3

The Basic Idea for Each For shared understanding. Ontology: defines a set of concepts and relationships that represent the content and structure of some subject matter in a formal language. For a database. Database Schema: defines the structure of a database in a formal language. loosely refers to any of: conceptual, logical, physical Pumping done-by Mechanical Device Hydraulic System Pump Engine Fuel System has-part Jet Engine supplies-fuel-to Hydraulic Pump Fuel Pump part-of connected-to Fuel Filter Aircraft Engine Driven Pump Page 4

Five Questions to get a Better Understanding 1. What is it for? 2. What does it look like? it : ontology or database schema 3. How do you build one? 4. How is it implemented and used? 5. Where are the semantics? Quick Poll: more alike? Are they more alike or different? Quick Poll: more different? Page 5

What is it for? DB Schema Ontology Focus Data Meaning Shared Understanding Core Purpose(s) Single purpose. Structure instances for efficient storage and querying. Human communication, interoperability, search, software engineering, Also for structuring instances. Thus Meaning lost. Instances optional. Page 6

What does it look like? Notation DB Schema Ontology Notation: syntax ER diagrams; no standard serialization syntax. Logic; no standard diagram notation syntax. Notation: semantics Minimal focus on formal semantics. Strong focus on formal semantics. Page 7

What does it look like? Expressivity Expressivity overlap Expressivity differences Cardinality constraints. DB Schema Entities Attributes, Relations Constraints No taxonomy. Constraints for integrity, foreign key, delete. Ontology Classes Properties Axioms Taxonomy is backbone. Constraints for meaning, consistency & integrity. Page 8

How to Build One? Starting point: Normalization: Optimization: Toss expensive constraints. Dirty data, lost meaning. DB Schema Scratch, rarely reuse. Standard rules in natural language, little tool support. Fundamental step. Manual, geared to specific queries for specific DB. Ontology Reuse if possible. No standard rules or guidelines. OntoClean Some tuning may be required. A black art. Ontology independent; Inference engine developers. Page 9

How is it Implemented and Used? Change Management, Agility, Flexibility Semantics hardwired in procedural code. DB Schema Locked into specific set of queries per DB. Tight coupling. Lost meaning. Hard to evolve and maintain. ETL tools to help. Ontology No query lock-in. Queries usable on other systems. Looser coupling. Semantics explicit. Potentially easier to evolve & maintain. Few tools. Still no picnic! Page 10

How is it Implemented and Used? Processing Engines Primary Focus: DB Schema SQL Engines Queries Reasoning with Views Data integrity Ontology Theorem Provers Derive new information from existing information. Consistency and integrity Standardized on SQL Both handle complex logical expressions. Less standardization Page 11

How is it Implemented and Used? Performance DB Schema Highly tuned for performance and scale. Not work well with too many joins. Ontology Full inferencing: much smaller scale. Reduced inferencing: reaching large scale. Tradeoff: Performance vs. Function & Flexibility RDB and Triple Stores both have a Niche. Page 12

Building Databases & Applications Gather Requirements Build Conceptual Schema (e.g., ER or UML model) Refine to Logical Schema (e.g., normalize, still ER or UML) Refine to Physical Schema Page 13

Refine to Physical Schema Define tables, columns, keys. Optimize to Specific Kinds of Queries. Create Data Dictionary (semantics for humans). Integrity Constraints Domain, Referential & Semantic integrity Do the best you can, little automation. Where are the Semantics? Page 14

Five Questions to get a Better Understanding 1. What is it for? 2. What does it look like? it : ontology or database schema 3. How do you build one? 4. How is it implemented and used? 5. Where are the semantics? Page 15

Where are the Semantics for Database Schema? Mainly in Conceptual Schema and Data Dictionary Designed for Humans Semantics don t evolve as DB and applications change Conceptual Schema semantics thrown away when building Physical Schema. Integrity constraints hardwired in procedural code Semantics are hardwired, lost, tossed, or out of date. You cannot maintain what you do not understand! Page 16

Quick Summary Database Schema Focus on DATA DB Constraints to ensure integrity may hint at meaning No ISA hierarchy SQL Engines querying, views data integrity Instances Central Data Dictionary separate artifact Ontology FOCUS on Meaning Ontology Axioms to specify meaning maybe for integrity ISA Hierarchy is Backbone Theorem Provers infer new information ensure consistency Instances Optional 'Comments' part of the ontology Page 17

More Detailed Summary: 24 Features/Aspects Core for DB Schema Secondary for DB Schema Unimportant for DB Schema Core for Ontology 1. Represented using a formal language. 2. Expressivity: types, properties, constraints. 3. Constraints for consistency checking. 4. Shared meaning of some subject matter. 5. Taxonomy. 6. Multi-purpose. 7. Embedded natural language definitions. 8. Constraints for meaning. 9. Constraints for ensuring self-consistency (not data). 16. Abstract types w/no instances. 17. Reused to build new ones. 18. Reused in unexpected ways. 19. Formal model-theoretic semantics. Secondary for Ontology Unimportant for Ontology 10. Efficient querying and storage for data. 11. Standardized diagram notation. 12. Separate natural language definitions (data dictionary). 13. Constraints for data integrity. 14. Industry-wide construction guidelines (normalization). 15. Scale to huge sizes. 20. Cardinality constraints for getting foreign keys right and ensuring tables created for many-to-many relationships. 21. Toss semantics after conceptual modeling. 22. Optimization for specific set of queries. 23. Sophisticated tool support for migrating data when schema evolve (ETL). 24. SQL for querying data. More Alike? Over 60% of features are common to both. The 3 features core to both are the most important: what is expressed and how. More Different? Only 12% of features are core to both. Can you convert one into the other? Page 18

More Alike? or More Different? Conversion? Conceptual Schema & Ontology No harder than between two different ontology languages Ontology & Logical Schema Some loss of information, a design artifact Ontology & Physical Schema Much loss of information, an implementation artifact My Big Fat Greek Wedding Toast: even though we are apples and oranges, we are all fruit. Page 19

Conclusion They are similar beasts. They evolved in different communities. Two cultures divided by a common idea? Speaking of weddings What do you get if you cross: DB schema technology/community? Ontology technology/community? Page 20

Ontology/Model-Driven Architecture & Development Basic Idea: Explicitly capture the semantics as formal ontologies. Base design- and implementation-level artifacts on the ontologies. Ontologies drive the applications, directly or indirectly. Benefits: Looser coupling. Semantics is explicit. Integration/Interoperability by design. Inference gives automated consistency checking. Conceptual, Real world Logical, Design world Physical, Implementation world Easier to evolve and maintain: Less hardwiring of semantics means easier to understand and change Ontologies evolve with the DB and applications. Page 21

Acknowledgements For answering countless questions as a cube-mate: John Thompson For reviews of a companion unpublished paper: Phil Bernstein, Tim Wilmering, Jun Yuan, Anhai Doan, Bill Andersen, Amit Sheth. For ideas on model-driven development: Simon Robe. Page 22