Reasoning with Expressive Description Logics: Theory and Practice

Similar documents
Description Logic: Axioms and Rules

DAML+OIL and Description Logic Reasoning

Ulrike Sattler. University of Manchester, UK. TU Dresden, Germany. Institut für Theoretische Informatik

Reasoning with DAML+OIL:

Optimised Classification for Taxonomic Knowledge Bases

OWL an Ontology Language for the Semantic Web

2 Subsumption testing. Abstract. 1 Introduction. 3 Performance

Explaining Subsumption in ALEHF R + TBoxes

OWL Rules, OK? Ian Horrocks Network Inference Carlsbad, CA, USA

DL Reasoner vs. First-Order Prover

Optimising Terminological Reasoning for Expressive Description Logics

l A family of logic based KR formalisms l Distinguished by: l Decidable fragments of FOL l Closely related to Propositional Modal & Dynamic Logics

THE DESCRIPTION LOGIC HANDBOOK: Theory, implementation, and applications

Ordering Heuristics for Description Logic Reasoning

Logical Foundations for the Semantic Web

Description Logics as Ontology Languages for Semantic Webs

OWL and tractability. Based on slides from Ian Horrocks and Franz Baader. Combining the strengths of UMIST and The Victoria University of Manchester

Principles of Knowledge Representation and Reasoning

Description Logics and OWL

Towards Efficient Reasoning for Description Logics with Inverse Roles

Scalability via Parallelization of OWL Reasoning

A Tableaux Decision Procedure for SHOIQ

Evaluating a modular Abox algorithm

Searching for the Holy Grail. Ian Horrocks Information Systems Group Oxford University Computing Laboratory

Language. DAML+OIL: a Reason-able Web Ontology. Manchester, UK. Ian Horrocks. University of Manchester.

Implementing an ALCRP(D) ABox Reasoner Progress Report. 2 The Implementation. 1 Introduction. 2.1 Preprocessing. 2.

Nonstandard Inferences in Description Logics

Description Logics. F. Description Logics. This section is based on material from Ian Horrocks:

Description Logic: A Formal Foundation for Ontology Languages and Tools

AI Fundamentals: Knowledge Representation and Reasoning. Maria Simi

Helmi Ben Hmida Hannover University, Germany

Description Logic. Eva Mráková,

DAML+OIL: an Ontology Language for the Semantic Web

OPTIMISING TABLEAUX DECISION PROCEDURES FOR DESCRIPTION LOGICS

Description Logic Systems with Concrete Domains: Applications for the Semantic Web

The revival of structural subsumption in tableau-based description logic reasoners

Week 4. COMP62342 Sean Bechhofer, Uli Sattler

SEMANTIC WEB AND COMPARATIVE ANALYSIS OF INFERENCE ENGINES

On the Scalability of Description Logic Instance Retrieval

Racer: An OWL Reasoning Agent for the Semantic Web

Optimization Techniques for Retrieving Resources Described in OWL/RDF Documents: First Results

The OWL Instance Store: System Description

Benchmarking DL Reasoners Using Realistic Ontologies

The Instance Store: DL Reasoning with Large Numbers of Individuals

! model construction

A Knowledge Compilation Technique for ALC Tboxes

Appendix 1. Description Logic Terminology

jcel: A Modular Rule-based Reasoner

Appendix 1. Description Logic Terminology

Presented By Aditya R Joshi Neha Purohit

Scalable Ontology-Based Information Systems

DAML+OIL: a Reason-able Web Ontology Language

Standardization of Ontologies

On the Reduction of Dublin Core Metadata Application Profiles to Description Logics and OWL

Introduction to Description Logics

Ontologies and OWL. Riccardo Rosati. Knowledge Representation and Semantic Technologies

RELATIONAL REPRESENTATION OF ALN KNOWLEDGE BASES

On the Scalability of Description Logic Instance Retrieval

Extending Datatype Support in Web Ontology Reasoning

Algebraic Tableau Algorithm for ALCOQ

Augmenting Concept Languages by Transitive Closure of Roles An Alternative to Terminological Cycles

Get my pizza right: Repairing missing is-a relations in ALC ontologies

Representing Product Designs Using a Description Graph Extension to OWL 2

Challenges in the Core of Ontology Support Systems

Extending the SHOIQ(D) Tableaux with DL-safe Rules: First Results

OntoXpl Exploration of OWL Ontologies

A Unified Logical Framework for Rules (and Queries) with Ontologies - position paper -

Description Logics for Conceptual Data Modeling in UML

Converting Instance Checking to Subsumption: A Rethink for Object Queries over Practical Ontologies

SEMANTICS. Retrieval by Meaning

Description Logics. Description Logics and Databases

Computing least common subsumers for FLE +

Intelligent Brokering of Environmental Information with the BUSTER System

Practical Reasoning for Description Logics with Functional Restrictions, Inverse and Transitive Roles, and Role Hierarchies

Ontologies and the Web Ontology Language OWL

Semantic Web. Part 3 The ontology layer 1: Ontologies, Description Logics, and OWL

Toward Multi-Viewpoint Reasoning with OWL Ontologies

Semantic Matching of Description Logics Ontologies

Ontology Development and Engineering. Manolis Koubarakis Knowledge Technologies

An Architecture for Semantic Enterprise Application Integration Standards

Description Logics and Disjunctive Datalog The Story so Far

A Solution for the Man-Man Problem in the Family History Knowledge Base

OWL extended with Meta-modelling

Knowledge Representation on the Web

From Lexicon To Mammographic Ontology: Experiences and Lessons

OWL 2 The Next Generation. Ian Horrocks Information Systems Group Oxford University Computing Laboratory

Lightweight Semantic Web Motivated Reasoning in Prolog

Racer - An Inference Engine for the Semantic Web

Extracting Finite Sets of Entailments from OWL Ontologies

Description logic reasoning using the PTTP approach

An Overview of Tableau Algorithms for Description Logics Λ

Mastro Studio: a system for Ontology-Based Data Management

Modularity in Ontologies: Introduction (Part A)

Role-depth Bounded Least Common Subsumers for EL + and ELI

Description Logics: A Logical Foundation of the Semantic Web and its Applications

Query Answering Systems in the Semantic Web

LTCS Report. Concept Descriptions with Set Constraints and Cardinality Constraints. Franz Baader. LTCS-Report 17-02

TrOWL: Tractable OWL 2 Reasoning Infrastructure

A Tool for Storing OWL Using Database Technology

Structure of This Presentation

Transcription:

Reasoning with Expressive Description Logics: Theory and Practice Ian Horrocks horrocks@cs.man.ac.uk University of Manchester Manchester, UK Reasoning with Expressive DLs p.1/39

Talk Outline Introduction to Description Logics (DLs) Reasoning techniques Implementing DL systems DL applications Including demos (time permitting) Reasoning with Expressive DLs p.2/39

Introduction to DLs Reasoning with Expressive DLs p.3/39

What are Description Logics? Based on concepts (classes) and roles Concepts (classes) are interpreted as sets of objects Roles are interpreted as binary relations on objects Descendants of semantic networks, frame based systems and KL-ONE Decidable fragments of FOL Many DLs are fragments of L2, C2 or the Guarded Fragment Closely related to propositional modal logics Also known as terminological logics, concept languages, etc. Key features of DLs are Well defined semantics (they are logics) Provision of inference services Reasoning with Expressive DLs p.4/39

DL Applications DLs have may applications including: Terminological KR (including Ontologies) Medical terminology/controled vocabulary (Galen) Bio-ontologies (Tambis, GO) Web based ontology languages (OIL, DAML+OIL) Configuration Classic system used to configure telecom equipment Database schema and query reasoning Schema design and query optimisation Interoperability and federation Query containment (w.r.t. schema) Reasoning with Expressive DLs p.5/39

DL System Architecture Knowledge Base Tbox (schema) Man =. Human Male Happy-Father =. Man has-child.female.... Abox (data) John : Happy-Father John, Mary : has-child. Inference System Interface Reasoning with Expressive DLs p.6/39

DL Constructors Particular DLs characterised by set of constructors provided for building complex concepts and roles from simpler ones Usually include at least: Conjunction ( ), disjunction ( ), negation ( ) Restricted (guarded) forms of quantification (, ) This basic DL is known as ALC Reasoning with Expressive DLs p.7/39

DL Syntax and Semantics Semantics given by interpretation I = ( I, I) Constructor Syntax Example Semantics atomic concept A Human A I I atomic role R has-child R I I I and for C, D concepts and R a role name conjunction C D Human Male C I D I disjunction C D Doctor Lawyer C I D I negation C Male I \ C exists restr. R.C has-child.male {x y. x, y R I y C I } value restr. R.C has-child.doctor {x y. x, y R I = y C I } Reasoning with Expressive DLs p.8/39

Other DL Constructors Many different DLs/DL constructors have been investigated, e.g. Constructor Syntax Example Semantics number restr. nr 3 has-child {x {y. x, y R I } n} nr 1 has-mother {x {y. x, y R I } n} inverse role R has-child { x, y y, x R I } trans. role R has-child (R I ) concrete domain f 1,..., f n.p earns spends < {x P (f1 I,..., f n I )}. Reasoning with Expressive DLs p.9/39

DL Knowledge Base (Tbox) Terminological part (Tbox) is set of axioms describing structure of domain Definition axioms introduce macros/names for concepts A. = C, A C Father. = Man has-child.human Human Animal Biped Inclusion (GCI) axioms assert subsumption relations C D (note C. = D equivalent to C. = D and D. = C) has-degree.masters has-degree.bachelors Reasoning with Expressive DLs p.10/39

DL Knowledge Base (Abox) Assertional part (Abox) is set of axioms describing concrete situation Concept assertions a : C John : Man has-child.female Role assertions a, b : R John, Mary : has-child Reasoning with Expressive DLs p.11/39

Basic Inference Problems Subsumption (structure knowledge, compute taxonomy) C D? Is C I D I in all interpretations? Subsumption w.r.t. Tbox T C T D? Is C I D I in all models of T? Consistency Is C consistent w.r.t. T? Is there a model I of T s.t. C I? KB Consistency Is T, A consistent? Is there a model I of T, A? Reasoning with Expressive DLs p.12/39

Reasoning Techniques Reasoning with Expressive DLs p.13/39

Subsumption and Satisfiability Subsumption transformed into satisfiability Tableaux algorithm used to test satisfiability Try to build model of concept C Model represented by tree T Nodes in T correspond to individuals in model Nodes labeled with sets of subconcepts of C Edges labeled with role names in C Start from root node labeled {C} Apply expansion rules to node labels until Expansion completed (tree represents valid model) Contradictions prove there is no model Non-deterministic expansion search (e.g., C D) Blocking ensures termination (with expressive DLs) Reasoning with Expressive DLs p.14/39

Tableaux Expansion Test satisfiability of S.C S.( C D) R.C R.( R.C)} where R is a transitive role L(w) = { S.C, S.( C D), R.C, R.( R.C)} w S R L(x) = {C, ( C D), D} x y L(y) = {C, R.C, R.( R.C)} R R blocked z L(z) = {C, R.C, R.( R.C)} Reasoning with Expressive DLs p.15/39

More Advanced Techniques Satisfiability w.r.t. a Terminology For each GCI C D T, add C D to every node label More expressive DLs Basic technique can be extended to deal with Role inclusion axioms (role hierarchy) Number restrictions Inverse roles Concrete domains Aboxes Extend expansion rules and use more sophisticated blocking strategy Forest instead of Tree (for Aboxes) Reasoning with Expressive DLs p.16/39

Implementing DL Systems Reasoning with Expressive DLs p.17/39

Naive Implementations Problems include: Space usage Storage required for tableaux datastructures Rarely a serious problem in practice Time usage Search required due to non-deterministic expansion Serious problem in practice Mitigated by: Careful choice of algorithm Highly optimised implementation Reasoning with Expressive DLs p.18/39

Careful Choice of Algorithm Transitive roles instead of transitive closure Deterministic expansion of R.C, even when R R + (Relatively) simple blocking conditions Cycles always represent (part of) valid cyclical models Direct algorithm/implementation instead of encodings GCI axioms can be used to encode additional operators/axioms Powerful technique, particularly when used with FL closure Can encode cardinality constraints, inverse roles, range/domain,... E.g., (domain R.C) R. C (FL) encodings introduce (large numbers of) axioms BUT even simple domain encoding is disastrous with large numbers of roles Reasoning with Expressive DLs p.19/39

Highly Optimised Implementation Optimisation performed at 2 levels Computing classification (partial ordering) of concepts Objective is to minimise number of subsumption tests Can use standard order-theoretic techniques E.g., use enhanced traversal that exploits information from previous tests Also use structural information from KB E.g., to select order in which to classify concepts Computing subsumption between concepts Objective is to minimise cost of single subsumption tests Small number of hard tests can dominate classification time Recent DL research has addressed this problem (with considerable success) Reasoning with Expressive DLs p.20/39

Optimising Subsumption Testing Optimisation techniques broadly fall into 2 categories Pre-processing optimisations Aim is to simplify KB and facilitate subsumption testing Largely algorithm independent Particularly important when KB contains GCI axioms Algorithmic optimisations Main aim is to reduce search space due to non-determinism Integral part of implementation But often generally applicable to search based algorithms Reasoning with Expressive DLs p.21/39

Pre-processing Optimisations Useful techniques include Normalisation and simplification of concepts Refinement of technique first used in KRIS system Lexically normalise and simplify all concepts in KB Combine with lazy unfolding in tableaux algorithm Facilitates early detection of inconsistencies (clashes) Absorption (simplification) of general axioms Eliminate GCIs by absorbing into definition axioms Definition axioms efficiently dealt with by lazy expansion Avoidance of potentially costly reasoning whenever possible Normalisation can discover obvious (un)satisfiability Structural analysis can discover obvious subsumption Reasoning with Expressive DLs p.22/39

Normalisation and Simplification Normalise concepts to standard form, e.g.: R.C R. C C D ( C D) Simplify concepts, e.g.: (D C) (A D) A C D R.... C... C... Lazily unfold concepts in tableaux algorithm Use names/pointers to refer to complex concepts Only add structure as required by progress of algorithm Detect clashes between lexically equivalent concepts Reasoning with Expressive DLs p.23/39

Absorption I Reasoning w.r.t. set of GCI axioms can be very costly GCI C D adds D C to every node label Expansion of disjunctions leads to search With 10 axioms and 10 nodes search space already 2 100 GALEN (medical terminology) KB contains thousands of axioms Reasoning w.r.t. primitive definition axioms is relatively efficient For CN D, add D only to node labels containing CN For CN D, add D only to node labels containing CN Can expand definitions lazily Only add definitions after other local (propositional) expansion Only add definitions one step at a time Reasoning with Expressive DLs p.24/39

Absorption II Transform GCIs into primitive definitions, e.g. CN C D CN D C CN C D CN D C Absorb into existing primitive definitions, e.g. CN A, CN D C CN A (D C) CN A, CN D C CN A (D C) Use lazy expansion technique with primitive definitions Disjunctions only added to relevant node labels Performance improvements often too large to measure At least four orders of magnitude with GALEN KB Reasoning with Expressive DLs p.25/39

Algorithmic Optimisations Useful techniques include Avoiding redundancy in search branches Davis-Putnam style semantic branching search Syntactic branching with no-good list Dependency directed backtracking Backjumping Dynamic backtracking Caching Cache partial models Cache satisfiability status (of labels) Heuristic ordering of propositional and modal expansion Min/maximise constrainedness (e.g., MOMS) Maximise backtracking (e.g., oldest first) Reasoning with Expressive DLs p.26/39

Dependency Directed Backtracking Allows rapid recovery from bad branching choices Most commonly used technique is backjumping Tag concepts introduced at branch points (e.g., when expanding disjunctions) Expansion rules combine and propagate tags On discovering a clash, identify most recently introduced concepts involved Jump back to relevant branch points without exploring alternative branches Effect is to prune away part of the search space Performance improvements with GALEN KB again too large to measure Reasoning with Expressive DLs p.27/39

Backjumping E.g., if R. A R.(A B) (C 1 D 1 )... (C n D n ) L(x) Backjump L(x) {C n-1 } Pruning L(x) {C 1 } L(x) { C 1, D 1 } x x x L(x) { C 2, D 2 } L(x) {C n } L(y) = {(A B), A, A, B} x y R x L(x) { C n, D n } y R L(y) = {(A B), A, A, B} clash clash...... Reasoning with Expressive DLs p.28/39

Caching Cache the satisfiability status of a node label Identical node labels often recur during expansion Avoid re-solving problems by caching satisfiability status When L(x) initialised, look in cache Use result, or add status once it has been computed Can use sub/super set caching to deal with similar labels Care required when used with blocking or inverse roles Significant performance gains with some kinds of problem Cache (partial) models of concepts Use to detect obvious non-subsumption C D if C D is satisfiable C D satisfiable if models of C and D can be merged If not, continue with standard subsumption test Can use same technique in sub-problems Reasoning with Expressive DLs p.29/39

DL applications Reasoning with Expressive DLs p.30/39

Terminological KR and Ontologies General requirement for medical terminologies Static lists/taxonomies difficult to build and maintain Need to be very large and highly interconnected Inevitably contain many errors and omissions Galen project replaced static hierarchy with DL Describe concepts (e.g., spiral fracture of left femur) Use DL classifier to build taxonomy Needed expressive DL and efficient reasoning Descriptions used transitive roles, inverses, GCIs etc. Even prototype KB was very large ( 3,000 concepts) Existing (incomplete) classifier took 24 hours to classify KB FaCT system (sound and complete) takes 60s Reasoning with Expressive DLs p.31/39

The Semantic Web Most existing Web resources only human understandable Markup (HTML) provides rendering information Textual/graphical information for human consumption Semantic Web aims at machine understandability Semantic markup will be added to web resources Markup will use Ontologies for shared understanding Requirement for DAML ontology language Should extend existing Web standards (XML, RDF, RDFS) Reasoning with Expressive DLs p.32/39

OIL and DAML+OIL Intuitive (frame) syntax plus high expressive power Well defined semantics via mapping to SHIQ DL Can use FaCT system to reason with OIL ontologies Extends existing Web standards (XML, RDF, RDFS) Effectively a DL with RDFS based syntax Can use DL reasoning with DAML+OIL E.g., OilEd ontology editor Frame based interface (e.g., Protegé, OntoEdit) Extended to capture whole of OIL/DAML+OIL languages Reasoning support from FaCT (via CORBA interface) Reasoning with Expressive DLs p.33/39

OilEd E.g., DAML+OIL medical terminology ontology Transitive roles capture transitive partonomy, causality, etc. Smoking causes.cancer plus Cancer causes.death Cancer FatalThing GCIs represent additional non-definitional knowledge Stomach-Ulcer. = Ulcer haslocation.stomach plus Stomach-Ulcer haslocation.lining-of-stomach Ulcer haslocation.stomach OrganLiningLesion Inverse roles capture e.g. cases/causedby relationship Death causedby.smoking PrematureDeath Smoking CauseOfPrematureDeath Cardinality restrictions add consistency constraints BloodPressure hasvalue.(high Low) 1hasValue plus High Low HighLowBloodPressure Reasoning with Expressive DLs p.34/39

Database Schema and Query Reasoning DLR (n-ary DL) can capture semantics of many datamodelling methodologies (e.g., EER) Satisfiability preserving mapping to SHIQ allows use of DL reasoners (e.g., FaCT, RACER) DL Abox can also capture semantics of conjunctive queries Can reason about query containment w.r.t. schema DL reasoning can be used to support, e.g. Schema design and integration Query optimisation Interoperability and federation E.g., I.COM Intelligent Conceptual Modelling tool (Enrico Franconi) Uses FaCT system to provide reasoning support for EER Reasoning with Expressive DLs p.35/39

Summary DLs are logic based KR formalisms DL systems provide efficient inference services Careful choice of logic/algorithm Highly optimised implementation Have proved effective in a range of applications Terminologies/Ontologies Databases Have been influential in development of Semantic Web Web standard ontology language will be DL based Reasoning with Expressive DLs p.36/39

Resources Slides from this talk www.cs.man.ac.uk/~horrocks/slides/leipzig-jun-01.pdf FaCT system www.cs.man.ac.uk/fact OIL www.ontoknowledge.org/oil/ DAML+OIL www.daml.org/language/ OilEd img.cs.man.ac.uk/oil I.COM www.cs.man.ac.uk/~franconi/icom/ Reasoning with Expressive DLs p.37/39

Select Bibliography F. Baader, E. Franconi, B. Hollunder, B. Nebel, and H.-J. Profitlich. An empirical analysis of optimization techniques for terminological representation systems or: Making KRIS get a move on. In B. Nebel, C. Rich, and W. Swartout, editors, Proc. of KR 92, pages 270 281. Morgan Kaufmann, 1992. F. Giunchiglia and R. Sebastiani. A SAT-based decision procedure for ALC. In Proc. of KR 96, pages 304 314. Morgan Kaufmann, 1996. V. Haarslev and R. Möller. High performance reasoning with very large knowledge bases: A practical case study. In Proc. of IJCAI 2001 (to appear). B. Hollunder and W. Nutt. Subsumption algorithms for concept languages. In Proc. of ECAI 90, pages 348 353. John Wiley & Sons Ltd., 1990. Reasoning with Expressive DLs p.38/39

Select Bibliography I. Horrocks. Optimising Tableaux Decision Procedures for Description Logics. PhD thesis, University of Manchester, 1997. I. Horrocks and P. F. Patel-Schneider. Comparing subsumption optimizations. In Proc. of DL 98, pages 90 94. CEUR, 1998. I. Horrocks and P. F. Patel-Schneider. Optimising description logic subsumption. Journal of Logic and Computation, 9(3):267 293, 1999. I. Horrocks and S. Tobies. Reasoning with axioms: Theory and practice. In Proc. of KR 00 pages 285 296. Morgan Kaufmann, 2000. E. Franconi and G. Ng. The i.com tool for intelligent conceptual modelling. In Proc. of (KRDB 00), August 2000. D. Fensel, F. van Harmelen, I. Horrocks, D. McGuinness, and P. F. Patel-Schneider. OIL: An ontology infrastructure for the semantic web. IEEE Intelligent Systems, 16(2):38 45, 2001. Reasoning with Expressive DLs p.39/39