Nonstandard Inferences in Description Logics

Similar documents
Description Logics as Ontology Languages for Semantic Webs

l A family of logic based KR formalisms l Distinguished by: l Decidable fragments of FOL l Closely related to Propositional Modal & Dynamic Logics

LTCS Report. Concept Descriptions with Set Constraints and Cardinality Constraints. Franz Baader. LTCS-Report 17-02

Augmenting Concept Languages by Transitive Closure of Roles An Alternative to Terminological Cycles

An Overview of Tableau Algorithms for Description Logics Λ

Principles of Knowledge Representation and Reasoning

Description Logics and OWL

AI Fundamentals: Knowledge Representation and Reasoning. Maria Simi

Computing least common subsumers for FLE +

Appendix 1. Description Logic Terminology

Appendix 1. Description Logic Terminology

From Tableaux to Automata for Description Logics

Week 4. COMP62342 Sean Bechhofer, Uli Sattler

! model construction

THE DESCRIPTION LOGIC HANDBOOK: Theory, implementation, and applications

Logik für Informatiker Logic for computer scientists. Ontologies: Description Logics

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Markus Krötzsch University of Oxford. Reasoning Web 2012

SEMANTICS. Retrieval by Meaning

Querying Data through Ontologies

Ontologies and the Web Ontology Language OWL

RELATIONAL REPRESENTATION OF ALN KNOWLEDGE BASES

Knowledge representation in process engineering

Get my pizza right: Repairing missing is-a relations in ALC ontologies

On the Scalability of Description Logic Instance Retrieval

Description Logic. Eva Mráková,

Introduction to Description Logics

SQL, DLs, Datalog, and ASP: comparison

Ordering Heuristics for Description Logic Reasoning

A Knowledge Compilation Technique for ALC Tboxes

DL-Lite: Tractable Description Logics for Ontologies

Semantic reasoning for dynamic knowledge bases. Lionel Médini M2IA Knowledge Dynamics 2018

Converting Instance Checking to Subsumption: A Rethink for Object Queries over Practical Ontologies

Towards Efficient Reasoning for Description Logics with Inverse Roles

Description Logics for Conceptual Data Modeling in UML

A Scheme for Integrating Concrete Domains into Concept Languages

Description Logics. Structural Description Logics (Acknowledgement: Enrico Franconi) Object-Oriented Representations. Applications

Explaining Subsumption in ALEHF R + TBoxes

Role-depth Bounded Least Common Subsumers for EL + and ELI

University of Innsbruck

COMP4418 Knowledge Representation and Reasoning

Description Logics and Disjunctive Datalog The Story so Far

Optimised Classification for Taxonomic Knowledge Bases

Prime Implicate Normal Form for ALC Concepts

Complexity boundaries for full satisfiability of restricted UML class diagrams

(Conceptual) Clustering methods for the Semantic Web: issues and applications

Refining the Structure of Terminological Systems: Terminology = Schema + Views*

Description Logic: A Formal Foundation for Ontology Languages and Tools

Concrete Domains and Nominals United

Optimising Terminological Reasoning for Expressive Description Logics

Description Logics. F. Description Logics. This section is based on material from Ian Horrocks:

Reasoning and Query Answering in Description Logics

Rewriting Ontology-Mediated Queries. Carsten Lutz University of Bremen

Description Logics. Description Logics and Databases

Chapter 3 Complexity of Classical Planning

Inductive Concept Retrieval and Query Answering with Semantic Knowledge Bases through Kernel Methods

Description Logics Reasoning Algorithms Concluding Remarks References. DL Reasoning. Stasinos Konstantopoulos. IIT, NCSR Demokritos

Racer - An Inference Engine for the Semantic Web

A Tableau-based Federated Reasoning Algorithm for Modular Ontologies

Updating data and knowledge bases

Searching for the Holy Grail. Ian Horrocks Information Systems Group Oxford University Computing Laboratory

Survey of Temporal Knowledge Representation. (Second Exam)

The revival of structural subsumption in tableau-based description logic reasoners

Reasoning with Expressive Description Logics: Theory and Practice

A platform for distributing and reasoning with OWL-EL knowledge bases in a Peer-to-Peer environment

Exponential Speedup in U L Subsumption Checking Relative to General TBoxes for the Constructive Semantics

Implementing an ALCRP(D) ABox Reasoner Progress Report. 2 The Implementation. 1 Introduction. 2.1 Preprocessing. 2.

Practical Reasoning for Description Logics with Functional Restrictions, Inverse and Transitive Roles, and Role Hierarchies

OWL and tractability. Based on slides from Ian Horrocks and Franz Baader. Combining the strengths of UMIST and The Victoria University of Manchester

Presented By Aditya R Joshi Neha Purohit

Standardization of Ontologies

COMBINING INTEGER PROGRAMMING AND TABLEAU-BASED REASONING: A HYBRID CALCULUS FOR THE DESCRIPTION LOGIC SHQ

Evaluating a modular Abox algorithm

Research Article A Method of Extracting Ontology Module Using Concept Relations for Sharing Knowledge in Mobile Cloud Computing Environment

Scalability via Parallelization of OWL Reasoning

DAML+OIL and Description Logic Reasoning

Mastro Studio: a system for Ontology-Based Data Management

Ulrike Sattler. University of Manchester, UK. TU Dresden, Germany. Institut für Theoretische Informatik

Ontologies - Querying Data through Ontologies

On the Reduction of Dublin Core Metadata Application Profiles to Description Logics and OWL

Automatic Ontology Extension: Resolving Inconsistencies

Maurizio Lenzerini. Dipartimento di Ingegneria Informatica Automatica e Gestionale Antonio Ruberti

OWL 2 Syntax and Semantics Sebastian Rudolph

Modeling multidimensional database

Representing Product Designs Using a Description Graph Extension to OWL 2

OPTIMISING TABLEAUX DECISION PROCEDURES FOR DESCRIPTION LOGICS

Knowledge Representation

From Lexicon To Mammographic Ontology: Experiences and Lessons

On the Scalability of Description Logic Instance Retrieval

Knowledge Engineering with Semantic Web Technologies

Description Logic: Axioms and Rules

jcel: A Modular Rule-based Reasoner

Extending the SHOIQ(D) Tableaux with DL-safe Rules: First Results

Debugging Incoherent Terminologies

Efficient Recognition of Finite Satisfiability in UML Class Diagrams: Strengthening by Propagation of Disjoint Constraints

OWL 2 Profiles. An Introduction to Lightweight Ontology Languages. Маркус Крёч (Markus Krötzsch) University of Oxford. KESW Summer School 2012

Scalable Web Service Composition with Partial Matches

DL Reasoner vs. First-Order Prover

D20.1 v0.2 OWL Flight

Toward Multi-Viewpoint Reasoning with OWL Ontologies

Ontologies, OWL, OWL profiles

Transcription:

Nonstandard Inferences in Description Logics Franz Baader Theoretical Computer Science Germany Short introduction to Description Logics Application in chemical process engineering Non-standard inferences least common subsumer, most specific concept, rewriting, and matching

Description logics class of knowledge representation formalisms Descended from structured inheritance networks [Brachman 78]. Tried to overcome ambiguities in semantic networks and frames that were due to their lack of a formal semantics. Restriction to a small set of "epistemologically adequate" operators for defining concepts (classes). Importance of well-defined basic inference procedures: subsumption and instance problem. First realization: system KL-ONE [Brachman&Schmolze 85], many successor systems (Classic, Crack, FaCT, Flex, Kris, Loom, Race...). First application: natural language processing; now also other domains (configuration, medical terminology, databases, chemical process engineering, ontologies for the semantic web,...)

Description logic systems structure description language TBox defines terminology of the application domain reasoning component constructors for building complex concepts and roles out of atomic concepts and roles ABox states facts about a specific "world" derive implicitly represented knowledge (e.g., subsumption) "practical" algorithms formal, logic-based semantics knowledge base

Description language examples of typical constructors: C D, C, r. C, r. C, ( n r) A man Human Female that is married to a doctor, and married-to. Doctor has at least 5 children, ( 5 child) all of whom are professors. child. Professor TBox ABox definition of concepts Happy-man = Human... properties of individuals Happy-Man(Franz) child(franz,luisa) child(franz,julian)

Formal semantics based on interpretations as in predicate logic An interpretation I associates concepts C with sets C I and roles r with binary relations r I. The semantics of the constructors is defined through identities: (C D) I = C I D I ( n r) I = {d #{e (d,e) r I } n} ( r. C ) I = {d e: (d,e) r I e C I } ( r. C ) I = {d e: (d,e) r I e C I } I = A = C iff A I = C I I = C(a) iff a I C I I = r(a,b) iff (a I,b I ) r I model

Inferences make implicit knowledge explicit; are available as system services of DL systems Subsumption Is C a sub-concept of D? C D iff C I D I for all interpretations I. Instance Is e an instance of C w.r.t. the ABox A? A = C(e) iff e I C I for all models I of A.

Focus of DL research decidability/complexity application relevant concepts of reasoning must be definable requires restricted description some application domains language require very expressive DLs systems and complexity results efficient algorithms in practice available for various combinations for very expressive DLs? of constructors Reasoning feasible versus Expressivity sufficient

DL research historical overview Phase 1 mostly system development (KL-ONE, LOOM,...) early eighties expressive description languages, but no disjunction, negation, exist. quant. use of so-called structural subsumption algorithms (polynomial) no formal investigation of reasoning problems and properties of algorithms Phase 2 first formal investigations formal, logic-based semantics mideighties first undecidability and complexity results incompleteness of structural subsumption algorithms incompleteness as feature (Loom, Back) restrict expressive power (Classic)

Phase 3 tableau algorithms for DLs and thorough complexity analysis end eighties to mid-nineties Schmidt-Schauß and Smolka describe the first complete (tableau-based) subsumption algorithm for a non-trivial DL; ALC: propositionally closed (negation, disjunction, existential restrictions); complexity result: subsumption in ALC is PSPACE-complete. Exact worst-case complexity of satisfiability and subsumption for various DLs (DFKI, University of Rome I). Development of tableau-based algorithms for a great variety of DLs (DFKI, University of Rome I,,...). First DL systems with tableau algorithms: Kris (DFKI), Crack (IRST Trento); first optimization techniques for DL systems with tableau algorithms. Schild notices a close connection between DLs and modal logics.

ALC is a syntactic variant of multi-modal K [Schild 91] concept name A role name r propositional variable A modal parameter r C D t(c) t(d) translation C D t(c) t(d) t C t(c) r C <r>t(c) r C [r]t(c) interpretation I set of individuals dom(i) Kripke structure K = (W,R) set of worlds W interpretation of role names r I accessibility relation R r interpretation of concept names A I worlds in which A is true

Phase 4 algorithms and systems for very expressive DLs (e.g., without finite model property) late nineties Decidability results for very expressive DLs by translation into PDL (propositional dynamic logic) (Uni Roma I), strong complexity results; motivated by database applications. Intensive optimization of tableau algorithms (Uni Manchester, IRST Trento, Bell Labs, Uni Hamburg): very efficient systems for expressive DLs. Design of practical tableau algorithms for very expressive DLs (Uni Manchester, ); application to ontology reasoning for the semantic web.

Old and new inference problems Standard inference problems (like subsumption, instance) in Description Logics are well-investigated: Complexity results for a great variety of DLs. Optimized implementations for expressive DLs. Building and maintaining large knowledge bases requires support by additional nonstandard inference methods; e.g.: Bottom-up construction of knowledge bases requires least common subsumer, most specific concept, and rewriting. Search for partially specified concepts requires matching and unification.

Modelling chemical processes and plants Process Systems Engineering (Prof. Marquardt) Computer-aided modelling of chemical processes to analyze, simulate, and optimize the processes. Modelling tool ModKit that allows to build process models from standard building blocks. Library of standard building blocks, which must be extended continuously. Description of building blocks in a frame-based formalism (VeDa). Structured representation of the building blocks in a class hierarchy supports searching and browsing.

Applying standard inferences VeDa / ModKit for the automated structuring of the library. DL ALE / FaCT Class descriptions Building block descriptions Concept definitions (TBox) Individuals (ABox) subsumption instance algorithm translate transfer Computed subsumption hierarchy Computed instance relationships

Applying nonstandard inferences to support the bottom-up construction of knowledge bases new class description having the building blocks as instances translate C min minimal rewriting C least common subsumer (lcs) C a C c C b most specific concept (msc) building blocks translate a:a r r c:b s b:a ABox

Research work necessary to realize this approach Least common subsumer: existence and computation First results for Classic (AT&T) [Cohen&Hirsh94, Frazier&Pitt96]: incorrect treatment of inconsistency and attributes. First sound and complete treatment for ALN, ALE, and ALEN [B.&Küsters98, B.,Küsters&Molitor99, Küsters&Molitor01]. Most specific concept: existence, computation or approximation Existence and computation for ALN with cyclic definitions [B.&Küsters98]. Non-existence and approximation in ALE [Küsters&MolitorKI01]. Rewriting: general framework, complexity and computation Results for ALE, ALN, ALC and sub-languages [B.,Küsters&Molitor00]. Related results in database research: rewriting queries using views.

Least common subsumer of concept descriptions The concept description E is the least common subsumer (lcs) of the concept descriptions C 1,..., C n iff C 1 E,..., C n E, and subsumes all C i C 1 F,..., C n F implies E F is the least such description child. Male child. Doctor is in ALE the lcs of child. Top child. (Male Doctor) and child. (Male Student) child. (Doctor Female) In DLs with disjunction, lcs = disjunction and thus not interesting. Questions to be answered: Existence of the lcs, its size (binary, n-ary), how to compute it.

Results for the lcs in sublanguages of ALEN EL C D, r. C, Top ALE EL + r. C + atomic negation ALEN ALE + number restrictions Existence yes yes yes Size (2) polynomial exponential doubly exponential Size (n) exponential exponential doubly exponential Computation Ptime/Exptime Exptime 2-Exptime Approach Translate concept descriptions into description trees. Characterize subsumption via homomorphisms. Lcs as product of the description trees.

Most specific concept of an ABox individual The concept description E is the most specific concept (msc) of the individual b in the ABox A iff A = E(b), and A = F(b) implies E F b is an instance of E E is the least such description r b is an instance of A, r. A, r. r. A, r. r. r.a,... b : A There is no most specific ALE-concept description C such that b is an instance of C. A similar example shows that the msc need not exist in ALN.

Ways out two approaches to deal with the non-existence of the msc Allow for cyclic TBoxes with an appropriate fixpoint semantics: C = A r. C with gfp-semantics yields msc in the example. For ALN with cyclic definitions and gfp-semantics, the msc always exists and can effectively be computed [B.&Küsters98]. Use k-approximation of the msc: most specific concept of role depth at most k having b as an instance: A r. (A r. A) is the 2-approximation of the msc in the example. For ALE, the k-approximation always exists and can effectively be computed. More efficient approaches for sub-languages of ALE [Küsters&MolitorKI01].

Rewriting in DL our motivation Non-standard inference procedures for DLs, like computing the least common subsumer (lcs), matching and unification of concept descriptions, produce concept descriptions that are shown to the user for inspection, may be quite large, are unfolded, i.e., do not use concept names defined in the TBox. Inference service that automatically increases readability of concept descriptions?

Rewriting concepts using a TBox unfolded description C TBox T rewriting procedure description D T C using names defined in T minimal rewriting: D is of minimal size

Example Happy child. child. (Rich Happy) A = Happy child. child. Rich B = child. Happy C = child. (B child. Rich) rewriting procedure A child. B Happy C minimal

Results for the minimal rewriting problem in DLs [B.,Küsters&Molitor00] Complexity of the corresponding optimization problem: ALN: NP-hard, in Σ ALE: NP-hard, in PSPACE ALC: PSPACE-complete Nondeterministic algorithms computing all minimal rewritings in ALE and ALN. p 2 Heuristic rewriting algorithm for ALE behaves quite well in practice: descriptions of size 800 obtained as lcs in the process engineering application are rewritten into descriptions of size 10.

Matching in DL our motivation Design by modification: before defining a new building block from scratch, the process engineers try to locate a structurally similar one in the knowledge base, and then modify the existing block. Matching allows to look for concepts having a certain (partially specified) structure: Assume we look for concepts concerned with individuals having a son and a daughter sharing some characteristic: child. (Male X) child. (Female X) variable is a concept pattern expressing this. The substitution {X > Tall, Y > Tall} shows that this pattern matches the description ` child. (Male Tall) child. (Female Tall)

Matching in DL definition and results Let C be a concept and D be a pattern. The substitution σ is a matcher of C? D iff C σ(d). Matching in ALN [B., Küsters, Borgida, McGuinness 99]: Existence of matchers can be decided in polynomial time. Solvable problems have a unique least matcher, which can be computed in polynomial time. Matching problems are translated into formal language equations. Matching in ALE [B.&Küsters00]: Existence of matchers is an NP-complete problem. Solvable problems have finitely many minimal matchers, which can be computed in exponential time. Both the number and the size of matchers may be exponential. Approach depends on partial homomorphisms and lcs.

Conclusion Compared to the body of results for standard inferences, the research on nonstandard inferences is just at the beginning. For lcs, msc, and matching, we have a relatively clear picture for (sub-languages of) ALE and ALN. Other interesting nonstandard inferences: Unification, i.e., patterns on both sides of the equation: considerably harder than matching; Exptime-complete for FL 0 (C D, r. C). Approximation of concepts in one DL by concepts in another DL: we just started to work on this.