On practical aspects of enhancing semantic interoperability using SKOS and KOS alignment

Similar documents
Publishing Vocabularies on the Web. Guus Schreiber Antoine Isaac Vrije Universiteit Amsterdam

SKOS - Simple Knowledge Organization System

Library of Congress Controlled Vocabularies as Linked Data:

Ontologies SKOS. COMP62342 Sean Bechhofer

SKOS. COMP62342 Sean Bechhofer

Semantics. Matthew J. Graham CACR. Methods of Computational Science Caltech, 2011 May 10. matthew graham

A brief introduction to SKOS

Converting a thesaurus into an ontology: the use case of URBISOC

Agricultural bibliographic data sharing & interoperability in China

SWAD-Europe Deliverable 8.1 Core RDF Vocabularies for Thesauri

Unified Thesaurus feasibility study

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Linguaggi Logiche e Tecnologie per la Gestione Semantica dei testi

New Dimensions in KOS

The Semantic Web DEFINITIONS & APPLICATIONS

Vocabulary and Semantics in the Virtual Observatory

Europeana and semantic alignment of vocabularies

Terminologies, Knowledge Organization Systems, Ontologies

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

A Semantic MediaWiki-Empowered Terminology Registry

Linked Data and RDF. COMP60421 Sean Bechhofer

Report from the W3C Semantic Web Best Practices Working Group

H1 Spring B. Programmers need to learn the SOAP schema so as to offer and use Web services.

Semantic Web Publishing. Dr Nicholas Gibbins 32/4037

Wondering about either OWL ontologies or SKOS vocabularies? You need both!

SWAD-Europe Deliverable 8.3: RDF Encoding of Multilingual Thesauri

Linked data for manuscripts in the Semantic Web

Europeana Data Model. Stefanie Rühle (SUB Göttingen) Slides by Valentine Charles

Semantic challenges in sharing dataset metadata and creating federated dataset catalogs

Introduction and background

Linked Data and RDF. COMP60421 Sean Bechhofer

The AGROVOC Concept Scheme - A Walkthrough

JENA: A Java API for Ontology Management

Vocabulary Alignment for archaeological Knowledge Organization Systems

Pulling Together, or

Semantic web. Tapas Kumar Mishra 11CS60R32

Chinese Agricultural Thesaurus and its application on data sharing & interoperability

How to Design & Build Semantic Applications with Linked Data #DCMI Dave Clarke CEO Synaptica

Terminology Services. Experimental Services for Controlled Vocabularies. OCLC Research April 2008

DCMI Abstract Model - DRAFT Update

The UNESCO Thesaurus

> Semantic Web Use Cases and Case Studies

Pattern-based design, part I

OSM Lecture (14:45-16:15) Takahira Yamaguchi. OSM Exercise (16:30-18:00) Susumu Tamagawa

Semantic Web In Depth: Resource Description Framework. Dr Nicholas Gibbins 32/4037

Application Services for Knowledge Organisation and System Integration

Semantic Web. Tahani Aljehani

Europeana update: aspects of the data

Migrating Thesauri to the Semantic Web Guidelines and case studies for generating RDF encodings of existing thesauri

Innovation in Thesaurus Management

Semantic Web. RDF and RDF Schema. Morteza Amini. Sharif University of Technology Spring 90-91

Semantic Web Technologies

Library of Congress BIBFRAME Pilot. NOTSL Fall Meeting October 30, 2015

Ontological Modeling: Part 2

RDF. Mario Arrigoni Neri

Semantic Web. Ontology Pattern. Gerd Gröner, Matthias Thimm. Institute for Web Science and Technologies (WeST) University of Koblenz-Landau

Using metadata schema registry as a core function to enhance usability and reusability of metadata schemas

Contents. G52IWS: The Semantic Web. The Semantic Web. Semantic web elements. Semantic Web technologies. Semantic Web Services

Business Rules in the Semantic Web, are there any or are they different?

Multi-agent and Semantic Web Systems: RDF Data Structures

PoolParty - Thesaurus Server

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

JOHN SHEPHERDSON... AUTOMATIC KEYWORD GENERATION TECHNICAL SERVICES DIRECTOR UK DATA ARCHIVE UNIVERSITY OF ESSEX...

Linking Thesauri to the Linked Open Data Cloud for Improved Media Retrieval

0.1 Knowledge Organization Systems for Semantic Web

Mapping between Digital Identity Ontologies through SISM

H1 Spring C. A service-oriented architecture is frequently deployed in practice without a service registry

RESOURCES DESCRIPTION FRAMEWORK: RDF

netcdf-ld SKOS: demonstrating Linked Data vocabulary use within netcdf-compliant files

Oshiba Tadahiko National Diet Library Tokyo, Japan

The Semantic Web. Mansooreh Jalalyazdi

Is Linked Data the future of data integration in the enterprise?

VISUALIZATIONS AND INFORMATION WIDGETS

Converting the TheSoz to SKOS Zapilko, Benjamin; Sure, York Veröffentlichungsversion / Published Version Arbeitspapier / working paper

RDF Schema. Mario Arrigoni Neri

Linked.Art & Vocabularies: Linked Open Usable Data

Linked Open Data: a short introduction

Metadata Common Vocabulary: a journey from a glossary to an ontology of statistical metadata, and back

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

Linking Distributed Data across the Web

SEMANTIC WEB 03 RDF DATA MODEL RESOURCE DESCRIPTION FRAMEWORK IMRAN IHSAN ASSISTANT PROFESSOR, AIR UNIVERSITY, ISLAMABAD

Classification and reference vocabulary in Linked Environment Data

An Introduction to the Semantic Web. Jeff Heflin Lehigh University

Linked Open Europeana: Semantics for the Digital Humanities

BUILDING THE SEMANTIC WEB

Making BioPAX SPARQL

Metadata Standards and Applications. 6. Vocabularies: Attributes and Values

Content analysis and classification in mathematics

The Agricultural Ontology Server: A Tool for Knowledge Organisation and Integration

Semantiska webben DFS/Gbg

A representation framework for terminological ontologies

Putting ontology alignment in context: Usage scenarios, deployment and evaluation in a library case

SIERA: A Multilingual Knowledge Sharing Project

- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents

Table of Contents. iii

A SKOS Core approach to implementing an M2M terminology mapping server

Recommended Practice for Software Requirements Specifications (IEEE)

SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web

Ontologies and thesauri How to answer complex questions using interoperability?

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac

Transcription:

On practical aspects of enhancing semantic interoperability using SKOS and KOS alignment Antoine ISAAC KRR lab, Vrije Universiteit Amsterdam National Library of the Netherlands ISKO UK Meeting, July 21, London

Agenda (Optional) Semantic Web refresher Representing KOSs using SKOS Main features Practical issues Demo SKOS and semantic alignment of KOSs

The Semantic Web: a web of resources Pointing at resources What? Knowledge objects everything that we may want to refer to including documents, persons How? Uniform Resource Identifiers (URIs) E.g., HTTP URLs: http://www.few.vu.nl/~aisaac/

A Web of resources Enhancing semantic interoperability using SKOS theirvoc:article http://ex.org/files/file1 myvoc:amsterdam Here: URIs (namespace and localname) Note: different locations!

Describing Semantic Web resources: RDF Pointing at resources: URIs Creating structured assertions involving resources What? Typed links between resources How? RDF (Resource Description Framework) Statements subject-predicate-object

Data in an RDF graph theirvoc:article rdf:type http://ex.org/files/file1 theirvoc:subject myvoc:amsterdam

The Semantic Web approach: A Web of (meta)data The_Netherlands Article subclassof Document type hascapital file1 type City Amsterdam defines partof paragraph3

What's the role of SKOS in this? Porting KOSs to the Semantic Web Reminder: SKOS is for publication and access, not replacement xxxx xxxx xxx xxx xxx xxxx xxxx xxxx xxx xxx xxx xxx xxxx xxxx xxx xxxx xxxx xxx

Agenda Semantic Web refresher Representing KOSs using SKOS Main features Practical issues Demo SKOS and semantic alignment of KOSs

SKOS (Simple Knowledge Organization System) SKOS offers a vocabulary to create RDF data representing KOS content Concepts and ConceptSchemes Lexical properties (preflabel, altlabel) Semantic relations (broader, related) Notes (scopenote, definition)

Conceptual resources with URIs prefix skos: <http://www.w3.org/2004/02/skos/core#> prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> prefix ex: <http://example.org/>

Labels as strings USE/UF functions, as in ISO2788 But a concept-oriented model! Concepts are first-order (RDF) entities Labels are RDF literals (simple string values) Labels are linked via the concept resource

(Multilingual) labels Multilingual functions, as in ISO5964 But a concept-oriented model, again

Semantic relations: broader, narrower and related Same function as BT/NT/RT links Similar intended meanings e.g. broader can cover partitive, generic, or class-instance relationships

Documenting concepts Enhancing semantic interoperability using SKOS

Example: thesaurus animals cats NT cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats

Example: SKOS graph animals NT cats cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats

Example: RDF XML serialization Enhancing semantic interoperability using SKOS animals NT cats cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats <rdf:rdf> <skos:concept rdf:about="http://example.org/animals"> <skos:preflabel xml:lang="en">animals</skos:preflabel> <skos:narrower rdf:resource="http://example.org/cats"/> </skos:concept> <skos:concept rdf:about="http://example.org/cats"> <skos:preflabel xml:lang="en">cats</skos:preflabel> <skos:altlabel xml:lang="en">domestic cats</skos:altlabel> <skos:scopenote>used only for domestic cats</skos:scopenote> <skos:broader rdf:resource="http://example.org/animals"/> <skos:related rdf:resource="http://example.org/wildcats"/> </skos:concept> <skos:concept rdf:about="http://example.org/wildcats"> <skos:preflabel xml:lang="en">wildcats</skos:preflabel> </skos:concept> </rdf:rdf>

Agenda (Optional) Semantic Web refresher Representing KOSs using SKOS Main features Practical issues Demo SKOS and semantic alignment of KOSs

How does SKOS stand the test of application? How much interoperability does porting to SKOS really allow? Are there hidden caveats? Different ways to convert similar things Different interpretations of SKOS constructs? Things impossible to convert

Issue 1: where can URIs come from? Generated from pre-existing identifiers sh96011203 Using (less stable) labels is not always a good idea Logotypes (Printing)

Issue 1: "web-enabled" names? Do concept names have to refer to accessible documents? 1. (first approach) Whatever qualifies as URI, even it does not exist on the web http://example.org/animals 2. (better) Basic web-enabled name, resolves into some document http://catalogue.bnf.fr/ark:/12148/cb11931683w 3. (ideal) URI with content negotiation to serve SKOS/RDF, HTML http://lcsh.info/sh96011203#concept Cf. Best Practice Recipes for Publishing RDF Vocabularies http://www.w3.org/tr/swbp-vocab-pub/

Issue 2: different types of labels Notice: SKOS does not offer guidelines for good labels But it assumes some characteristics for the different kinds of labels, that could influence conversion (Hard) A concept has only one preflabel per language (Soft) No two concepts from a same concept scheme should have the same preflabel in a given language Cf. notion of descriptor Interop. enabler

Issue 2: classifications, notations and labels Notations - 21.51 "technique and materials" Can we use notations as SKOS preferred labels? They (are supposed to) make sense for users & be displayed They are unambiguous ex:21.51 skos:preflabel "21.51"@x-notation

Issue 2: classifications, notations and labels Captions could also be considered as preferred labels They are often displayed ex:21.51 skos:preflabel "21.51"@x-notation skos:preflabel "technique and materials"@en They can be ambiguous But the preflabel uniqueness constraint was soft! Yet experts could choose to have all captions as altlabels 21.00 "painting: general" - 21.01 "technique and materials" 21.50 "sculpture: general" - 21.51 "technique and materials" Interop. at risk!

Semantic relations: broader, narrower and related

Issue 3: semantics for semantic relations Is broader "transitive"? If yes, we can miss the original information ex:cats skos:broader skos:broader? ex:mammals skos:broader ex:animals

Issue 3: semantics for semantic relations broader is not transitive in general It has a super-property broadertransitive with semantics of has ancestors 1: every broader statement ("parent") logically implies a broadertransitive one ("ancestor") 2: broadertransitive is transitive!

Issue 3: semantics for semantic relations broader and narrower are inverse of each other skos:narrower X Y related is symmetric! skos:related X Y Assumption: there are not many exceptions in KOSs A non-symmetric specialization of related can be coined if needed

Semantics of SKOS SKOS semantics make assumptions that distinguish what could be regularly inferred from a statement broader and narrower are inverse of each other from what would be less agreed upon broader is transitive This answers some questions about what should be explicit or not in a SKOS conversion, and what can (shall) be inferred from it Important for specifying application, e.g. services Crucial for interoperability! Beware: this assumes reasoning, or a simulation of it! Interop. enabler Interop. at risk!

Example of custom extension for SKOS Creating a non symmetric specialization of related? my:nonsymmetricrelated rdfs:subpropertyof skos:related. Assertions of my:nonsymmetricproperty do not imply inference of reciprocal statements If RDFS semantics are applied (e.g. by a reasoner) there is inference of standard SKOS skos:related statements my:nonsymmetricrelated X Y Interop. enabler

Other KOS features which could harm interoperability Very difficult to represent in SKOS Synthesis of new subjects Using subdivisions: Brass bands--sponsorship Links to compound non-preferred terms Cf. Stella/Leonard/Nicholas Can be represented, but not really standard Qualifiers in labels: Technique (painting) Standard, but may not be used Groupings by Collection, cf. Doug/Ceri Cf. next presentations!

Is that damn thing useful? At least it's there! There is a proposed standard to represent KOS on the Semantic Web It allows to publish KOSs LCSH, Agrovoc It allows to develop applications with re-usable & interoperable components Cf. Doug/Ceri and Bernard

Benefit of SKOS Homogeneous SW representation of vocabularies (and metadata) amsterdam mary subject page1 creator john book2 creator haspart picture3 depicts Netherlands

Is that damn thing useful? For most aspects of a KOS, conversion is relatively smooth It makes some commitments more explicit Nothing compared to representation as a formal ontology Believe me! A basis for (your!) experience sharing Comparing conversion strategies Realizing the interoperability issues there Devising agreed extensions

Demo! KB Illuminated Manuscripts BNF Mandragore Manuscripts http://galjas.cs.vu.nl:33333/mandra-sv-icemandranewnone, amphibians

Demo (1) Enhancing semantic interoperability using SKOS Subject vocabulary, collection 1 Subjects

Demo (2) Hierarchical path from root to selected subject Possible specialisation for selected subject

Demo (3) Semantic alignment of subjects activated Document from Collection 2

Demo (4) Subject from voc2 aligned to voc1:amphibians Back

Demo: noticeable facts KOS-independent interface The French vocabulary has just replaced an English vocabulary that was used in a previous pilot Makes use of standard SKOS constructs broader, preflabel Can exploit standard alignment relations Semantic equivalence can be computed thanks to SKOS' seamless representation of multilingual labels It s actually a case of French-to-French alignment!

Agenda Semantic Web refresher Representing KOSs using SKOS Main features Practical issues Demo SKOS and semantic alignment of KOSs (time?) No time?

Aligning vocabularies xxxx xxxx xxx xxx xxx xxxx xxxx xxxx xxx xxx xxx xxx xxxx xxxx xxx xxxx xxxx xxx

Vocabulary Alignment Aim: find correspondences between different concepts with comparable meanings Doing it manually or (at least semi-)automatically Cf. ontology alignment in Semantic Web research Lexical Structural Statistical Background knowledge still is a difficult research problem!

Mapping concepts with SKOS ex1:animal ex1:platypus skos:broadmatch

SKOS contribution to mapping A common way to represent important info for KOS use cases Focusing on types of mapping relationships Note: can be used in combination with more complex formats developed by the ontology alignment community E.g. to give mappings a confidence measure Again with (debatable?) semantics broadmatch is a sub-property of broader Allows to seamlessly use mappings as basic KOS relationships Still keeps the difference at the statement level ex1:platypus skos:broadmatch skos:broader ex2:egglaying Animals

Conclusion Representing KOSs using SKOS Main features Practical issues Demo SKOS and semantic alignment of KOSs Despite some issues, SKOS provides a crucial contribution to enhance interoperability of KOSs

Thank you! SKOS site: http://www.w3.org/2004/02/skos Reminder: Comments highly welcome on SKOS Reference: http://www.w3.org/tr/skos-reference SKOS Primer: http://www.w3.org/tr/skos-primer