Mapping between Digital Identity Ontologies through SISM

Similar documents
Sindice Widgets: Lightweight embedding of Semantic Web capabilities into existing user applications.

Porting Social Media Contributions with SIOC

W3C Workshop on the Future of Social Networking, January 2009, Barcelona

Social Networks and Data Portability using Semantic Web technologies

From Online Community Data to RDF

E-Learning and Microformats: A Learning Object Harvesting Model and a Sample Application

Weaving SIOC into the Web of Linked Data

Semantics. Matthew J. Graham CACR. Methods of Computational Science Caltech, 2011 May 10. matthew graham

Extended Identity for Social Networks

The Semantic Web Revisited. Nigel Shadbolt Tim Berners-Lee Wendy Hall

Linguaggi Logiche e Tecnologie per la Gestione Semantica dei testi

Linked Data: What Now? Maine Library Association 2017

Report from the W3C Semantic Web Best Practices Working Group

Combining RDF Vocabularies for Expert Finding

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

Opus: University of Bath Online Publication Store

Publishing Vocabularies on the Web. Guus Schreiber Antoine Isaac Vrije Universiteit Amsterdam

O.Curé [1 ] Mashup, Microformats, RDFa and GRDDL

Provenance-aware Faceted Search in Drupal

Contents. G52IWS: The Semantic Web. The Semantic Web. Semantic web elements. Semantic Web technologies. Semantic Web Services

Semantiska webben DFS/Gbg

Converting a thesaurus into an ontology: the use case of URBISOC

COMP6217 Social Networking Technologies Web evolution and the Social Semantic Web. Dr Thanassis Tiropanis

Semantic Web Publishing. Dr Nicholas Gibbins 32/4037

Ontology-based Architecture Documentation Approach

Bringing the Wiki-Way to the Semantic Web with Rhizome

Metadata Common Vocabulary: a journey from a glossary to an ontology of statistical metadata, and back

GenTax: A Generic Methodology for Deriving OWL and RDF-S Ontologies from Hierarchical Classifications, Thesauri, and Inconsistent Taxonomies

WebGUI & the Semantic Web. William McKee WebGUI Users Conference 2009

Table of Contents. iii

Semantic Web Test

int.ere.st: Building a Tag Sharing Service with the SCOT Ontology

A Lightweight Ontology for Rating Assessments

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria

Extracting knowledge from Ontology using Jena for Semantic Web

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

DBpedia-An Advancement Towards Content Extraction From Wikipedia

Artificial Intelligence Techniques. Internet Applications 2

From the Web to the Semantic Web: RDF and RDF Schema

Springer Science+ Business, LLC

Interacting with Linked Data Part I: General Introduction

Two Layer Mapping from Database to RDF

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

SKOS - Simple Knowledge Organization System

Racer: An OWL Reasoning Agent for the Semantic Web

Are you afraid of Semantic Web?

An Architecture to Discover and Query Decentralized RDF Data

SemClip - Overcoming the Semantic Gap Between Desktop Applications

Library of Congress Controlled Vocabularies as Linked Data:

Standardization of Ontologies

A Privacy Preference Ontology (PPO) for Linked Data

Semantic Web Programming

Semantic Cloud Generation based on Linked Data for Efficient Semantic Annotation

An Evaluation of Geo-Ontology Representation Languages for Supporting Web Retrieval of Geographical Information

Data formats for exchanging classifications UNSD

> Semantic Web Use Cases and Case Studies

Vocabulary and Semantics in the Virtual Observatory

WSDL versioning. Facts Basic scenario. WSDL -Web Services Description Language SAWSDL -Semantic Annotations for WSDL and XML Schema

The Semantic Web & Ontologies

Introducing Linked Data

D WSMO Data Grounding Component

The Point of View Axis: Varying the Levels of Explanation Within a Generic RDF Data Browsing Environment

What is new in W3C land? 2009 Semantic Technology Conference San Jose, California, USA June 15, Ivan Herman, W3C

Efficient Querying of Web Services Using Ontologies

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

Ontology Exemplification for aspocms in the Semantic Web

Towards an Interlinked Semantic Wiki Farm

SPARQL เอกสารหล ก ใน มคอ.3

CHAPTER 1 INTRODUCTION

4 The StdTrip Process

Why You Should Care About Linked Data and Open Data Linked Open Data (LOD) in Libraries

Library of Congress BIBFRAME Pilot. NOTSL Fall Meeting October 30, 2015

Linked Data and RDF. COMP60421 Sean Bechhofer

Semantic Integration with Apache Jena and Apache Stanbol

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011

Provenance-Aware Faceted Search in Drupal

Prof. Dr. Christian Bizer

PAD: A Semantic Social Network

INF3580/4580 Semantic Technologies Spring 2015

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Tools. Federico Chesani 18 Febbraio 2010

H1 Spring B. Programmers need to learn the SOAP schema so as to offer and use Web services.

SKOS. COMP62342 Sean Bechhofer

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

Knowledge Representation in Social Context. CS227 Spring 2011

A Model for Semantic Annotation of Environmental Resources: The TaToo Semantic Framework

Chapter 2 SEMANTIC WEB. 2.1 Introduction

Temporality in Semantic Web

Generation of Semantic Clouds Based on Linked Data for Efficient Multimedia Semantic Annotation

Linked Open Data: a short introduction

SWAD-Europe Deliverable 8.1 Core RDF Vocabularies for Thesauri

RDF /RDF-S Providing Framework Support to OWL Ontologies

H1 Spring C. A service-oriented architecture is frequently deployed in practice without a service registry

4 th Linked Data on the Web Workshop (LDOW 2011)

Multi-agent Semantic Web Systems: RDF Models

Deep integration of Python with Semantic Web technologies

OWL a glimpse. OWL a glimpse (2) requirements for ontology languages. requirements for ontology languages

XML related Data Exchange from the Test Machine into the Web-enabled Alloys-DB. Nagy M, Over HH, Smith A

The NEPOMUK project. Dr. Ansgar Bernardi DFKI GmbH Kaiserslautern, Germany

An Introduction to the Semantic Web. Jeff Heflin Lehigh University

Transcription:

Mapping between Digital Identity Ontologies through SISM Matthew Rowe The OAK Group, Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield S1 4DP, UK m.rowe@dcs.shef.ac.uk Abstract. Various ontologies are available defining the semantics of digital identity information. Due to the rise in use of lowercase semantics, such ontologies are now used to add metadata to digital identity information within web pages. However concepts exist in these ontologies which are related and must be mapped together in order to enhance machinereadability of identity information on the web. This paper presents the Social identity Schema Mapping (SISM) vocabulary which contains a set of mappings between related concepts in distinct digital identity ontologies using OWL and SKOS mapping constructs. Key words: Semantic Web, Social Web, SKOS, OWL, FOAF, SIOC, PIMO, NCO, Microformats 1 Introduction The semantic web provides a web of machine-readable data. Ontologies form a vital component of the semantic web by providing conceptualisations of domains of knowledge which can then be used to provide a common understanding of some domain. A basic ontology contains a vocabulary of concepts and definitions of the relationships between those concepts. An agent reading a concept from an ontology can look up the concept and discover its properties and characteristics, therefore interpreting how it fits into that particular domain. Due to the great number of ontologies it is common for related concepts to be defined in separate ontologies, these concepts must be identified and mapped together. Web technologies such as Microformats, erdf and RDFa have allowed web developers to encode lowercase semantics within XHTML pages. It is commonplace for different vocabularies to be used within such pages, particularly when describing identity information. It must be declaratively specified to the agent how an unknown vocabulary is related to the vocabulary set that it knows and understands. The need to specify mappings between digital identity ontologies is currently being investigated by the W3C Social Web Incubator Group 1. Our contribution in this paper is the presentation of a mapping vocabulary between ontologies used to define digital identity information called the Social Identity 1 http://www.w3.org/2005/incubator/socialweb/

2 Matthew Rowe Schema Mapping (SISM). Each mapping provides a relation between two concepts in disparate ontologies where the relation defines the semantics of the mapping which can cover equivalence, associative and hierarchical relations. This paper is structured as follows: Section 2 defines the problems that have motivated the creation of mappings between digital identity ontologies. Section 3 presents the Social Identity Schema Mapping (SISM), detailing the semantics of the mappings, explaining the design choices and providing example mappings. Section 4 discusses how SISM is currently being used in applications. Section 5 presents related work and section 6 discusses the conclusions we have drawn from SISM and plans for future work. 2 Problem Definition 2.1 Monitoring Personal Information Online Metadata models can now be embedded within web pages using lowercase semantics: The Resource Description Framework in Attributes (RDFa) [8] and Microformats embeds ontology concepts and a set vocabulary of terms within the attributes of XHTML elements respectively. These metadata models can be gleaned from web pages through the use of Gleaning Resource Descriptions from Dialects of Language (GRDDL) [2] where transformations specified within a web page s header generate an RDF model from the page. The transformation specifies the ontologies to be used within the gleaned model, which may be unknown ontologies. This is a common problem when monitoring personal information on the web using automated means: Web pages are parsed in order to glean metadata models containing identity information, if the models use unknown ontologies then automated approaches are unable to interpret such models. Such approaches must be explicitly informed of the relationship between the parsed concepts and a known vocabulary of concepts. 2.2 Data Portability The majority of web users now have profiles on several distributed Social Web platforms. Such platforms open up their data through an API allowing it to be reused and combined with data from other platforms. In most cases the response returned by the API is XML according to an XML schema used by the platform and due to each platform using a distinct schema, reusing data automatically becomes limited. If we consider that each Social Web platform that a given web user is a member of contains a distinct facet of his/her digital identity, then compiling these facets together would create a complete profile of the web user. This profile could then be reused by the web user when accessing additional Social Web platforms, allowing personal information access to be controlled. 3 Social Identity Schema Mapping (SISM) SISM solves the problem of information heterogeneity by providing mappings between different identity ontologies. An agent is able to consult the mappings

Mapping between Digital Identity Ontologies through SISM 3 and derive a relation between a parsed concept and a known concept where the relation describes the semantics of the mapping in a machine-readable format. SISM is available on the web 2 as RDF containing mappings defined as triples containing a source concept as the subject and the target concept as the object, the predicate contains the semantics of the mapping. 3.1 Mapping Identity Ontologies SISM contains mappings between five ontologies which contain concepts used to define identity information: The Friend of a Friend ontology 3, the ontology for VCards 4, the XFN ontology 5, and the Nepomuk ontologies; Personal Information Model Ontology 6, and the Nepomuk Contact Ontology 7. Mapping constructs from the Web Ontology Language (OWL) [7] and the Simple Knowledge Organisation System (SKOS) [6] are used within SISM. OWL is a well established and widely used ontology language, and SKOS is also a widely used language for describing knowledge organisation systems such as thesauri. The combination of both OWL and SKOS constructs cover the range of mappings needed in order to map concepts from the available identity ontologies. OWL semantics employ strong bindings of equivalence and subsumption. If we want to say that two concepts from distinct ontologies are similar or related in some way but are not equivalent then we cannot express this using OWL constructs. Conversely, SKOS semantics employ relaxed bindings of a relation. If we want to denote a mapping of equivalence between concepts then OWL constructs are better suited given that SKOS constructs do not cover such equivalence. We will now discuss the design of SISM and explain which constructs were used and how they were applied. 3.2 Collections of Mappings Figure 1 shows mappings between the foaf:person and vcard:vcard classes where each class is defined as an instance of skos:concept. According to the SKOS primer [6] such concept definitions encompass all OWL class and property types (i.e. owl:class, owl:objectproperty and owl:datatypeproperty). SISM contains a concept scheme (instance of skos:conceptscheme) defined by the hash URI #sism. Each mapped concept is related to this concept scheme through the relation. This allows an agent to query the semantic web asking for all the schemes that a given concept belongs to. If we imagine that thousands of mapping vocabularies are published, each containing a SKOS concept scheme, then an agent is able to see the differing use that a given concept has been applied 2 http://purl.oclc.org/net/sism/0.1 3 http://xmlns.com/foaf/spec/ 4 http://www.w3.org/2006/vcard/ns 5 http://vocab.sindice.com/xfn 6 http://www.semanticdesktop.org/ontologies/2007/11/01/pimo/# 7 http://www.semanticdesktop.org/ontologies/2007/03/22/nco/

4 Matthew Rowe to and enrich its vocabulary of interpretable concepts. Mapped concepts are grouped into collections to organise them into logical sets. For instance, a user or agent could ask SISM for all the mappings which describe the name or the email address of a person. Collections are expressed as instances of skos:collection. As figure 1 shows, the collection of person concepts is identified as a resource using the hash URI #person. Concepts are placed within a collection using the relation to denote membership. #sism rdf:type skos:conceptscheme vcard:vcard pimo:person owl:equivalentclass foaf:person nco:personcontact #person rdf:type skos:collection Fig. 1. Collection of mapped Person concepts Figure 1 shows mappings between classes in different identity ontologies. We define the foaf:person and vcard:vcard concepts as being related in some way but not equivalent: vcard:vcard defines an information resource containing contact information whereas foaf:person defines an instance of a person. We therefore regard each concept to be related in some way, and express this using the relation. The relation between foaf:person and pimo:person is more straightforward as each ontology defines a resource as a person. Therefore we used the strict expressivity of OWL to relate the two concepts as being equivalent using the owl:equivalentclass relation. Hierarchical relations between concepts in SISM are dealt with through the use of the SKOS relationship definitions skos:broader and skos:narrower. SISM contains a collection of mappings between URL concepts, the foaf:homepage property is defined as having a broader concept defined by the vcard:url property. Therefore using skos:narrower, the inverse of skos:broader, vcard:url can also be defined as having a narrower concept defined by the foaf:homepage property. We chose SKOS constructs to express generalisation and specialisa-

Mapping between Digital Identity Ontologies through SISM 5 tion due to their relaxed semantics, OWL constructs also allow the definition of hierarchical relations but impose strict restrictions on subsumption. Equivalent property concepts in distinct schemas are defined as equivalent using the owl:equivalentproperty relation. 4 Applications 4.1 Transforming Metadata Models To date the main application of SISM has been the transformation of metadata models. As we explained in section 2, one of the main problems that motivated the creation of SISM was the increase of lowercase semantics within XHTML markup and the disparate ontologies used. We have successfully applied SISM to normalise metadata models to our desired identity ontologies by generating inference rules which when applied to a given RDF model transforms the model. Transformed models can then be integrated and the internal data compared more easily given the same ontological concepts. This approach has been successfully deployed in an application that automatically monitors personal information on the web [4]. 4.2 Interlinking Distributed Identity Fragments The second application of SISM has been to map XML schemas used by Social Web sites to concepts within SISM. Through this additional mapping metadata models, defined as XML, returned from querying Social Web platforms are lifted to RDF. By lifting these responses to RDF representations we have been able to integrate information from such platforms [5] thereby investigating how digital identity fragments distributed across the Social Web can be combined into a single complete profile which the user has control over. 5 Related Work The UMBEL project 8 has investigated and produced a lightweight ontology to provide a mapping layer between RDF models published on the web. This works by linking similar concepts in distinct ontologies to an upper concept, therefore providing mappings in an upper level. SISM differs by using collections for meditation rather then concepts. Work by [9] presents an approach to data portability across Social Web platforms by defining identity information using FOAF. However, such platforms will not consume RDF using FOAF, instead exported data must be lifted to RDF and then lowered again into the required form. Making social data portable and therefore reusable across Social Web platforms has been discussed in [1] through the use of SIOC 9. Social data such as shared content and discussions can be 8 http://www.umbel.org/ 9 http://www.sioc-project.org

6 Matthew Rowe described using SIOC and when the web user exports his content from one site to another it is machine-readable and therefore reusable. This strategy has been deployed as a fully functional WordPress plugin 10. 6 Conclusions and Future Work In this paper we have presented the Social Identity Schema Mapping (SISM), a currently deployed and very much in-use vocabulary of mappings between disparate digital identity ontologies. At the time of writing this paper SISM was the only mapping vocabulary for digital identity ontologies available on the web. The need for SISM is highlighted in the W3C Social Web Incubator Group s current work investigating the portability of social data and the overlap in conceptual elements available to define the semantics of such data. SISM is based on both SKOS and OWL constructs which capture the possible semantics of relations between concepts and is currently supporting two areas of work: transforming metadata models and interlinking identity fragments distributed across the Social Web. Future work will include the addition of more ontologies into SISM to cover the maximum range of digital identity ontologies. We hope that SISM becomes the basis for interpreting relations between digital identity ontologies, therefore we are currently implementing a RESTful service which returns an RDF model from a web page according to required ontologies. References 1. Bojars, U., Breslin, J.G., Peristeras, V., Tummarello, G,. and Decker, S.: Interlinking the Social Web with Semantics. Intelligent Systems, IEEE. Vol. 23, no. 3, pp. 29-40. (2008) 2. Connolly, D.: Gleaning Resource Descriptions from Dialects of Languages (GRDDL). W3C Recommendation. http://www.w3.org/tr/grddl/ (2007) 3. Khare, R.: Microformats: The next (small) thing on the Semantic Web?Internet Computing, IEEE. Vol. 10, pp 68-75. (2006) 4. Rowe, M.: Applying Semantic Social Graphs to Disambiguate Identity References. In: Proc. 6th European Semantic Web Conference. Heraklion, Crete. LNCS, vol. 5554, pp 461-475 (2009) 5. Rowe, M.: Interlinking Distributed Social Graphs. In: Proc. Linked Data on the Web Workshop, WWW09. Madrid, Spain. (2009) 6. Isaac, A. and Summers, E.: SKOS Simple Knowledge Organization System Primer. W3C Recommendation. http://www.w3.org/tr/skos-primer/ (2009) 7. Smith, M., Welty, C., and McGuinness, D.L.: OWL Web Ontology Guide. W3C Recommendation. http://www.w3.org/tr/owl-guide/ (2004) 8. W3C Working Group.: RDFa Primer: Bridging the Human and Data Webs. (2008) http://www.w3.org/tr/xhtml-rdfa-primer/, 9. Yeung, C., Liccardi, I., Lu, K., Seneviratne, O., and Berners-Lee, T.: Decentralization: The Future of Online Social Networking. W3C Workshop on the Future of Social Networking Position Papers. (2009) 10 http://wiki.sioc-project.org/w/sioc Import Plugin