The Personal Knowledge Workbench of the NEPOMUK Semantic Desktop

Similar documents
The NEPOMUK project. Dr. Ansgar Bernardi DFKI GmbH Kaiserslautern, Germany

Using idocument for Document Categorization in Nepomuk Social Semantic Desktop

The NEPOMUK project. Dr. Ansgar Bernardi DFKI GmbH Kaiserslautern, Germany

Semantic Desktop 2.0: The Gnowsis Experience

The NEPOMUK Project - On the way to the Social Semantic Desktop

Mymory: Enhancing a Semantic Wiki with Context Annotations

ConTag: A Semantic Tag Recommendation System

The NEPOMUK Project - On the Way to the Social Semantic Desktop

FedX: A Federation Layer for Distributed Query Processing on Linked Open Data

Payola: Collaborative Linked Data Analysis and Visualization Framework

An Annotation Tool for Semantic Documents

Porting Social Media Contributions with SIOC

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

Linking Semantic Desktop Data to the Web of Data

GoNTogle: A Tool for Semantic Annotation and Search

Semantic Document Architecture for Desktop Data Integration and Management

Browsing the Semantic Web

Domain-specific Knowledge Management in a Semantic Desktop

Advanced Management of Research Publications based on the Lightroom Paradigm

SemSearch 2008, CEUR Workshop Proceedings, ISSN , online at CEUR-WS.org/Vol-334/ QuiKey a Demo. Heiko Haller

Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

DBpedia-An Advancement Towards Content Extraction From Wikipedia

Towards a Semantic Wiki Experience Desktop Integration and Interactivity in WikSAR

Support for Reusable Explorations of Linked Data in the Semantic Web

Fusing Corporate Thesaurus Management with Linked Data using PoolParty

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

Combining Fact and Document Retrieval with Spreading Activation for Semantic Desktop Search

Converging Web and Desktop Data with Konduit

LODatio: A Schema-Based Retrieval System forlinkedopendataatweb-scale

Gnowsis Adapter Framework: Treating Structured Data Sources as Virtual RDF Graphs

SemSearch: Refining Semantic Search

LinDA: A Service Infrastructure for Linked Data Analysis and Provision of Data Statistics

SWSE: Objects before documents!

Personalization in the EPOS project

Using Attention and Context Information for Annotations in a Semantic Wiki

RaDON Repair and Diagnosis in Ontology Networks

Visualizing semantic table annotations with TableMiner+

Semantic Exploitation of Engineering Models: An Application to Oilfield Models

A Context Model for Personal Knowledge Management

Dartgrid: a Semantic Web Toolkit for Integrating Heterogeneous Relational Databases

TagFS Tag Semantics for Hierarchical File Systems

PoolParty. Thesaurus Management Semantic Search Linked Data. ISKO UK, London September 14, Andreas Blumauer

Personalized Information Access in a Wiki Using Structured Tagging

Ontology Extraction from Heterogeneous Documents

Evolva: A Comprehensive Approach to Ontology Evolution

CHIP Demonstrator: Semantics-Driven Recommendations and Museum Tour Generation

Ylvi - Multimedia-izing the Semantic Wiki

Authoring and Maintaining of Educational Applications on the Web

Theme Identification in RDF Graphs

SemClip - Overcoming the Semantic Gap Between Desktop Applications

Eclipse as a Web 2.0 Application Position Paper

Integrated Knowledge Workbench

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

HyperSD: a Semantic Desktop as a Semantic Web Application

Mobile Query Interfaces

Utilizing Open Data for interactive knowledge transfer

SemaPlorer Interactive Semantic Exploration of Data and Media based on a Federated Cloud Infrastructure

Semantic-Aware Mashups for Personal Resources in SemanticLIFE and SocialLIFE

Federated Search Engine for Open Educational Linked Data

Web Portal : Complete ontology and portal

Crossing the Archival Borders

An Archiving System for Managing Evolution in the Data Web

CONTEXT-SENSITIVE VISUAL RESOURCE BROWSER

Oleksandr Kuzomin, Bohdan Tkachenko

Generation of Semantic Clouds Based on Linked Data for Efficient Multimedia Semantic Annotation

Durchblick - A Conference Assistance System for Augmented Reality Devices

ADAPTIVE RDF GRAPH REPLICATION FOR MOBILE SEMANTIC WEB APPLICATIONS *

A Tagging Approach to Ontology Mapping

Job Aid. Remote Access BAIRS Printing and Saving a Report. Table of Contents

Extracting knowledge from Ontology using Jena for Semantic Web

ALOE - A Socially Aware Learning Resource and Metadata Hub

Club Site Editing Guide

Development of an Ontology-Based Portal for Digital Archive Services

An Entity Name Systems (ENS) for the [Semantic] Web

User Configurable Semantic Natural Language Processing

Semantic Web. Lecture XIII Tools Dieter Fensel and Katharina Siorpaes. Copyright 2008 STI INNSBRUCK

A Tool for Storing OWL Using Database Technology

Getting the Most from Eclipse

From Handwriting Recognition to Ontologie-Based Information Extraction of Handwritten Notes

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report

Making Ontology Documentation with LODE

Cataloguing GI Functions provided by Non Web Services Software Resources Within IGN

A Performance and Scalability Metric for Virtual RDF Graphs

P2P Knowledge Management: an Investigation of the Technical Architecture and Main Processes

SALT - Semantically Annotated L A TEX for Scientific Publications

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE

Managing Learning Objects in Large Scale Courseware Authoring Studio 1

A Developer s Guide to the Semantic Web

A Unified Environment for Accessing a Suite of Accessibility Evaluation Facilities

Mapping between Digital Identity Ontologies through SISM

SERIMI Results for OAEI 2011

Medici for Digital Cultural Heritage Libraries. George Tsouloupas, PhD The LinkSCEEM Project

WEB PAGE RE-RANKING TECHNIQUE IN SEARCH ENGINE

Semantically Rich Recommendations in Social Networks for Sharing, Exchanging and Ranking Semantic Context

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data

Developing Web-Based Applications Using Model Driven Architecture and Domain Specific Languages

Collaborative Semantic Tables

Bibster A Semantics-Based Bibliographic Peer-to-Peer System

An RDF NetAPI. Andy Seaborne. Hewlett-Packard Laboratories, Bristol

Transcription:

The Personal Knowledge Workbench of the NEPOMUK Semantic Desktop Gunnar Aastrand Grimnes, Leo Sauermann, and Ansgar Bernardi DFKI GmbH, Kaiserslautern, Germany gunnar.grimnes@dfki.de, leo.sauermann@dfki.de, ansgar.bernardi@dfki.de Abstract. In this paper we describe some of the personal information management features found in the PSEW prototype of the NEPO- MUK Semantic Desktop project. Focussing on importing and adding new knowledge to the system as well as navigating, browsing, and searching the existing knowledge we show how a user can use NEPOMUK to integrate her information across application boundaries and create and discover new structures in her information space. 1 The NEPOMUK Semantic Desktop The paper presents parts of the outcome of the now completed NEPOMUK project 1. NEPOMUK aimed to create a specification for a Social Semantic Desktop and a prototype implementation of this specification. This paper will only describe the details around the prototype implementation, for information on the Semantic Desktop specifications such as the ontologies, the architecture blueprint and service descriptions we refer the reader to the project deliverables available from the project website as well as [1,2]. The NEPOMUK project resulted in two parallele implementations of the Semantic Desktop, one written in C++ which is tightly integrated with the KDE4 desktop environment, and a cross platform version written in Java. This paper deals only with the Java version. Furthermore, we focus only the Personal Information Management (PIM) components of the NEPOMUK Semantic Desktop, that is, only the components used on a single desktop machine. NEPOMUK also includes social features allowing various kinds of distributed search and information sharing, but they will not be covered here. The NEPOMUK approach to personal information management aims to overcome the limitation of having a separate applications for each type of data, such as email, photo-manager, publication-managers, etc. Instead all the information is integrated into one information-space and the user is free to relate and annotate information elements across application boundaries. The central hub for working with this information-space is called the NEPOMUK Personal Semantic Workbench (PSEW) and a typical view is show in Figure 1. Here we see a range of different views on the user s data, and a person concept, Claudia, 1 http://nepomuk.semanticdesktop.org L. Aroyo et al. (Eds.): ESWC 2009, LNCS 5554, pp. 836 840, 2009. c Springer-Verlag Berlin Heidelberg 2009

The Personal Knowledge Workbench of the NEPOMUK Semantic Desktop 837 being editted. The central idea for personal information management in NEPO- MUK is semantic annotation, ranging from simple tagging to more sophisticated ontology based use of typed relations, class hierarchies, etc. Although semantically rich relations are possible in the framework, we have found the general hastag relation to be at a level of complexity the users can easily understand and work with, and is also enough for most applications [3]. In NEPOMUK we go beyond the simple text tags used by web2.0 sites such as del.icio.us or Flickr, and each tag is a PIMO concept; that is, tags belong to one of the classes in the Personal Information Modelling Ontology (PIMO). PIMO is the NEPOMUK way of letting the user express her world-model on the Semantic Desktop by extending and sub-classing the starting PIMO which provides general classes such as Document, Person, Company, etc. Users are free to get as involved as they wish, if they are happy with a flat list of tags, all tags can be left as the default pimo:topic, or they can create a more elaborate structuring scheme. Automated services that will automatically assign the correct type to known cities, people and countries are also available. From a technical point of view NEPOMUK works by adding all desktop knowledge into a central RDF store, paired with a Lucene index for full-text queries of the content of RDF literals [4]. We have defined a set of ontologies for representing desktop items, known collectively as NIE (Nepomuk Information Elements) 2 ; NIE includes ontologies for contacts (Nepomuk Contact Ontology NCO), messages (Messaging Ontology NMO). The PIMO ontology is used to organise and interrelate this knowledge. A set of core information handling services, such as PIMO manipulation, data-crawling, etc., are available through HTTP based interfaces and a Eclipse RCP GUI is built in top of these. Additional plugins for existing applications are also built using the HTTP interfaces. 2 Adding Knowledge When starting the use a new knowledge management tool, bootstrapping is an important problem. To the end-user, the value of the Semantic Desktop increases as more data is added, allowing richer interrelations and increased chances of finding interesting patterns. In NEPOMUK the Aperture framework 3 is used for importing existing data into the Semantic Desktop. Aperture can crawl information from a wide range of sources, such as file systems, email servers, web2.0 web-sites and existing desktop applications such as MS Outlook or Mozilla Thunderbird. From the crawled information sources Aperture can extract meta-data as wellasthetextualcontents of documents; most common document formats are supported, such MS Office documents, PDFs, HTML, JPEGs, PNGs, MP3s, other multi-media files, and many more. When NEPOMUK is first installed the user has a choice of what data-sources to setup and crawl, common data-sources such as files and email are automati- 2 http://www.semanticdesktop.org/ontologies/ 3 http://aperture.sourceforge.net

838 G.A. Grimnes, L. Sauermann, and A. Bernardi Fig. 1. The NEPOMUK Personal Semantic Workbench cally detected by the system. Crawling is done regularlyand iteratively, meaning changes to the original data will soon be picked up by NEPOMUK. Once NEPOMUK has been setup and existing information has been crawled, NEPOMUK offers several ways to add new information directly into the Semantic Desktop data-cloud. Firstly, NEPOMUK includes a DropBox, a special folder that is monitored for changes, when a new file is downloaded from the web it can be saved in this folder. NEPOMUK will detect the file and analyse the content a pop-up a dialog with suggested tags, as shown in Figure 2. Here the file prototypearchitecture.pdf has been dropped, and the tags Marco Andriotti and Technical Workpackage Meeting has been proposed, the user himself has added a new tag: cidprototype. Each tag in NEPOMUK can be associated with a folder on disk, when a tag with an associated folder is assigned in the DropBox, the file is automatically moved to the appropriate location as well as tagged with the correct concept in one quick operation. For content that is not downloaded into the DropBox NEPOMUK offers a bookmarklet for easy tagging of web-pages, as well as plugins for Mozilla Thunderbird and MS Outlook for tagging of emails. 3 Information Enrichment and Annotation Importing the knowledge into a common RDF store only solves part of the information integration problem, as the data still forms separate islands for each

The Personal Knowledge Workbench of the NEPOMUK Semantic Desktop 839 Fig. 2. The NEPOMUK DropBox data-source. Creating the relations that for instance links the author of a PDF on disk to the sender of an email from Thunderbird is an important part of NEPO- MUK and we call the process Local Data Alignment (LDA). Space does not allow us to go into details here, but NEPOMUK uses several heuristics for doing named entity merging; assigning the right class to known cities and countries; using the semantically enabled encyclopedia DBPedia 4 for finding additional information about concepts; and graph mining approaches for finding related resources. For crawled data, the data-alignment happens automatically. For manually created concepts the relations proposed by LDA are shows as recommendations in the user-interface. 4 Browsing and Searching One of the key benefits of using Semantic Web technology to structure desktop information is that one is no longer tied to a single scheme of categorisation, i.e. in a traditional file-system a file resides in one folder, and one folder only. If one has chosen to arrange ones files by project, there is no way to later go back and view them by year. On the Semantic Desktop we can offer a wide range of views on the same data, for example, a timeline view that shows what concepts have been active in a certain period; a map view that shows the geographical location of concepts; a facetted browser; a tree-based view of the personal ontology, etc. For example, Figure 1 shows the hierarchical class-browser for browsing concepts based on their class, a timeline view, a map view, as well as a concept being editted. 4 http://dbpedia.org

840 G.A. Grimnes, L. Sauermann, and A. Bernardi For the cases when browsing is not sufficient, NEPOMUK also offers a powerful search perspective, allowing the user to search her entire information space based either simple keyword queries as well as a more powerful structured query builder [5]. The structured query builder provides a visual interface for creating SPARQL queries by drag-dropping, and also offers heuristics for ontology relaxation when the exact ontology of the data queried is not known. 5 Summary We have briefly outlined the features the NEPOMUK Semantic Desktop offers for personal information management, focussing on the process of adding information into the system and later browsing and re-finding this information. The NEPOMUK system we have discussed is open source, and the system, as well as the test-data and scripts that will be used for the demo, are all available from the project website. References 1. Bernardi, A., Decker, S., van Elst, L., Grimnes, G., Groza, T., Handschuh, S., Jazayeri, M., Mesnage, C., Moeller, K., Reif, G., Sintek, M., Sauermann, L.: The Social Semantic Desktop: A New Paradigm Towards Deploying the Semantic Web on the Desktop. IGI Global (October 2008) 2. Groza, T., Handschuh, S., Moeller, K., Grimnes, G., Sauermann, L., Minack, E., Mesnage, C., Jazayeri, M., Reif, G., Gudjonsdottir, R.: The nepomuk project on the way to the social semantic desktop. In: Proceedings of the Third International Conference on Semantic Technologies (I-SEMANTICS 2007), Graz, Austria (2007) 3. Sauermann, L., Heim, D.: Evaluating long-term use of the gnowsis semantic desktop for pim. In: Sheth, A., et al. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 467 482. Springer, Heidelberg (2008) 4. Minack, E., Sauermann, L., Grimnes, G., Fluit, C., Broekstra, J.: The sesame lucene sail: Rdf queries with full-text search. Technical Report 2008-1, NEPOMUK Consortium (February 2008) 5. Kawase, R., Minack, E., Nejdl, W., Araäjo, S., Schwabe, D.: Incremental end-user query construction for the semantic desktop. In: International Conference on Web Information Systems and Technologies (WEBIST), Lisboa, Protugal (2009)