Utilizing Open Data for interactive knowledge transfer

Similar documents
3 Publishing Technique

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

Exploring and Using the Semantic Web

Semantic Web and Natural Language Processing

Web 2.0, AJAX and RIAs

Browsing the Semantic Web

Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty

The NEPOMUK project. Dr. Ansgar Bernardi DFKI GmbH Kaiserslautern, Germany

Reducing Consumer Uncertainty

Managing Learning Objects in Large Scale Courseware Authoring Studio 1

COMP9321 Web Application Engineering

SEXTANT 1. Purpose of the Application

Introduction. October 5, Petr Křemen Introduction October 5, / 31

COMP9321 Web Application Engineering

OKKAM-based instance level integration

An Entity Name Systems (ENS) for the [Semantic] Web

DBpedia Extracting structured data from Wikipedia

Outline. 1 Introduction. 2 Semantic Assistants: NLP Web Services. 3 NLP for the Masses: Desktop Plug-Ins. 4 Conclusions. Why?

Ontology-based Architecture Documentation Approach

PROJECT PERIODIC REPORT

Fusing Corporate Thesaurus Management with Linked Data using PoolParty

SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009

Linking Spatial Data from the Web

VISO: A Shared, Formal Knowledge Base as a Foundation for Semi-automatic InfoVis Systems

DBpedia-An Advancement Towards Content Extraction From Wikipedia

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

A Developer s Guide to the Semantic Web

Re-using Cool URIs: Entity Reconciliation Against LOD Hubs

> Semantic Web Use Cases and Case Studies

Linked Data. Department of Software Enginnering Faculty of Information Technology Czech Technical University in Prague Ivo Lašek, 2011

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

Leveraging the Social Web for Situational Application Development and Business Mashups

The Emerging Web of Linked Data

STS Infrastructural considerations. Christian Chiarcos

Semantic Web. Lecture XIII Tools Dieter Fensel and Katharina Siorpaes. Copyright 2008 STI INNSBRUCK

Semantic Web in a Constrained Environment

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Semantic e-science. Bibliographic Cloud

Linked Data: What Now? Maine Library Association 2017

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report

Springer Science+ Business, LLC

SLIPO. Scalable Linking and Integration of Big POI data. Giorgos Giannopoulos IMIS/Athena RC

W3C Workshop on the Future of Social Networking, January 2009, Barcelona

WebGUI & the Semantic Web. William McKee WebGUI Users Conference 2009

DB Project. Database Systems Spring 2013

The Semantic Web DEFINITIONS & APPLICATIONS

The Linking Open Data Project Bootstrapping the Web of Data

Service Integration - A Web of Things Perspective W3C Workshop on Data and Services Integration

Proposal for Implementing Linked Open Data on Libraries Catalogue

Semantic agents for location-aware service provisioning in mobile networks

How to Publish Linked Data on the Web - Proposal for a Half-day Tutorial at ISWC2008

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data

Semantic Annotation, Search and Analysis

August 2012 Daejeon, South Korea

An Annotation Tool for Semantic Documents

Semantic Adaptation Approach for Adaptive Web-Based Systems

Management of Complex Product Ontologies Using a Web-Based Natural Language Processing Interface

A Formal Definition of RESTful Semantic Web Services. Antonio Garrote Hernández María N. Moreno García

The Personal Knowledge Workbench of the NEPOMUK Semantic Desktop

a paradigm for the Semantic Web Linked Data Angelica Lo Duca IIT-CNR Linked Open Data:

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

Version 11

Collaborative editing of knowledge resources for cross-lingual text mining

PoolParty. Thesaurus Management Semantic Search Linked Data. ISKO UK, London September 14, Andreas Blumauer

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Scalable Data Analysis (CIS )

Ylvi - Multimedia-izing the Semantic Wiki

A generic approach to manage metadata standards

USING DECISION MODELS METAMODEL FOR INFORMATION RETRIEVAL SABINA CRISTIANA MIHALACHE *

KNOWLEDGE GRAPHS. Lecture 1: Introduction and Motivation. TU Dresden, 16th Oct Markus Krötzsch Knowledge-Based Systems

Web Services. Lecture I. Valdas Rapševičius. Vilnius University Faculty of Mathematics and Informatics

ICD Wiki Framework for Enabling Semantic Web Service Definition and Orchestration

Europeana update: aspects of the data

Semantic Web. Sumegha Chaudhry, Satya Prakash Thadani, and Vikram Gupta, Student 1, Student 2, Student 3. ITM University, Gurgaon.

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

WaSABi 2014: Breakout Brainstorming Session Summary

Jie Bao, Paul Smart, Dave Braines, Nigel Shadbolt

Porting Social Media Contributions with SIOC

One Click Annotation

Tania Tudorache Stanford University. - Ontolog forum invited talk04. October 2007

RiMOM Results for OAEI 2009

OWLIM Reasoning over FactForge

Joining the BRICKS Network - A Piece of Cake

The Rich Web. Arnaud Dumont RAL Retreat * Nov 7-9, 2007

Mapping between Digital Identity Ontologies through SISM

Information Retrieval (IR) through Semantic Web (SW): An Overview

On Measuring the Lattice of Commonalities Among Several Linked Datasets

Lecture 1: Introduction and Motivation Markus Kr otzsch Knowledge-Based Systems

White Paper. EVERY THING CONNECTED How Web Object Technology Is Putting Every Physical Thing On The Web

Theme Identification in RDF Graphs

Enterprise Multimedia Integration and Search

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

Semantic Document Architecture for Desktop Data Integration and Management

Web Services. Lecture I. Valdas Rapševičius Vilnius University Faculty of Mathematics and Informatics

INSPIRE & Linked Data: Bridging the Gap Part II: Tools for linked INSPIRE data

Prof. Dr. Christian Bizer

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

Transcription:

Utilizing Open Data for interactive knowledge transfer qkai (qualifying Knowledge Acquisition and Inquiry) PhD research project at the Leibniz University of Hanover Faculty of Electrical Engineering and Computer Science Institute of Systems Engineering System- and Computer Architecture 1

Outline Introduction What is the Social Semantic Web (S2W)? How do we define the term Open Data? Which role plays user interaction in here? Motivation and claims How to use Open Data in higher-layered applications? Refining the concept: Main requirements: resource handling, lightweight user interaction, quality of content. Main contribution and focus: The qkai application framework serves as conceptual basis and system specification. A hybrid data layer embeds distributed resources and non redundant enrichment. Incentive interaction with Open Data is intended on the example of knowledge games: instancing gaming components and sequences. Summary and outlook 2

Introduction: Social Semantic Web (S2W) S2W is currently a trend towards a next generation of web applications mixing up Web 2.0 and Semantic Web technologies. Web 2.0 embeds users for content creation and ranking (prosumers: producer + consumer, Mit-Mach-Web, Read-Write-Web). Desktop-alike web applications with Rich User Experience replace the traditional MVC2 model (RIA: Rich Internet Applications with usability focus). Semantic Web offers technology oriented data representation and processing in a formal, machine interpretable language (RDF: Resource Description Framework, W3C standard). Lack of user friendly, agile interaction scenarios in Semantic Web. Lack of standardized representation and reusability in Web 2.0. Promising combination. 3

Introduction: Linked Open Data (LOD) "A piece of knowledge is open if you are free to use, reuse, and redistribute it. (Open Knowledge Foundation) Knowledge bases are numerously available in RDF as huge LOD in SW, partially linked with each other, querying through SPARQL (Simple Protocol and RDF Query Language, W3C standard since 2008). Often redundant content in two representations as e.g. (X)HTML and RDF Wikipedia, Freebase, Dbpedia. Additionally there are lots of resources outside of LOD like (X)HTML, databases, further wikis, multimedia, 4

Introduction: the role of interaction Without interaction Open Data remains futile for the user. Obviously we need some kind of fancy interaction to deploy Open Data for further purpose. Examples for interaction with Open Data: Semantic browsing, searching, grouping: Twine, Freebase, Dbpedia, Sindice, Powerset, Swoogle, Stumpedia. Rating and ranking: Revyu. SPARQL query tools very technical, little intuitive. Searching, displaying, editing, annotating and grouping of content. There is still little continuative advantage without much Incentive for the user to interact and deduce new knowledge. 5

Motivation and requirements How to reuse Open Data targeted in higher-layered applications? Provide standard tasks of knowledge engineering: Acquisition, formalization, representation, visualization. Determine and enhance quality of content: Analyzes and enrichment of meta information. User s opinion and knowledge to annotate, rate and rank content. The more we know about a resource, the better we can reuse it. Tackle extended interaction and incentive for user's attendance: Motivating, enjoyable scenarios inaugurating knowledge games, uniting fun and learning is a proved concept (serious games, assessment), strong focus on usability and lightweight, intuitive interoperation. Interoperation as possibility to deduce new knowledge, AFAWK: No applications available so far that are based on Open Data for knowledge transfer and learning. 6

qkai concept and focus We want to offer scenarios based on user-oriented web services for rich and lightweight interaction with Open Data: Therefore we have to implement standard tasks of knowledge engineering for extended interaction as a generic application framework, available Java APIs for subtasks have to be combined and respectively extended, with a scalable, reusable and unifying software concept: Social Semantic Web and Service Oriented Architecture principles are considered as Dreamteam, retaining application autonomy with loosely coupling of remote resources and services becomes possible, paradigms like extensibility, addressability, stateless communication are fulfilled. Build a hybrid (Meta) data repository for distributed resources: to reach non redundant enrichment of existing resources through semantic interlinking. Qualify Open Data through meta data and interoperation: automated analyzes of meta data using the Aperture Java framework, at the user side we enrich content through interaction based on gaming joker options. Game-based knowledge transfer is instanced to simplify interoperation and enhance user participation. to enable a proposal of suitable use cases, API- and service compositions for deploying Open Data in knowledge transfer and learning. User-centered with Rich User Experience using Flex/Flash because of high design and functionality issues. 7

qkai application framework Designed according to a 5-tier-layer design, in combination of RIA model, mediator-wrapper concepts and SOA. SOA/REST (Representational State transfer) paradigms We get a stateful Client using Flex/Flash plugin. Desktop-alike applications with advantages like faster reaction to user requests, less network traffic and server load, offline usage possibility. We decided for a Rich Thin Client stateless services and server with communication over http protocol. Business logic remains at server side, Rich UI Engine delivers GUIs, the presentation logic is divided from visualization components. Mediation layer as business logic or controller. A mediator is a software module that exploits encoded knowledge about certain sets or subsets of data to create information for a higher layer of applications. (Wiederhold, 1992) Middleware, that connects available services with further technical components (Service Mediation). 8

qkai data layer: Linked Data How to annotate distributed resources with minimal redundancy, access to live resources and ability to change management instead of data dump copies? qkai produces statements about resources for every resource a new qkai URI is generated, following REST/Linked Data paradigms, we get a Linked Data repository with semantic interlinking using seealso or sameas description in RDF and OWL, qkai stores only new information and not the provenance source while adding a new node to LOD. It is the foundation of a kind of self-propelled data structure that grows by and by with increasing user interaction. 9

Quality of content Meta data can be seen as a quality feature: One example is a domain ranking visualized as tag clouds Meta data directly included in a resource like format, timeliness, author, provenance, language, They can be determined with help of the Java framework Aperture. Second level criteria: About which domain we get the most information right now? First level criteria: The more meta data we are snapping, the better we get to know the content. There is no absolute quality, but we can compare resources with each other (Open World Assumption) and weight them based on the amount and structure of meta information. Enrichment of a resource happens in the corresponding qkai URI file. User interaction for example with gaming jokers helps to enhance semantically correctness, after users have played jokers, ranking, annotation and even semantic correction becomes possible. Third level criteria: Employing NLP (Natural Language Processing) might detect some more information hidden inside the resource by entity recognition: NER (Named Entity Recognition), PoS (Part of Speech) Tagging or automated summarization, 10

Game-based question answer creation We focus on question-answer games first Creating questions and answers is the game itself so we do not have to create gaming content laboriously before starting to play Users already get rewarded for creating questions and answers or completing them to a knowledge unit,?! type (person, location, object), domain like computer science or architecture, ), gaming type or question type. qkai concepts use existing ontologies where possible to describe own domains, persons or themes? multiple choice, text-text assignment, image-text assignment or ordering questions are available as learning standards in LMS too. This question types can be easily converted into IMS/QTI format after in game creation, embedding of multimedia is also possible. Gaming concepts are described through ontologies and the user makes his choice, what he wants to ask for or what kind of questions he wants to answer! gaming elements are a question, a answer and their assignment as knowledge unit, YAGO, SKOS, Dublin Core, FOAF, Dbpedia,. Game-based rating and ranking of content and interaction is done by in game joker options and a global point system to reward any kind of interaction. 11

Examples of gaming components and sequences 12

Points of Interest (POI): personalized knowledge view At the beginning of an knowledge game or even at many other interaction scenarios - we have to define the domain and the search space the user wants to play or interact with: The web complies to the open world assumption so we are not able to present a absolute complete domain view. But we can offer what is available and let the user choose out of it. There are lots of domains and themes in the Web of Data. POI setter offers a frontend to limit the search space through the qkai service repository, suggest available content and concepts, filter and select. This can be done thematically, geographically or both. qkai builds a personalized knowledge view this way handled as RDF graphs. 13

qfoaf: Semantic user profil We need a sustainable user profile to store gaming results and personal information in a transparent manner. qfoaf is build as qkai resource in RDF at the beginning of a game as extended FOAF file (FriendOfaFriend) with unique URI for every user. It connects the user semantically with topics, knowledge units, other players, step by step while gaming e.g. http://qkai.org/foaf/msteinberg/foaf.rdf The qfoaf file can be enriched (semi)automated with given information by the user e.g. by including existing FOAF files, geocodes or interests. while gaming and interacting with game points and content like created questions, answers, knowledge units, ratings, with domains, locations, friends, 14

qchunk: Game goal and sequence The user gets presented small textual information chunks (out of Wikipedia) and has to guess the quested term with as less chunks as possible qchunk qkai first chunk multimedia chunks like zoom parts out of images are conceivable too. next chunk Chunk:?? is the capital of Lower Saxony founded in 1942. Answer: Hanover. solve points new game 15

SPARQLizer: Game-based SPARQL-Endpoint Translates questions build by the user into SPARQL requests and returns human readable answers out of SPARQL responses. Available concepts are shown to the user by form based GUI components he can choose of using auto completion and tag clouds. 16

SPARQLizer: example questions (questionizer) Which are the capitals of Europe? Which cities have famous buildings of the gothic epoch? Which people are born in Berlin before 1900? SELECT?name?birth?death?person WHERE {?person dbpedia2:birthplace <http://dbpedia.org/resource/berlin>.?person dbpedia2:birth?birth.?person foaf:name?name.?person dbpedia2:death?death FILTER (?birth < "1900-01-01"^^xsd:date). } ORDER BY?name Immanuel Becker Abraham Mendelssohn Bartholdy Achim von Arnim Adalbert von Preußen 17

SPARQLizer: Flickr text image question User gets presented randomized terms and images out of Flickr (Web 2.0 image sharing platform) and then has to assign the right term to the right image (multiple choice or drag & drop assignment). tags qkai images + terms solve points new game 18

qmap: Map-based geocode GUI Presents a gaming board for available knowledge units (questions and answers) Geocoded placement of locations, events, buildings, photos, persons, currently e.g. a photo often knows where it is taken. qmap provides a frontend for interaction with filtering, searching, editing and adding of information or knowledge units. OpenStreetMap or Yahoo Maps are good alternatives to Google Maps!? Map symbols are connected with different gaming interactions and information units.!?!?!?!???!! especially using Flash/Flex API.??!?!! 19

qrank Allows game-based rating and ranking of resources, information and knowledge units by joker options, Know-it-all-joker bounds player to add missing information, Nonsense-joker marks information as semantically wrong and defers it to review mode by other users, Explorer-joker with predefined time, history-joker enables lookups in played transaction protocol, qlink Offers game-based interlinking with new resources, Linked Open Data basis will be enhanced, new interlinking brings qpoints in qfoaf. qpoint Is a global point and level system documenting learning progress and personal knowledge, every interaction is rewarded with qpoints, based on an interaction catalog with assigned points, 20

Current work and outlook Current: We are implementing demonstrators für GUI and background functionality: qchunk, SPARQLizer, POI-Setter are work in progress. Backend services for qchunk and POI-Setter are almost done, Flex GUI frontend is currently under development. Embedding available resources as qkai knowledge base using Sindice web service and SPARQL is also work in progress. Next: Implementing SPARQLizer, qmap, qpoint, qfoaf, evaluation of finished gaming types with large student groups. Proof of concept. 1,5 years left for prototypically implementation and evaluation of gaming components and services. Long term objective: Atomic, simple and composite services for generic rearrangement in several scenarios. 21

Conclusion Requirements are derived to reuse Open Data for knowledge transfer and learning. Shown on the example of knowledge games: Summarizing the main aspects in qkai: Implemented atomic services allow rearrangement and are suitable for any other purpose in information and knowledge transfer not only for game-based use cases. Services are adaptable to several domains like computer science, city guiding, history or architecture studies, philosophy, music theory, We are designing and implementing a framework specification following REST, Linked Data und RIA paradigms, building a hybrid data store for distributed web resources with semantic interlinking, creating incentive game-based interoperation using scalable web services and rich interoperation, determining and enhancing contents quality by analyzing meta data and enriching content with user s in game annotations. Reusing and composing is a precept at all levels, the challenge is to merge and to expand: Resources, concepts (ontologies), frameworks, APIs, tools, libraries, We are aiming at standardized, machine- and human readable staging of Open Data with lightweight interoperation. 22

Thank you very much for your attention! 23