Joe Raad [1,2], Wouter Beek [3], Frank van Harmelen [3], Nathalie Pernelle [2], Fatiha Saïs [2]

Size: px
Start display at page:

Download "Joe Raad [1,2], Wouter Beek [3], Frank van Harmelen [3], Nathalie Pernelle [2], Fatiha Saïs [2]"

Transcription

1 DETECTING ERRONEOUS IDENTITY LINKS IN THE WEB OF DATA Joe Raad [1,2], Wouter Beek [3], Frank van Harmelen [3], Nathalie Pernelle [2], Fatiha Saïs [2] [1] INRA, Paris France [2] LRI, Orsay France [3] VU, Amsterdam Netherlands

2 MOTIVATION

3 5 LINKED OPEN DATA make your data available on the Web make it available as structured data make it available in a non-proprietary format use open standards from the W3C link your data to other data Tim Berners-Lee, 2010

4 WHY LINKING YOUR DATA? My Pharmacy Knowledge Graph mykg:drug33 rdfs:label "Dolasetron" ; mykg:treats mykg:nausea ; mykg:location geonames:paris. Freebase Knowledge Graph freebase:m.02jq1 freebase:objectname "Dolasetronum" ; freebase:objecttype freebase:medicine. Ok Google... where to find a medicine for Nausea in Paris?

5 HOW TO LINK YOUR DATA? owl:sameas (the Semantic Web Identity Predicate) x, owl:sameas, y means that: x = y ( P)(Px Py) there is one thing which has two names: x and y

6 "THE SAMEAS PROBLEM" (1/2) Like the WWW, the Semantic Web contains a number of incorrect (identity) statements Oversimplified heuristic generation Classical notion of identity is problematic Lack of formal standardized alternatives Concept dri of identity links targets

7 "THE SAMEAS PROBLEM" (2/2) The largest identity set contains 177,794 terms that 'should' refer to the same real world entity However: full list at:

8 HOW CAN WE DETECT ERRONEOUS SAMEAS LINKS? Source Trustworthiness [Cudre-Mauroux et al. 2009] UNA or Ontology Axioms Violation [de Melo 2013; Valdestilhas et al. 2017; Hogan et al. 2012; Papaleo et al. 2014] Content-based [Acosta et al. 2013; Paulheim et al. 2014; Cuzzola et al.,2015] Network Metrics [Guéret et al. 2012; Sarasua et al.,2017]

9 REQUIREMENTS High accuracy and recall Scalable to the LOD Tested on real world data No (or minimal) assumptions on the data (e.g. UNA, textual description, source trustworthiness) (No existing approach combines all these criteria)

10 APPROACH & EXPERIMENTS

11 OVERALL IDEA Use the community structure of the network containing solely owl:sameas links to assign an error degree for each link 4 MAIN STEPS

12 DATASET LOD-a-lot dataset: 28.3B unique triples [Fernandez et al. 2017]

13 1. EXTRACT THE EXPLICIT IDENTITY STATEMENTS prefix owl: < select distinct?s?p?o { bind (owl:sameas?p)?s?p?o } OUTPUT: 558.9M owl:sameas (179.7M terms)

14 2. PARTITION TO EQUALITY SETS :a owl:sameas :b :a owl:sameas :c :c owl:sameas :a :d owl:sameas :e :f owl:sameas :f Eq Set 1 Eq Set 2 OUTPUT: 48.9M Non-Singleton Equality Sets [Beek et al. 2018]

15 'BARACK OBAMA' EQUALITY SET These 440 identifiers denote the same thing (EqSet 5723)

16 3. DETECT THE COMMUNITY STRUCTURE IN EACH EQ SET We use the Louvain algorithm Detects non-overlapping communities Adapted to weighted networks Low computational complexity Outperforms other algorithms [Lancichinetti and Fortunato. 2009; Yang et al. 2016] [Blondel et al. 2008]

17 4. ASSIGN ERROR DEGREES Intra-Community Link Inter-Community Link Between 0 and 1 based on the weight of the link and the density of the community(ies)

18 COMMUNITIES - 'BARACK OBAMA' C0: person; C1: president; C2: government; C3: senator

19 EVALUATION

20 ERROR DEGREE DISTRIBUTION OF 556M OWL:SAMEAS

21 FINDING THE THRESHOLD (1/3) MANUAL EVALUATION OF 200 SAMEAS LINKS 1. All evaluated links with an error degree < 0.4 are correct

22 FINDING THE THRESHOLD (2/3) MANUAL EVALUATION OF 200 SAMEAS LINKS 2. The higher an error degree is, the more likely an owl:sameas link is erroneous

23 FINDING THE THRESHOLD (3/3) MANUAL EVALUATION OF 60 SAMEAS WITH ERR > Links with an err > 0.99 and belonging to large equality sets are more likely to be incorrect

24 EVALUATION / ACCURACY (1/2) Erroneous sameas: err > 0.99 (threshold) TP TN FP FN Total Table Table 'Obama' EqSet Total % Accuracy TP + TN Total

25 EVALUATION / ACCURACY (2/2) Erroneous sameas: err > 0.99 (threshold) 95 correct DBpedia-Freebase owl:sameas 78 / 95 links exist in our dataset 77 / 78 links have an err 0.99 (error degree ranging from 0.52 to 0.94) 98.7% Accuracy [Acosta et al. 2013]

26 EVALUATION / RECALL Manually chose 40 random different terms (e.g. dbr:facebook, dbr:strawberry, dbr:chair) Made sure they are not explicitly sameas (but some which are in the same equality set) Added all the possible 780 links between them 93% Recall 725/780 have an error degree > 0.99 (error degree ranging from 0.87 to )

27 ERROR DEGREES Intra-Community Link Inter-Community Link [0, 1] based on the weight of the link and the density of the community(ies)

28 SYMMETRY IMPACT (1/2) 1. Symmetrical owl:sameas links are more likely to be correct

29 SYMMETRY IMPACT (2/2) DISCARDED THE WEIGHT FROM THE ERROR DEGREE Manual evaluation of 30 sameas links with error degree > in the largest equality set Only 2 erroneous owl:sameas (17 correct and 11 can't tell) Accuracy drops from 88% to 11% (in the largest equality set)

30 WHO IS MESSING UP THE LOD? C0: person; C1: president; C2: government; C3: senator

31 WHO IS MESSING UP THE LOD? freebase:m.05b6w1g owl:sameas dbr:president_barack_obama freebase:m.05b6w1g owl:sameas dbr:president_obama freebase:m.05b6w1g freebase:object.name "Presidency of Barack Both owl:sameas links have are error degree of the only two links in the 'Obama' equality set with err > 0.99

32 HOW MESSED UP IS THE LOD? Our approach can give an approximation based on: 1. nbr of symmetrical sameas (450M with 98% chance of correctness) 2. nbr of non-symmetrical sameas with err degree 0.99 (105M with a 88.6% chance of correctness) 3. nbr of non-symmetrical sameas with err degree >0.99 (1.2M with an erroneous probability between 40 and 88%) We estimate that 4% of owl:sameas links in the LOD are erroneous (around 22.5M links) previous estimations: 2.8% by [Hogan et al. 2012] and 21% by [Halpin et al. 2010]

33 CONCLUSION & PERSPECTIVES

34 SUMMARY Approach for detecting erroneous identity links Uses the sameas network's community structure No assumptions on the data Scale to the Web (total runtime of 11 hours) High Accuracy (86%) and Recall (93%) Links, Error Degrees, and Eq Sets:

35 PERSPECTIVES Combine multiple community detection algorithms Consider the Equality Set size in the error degree Include duplicate owl:sameas triples Combine with state-of-the-art approaches Replace erroneous owl:sameas links with contextual identity links [Beek et al. 2016; Raad et al. 2017]

36 THANK YOU! J.Raad, W.Beek, F.van Harmelen, N.Pernelle, and F.Saïs Detecting Erroneous Identity Links on the Web using Network Metrics, ISWC W.Beek, J.Raad, J.Wielemaker, and F.van Harmelen sameas.cc: The Closure of 500M owl:sameas Statements, ESWC. 2018

KNOWLEDGE GRAPH COMPLETION PART 3: IDENTITY LINK VALIDATION

KNOWLEDGE GRAPH COMPLETION PART 3: IDENTITY LINK VALIDATION KNOWLEDGE GRAPH COMPLETION PART 3: IDENTITY LINK VALIDATION FATIHA SAÏS (1) NATHALIE PERNELLE (1) DANAI SYMEONIDOU (2) (1) LRI, PARIS SUD UNIVERSITY, CNRS, PARIS SACLAY UNIVERSITY (2) INRA, GAMMA TEAM

More information

sameas.cc: The Closure of 500M owl:sameas Statements

sameas.cc: The Closure of 500M owl:sameas Statements sameas.cc: The Closure of 500M owl:sameas Statements Wouter Beek 1, Joe Raad 2, Jan Wielemaker 1, and Frank van Harmelen 1 1 Dept. of Computer Science, VU University Amsterdam, NL {w.g.j.beek, J.Wielemaker,

More information

The Data Web and Linked Data.

The Data Web and Linked Data. Mustafa Jarrar Lecture Notes, Knowledge Engineering (SCOM7348) University of Birzeit 1 st Semester, 2011 Knowledge Engineering (SCOM7348) The Data Web and Linked Data. Dr. Mustafa Jarrar University of

More information

On Measuring the Lattice of Commonalities Among Several Linked Datasets

On Measuring the Lattice of Commonalities Among Several Linked Datasets On Measuring the Lattice of Commonalities Among Several Linked Datasets Michalis Mountantonakis and Yannis Tzitzikas FORTH-ICS Information Systems Laboratory University of Crete Computer Science Department

More information

01 INTRODUCTION TO SEMANTIC WEB

01 INTRODUCTION TO SEMANTIC WEB SEMANTIC WEB 01 INTRODUCTION TO SEMANTIC WEB FROM WEB 1.0 TO WEB 3.0 IMRAN IHSAN ASSISTANT PROFESSOR, AIR UNIVERSITY, ISLAMABAD WWW.IMRANIHSAN.COM QUESTIONS What is the Semantic Web? Why do we want it?

More information

Linked Open Data: a short introduction

Linked Open Data: a short introduction International Workshop Linked Open Data & the Jewish Cultural Heritage Rome, 20 th January 2015 Linked Open Data: a short introduction Oreste Signore (W3C Italy) Slides at: http://www.w3c.it/talks/2015/lodjch/

More information

Linked Data. The World is Your Database

Linked Data. The World is Your Database Linked Data Dave Clarke Synaptica CEO Gene Loh Synaptica Software Architect The World is Your Database Agenda 1. What is Linked Data, and why is it good for you (15 mins) What is Linked Data 2. How it

More information

Semantic Web Fundamentals

Semantic Web Fundamentals Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2017/18 Vedran Sabol with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz December 11 th 2017 Overview What is Semantic

More information

The Semantic Web: A Vision or a Dream?

The Semantic Web: A Vision or a Dream? The Semantic Web: A Vision or a Dream? Ben Weber Department of Computer Science California Polytechnic State University May 15, 2005 Abstract The Semantic Web strives to be a machine readable version of

More information

DBpedia-An Advancement Towards Content Extraction From Wikipedia

DBpedia-An Advancement Towards Content Extraction From Wikipedia DBpedia-An Advancement Towards Content Extraction From Wikipedia Neha Jain Government Degree College R.S Pura, Jammu, J&K Abstract: DBpedia is the research product of the efforts made towards extracting

More information

FedX: Optimization Techniques for Federated Query Processing on Linked Data. ISWC 2011 October 26 th. Presented by: Ziv Dayan

FedX: Optimization Techniques for Federated Query Processing on Linked Data. ISWC 2011 October 26 th. Presented by: Ziv Dayan FedX: Optimization Techniques for Federated Query Processing on Linked Data ISWC 2011 October 26 th Presented by: Ziv Dayan Andreas Schwarte 1, Peter Haase 1, Katja Hose 2, Ralf Schenkel 2, and Michael

More information

Semantic Web Fundamentals

Semantic Web Fundamentals Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2018/19 with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz January 7 th 2019 Overview What is Semantic Web? Technology

More information

Semantic Interoperability. Being serious about the Semantic Web

Semantic Interoperability. Being serious about the Semantic Web Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA 1 Being serious about the Semantic Web It is not one person s ontology It is not several people s common

More information

Linked Data Annotation and Fusion driven by Data Quality Evaluation

Linked Data Annotation and Fusion driven by Data Quality Evaluation Linked Data Annotation and Fusion driven by Data Quality Evaluation Ioanna Giannopoulou (1), Fatiha Saïs (1) and Rallou Thomopoulos (2) (1) LRI (Paris Sud University & CNRS) Bât. 650 Ada Lovelace, F-91405

More information

Semantic Web Information Management

Semantic Web Information Management Semantic Web Information Management Norberto Fernández ndez Telematics Engineering Department berto@ it.uc3m.es.es 1 Motivation n Module 1: An ontology models a domain of knowledge n Module 2: using the

More information

RiMOM Results for OAEI 2009

RiMOM Results for OAEI 2009 RiMOM Results for OAEI 2009 Xiao Zhang, Qian Zhong, Feng Shi, Juanzi Li and Jie Tang Department of Computer Science and Technology, Tsinghua University, Beijing, China zhangxiao,zhongqian,shifeng,ljz,tangjie@keg.cs.tsinghua.edu.cn

More information

Ontological Modeling: Part 2

Ontological Modeling: Part 2 Ontological Modeling: Part 2 Terry Halpin LogicBlox This is the second in a series of articles on ontology-based approaches to modeling. The main focus is on popular ontology languages proposed for the

More information

Ontology Matching and the Semantic Web

Ontology Matching and the Semantic Web Ontology Matching and the Semantic Web Heiko Paulheim TU Darmstadt, Knowledge Engineering Group / SAP Research Darmstadt WeRC Interdisciplinary Talk Series April 26th, 2011 April 26th, 2011 Department

More information

Semantic Technologies & Triplestores for BI

Semantic Technologies & Triplestores for BI Semantic Technologies & Triplestores for BI 1 st European Business Intelligence Summer School ebiss 2011 Marin Dimitrov (Ontotext) Jul 2011 ebiss 2011 #2 Contents Introduction to Semantic Technologies

More information

RKB, sameas and dotac

RKB, sameas and dotac RKB, sameas and dotac at 2009: Beyond the Repository Fringe Edinburgh 30-31 July 2009 Hugh Glaser & Ian Millard Linked Data Tim Berners-Lee http://www.w3.org/2009/talks/0204-ted-tbl/ the Semantic Web done

More information

Towards a Linked Semantic Web: Precisely, Comprehensively and Scalably Linking Heterogeneous Data in the Semantic Web

Towards a Linked Semantic Web: Precisely, Comprehensively and Scalably Linking Heterogeneous Data in the Semantic Web Lehigh University Lehigh Preserve Theses and Dissertations 2014 Towards a Linked Semantic Web: Precisely, Comprehensively and Scalably Linking Heterogeneous Data in the Semantic Web Dezhao Song Lehigh

More information

Avoiding Chinese Whispers: Controlling End-to-End Join Quality in Linked Open Data Stores

Avoiding Chinese Whispers: Controlling End-to-End Join Quality in Linked Open Data Stores Avoiding Chinese Whispers: Controlling End-to-End Join Quality in Linked Open Data Stores Jan-Christoph Kalo, Silviu Homoceanu, Jewgeni Rose, Wolf-Tilo Balke Technische Universität Braunschweig Mühlenpfordstraße

More information

Prof. Dr. Christian Bizer

Prof. Dr. Christian Bizer STI Summit July 6 th, 2011, Riga, Latvia Global Data Integration and Global Data Mining Prof. Dr. Christian Bizer Freie Universität ität Berlin Germany Outline 1. Topology of the Web of Data What data

More information

ProLD: Propagate Linked Data

ProLD: Propagate Linked Data ProLD: Propagate Linked Data Peter Kalchgruber University of Vienna, Faculty of Computer Science, Liebiggasse 4/3-4, A-1010 Vienna peter.kalchgruber@univie.ac.at Abstract. Since the Web of Data consists

More information

Leveraging Data and Structure in Ontology Integration

Leveraging Data and Structure in Ontology Integration Leveraging Data and Structure in Ontology Integration O. Udrea L. Getoor R.J. Miller Group 15 Enrico Savioli Andrea Reale Andrea Sorbini DEIS University of Bologna Searching Information in Large Spaces

More information

SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009

SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009 SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009 The Emerging Web of Linked Data Chris Bizer, Freie Universität Berlin Outline 1. From a Web of Documents to a Web of Data

More information

Keys and Pseudo-Keys Detection for Web Datasets Cleansing and Interlinking

Keys and Pseudo-Keys Detection for Web Datasets Cleansing and Interlinking Author manuscript, published in "Proc. 18th international conference on knowledge engineering and knowledge management (EKAW) 7603 (2012) 144-153" Keys and Pseudo-Keys Detection for Web Datasets Cleansing

More information

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web What you have learned so far Interoperability Introduction to the Semantic Web Tutorial at ISWC 2010 Jérôme Euzenat Data can be expressed in RDF Linked through URIs Modelled with OWL ontologies & Retrieved

More information

New Approach to Graph Databases

New Approach to Graph Databases Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few

More information

Knowledge Graphs on the Web

Knowledge Graphs on the Web Knowledge Graphs on the Web Which information can we find in them and which can we not? 1 Introduction You ve seen this, haven t you? Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae,

More information

Semantic Web in a Constrained Environment

Semantic Web in a Constrained Environment Semantic Web in a Constrained Environment Laurens Rietveld and Stefan Schlobach Department of Computer Science, VU University Amsterdam, The Netherlands {laurens.rietveld,k.s.schlobach}@vu.nl Abstract.

More information

Semantic Web. Tahani Aljehani

Semantic Web. Tahani Aljehani Semantic Web Tahani Aljehani Motivation: Example 1 You are interested in SOAP Web architecture Use your favorite search engine to find the articles about SOAP Keywords-based search You'll get lots of information,

More information

Cluster-based Instance Consolidation For Subsequent Matching

Cluster-based Instance Consolidation For Subsequent Matching Jennifer Sleeman and Tim Finin, Cluster-based Instance Consolidation For Subsequent Matching, First International Workshop on Knowledge Extraction and Consolidation from Social Media, November 2012, Boston.

More information

Semantics. KR4SW Winter 2011 Pascal Hitzler 1

Semantics. KR4SW Winter 2011 Pascal Hitzler 1 Semantics KR4SW Winter 2011 Pascal Hitzler 1 Knowledge Representation for the Semantic Web Winter Quarter 2011 Pascal Hitzler Slides 5 01/20+25/2010 Kno.e.sis Center Wright State University, Dayton, OH

More information

ONTOPARK: ONTOLOGY BASED PAGE RANKING FRAMEWORK USING RESOURCE DESCRIPTION FRAMEWORK

ONTOPARK: ONTOLOGY BASED PAGE RANKING FRAMEWORK USING RESOURCE DESCRIPTION FRAMEWORK Journal of Computer Science 10 (9): 1776-1781, 2014 ISSN: 1549-3636 2014 doi:10.3844/jcssp.2014.1776.1781 Published Online 10 (9) 2014 (http://www.thescipub.com/jcs.toc) ONTOPARK: ONTOLOGY BASED PAGE RANKING

More information

Logical reconstruction of RDF and ontology languages

Logical reconstruction of RDF and ontology languages Logical reconstruction of RDF and ontology languages Jos de Bruijn 1, Enrico Franconi 2, and Sergio Tessaris 2 1 Digital Enterprise Research Institute, University of Innsbruck, Austria jos.debruijn@deri.org

More information

How Redundant Is It? - An Empirical Analysis on Linked Datasets

How Redundant Is It? - An Empirical Analysis on Linked Datasets How Redundant Is It? - An Empirical Analysis on Linked Datasets Honghan Wu 1, Boris Villazon-Terrazas 2, Jeff Z. Pan 1, and Jose Manuel Gomez-Perez 2 1 Department of Computing Science, University of Aberdeen,

More information

SERIMI Results for OAEI 2011

SERIMI Results for OAEI 2011 SERIMI Results for OAEI 2011 Samur Araujo 1, Arjen de Vries 1, and Daniel Schwabe 2 1 Delft University of Technology, PO Box 5031, 2600 GA Delft, the Netherlands {S.F.CardosodeAraujo, A.P.deVries}@tudelft.nl

More information

Exploiting Semantics Where We Find Them

Exploiting Semantics Where We Find Them Vrije Universiteit Amsterdam 19/06/2018 Exploiting Semantics Where We Find Them A Bottom-up Approach to the Semantic Web Prof. Dr. Christian Bizer Bizer: Exploiting Semantics Where We Find Them. VU Amsterdam,

More information

Linked Data Querying through FCA-based Schema Indexing

Linked Data Querying through FCA-based Schema Indexing Linked Data Querying through FCA-based Schema Indexing Dominik Brosius and Steffen Staab Institute for Web Science and Technologies, University of Koblenz-Landau, {dbrosius, staab}@uni-koblenz.de Abstract.

More information

Revisiting Blank Nodes in RDF to Avoid the Semantic Mismatch with SPARQL

Revisiting Blank Nodes in RDF to Avoid the Semantic Mismatch with SPARQL Revisiting Blank Nodes in RDF to Avoid the Semantic Mismatch with SPARQL Marcelo Arenas 1, Mariano Consens 2, and Alejandro Mallea 1,3 1 Pontificia Universidad Católica de Chile 2 University of Toronto

More information

Semantic Web and Natural Language Processing

Semantic Web and Natural Language Processing Semantic Web and Natural Language Processing Wiltrud Kessler Institut für Maschinelle Sprachverarbeitung Universität Stuttgart Semantic Web Winter 2014/2015 This work is licensed under a Creative Commons

More information

Helmi Ben Hmida Hannover University, Germany

Helmi Ben Hmida Hannover University, Germany Helmi Ben Hmida Hannover University, Germany 1 Summarizing the Problem: Computers don t understand Meaning My mouse is broken. I need a new one 2 The Semantic Web Vision the idea of having data on the

More information

Understanding Billions of Triples with Usage Summaries

Understanding Billions of Triples with Usage Summaries Understanding Billions of Triples with Usage Summaries Shahan Khatchadourian and Mariano P. Consens University of Toronto shahan@cs.toronto.edu, consens@cs.toronto.edu Abstract. Linked Data is a way to

More information

Today: RDF syntax. + conjunctive queries for OWL. KR4SW Winter 2010 Pascal Hitzler 3

Today: RDF syntax. + conjunctive queries for OWL. KR4SW Winter 2010 Pascal Hitzler 3 Today: RDF syntax + conjunctive queries for OWL KR4SW Winter 2010 Pascal Hitzler 3 Today s Session: RDF Schema 1. Motivation 2. Classes and Class Hierarchies 3. Properties and Property Hierarchies 4. Property

More information

SWSE: Objects before documents!

SWSE: Objects before documents! Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the published version when available. Title SWSE: Objects before documents! Author(s) Harth, Andreas; Hogan,

More information

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Robert Meusel and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {robert,heiko}@informatik.uni-mannheim.de

More information

StdTrip+K: Design Rationale in the RDB-to-RDF process

StdTrip+K: Design Rationale in the RDB-to-RDF process StdTrip+K: Design Rationale in the RDB-to-RDF process Rita Berardi 1, Karin Breitman 1, Marco A. Casanova 1, Giseli Rabello Lopes 1, Adriana Pereira de Medeiros 2 1 Departamento de Informática Pontifícia

More information

Information-theoretic Analysis of Entity Dynamics on the Linked Open Data Cloud

Information-theoretic Analysis of Entity Dynamics on the Linked Open Data Cloud Information-theoretic Analysis of Entity Dynamics on the Linked Open Data Cloud Chifumi Nishioka 1,2 and Ansgar Scherp 1,2 1 Kiel University, Christian-Albrechts-Platz 4, 24118 Kiel, Germany 2 ZBW Leibniz

More information

A rule-based approach to address semantic accuracy problems on Linked Data

A rule-based approach to address semantic accuracy problems on Linked Data A rule-based approach to address semantic accuracy problems on Linked Data (ISWC 2014 - Doctoral Consortium) Leandro Mendoza 1 LIFIA, Facultad de Informática, Universidad Nacional de La Plata, Argentina

More information

Chapter 9. Classification and Clustering

Chapter 9. Classification and Clustering Chapter 9 Classification and Clustering Classification and Clustering Classification and clustering are classical pattern recognition and machine learning problems Classification, also referred to as categorization

More information

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision A Semantic Web-Based Approach for Harvesting Multilingual Textual Definitions from Wikipedia to Support ICD-11 Revision Guoqian Jiang 1,* Harold R. Solbrig 1 and Christopher G. Chute 1 1 Department of

More information

KNOWLEDGE GRAPHS. Lecture 1: Introduction and Motivation. TU Dresden, 16th Oct Markus Krötzsch Knowledge-Based Systems

KNOWLEDGE GRAPHS. Lecture 1: Introduction and Motivation. TU Dresden, 16th Oct Markus Krötzsch Knowledge-Based Systems KNOWLEDGE GRAPHS Lecture 1: Introduction and Motivation Markus Krötzsch Knowledge-Based Systems TU Dresden, 16th Oct 2018 Introduction and Organisation Markus Krötzsch, 16th Oct 2018 Knowledge Graphs slide

More information

SPARQL Protocol And RDF Query Language

SPARQL Protocol And RDF Query Language SPARQL Protocol And RDF Query Language WS 2011/12: XML Technologies John Julian Carstens Department of Computer Science Communication Systems Group Christian-Albrechts-Universität zu Kiel March 1, 2012

More information

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96 ه عا ی Semantic Web Ontology Alignment Morteza Amini Sharif University of Technology Fall 95-96 Outline The Problem of Ontologies Ontology Heterogeneity Ontology Alignment Overall Process Similarity (Matching)

More information

RDF Data Management: Reasoning on Web Data

RDF Data Management: Reasoning on Web Data RDF Data Management: Reasoning on Web Data Damian Bursztyn 1,2, François Goasdoué 3,1, Ioana Manolescu 1,2, Alexandra Roatiş 1,2 1 INRIA 2 Université Paris Sud 3 Université Rennes 1 France ioana.manolescu@inria.fr,

More information

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Murhaf Fares & Stephan Oepen Language Technology Group (LTG) September 27, 2017 Today 2 Recap Evaluation of classifiers Unsupervised

More information

Semantic Web Systems Introduction Jacques Fleuriot School of Informatics

Semantic Web Systems Introduction Jacques Fleuriot School of Informatics Semantic Web Systems Introduction Jacques Fleuriot School of Informatics 11 th January 2015 Semantic Web Systems: Introduction The World Wide Web 2 Requirements of the WWW l The internet already there

More information

User Interests: Definition, Vocabulary, and Utilization in Unifying Search and Reasoning

User Interests: Definition, Vocabulary, and Utilization in Unifying Search and Reasoning User Interests: Definition, Vocabulary, and Utilization in Unifying Search and Reasoning Yi Zeng 1, Yan Wang 1, Zhisheng Huang 2, Danica Damljanovic 3, Ning Zhong 1,4, Cong Wang 1 1 International WIC Institute,

More information

BUBBLE RAP: Social-Based Forwarding in Delay-Tolerant Networks

BUBBLE RAP: Social-Based Forwarding in Delay-Tolerant Networks 1 BUBBLE RAP: Social-Based Forwarding in Delay-Tolerant Networks Pan Hui, Jon Crowcroft, Eiko Yoneki Presented By: Shaymaa Khater 2 Outline Introduction. Goals. Data Sets. Community Detection Algorithms

More information

Natasha Noy Stanford University USA

Natasha Noy Stanford University USA Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University US Being serious

More information

Discovering Concept Coverings in Ontologies of Linked Data Sources

Discovering Concept Coverings in Ontologies of Linked Data Sources Discovering Concept Coverings in Ontologies of Linked Data Sources Rahul Parundekar, Craig A. Knoblock and Jose-Luis Ambite {parundek,knoblock}@usc.edu, ambite@isi.edu University of Southern California

More information

How to Design & Build Semantic Applications with Linked Data #DCMI17. https://www.asist.org. Dave Clarke CEO Synaptica

How to Design & Build Semantic Applications with Linked Data #DCMI17. https://www.asist.org.  Dave Clarke CEO Synaptica How to Design & Build Semantic Applications with Linked Data https://www.asist.org #DCMI17 Dave Clarke CEO Synaptica http://dublincore.org/ Outline & Objectives 1. Linked Data: what is it and why is it

More information

Designing a self-medication application on Semantic Web technologies. Olivier Curé UPEM LIGM, France

Designing a self-medication application on Semantic Web technologies. Olivier Curé UPEM LIGM, France Designing a self-medication application on Semantic Web technologies Olivier Curé UPEM LIGM, France Overview Self-medication applications Symptom & Drug DB Overview Self-medication applications Symptom

More information

WebPIE: A Web-scale parallel inference engine using MapReduce

WebPIE: A Web-scale parallel inference engine using MapReduce WebPIE: A Web-scale parallel inference engine using MapReduce Jacopo Urbani a, Spyros Kotoulas a, Jason Maassen a, Frank van Harmelen a, Henri Bal a a Department of Computer Science, Vrije Universiteit

More information

H1 Spring C. A service-oriented architecture is frequently deployed in practice without a service registry

H1 Spring C. A service-oriented architecture is frequently deployed in practice without a service registry 1. (12 points) Identify all of the following statements that are true about the basics of services. A. Screen scraping may not be effective for large desktops but works perfectly on mobile phones, because

More information

Knowledge Representation for the Semantic Web

Knowledge Representation for the Semantic Web Knowledge Representation for the Semantic Web Winter Quarter 2011 Pascal Hitzler Slides 4 01/13/2010 Kno.e.sis Center Wright State University, Dayton, OH http://www.knoesis.org/pascal/ KR4SW Winter 2011

More information

OWLIM Reasoning over FactForge

OWLIM Reasoning over FactForge OWLIM Reasoning over FactForge Barry Bishop, Atanas Kiryakov, Zdravko Tashev, Mariana Damova, Kiril Simov Ontotext AD, 135 Tsarigradsko Chaussee, Sofia 1784, Bulgaria Abstract. In this paper we present

More information

Ontological Modeling: Part 11

Ontological Modeling: Part 11 Ontological Modeling: Part 11 Terry Halpin LogicBlox and INTI International University This is the eleventh in a series of articles on ontology-based approaches to modeling. The main focus is on popular

More information

The Emerging Web of Linked Data

The Emerging Web of Linked Data 4th Berlin Semantic Web Meetup 26. February 2010 The Emerging Web of Linked Data Prof. Dr. Christian Bizer Freie Universität Berlin Outline 1. From a Web of Documents to a Web of Data Web APIs and Linked

More information

Agenda. Introduction. Semantic Web Architectural Overview Motivations / Goals Design Conclusion. Jaya Pradha Avvaru

Agenda. Introduction. Semantic Web Architectural Overview Motivations / Goals Design Conclusion. Jaya Pradha Avvaru Semantic Web for E-Government Services Jaya Pradha Avvaru 91.514, Fall 2002 University of Massachusetts Lowell November 25, 2002 Introduction Agenda Semantic Web Architectural Overview Motivations / Goals

More information

Lecture 1: Introduction and Motivation Markus Kr otzsch Knowledge-Based Systems

Lecture 1: Introduction and Motivation Markus Kr otzsch Knowledge-Based Systems KNOWLEDGE GRAPHS Introduction and Organisation Lecture 1: Introduction and Motivation Markus Kro tzsch Knowledge-Based Systems TU Dresden, 16th Oct 2018 Markus Krötzsch, 16th Oct 2018 Course Tutors Knowledge

More information

Linked Open Europeana: Semantic Leveraging of European Cultural Heritage

Linked Open Europeana: Semantic Leveraging of European Cultural Heritage Linked Open Europeana: Semantic Leveraging of European Cultural Heritage http://www.slideshare.net/gradmans/ Prof. Dr. Stefan Gradmann Humboldt-Universität zu Berlin / School of Library and Information

More information

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal & Stephan Oepen Language Technology Group (LTG) September 23, 2015 Agenda Last week Supervised vs unsupervised learning.

More information

Linked Data and RDF. COMP60421 Sean Bechhofer

Linked Data and RDF. COMP60421 Sean Bechhofer Linked Data and RDF COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk Building a Semantic Web Annotation Associating metadata with resources Integration Integrating information sources Inference

More information

A General Approach to Query the Web of Data

A General Approach to Query the Web of Data A General Approach to Query the Web of Data Xin Liu 1 Department of Information Science and Engineering, University of Trento, Trento, Italy liu@disi.unitn.it Abstract. With the development of the Semantic

More information

State of the Art of Semantic Web

State of the Art of Semantic Web State of the Art of Semantic Web Ali Alqazzaz Computer Science and Engineering Department Oakland University Rochester Hills, MI 48307, USA gazzaz86@gmail.com Abstract Semantic web is an attempt to provide

More information

The Semantic Web Revisited. Nigel Shadbolt Tim Berners-Lee Wendy Hall

The Semantic Web Revisited. Nigel Shadbolt Tim Berners-Lee Wendy Hall The Semantic Web Revisited Nigel Shadbolt Tim Berners-Lee Wendy Hall Today sweb It is designed for human consumption Information retrieval is mainly supported by keyword-based search engines Some problems

More information

Entity and Knowledge Base-oriented Information Retrieval

Entity and Knowledge Base-oriented Information Retrieval Entity and Knowledge Base-oriented Information Retrieval Presenter: Liuqing Li liuqing@vt.edu Digital Library Research Laboratory Virginia Polytechnic Institute and State University Blacksburg, VA 24061

More information

ECLT 5810 Evaluation of Classification Quality

ECLT 5810 Evaluation of Classification Quality ECLT 5810 Evaluation of Classification Quality Reference: Data Mining Practical Machine Learning Tools and Techniques, by I. Witten, E. Frank, and M. Hall, Morgan Kaufmann Testing and Error Error rate:

More information

Scaling Parallel Rule-based Reasoning

Scaling Parallel Rule-based Reasoning Scaling Parallel Rule-based Reasoning Martin Peters 1, Christopher Brink 1, Sabine Sachweh 1, and Albert Zündorf 2 1 University of Applied Sciences Dortmund, Germany, Department of Computer Science {martin.peters

More information

Introduction. October 5, Petr Křemen Introduction October 5, / 31

Introduction. October 5, Petr Křemen Introduction October 5, / 31 Introduction Petr Křemen petr.kremen@fel.cvut.cz October 5, 2017 Petr Křemen (petr.kremen@fel.cvut.cz) Introduction October 5, 2017 1 / 31 Outline 1 About Knowledge Management 2 Overview of Ontologies

More information

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day

More information

The role of humans in crowdsourced semantics

The role of humans in crowdsourced semantics The role of humans in crowdsourced semantics Elena Simperl, University of Southampton* WIC@WWW2014 *with contributions by Maribel Acosta, KIT 07 April 2014 Crowdsourcing Web semantics: the great challenge

More information

A Formal Framework for Comparing Linked Data Fragments

A Formal Framework for Comparing Linked Data Fragments A Formal Framework for Comparing Linked Data Fragments Olaf Hartig 1, Ian Letter 2, and Jorge Pérez 3 1 Dept. of Computer and Information Science (IDA), Linköping University, Sweden olaf.hartig@liu.se

More information

Data Integration and Structured Search

Data Integration and Structured Search Data Integration and Structured Search RDF a language for linking data URIs are the words of the language. URIs provide global identity independently of application context. Domain Name system provides

More information

Semantic Technology. Chris Welty IBM Research

Semantic Technology. Chris Welty IBM Research Semantic Technology Chris Welty IBM Research What are semantic technologies Dates back to the 60s, 70s, 80s, 90s STRIPS, SNePS,, CG, KL-ONE, NIKL, CLASSIC, LOOM, RACER, etc Today we have standards Common

More information

Making Ontology Documentation with LODE

Making Ontology Documentation with LODE Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 63-67, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.

More information

Linking Trust to Data Quality

Linking Trust to Data Quality Linking Trust to Data Quality Davide Ceolin, Valentina Maccatrozzo, and Lora Aroyo VU University Amsterdam de Boelelaan, 1081a 1081HV Amsterdam, The Netherlands {d.ceolin, v.maccatrozzo, lora.aroyo}@vu.nl

More information

AI for Smart Cities Workshop AI*IA th Year Anniversary XIII Conference Turin (Italy), December 5, 2013

AI for Smart Cities Workshop AI*IA th Year Anniversary XIII Conference Turin (Italy), December 5, 2013 Fedelucio Narducci*, Matteo Palmonari*, Giovanni Semeraro *DISCO, University of Milan-Bicocca, Italy Department of Computer Science, University of Bari Aldo Moro, Italy!!"#$%&' "!('' #&&!))'*''''''''''''''''''

More information

The P2 Registry

The P2 Registry The P2 Registry -------------------------------------- Where the Semantic Web and Web 2.0 meet Digital Preservation David Tarrant, Steve Hitchcock & Les Carr davetaz / sh94r / lac @ecs.soton.ac.uk School

More information

CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL

CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL 85 CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL 5.1 INTRODUCTION Document clustering can be applied to improve the retrieval process. Fast and high quality document clustering algorithms play an important

More information

Social Networks and Data Portability using Semantic Web technologies

Social Networks and Data Portability using Semantic Web technologies Social Networks and Data Portability using Semantic Web technologies Uldis Bojārs1, Alexandre Passant2, John Breslin1, Stefan Decker1 1 Digital Enterprise Research Institute, National University of Ireland,

More information

SPARQL. Fausto Giunchiglia and Mattia Fumagallli. University of Trento

SPARQL. Fausto Giunchiglia and Mattia Fumagallli. University of Trento SPARQL Fausto Giunchiglia and Mattia Fumagallli University of Trento Roadmap Introduction Basic query forms SELECT CONSTRUCT ASK DESCRIBE Other clauses and modifiers SPARQL Federated Query Exercises 2

More information

Type inference through the analysis of Wikipedia links

Type inference through the analysis of Wikipedia links Type inference through the analysis of Wikipedia links Andrea Giovanni Nuzzolese nuzzoles@cs.unibo.it Aldo Gangemi aldo.gangemi@cnr.it Valentina Presutti valentina.presutti@cnr.it Paolo Ciancarini ciancarini@cs.unibo.it

More information

Using RDF to Model the Structure and Process of Systems

Using RDF to Model the Structure and Process of Systems Using RDF to Model the Structure and Process of Systems Marko A. Rodriguez Jennifer H. Watkins Johan Bollen Los Alamos National Laboratory {marko,jhw,jbollen}@lanl.gov Carlos Gershenson New England Complex

More information

Man vs. Machine Dierences in SPARQL Queries

Man vs. Machine Dierences in SPARQL Queries Man vs. Machine Dierences in SPARQL Queries Laurens Rietveld 1 and Rinke Hoekstra 1,2 1 Department of Computer Science, VU University Amsterdam, The Netherlands {laurens.rietveld,rinke.hoekstra}@vu.nl

More information

Evaluating DBOWL: A Non-materializing OWL Reasoner based on Relational Database Technology

Evaluating DBOWL: A Non-materializing OWL Reasoner based on Relational Database Technology Evaluating DBOWL: A Non-materializing OWL Reasoner based on Relational Database Technology Maria del Mar Roldan-Garcia, Jose F. Aldana-Montes University of Malaga, Departamento de Lenguajes y Ciencias

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH Andreas Walter FZI Forschungszentrum Informatik, Haid-und-Neu-Straße 10-14, 76131 Karlsruhe, Germany, awalter@fzi.de

More information