Graph based semantic annotation for enriching documents with linked data

Size: px
Start display at page:

Download "Graph based semantic annotation for enriching documents with linked data"

Transcription

1 Graph based semantic annotation for enriching documents with linked data Estefanía Otero García Supervisors: Manuel Lama, Juan C. Vidal Centro Singular de Investigación en Tecnoloxías da Información UNIVERSIDADE DE SANTIAGO DE COMPOSTELA citius.usc.es

2 Description of the problem Learning Fruits Digital books with clear and consist text. They have a friendly interface and interactive audiovisual resources: The content is identical to the textbooks, but with links to other content to access to relevant and complementary information. They are based on an XML format that describes the structure of the LF. It facilitates their use by the e- learning applications. 2

3 Description of the problem Open problems PROB1 Get additional and complementary information about terms that: there are no links to complementary web pages or other content not provide explanations. 1oc 1oc PROB2 Search information inside the content of the LF: It is necessary to explore through links to external content and web pages 3

4 Solution ADEGA 1.0 Semantic annotation of the Learning Fruit content Automatically associate the set of relevant terms of the LF to instances of a densely populated ontology Each relevant term is associated with a graph of instances that contains information about the term in the LF context Each instance has a link to an external web page 4

5 Solution: Ontology Required ontology characteristics Represents general purpose encyclopedic knowledge It is populated with a huge number of instances that have a link to a document or web page 5

6 Solution: DBpedia DBpedia content is modeled using an shallow, corss-domain ontology based on standard vocabularies. 529 classes and 2,333 properties The population of the DBpedia ontology takes place extracting data from Wikipedia structured information Infobox templates Information from categories Each DBpedia instance is linked to the Wikipedia page that the information has been extracted 6

7 Solution: DBpedia Statistics (English version) Ontology size Entity number External links 2.46 billion RDF triples 4 million entities Persons 832,000 Films 78,000 Places 639,000 Music Albums 116,000 Organizations 209,000 Video games 18,500 Species 226,000 Diseases 5,600 Images 24,600,000 External web pages 27,600,000 RDF repositories 45,000,000 Wikipedia Categories 67,000,000 YAGO Categories 41,200,000 7

8 State of the art Annotation based on instance GENERATION Description These proposals identify the terms of the document to annotate and automatically create instances in the ontology. This requires: Identify the concept of ontology which the instance belongs. Assign values to the attributes of the instance. Advantage V1 It is no necessary ontologies with huge number of instances. Limitations L1 They require complex language processing techniques, combined with machine learning techniques to detect the concepts that instances and attribute belongs. L2 Apply to a very restricted number of general concepts (people, organizations and places). 8

9 State of the art Annotation based on instance SEARCH Description These proposals identify the terms of the document to annotate and then they search instances that best represents the semantics of a given term in a densely populated ontology. It is need: Apply disambiguation techniques between instances. Use context in searche. Advantages V1 It is not required natural language processing techniques as complex as proposal of the annotation based on instances generation Named Entity Recognition Syntactic and semantic similarity between terms Limitations L1 It is necessary to have an populated ontology with a huge number of instances 9

10 State of the art It does not exist any context based proposal that properly annotates a term with a semantic graph DBpedia Ranker discard relevant taxonomic relations 10

11 Framework ADEGA 1.0 All your Documents Enriched with Graph Annotations It identifies relevant terms that characterize the LF: context Each term is associated with a DBpedia instances Semantic graph is obtained by filtering instances using the context 11

12 Framework: Context ADEGA 1.0 Morphology Analysis Set of nouns, proper nouns and compound nouns extracted from the LF content POS Stanford Pharaoh Ancient Egypt Ramesses II temple Horus God Egypt Cleopatra VII tomb Piest Nile 12

13 Framework: Context ADEGA 1.0 Similarity Analysis Pharaoh Ancient Egypt Ramesses II temple Horus god Egypt Cleopatra VII tomb priest Nile Cluster of terms that are composed by words that share a similar meaning or arise from the same root. SoftTFIDF Hybridization metrics TF-IDF + Jaro-Winkler {Egypt, egyptian, egyptologist} {Cleopatra, Cleopatra VII} {god, gods} 13

14 Framework: Context ADEGA 1.0 {Egypt, egyptian, egyptologist} {Cleopatra, Cleopatra VII} {god, gods} Frequency Analysis Number of times the term appears in the LF fields LF Context Final relevance It is calculated using the frequency weighted by the relevance of each LF field (p α ), 14

15 Framework: Filter graph ADEGA 1.0 CONTEXT EXTRACTION LEVEL Paleolithic node Tebas Necropolis Faraones... ANOTATION Frequency Similarity Luxor forma parte de la antigua ciudad llamada Uaset (en egipcio antiguo), o también conocida como Tebas (en griego), denominada por Homero "La ciudad de las cien puertas", por las numerosas puertas Es la ciudad de los grandes templos del antiguo Egipto (Luxor y Karnak), y de las célebres necrópolis de la ribera... U R I I D E N T I F I C A T I O N #level #nodes , , ,950,620 There are graph nodes that are not relevant to semantically describe the terms of the document Context is used to discriminate the relevant instances for the semantic description of the LF 15

16 Framework: Filter graph ADEGA 1.0 Depth First Search algorithm (depth limited) The exploration determines the relevance of each node, which depends on the relevance of the children nodes that are connected n 1 DBpedia relation weight wr 12 wr 17 URI n 2 n 7 nodes wr wr 24 The node is 23 relevant if it n 3 n 4 exceeds a threshold Frequency Diversity wr 45 wr 46 Text n 5 n 6 nodes 16

17 Validation ADEGA 1.0 Learning Fruits #terms The landscape of the earth 7 The river civilizations of Mesopotamia 13 The landscape of Spain and Europe 10 The Paleolithic and our remote ancestors 10 Ancient Egypt 10 Ancient Egypt Assyrian empyre Babylonia Canary Island Cantabrian mountain Caspian sea Caucasus Cleopatra VII Desert Earth Egypt Enlil Euphrates Pharaoh Fossil Giza God Guadalquivi r Gudea Hammurabi Homo Erectus Homo Habilis Horus Human Ishtar Neanderthal Nile Oceanic climate Osiris Paleolithic Prehistory... 17

18 Validation: Results ADEGA

19 Validation: Comparative ADEGA 1.0 Comparative: ADEGA vs RelFinder RelFinder is set with a exploration depth of 2 levels and the context terms are introduced as the input. F1-score is used as a comparison parameter between ADEGA and RelFinder, using the same number of instances in both algorithms. 19

20 Computational Issues ADEGA 1.0 Each additional level of exploration increments exponentially the number of visited nodes Jump from level to 1implies visiting nodes Exploration results for 1 term Variable Value Averaged nodes visited 248, Average nodes discarded 199, Average nodes processed 48, Average literalsprocessed 44, Average URL processed 4, Average number of SPARQL queries 22, Mean time per query in ms 9.91 Mean time of ADEGA (depht = 3) in ms 376, % discarded nodes 91% text nodes The most costly nodes 67% of computational time was used to query DBpedia x10 terms (avg) = 50 min to obtain a solution 20

21 Publications Journal Publications "Semantic Linking of Learning Object Repositories to Dbpedia. Manuel Lama, Juan Carlos Vidal, Estefanía Otero-García, Alberto Bugarín, and Senén Barro. Educational Technology & Society 15, no. 4 (2012): JCR = Graph-based semantic annotation for enriching documents with linked data. Juan C. Vidal, Manuel Lama, Estefanía Otero-García, Alberto Bugarín. Knowledge-Based Systems (2013) JCR =

22 ADEGA Demo user interface All your Documents Enriched with Graph Annotations 27

23 ADEGA applications UNIVERSIA ANNOTATION Clasification of Universia resources 28

24 ADEGA applications MENTOR EMPRENDE MENTOR EMPRENDE 29

25 Questions?

DBPedia (dbpedia.org)

DBPedia (dbpedia.org) Matt Harbers Databases and the Web April 22 nd, 2011 DBPedia (dbpedia.org) What is it? DBpedia is a community whose goal is to provide a web based open source data set of RDF triples based on Wikipedia

More information

Semantic Web and Natural Language Processing

Semantic Web and Natural Language Processing Semantic Web and Natural Language Processing Wiltrud Kessler Institut für Maschinelle Sprachverarbeitung Universität Stuttgart Semantic Web Winter 2014/2015 This work is licensed under a Creative Commons

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

Entity and Knowledge Base-oriented Information Retrieval

Entity and Knowledge Base-oriented Information Retrieval Entity and Knowledge Base-oriented Information Retrieval Presenter: Liuqing Li liuqing@vt.edu Digital Library Research Laboratory Virginia Polytechnic Institute and State University Blacksburg, VA 24061

More information

QAKiS: an Open Domain QA System based on Relational Patterns

QAKiS: an Open Domain QA System based on Relational Patterns QAKiS: an Open Domain QA System based on Relational Patterns Elena Cabrio, Julien Cojan, Alessio Palmero Aprosio, Bernardo Magnini, Alberto Lavelli, Fabien Gandon To cite this version: Elena Cabrio, Julien

More information

Programming Technologies for Web Resource Mining

Programming Technologies for Web Resource Mining Programming Technologies for Web Resource Mining SoftLang Team, University of Koblenz-Landau Prof. Dr. Ralf Lämmel Msc. Johannes Härtel Msc. Marcel Heinz Motivation What are interesting web resources??

More information

Semantic Web. Tahani Aljehani

Semantic Web. Tahani Aljehani Semantic Web Tahani Aljehani Motivation: Example 1 You are interested in SOAP Web architecture Use your favorite search engine to find the articles about SOAP Keywords-based search You'll get lots of information,

More information

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data ( Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (http://bioqueries.uma.es) María Jesús García-Godoy, Ismael Navas-Delgado, José Francisco Aldana Montes Computing

More information

Linked Data Evolving the Web into a Global Data Space

Linked Data Evolving the Web into a Global Data Space Linked Data Evolving the Web into a Global Data Space Anja Jentzsch, Freie Universität Berlin 05 October 2011 EuropeanaTech 2011, Vienna 1 Architecture of the classic Web Single global document space Web

More information

Linked Data in Archives

Linked Data in Archives Linked Data in Archives Publish, Enrich, Refine, Reconcile, Relate Presented 2012-08-23 SAA 2012, Linking Data Across Libraries, Archives, and Museums Corey A Harper Semantic Web TBL s original vision

More information

BioNav: An Ontology-Based Framework to Discover Semantic Links in the Cloud of Linked Data

BioNav: An Ontology-Based Framework to Discover Semantic Links in the Cloud of Linked Data BioNav: An Ontology-Based Framework to Discover Semantic Links in the Cloud of Linked Data María-Esther Vidal 1, Louiqa Raschid 2, Natalia Márquez 1, Jean Carlo Rivera 1, and Edna Ruckhaus 1 1 Universidad

More information

Knowledge Representation in Social Context. CS227 Spring 2011

Knowledge Representation in Social Context. CS227 Spring 2011 7. Knowledge Representation in Social Context CS227 Spring 2011 Outline Vision for Social Machines From Web to Semantic Web Two Use Cases Summary The Beginning g of Social Machines Image credit: http://www.lifehack.org

More information

Graph Exploration: Taking the User into the Loop

Graph Exploration: Taking the User into the Loop Graph Exploration: Taking the User into the Loop Davide Mottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany 2016/10/24 CIKM2016, Indianapolis, US Where we are Background (5

More information

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision A Semantic Web-Based Approach for Harvesting Multilingual Textual Definitions from Wikipedia to Support ICD-11 Revision Guoqian Jiang 1,* Harold R. Solbrig 1 and Christopher G. Chute 1 1 Department of

More information

Named Entity Detection and Entity Linking in the Context of Semantic Web

Named Entity Detection and Entity Linking in the Context of Semantic Web [1/52] Concordia Seminar - December 2012 Named Entity Detection and in the Context of Semantic Web Exploring the ambiguity question. Eric Charton, Ph.D. [2/52] Concordia Seminar - December 2012 Challenge

More information

On the use of Abstract Workflows to Capture Scientific Process Provenance

On the use of Abstract Workflows to Capture Scientific Process Provenance On the use of Abstract Workflows to Capture Scientific Process Provenance Paulo Pinheiro da Silva, Leonardo Salayandia, Nicholas Del Rio, Ann Q. Gates The University of Texas at El Paso CENTER OF EXCELLENCE

More information

Building the Multilingual Web of Data. Integrating NLP with Linked Data and RDF using the NLP Interchange Format

Building the Multilingual Web of Data. Integrating NLP with Linked Data and RDF using the NLP Interchange Format Building the Multilingual Web of Data Integrating NLP with Linked Data and RDF using the NLP Interchange Format Presenter name 1 Outline 1. Introduction 2. NIF Basics 3. NIF corpora 4. NIF tools & services

More information

Discovering Names in Linked Data Datasets

Discovering Names in Linked Data Datasets Discovering Names in Linked Data Datasets Bianca Pereira 1, João C. P. da Silva 2, and Adriana S. Vivacqua 1,2 1 Programa de Pós-Graduação em Informática, 2 Departamento de Ciência da Computação Instituto

More information

The German DBpedia: A Sense Repository for Linking Entities

The German DBpedia: A Sense Repository for Linking Entities The German DBpedia: A Sense Repository for Linking Entities Sebastian Hellmann, Claus Stadler, and Jens Lehmann Abstract The modeling of lexico-semantic resources by means of ontologies is an established

More information

DBpedia Data Processing and Integration Tasks in UnifiedViews

DBpedia Data Processing and Integration Tasks in UnifiedViews 1 DBpedia Data Processing and Integration Tasks in Tomas Knap Semantic Web Company Markus Freudenberg Leipzig University Kay Müller Leipzig University 2 Introduction Agenda, Team 3 Agenda Team & Goal An

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A proof of concept in the Open Educational Resources domain Abstract One of the reasons why Web resources

More information

Financial Dataspaces: Challenges, Approaches and Trends

Financial Dataspaces: Challenges, Approaches and Trends Financial Dataspaces: Challenges, Approaches and Trends Finance and Economics on the Semantic Web (FEOSW), ESWC 27 th May, 2012 Seán O Riain ebusiness Copyright 2009. All rights reserved. Motivation Changing

More information

Searching the Deep Web

Searching the Deep Web Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index

More information

DBpedia-An Advancement Towards Content Extraction From Wikipedia

DBpedia-An Advancement Towards Content Extraction From Wikipedia DBpedia-An Advancement Towards Content Extraction From Wikipedia Neha Jain Government Degree College R.S Pura, Jammu, J&K Abstract: DBpedia is the research product of the efforts made towards extracting

More information

Semantic Web Information Management

Semantic Web Information Management Semantic Web Information Management Norberto Fernández ndez Telematics Engineering Department berto@ it.uc3m.es.es 1 Motivation n Module 1: An ontology models a domain of knowledge n Module 2: using the

More information

Domain Specific Semantic Web Search Engine

Domain Specific Semantic Web Search Engine Domain Specific Semantic Web Search Engine KONIDENA KRUPA MANI BALA 1, MADDUKURI SUSMITHA 2, GARRE SOWMYA 3, GARIKIPATI SIRISHA 4, PUPPALA POTHU RAJU 5 1,2,3,4 B.Tech, Computer Science, Vasireddy Venkatadri

More information

URI Disambiguation in the Context of Linked Data

URI Disambiguation in the Context of Linked Data http://dbpedia.org/resource/spain http://dbpedia.org/resource/tim_berners-lee http://acm.rkbexplorer.com/id/resource-p112732 URI Disambiguation in the Context of Linked Data http://sws.geonames.org/2510769

More information

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Katsuya Masuda *, Makoto Tanji **, and Hideki Mima *** Abstract This study proposes a framework to access to the

More information

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Using Linked Data and taxonomies to create a quick-start smart thesaurus 7) MARJORIE HLAVA Using Linked Data and taxonomies to create a quick-start smart thesaurus 1. About the Case Organization The two current applications of this approach are a large scientific publisher

More information

Data.gov Wiki: A Semantic Web Approach to Government Data

Data.gov Wiki: A Semantic Web Approach to Government Data Data.gov Wiki: A Semantic Web Approach to Government Data Li Ding, Dominic DiFranzo, Sarah Magidson, Alvaro Graves, James R. Michaelis, Xian Li, Deborah L. McGuinness, Jim Hendler Tetherless World Constellation

More information

Exploring and Using the Semantic Web

Exploring and Using the Semantic Web Exploring and Using the Semantic Web Mathieu d Aquin KMi, The Open University m.daquin@open.ac.uk What?? Exploring the Semantic Web Vocabularies Ontologies Linked Data RDF documents Example: Exploring

More information

10/10/13. Traditional database system. Information Retrieval. Information Retrieval. Information retrieval system? Information Retrieval Issues

10/10/13. Traditional database system. Information Retrieval. Information Retrieval. Information retrieval system? Information Retrieval Issues COS 597A: Principles of Database and Information Systems Information Retrieval Traditional database system Large integrated collection of data Uniform access/modifcation mechanisms Model of data organization

More information

Linking Thesauri to the Linked Open Data Cloud for Improved Media Retrieval

Linking Thesauri to the Linked Open Data Cloud for Improved Media Retrieval biblio.ugent.be The UGent Institutional Repository is the electronic archiving and dissemination platform for all UGent research publications. Ghent University has implemented a mandate stipulating that

More information

DBpedia Extracting structured data from Wikipedia

DBpedia Extracting structured data from Wikipedia DBpedia Extracting structured data from Wikipedia Anja Jentzsch, Freie Universität Berlin Köln. 24. November 2009 DBpedia DBpedia is a community effort to extract structured information from Wikipedia

More information

University of Rome Tor Vergata DBpedia Manuel Fiorelli

University of Rome Tor Vergata DBpedia Manuel Fiorelli University of Rome Tor Vergata DBpedia Manuel Fiorelli fiorelli@info.uniroma2.it 07/12/2017 2 Notes The following slides contain some examples and pictures taken from: Lehmann, J., Isele, R., Jakob, M.,

More information

The Data Web and Linked Data.

The Data Web and Linked Data. Mustafa Jarrar Lecture Notes, Knowledge Engineering (SCOM7348) University of Birzeit 1 st Semester, 2011 Knowledge Engineering (SCOM7348) The Data Web and Linked Data. Dr. Mustafa Jarrar University of

More information

August 2012 Daejeon, South Korea

August 2012 Daejeon, South Korea Building a Web of Linked Entities (Part I: Overview) Pablo N. Mendes Free University of Berlin August 2012 Daejeon, South Korea Outline Part I A Web of Linked Entities Challenges Progress towards solutions

More information

An aggregation system for cultural heritage content

An aggregation system for cultural heritage content An aggregation system for cultural heritage content Nasos Drosopoulos, Vassilis Tzouvaras, Nikolaos Simou, Anna Christaki, Arne Stabenau, Kostas Pardalis, Fotis Xenikoudakis, Eleni Tsalapati and Stefanos

More information

Contribution of OCLC, LC and IFLA

Contribution of OCLC, LC and IFLA Contribution of OCLC, LC and IFLA in The Structuring of Bibliographic Data and Authorities : A path to Linked Data BY Basma Chebani Head of Cataloging and Metadata Services, AUB Libraries Presented to

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

SPARQL (RDF Query Language)

SPARQL (RDF Query Language) Mustafa Jarrar: Lecture Notes on SPARQL RDF Query Language. Birzeit University, 2019 Version 4 SPARQL (RDF Query Language) Mustafa Jarrar Birzeit University 1 Watch this lecture and download the slides

More information

Constructing a Focused Taxonomy from a Document Collection

Constructing a Focused Taxonomy from a Document Collection Constructing a Focused Taxonomy from a Document Collection Olena Medelyan, Steve Manion, Jeen Broekstra, and Anna Divoli Anna Lan Huang and Ian Witten Why? Why Automatic Generation? Dynamic Fast Cheap

More information

3 Data, Data Mining. Chengkai Li

3 Data, Data Mining. Chengkai Li CSE4334/5334 Data Mining 3 Data, Data Mining Chengkai Li Department of Computer Science and Engineering University of Texas at Arlington Fall 2018 (Slides partly courtesy of Pang-Ning Tan, Michael Steinbach

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

Pub/Sub and Semantic Annotation Enablers for Future Internet in the PPP EU Project Test-Bed

Pub/Sub and Semantic Annotation Enablers for Future Internet in the PPP EU Project Test-Bed Activity Context-Aware System Architectures: Papers from the AAAI 2013 Workshop Pub/Sub and Semantic Annotation Enablers for Future Internet in the PPP EU Project Test-Bed Boris Moltchanov 1, Oscar Rodríguez

More information

Orchestrating Music Queries via the Semantic Web

Orchestrating Music Queries via the Semantic Web Orchestrating Music Queries via the Semantic Web Milos Vukicevic, John Galletly American University in Bulgaria Blagoevgrad 2700 Bulgaria +359 73 888 466 milossmi@gmail.com, jgalletly@aubg.bg Abstract

More information

datos.bne.es: a Library Linked Data Dataset

datos.bne.es: a Library Linked Data Dataset Undefined (2012) 1 6 1 IOS Press datos.bne.es: a Library Linked Data Dataset Daniel Vila-Suero a, Boris Villazón-Terrazas a and Asunción Gómez-Pérez a a Ontology Engineering Group, Facultad de Informática,

More information

Using Linked Open Data sources for Entity Disambiguation

Using Linked Open Data sources for Entity Disambiguation Using Linked Open Data sources for Entity Disambiguation Esther Villar Rodríguez OPTIMA Unit TECNALIA esther.villar@tecnalia.com Ana I. Torre Bastida OPTIMA Unit TECNALIA isabel.torre@tecnalia.com Ana

More information

16th International World Wide Web Conference Developers Track, May 11, DBpedia. Querying Wikipedia like a Database

16th International World Wide Web Conference Developers Track, May 11, DBpedia. Querying Wikipedia like a Database 16th International World Wide Web Conference Developers Track, May 11, 2007 DBpedia Querying Wikipedia like a Database Christian Bizer, Freie Universität Berlin Sören Auer, Universität Leipzig Georgi Kobilarov,

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

Linked Data. Department of Software Enginnering Faculty of Information Technology Czech Technical University in Prague Ivo Lašek, 2011

Linked Data. Department of Software Enginnering Faculty of Information Technology Czech Technical University in Prague Ivo Lašek, 2011 Linked Data Department of Software Enginnering Faculty of Information Technology Czech Technical University in Prague Ivo Lašek, 2011 Semantic Web, MI-SWE, 11/2011, Lecture 9 Evropský sociální fond Praha

More information

The Semantic Web DEFINITIONS & APPLICATIONS

The Semantic Web DEFINITIONS & APPLICATIONS The Semantic Web DEFINITIONS & APPLICATIONS Data on the Web There are more an more data on the Web Government data, health related data, general knowledge, company information, flight information, restaurants,

More information

Europeana update: aspects of the data

Europeana update: aspects of the data Europeana update: aspects of the data Robina Clayphan, Europeana Foundation European Film Gateway Workshop, 30 May 2011, Frankfurt/Main Overview The Europeana Data Model (EDM) Data enrichment activity

More information

Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr.

Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr. Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies Student: Alexandra Moraru Mentor: Prof. Dr. Dunja Mladenić Environmental Monitoring automation Traffic Monitoring integration

More information

Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library

Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library Nuno Freire Chief data officer The European Library Pacific Neighbourhood Consortium 2014 Annual

More information

Chapter 16 Linked Data, Ontologies, and DBpedia

Chapter 16 Linked Data, Ontologies, and DBpedia Abstract Chapter 16 Linked Data, Ontologies, and DBpedia Alex Adamec The Semantic Web is a collaborative movement which promotes common data formats on the World Wide Web and aims to convert the currently

More information

Hierarchical Link Analysis for Ranking Web Data

Hierarchical Link Analysis for Ranking Web Data Hierarchical Link Analysis for Ranking Web Data Renaud Delbru, Nickolai Toupikov, Michele Catasta, Giovanni Tummarello, and Stefan Decker Digital Enterprise Research Institute, Galway June 1, 2010 Introduction

More information

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala Master Project Various Aspects of Recommender Systems May 2nd, 2017 Master project SS17 Albert-Ludwigs-Universität Freiburg Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue

More information

OWLIM Reasoning over FactForge

OWLIM Reasoning over FactForge OWLIM Reasoning over FactForge Barry Bishop, Atanas Kiryakov, Zdravko Tashev, Mariana Damova, Kiril Simov Ontotext AD, 135 Tsarigradsko Chaussee, Sofia 1784, Bulgaria Abstract. In this paper we present

More information

An Entity Name Systems (ENS) for the [Semantic] Web

An Entity Name Systems (ENS) for the [Semantic] Web An Entity Name Systems (ENS) for the [Semantic] Web Paolo Bouquet University of Trento (Italy) Coordinator of the FP7 OKKAM IP LDOW @ WWW2008 Beijing, 22 April 2008 An ordinary day on the [Semantic] Web

More information

Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library

Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library SWIB14 Semantic Web in Libraries Conference Bonn, December 2014 Outline Introduction to The

More information

INTERCONNECTING AND MANAGING MULTILINGUAL LEXICAL LINKED DATA. Ernesto William De Luca

INTERCONNECTING AND MANAGING MULTILINGUAL LEXICAL LINKED DATA. Ernesto William De Luca INTERCONNECTING AND MANAGING MULTILINGUAL LEXICAL LINKED DATA Ernesto William De Luca Overview 2 Motivation EuroWordNet RDF/OWL EuroWordNet RDF/OWL LexiRes Tool Conclusions Overview 3 Motivation EuroWordNet

More information

Semantic Web Fundamentals

Semantic Web Fundamentals Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2018/19 with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz January 7 th 2019 Overview What is Semantic Web? Technology

More information

Linked Data and RDF. COMP60421 Sean Bechhofer

Linked Data and RDF. COMP60421 Sean Bechhofer Linked Data and RDF COMP60421 Sean Bechhofer sean.bechhofer@manchester.ac.uk Building a Semantic Web Annotation Associating metadata with resources Integration Integrating information sources Inference

More information

Data integration perspectives from the LTB project

Data integration perspectives from the LTB project Data integration perspectives from the LTB project Michele Pasin Centre for Computing in the Humanities Kings College, London michele.pasin@ kcl.ac.uk SDH-SEMI-2010 Montreal, Canada, June 2010 Summary

More information

LODtagger. Guide for Users and Developers. Bahar Sateli René Witte. Release 1.0 July 24, 2015

LODtagger. Guide for Users and Developers. Bahar Sateli René Witte. Release 1.0 July 24, 2015 LODtagger Guide for Users and Developers Bahar Sateli René Witte Release 1.0 July 24, 2015 Semantic Software Lab Concordia University Montréal, Canada http://www.semanticsoftware.info Contents 1 LODtagger

More information

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad

More information

Query Expansion using Wikipedia and DBpedia

Query Expansion using Wikipedia and DBpedia Query Expansion using Wikipedia and DBpedia Nitish Aggarwal and Paul Buitelaar Unit for Natural Language Processing, Digital Enterprise Research Institute, National University of Ireland, Galway firstname.lastname@deri.org

More information

CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS

CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS 82 CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS In recent years, everybody is in thirst of getting information from the internet. Search engines are used to fulfill the need of them. Even though the

More information

LinkedMDB. The first linked data source dedicated to movies

LinkedMDB. The first linked data source dedicated to movies Oktie Hassanzadeh Mariano Consens University of Toronto April 20th, 2009 Madrid, Spain Presentation at the Linked Data On the Web (LDOW) 2009 Workshop LinkedMDB 2 The first linked data source dedicated

More information

Ontology Driven Semantic Search

Ontology Driven Semantic Search Ontology Driven Semantic Search DARIO BONINO, FULVIO CORNO, LAURA FARINETTI, ALESSIO BOSCA Dipartimento di Automatica ed Informatica Politecnico di Torino Corso Duca degli Abruzzi, 10129 Torino ITALY {dario.bonino,

More information

Theme Identification in RDF Graphs

Theme Identification in RDF Graphs Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published

More information

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University Ontology Engineering CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University http://www3.cs.stonybrook.edu/~pfodor/courses/cse595.html Lecture Outline Constructing Ontologies Reusing Existing

More information

Discovering and Maintaining Links on the Web of Data

Discovering and Maintaining Links on the Web of Data International Semantic Web Conference Thursday, October 29, 2009 Discovering and Maintaining Links on the Web of Data Julius Volz, Chemnitz University of Technology Christian Bizer, Freie Universität Berlin

More information

Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities

Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities Nitish Aggarwal, Kartik Asooja, Paul Buitelaar, and Gabriela Vulcu Unit for Natural Language Processing Insight-centre, National University

More information

Searching the Deep Web

Searching the Deep Web Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index

More information

LODtagger. Guide for Users and Developers. Bahar Sateli René Witte. Release 1.1 October 7, 2016

LODtagger. Guide for Users and Developers. Bahar Sateli René Witte. Release 1.1 October 7, 2016 LODtagger Guide for Users and Developers Bahar Sateli René Witte Release 1.1 October 7, 2016 Semantic Software Lab Concordia University Montréal, Canada http://www.semanticsoftware.info Contents 1 LODtagger

More information

Linking library data: contributions and role of subject data. Nuno Freire The European Library

Linking library data: contributions and role of subject data. Nuno Freire The European Library Linking library data: contributions and role of subject data Nuno Freire The European Library Outline Introduction to The European Library Motivation for Linked Library Data The European Library Open Dataset

More information

Imhotep / Stone Age Students / Ayers Rock / Glaciers / Tasmanian Devil / Virginia Beach / Squids (National Geographic School Bulletin, March 11, 1968

Imhotep / Stone Age Students / Ayers Rock / Glaciers / Tasmanian Devil / Virginia Beach / Squids (National Geographic School Bulletin, March 11, 1968 Imhotep / Stone Age Students / Ayers Rock / Glaciers / Tasmanian Devil / Virginia Beach / Squids (National Geographic School Bulletin, March 11,, Number 24) Western Civilization Since 1648 101 > Usilton

More information

Semantic Annotation, Search and Analysis

Semantic Annotation, Search and Analysis Semantic Annotation, Search and Analysis Borislav Popov, Ontotext Ontology A machine readable conceptual model a common vocabulary for sharing information machine-interpretable definitions of concepts in

More information

A simple method to extract abbreviations within a document using regular expressions

A simple method to extract abbreviations within a document using regular expressions A simple method to extract abbreviations within a document using regular expressions Christian Sánchez, Paloma Martínez Computer Science Department, Universidad Carlos III of Madrid Avd. Universidad, 30,

More information

Web Linked Data (RDF, Seman3c Web, Web of Data)

Web Linked Data (RDF, Seman3c Web, Web of Data) Web Linked Data (RDF, Seman3c Web, Web of Data) Graham Klyne e-research Centre, University of Oxford hep://annalist.net My background Involved in RDF/seman3c web/linked data for many years (and through

More information

Challenges for the Multilingual Semantic Web

Challenges for the Multilingual Semantic Web Challenges for the Multilingual Semantic Web Walther v. HAHN Universität Hamburg Arbeitsbereich Natürlichsprachliche Systeme Vogt-Kölln Straße 30 Hamburg, Germany, 22527 vhahn@informatik.uni-hamburg.de

More information

Event Registry. Gregor Leban. Jozef Stefan Institute

Event Registry. Gregor Leban. Jozef Stefan Institute Event Registry Gregor Leban Jozef Stefan Institute Event registry http://eventregistry.org/ Event registry is a system for collecting news content published globally automatic identification of world events

More information

Semantic Web Fundamentals

Semantic Web Fundamentals Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2017/18 Vedran Sabol with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz December 11 th 2017 Overview What is Semantic

More information

SEXTANT 1. Purpose of the Application

SEXTANT 1. Purpose of the Application SEXTANT 1. Purpose of the Application Sextant has been used in the domains of Earth Observation and Environment by presenting its browsing and visualization capabilities using a number of link geospatial

More information

Supporting i*-based Context Models Construction through the DHARMA Ontology

Supporting i*-based Context Models Construction through the DHARMA Ontology Supporting i*-based Context Models Construction through the DHARMA Ontology Wilson Pérez 1, Karina Abad 1, Juan Pablo Carvallo 2, Xavier Franch 3 1 Universidad de Cuenca (UC), Cuenca, Ecuador 2 Universidad

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

ESSLLI 2012 Ontology-based Interpretation of Natural Language. Application: Ontology-based question answering

ESSLLI 2012 Ontology-based Interpretation of Natural Language. Application: Ontology-based question answering ESSLL 2012 Ontology-based nterpretation of Natural Language August 17, 2012 Application: Ontology-based question answering Philipp Cimiano Christina Unger Semantic Computing Group CTEC, Bielefeld University

More information

Semantics at ADS Labs. Rahul Dave, Alberto Accomazzi Sponsored by Microsoft Research and ADS

Semantics at ADS Labs. Rahul Dave, Alberto Accomazzi Sponsored by Microsoft Research and ADS Semantics at ADS Labs Rahul Dave, Alberto Accomazzi Sponsored by Microsoft Research and ADS The Cycle We put in proposals and hope we get grants. We use the grants to support ourselves, students, post-docs,

More information

Interactive Handwritten Text Recognition and Indexing of Historical Documents: the transcriptorum Project

Interactive Handwritten Text Recognition and Indexing of Historical Documents: the transcriptorum Project Interactive Handwritten Text Recognition and ing of Historical Documents: the transcriptorum Project Alejandro H. Toselli ahector@prhlt.upv.es Pattern Recognition and Human Language Technology Reseach

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 12 (Wrap-up) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411

More information

IJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN

IJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN Movie Related Information Retrieval Using Ontology Based Semantic Search Tarjni Vyas, Hetali Tank, Kinjal Shah Nirma University, Ahmedabad tarjni.vyas@nirmauni.ac.in, tank92@gmail.com, shahkinjal92@gmail.com

More information

A Hybrid Neural Model for Type Classification of Entity Mentions

A Hybrid Neural Model for Type Classification of Entity Mentions A Hybrid Neural Model for Type Classification of Entity Mentions Motivation Types group entities to categories Entity types are important for various NLP tasks Our task: predict an entity mention s type

More information

Technische Universität Dresden Fakultät Informatik. Wikidata. A Free Collaborative Knowledge Base. Markus Krötzsch TU Dresden.

Technische Universität Dresden Fakultät Informatik. Wikidata. A Free Collaborative Knowledge Base. Markus Krötzsch TU Dresden. Technische Universität Dresden Fakultät Informatik Wikidata A Free Collaborative Knowledge Base Markus Krötzsch TU Dresden IBM June 2015 Where is Wikipedia Going? Wikipedia in 2015: A project that has

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 1, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 12 (Wrap-up) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2457

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Boosting QAKiS with multimedia answer visualization

Boosting QAKiS with multimedia answer visualization Boosting QAKiS with multimedia answer visualization Elena Cabrio, Vivek Sachidananda, Raphael Troncy To cite this version: Elena Cabrio, Vivek Sachidananda, Raphael Troncy. Boosting QAKiS with multimedia

More information

Natural Language Processing with PoolParty

Natural Language Processing with PoolParty Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense

More information

Flamenco on the Web. Sergio Oramas

Flamenco on the Web. Sergio Oramas Flamenco on the Web Sergio Oramas Overview Structured vs Unstructured data Flamenco on the Web FlaBase: A Flamenco Music Knowledge Base 3 Data Sources Structured vs. Unstructured 4 Structured Data Sources

More information