YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames

Size: px
Start display at page:

Download "YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames"

Transcription

1 1/38 YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames Fabian Suchanek 1, Johannes Hoffart 2, Joanna Biega 2, Thomas Rebele 1, Erdal Kuzey 2, Gerhard Weikum 2 1 Télécom ParisTech 2 Max Planck Institute for Informatics

2 Plan What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 2/38

3 What is a knowledge base? 3/38 Mileva Marić married Albert Einstein Image source:

4 What is a knowledge base? 3/38 Mileva Marić married Albert Einstein has advisor Alfred Kleiner Image source:

5 What is a knowledge base? 3/38 Mileva Marić married Albert Einstein has advisor won prize Alfred Kleiner Nobel Prize in Physics Image source:

6 Where do knowledge bases help? 4/38 Applications of knowledge bases question answering semantic search intelligent personal assistants investigative journalism text analysis machine translation Hence, everybody does it Google: Knowledge Graph Amazon: Evi Microsoft: Satori

7 What is YAGO? 5/38 knowledge base with 10 million entities and>210 million facts multilingual facts from 10 languages focus on precision developed by Max-Planck Institute for Informatics and Télécom ParisTech

8 What is YAGO? 6/38 <Albert_Einstein> married advisor won Mileva Nobel Prize Alfred Kleiner

9 What does YAGO know? 7/38 theme name taxonomy-related facts simplified taxonomy main facts GeoNames facts meta-facts multilingual class labels maps to other knowledge bases raw information Wikipedia redirect links and anchor texts Themes of YAGO number of facts 95 m 17 m 55 m 39 m 203 m 787 k 4 m 296 m 471 m

10 What does YAGO know? - Sources 8/38 - Wikipedia - WordNet - GeoNames

11 YAGO 2006 idea: combine entities from Wikipedia with WordNet taxonomy 2007 YAGO: first version [SKW07] 2008 extract facts from infoboxes [SKW08] 2011 YAGO2: temporal and geographical meta facts [Hof+11a]; [Hof+13] 2013 YAGO2s: modular architecture [Suc+13] 2015 YAGO3: entities and facts from 10 languages [MBS15] Fabian Suchanek Johannes Hoffart Joanna Biega Gerhard Weikum Erdal Kuzey F. Mahdisoltani Gjergji Kasneci Thomas Rebele 9/38

12 YAGO - Combining WordNet with Wikipedia <wordnet_person>... rdfs:subclassof rdfs:subclassof <wordnet_physicist> rdfs:subclassof <wikicat_20th-century_physicists> Wordnet taxonomy Wikipedia categories rdf:type Albert Einstein 10/38

13 YAGO - Combining WordNet with Wikipedia Graph Browser w3id.org/yago/svgbrowser 11/38

14 YAGO - Multilingual: facts and entities from 10 Wikipedias 12/38

15 YAGO - Multilingual: facts and entities from 10 Wikipedias intermediate extractors clean facts deduplicate entities and facts check consistency 12/38

16 YAGO - Multilingual: facts and entities from 10 Wikipedias birth_place GEBURTSORT lieu de naissance (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... 13/38

17 YAGO - Multilingual: facts and entities from 10 Wikipedias birth_place GEBURTSORT lieu de naissance (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... P(S 1 S 2 ) > theta GEBURTSORT birth_place lieu de naissance birth_place... 13/38

18 YAGO - Multilingual: facts and entities from 10 Wikipedias YAGO architecture 14/38

19 YAGO - Time and space 15/38 Albert Einstein married Mileva Marić

20 YAGO - Time and space 15/38 Albert Einstein married Mileva Marić <id_tiwmcu_16x_11ovtp>

21 YAGO - Time and space Albert Einstein married Mileva Marić <id_tiwmcu_16x_11ovtp> source < 15/38

22 YAGO - Time and space Albert Einstein married Mileva Marić <id_tiwmcu_16x_11ovtp> source since until 1903-##-## 1919-##-## < 15/38

23 YAGO - Time and space 16/38 SPOTLX Browser w3id.org/yago/spotlx

24 YAGO - Accuracy 17/38 Screenshot of evaluation result 2 months evaluation, 15 participants evaluated 4412 facts of 76 relations (with 60m total facts) 98% facts of the sample were correct Wilson center: 95%, interval width: 4.2%

25 Applications What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 18/38

26 Applications - DBpedia 19/38 research and community project knowledge base built from Wikipedia imported YAGO s taxonomy

27 Applications - IBM Watson 20/38 participated in Jeopardy (TV quizz show) first place (competitors: human champions) used YAGO s type hierarchy

28 Applications - AIDA 21/38 Named Entity Disambiguation with ADIA, [Hof+11b] github.com/yago-naga/aida

29 Applications - Semantic Culturomics in cooperation with Le Monde Average age of people by occupation, [HBS13] w3id.org/yago/mpi/le-monde 22/38

30 AMIE What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 23/38

31 AMIE 24/38 AMIE type(x, pope) = diedin(x, Rome) Reference: [Gal+13] [Gal+15] Luis Galárraga Christina Teflioudi Katja Hose Fabian Suchanek

32 AMIE - Partial completeness assumption 25/38 iscitizenof...

33 AMIE - Partial completeness assumption 25/38 false iscitizenof...

34 AMIE - Partial completeness assumption 25/38 false unknown iscitizenof...

35 AMIE - Rule mining in knowledge bases almamater University of Zurich Albert Einstein worksat livedin livedin citizenof advises Alfred Kleiner Switzerland AMIE almamater(z,y), advises(x,z) livedin(x,y) = worksat(x,y) = iscitizenof(x,y) Diagram by Luis Galárraga Image source: 26/38

36 AMIE - Canonicalization of knowledge bases 27/38 graduaded from earned degree from University of Zurich Albert Einstein AMIE graduated from earned degree from Reference: [Gal+14] Diagram by Luis Galárraga Image source:

37 AMIE - Wikilinks semantification linksto won Albert Einstein is Nobel Prize linksto??? is Scientist Max Planck AMIE is(x, Scientist), linksto(x, Nobel Prize) Reference: [GSM15] = won(x, Nobel Prize) Diagram by Luis Galárraga Image source: 28/38

38 AMIE - Completeness 29/38 Albert Einstein married Mileva Marić Image source:

39 AMIE - Completeness Albert Einstein married Mileva Marić married Priscilla Presley? AMIE can mine completeness rules Image source: 29/38

40 AMIE - Completeness 30/38 closed world assumption partial completeness assumption popularity oracle no-change oracle star-pattern oracle class oracle AMIE oracle Reference: [Gal+17] Luis Galárraga Simon Razniewski Antoine Amarilli Fabian Suchanek

41 AMIE - Completeness 30/38 closed world assumption partial completeness assumption popularity oracle no-change oracle star-pattern oracle class oracle AMIE oracle morethan 1 (x, hasparent) = complete(x, hasparent) Reference: [Gal+17] Luis Galárraga Simon Razniewski Antoine Amarilli Fabian Suchanek

42 AMIE - Completeness 31/38 AMIE can predict incompleteness bornin: 100% diedin: 96% directed: 100% graduatedfrom: 87% haschild: 78% ismarriedto: 46%... and more

43 Conclusion What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 32/38

44 Conclusion 33/38 knowledge base with 10 million entities and>210 million facts combines Wikipedia with Wordnet multilingual facts from 10 languages has facts about time and space contains>95% correct facts available at

45 Conclusion 33/38 knowledge base with 10 million entities and>210 million facts combines Wikipedia with Wordnet multilingual facts from 10 languages has facts about time and space contains>95% correct facts available at Announce: in the following weeks new data release YAGO goes Open Source

46 References What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 34/38

47 References 35/38 Joanna Biega, Erdal Kuzey, and Fabian M. Suchanek. Inside YAGO2s: A transparent information extraction architecture. In: Proceedings of the 22nd international conference on World Wide Web companion International World Wide Web Conferences Steering Committee, 2013, pp Luis Galárraga, Christina Teflioudi, Katja Hose, and Fabian M. Suchanek. AMIE: association rule mining under incomplete evidence in ontological knowledge bases. In: WWW Luis Galárraga, Geremy Heitz, Kevin Murphy, and Fabian M. Suchanek. Canonicalizing Open Knowledge Bases. In: ACM Press, 2014, pp ISBN: DOI: / Luis Galárraga, Christina Teflioudi, Katja Hose, and Fabian M. Suchanek. Fast Rule Mining in Ontological Knowledge Bases with AMIE+. In: VLDBJ

48 References 36/38 Luis Galárraga, Simon Razniewski, Antoine Amarilli, and Fabian M. Suchanek. Predicting Completeness in Knowledge Bases. In: ACM Press, 2017, pp ISBN: DOI: / Luis Galarraga, Danai Symeonidou, and Jean-Claude Moissinac. Rule Mining for Semantifying Wikilinks. In: Linked Open Data Workshop at WWW Thomas Huet, Joanna Biega, and Fabian M. Suchanek. Mining history with Le Monde. In: Proceedings of the 2013 workshop on Automated knowledge base construction ACM Press, 2013, pp ISBN: DOI: / Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia. Research Report. Max-Planck-Institut für Informatik, 2010

49 References 37/38 Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, Edwin Lewis-Kelham, Gerard De Melo, and Gerhard Weikum. YAGO2: exploring and querying world knowledge in time, space, context, and many languages. In: Proceedings of the 20th international conference companion on World wide web ACM, 2011, pp Johannes Hoffart et al. Robust disambiguation of named entities in text. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing Association for Computational Linguistics, 2011, pp Johannes Hoffart, Fabian M Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. In: Artificial Intelligence 194 (2013) , pp Farzaneh Mahdisoltani, Joanna Biega, and Fabian Suchanek. YAGO3: A knowledge base from multilingual Wikipedias. In: 7th Biennial Conference on Innovative Data Systems Research CIDR 2015, 2015

50 References 38/38 Thomas Rebele, Fabian Suchanek, Johannes Hoffart, Joanna Biega, Erdal Kuzey, and Gerhard Weikum. YAGO: a multilingual knowledge base from Wikipedia, Wordnet, and Geonames. In: ISWC Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. Yago: a core of semantic knowledge. In: Proceedings of the 16th international conference on World Wide Web ACM, 2007, pp Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. Yago: A large ontology from wikipedia and wordnet. In: Web Semantics: Science, Services and Agents on the World Wide Web 6.3 (2008) , pp Fabian M. Suchanek, Johannes Hoffart, Erdal Kuzey, and Edwin Lewis-Kelham. YAGO2s: Modular High-Quality Information Extraction with an Application to Flight Planning.. In: BTW. Vol , pp Acknowledgements: Icons designed by Madebyoliver from Flaticon

YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames

YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames 1/24 a from Wikipedia, Wordnet, and Geonames 1, Fabian Suchanek 1, Johannes Hoffart 2, Joanna Biega 2, Erdal Kuzey 2, Gerhard Weikum 2 Who uses 1 Télécom ParisTech 2 Max Planck Institute for Informatics

More information

A Hitchhiker s Guide to Ontology

A Hitchhiker s Guide to Ontology A Hitchhiker s Guide to Ontology Fabian M. Suchanek Télécom ParisTech University, Paris, France 2 Fabian M. Suchanek 2003 2005 2008 money freedom permanent 3 Fabian M. Suchanek 2003: BSc in Cognitive Science

More information

A Hitchhiker s guide to Ontology

A Hitchhiker s guide to Ontology A Hitchhiker s guide to Ontology Fabian M. Suchanek Télécom ParisTech University Paris, France >me >intro 2 Fabian M. Suchanek 2003: 2005: 2008: money freedom permanent 3 Fabian M. Suchanek 2003: BSc in

More information

Enriching an Academic Knowledge base using Linked Open Data

Enriching an Academic Knowledge base using Linked Open Data Enriching an Academic Knowledge base using Linked Open Data Chetana Gavankar 1,2 Ashish Kulkarni 1 Yuan Fang Li 3 Ganesh Ramakrishnan 1 (1) IIT Bombay, Mumbai, India (2) IITB-Monash Research Academy, Mumbai,

More information

DB Project. Database Systems Spring 2013

DB Project. Database Systems Spring 2013 DB Project Database Systems Spring 2013 1 Database project YAGO Yet Another Great Ontology 2 About YAGO * A huge semantic knowledge base knowledge base a special kind of DB for knowledge management (e.g.,

More information

YAGO - Yet Another Great Ontology

YAGO - Yet Another Great Ontology YAGO - Yet Another Great Ontology YAGO: A Large Ontology from Wikipedia and WordNet 1 Presentation by: Besnik Fetahu UdS February 22, 2012 1 Fabian M.Suchanek, Gjergji Kasneci, Gerhard Weikum Presentation

More information

Advancing the utility of large knowledge bases

Advancing the utility of large knowledge bases Advancing the utility of large knowledge bases Fabian M. Suchanek Habilitation defense at the Université Pierre et Marie Curie in Paris, France Contributions à l avancement des grandes bases de connaissances

More information

Semantic Suggestions in Information Retrieval

Semantic Suggestions in Information Retrieval The Eight International Conference on Advances in Databases, Knowledge, and Data Applications June 26-30, 2016 - Lisbon, Portugal Semantic Suggestions in Information Retrieval Andreas Schmidt Department

More information

Knowledge Graph Embedding with Numeric Attributes of Entities

Knowledge Graph Embedding with Numeric Attributes of Entities Knowledge Graph Embedding with Numeric Attributes of Entities Yanrong Wu, Zhichun Wang College of Information Science and Technology Beijing Normal University, Beijing 100875, PR. China yrwu@mail.bnu.edu.cn,

More information

Doctorat ParisTech. TELECOM ParisTech. Rule Mining in Knowledge Bases

Doctorat ParisTech. TELECOM ParisTech. Rule Mining in Knowledge Bases 2013-ENST-0050 EDITE - ED 130 Doctorat ParisTech T H È S E pour obtenir le grade de docteur délivré par TELECOM ParisTech Spécialité «Informatique» présentée et soutenue publiquement par Luis Galárraga

More information

Georeferencing Wikipedia pages using language models from Flickr

Georeferencing Wikipedia pages using language models from Flickr Georeferencing Wikipedia pages using language models from Flickr Chris De Rouck 1, Olivier Van Laere 1, Steven Schockaert 2, and Bart Dhoedt 1 1 Department of Information Technology, IBBT, Ghent University,

More information

Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks

Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks Zhichun Wang, Qingsong Lv, Xiaohan Lan, Yu Zhang College of Information Science and Technique Beijing Normal University, Beijing,

More information

On Statistical Characteristics of Real-life Knowledge Graphs

On Statistical Characteristics of Real-life Knowledge Graphs On Statistical Characteristics of Real-life Knowledge Graphs Wenliang Cheng, Chengyu Wang, Bing Xiao, Weining Qian, Aoying Zhou Institute for Data Science and Engineering East China Normal University Shanghai,

More information

AIDArabic A Named-Entity Disambiguation Framework for Arabic Text

AIDArabic A Named-Entity Disambiguation Framework for Arabic Text AIDArabic A Named-Entity Disambiguation Framework for Arabic Text Mohamed Amir Yosef, Marc Spaniol, Gerhard Weikum Max-Planck-Institut für Informatik, Saarbrücken, Germany {mamir mspaniol weikum}@mpi-inf.mpg.de

More information

Information Extraction for Ontology Learning

Information Extraction for Ontology Learning Information Extraction for Ontology Learning Fabian SUCHANEK Max Planck Institute for Informatics, Germany Abstract. In this chapter, we discuss how ontologies can be constructed by extracting information

More information

DBpedia As A Formal Knowledge Base An Evaluation

DBpedia As A Formal Knowledge Base An Evaluation DBpedia As A Formal Knowledge Base An Evaluation TOMASZ BOIŃSKI Gdańsk University of Technology Faculty of Electronics, Telecommunications and Informatics Narutowicza Street 11/12 80-233 Gdańsk POLAND

More information

VICKEY: Mining Conditional Keys on Knowledge Bases

VICKEY: Mining Conditional Keys on Knowledge Bases VICKEY: Mining Conditional Keys on Knowledge Bases Danai Symeonidou 1, Luis Galárraga 2, Nathalie Pernelle 3, Fatiha Saïs 3, Fabian Suchanek 4 1 INRA, France 2 Aalborg University, Denmark 3 LRI, France

More information

Papers for comprehensive viva-voce

Papers for comprehensive viva-voce Papers for comprehensive viva-voce Priya Radhakrishnan Advisor : Dr. Vasudeva Varma Search and Information Extraction Lab, International Institute of Information Technology, Gachibowli, Hyderabad, India

More information

Extracting Wikipedia Historical Attributes Data

Extracting Wikipedia Historical Attributes Data Extracting Wikipedia Historical Attributes Data Guillermo Garrido NLP & IR Group, UNED Madrid, Spain ggarrido@lsi.uned.es Jean-Yves Delort Google Research Zurich Switzerland jydelort@google.com Enrique

More information

Linking Entities in Chinese Queries to Knowledge Graph

Linking Entities in Chinese Queries to Knowledge Graph Linking Entities in Chinese Queries to Knowledge Graph Jun Li 1, Jinxian Pan 2, Chen Ye 1, Yong Huang 1, Danlu Wen 1, and Zhichun Wang 1(B) 1 Beijing Normal University, Beijing, China zcwang@bnu.edu.cn

More information

Question Answering Biographic Information and Social Networks Powered by the Semantic Web

Question Answering Biographic Information and Social Networks Powered by the Semantic Web Question Answering Biographic Information and Social Networks Powered by the Semantic Web Peter Adolphs, Xiwen Cheng, Tina Klüwer, Hans Uszkoreit & Feiyu Xu German Research Center for Artifical Intelligence

More information

Adventure in Data. Gerhard Weikum Max Planck Institute for Informatics and Saarland University

Adventure in Data. Gerhard Weikum Max Planck Institute for Informatics and Saarland University Adventure in Data Gerhard Weikum Max Planck Institute for Informatics and Saarland University http://mpi-inf.mpg.de/~weikum Thanks to a Great Team Fabian Suchanek Gjergji Kasneci Gerard Johannes de Melo

More information

NAGA: Searching and Ranking Knowledge. Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum

NAGA: Searching and Ranking Knowledge. Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum NAGA: Searching and Ranking Knowledge Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum MPI I 2007 5 001 March 2007 Authors Addresses Gjergji Kasneci Max-Planck-Institut

More information

Demand-Weighted Completeness Prediction for a Knowledge Base

Demand-Weighted Completeness Prediction for a Knowledge Base Demand-Weighted Completeness Prediction for a Knowledge Base Andrew Hopkinson and Amit Gurdasani and Dave Palfrey and Arpit Mittal Amazon Research Cambridge Cambridge, UK {hopkia, amitgurd, dpalfrey, mitarpit}@amazon.co.uk

More information

arxiv: v1 [cs.cl] 1 Aug 2017

arxiv: v1 [cs.cl] 1 Aug 2017 Hidekazu Oiwa * 1 Yoshihiko Suhara * 1 Jiyu Komiya 1 Andrei Lopatenko 1 arxiv:1708.00481v1 [cs.cl] 1 Aug 2017 Abstract Entity population, a task of collecting entities that belong to a particular category,

More information

Moving beyond Simple Collections of Facts

Moving beyond Simple Collections of Facts The The Future Future of of Knowledge Knowledge Graphs Graphs:: Moving beyond Moving beyond Simple Collections of Facts Simple Collections of Facts Gerard de Melo Gerard de Melo Assistant Professor, Tsinghua

More information

Mutual Disambiguation for Entity Linking

Mutual Disambiguation for Entity Linking Mutual Disambiguation for Entity Linking Eric Charton Polytechnique Montréal Montréal, QC, Canada eric.charton@polymtl.ca Marie-Jean Meurs Concordia University Montréal, QC, Canada marie-jean.meurs@concordia.ca

More information

A Korean Knowledge Extraction System for Enriching a KBox

A Korean Knowledge Extraction System for Enriching a KBox A Korean Knowledge Extraction System for Enriching a KBox Sangha Nam, Eun-kyung Kim, Jiho Kim, Yoosung Jung, Kijong Han, Key-Sun Choi KAIST / The Republic of Korea {nam.sangha, kekeeo, hogajiho, wjd1004109,

More information

{dhgupta, jannik.stroetgen, Abstract

{dhgupta, jannik.stroetgen, Abstract Dhruv Gupta 1,2, Jannik Strötgen 1, and Klaus Berberich 1,3 1 Max Planck Institute for Informatics, Saarbrücken, Germany 2 Saarbrücken Graduate School of Compute Science, Saarbrücken, Germany 3 htw saar,

More information

A Robust Number Parser based on Conditional Random Fields

A Robust Number Parser based on Conditional Random Fields A Robust Number Parser based on Conditional Random Fields Heiko Paulheim Data and Web Science Group, University of Mannheim, Germany Abstract. When processing information from unstructured sources, numbers

More information

DBpedia-An Advancement Towards Content Extraction From Wikipedia

DBpedia-An Advancement Towards Content Extraction From Wikipedia DBpedia-An Advancement Towards Content Extraction From Wikipedia Neha Jain Government Degree College R.S Pura, Jammu, J&K Abstract: DBpedia is the research product of the efforts made towards extracting

More information

Mining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base

Mining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base Mining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base Andias Wira-Alam and Brigitte Mathiak GESIS - Leibniz-Institute for the Social Sciences Unter Sachsenhausen 6-8, 50667 Köln,

More information

Fusion of Event Stream and Background Knowledge for Semantic-Enabled Complex Event Processing

Fusion of Event Stream and Background Knowledge for Semantic-Enabled Complex Event Processing Fusion of Event Stream and Background Knowledge for Semantic-Enabled Complex Event Processing Challenge Paper Kia Teymourian, Malte Rohde, Ahmad Hasan, and Adrian Paschke Freie Universität Berlin Institute

More information

Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities

Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities Omar Qawasmeh 1, Maxime Lefranois 2, Antoine Zimmermann 2, Pierre Maret 1 1 Univ. Lyon, CNRS, Lab. Hubert Curien UMR

More information

Tools and Infrastructure for Supporting Enterprise Knowledge Graphs

Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Sumit Bhatia, Nidhi Rajshree, Anshu Jain, and Nitish Aggarwal IBM Research sumitbhatia@in.ibm.com, {nidhi.rajshree,anshu.n.jain}@us.ibm.com,nitish.aggarwal@ibm.com

More information

A Survey on Information Extraction in Web Searches Using Web Services

A Survey on Information Extraction in Web Searches Using Web Services A Survey on Information Extraction in Web Searches Using Web Services Maind Neelam R., Sunita Nandgave Department of Computer Engineering, G.H.Raisoni College of Engineering and Management, wagholi, India

More information

eresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library

eresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library eresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library 1 Overview The MPG eresearch, a Definition Max Planck Digital Library (MPDL):

More information

Temponym Tagging: Temporal Scopes for Textual Phrases

Temponym Tagging: Temporal Scopes for Textual Phrases Temponym Tagging: Temporal Scopes for Textual Phrases Erdal Kuzey, Jannik Strötgen, Vinay Setty, Gerhard Weikum jannik.stroetgen@mpi-inf.mpg.de TempWeb Montréal, April 12, 2016 Why temporal information

More information

Yago: A Large Ontology from Wikipedia and WordNet. Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum

Yago: A Large Ontology from Wikipedia and WordNet. Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum Yago: A Large Ontology from Wikipedia and WordNet Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum MPI I 2007 5-003 December 2007 Authors Addresses Fabian M. Suchanek Max-Planck-Institute for Computer

More information

Reasoning on Web Data Semantics

Reasoning on Web Data Semantics Reasoning on Web Data Semantics Oui. Peut-on préciser l'heure et le lieu? Merci Marie-Christine Rousset Université de Grenoble (UJF) et Institut Universitaire de France Amicalement Marie-Christine 1 Evolution

More information

A Self-Supervised Approach for Extraction of Attribute-Value Pairs from Wikipedia Articles

A Self-Supervised Approach for Extraction of Attribute-Value Pairs from Wikipedia Articles A Self-Supervised Approach for Extraction of Attribute-Value Pairs from Wikipedia Articles Wladmir C. Brandão 1, Edleno S. Moura 2, Altigran S. Silva 2, and Nivio Ziviani 1 1 Dep. of Computer Science,

More information

Building DBpedia Japanese and Linked Data Cloud in Japanese

Building DBpedia Japanese and Linked Data Cloud in Japanese Building DBpedia Japanese and Linked Data Cloud in Japanese Fumihiro Kato 1,2, Hideaki Takeda 1,3, Seiji Koide 1,2, and Ikki Ohmukai 1,3 1 National Institute of Informatics, 2-1-2, Chiyoda-ku, Tokyo, Japan

More information

DHTK: The Digital Humanities ToolKit

DHTK: The Digital Humanities ToolKit DHTK: The Digital Humanities ToolKit Davide Picca, Mattia Egloff University of Lausanne Abstract. Digital Humanities have the merit of connecting two very different disciplines such as humanities and computer

More information

Leveraging Community-built Knowledge for Type Coercion in Question Answering

Leveraging Community-built Knowledge for Type Coercion in Question Answering Leveraging Community-built Knowledge for Type Coercion in Question Answering Aditya Kalyanpur, J William Murdock, James Fan and Chris Welty IBM Research, 19 Skyline Drive, Hawthorne NY 10532 {adityakal,

More information

arxiv: v1 [cs.dc] 20 Jun 2016

arxiv: v1 [cs.dc] 20 Jun 2016 Debugging OpenStack Problems Using a State Graph Approach Yong Xiang, Hu Li, Sen Wang Tsinghua University {xiangy13, lihu12, wangs12} @mails.tsinghua.edu.cn Wei Xu Tsinghua University weixu@mail.tsinghua.edu.cn

More information

Oleksandr Kuzomin, Bohdan Tkachenko

Oleksandr Kuzomin, Bohdan Tkachenko International Journal "Information Technologies Knowledge" Volume 9, Number 2, 2015 131 INTELLECTUAL SEARCH ENGINE OF ADEQUATE INFORMATION IN INTERNET FOR CREATING DATABASES AND KNOWLEDGE BASES Oleksandr

More information

Building a Large-Scale Cross-Lingual Knowledge Base from Heterogeneous Online Wikis

Building a Large-Scale Cross-Lingual Knowledge Base from Heterogeneous Online Wikis Building a Large-Scale Cross-Lingual Knowledge Base from Heterogeneous Online Wikis Mingyang Li (B), Yao Shi, Zhigang Wang, and Yongbin Liu Department of Computer Science and Technology, Tsinghua University,

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

YAGO: A Large Ontology from Wikipedia and WordNet

YAGO: A Large Ontology from Wikipedia and WordNet YAGO: A Large Ontology from Wikipedia and WordNet Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum Max-Planck-Institute for Computer Science, Saarbruecken, Germany Abstract This article presents YAGO,

More information

Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities

Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities Nitish Aggarwal, Kartik Asooja, Paul Buitelaar, and Gabriela Vulcu Unit for Natural Language Processing Insight-centre, National University

More information

Type inference through the analysis of Wikipedia links

Type inference through the analysis of Wikipedia links Type inference through the analysis of Wikipedia links Andrea Giovanni Nuzzolese nuzzoles@cs.unibo.it Aldo Gangemi aldo.gangemi@cnr.it Valentina Presutti valentina.presutti@cnr.it Paolo Ciancarini ciancarini@cs.unibo.it

More information

Type-Constrained Representation Learning in Knowledge Graphs

Type-Constrained Representation Learning in Knowledge Graphs Type-Constrained Representation Learning in Knowledge Graphs Denis Krompaß 1,2, Stephan Baier 1, and Volker Tresp 1,2 Siemens AG, Corporate Technology, Munich, Germany Denis.Krompass@siemens.com Volker.Tresp@siemens.com

More information

Mapping WordNet Instances to Wikipedia

Mapping WordNet Instances to Wikipedia Mapping WordNet Instances to Wikipedia John P. McCrae Insight Centre for Data Analytics, National University of Ireland Galway Lexical vs. Encyclopedic Yellow (in a dictionary) Is a verb, noun and adjective

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

A Scalable Approach to Incrementally Building Knowledge Graphs

A Scalable Approach to Incrementally Building Knowledge Graphs A Scalable Approach to Incrementally Building Knowledge Graphs Gleb Gawriljuk 1, Andreas Harth 1, Craig A. Knoblock 2, and Pedro Szekely 2 1 Institute of Applied Informatics and Formal Description Methods

More information

Mining Rules to Align Knowledge Bases

Mining Rules to Align Knowledge Bases Mining Rules to Align Knowledge Bases Luis Galárraga 1, Nicoleta Preda 2, Fabian M. Suchanek 1 1 Max-Planck Institute for Informatics, Saarbrücken, Germany 2 University of Versailles, PRiSM-CNRS, Versailles,

More information

Query-driven Data Completeness Assessment

Query-driven Data Completeness Assessment Query-driven Data Completeness Assessment Simon Razniewski Free University of Bozen-Bolzano, Italy Part I joint work with Flip Korn, Werner Nutt, Divesh Srivastava Background 2011-2014: PhD (on reasoning

More information

THE amount of Web data has increased exponentially

THE amount of Web data has increased exponentially 1 Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions Wei Shen, Jianyong Wang, Senior Member, IEEE, and Jiawei Han, Fellow, IEEE Abstract The large number of potential applications

More information

RPI INSIDE DEEPQA INTRODUCTION QUESTION ANALYSIS 11/26/2013. Watson is. IBM Watson. Inside Watson RPI WATSON RPI WATSON ??? ??? ???

RPI INSIDE DEEPQA INTRODUCTION QUESTION ANALYSIS 11/26/2013. Watson is. IBM Watson. Inside Watson RPI WATSON RPI WATSON ??? ??? ??? @ INSIDE DEEPQA Managing complex unstructured data with UIMA Simon Ellis INTRODUCTION 22 nd November, 2013 WAT SON TECHNOLOGIES AND OPEN ARCHIT ECT URE QUEST ION ANSWERING PROFESSOR JIM HENDLER S IMON

More information

Annotating Spatio-Temporal Information in Documents

Annotating Spatio-Temporal Information in Documents Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de

More information

Databases & Information Retrieval

Databases & Information Retrieval Databases & Information Retrieval Maya Ramanath (Further Reading: Combining Database and Information-Retrieval Techniques for Knowledge Discovery. G. Weikum, G. Kasneci, M. Ramanath and F.M. Suchanek,

More information

Doctoral Thesis Proposal Learning Semantics of WikiTables

Doctoral Thesis Proposal Learning Semantics of WikiTables Doctoral Thesis Proposal Learning Semantics of WikiTables Chandra Sekhar Bhagavatula Department of Electrical Engineering and Computer Science Northwestern University csbhagav@u.northwestern.edu December

More information

Linking Entities in Short Texts Based on a Chinese Semantic Knowledge Base

Linking Entities in Short Texts Based on a Chinese Semantic Knowledge Base Linking Entities in Short Texts Based on a Chinese Semantic Knowledge Base Yi Zeng, Dongsheng Wang, Tielin Zhang, Hao Wang, and Hongwei Hao Institute of Automation, Chinese Academy of Sciences, Beijing,

More information

CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System

CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System Bo Xu 1,YongXu 1, Jiaqing Liang 1,2, Chenhao Xie 1,2, Bin Liang 1, Wanyun Cui 1, and Yanghua Xiao 1,3(B) 1 Shanghai Key Laboratory of Data

More information

PoS(EGICF12-EMITC2)031

PoS(EGICF12-EMITC2)031 based on Distributed Data Management for German Grid Projects Center for Information Services and High Performance Computing, Technische Universität Dresden E-mail: rene.jaekel@tu-dresden.de Steffen Metzger

More information

Introduction to Text Mining. Hongning Wang

Introduction to Text Mining. Hongning Wang Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:

More information

A Keyword-based Structured Query Language

A Keyword-based Structured Query Language Expressive and Flexible Access to Web-Extracted Data : A Keyword-based Structured Query Language Department of Computer Science and Engineering Indian Institute of Technology Delhi 22th September 2011

More information

Type inference through the analysis of Wikipedia links

Type inference through the analysis of Wikipedia links Type inference through the analysis of Wikipedia links Andrea Giovanni Nuzzolese ISTC-CNR, STLab CS Dept., University of Bologna, Italy nuzzoles@cs.unibo.it Valentina Presutti ISTC-CNR, Semantic Technology

More information

Discovering Semantic Relations from the Web and Organizing them with PATTY

Discovering Semantic Relations from the Web and Organizing them with PATTY Discovering Semantic Relations from the Web and Organizing them with PATTY Ndapandula Nakashole, Gerhard Weikum, Fabian Suchanek Max Planck Institute for Informatics, Saarbruecken, Germany {nnakasho,weikum,suchanek}@mpi-inf.mpg.de

More information

Semantic Web Systems Introduction Jacques Fleuriot School of Informatics

Semantic Web Systems Introduction Jacques Fleuriot School of Informatics Semantic Web Systems Introduction Jacques Fleuriot School of Informatics 11 th January 2015 Semantic Web Systems: Introduction The World Wide Web 2 Requirements of the WWW l The internet already there

More information

Peter T. Wood. Third Alberto Mendelzon International Workshop on Foundations of Data Management

Peter T. Wood. Third Alberto Mendelzon International Workshop on Foundations of Data Management Languages Languages query School of Computer Science and Information Systems Birkbeck, University of London ptw@dcs.bbk.ac.uk Third Alberto Mendelzon International Workshop on Foundations of Data Management

More information

Semantic Web: Extracting and Mining Structured Data from Unstructured Content

Semantic Web: Extracting and Mining Structured Data from Unstructured Content : Extracting and Mining Structured Data from Unstructured Content Web Science Lecture Besnik Fetahu L3S Research Center, Leibniz Universität Hannover May 20, 2014 1 Introduction 2 3 4 5 6 7 8 1 Introduction

More information

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases LIDER Survey Overview Participant profile (organisation type, industry sector) Relevant use-cases Discovering and extracting information Understanding opinion Content and data (Data Management) Monitoring

More information

Real-time population of Knowledge Bases: Opportunities and Challenges. Ndapa Nakashole Gerhard Weikum

Real-time population of Knowledge Bases: Opportunities and Challenges. Ndapa Nakashole Gerhard Weikum Real-time population of Knowledge Bases: Opportunities and Challenges Ndapa Nakashole Gerhard Weikum AKBC Workshop at NAACL 2012 Real-time Data Sources In news and social media, the implicit query is:

More information

Clustering Tweets Containing Ambiguous Named Entities Based on the Co-occurrence of Characteristic Terms

Clustering Tweets Containing Ambiguous Named Entities Based on the Co-occurrence of Characteristic Terms DEIM Forum 2016 C5-5 Clustering Tweets Containing Ambiguous Named Entities Based on the Co-occurrence of Characteristic Terms Maike ERDMANN, Gen HATTORI, Kazunori MATSUMOTO, and Yasuhiro TAKISHIMA KDDI

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

Personalization of Search on Structured Data

Personalization of Search on Structured Data Universität des Saarlandes, FR Informatik Max-Planck-Institut für Informatik, AG 5 U N S A R I V E R S A V I E I T A S N I S S Personalization of Search on Structured Data Masterarbeit im Fach Informatik

More information

Towards Summarizing the Web of Entities

Towards Summarizing the Web of Entities Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin

More information

What are the Important Properties of an Entity?

What are the Important Properties of an Entity? What are the Important Properties of an Entity? Comparing Users and Knowledge Graph Point of View Ahmad Assaf 1, Ghislain A. Atemezing 1, Raphaël Troncy 1 and Elena Cabrio 1,2 1 EURECOM, Sophia Antipolis,

More information

Generating Semantic Aspects for Queries. Dhruv Gupta Klaus Berberich Jannik Strötgen Demetrios Zeinalipour-Yazti

Generating Semantic Aspects for Queries. Dhruv Gupta Klaus Berberich Jannik Strötgen Demetrios Zeinalipour-Yazti Generating Semantic Aspects for Queries Dhruv Gupta Klaus Berberich Jannik Strötgen Demetrios Zeinalipour-Yazti MPI I 2017 5-001 September 2017 Authors Addresses Dhruv Gupta Max-Planck-Institut für Informatik

More information

1.

1. * 390/0/2 : 389/07/20 : 2 25-8223 ( ) 2 25-823 ( ) ISC SCOPUS L ISA http://jist.irandoc.ac.ir 390 22-97 - :. aminnezarat@gmail.com mosavit@pnu.ac.ir : ( ).... 00.. : 390... " ". ( )...2 2. 3. 4 Google..

More information

YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia

YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia Fabian M. Suchanek Max-Planck-Institut Saarbrücken / Germany suchanek@mpii.mpg.de Gjergji Kasneci Max-Planck-Institut Saarbrücken / Germany

More information

Semantic Web Mining. Diana Cerbu

Semantic Web Mining. Diana Cerbu Semantic Web Mining Diana Cerbu Contents Semantic Web Data mining Web mining Content web mining Structure web mining Usage web mining Semantic Web Mining Semantic web "The Semantic Web is a vision: the

More information

DBPedia (dbpedia.org)

DBPedia (dbpedia.org) Matt Harbers Databases and the Web April 22 nd, 2011 DBPedia (dbpedia.org) What is it? DBpedia is a community whose goal is to provide a web based open source data set of RDF triples based on Wikipedia

More information

Text, Knowledge, and Information Extraction. Lizhen Qu

Text, Knowledge, and Information Extraction. Lizhen Qu Text, Knowledge, and Information Extraction Lizhen Qu A bit about Myself PhD: Databases and Information Systems Group (MPII) Advisors: Prof. Gerhard Weikum and Prof. Rainer Gemulla Thesis: Sentiment Analysis

More information

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites Access IT Training 2003 Google indexed 3,3 billion of pages http://searchenginewatch.com/3071371 2005 Google s index contains 8,1 billion of websites http://blog.searchenginewatch.com/050517-075657 Estimated

More information

Entity Set Expansion with Meta Path in Knowledge Graph

Entity Set Expansion with Meta Path in Knowledge Graph Entity Set Expansion with Meta Path in Knowledge Graph Yuyan Zheng 1, Chuan Shi 1,2(B), Xiaohuan Cao 1, Xiaoli Li 3, and Bin Wu 1 1 Beijing Key Lab of Intelligent Telecommunications Software and Multimedia,

More information

SOFIE: A Self-Organizing Framework for Information Extraction. Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum

SOFIE: A Self-Organizing Framework for Information Extraction. Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum SOFIE: A Self-Organizing Framework for Information Extraction Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum MPI I 2008 5-004 November 2008 Authors Addresses Fabian M. Suchanek Max-Planck-Institute for

More information

Adding High-Precision Links to Wikipedia

Adding High-Precision Links to Wikipedia Adding High-Precision Links to Wikipedia Thanapon Noraset Chandra Bhagavatula Doug Downey Department of Electrical Engineering & Computer Science Northwestern University Evanston, IL 60208 {nor csbhagav}@u.northwestern.edu,

More information

WebIsALOD: Providing Hypernymy Relations extracted from the Web as Linked Open Data

WebIsALOD: Providing Hypernymy Relations extracted from the Web as Linked Open Data WebIsALOD: Providing Hypernymy Relations extracted from the Web as Linked Open Data Sven Hertling and Heiko Paulheim Data and Web Science Group, University of Mannheim, Germany {sven,heiko}@informatik.uni-mannheim.de

More information

Web-Scale Extraction of Structured Data

Web-Scale Extraction of Structured Data Web-Scale Extraction of Structured Data Michael J. Cafarella University of Washington mjc@cs.washington.edu Jayant Madhavan Google Inc. jayant@google.com Alon Halevy Google Inc. halevy@google.com ABSTRACT

More information

Text-Mining, Structured Queries, and Knowledge Management on Web Document Corpora

Text-Mining, Structured Queries, and Knowledge Management on Web Document Corpora Text-Mining, Structured Queries, and Knowledge Management on Web Document Corpora Hamid Mousavi CSD, UCLA Los Angeles, CA hmousavi@cs.ucla.edu Maurizio Atzori University of Cagliari Cagliari, Italy atzori@unica.it

More information

Scholarly Big Data: Leverage for Science

Scholarly Big Data: Leverage for Science Scholarly Big Data: Leverage for Science C. Lee Giles The Pennsylvania State University University Park, PA, USA giles@ist.psu.edu http://clgiles.ist.psu.edu Funded in part by NSF, Allen Institute for

More information

PACLIC 30 Proceedings. 3 The Approach. 3.1 Classification based Model

PACLIC 30 Proceedings. 3 The Approach. 3.1 Classification based Model PACLIC 30 Proceedings Automatic Identifying Entity Type in Linked Data Qingliang Miao, Ruiyu Fang, Shuangyong Song, Zhongguang Zheng, Lu Fang, Yao Meng, Jun Sun Fujitsu R&D Center Co., Ltd. Chaoyang District,

More information

Use of graphs and taxonomic classifications to analyze content relationships among courseware

Use of graphs and taxonomic classifications to analyze content relationships among courseware Institute of Computing UNICAMP Use of graphs and taxonomic classifications to analyze content relationships among courseware Márcio de Carvalho Saraiva and Claudia Bauzer Medeiros Background and Motivation

More information

DBpedia Extracting structured data from Wikipedia

DBpedia Extracting structured data from Wikipedia DBpedia Extracting structured data from Wikipedia Anja Jentzsch, Freie Universität Berlin Köln. 24. November 2009 DBpedia DBpedia is a community effort to extract structured information from Wikipedia

More information

Enabling Complex Wikipedia Queries - Technical Report

Enabling Complex Wikipedia Queries - Technical Report Enabling Complex Wikipedia Queries - Technical Report Gilad Katz and Bracha Shapira Department of Information Systems Engineering, Ben-Gurion University of the Negev, Israel Abstract In this technical

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

Bc. Pavel Taufer. Named Entity Recognition and Linking

Bc. Pavel Taufer. Named Entity Recognition and Linking MASTER THESIS Bc. Pavel Taufer Named Entity Recognition and Linking Institute of Formal and Applied Linguistics Supervisor of the master thesis: Study programme: Study branch: RNDr. Milan Straka, Ph.D.

More information

From DBpedia to Wikipedia: Filling the Gap by Discovering Wikipedia Conventions

From DBpedia to Wikipedia: Filling the Gap by Discovering Wikipedia Conventions From DBpedia to Wikipedia: Filling the Gap by Discovering Wikipedia Conventions Diego Torres and Pascal Molli LIFIA - Facultad de Informática - UNLP, La Plata, Argentina Email: {diego.torres, alicia.diaz}@lifia.info.unlp.edu.ar

More information