YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames
|
|
- Cornelius Lindsey
- 6 years ago
- Views:
Transcription
1 1/38 YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames Fabian Suchanek 1, Johannes Hoffart 2, Joanna Biega 2, Thomas Rebele 1, Erdal Kuzey 2, Gerhard Weikum 2 1 Télécom ParisTech 2 Max Planck Institute for Informatics
2 Plan What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 2/38
3 What is a knowledge base? 3/38 Mileva Marić married Albert Einstein Image source:
4 What is a knowledge base? 3/38 Mileva Marić married Albert Einstein has advisor Alfred Kleiner Image source:
5 What is a knowledge base? 3/38 Mileva Marić married Albert Einstein has advisor won prize Alfred Kleiner Nobel Prize in Physics Image source:
6 Where do knowledge bases help? 4/38 Applications of knowledge bases question answering semantic search intelligent personal assistants investigative journalism text analysis machine translation Hence, everybody does it Google: Knowledge Graph Amazon: Evi Microsoft: Satori
7 What is YAGO? 5/38 knowledge base with 10 million entities and>210 million facts multilingual facts from 10 languages focus on precision developed by Max-Planck Institute for Informatics and Télécom ParisTech
8 What is YAGO? 6/38 <Albert_Einstein> married advisor won Mileva Nobel Prize Alfred Kleiner
9 What does YAGO know? 7/38 theme name taxonomy-related facts simplified taxonomy main facts GeoNames facts meta-facts multilingual class labels maps to other knowledge bases raw information Wikipedia redirect links and anchor texts Themes of YAGO number of facts 95 m 17 m 55 m 39 m 203 m 787 k 4 m 296 m 471 m
10 What does YAGO know? - Sources 8/38 - Wikipedia - WordNet - GeoNames
11 YAGO 2006 idea: combine entities from Wikipedia with WordNet taxonomy 2007 YAGO: first version [SKW07] 2008 extract facts from infoboxes [SKW08] 2011 YAGO2: temporal and geographical meta facts [Hof+11a]; [Hof+13] 2013 YAGO2s: modular architecture [Suc+13] 2015 YAGO3: entities and facts from 10 languages [MBS15] Fabian Suchanek Johannes Hoffart Joanna Biega Gerhard Weikum Erdal Kuzey F. Mahdisoltani Gjergji Kasneci Thomas Rebele 9/38
12 YAGO - Combining WordNet with Wikipedia <wordnet_person>... rdfs:subclassof rdfs:subclassof <wordnet_physicist> rdfs:subclassof <wikicat_20th-century_physicists> Wordnet taxonomy Wikipedia categories rdf:type Albert Einstein 10/38
13 YAGO - Combining WordNet with Wikipedia Graph Browser w3id.org/yago/svgbrowser 11/38
14 YAGO - Multilingual: facts and entities from 10 Wikipedias 12/38
15 YAGO - Multilingual: facts and entities from 10 Wikipedias intermediate extractors clean facts deduplicate entities and facts check consistency 12/38
16 YAGO - Multilingual: facts and entities from 10 Wikipedias birth_place GEBURTSORT lieu de naissance (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... 13/38
17 YAGO - Multilingual: facts and entities from 10 Wikipedias birth_place GEBURTSORT lieu de naissance (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... (Einstein, Ulm) (Hawking, Oxford) (Euler, Basel)... P(S 1 S 2 ) > theta GEBURTSORT birth_place lieu de naissance birth_place... 13/38
18 YAGO - Multilingual: facts and entities from 10 Wikipedias YAGO architecture 14/38
19 YAGO - Time and space 15/38 Albert Einstein married Mileva Marić
20 YAGO - Time and space 15/38 Albert Einstein married Mileva Marić <id_tiwmcu_16x_11ovtp>
21 YAGO - Time and space Albert Einstein married Mileva Marić <id_tiwmcu_16x_11ovtp> source < 15/38
22 YAGO - Time and space Albert Einstein married Mileva Marić <id_tiwmcu_16x_11ovtp> source since until 1903-##-## 1919-##-## < 15/38
23 YAGO - Time and space 16/38 SPOTLX Browser w3id.org/yago/spotlx
24 YAGO - Accuracy 17/38 Screenshot of evaluation result 2 months evaluation, 15 participants evaluated 4412 facts of 76 relations (with 60m total facts) 98% facts of the sample were correct Wilson center: 95%, interval width: 4.2%
25 Applications What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 18/38
26 Applications - DBpedia 19/38 research and community project knowledge base built from Wikipedia imported YAGO s taxonomy
27 Applications - IBM Watson 20/38 participated in Jeopardy (TV quizz show) first place (competitors: human champions) used YAGO s type hierarchy
28 Applications - AIDA 21/38 Named Entity Disambiguation with ADIA, [Hof+11b] github.com/yago-naga/aida
29 Applications - Semantic Culturomics in cooperation with Le Monde Average age of people by occupation, [HBS13] w3id.org/yago/mpi/le-monde 22/38
30 AMIE What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 23/38
31 AMIE 24/38 AMIE type(x, pope) = diedin(x, Rome) Reference: [Gal+13] [Gal+15] Luis Galárraga Christina Teflioudi Katja Hose Fabian Suchanek
32 AMIE - Partial completeness assumption 25/38 iscitizenof...
33 AMIE - Partial completeness assumption 25/38 false iscitizenof...
34 AMIE - Partial completeness assumption 25/38 false unknown iscitizenof...
35 AMIE - Rule mining in knowledge bases almamater University of Zurich Albert Einstein worksat livedin livedin citizenof advises Alfred Kleiner Switzerland AMIE almamater(z,y), advises(x,z) livedin(x,y) = worksat(x,y) = iscitizenof(x,y) Diagram by Luis Galárraga Image source: 26/38
36 AMIE - Canonicalization of knowledge bases 27/38 graduaded from earned degree from University of Zurich Albert Einstein AMIE graduated from earned degree from Reference: [Gal+14] Diagram by Luis Galárraga Image source:
37 AMIE - Wikilinks semantification linksto won Albert Einstein is Nobel Prize linksto??? is Scientist Max Planck AMIE is(x, Scientist), linksto(x, Nobel Prize) Reference: [GSM15] = won(x, Nobel Prize) Diagram by Luis Galárraga Image source: 28/38
38 AMIE - Completeness 29/38 Albert Einstein married Mileva Marić Image source:
39 AMIE - Completeness Albert Einstein married Mileva Marić married Priscilla Presley? AMIE can mine completeness rules Image source: 29/38
40 AMIE - Completeness 30/38 closed world assumption partial completeness assumption popularity oracle no-change oracle star-pattern oracle class oracle AMIE oracle Reference: [Gal+17] Luis Galárraga Simon Razniewski Antoine Amarilli Fabian Suchanek
41 AMIE - Completeness 30/38 closed world assumption partial completeness assumption popularity oracle no-change oracle star-pattern oracle class oracle AMIE oracle morethan 1 (x, hasparent) = complete(x, hasparent) Reference: [Gal+17] Luis Galárraga Simon Razniewski Antoine Amarilli Fabian Suchanek
42 AMIE - Completeness 31/38 AMIE can predict incompleteness bornin: 100% diedin: 96% directed: 100% graduatedfrom: 87% haschild: 78% ismarriedto: 46%... and more
43 Conclusion What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 32/38
44 Conclusion 33/38 knowledge base with 10 million entities and>210 million facts combines Wikipedia with Wordnet multilingual facts from 10 languages has facts about time and space contains>95% correct facts available at
45 Conclusion 33/38 knowledge base with 10 million entities and>210 million facts combines Wikipedia with Wordnet multilingual facts from 10 languages has facts about time and space contains>95% correct facts available at Announce: in the following weeks new data release YAGO goes Open Source
46 References What is a knowledge base? YAGO Combining WordNet with Wikipedia Multilingual: facts and entities from 10 Wikipedias Time and space Accuracy Applications DBpedia IBM Watson AIDA Semantic Culturomics AMIE Partial completeness assumption Rule mining in knowledge bases Canonicalization of knowledge bases Wikilinks semantification Completeness Conclusion References 34/38
47 References 35/38 Joanna Biega, Erdal Kuzey, and Fabian M. Suchanek. Inside YAGO2s: A transparent information extraction architecture. In: Proceedings of the 22nd international conference on World Wide Web companion International World Wide Web Conferences Steering Committee, 2013, pp Luis Galárraga, Christina Teflioudi, Katja Hose, and Fabian M. Suchanek. AMIE: association rule mining under incomplete evidence in ontological knowledge bases. In: WWW Luis Galárraga, Geremy Heitz, Kevin Murphy, and Fabian M. Suchanek. Canonicalizing Open Knowledge Bases. In: ACM Press, 2014, pp ISBN: DOI: / Luis Galárraga, Christina Teflioudi, Katja Hose, and Fabian M. Suchanek. Fast Rule Mining in Ontological Knowledge Bases with AMIE+. In: VLDBJ
48 References 36/38 Luis Galárraga, Simon Razniewski, Antoine Amarilli, and Fabian M. Suchanek. Predicting Completeness in Knowledge Bases. In: ACM Press, 2017, pp ISBN: DOI: / Luis Galarraga, Danai Symeonidou, and Jean-Claude Moissinac. Rule Mining for Semantifying Wikilinks. In: Linked Open Data Workshop at WWW Thomas Huet, Joanna Biega, and Fabian M. Suchanek. Mining history with Le Monde. In: Proceedings of the 2013 workshop on Automated knowledge base construction ACM Press, 2013, pp ISBN: DOI: / Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia. Research Report. Max-Planck-Institut für Informatik, 2010
49 References 37/38 Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, Edwin Lewis-Kelham, Gerard De Melo, and Gerhard Weikum. YAGO2: exploring and querying world knowledge in time, space, context, and many languages. In: Proceedings of the 20th international conference companion on World wide web ACM, 2011, pp Johannes Hoffart et al. Robust disambiguation of named entities in text. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing Association for Computational Linguistics, 2011, pp Johannes Hoffart, Fabian M Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. In: Artificial Intelligence 194 (2013) , pp Farzaneh Mahdisoltani, Joanna Biega, and Fabian Suchanek. YAGO3: A knowledge base from multilingual Wikipedias. In: 7th Biennial Conference on Innovative Data Systems Research CIDR 2015, 2015
50 References 38/38 Thomas Rebele, Fabian Suchanek, Johannes Hoffart, Joanna Biega, Erdal Kuzey, and Gerhard Weikum. YAGO: a multilingual knowledge base from Wikipedia, Wordnet, and Geonames. In: ISWC Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. Yago: a core of semantic knowledge. In: Proceedings of the 16th international conference on World Wide Web ACM, 2007, pp Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. Yago: A large ontology from wikipedia and wordnet. In: Web Semantics: Science, Services and Agents on the World Wide Web 6.3 (2008) , pp Fabian M. Suchanek, Johannes Hoffart, Erdal Kuzey, and Edwin Lewis-Kelham. YAGO2s: Modular High-Quality Information Extraction with an Application to Flight Planning.. In: BTW. Vol , pp Acknowledgements: Icons designed by Madebyoliver from Flaticon
YAGO: a Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames
1/24 a from Wikipedia, Wordnet, and Geonames 1, Fabian Suchanek 1, Johannes Hoffart 2, Joanna Biega 2, Erdal Kuzey 2, Gerhard Weikum 2 Who uses 1 Télécom ParisTech 2 Max Planck Institute for Informatics
More informationA Hitchhiker s Guide to Ontology
A Hitchhiker s Guide to Ontology Fabian M. Suchanek Télécom ParisTech University, Paris, France 2 Fabian M. Suchanek 2003 2005 2008 money freedom permanent 3 Fabian M. Suchanek 2003: BSc in Cognitive Science
More informationA Hitchhiker s guide to Ontology
A Hitchhiker s guide to Ontology Fabian M. Suchanek Télécom ParisTech University Paris, France >me >intro 2 Fabian M. Suchanek 2003: 2005: 2008: money freedom permanent 3 Fabian M. Suchanek 2003: BSc in
More informationEnriching an Academic Knowledge base using Linked Open Data
Enriching an Academic Knowledge base using Linked Open Data Chetana Gavankar 1,2 Ashish Kulkarni 1 Yuan Fang Li 3 Ganesh Ramakrishnan 1 (1) IIT Bombay, Mumbai, India (2) IITB-Monash Research Academy, Mumbai,
More informationDB Project. Database Systems Spring 2013
DB Project Database Systems Spring 2013 1 Database project YAGO Yet Another Great Ontology 2 About YAGO * A huge semantic knowledge base knowledge base a special kind of DB for knowledge management (e.g.,
More informationYAGO - Yet Another Great Ontology
YAGO - Yet Another Great Ontology YAGO: A Large Ontology from Wikipedia and WordNet 1 Presentation by: Besnik Fetahu UdS February 22, 2012 1 Fabian M.Suchanek, Gjergji Kasneci, Gerhard Weikum Presentation
More informationAdvancing the utility of large knowledge bases
Advancing the utility of large knowledge bases Fabian M. Suchanek Habilitation defense at the Université Pierre et Marie Curie in Paris, France Contributions à l avancement des grandes bases de connaissances
More informationSemantic Suggestions in Information Retrieval
The Eight International Conference on Advances in Databases, Knowledge, and Data Applications June 26-30, 2016 - Lisbon, Portugal Semantic Suggestions in Information Retrieval Andreas Schmidt Department
More informationKnowledge Graph Embedding with Numeric Attributes of Entities
Knowledge Graph Embedding with Numeric Attributes of Entities Yanrong Wu, Zhichun Wang College of Information Science and Technology Beijing Normal University, Beijing 100875, PR. China yrwu@mail.bnu.edu.cn,
More informationDoctorat ParisTech. TELECOM ParisTech. Rule Mining in Knowledge Bases
2013-ENST-0050 EDITE - ED 130 Doctorat ParisTech T H È S E pour obtenir le grade de docteur délivré par TELECOM ParisTech Spécialité «Informatique» présentée et soutenue publiquement par Luis Galárraga
More informationGeoreferencing Wikipedia pages using language models from Flickr
Georeferencing Wikipedia pages using language models from Flickr Chris De Rouck 1, Olivier Van Laere 1, Steven Schockaert 2, and Bart Dhoedt 1 1 Department of Information Technology, IBBT, Ghent University,
More informationCross-lingual Knowledge Graph Alignment via Graph Convolutional Networks
Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks Zhichun Wang, Qingsong Lv, Xiaohan Lan, Yu Zhang College of Information Science and Technique Beijing Normal University, Beijing,
More informationOn Statistical Characteristics of Real-life Knowledge Graphs
On Statistical Characteristics of Real-life Knowledge Graphs Wenliang Cheng, Chengyu Wang, Bing Xiao, Weining Qian, Aoying Zhou Institute for Data Science and Engineering East China Normal University Shanghai,
More informationAIDArabic A Named-Entity Disambiguation Framework for Arabic Text
AIDArabic A Named-Entity Disambiguation Framework for Arabic Text Mohamed Amir Yosef, Marc Spaniol, Gerhard Weikum Max-Planck-Institut für Informatik, Saarbrücken, Germany {mamir mspaniol weikum}@mpi-inf.mpg.de
More informationInformation Extraction for Ontology Learning
Information Extraction for Ontology Learning Fabian SUCHANEK Max Planck Institute for Informatics, Germany Abstract. In this chapter, we discuss how ontologies can be constructed by extracting information
More informationDBpedia As A Formal Knowledge Base An Evaluation
DBpedia As A Formal Knowledge Base An Evaluation TOMASZ BOIŃSKI Gdańsk University of Technology Faculty of Electronics, Telecommunications and Informatics Narutowicza Street 11/12 80-233 Gdańsk POLAND
More informationVICKEY: Mining Conditional Keys on Knowledge Bases
VICKEY: Mining Conditional Keys on Knowledge Bases Danai Symeonidou 1, Luis Galárraga 2, Nathalie Pernelle 3, Fatiha Saïs 3, Fabian Suchanek 4 1 INRA, France 2 Aalborg University, Denmark 3 LRI, France
More informationPapers for comprehensive viva-voce
Papers for comprehensive viva-voce Priya Radhakrishnan Advisor : Dr. Vasudeva Varma Search and Information Extraction Lab, International Institute of Information Technology, Gachibowli, Hyderabad, India
More informationExtracting Wikipedia Historical Attributes Data
Extracting Wikipedia Historical Attributes Data Guillermo Garrido NLP & IR Group, UNED Madrid, Spain ggarrido@lsi.uned.es Jean-Yves Delort Google Research Zurich Switzerland jydelort@google.com Enrique
More informationLinking Entities in Chinese Queries to Knowledge Graph
Linking Entities in Chinese Queries to Knowledge Graph Jun Li 1, Jinxian Pan 2, Chen Ye 1, Yong Huang 1, Danlu Wen 1, and Zhichun Wang 1(B) 1 Beijing Normal University, Beijing, China zcwang@bnu.edu.cn
More informationQuestion Answering Biographic Information and Social Networks Powered by the Semantic Web
Question Answering Biographic Information and Social Networks Powered by the Semantic Web Peter Adolphs, Xiwen Cheng, Tina Klüwer, Hans Uszkoreit & Feiyu Xu German Research Center for Artifical Intelligence
More informationAdventure in Data. Gerhard Weikum Max Planck Institute for Informatics and Saarland University
Adventure in Data Gerhard Weikum Max Planck Institute for Informatics and Saarland University http://mpi-inf.mpg.de/~weikum Thanks to a Great Team Fabian Suchanek Gjergji Kasneci Gerard Johannes de Melo
More informationNAGA: Searching and Ranking Knowledge. Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum
NAGA: Searching and Ranking Knowledge Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, and Gerhard Weikum MPI I 2007 5 001 March 2007 Authors Addresses Gjergji Kasneci Max-Planck-Institut
More informationDemand-Weighted Completeness Prediction for a Knowledge Base
Demand-Weighted Completeness Prediction for a Knowledge Base Andrew Hopkinson and Amit Gurdasani and Dave Palfrey and Arpit Mittal Amazon Research Cambridge Cambridge, UK {hopkia, amitgurd, dpalfrey, mitarpit}@amazon.co.uk
More informationarxiv: v1 [cs.cl] 1 Aug 2017
Hidekazu Oiwa * 1 Yoshihiko Suhara * 1 Jiyu Komiya 1 Andrei Lopatenko 1 arxiv:1708.00481v1 [cs.cl] 1 Aug 2017 Abstract Entity population, a task of collecting entities that belong to a particular category,
More informationMoving beyond Simple Collections of Facts
The The Future Future of of Knowledge Knowledge Graphs Graphs:: Moving beyond Moving beyond Simple Collections of Facts Simple Collections of Facts Gerard de Melo Gerard de Melo Assistant Professor, Tsinghua
More informationMutual Disambiguation for Entity Linking
Mutual Disambiguation for Entity Linking Eric Charton Polytechnique Montréal Montréal, QC, Canada eric.charton@polymtl.ca Marie-Jean Meurs Concordia University Montréal, QC, Canada marie-jean.meurs@concordia.ca
More informationA Korean Knowledge Extraction System for Enriching a KBox
A Korean Knowledge Extraction System for Enriching a KBox Sangha Nam, Eun-kyung Kim, Jiho Kim, Yoosung Jung, Kijong Han, Key-Sun Choi KAIST / The Republic of Korea {nam.sangha, kekeeo, hogajiho, wjd1004109,
More information{dhgupta, jannik.stroetgen, Abstract
Dhruv Gupta 1,2, Jannik Strötgen 1, and Klaus Berberich 1,3 1 Max Planck Institute for Informatics, Saarbrücken, Germany 2 Saarbrücken Graduate School of Compute Science, Saarbrücken, Germany 3 htw saar,
More informationA Robust Number Parser based on Conditional Random Fields
A Robust Number Parser based on Conditional Random Fields Heiko Paulheim Data and Web Science Group, University of Mannheim, Germany Abstract. When processing information from unstructured sources, numbers
More informationDBpedia-An Advancement Towards Content Extraction From Wikipedia
DBpedia-An Advancement Towards Content Extraction From Wikipedia Neha Jain Government Degree College R.S Pura, Jammu, J&K Abstract: DBpedia is the research product of the efforts made towards extracting
More informationMining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base
Mining Wikipedia s Snippets Graph: First Step to Build A New Knowledge Base Andias Wira-Alam and Brigitte Mathiak GESIS - Leibniz-Institute for the Social Sciences Unter Sachsenhausen 6-8, 50667 Köln,
More informationFusion of Event Stream and Background Knowledge for Semantic-Enabled Complex Event Processing
Fusion of Event Stream and Background Knowledge for Semantic-Enabled Complex Event Processing Challenge Paper Kia Teymourian, Malte Rohde, Ahmad Hasan, and Adrian Paschke Freie Universität Berlin Institute
More informationComputer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities
Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities Omar Qawasmeh 1, Maxime Lefranois 2, Antoine Zimmermann 2, Pierre Maret 1 1 Univ. Lyon, CNRS, Lab. Hubert Curien UMR
More informationTools and Infrastructure for Supporting Enterprise Knowledge Graphs
Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Sumit Bhatia, Nidhi Rajshree, Anshu Jain, and Nitish Aggarwal IBM Research sumitbhatia@in.ibm.com, {nidhi.rajshree,anshu.n.jain}@us.ibm.com,nitish.aggarwal@ibm.com
More informationA Survey on Information Extraction in Web Searches Using Web Services
A Survey on Information Extraction in Web Searches Using Web Services Maind Neelam R., Sunita Nandgave Department of Computer Engineering, G.H.Raisoni College of Engineering and Management, wagholi, India
More informationeresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library
eresearch A Max Planck Perspective Kurt Mehlhorn Max-Planck-Institut für Informatik Interim Head, Max Planck Digital Library 1 Overview The MPG eresearch, a Definition Max Planck Digital Library (MPDL):
More informationTemponym Tagging: Temporal Scopes for Textual Phrases
Temponym Tagging: Temporal Scopes for Textual Phrases Erdal Kuzey, Jannik Strötgen, Vinay Setty, Gerhard Weikum jannik.stroetgen@mpi-inf.mpg.de TempWeb Montréal, April 12, 2016 Why temporal information
More informationYago: A Large Ontology from Wikipedia and WordNet. Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum
Yago: A Large Ontology from Wikipedia and WordNet Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum MPI I 2007 5-003 December 2007 Authors Addresses Fabian M. Suchanek Max-Planck-Institute for Computer
More informationReasoning on Web Data Semantics
Reasoning on Web Data Semantics Oui. Peut-on préciser l'heure et le lieu? Merci Marie-Christine Rousset Université de Grenoble (UJF) et Institut Universitaire de France Amicalement Marie-Christine 1 Evolution
More informationA Self-Supervised Approach for Extraction of Attribute-Value Pairs from Wikipedia Articles
A Self-Supervised Approach for Extraction of Attribute-Value Pairs from Wikipedia Articles Wladmir C. Brandão 1, Edleno S. Moura 2, Altigran S. Silva 2, and Nivio Ziviani 1 1 Dep. of Computer Science,
More informationBuilding DBpedia Japanese and Linked Data Cloud in Japanese
Building DBpedia Japanese and Linked Data Cloud in Japanese Fumihiro Kato 1,2, Hideaki Takeda 1,3, Seiji Koide 1,2, and Ikki Ohmukai 1,3 1 National Institute of Informatics, 2-1-2, Chiyoda-ku, Tokyo, Japan
More informationDHTK: The Digital Humanities ToolKit
DHTK: The Digital Humanities ToolKit Davide Picca, Mattia Egloff University of Lausanne Abstract. Digital Humanities have the merit of connecting two very different disciplines such as humanities and computer
More informationLeveraging Community-built Knowledge for Type Coercion in Question Answering
Leveraging Community-built Knowledge for Type Coercion in Question Answering Aditya Kalyanpur, J William Murdock, James Fan and Chris Welty IBM Research, 19 Skyline Drive, Hawthorne NY 10532 {adityakal,
More informationarxiv: v1 [cs.dc] 20 Jun 2016
Debugging OpenStack Problems Using a State Graph Approach Yong Xiang, Hu Li, Sen Wang Tsinghua University {xiangy13, lihu12, wangs12} @mails.tsinghua.edu.cn Wei Xu Tsinghua University weixu@mail.tsinghua.edu.cn
More informationOleksandr Kuzomin, Bohdan Tkachenko
International Journal "Information Technologies Knowledge" Volume 9, Number 2, 2015 131 INTELLECTUAL SEARCH ENGINE OF ADEQUATE INFORMATION IN INTERNET FOR CREATING DATABASES AND KNOWLEDGE BASES Oleksandr
More informationBuilding a Large-Scale Cross-Lingual Knowledge Base from Heterogeneous Online Wikis
Building a Large-Scale Cross-Lingual Knowledge Base from Heterogeneous Online Wikis Mingyang Li (B), Yao Shi, Zhigang Wang, and Yongbin Liu Department of Computer Science and Technology, Tsinghua University,
More informationProposal for Implementing Linked Open Data on Libraries Catalogue
Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:
More informationYAGO: A Large Ontology from Wikipedia and WordNet
YAGO: A Large Ontology from Wikipedia and WordNet Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum Max-Planck-Institute for Computer Science, Saarbruecken, Germany Abstract This article presents YAGO,
More informationIs Brad Pitt Related to Backstreet Boys? Exploring Related Entities
Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities Nitish Aggarwal, Kartik Asooja, Paul Buitelaar, and Gabriela Vulcu Unit for Natural Language Processing Insight-centre, National University
More informationType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia links Andrea Giovanni Nuzzolese nuzzoles@cs.unibo.it Aldo Gangemi aldo.gangemi@cnr.it Valentina Presutti valentina.presutti@cnr.it Paolo Ciancarini ciancarini@cs.unibo.it
More informationType-Constrained Representation Learning in Knowledge Graphs
Type-Constrained Representation Learning in Knowledge Graphs Denis Krompaß 1,2, Stephan Baier 1, and Volker Tresp 1,2 Siemens AG, Corporate Technology, Munich, Germany Denis.Krompass@siemens.com Volker.Tresp@siemens.com
More informationMapping WordNet Instances to Wikipedia
Mapping WordNet Instances to Wikipedia John P. McCrae Insight Centre for Data Analytics, National University of Ireland Galway Lexical vs. Encyclopedic Yellow (in a dictionary) Is a verb, noun and adjective
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationA Scalable Approach to Incrementally Building Knowledge Graphs
A Scalable Approach to Incrementally Building Knowledge Graphs Gleb Gawriljuk 1, Andreas Harth 1, Craig A. Knoblock 2, and Pedro Szekely 2 1 Institute of Applied Informatics and Formal Description Methods
More informationMining Rules to Align Knowledge Bases
Mining Rules to Align Knowledge Bases Luis Galárraga 1, Nicoleta Preda 2, Fabian M. Suchanek 1 1 Max-Planck Institute for Informatics, Saarbrücken, Germany 2 University of Versailles, PRiSM-CNRS, Versailles,
More informationQuery-driven Data Completeness Assessment
Query-driven Data Completeness Assessment Simon Razniewski Free University of Bozen-Bolzano, Italy Part I joint work with Flip Korn, Werner Nutt, Divesh Srivastava Background 2011-2014: PhD (on reasoning
More informationTHE amount of Web data has increased exponentially
1 Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions Wei Shen, Jianyong Wang, Senior Member, IEEE, and Jiawei Han, Fellow, IEEE Abstract The large number of potential applications
More informationRPI INSIDE DEEPQA INTRODUCTION QUESTION ANALYSIS 11/26/2013. Watson is. IBM Watson. Inside Watson RPI WATSON RPI WATSON ??? ??? ???
@ INSIDE DEEPQA Managing complex unstructured data with UIMA Simon Ellis INTRODUCTION 22 nd November, 2013 WAT SON TECHNOLOGIES AND OPEN ARCHIT ECT URE QUEST ION ANSWERING PROFESSOR JIM HENDLER S IMON
More informationAnnotating Spatio-Temporal Information in Documents
Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de
More informationDatabases & Information Retrieval
Databases & Information Retrieval Maya Ramanath (Further Reading: Combining Database and Information-Retrieval Techniques for Knowledge Discovery. G. Weikum, G. Kasneci, M. Ramanath and F.M. Suchanek,
More informationDoctoral Thesis Proposal Learning Semantics of WikiTables
Doctoral Thesis Proposal Learning Semantics of WikiTables Chandra Sekhar Bhagavatula Department of Electrical Engineering and Computer Science Northwestern University csbhagav@u.northwestern.edu December
More informationLinking Entities in Short Texts Based on a Chinese Semantic Knowledge Base
Linking Entities in Short Texts Based on a Chinese Semantic Knowledge Base Yi Zeng, Dongsheng Wang, Tielin Zhang, Hao Wang, and Hongwei Hao Institute of Automation, Chinese Academy of Sciences, Beijing,
More informationCN-DBpedia: A Never-Ending Chinese Knowledge Extraction System
CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System Bo Xu 1,YongXu 1, Jiaqing Liang 1,2, Chenhao Xie 1,2, Bin Liang 1, Wanyun Cui 1, and Yanghua Xiao 1,3(B) 1 Shanghai Key Laboratory of Data
More informationPoS(EGICF12-EMITC2)031
based on Distributed Data Management for German Grid Projects Center for Information Services and High Performance Computing, Technische Universität Dresden E-mail: rene.jaekel@tu-dresden.de Steffen Metzger
More informationIntroduction to Text Mining. Hongning Wang
Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:
More informationA Keyword-based Structured Query Language
Expressive and Flexible Access to Web-Extracted Data : A Keyword-based Structured Query Language Department of Computer Science and Engineering Indian Institute of Technology Delhi 22th September 2011
More informationType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia links Andrea Giovanni Nuzzolese ISTC-CNR, STLab CS Dept., University of Bologna, Italy nuzzoles@cs.unibo.it Valentina Presutti ISTC-CNR, Semantic Technology
More informationDiscovering Semantic Relations from the Web and Organizing them with PATTY
Discovering Semantic Relations from the Web and Organizing them with PATTY Ndapandula Nakashole, Gerhard Weikum, Fabian Suchanek Max Planck Institute for Informatics, Saarbruecken, Germany {nnakasho,weikum,suchanek}@mpi-inf.mpg.de
More informationSemantic Web Systems Introduction Jacques Fleuriot School of Informatics
Semantic Web Systems Introduction Jacques Fleuriot School of Informatics 11 th January 2015 Semantic Web Systems: Introduction The World Wide Web 2 Requirements of the WWW l The internet already there
More informationPeter T. Wood. Third Alberto Mendelzon International Workshop on Foundations of Data Management
Languages Languages query School of Computer Science and Information Systems Birkbeck, University of London ptw@dcs.bbk.ac.uk Third Alberto Mendelzon International Workshop on Foundations of Data Management
More informationSemantic Web: Extracting and Mining Structured Data from Unstructured Content
: Extracting and Mining Structured Data from Unstructured Content Web Science Lecture Besnik Fetahu L3S Research Center, Leibniz Universität Hannover May 20, 2014 1 Introduction 2 3 4 5 6 7 8 1 Introduction
More informationLIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases
LIDER Survey Overview Participant profile (organisation type, industry sector) Relevant use-cases Discovering and extracting information Understanding opinion Content and data (Data Management) Monitoring
More informationReal-time population of Knowledge Bases: Opportunities and Challenges. Ndapa Nakashole Gerhard Weikum
Real-time population of Knowledge Bases: Opportunities and Challenges Ndapa Nakashole Gerhard Weikum AKBC Workshop at NAACL 2012 Real-time Data Sources In news and social media, the implicit query is:
More informationClustering Tweets Containing Ambiguous Named Entities Based on the Co-occurrence of Characteristic Terms
DEIM Forum 2016 C5-5 Clustering Tweets Containing Ambiguous Named Entities Based on the Co-occurrence of Characteristic Terms Maike ERDMANN, Gen HATTORI, Kazunori MATSUMOTO, and Yasuhiro TAKISHIMA KDDI
More informationEnhanced retrieval using semantic technologies:
Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008
More informationPersonalization of Search on Structured Data
Universität des Saarlandes, FR Informatik Max-Planck-Institut für Informatik, AG 5 U N S A R I V E R S A V I E I T A S N I S S Personalization of Search on Structured Data Masterarbeit im Fach Informatik
More informationTowards Summarizing the Web of Entities
Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin
More informationWhat are the Important Properties of an Entity?
What are the Important Properties of an Entity? Comparing Users and Knowledge Graph Point of View Ahmad Assaf 1, Ghislain A. Atemezing 1, Raphaël Troncy 1 and Elena Cabrio 1,2 1 EURECOM, Sophia Antipolis,
More informationGenerating Semantic Aspects for Queries. Dhruv Gupta Klaus Berberich Jannik Strötgen Demetrios Zeinalipour-Yazti
Generating Semantic Aspects for Queries Dhruv Gupta Klaus Berberich Jannik Strötgen Demetrios Zeinalipour-Yazti MPI I 2017 5-001 September 2017 Authors Addresses Dhruv Gupta Max-Planck-Institut für Informatik
More information1.
* 390/0/2 : 389/07/20 : 2 25-8223 ( ) 2 25-823 ( ) ISC SCOPUS L ISA http://jist.irandoc.ac.ir 390 22-97 - :. aminnezarat@gmail.com mosavit@pnu.ac.ir : ( ).... 00.. : 390... " ". ( )...2 2. 3. 4 Google..
More informationYAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia
YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia Fabian M. Suchanek Max-Planck-Institut Saarbrücken / Germany suchanek@mpii.mpg.de Gjergji Kasneci Max-Planck-Institut Saarbrücken / Germany
More informationSemantic Web Mining. Diana Cerbu
Semantic Web Mining Diana Cerbu Contents Semantic Web Data mining Web mining Content web mining Structure web mining Usage web mining Semantic Web Mining Semantic web "The Semantic Web is a vision: the
More informationDBPedia (dbpedia.org)
Matt Harbers Databases and the Web April 22 nd, 2011 DBPedia (dbpedia.org) What is it? DBpedia is a community whose goal is to provide a web based open source data set of RDF triples based on Wikipedia
More informationText, Knowledge, and Information Extraction. Lizhen Qu
Text, Knowledge, and Information Extraction Lizhen Qu A bit about Myself PhD: Databases and Information Systems Group (MPII) Advisors: Prof. Gerhard Weikum and Prof. Rainer Gemulla Thesis: Sentiment Analysis
More informationGoogle indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites
Access IT Training 2003 Google indexed 3,3 billion of pages http://searchenginewatch.com/3071371 2005 Google s index contains 8,1 billion of websites http://blog.searchenginewatch.com/050517-075657 Estimated
More informationEntity Set Expansion with Meta Path in Knowledge Graph
Entity Set Expansion with Meta Path in Knowledge Graph Yuyan Zheng 1, Chuan Shi 1,2(B), Xiaohuan Cao 1, Xiaoli Li 3, and Bin Wu 1 1 Beijing Key Lab of Intelligent Telecommunications Software and Multimedia,
More informationSOFIE: A Self-Organizing Framework for Information Extraction. Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum
SOFIE: A Self-Organizing Framework for Information Extraction Fabian M. Suchanek, Mauro Sozio, Gerhard Weikum MPI I 2008 5-004 November 2008 Authors Addresses Fabian M. Suchanek Max-Planck-Institute for
More informationAdding High-Precision Links to Wikipedia
Adding High-Precision Links to Wikipedia Thanapon Noraset Chandra Bhagavatula Doug Downey Department of Electrical Engineering & Computer Science Northwestern University Evanston, IL 60208 {nor csbhagav}@u.northwestern.edu,
More informationWebIsALOD: Providing Hypernymy Relations extracted from the Web as Linked Open Data
WebIsALOD: Providing Hypernymy Relations extracted from the Web as Linked Open Data Sven Hertling and Heiko Paulheim Data and Web Science Group, University of Mannheim, Germany {sven,heiko}@informatik.uni-mannheim.de
More informationWeb-Scale Extraction of Structured Data
Web-Scale Extraction of Structured Data Michael J. Cafarella University of Washington mjc@cs.washington.edu Jayant Madhavan Google Inc. jayant@google.com Alon Halevy Google Inc. halevy@google.com ABSTRACT
More informationText-Mining, Structured Queries, and Knowledge Management on Web Document Corpora
Text-Mining, Structured Queries, and Knowledge Management on Web Document Corpora Hamid Mousavi CSD, UCLA Los Angeles, CA hmousavi@cs.ucla.edu Maurizio Atzori University of Cagliari Cagliari, Italy atzori@unica.it
More informationScholarly Big Data: Leverage for Science
Scholarly Big Data: Leverage for Science C. Lee Giles The Pennsylvania State University University Park, PA, USA giles@ist.psu.edu http://clgiles.ist.psu.edu Funded in part by NSF, Allen Institute for
More informationPACLIC 30 Proceedings. 3 The Approach. 3.1 Classification based Model
PACLIC 30 Proceedings Automatic Identifying Entity Type in Linked Data Qingliang Miao, Ruiyu Fang, Shuangyong Song, Zhongguang Zheng, Lu Fang, Yao Meng, Jun Sun Fujitsu R&D Center Co., Ltd. Chaoyang District,
More informationUse of graphs and taxonomic classifications to analyze content relationships among courseware
Institute of Computing UNICAMP Use of graphs and taxonomic classifications to analyze content relationships among courseware Márcio de Carvalho Saraiva and Claudia Bauzer Medeiros Background and Motivation
More informationDBpedia Extracting structured data from Wikipedia
DBpedia Extracting structured data from Wikipedia Anja Jentzsch, Freie Universität Berlin Köln. 24. November 2009 DBpedia DBpedia is a community effort to extract structured information from Wikipedia
More informationEnabling Complex Wikipedia Queries - Technical Report
Enabling Complex Wikipedia Queries - Technical Report Gilad Katz and Bracha Shapira Department of Information Systems Engineering, Ben-Gurion University of the Negev, Israel Abstract In this technical
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationBc. Pavel Taufer. Named Entity Recognition and Linking
MASTER THESIS Bc. Pavel Taufer Named Entity Recognition and Linking Institute of Formal and Applied Linguistics Supervisor of the master thesis: Study programme: Study branch: RNDr. Milan Straka, Ph.D.
More informationFrom DBpedia to Wikipedia: Filling the Gap by Discovering Wikipedia Conventions
From DBpedia to Wikipedia: Filling the Gap by Discovering Wikipedia Conventions Diego Torres and Pascal Molli LIFIA - Facultad de Informática - UNLP, La Plata, Argentina Email: {diego.torres, alicia.diaz}@lifia.info.unlp.edu.ar
More information