From Ontologies to Information Extraction and Back

Size: px
Start display at page:

Download "From Ontologies to Information Extraction and Back"

Transcription

1 From Ontologies to Information Extraction and Back LAMA SAEEDA D E PA R T M E N T O F C Y B E R N E T I C S A N D A R T I F I C I A L I N T E L L I G E N C E FA C U LT Y O F E L E C T R I C A L E N G I N E E R I N G C Z E C H T E C H N I C A L U N I V E R S I T Y I N P R A G U E 11/10/2017 LAMA SAEEDA - FEL, CVUT 1

2 Content IE and OBIE Motivation Information extraction and Ontology learning Applications Aviation safety Dataset exploration Conclusion 11/10/2017 LAMA SAEEDA - FEL, CVUT 2

3 Information extraction and ontologybased information extraction Flight had a prolonged loss of communication over Swiss territory. Zurich radar informed at about loss of contact and that also no contact on 121,5 MHz could be established. Geneva informed that ABC1234 has contacted them. Length of loss of comm. is approx. 11 minutes. ufo:trope ufo:object ufo:event UFO ontology aso:location aso:call-sign Aviation-Safety ontology A little semantics goes a long way 11/10/2017 LAMA SAEEDA - FEL, CVUT 3

4 Ontology-based information extraction Challenges: Availability of domain ontology Lack of complete knowledge graph that would serve all purposes (DBpedia) Ontology is essential in IE from textual corpus, and enriching the domain ontology will definitely enhance the process! 11/10/2017 LAMA SAEEDA - FEL, CVUT 4

5 11/10/2017 LAMA SAEEDA - FEL, CVUT 5

6 The iterative methodology Ontology learning Textual corpus Domain ontology Semantic annotation More extracted information leads to a richer ontology Richer ontology leads to extracting more information 11/10/2017 LAMA SAEEDA - FEL, CVUT 6

7 Information extraction Text pre processing Extraction Pre-processing, Rule-based / Machine removing Learning-based stop words, POS extraction tagging 11/10/2017 LAMA SAEEDA - FEL, CVUT 7

8 Ontology learning Terms Concepts Relations Axioms 11/10/2017 LAMA SAEEDA - FEL, CVUT 8

9 Ontology learning Terms Concepts Taxonomic relations Non-taxonomic relations Axioms statistic methods Text pre-processing X POS tagging X Sentence parsing X Latent semantic X Cooccurrence X X Clustering X X Term subsumption X Association rules Linguistic methods Seed words X Semantic lexicon X X X Sub-categorization frames X X Syntactic structure X X Dependency analysis X X Semantic templates X X Lexico-syntactic paterns X X Axiom templates X Logical methods Logical inference X X Inductive Logic X 11/10/2017 LAMA SAEEDA - FEL, CVUT 9

10 The iterative methodology 11/10/2017 LAMA SAEEDA - FEL, CVUT 10

11 Text pre processing Extraction methods Unseen text documents To be annotated Input Information Extraction Input Annotated textual corpus Input Ontology Learning Input Seed ontology Domain expert (user) Terms Concepts Relations Axioms 11/10/2017 LAMA SAEEDA - FEL, CVUT 11

12 Information extraction and semi-structured ontology learning iterative methodology - Example Doc1: Rice, a city in Minnesota Person Doc2: Mitch Fielder is mayor of Rice Doc3: Krnačova, mayor of Prague Location Politician Doc4: Tong wrote Rice City has-mayor Mayor Rice has-mayor M.Fielder 11/10/2017 LAMA SAEEDA - FEL, CVUT 12

13 11/10/2017 LAMA SAEEDA - FEL, CVUT 13

14 OBIE from aviation safety reports - Motivation Reporting Tool 11/10/2017 LAMA SAEEDA - FEL, CVUT 14

15 OBIE from aviation safety reports - Motivation Initial report Text Runway incursion caused by inadequate situational awareness and bad weather encounters Report analysis results weather encounter runway incursion 11/10/2017 LAMA SAEEDA - FEL, CVUT 15

16 OBIE from aviation safety reports - Motivation 11/10/2017 LAMA SAEEDA - FEL, CVUT 16

17 System users Semantic Annotations Aviation Domain experts Suggest automatically generated form to the user User input initial aviation safety report Textual aviation safety corpus Ontology Learning Aviation safety Ontology (ASO) Reporting tool Compliant with Generated form Generate form Report model Information extraction and ontology learning from aviation safety reports 11/10/2017 LAMA SAEEDA - FEL, CVUT 17

18 OBIE from aviation safety reports - Motivation Initial report Text Runway incursion caused by inadequate situational awareness and bad weather encounters Report analysis results weather encounter runway incursion 11/10/2017 LAMA SAEEDA - FEL, CVUT 18

19 In-flight shutdown, flight no. TVP7266, KTW-BOJ. Emergency landing in SOF. ENG 2 LOW OIL PRESS, low quantity, FADEC automatically stop the engine. DBpedia Spotlight Apache Stanbol Customized Techniques flight no ENG 2 LOW OIL PRESS engine FADEC ABC1234 In-flight shutdown, flight no. ABC1234, KTW-BOJ. Emergency landing in SOF. ENG 2 LOW OIL PRESS, low quantity, FADEC automatically stop the engine. KTW-BOJ RESTful web service SNAPSHOT-fd/rest/analyze Aviation safety Ontology (ASO) OBIE from aviation safety reports Extraction pipeline 11/10/2017 LAMA SAEEDA - FEL, CVUT 19

20 OBIE from aviation safety reports - Evaluation GATE (General Architecture for Text Engineering) 80 high quality annotated reports Recall With aviation vocabulary without aviation vocabulary with Aviation vocabulary Precision without aviation vocabulary 11/10/2017 LAMA SAEEDA - FEL, CVUT 20

21 OBIE from aviation safety reports Next steps Ontology is essential for information extraction Incompleteness of the ontology Experimenting with various ontology learning techniques and tools Creating the ontology learning pipeline 11/10/2017 LAMA SAEEDA - FEL, CVUT 21

22 11/10/2017 LAMA SAEEDA - FEL, CVUT 22

23 Temporal Knowledge extraction - Motivation Linked open data cloud Data size vs. data quality Dataset exploration Finding useful dataset without prior knowledge is difficult Temporal information Dataset management Temporal filters for search VoID descriptions could be enriched with a temporal dimension Source: "Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. 11/10/2017 LAMA SAEEDA - FEL, CVUT 23

24 Temporal Knowledge extraction - Motivation Structured data no defined schema Dataset summaries: Describe Explore Source: "Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. 11/10/2017 LAMA SAEEDA - FEL, CVUT 24

25 Temporal knowledge extraction - Temporal analysis Temporal data structure in datasets Structured data: e.g. xsd:date, xsd:time, xsd:gyear is-temporal Semi-structured: e.g. dcterms: description, dcterms: title has-temporal-potential 11/10/2017 LAMA SAEEDA - FEL, CVUT 25

26 Temporal knowledge extraction - Temporal analysis Temporal data in Czech Linked Open Data Cloud (Czech LOD) 11/10/2017 LAMA SAEEDA - FEL, CVUT 26

27 Temporal knowledge extraction - Temporal analysis Temporal data in Linked Open Data Vocabularies (LOV) Source 11/10/2017 LAMA SAEEDA - FEL, CVUT 27

28 Temporal knowledge extraction - Temporal analysis Dataset-based vs vocabulary-based CzLOD prov-o goodrelations schema.org dcterms Czech LOD has-temporal-potential Czech LOD is-temporal LOV has-temporal-potential LOV is-temporal is-temporal: Structured has-temporal-potential: semi-structured 11/10/2017 LAMA SAEEDA - FEL, CVUT 28

29 Temporal knowledge extraction - Temporal analysis SUTime (temporal taggers with F-measure score in TempEval-3) + extend the default rule files for SUTime { ruletype : "time, pattern : /dd/mm/yyyy/} 22/10/2017 { ruletype : "time, pattern : /dd-mm-yyyy/} {ruletype : "time, pattern : /[0-9]{1, 4}.?yyyy Sb /} 253/1992 Sb 11/10/2017 LAMA SAEEDA - FEL, CVUT 29

30 Temporal knowledge extraction - Temporal Descriptors Ontology (TDO) Temporal data in Czech Linked Open Data Cloud (Czech LOD) 11/10/2017 LAMA SAEEDA - FEL, CVUT 30

31 Temporal knowledge extraction - Temporal Descriptors Ontology (TDO) 11/10/2017 LAMA SAEEDA - FEL, CVUT 31

32 Temporal knowledge extraction Evaluation & results Temporal coverage of dataset DCAT voc dct:temporal (startdate, enddate) Temporal descriptor considers only structured temporal data Our approach consider actual data (structured + unstructured) 15 datasets: 46.6% of the cases matching (some cases there is no presence of TI description) 46% of the cases we were able to extract more accurate range e.g. Dataset ds:vavai/evaluation/2009 has no structured temporal info. 11/10/2017 LAMA SAEEDA - FEL, CVUT 32

33 Temporal knowledge extraction Conclusion Improve the temporal knowledge representation of the datasets Better exploring experience for users. Timeline visualization as a future work 11/10/2017 LAMA SAEEDA - FEL, CVUT 33

34 Conclusion Support the information extraction process Incomplete ontology Iterative approach Information extraction pipeline Ontology learning pipeline Temporal information extraction Evaluate the iterative approach to other approaches Evaluate involving experts (time, effort) Reduced with advanced iterations? 11/10/2017 LAMA SAEEDA - FEL, CVUT 34

University of Rome Tor Vergata DBpedia Manuel Fiorelli

University of Rome Tor Vergata DBpedia Manuel Fiorelli University of Rome Tor Vergata DBpedia Manuel Fiorelli fiorelli@info.uniroma2.it 07/12/2017 2 Notes The following slides contain some examples and pictures taken from: Lehmann, J., Isele, R., Jakob, M.,

More information

Linked data implementations who, what, why?

Linked data implementations who, what, why? Semantic Web in Libraries (SWIB18), Bonn, Germany 28 November 2018 Linked data implementations who, what, why? Karen Smith-Yoshimura OCLC Research Linking Open Data cloud diagram 2017, by Andrejs Abele,

More information

Introduction. October 5, Petr Křemen Introduction October 5, / 31

Introduction. October 5, Petr Křemen Introduction October 5, / 31 Introduction Petr Křemen petr.kremen@fel.cvut.cz October 5, 2017 Petr Křemen (petr.kremen@fel.cvut.cz) Introduction October 5, 2017 1 / 31 Outline 1 About Knowledge Management 2 Overview of Ontologies

More information

Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin. Frankfurt Book Fair 2017 October 11, 2017

Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin. Frankfurt Book Fair 2017 October 11, 2017 Building a Linked Open Data Knowledge Graph Henning Schoenenberger Michele Pasin Frankfurt Book Fair 2017 October 11, 2017 1 Springer Nature s Metadata Mission Statement We understand metadata as the gateway

More information

Knowledge-Driven Video Information Retrieval with LOD

Knowledge-Driven Video Information Retrieval with LOD Knowledge-Driven Video Information Retrieval with LOD Leslie F. Sikos, Ph.D., Flinders University ESAIR 15, 23 October 2015 Melbourne, VIC, Australia Knowledge-Driven Video IR Outline Video Retrieval Challenges

More information

Dataset Dashboard a SPARQL Endpoint Explorer

Dataset Dashboard a SPARQL Endpoint Explorer Dataset Dashboard a SPARQL Endpoint Explorer Petr Křemen, Lama Saeeda, Miroslav Blaško, and Michal Med Czech Technical University in Prague, Praha, Department of Cybernetics, Knowledge-based Software Systems

More information

SQL and Semi-structured data with PostgreSQL

SQL and Semi-structured data with PostgreSQL CS-E4610 Modern Database Systems 05.01.2018-05.04.2018 Tutorial 1 SQL and Semi-structured data with PostgreSQL FREDERICK AYALA-GÓMEZ PHD STUDENT I N COMPUTER SCIENCE, ELT E UNIVERSITY VISITING R ESEA RCHER,

More information

An Evaluation Dataset for Linked Data Profiling

An Evaluation Dataset for Linked Data Profiling An Evaluation Dataset for Linked Data Profiling Andrejs Abele, John McCrae, and Paul Buitelaar Insight Centre for Data Analytics, National University of Ireland, Galway, IDA Business Park, Lower Dangan,

More information

Annotating Spatio-Temporal Information in Documents

Annotating Spatio-Temporal Information in Documents Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Knowledge Engineering with Semantic Web Technologies

Knowledge Engineering with Semantic Web Technologies This file is licensed under the Creative Commons Attribution-NonCommercial 3.0 (CC BY-NC 3.0) Knowledge Engineering with Semantic Web Technologies Lecture 5: Ontological Engineering 5.3 Ontology Learning

More information

Emergent Models, Frameworks and Hardware Technologies for Big Data Analytics

Emergent Models, Frameworks and Hardware Technologies for Big Data Analytics International Symposium on Big Data and Cloud Computing Challenges (ISBCC-17) @ VIT Chennai Emergent Models, Frameworks and Hardware Technologies for Privatdozent Dr. rer. nat. habil. Sven Groppe University

More information

Enabling Semantic Search in Large Open Source Communities

Enabling Semantic Search in Large Open Source Communities Enabling Semantic Search in Large Open Source Communities Gregor Leban, Lorand Dali, Inna Novalija Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana {gregor.leban, lorand.dali, inna.koval}@ijs.si

More information

A rule-based approach to address semantic accuracy problems on Linked Data

A rule-based approach to address semantic accuracy problems on Linked Data A rule-based approach to address semantic accuracy problems on Linked Data (ISWC 2014 - Doctoral Consortium) Leandro Mendoza 1 LIFIA, Facultad de Informática, Universidad Nacional de La Plata, Argentina

More information

Introduction to Text Mining. Hongning Wang

Introduction to Text Mining. Hongning Wang Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:

More information

Integrating Heterogeneous Data Sources in the Web of Data

Integrating Heterogeneous Data Sources in the Web of Data Integrating Heterogeneous Data Sources in the Web of Data Franck Michel 1 More data sources More opportunities 2 Example: study history of zoological knowledge First Natural History Encycloedia, 1485.

More information

Fuzzy Ontology-based Spatial Data Warehouse for Context-aware. search and recommendations.

Fuzzy Ontology-based Spatial Data Warehouse for Context-aware. search and recommendations. Fuzzy Ontology-based Spatial Data Warehouse for Context-aware Search and Recommendation Hajer Baazaoui-Zghal Riadi laboratory, University of Manouba, Tunis, Tunisia Keywords: Abstract: Fuzzy Ontology,

More information

Natural Language Interfaces to Ontologies. Danica Damljanović

Natural Language Interfaces to Ontologies. Danica Damljanović Natural Language Interfaces to Ontologies Danica Damljanović danica@dcs.shef.ac.uk Sponsored by Transitioning Applications to Ontologies: www.tao-project.eu GATE case study in TAO project collect software

More information

Multimedia Standards

Multimedia Standards Multimedia Standards SS 2017 Lecture 8 Prof. Dr.-Ing. Karlheinz Brandenburg Karlheinz.Brandenburg@tu-ilmenau.de Contact: Dipl.-Inf. Thomas Köllmer thomas.koellmer@tu-ilmenau.de Metadata Standards 2 What

More information

Context Sensitive Search Engine

Context Sensitive Search Engine Context Sensitive Search Engine Remzi Düzağaç and Olcay Taner Yıldız Abstract In this paper, we use context information extracted from the documents in the collection to improve the performance of the

More information

Natural Language Processing with PoolParty

Natural Language Processing with PoolParty Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense

More information

PRIS at TAC2012 KBP Track

PRIS at TAC2012 KBP Track PRIS at TAC2012 KBP Track Yan Li, Sijia Chen, Zhihua Zhou, Jie Yin, Hao Luo, Liyin Hong, Weiran Xu, Guang Chen, Jun Guo School of Information and Communication Engineering Beijing University of Posts and

More information

Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets

Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Juggling the Jigsaw Towards Automated Problem Inference from Network Trouble Tickets Rahul Potharaju (Purdue University) Navendu Jain (Microsoft Research) Cristina Nita-Rotaru (Purdue University) April

More information

Linked Open Data Cloud. John P. McCrae, Thierry Declerck

Linked Open Data Cloud. John P. McCrae, Thierry Declerck Linked Open Data Cloud John P. McCrae, Thierry Declerck Hitchhiker s guide to the Linked Open Data Cloud DBpedia Largest node in the linked open data cloud Nucleus for a web of open data Most data is

More information

Semantic image search using queries

Semantic image search using queries Semantic image search using queries Shabaz Basheer Patel, Anand Sampat Department of Electrical Engineering Stanford University CA 94305 shabaz@stanford.edu,asampat@stanford.edu Abstract Previous work,

More information

August 2012 Daejeon, South Korea

August 2012 Daejeon, South Korea Building a Web of Linked Entities (Part I: Overview) Pablo N. Mendes Free University of Berlin August 2012 Daejeon, South Korea Outline Part I A Web of Linked Entities Challenges Progress towards solutions

More information

Linked Data Profiling

Linked Data Profiling Linked Data Profiling Identifying the Domain of Datasets Based on Data Content and Metadata Andrejs Abele «Supervised by Paul Buitelaar, John McCrae, Georgeta Bordea» Insight Centre for Data Analytics,

More information

Towards Summarizing the Web of Entities

Towards Summarizing the Web of Entities Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin

More information

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata Meeting Host Supporting Partner Meeting Sponsors Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata 105th OGC Technical Committee Palmerston North, New Zealand Dr.

More information

Knowledge Graphs on the Web

Knowledge Graphs on the Web Knowledge Graphs on the Web Which information can we find in them and which can we not? 1 Introduction You ve seen this, haven t you? Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae,

More information

Semantic Annotation and Linking of Medical Educational Resources

Semantic Annotation and Linking of Medical Educational Resources 5 th European IFMBE MBEC, Budapest, September 14-18, 2011 Semantic Annotation and Linking of Medical Educational Resources N. Dovrolis 1, T. Stefanut 2, S. Dietze 3, H.Q. Yu 3, C. Valentine 3 & E. Kaldoudi

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

Automatic Lemmatizer Construction with Focus on OOV Words Lemmatization

Automatic Lemmatizer Construction with Focus on OOV Words Lemmatization Automatic Lemmatizer Construction with Focus on OOV Words Lemmatization Jakub Kanis, Luděk Müller University of West Bohemia, Department of Cybernetics, Univerzitní 8, 306 14 Plzeň, Czech Republic {jkanis,muller}@kky.zcu.cz

More information

Financial Events Recognition in Web News for Algorithmic Trading

Financial Events Recognition in Web News for Algorithmic Trading Financial Events Recognition in Web News for Algorithmic Trading Frederik Hogenboom fhogenboom@ese.eur.nl Erasmus University Rotterdam PO Box 1738, NL-3000 DR Rotterdam, the Netherlands October 18, 2012

More information

The R2R Framework: Christian Bizer, Andreas Schultz. 1 st International Workshop on Consuming Linked Data (COLD2010) Freie Universität Berlin

The R2R Framework: Christian Bizer, Andreas Schultz. 1 st International Workshop on Consuming Linked Data (COLD2010) Freie Universität Berlin 1 st International Workshop on Consuming Linked Data (COLD2010) November 8, 2010, Shanghai, China The R2R Framework: Publishing and Discovering i Mappings on the Web Christian Bizer, Andreas Schultz Freie

More information

Query Expansion using Wikipedia and DBpedia

Query Expansion using Wikipedia and DBpedia Query Expansion using Wikipedia and DBpedia Nitish Aggarwal and Paul Buitelaar Unit for Natural Language Processing, Digital Enterprise Research Institute, National University of Ireland, Galway firstname.lastname@deri.org

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

Self-tuning ongoing terminology extraction retrained on terminology validation decisions

Self-tuning ongoing terminology extraction retrained on terminology validation decisions Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo Maldonado and David Lewis ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin

More information

A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet

A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet Joerg-Uwe Kietz, Alexander Maedche, Raphael Volz Swisslife Information Systems Research Lab, Zuerich, Switzerland fkietz, volzg@swisslife.ch

More information

ToC. n Introduction. n Linked Data. n Iterative and Incremental Linked Data Life cycle. n Q&A. Copyright 2017 FUJITSU LIMITED

ToC. n Introduction. n Linked Data. n Iterative and Incremental Linked Data Life cycle. n Q&A. Copyright 2017 FUJITSU LIMITED ToC n Introduction n Linked Data n Iterative and Incremental Linked Data Life cycle n Q&A 1 Personal Introduction http://boris.villazon.terrazas.name/ n Undergraduate University: Universidad Católica Boliviana

More information

Named Entity Detection and Entity Linking in the Context of Semantic Web

Named Entity Detection and Entity Linking in the Context of Semantic Web [1/52] Concordia Seminar - December 2012 Named Entity Detection and in the Context of Semantic Web Exploring the ambiguity question. Eric Charton, Ph.D. [2/52] Concordia Seminar - December 2012 Challenge

More information

The German DBpedia: A Sense Repository for Linking Entities

The German DBpedia: A Sense Repository for Linking Entities The German DBpedia: A Sense Repository for Linking Entities Sebastian Hellmann, Claus Stadler, and Jens Lehmann Abstract The modeling of lexico-semantic resources by means of ontologies is an established

More information

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020

More information

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases LIDER Survey Overview Participant profile (organisation type, industry sector) Relevant use-cases Discovering and extracting information Understanding opinion Content and data (Data Management) Monitoring

More information

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics Linked data and its role in the semantic web Dave Reynolds, Epimorphics Ltd @der42 Roadmap What is linked data? Modelling Strengths and weaknesses Examples Access other topics image: Leo Oosterloo @ flickr.com

More information

Incorporating ontological background knowledge into Information Extraction

Incorporating ontological background knowledge into Information Extraction Incorporating ontological background knowledge into Information Extraction Benjamin Adrian Knowledge Management Department, DFKI, Kaiserslautern, Germany benjamin.adrian@dfki.de Abstract. In my PhD work

More information

Archive of SID. A Semantic Ontology-based Document Organizer to Cluster elearning Documents

Archive of SID. A Semantic Ontology-based Document Organizer to Cluster elearning Documents A emantic Ontology-based Document Organizer to Cluster elearning Documents ara Alaee Electrical and Computer Engineering Department University of Tehran Tehran,Iran sara.alaee@ut.ac.ir Abstract Document

More information

M3 Framework: User s guide & tutorial

M3 Framework: User s guide & tutorial M3 Framework: User s guide & tutorial Creator Send Feedback Amelie Gyrard (Eurecom - Insight - NUIG/DERI) Designed and implemented by Amélie Gyrard, she was a PhD student at Eurecom under the supervision

More information

A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT

A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT ABSTRACT Tahar Guerram and Nacima Mellal Departement of Mathematics and Computer Science, University Larbi Ben M hidi of Oum El Bouaghi -

More information

INFORMATION EXTRACTION

INFORMATION EXTRACTION COMP90042 LECTURE 13 INFORMATION EXTRACTION INTRODUCTION Given this: Brasilia, the Brazilian capital, was founded in 1960. Obtain this: capital(brazil, Brasilia) founded(brasilia, 1960) Main goal: turn

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

Experiences with UIMA in NLP teaching and research. Manuela Kunze, Dietmar Rösner

Experiences with UIMA in NLP teaching and research. Manuela Kunze, Dietmar Rösner Experiences with UIMA in NLP teaching and research Manuela Kunze, Dietmar Rösner University of Magdeburg C Knowledge Based Systems and Document Processing Overview What is UIMA? First Experiments NLP Teaching

More information

How can Open Data and Linked Data help your organization and its Big Data? Mark Harrison

How can Open Data and Linked Data help your organization and its Big Data? Mark Harrison How can Open Data and Linked Data help your organization and its Big Data? Mark Harrison mark.harrison@cantab.net Outline Big Data, Open Data and Linked Data Open Data sources Linked Data technology Using

More information

Assessing the Quality of Natural Language Text

Assessing the Quality of Natural Language Text Assessing the Quality of Natural Language Text DC Research Ulm (RIC/AM) daniel.sonntag@dfki.de GI 2004 Agenda Introduction and Background to Text Quality Text Quality Dimensions Intrinsic Text Quality,

More information

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent Semantic Technologies and CDISC Standards Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent Part I Introduction to Semantic Technology Resource Description Framework

More information

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge Discover hidden information from your texts! Information overload is a well known issue in the knowledge industry. At the same time most of this information becomes available in natural language which

More information

WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus

WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus 2009 IEEE International Conference on Semantic Computing WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus Lalindra De Silva University of Colombo School

More information

State of the Art: Patterns in Ontology Engineering

State of the Art: Patterns in Ontology Engineering : Patterns in Ontology Engineering Eva Blomqvist ISSN 1404-0018 Research Report 04:8 : Patterns in Ontology Engineering Eva Blomqvist Information Engineering Research Group Department of Electronic and

More information

Towards high quality, industry-ready Linguistic Linked Licensed Data

Towards high quality, industry-ready Linguistic Linked Licensed Data Towards high quality, industry-ready Linguistic Linked Licensed Data Víctor Rodríguez-Doncel 1, Daniel Vila-Suero 1, Asunción Gómez-Pérez 1, Philipp Cimiano 2, John P. M c Crae 2, and Guadalupe Aguado-de-Cea

More information

Large Scale Semantic Annotation, Indexing, and Search at The National Archives Diana Maynard Mark Greenwood

Large Scale Semantic Annotation, Indexing, and Search at The National Archives Diana Maynard Mark Greenwood Large Scale Semantic Annotation, Indexing, and Search at The National Archives Diana Maynard Mark Greenwood University of Sheffield, UK 1 Burning questions you may have... In the last 3 years, which female

More information

LHD 2.0: A Text Mining Approach to Typing Entities In Knowledge Graphs

LHD 2.0: A Text Mining Approach to Typing Entities In Knowledge Graphs *Manuscript Click here to view linked References LHD 2.0: A Text Mining Approach to Typing Entities In Knowledge Graphs Tomáš Kliegr a,b,1, Ondřej Zamazal a,1 a Department of Information and Knowledge

More information

A Korean Knowledge Extraction System for Enriching a KBox

A Korean Knowledge Extraction System for Enriching a KBox A Korean Knowledge Extraction System for Enriching a KBox Sangha Nam, Eun-kyung Kim, Jiho Kim, Yoosung Jung, Kijong Han, Key-Sun Choi KAIST / The Republic of Korea {nam.sangha, kekeeo, hogajiho, wjd1004109,

More information

Introducing FREME: Deploying Linguistic Linked Data

Introducing FREME: Deploying Linguistic Linked Data Introducing FREME: Deploying Linguistic Linked Data Felix Sasaki1, Tatiana Gornostay2, Milan Dojchinovski3, Michele Osella4, Erik Mannens5, Giannis Stoitsis6, Phil Ritchie7, Kevin Koidl8 1 DFKI, felix.sasaki@dfki.de;

More information

Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules

Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules Xiaoshi Zhong, Aixin Sun, and Erik Cambria Computer Science and Engineering Nanyang Technological University

More information

STS Infrastructural considerations. Christian Chiarcos

STS Infrastructural considerations. Christian Chiarcos STS Infrastructural considerations Christian Chiarcos chiarcos@uni-potsdam.de Infrastructure Requirements Candidates standoff-based architecture (Stede et al. 2006, 2010) UiMA (Ferrucci and Lally 2004)

More information

Source Modeling. Kristina Lerman University of Southern California. Based on slides by Mark Carman, Craig Knoblock and Jose Luis Ambite

Source Modeling. Kristina Lerman University of Southern California. Based on slides by Mark Carman, Craig Knoblock and Jose Luis Ambite Source Modeling Kristina Lerman University of Southern California Based on slides by Mark Carman, Craig Knoblock and Jose Luis Ambite Source Modeling vs. Schema Matching Schema Matching Align schemas between

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A proof of concept in the Open Educational Resources domain Abstract One of the reasons why Web resources

More information

B2FIND and Metadata Quality

B2FIND and Metadata Quality B2FIND and Metadata Quality 3 rd EUDAT Conference 25 September 2014 Heinrich Widmann and B2FIND team 1 Outline B2FIND the EUDAT Metadata Service Semantic Mapping of Metadata Quality of Metadata Summary

More information

A Semantic Framework for Integrating and Publishing Linked Data on the Web. Aparna Padki

A Semantic Framework for Integrating and Publishing Linked Data on the Web. Aparna Padki A Semantic Framework for Integrating and Publishing Linked Data on the Web by Aparna Padki A Thesis Presented in Partial Fulfillment of the Requirements for the Degree Master of Science Approved June 2016

More information

Amine Hallili, PhD student Catherine Faron Zucker & Fabien Gandon, Advisors Elena Cabrio, Supervisor

Amine Hallili, PhD student Catherine Faron Zucker & Fabien Gandon, Advisors Elena Cabrio, Supervisor Amine Hallili, PhD student Catherine Faron Zucker & Fabien Gandon, Advisors Elena Cabrio, Supervisor 1 Headlines Introduction Motivations Research questions Chatbot Definition Categories Our Chatbot? Ongoing

More information

Language Resources and Linked Data

Language Resources and Linked Data Integrating NLP with Linked Data: the NIF Format Milan Dojchinovski @EKAW 2014 November 24-28, 2014, Linkoping, Sweden milan.dojchinovski@fit.cvut.cz - @m1ci - http://dojchinovski.mk Web Intelligence Research

More information

WebAnno: a flexible, web-based annotation tool for CLARIN

WebAnno: a flexible, web-based annotation tool for CLARIN WebAnno: a flexible, web-based annotation tool for CLARIN Richard Eckart de Castilho, Chris Biemann, Iryna Gurevych, Seid Muhie Yimam #WebAnno This work is licensed under a Attribution-NonCommercial-ShareAlike

More information

CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING

CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING 94 CHAPTER 5 EXPERT LOCATOR USING CONCEPT LINKING 5.1 INTRODUCTION Expert locator addresses the task of identifying the right person with the appropriate skills and knowledge. In large organizations, it

More information

NLP in practice, an example: Semantic Role Labeling

NLP in practice, an example: Semantic Role Labeling NLP in practice, an example: Semantic Role Labeling Anders Björkelund Lund University, Dept. of Computer Science anders.bjorkelund@cs.lth.se October 15, 2010 Anders Björkelund NLP in practice, an example:

More information

Linked Data: Standard s convergence

Linked Data: Standard s convergence Linked Data: Standard s convergence Enhancing the convergence between reporting standards Maria Mora Technical Manager maria.mora@cdp.net 1 Lets talk about a problem Lack of a perfect convergence between

More information

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Robert Meusel and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {robert,heiko}@informatik.uni-mannheim.de

More information

Putting ontologies to work in NLP

Putting ontologies to work in NLP Putting ontologies to work in NLP The lemon model and its future John P. McCrae National University of Ireland, Galway Introduction In natural language processing we are doing three main things Understanding

More information

Topics in Opinion Mining. Dr. Paul Buitelaar Data Science Institute, NUI Galway

Topics in Opinion Mining. Dr. Paul Buitelaar Data Science Institute, NUI Galway Topics in Opinion Mining Dr. Paul Buitelaar Data Science Institute, NUI Galway Opinion: Sentiment, Emotion, Subjectivity OBJECTIVITY SUBJECTIVITY SPECULATION FACTS BELIEFS EMOTION SENTIMENT UNCERTAINTY

More information

Motivating Ontology-Driven Information Extraction

Motivating Ontology-Driven Information Extraction Motivating Ontology-Driven Information Extraction Burcu Yildiz 1 and Silvia Miksch 1, 2 1 Institute for Software Engineering and Interactive Systems, Vienna University of Technology, Vienna, Austria {yildiz,silvia}@

More information

Precise Medication Extraction using Agile Text Mining

Precise Medication Extraction using Agile Text Mining Precise Medication Extraction using Agile Text Mining Chaitanya Shivade *, James Cormack, David Milward * The Ohio State University, Columbus, Ohio, USA Linguamatics Ltd, Cambridge, UK shivade@cse.ohio-state.edu,

More information

Building Search Applications

Building Search Applications Building Search Applications Lucene, LingPipe, and Gate Manu Konchady Mustru Publishing, Oakton, Virginia. Contents Preface ix 1 Information Overload 1 1.1 Information Sources 3 1.2 Information Management

More information

Kristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok

Kristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok Kristina Lerman University of Southern California This lecture is partly based on slides prepared by Anon Plangprasopchok Social Web is a platform for people to create, organize and share information Users

More information

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured

More information

Statistical parsing. Fei Xia Feb 27, 2009 CSE 590A

Statistical parsing. Fei Xia Feb 27, 2009 CSE 590A Statistical parsing Fei Xia Feb 27, 2009 CSE 590A Statistical parsing History-based models (1995-2000) Recent development (2000-present): Supervised learning: reranking and label splitting Semi-supervised

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

Linked Data in Archives

Linked Data in Archives Linked Data in Archives Publish, Enrich, Refine, Reconcile, Relate Presented 2012-08-23 SAA 2012, Linking Data Across Libraries, Archives, and Museums Corey A Harper Semantic Web TBL s original vision

More information

Final Project Discussion. Adam Meyers Montclair State University

Final Project Discussion. Adam Meyers Montclair State University Final Project Discussion Adam Meyers Montclair State University Summary Project Timeline Project Format Details/Examples for Different Project Types Linguistic Resource Projects: Annotation, Lexicons,...

More information

Semantic Analysis. Compiler Architecture

Semantic Analysis. Compiler Architecture Processing Systems Prof. Mohamed Hamada Software Engineering Lab. The University of Aizu Japan Source Compiler Architecture Front End Scanner (lexical tokens Parser (syntax Parse tree Semantic Analysis

More information

Making Sense Out of the Web

Making Sense Out of the Web Making Sense Out of the Web Rada Mihalcea University of North Texas Department of Computer Science rada@cs.unt.edu Abstract. In the past few years, we have witnessed a tremendous growth of the World Wide

More information

Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger. Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight

Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger. Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight Origin and Outcomes Currently funded through a Wellcome Trust Seed award Collaboration

More information

SAMPLE 2 This is a sample copy of the book From Words to Wisdom - An Introduction to Text Mining with KNIME

SAMPLE 2 This is a sample copy of the book From Words to Wisdom - An Introduction to Text Mining with KNIME 2 Copyright 2018 by KNIME Press All Rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction, storage in a retrieval

More information

Watson & WMR2017. (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself)

Watson & WMR2017. (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself) Watson & WMR2017 (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself) R. BASILI A.A. 2016-17 Overview Motivations Watson Jeopardy NLU in Watson

More information

DBpedia-An Advancement Towards Content Extraction From Wikipedia

DBpedia-An Advancement Towards Content Extraction From Wikipedia DBpedia-An Advancement Towards Content Extraction From Wikipedia Neha Jain Government Degree College R.S Pura, Jammu, J&K Abstract: DBpedia is the research product of the efforts made towards extracting

More information

Graph Exploration: Taking the User into the Loop

Graph Exploration: Taking the User into the Loop Graph Exploration: Taking the User into the Loop Davide Mottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany 2016/10/24 CIKM2016, Indianapolis, US Where we are Background (5

More information

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala Master Project Various Aspects of Recommender Systems May 2nd, 2017 Master project SS17 Albert-Ludwigs-Universität Freiburg Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue

More information

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University Ontology Engineering CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University http://www3.cs.stonybrook.edu/~pfodor/courses/cse595.html Lecture Outline Constructing Ontologies Reusing Existing

More information

EAD3 Study Group on Discovery Report and Discussion

EAD3 Study Group on Discovery Report and Discussion EAD3 Study Group on Discovery Report and Discussion Society of American Archivists Encoded Archival Standards Section April 12, 2017 Study Group members Sarah Buchanan, University of Missouri Michele Combs,

More information

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28 APRIL 2016

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28 APRIL 2016 Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 www.freme-project.eu A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N

More information

Refining Ontologies by Pattern-Based Completion

Refining Ontologies by Pattern-Based Completion Refining Ontologies by Pattern-Based Completion Nadejda Nikitina and Sebastian Rudolph and Sebastian Blohm Institute AIFB, University of Karlsruhe D-76128 Karlsruhe, Germany {nikitina, rudolph, blohm}@aifb.uni-karlsruhe.de

More information

Implementing a Variety of Linguistic Annotations

Implementing a Variety of Linguistic Annotations Implementing a Variety of Linguistic Annotations through a Common Web-Service Interface Adam Funk, Ian Roberts, Wim Peters University of Sheffield 18 May 2010 Adam Funk, Ian Roberts, Wim Peters Implementing

More information