A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet

Size: px
Start display at page:

Download "A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet"

Transcription

1 A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet Joerg-Uwe Kietz, Alexander Maedche, Raphael Volz Swisslife Information Systems Research Lab, Zuerich, Switzerland fkietz, volzg@swisslife.ch AIFB, Univ. Karlsruhe, D Karlsruhe, Germany maedche@aifb.uni-karlsruhe.de Abstract This paper describes our actual and ongoing work in supporting semiautomatic ontology acquisition from a corporate intranet of an insurance company. A comprehensive architecture and a system for semi-automatic ontology acquisition supports processing semi-structured information (e.g. contained in dictionaries) and natural language documents and including existing core ontologies (e.g. GermaNet, WordNet). We present a method for acquiring a applicationtailored domain ontology from given heterogeneous intranet sources. 1 Introduction Ontologies have proved their usefulness in different applications scenarios, such as intelligent information integration, knowledge-based systems, natural language processing. The role of ontologies is to capture domain knowledge in a generic way and provide a commonly agreed upon understanding of a domain. The common vocabulary of an ontology, defining the meaning of terms and relations, is usually organized in a taxonomy. An ontology usually contains modeling primitives such as concepts, generic relations between concepts, and axioms. In this paper we describe our actual and ongoing work in supporting semi-automatic ontology acquisition from a corporate intranet of an insurance company. Corporate intranets have been developed in the last years to large information resources containing useful information, such as dictionaries, instruction catalogues, collection of interesting publications of the domain, internal information, etc. However, the information stored in corporate intranets is only accessible and searcheable using standard information retrieval techniques. This is due to that the information expressed in natural language is not machine processable. Nevertheless, a lot of useful concepts and conceptual structures of the domain and the company terminology is contained on the intranet. Our approach underlies different heterogeneous sources: First, a generic core ontology has been taken as a top level structure for our domain-specific goal ontology. It has been transformed into our ontology representation formalism. Second, a dictionary containing the most important 1500 corporate terms described with natural language definitions served as an input for the ontology construction. Third, we accessed around 1000 intranet documents containing business reports and general company information.

2 2 Architecture The general architecture of the core approach for semi-automatic ontology learning from natural language has been described in [5]. The main components of the system are a Text & Processing Management component, a Text Processing Server, a component containing different algorithms for ontology learning and pruning coming with a multistrategy learning result set, and the Ontology Engineering System OntoEdit [10]. semi-structured information, e.g. domain-specific dictionaries natural language texts Ontology WordNet feed feed Select core ontology GermaNet Text & Processing Management Built core ontology (XML tagged) text &selected algorithms uses Ontology Learning & Ontology Pruning Algorithms selected text & preprocessing method XMLtagged text Multi - Strategy Learning Result Set Stemming Text Processing Server POS tagging chunk parsing Information Extraction... uses Domain Ontology interacts domain lexicon models edits Proposes new Conceptual structures (concepts, isa relations, generic relations) OntoEdit Ontology Modeling Environment uses Lexical DB Figure1. Architecture of the Ontology Learning Approach The Text & Processing component supports efficient handling and processing of the different input sources (semi-structured information, natural language documents, core ontologies), the definition of linguistic processing steps, the application of several algorithms. The Text Processing Server (SMES) is an information extraction system including shallow parsing mechanisms at different processing levels. It is based on the core systems SMES (Saarbrücken Message Extraction System), a shallow text processor for German (cf. [8]). SMES is a system that performs syntactic analysis on natural language documents. In general, the Text Processing Server is organized in modules, such as a tokenizer, morphological and lexical processing, and chunk parsing that use lexical resources to produce mixed syntactic/semantic information. The results of text processing are stored in annotations using XML-tagged text.

3 The Ontology Learning & Pruning Component includes a pattern definition and matching module, several term extraction mechanisms based on term frequency/inverted document frequency [9], and an algorithm for discovering conceptual relations. A Multi-Strategy Learning Result Set is used to support the complex task of ontology learning: It is possible to combine results from different learning methods, that have been applied to different sources. As described by [6] multi-strategy learning architectures support balancing between advantages and disadvantages of different learning methods. The Ontology Engineering System OntoEdit supports the ontology engineer in semiautomatically adding newly discovered structures to the ontology. 1 In addition to core capabilities for structuring the ontology, the engineering environment provides some additional features for the purpose of documentation, maintenance, and ontology exchange. OntoEdit internally stores ontologies using an XML serialization of the ontology model. 3 The Acquisition Process Based on the architecture described above we developed an ontology acquisition methodology depicted in Figure 2. The acquisition cycle starts with selecting a core ontology (cf. subsection 3.1). In our case we decided to adopt the GermaNet lexical semantic network described in [2] and transformed it to our internal ontology representation. In general, core or top level ontologies like GermaNet, do not contain domain-specific concepts. First, domain-specific concepts are acquired from the sources and are embedded into the concept taxonomy. In our case as described in subsection 3.2 we used a domain-specific dictionary. Additionally, the documents have been linguistically processed and frequency-based term extraction mechanisms for concept acquisition have been applied. Second, the ontology is focused or pruned to the application domain. This step is further explained in subsection 3.3. Third, domain-specific conceptual relations are extracted from natural language documents (cf. subsection 3.4). The resulting domain-specific ontology can be further refined and improved repeating the acquisition cycle. Our cyclic approach supports the evolving nature of ontologies and natural language in general. We have to emphasize that the acquisition process described in the following is a semi-automatic process with necessary human intervention. 3.1 Transforming GermaNet GermaNet is the german counterpart to the well known WordNet. It builds a lexical semantic network for german words, where 3 different types of word classes are distinguished: nouns, verbs and adjectives. Words are grouped into sets of synomyms so called synsets. These sets are converted to concepts. Some semantic relations between synsets are converted to conceptual relations, especially the hypernym-hyponym 1 A comprehensive description of the ontology engineering system OntoEdit and the underlying methodology is given in [10].

4 Figure2. Semi-Automatic Ontology Acquisition Process relations are used to construct the concept taxonomy. We have to emphasize that other generic top-level ontologies like CyC, Dahlgren s or Sowa s ontology could also be used to start the domain ontology acquisition process. 3.2 Extracting domain-specific concepts and a concept hierarchy As already mentioned we used a dictionary of corporate terms to acquire domain specific concepts. Dictionary entries are considered as concepts. The entires are aligned into the taxonomy using their natural language descriptions. The descriptions are processed by the Text Processing Server. Several heuristic patterns for extracting taxonomic relations similar to [3] are used to acquire the taxonomic alignment. Additionally, all phrases have been extracted from the intranet documents using linguistic processing techniques. Frequent phrases which tend to be concepts have been detected using statistical techniques. 3.3 Domain-specific Pruning By now the generic core ontology has been extended with domain-specific components. In order to prune concepts that are domain-unspecific, concept frequencies are determined from the intranet corpus. By cumulating the frequencies of sub concepts within their super concepts, concept frequencies are propagated through the taxonomy. The frequencies are compared to concept frequencies acquired from generic corpora. Concepts that are less frequent than an user specified minimum are deleted from the ontology (see figure 3).

5 1. Given: ontology, corpus, mininimum frequency 2. Determine concept frequencies from corpus (a) Count concept frequencies in corpus (b) Propagate frequencies to super concepts (c) Compare to concept frequencies from generic corpora 3. Remove concepts that do not support minimum frequencies (a) Remove concept relations and restrictions from ontology (b) Remove concept from ontology Figure3. Focusing on the domain 3.4 Acquisition of Conceptual Relations We used two methods to learn conceptual relations. First, we used a statistical approach based on association rules as described in [4]. The algorithm uses the background knowledge from the concept taxonomy in order to propose relations at the appropriate level of abstraction. For instance, the linguistic processing may find that the word insurance agreement frequently co-occurs with each of the words policyholder and insurance salesman. From this statistical linguistic data our approach derives correlations at the conceptual level, viz. between the concept InsuranceAgreement and the concepts, PolicyHolder and InsuranceSalesman. The discovery algorithm determines support and confidence measures for the relationships between these three pairs, as well as for relationships at higher levels of abstraction, such as between InsuranceAgreement and Person. In a final step, the algorithm determines the level of abstraction most suited to describe the conceptual relationships by pruning appearingly less adequate ones. Here, the relation between InsuranceAgreement and Person may be proposed for inclusion in the ontology. Second, we used heuristic patterns defined as regular expressions to extract nontaxonomic conceptual relations from text as used in [7] for the acquisition of hyponyms. In our current work we have defined some patterns on top of the phrase-level processed documents, such as NP have NP. 4 Conclusions & Further Work In this paper we have described our recent and ongoing work in semi-automatic ontology acquisition from a corporate intranet. Based on our comprehensive architecture a new approach for supporting the overall process of engineering ontologies from text is described. It is mainly based on a given core ontology, which is extended with domain specific concepts. The resulting ontology is pruned and restricted to a specific application using a corpus-based mechanism for ontology pruning. On top of the ontology two approaches supporting the difficult task of determining non-taxonomic conceptual relationships are applied. In the future much work remains to be done. First, several techniques for evaluating the acquired ontology have to be developed. In our scenario we will apply ontology cross comparison techniques such as described in [4]. Additionally, applying the ontology on top of the intranet documents (e.g. a information retrieval scenario, a semantic

6 document annotation scenario such as described in [1]) will allow us an applicationspecific evaluation of the ontology using standard measures such as precision and recall. Second, our approach for multi-strategy learning is still in an early stage. We will have to elaborate how the results of different learning algorithms will have to be assessed and combined in the multi-strategy learning set. Nevertheless, an approach combing different resources on which different techniques are applied, seems promising for supporting the complex task of ontology learning from text. References 1. M. Erdmann, A. Maedche, H.-P. Schnurr, and Steffen Staab. From manual to semi-automatic semantic annotation: About ontology-based text annotation tools. In P. Buitelaar & K. Hasida (eds). Proceedings of the COLING 2000 Workshop on Semantic Annotation and Intelligent Content, Luxembourg, August B. Hamp and H. Feldweg. Germanet - a lexical-semantic net for german. In Proceedings of ACL workshop Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications, Madrid., M.A. Hearst. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th International Conference on Computational Linguistics. Nantes, France, A. Maedche and S. Staab. Discovering conceptual relations from text. In Proceedings of ECAI IOS Press, Amsterdam, A. Maedche and S. Staab. Semi-automatic engineering of ontologies from text. In Proceedings of the 12th Internal Conference on Software and Knowledge Engineering. Chicago, USA. KSI, R. Michalski and K. Kaufmann. Data mining and knowledge discovery: A review of issues and multistrategy approach. In Machine Learning and Data Mining Methods and Applications. John Wiley, England, E. Morin. Automatic acquisition of semantic relations between terms from technical corpora. In Proc. of the Fifth International Congress on Terminology and Knowledge Engineering - TKE 99, G. Neumann, R. Backofen, J. Baur, M. Becker, and C. Braun. An information extraction core system for real world german text processing. In ANLP 97 Proceedings of the Conference on Applied Natural Language Processing, pages , Washington, USA, G. Salton. Automatic Text Processing. Addison-Wesley, S. Staab and A. Maedche. Ontology engineering beyond the modeling of concepts and relations. In Proceedings of the ECAI 2000 Workshop on Application of Ontologies and Problem-Solving Methods, 2000.

THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE

THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE ONTOLOGY LEARNING FOR THE SEMANTIC WEB ONTOLOGY LEARNING FOR THE SEMANTIC WEB by Alexander Maedche University of Karlsruhe, Germany SPRINGER

More information

Enabling Semantic Search in Large Open Source Communities

Enabling Semantic Search in Large Open Source Communities Enabling Semantic Search in Large Open Source Communities Gregor Leban, Lorand Dali, Inna Novalija Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana {gregor.leban, lorand.dali, inna.koval}@ijs.si

More information

Ontology-Based Information Extraction

Ontology-Based Information Extraction Ontology-Based Information Extraction Daya C. Wimalasuriya Towards Partial Completion of the Comprehensive Area Exam Department of Computer and Information Science University of Oregon Committee: Dr. Dejing

More information

Motivating Ontology-Driven Information Extraction

Motivating Ontology-Driven Information Extraction Motivating Ontology-Driven Information Extraction Burcu Yildiz 1 and Silvia Miksch 1, 2 1 Institute for Software Engineering and Interactive Systems, Vienna University of Technology, Vienna, Austria {yildiz,silvia}@

More information

From Manual to Semi-automatic Semantic Annotation: About Ontology-based Text Annotation Tools

From Manual to Semi-automatic Semantic Annotation: About Ontology-based Text Annotation Tools Linköping Electronic Articles in Computer and Information Science Vol. 6(2001): nr? From Manual to Semi-automatic Semantic Annotation: About Ontology-based Text Annotation Tools M. Erdmann A. Maedche H.-P.

More information

State of the Art: Patterns in Ontology Engineering

State of the Art: Patterns in Ontology Engineering : Patterns in Ontology Engineering Eva Blomqvist ISSN 1404-0018 Research Report 04:8 : Patterns in Ontology Engineering Eva Blomqvist Information Engineering Research Group Department of Electronic and

More information

Ontology Extraction from Heterogeneous Documents

Ontology Extraction from Heterogeneous Documents Vol.3, Issue.2, March-April. 2013 pp-985-989 ISSN: 2249-6645 Ontology Extraction from Heterogeneous Documents Kirankumar Kataraki, 1 Sumana M 2 1 IV sem M.Tech/ Department of Information Science & Engg

More information

A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT

A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT A DOMAIN INDEPENDENT APPROACH FOR ONTOLOGY SEMANTIC ENRICHMENT ABSTRACT Tahar Guerram and Nacima Mellal Departement of Mathematics and Computer Science, University Larbi Ben M hidi of Oum El Bouaghi -

More information

Ontology Learning and Reasoning Dealing with Uncertainty and Inconsistency

Ontology Learning and Reasoning Dealing with Uncertainty and Inconsistency Ontology Learning and Reasoning Dealing with Uncertainty and Inconsistency Peter Haase, Johanna Völker Institute AIFB, University of Karlsruhe, Germany {pha,jvo}@aifb.uni-karlsruhe.de Abstract. Ontology

More information

WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus

WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus 2009 IEEE International Conference on Semantic Computing WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus Lalindra De Silva University of Colombo School

More information

Making Sense Out of the Web

Making Sense Out of the Web Making Sense Out of the Web Rada Mihalcea University of North Texas Department of Computer Science rada@cs.unt.edu Abstract. In the past few years, we have witnessed a tremendous growth of the World Wide

More information

Question Answering Approach Using a WordNet-based Answer Type Taxonomy

Question Answering Approach Using a WordNet-based Answer Type Taxonomy Question Answering Approach Using a WordNet-based Answer Type Taxonomy Seung-Hoon Na, In-Su Kang, Sang-Yool Lee, Jong-Hyeok Lee Department of Computer Science and Engineering, Electrical and Computer Engineering

More information

Final Project Discussion. Adam Meyers Montclair State University

Final Project Discussion. Adam Meyers Montclair State University Final Project Discussion Adam Meyers Montclair State University Summary Project Timeline Project Format Details/Examples for Different Project Types Linguistic Resource Projects: Annotation, Lexicons,...

More information

A framework for retrieving conceptual knowledge from Web pages

A framework for retrieving conceptual knowledge from Web pages A framework for retrieving conceptual knowledge from Web pages Nacéra Bennacer, Lobna Karoui Ecole Supérieure d Electricité (Supélec), Plateau de Moulon 3 rue Joliot Curie, 91192 Gif-sur-Yvette cedex,

More information

Ontology Research and Development Part 1 A Review of Ontology Generation

Ontology Research and Development Part 1 A Review of Ontology Generation Ontology Research and Development Part 1 A Review of Ontology Generation Ying Ding Division of Mathematics and Computer Science Vrije Universiteit, Amsterdam (ying@cs.vu.nl) Schubert Foo Division of Information

More information

Metadata and the Semantic Web and CREAM 0

Metadata and the Semantic Web and CREAM 0 Metadata and the Semantic Web and CREAM 0 1 Siegfried Handschuh, 1;2 Steffen Staab, 1;3 Alexander Maedche 1 Institute AIFB, University of Karlsruhe, D-76128 Karlsruhe, Germany http://www.aifb.uni-karlsruhe.de/wbs

More information

A Tool-supported Methodology for Ontology-based Knowledge Management

A Tool-supported Methodology for Ontology-based Knowledge Management A Tool-supported Methodology for Ontology-based Knowledge Management York Sure Institute AIFB, University of Karlsruhe, D-76128 Karlsruhe, Germany http://www.aifb.uni-karlsruhe.de/wbs mailto:sure@aifb.uni-karlsruhe.de

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

Text Mining for Software Engineering

Text Mining for Software Engineering Text Mining for Software Engineering Faculty of Informatics Institute for Program Structures and Data Organization (IPD) Universität Karlsruhe (TH), Germany Department of Computer Science and Software

More information

Manually vs semiautomatic domain specific ontology building

Manually vs semiautomatic domain specific ontology building Facoltà di Lettere e Filosofia Corso di Laurea Specialistica in Comunicazione d impresa e pubblica Tesi di Laurea in Informatica per il Commercio Elettronico Manually vs semiautomatic domain specific ontology

More information

Data-Mining Algorithms with Semantic Knowledge

Data-Mining Algorithms with Semantic Knowledge Data-Mining Algorithms with Semantic Knowledge Ontology-based information extraction Carlos Vicient Monllaó Universitat Rovira i Virgili December, 14th 2010. Poznan A Project funded by the Ministerio de

More information

Ontology Construction from Text: Challenges and Trends

Ontology Construction from Text: Challenges and Trends Ontology Construction from Text: Challenges and Trends Abeer Al-Arfaj Department of Computer Science King Saud University Riyadh, Saudi Arabia AbdulMalik Al-Salman Department of Computer Science King Saud

More information

GernEdiT: A Graphical Tool for GermaNet Development

GernEdiT: A Graphical Tool for GermaNet Development GernEdiT: A Graphical Tool for GermaNet Development Verena Henrich University of Tübingen Tübingen, Germany. verena.henrich@unituebingen.de Erhard Hinrichs University of Tübingen Tübingen, Germany. erhard.hinrichs@unituebingen.de

More information

IR and AI: The role of ontology

IR and AI: The role of ontology IR and AI: The role of ontology Ying Ding Division of Mathematics & Computer Science Free University, Amsterdam De Boelelaan 1081a, 1081 HV Amsterdam, the Netherlands Email: ying@cs.vu.nl Abstract This

More information

Iterative Learning of Relation Patterns for Market Analysis with UIMA

Iterative Learning of Relation Patterns for Market Analysis with UIMA UIMA Workshop, GLDV, Tübingen, 09.04.2007 Iterative Learning of Relation Patterns for Market Analysis with UIMA Sebastian Blohm, Jürgen Umbrich, Philipp Cimiano, York Sure Universität Karlsruhe (TH), Institut

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 93-94 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

Knowledge Engineering with Semantic Web Technologies

Knowledge Engineering with Semantic Web Technologies This file is licensed under the Creative Commons Attribution-NonCommercial 3.0 (CC BY-NC 3.0) Knowledge Engineering with Semantic Web Technologies Lecture 5: Ontological Engineering 5.3 Ontology Learning

More information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University sekine@cs.nyu.edu Kapil Dalwani Computer Science Department

More information

The Dictionary Parsing Project: Steps Toward a Lexicographer s Workstation

The Dictionary Parsing Project: Steps Toward a Lexicographer s Workstation The Dictionary Parsing Project: Steps Toward a Lexicographer s Workstation Ken Litkowski ken@clres.com http://www.clres.com http://www.clres.com/dppdemo/index.html Dictionary Parsing Project Purpose: to

More information

NLP - Based Expert System for Database Design and Development

NLP - Based Expert System for Database Design and Development NLP - Based Expert System for Database Design and Development U. Leelarathna 1, G. Ranasinghe 1, N. Wimalasena 1, D. Weerasinghe 1, A. Karunananda 2 Faculty of Information Technology, University of Moratuwa,

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 95-96 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

Ontology enrichment and indexing process

Ontology enrichment and indexing process Ontology enrichment and indexing process E. Desmontils, C. Jacquin, L. Simon Institut de Recherche en Informatique de Nantes 2, rue de la Houssinire B.P. 92208 44322 NANTES CEDEX 3 Ingénierie des Connaissances

More information

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD 10 Text Mining Munawar, PhD Definition Text mining also is known as Text Data Mining (TDM) and Knowledge Discovery in Textual Database (KDT).[1] A process of identifying novel information from a collection

More information

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator Vít Nováček, Pavel Smrž, Jan Pomikálek Faculty of Informatics, Masaryk University Botanická 68a, 602 00 Brno, Czech

More information

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad

More information

A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS

A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS Manoj Paul, S. K. Ghosh School of Information Technology, Indian Institute of Technology, Kharagpur 721302, India - (mpaul, skg)@sit.iitkgp.ernet.in

More information

A Framework for Ontology Life Cycle Management

A Framework for Ontology Life Cycle Management A Framework for Ontology Life Cycle Management Perakath Benjamin, Nitin Kumar, Ronald Fernandes, and Biyan Li Knowledge Based Systems, Inc., College Station, TX, USA Abstract - This paper describes a method

More information

Domain-specific Concept-based Information Retrieval System

Domain-specific Concept-based Information Retrieval System Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical

More information

2 Experimental Methodology and Results

2 Experimental Methodology and Results Developing Consensus Ontologies for the Semantic Web Larry M. Stephens, Aurovinda K. Gangam, and Michael N. Huhns Department of Computer Science and Engineering University of South Carolina, Columbia,

More information

The HMatch 2.0 Suite for Ontology Matchmaking

The HMatch 2.0 Suite for Ontology Matchmaking The HMatch 2.0 Suite for Ontology Matchmaking S. Castano, A. Ferrara, D. Lorusso, and S. Montanelli Università degli Studi di Milano DICo - Via Comelico, 39, 20135 Milano - Italy {castano,ferrara,lorusso,montanelli}@dico.unimi.it

More information

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 94-95

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 94-95 ه عا ی Semantic Web Ontology Alignment Morteza Amini Sharif University of Technology Fall 94-95 Outline The Problem of Ontologies Ontology Heterogeneity Ontology Alignment Overall Process Similarity Methods

More information

CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS

CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS 82 CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS In recent years, everybody is in thirst of getting information from the internet. Search engines are used to fulfill the need of them. Even though the

More information

SEMANTIC MATCHING APPROACHES

SEMANTIC MATCHING APPROACHES CHAPTER 4 SEMANTIC MATCHING APPROACHES 4.1 INTRODUCTION Semantic matching is a technique used in computer science to identify information which is semantically related. In order to broaden recall, a matching

More information

Using ART2 Neural Network and Bayesian Network for Automating the Ontology Constructing Process

Using ART2 Neural Network and Bayesian Network for Automating the Ontology Constructing Process Available online at www.sciencedirect.com Procedia Engineering 29 (2012) 3914 3923 2012 International Workshop on Information and Electronics Engineering (IWIEE) Using ART2 Neural Network and Bayesian

More information

Clustering for Ontology Evolution

Clustering for Ontology Evolution Clustering for Ontology Evolution George Tsatsaronis, Reetta Pitkänen, and Michalis Vazirgiannis Department of Informatics, Athens University of Economics and Business, 76, Patission street, Athens 104-34,

More information

LexiRes: A Tool for Exploring and Restructuring EuroWordNet for Information Retrieval

LexiRes: A Tool for Exploring and Restructuring EuroWordNet for Information Retrieval LexiRes: A Tool for Exploring and Restructuring EuroWordNet for Information Retrieval Ernesto William De Luca and Andreas Nürnberger 1 Abstract. The problem of word sense disambiguation in lexical resources

More information

WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY

WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk

More information

Downloaded from jipm.irandoc.ac.ir at 5:49 IRDT on Sunday June 17th 2018

Downloaded from jipm.irandoc.ac.ir at 5:49 IRDT on Sunday June 17th 2018 5-83 ( ) 5-83 ( ) ISC SCOPUS L ISA http://jist.irandoc.ac.ir 390 5-3 - : fathian000@gmail.com : * 388/07/ 5 : 388/05/8 : :...... : 390..(Brank, Grobelnic, and Mladenic 005). Brank, ).(Grobelnic, and Mladenic

More information

A Survey of Ontology Learning Procedures

A Survey of Ontology Learning Procedures A Survey of Ontology Learning Procedures Lucas Drumond and Rosario Girardi Federal University of Maranhão, Computer Science Department, Av. dos Portugueses, São Luís, Brasil ldrumond@gmail.com,rgirardi@deinf.ufma.br

More information

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96

Semantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96 ه عا ی Semantic Web Ontology Alignment Morteza Amini Sharif University of Technology Fall 95-96 Outline The Problem of Ontologies Ontology Heterogeneity Ontology Alignment Overall Process Similarity (Matching)

More information

Text Onto Miner - a Semi Automated Ontology Building System

Text Onto Miner - a Semi Automated Ontology Building System Text Onto Miner - a Semi Automated Ontology Building System Piotr Gawrysiak 1, Grzegorz Protaziuk 1, Henryk Rybinski 1 and Alexandre Delteil 2 1 ICS, Warsaw University of Technology 2 France Telecome R

More information

Question Answering Using XML-Tagged Documents

Question Answering Using XML-Tagged Documents Question Answering Using XML-Tagged Documents Ken Litkowski ken@clres.com http://www.clres.com http://www.clres.com/trec11/index.html XML QA System P Full text processing of TREC top 20 documents Sentence

More information

Knowledge Harvesting For Business

Knowledge Harvesting For Business 1 Knowledge Harvesting For Business Intelligence Nesrine Ben Mustapha and Marie-Aude Aufaure Ecole Centrale Paris, MAS Laboratory {Nesrine.Ben-Mustapha, Marie-Aude.Aufaure}@ecp.fr Summary. With the growth

More information

Semantic Interoperability. Being serious about the Semantic Web

Semantic Interoperability. Being serious about the Semantic Web Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA 1 Being serious about the Semantic Web It is not one person s ontology It is not several people s common

More information

2 Which Methodology for Building Ontologies? 2.1 A Work Still in Progress Many approaches (for a complete survey, the reader can refer to the OntoWeb

2 Which Methodology for Building Ontologies? 2.1 A Work Still in Progress Many approaches (for a complete survey, the reader can refer to the OntoWeb Semantic Commitment for Designing Ontologies: A Proposal Bruno Bachimont 1,Antoine Isaac 1;2, Raphaël Troncy 1;3 1 Institut National de l'audiovisuel, Direction de la Recherche 4, Av. de l'europe - 94366

More information

Refining Ontologies by Pattern-Based Completion

Refining Ontologies by Pattern-Based Completion Refining Ontologies by Pattern-Based Completion Nadejda Nikitina and Sebastian Rudolph and Sebastian Blohm Institute AIFB, University of Karlsruhe D-76128 Karlsruhe, Germany {nikitina, rudolph, blohm}@aifb.uni-karlsruhe.de

More information

Requirements for Information Extraction for Knowledge Management

Requirements for Information Extraction for Knowledge Management Requirements for Information Extraction for Knowledge Management Philipp Cimiano*, Fabio Ciravegna, John Domingue, Siegfried Handschuh*, Alberto Lavelli +, Steffen Staab*, Mark Stevenson AIFB, University

More information

Background and Context for CLASP. Nancy Ide, Vassar College

Background and Context for CLASP. Nancy Ide, Vassar College Background and Context for CLASP Nancy Ide, Vassar College The Situation Standards efforts have been on-going for over 20 years Interest and activity mainly in Europe in 90 s and early 2000 s Text Encoding

More information

KOSO: A Reference-Ontology for Reuse of Existing Knowledge Organization Systems

KOSO: A Reference-Ontology for Reuse of Existing Knowledge Organization Systems KOSO: A Reference-Ontology for Reuse of Existing Knowledge Organization Systems International Workshop on Knowledge Reuse and Reengineering over the Semantic Web (KRRSW 2008) ESWC 2008 Tenerife, Spain,

More information

Prediction-Based NLP System by Boyer-Moore Algorithm for Requirements Elicitation

Prediction-Based NLP System by Boyer-Moore Algorithm for Requirements Elicitation Prediction-Based NLP System by Boyer-Moore Algorithm for Requirements Elicitation Dr A.Sumithra 1, K.Poongothai 2, Dr S.Gavaskar 3 1 Associate Professor, Dept of Computer Science & Engineering, VSB College

More information

Schema-based Semantic Matching: Algorithms, a System and a Testing Methodology

Schema-based Semantic Matching: Algorithms, a System and a Testing Methodology Schema-based Semantic Matching: Algorithms, a System and a Testing Methodology Abstract. Schema/ontology/classification matching is a critical problem in many application domains, such as, schema/ontology/classification

More information

Semi-automatic creation of domain ontologies with centroid based crawlers. Carel Fenijn

Semi-automatic creation of domain ontologies with centroid based crawlers. Carel Fenijn Semi-automatic creation of domain ontologies with centroid based crawlers Carel Fenijn Graduate Thesis Doctoraal Linguistics Utrecht University, December 2007 Contents...................................

More information

Survey of Semantic Annotation Platforms

Survey of Semantic Annotation Platforms Survey of Semantic Annotation Platforms Lawrence Reeve College of Information Science and Technology Drexel University Philadelphia, PA 19104 USA larry.reeve@drexel.edu Hyoil Han College of Information

More information

Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities

Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities Computer-assisted Ontology Construction System: Focus on Bootstrapping Capabilities Omar Qawasmeh 1, Maxime Lefranois 2, Antoine Zimmermann 2, Pierre Maret 1 1 Univ. Lyon, CNRS, Lab. Hubert Curien UMR

More information

Personalized Terms Derivative

Personalized Terms Derivative 2016 International Conference on Information Technology Personalized Terms Derivative Semi-Supervised Word Root Finder Nitin Kumar Bangalore, India jhanit@gmail.com Abhishek Pradhan Bangalore, India abhishek.pradhan2008@gmail.com

More information

A MODEL-DRIVEN APPROACH OF ONTOLOGICAL COMPONENTS FOR ON- LINE SEMANTIC WEB INFORMATION RETRIEVAL

A MODEL-DRIVEN APPROACH OF ONTOLOGICAL COMPONENTS FOR ON- LINE SEMANTIC WEB INFORMATION RETRIEVAL Journal of Web Engineering, Vol. 6, No.4 (2007) 303-329 Rinton Press A MODEL-DRIVEN APPROACH OF ONTOLOGICAL COMPONENTS FOR ON- LINE SEMANTIC WEB INFORMATION RETRIEVAL HAJER BAAZAOUI ZGHAL 1, MARIE-AUDE

More information

Let s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed

Let s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed Let s get parsing! SpaCy default model includes tagger, parser and entity recognizer nlp = spacy.load('en ) tells spacy to use "en" with ["tagger", "parser", "ner"] Each component processes the Doc object,

More information

Eurown: an EuroWordNet module for Python

Eurown: an EuroWordNet module for Python Eurown: an EuroWordNet module for Python Neeme Kahusk Institute of Computer Science University of Tartu, Liivi 2, 50409 Tartu, Estonia neeme.kahusk@ut.ee Abstract The subject of this demo is a Python module

More information

COMP90042 LECTURE 3 LEXICAL SEMANTICS COPYRIGHT 2018, THE UNIVERSITY OF MELBOURNE

COMP90042 LECTURE 3 LEXICAL SEMANTICS COPYRIGHT 2018, THE UNIVERSITY OF MELBOURNE COMP90042 LECTURE 3 LEXICAL SEMANTICS SENTIMENT ANALYSIS REVISITED 2 Bag of words, knn classifier. Training data: This is a good movie.! This is a great movie.! This is a terrible film. " This is a wonderful

More information

Bibster A Semantics-Based Bibliographic Peer-to-Peer System

Bibster A Semantics-Based Bibliographic Peer-to-Peer System Bibster A Semantics-Based Bibliographic Peer-to-Peer System Peter Haase 1, Björn Schnizler 1, Jeen Broekstra 2, Marc Ehrig 1, Frank van Harmelen 2, Maarten Menken 2, Peter Mika 2, Michal Plechawski 3,

More information

Web Services Annotation and Reasoning

Web Services Annotation and Reasoning Web Services Annotation and Reasoning, W3C Workshop on Frameworks for Semantics in Web Services Web Services Annotation and Reasoning Peter Graubmann, Evelyn Pfeuffer, Mikhail Roshchin Siemens AG, Corporate

More information

KAON The Karlsruhe Ontology and Semantic Web Meta Project

KAON The Karlsruhe Ontology and Semantic Web Meta Project KAON The Karlsruhe Ontology and Semantic Web Meta Project Alexander Maedche 1 & Steffen Staab 2 1 Forschungszentrum Informatik, Karlsruhe, Germany, http://www.fzi.de/wim 2 Institut AIFB, Universität Karlsruhe,

More information

Data Collection & Data Preprocessing

Data Collection & Data Preprocessing Data Collection & Data Preprocessing Bayu Distiawan Natural Language Processing & Text Mining Short Course Pusat Ilmu Komputer UI 22 26 Agustus 2016 DATA COLLECTION Fakultas Ilmu Komputer Universitas Indonesia

More information

Web Information Retrieval using WordNet

Web Information Retrieval using WordNet Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT

More information

Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy

Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy Martin Rajman, Pierre Andrews, María del Mar Pérez Almenta, and Florian Seydoux Artificial Intelligence

More information

Leopold Franzens University Innsbruck. Ontology Learning. Institute of Computer Science STI - Innsbruck. Seminar Paper

Leopold Franzens University Innsbruck. Ontology Learning. Institute of Computer Science STI - Innsbruck. Seminar Paper Leopold Franzens University Innsbruck Institute of Computer Science STI - Innsbruck Ontology Learning Seminar Paper Applied Ontology Engineering (WS 2010) Supervisor: Dr. Katharina Siorpaes Michael Rogger

More information

Ontology Development and Evolution: Selected Approaches for Small-Scale Application Contexts

Ontology Development and Evolution: Selected Approaches for Small-Scale Application Contexts : Selected Approaches for Small-Scale Application Contexts Annika Öhgren ISSN 1404-0018 Research Report 04:7 : Selected Approaches for Small-Scale Application Contexts Annika Öhgren Information Engineering

More information

A Methodology for Extracting Knowledge about Controlled Vocabularies from Textual Data using FCA-Based Ontology Engineering

A Methodology for Extracting Knowledge about Controlled Vocabularies from Textual Data using FCA-Based Ontology Engineering A Methodology for Extracting Knowledge about Controlled Vocabularies from Textual Data using FCA-Based Ontology Engineering 1 st Simin Jabbari Information Management Institute University of Neuchâtel Neuchâtel

More information

Dynamic Ontology Evolution

Dynamic Ontology Evolution Dynamic Evolution Fouad Zablith Knowledge Media Institute (KMi), The Open University. Walton Hall, Milton Keynes, MK7 6AA, United Kingdom. f.zablith@open.ac.uk Abstract. Ontologies form the core of Semantic

More information

A Comprehensive Analysis of using Semantic Information in Text Categorization

A Comprehensive Analysis of using Semantic Information in Text Categorization A Comprehensive Analysis of using Semantic Information in Text Categorization Kerem Çelik Department of Computer Engineering Boğaziçi University Istanbul, Turkey celikerem@gmail.com Tunga Güngör Department

More information

Accessing Distributed Learning Repositories through a Courseware Watchdog

Accessing Distributed Learning Repositories through a Courseware Watchdog Accessing Distributed Learning Repositories through a Courseware Watchdog Christoph Schmitz, Steffen Staab, Rudi Studer, Gerd Stumme, Julien Tane Learning Lab Lower Saxony (L3S), Expo Plaza 1, D--30539

More information

Semantic Web Technologies Trends and Research in Ontology-based Systems

Semantic Web Technologies Trends and Research in Ontology-based Systems Semantic Web Technologies Trends and Research in Ontology-based Systems John Davies BT, UK Rudi Studer University of Karlsruhe, Germany Paul Warren BT, UK John Wiley & Sons, Ltd Contents Foreword xi 1.

More information

InsMT / InsMTL Results for OAEI 2014 Instance Matching

InsMT / InsMTL Results for OAEI 2014 Instance Matching InsMT / InsMTL Results for OAEI 2014 Instance Matching Abderrahmane Khiat 1, Moussa Benaissa 1 1 LITIO Lab, University of Oran, BP 1524 El-Mnaouar Oran, Algeria abderrahmane_khiat@yahoo.com moussabenaissa@yahoo.fr

More information

Learning to find transliteration on the Web

Learning to find transliteration on the Web Learning to find transliteration on the Web Chien-Cheng Wu Department of Computer Science National Tsing Hua University 101 Kuang Fu Road, Hsin chu, Taiwan d9283228@cs.nthu.edu.tw Jason S. Chang Department

More information

Ontology Population and Enrichment: State of the Art

Ontology Population and Enrichment: State of the Art Ontology Population and Enrichment: State of the Art Georgios Petasis, Vangelis Karkaletsis, Georgios Paliouras, Anastasia Krithara, and Elias Zavitsanos Institute of Informatics and Telecommunications,

More information

A Lightweight Approach to Semantic Tagging

A Lightweight Approach to Semantic Tagging A Lightweight Approach to Semantic Tagging Nadzeya Kiyavitskaya, Nicola Zeni, Luisa Mich, John Mylopoulus Department of Information and Communication Technologies, University of Trento Via Sommarive 14,

More information

Towards On-the-fly Ontology Construction - Focusing on Ontology Quality Improvement

Towards On-the-fly Ontology Construction - Focusing on Ontology Quality Improvement Towards On-the-fly Ontology Construction - Focusing on Ontology Quality Improvement Naoki Sugiura 1, Yoshihiro Shigeta 1 Naoki Fukuta 1, Noriaki Izumi 2, and Takahira Yamaguchi 1 1 Shizuoka University,

More information

Re-designing Online Terminology Resources for German Grammar

Re-designing Online Terminology Resources for German Grammar Re-designing Online Terminology Resources for German Grammar Project Report Karolina Suchowolec, Christian Lang, and Roman Schneider Institut für Deutsche Sprache (IDS), Mannheim, Germany {suchowolec,

More information

Korean NLP2RDF Resources

Korean NLP2RDF Resources Korean NLP2RDF Resources YoungG yun Hahm 1 K yung taelim 1 YoonYongun 2 Jung yeul Park 3 Ke y Sun Choi 1,2 (1) Division of Web Science and Technology, KAIST, Daejeon, South Korea (2) Departmentt of Computer

More information

FOAM Framework for Ontology Alignment and Mapping Results of the Ontology Alignment Evaluation Initiative

FOAM Framework for Ontology Alignment and Mapping Results of the Ontology Alignment Evaluation Initiative FOAM Framework for Ontology Alignment and Mapping Results of the Ontology Alignment Evaluation Initiative Marc Ehrig Institute AIFB University of Karlsruhe 76128 Karlsruhe, Germany ehrig@aifb.uni-karlsruhe.de

More information

Enriching Ontology Concepts Based on Texts from WWW and Corpus

Enriching Ontology Concepts Based on Texts from WWW and Corpus Journal of Universal Computer Science, vol. 18, no. 16 (2012), 2234-2251 submitted: 18/2/11, accepted: 26/8/12, appeared: 28/8/12 J.UCS Enriching Ontology Concepts Based on Texts from WWW and Corpus Tarek

More information

A Semantic Role Repository Linking FrameNet and WordNet

A Semantic Role Repository Linking FrameNet and WordNet A Semantic Role Repository Linking FrameNet and WordNet Volha Bryl, Irina Sergienya, Sara Tonelli, Claudio Giuliano {bryl,sergienya,satonelli,giuliano}@fbk.eu Fondazione Bruno Kessler, Trento, Italy Abstract

More information

Ontology Development. Qing He

Ontology Development. Qing He A tutorial report for SENG 609.22 Agent Based Software Engineering Course Instructor: Dr. Behrouz H. Far Ontology Development Qing He 1 Why develop an ontology? In recent years the development of ontologies

More information

WordNet-based User Profiles for Semantic Personalization

WordNet-based User Profiles for Semantic Personalization PIA 2005 Workshop on New Technologies for Personalized Information Access WordNet-based User Profiles for Semantic Personalization Giovanni Semeraro, Marco Degemmis, Pasquale Lops, Ignazio Palmisano LACAM

More information

NLP Final Project Fall 2015, Due Friday, December 18

NLP Final Project Fall 2015, Due Friday, December 18 NLP Final Project Fall 2015, Due Friday, December 18 For the final project, everyone is required to do some sentiment classification and then choose one of the other three types of projects: annotation,

More information

Organizing Resources on Tagging Systems using T-ORG

Organizing Resources on Tagging Systems using T-ORG Organizing Resources on Tagging Systems using T-ORG Rabeeh Abbasi 1, Steffen Staab 1 and Philipp Cimiano 2 1 ISWEB, University of Koblenz-Landa Germany http://isweb.uni-koblenz.de {abbasi,staab}@uni-koblenz.de

More information

AROMA results for OAEI 2009

AROMA results for OAEI 2009 AROMA results for OAEI 2009 Jérôme David 1 Université Pierre-Mendès-France, Grenoble Laboratoire d Informatique de Grenoble INRIA Rhône-Alpes, Montbonnot Saint-Martin, France Jerome.David-at-inrialpes.fr

More information

A Linguistic Approach for Semantic Web Service Discovery

A Linguistic Approach for Semantic Web Service Discovery A Linguistic Approach for Semantic Web Service Discovery Jordy Sangers 307370js jordysangers@hotmail.com Bachelor Thesis Economics and Informatics Erasmus School of Economics Erasmus University Rotterdam

More information

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH Andreas Walter FZI Forschungszentrum Informatik, Haid-und-Neu-Straße 10-14, 76131 Karlsruhe, Germany, awalter@fzi.de

More information