Semantic Searching. John Winder CMSC 676 Spring 2015

Similar documents
IJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN

SWSE: Objects before documents!

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

Comparison of Question Answering Systems Based on Ontology and Semantic Web in Different Environment

What can be done with the Semantic Web? An Overview of Watson-based Applications

Open Data Search Framework based on Semi-structured Query Patterns

Implementing a Variety of Linguistic Annotations

SCIENCE & TECHNOLOGY

GoNTogle: A Tool for Semantic Annotation and Search

Using idocument for Document Categorization in Nepomuk Social Semantic Desktop

An Integrated Digital Tool for Accessing Language Resources

An Ontology-Based Information Retrieval Model

Adaptive Model of Personalized Searches using Query Expansion and Ant Colony Optimization in the Digital Library

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

The Semantic Web: A New Opportunity and Challenge for Human Language Technology

INTERCONNECTING AND MANAGING MULTILINGUAL LEXICAL LINKED DATA. Ernesto William De Luca

Semantic-Based Information Retrieval for Java Learning Management System

Information Retrieval (IR) through Semantic Web (SW): An Overview

ONTOPARK: ONTOLOGY BASED PAGE RANKING FRAMEWORK USING RESOURCE DESCRIPTION FRAMEWORK

Semantic Annotation, Search and Analysis

Motivating Ontology-Driven Information Extraction

Semantic Search meets the Web

Semantic Web and Natural Language Processing

SemSearch: Refining Semantic Search

GoNTogle: A Tool for Semantic Annotation and Search

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE

Prior Art Retrieval Using Various Patent Document Fields Contents

Domain-specific Concept-based Information Retrieval System

Ontology Matching with CIDER: Evaluation Report for the OAEI 2008

Knowledge and Ontological Engineering: Directions for the Semantic Web

Exploring Semantic Constraints for Document Retrieval

The INCEpTION Platform: Machine-Assisted and Knowledge-Oriented Interactive Annotation

Finding Topic-centric Identified Experts based on Full Text Analysis

Incorporating ontological background knowledge into Information Extraction

Ontology Based Prediction of Difficult Keyword Queries

MEASUREMENT OF SEMANTIC SIMILARITY BETWEEN WORDS: A SURVEY

Semantic Web Applications and the Semantic Web in 10 Years. Based on work of Grigoris Antoniou, Frank van Harmelen

Constructing Virtual Documents for Keyword Based Concept Search in Web Ontology

OwlExporter. Guide for Users and Developers. René Witte Ninus Khamis. Release 1.0-beta2 May 16, 2010

Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library

e-issn: p-issn:

Knowledge-based Word Sense Disambiguation using Topic Models Devendra Singh Chaplot

An Ontology Based Question Answering System on Software Test Document Domain

CHAPTER 1 INTRODUCTION

On Measuring the Lattice of Commonalities Among Several Linked Datasets

Semantic Web Fundamentals

Chapter 27 Introduction to Information Retrieval and Web Search

Reducing Consumer Uncertainty

Exploring the Use of Semantic Technologies for Cross-Search of Archaeological Grey Literature and Data

WASA: A Web Application for Sequence Annotation

Key-value stores. Berkeley DB. Bigtable

NLP AND ONTOLOGY MATCHING: A SUCCESSFUL COMBINATION FOR TRIALOGICAL LEARNING

STS Infrastructural considerations. Christian Chiarcos

Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources

MSc Advanced Computer Science School of Computer Science The University of Manchester

Context Sensitive Search Engine

The Security Role for Content Analysis

Contributions to the Study of Semantic Interoperability in Multi-Agent Environments - An Ontology Based Approach

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD

Open Research Online The Open University s repository of research publications and other research outputs

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

Dynamic Ontology Evolution

Development of an Ontology-Based Portal for Digital Archive Services

Linked Data Semantic Web Technologies 1 (2010/2011)

Enabling Semantic Search in Large Open Source Communities

An Archiving System for Managing Evolution in the Data Web

Semantic Interoperability. Being serious about the Semantic Web

A SURVEY OF DIFFERENT SEMANTIC AND ONTOLOGY BASED QUESTION ANSWERING SYSTEM

Semantic Web. Tahani Aljehani

Natural Language Query Processing for SPARQL generation - a Prototype System for SNOMEDCT

Semantics. KR4SW Winter 2011 Pascal Hitzler 1

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online

Survey of Semantic Annotation Platforms

Weaving the Pedantic Web - Information Quality on the Web of Data

Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search

Simple library thesaurus alignment with SILAS

Challenges for the Multilingual Semantic Web

A SEMANTIC MATCHMAKER SERVICE ON THE GRID

Keyword Search in RDF Databases

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

A Keyword-Based on Semantic Web Search Engine

Query Expansion using Wikipedia and DBpedia

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

An ontology-based approach for semantics ranking of the web search engines results

Linking Entities in Chinese Queries to Knowledge Graph

Automatic Ontology-Based Document Annotation for Arabic Information Retrieval

Proposal for Implementing Linked Open Data on Libraries Catalogue

QUERY EXPANSION USING WORDNET WITH A LOGICAL MODEL OF INFORMATION RETRIEVAL

NEW DEVELOPMENTS ON THE NEWISTIC PLATFORM. Horatiu Mocian and Ovidiu Dan. Short paper

Measuring Semantic Similarity between Words Using Page Counts and Snippets

Semantic Cloud Generation based on Linked Data for Efficient Semantic Annotation

YARS2: A Federated Repository for Querying Graph Structured Data from the Web

Exploring and Using the Semantic Web

Performance Assessment using Text Mining

A rule-based approach to address semantic accuracy problems on Linked Data

Bridges To Computing

Watson & WMR2017. (slides mostly derived from Jim Hendler and Simon Ellis, Rensselaer Polytechnic Institute, or from IBM itself)

The Documentalist Support System: a Web-Services based Tool for Semantic Annotation and Browsing

Transcription:

Semantic Searching John Winder CMSC 676 Spring 2015

Semantic Searching searching and retrieving documents by their semantic, conceptual, and contextual meanings Motivations: to do disambiguation to improve retrieval accuracy precision and recall to unite the Semantic Web

Semantic Web Standardizations data formats and schemas XML, RDF query languages RDQL, SPARQL Key Ideas metadata ontologies The Semantic Web Stack

Ontology An ontology is a knowledge base models hierarchies, relationships (is-a, has-a) uses formal languages (inspired by databases) Examples: A query to retrieve a list of paintings and their painters. WordNet (dog is-a canine is-a carnivore, etc.) ConceptNet

Main Advancements Vector Space model Boolean model no partial matching no clear ranking method requires parallel metadata rank by TF-IDF Semi-structured Vector Space Semantic Search System by Vallet et al. [2005] fully structured ontological mapping has worse recall keyword searching is flexible but has worse precision

Main Advancements (cont.) Query Expansion searching by meaning, beyond literal keywords given a query, map into ontology, find new relations returns documents even without search keywords being present in the documents examples: presidents of the French government reports on flooding for cities in Asia with populations under 50,000

Main Advancements (cont.) Generating queries search by keyword parses out entity/relations Semantic Ranking by entity ReConRank in SWSE by relationship by document annotations (Swoogle) Ontology-Based Semantic Search System by Fernandez et al. [2011]

Mimir: Semantic Search at Scale (2015) Mimir, annotation-based semantic search uses GATE to do NLP, extract entities/relationships open source, distributed (federated) system complex query parsing, indexing at three levels: tokens, annotations, sub-annotations applied to real world corpora (over 150 million docs) immunology dataset patent dataset, searching for prior art

Future Applications Recommender Systems build user profiles, use history to inform results Sentiment Analysis disambiguation to spot outliers in word usage Reasoning (Artificial Intelligence) inference: discovering new facts using ontologies to build... more ontologies

References Castells, Pablo, Fernandez, Miriam, and Vallet, David. An adaptation of the vector-space model for ontology-based information retrieval. Knowledge and Data Engineering, IEEE Transactions on, 19(2):261 272, 2007. Cunningham, Hamish, Maynard, Diana, Bontcheva, Kalina, and Tablan, Valentin. Gate: an architecture for development of robust hlt applications. In Proceedings of the 40th annual meeting on association for computational linguistics, pp. 168 175. Association for Computational Linguistics, 2002. Fernandez, Miriam, Cantador, Ivan, Lopez, Vanesa, Vallet, David, Castells, Pablo, and Motta, Enrico. Semantically enhanced information retrieval: an ontology-based approach. Web Semantics: Science, Services and Agents on the World Wide Web, 9(4):434 452, 2011. Havasi, Catherine, Speer, Robert, and Alonso, Jason. Conceptnet 3: a flexible, multilingual semantic network for common sense knowledge. In Recent advances in natural language processing, pp. 27 29, 2007. Hogan, Aidan, Harth, Andreas, Umbrich, Jrgen, Kinsella, Sheila, Polleres, Axel, and Decker, Stefan. Searching and browsing linked data with swse: The semantic web search engine. Web Semantics: Science, Services and Agents on the World Wide Web, 9(4):365 401, 2011. ISSN 1570-8268. doi:http://dx. doi.org/10.1016/j.websem.2011.06.004. fjwsg special issue on Semantic Search. Miller, George A. Wordnet: a lexical database for english. Communications of the ACM, 38(11):39 41, 1995. Russell, Stuart and Norvig, Peter. Artificial intelligence: A modern approach. Prentice-Hall, Englewood Cliffs, 3, 2003. Styltsvig, Henrik Bulskov. Ontology-based information retrieval. PhD thesis, Roskilde University, Denmark, 2006. Tablan, Valentin, Bontcheva, Kalina, Roberts, Ian, and Cunningham, Hamish. Mmir: An open-source semantic search framework for interactive information seeking and discovery. Web Semantics: Science, Services and Agents on the World Wide Web, 30(0):52 68, 2015. ISSN 1570-8268. doi: http://dx.doi.org/10.1016/j.websem.2014.10.002. Semantic Search. Vallet, David, Fernandez, Miriam, and Castells, Pablo. An ontology-based information retrieval model. In The Semantic Web: Research and Applications, pp. 455 470. Springer, 2005.