Information Retrieval and Knowledge Organisation

Similar documents
INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE

Modelling in Enterprise Architecture. MSc Business Information Systems

Definitions. Lecture Objectives. Text Technologies for Data Science INFR Learn about main concepts in IR 9/19/2017. Instructor: Walid Magdy

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

DL User Interfaces. Giuseppe Santucci Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza

Opus: University of Bath Online Publication Store

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Enterprise Knowledge Map: Toward Subject Centric Computing. March 21st, 2007 Dmitry Bogachev

Modeling Data and Documents

BBK3253 Knowledge Management Prepared by Dr Khairul Anuar

Document Clustering for Mediated Information Access The WebCluster Project

Enhanced retrieval using semantic technologies:

Information Retrieval CSCI

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.

INFSCI 2140 Information Storage and Retrieval Lecture 2: Models of Information Retrieval: Boolean model. Final Group Projects

SDMX GLOBAL CONFERENCE

Business Architecture Implementation

More about Databases. Topics. More Taxonomy. And more Taxonomy. Computer Literacy 1 Lecture 18 30/10/2008

Multi-agent Semantic Web Systems: Data & Metadata

Information Retrieval (Part 1)

The Semantic Web DEFINITIONS & APPLICATIONS

INFS 427: AUTOMATED INFORMATION RETRIEVAL (1 st Semester, 2018/2019)

Information Retrieval and Organisation

SKOS. COMP62342 Sean Bechhofer

Semantic Annotation, Search and Analysis

Semantic Web. Tahani Aljehani

From Open Data to Data- Intensive Science through CERIF

Contextual Information Retrieval Using Ontology-Based User Profiles

Ontologies SKOS. COMP62342 Sean Bechhofer

Making Information Findable

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Chapter 2 - Concepts and Definitions

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

Ontology-Based Web Query Classification for Research Paper Searching

Chapter 27 Introduction to Information Retrieval and Web Search

Advanced Topics in Software Engineering (02265) Ekkart Kindler

ISSUES IN INFORMATION RETRIEVAL Brian Vickery. Presentation at ISKO meeting on June 26, 2008 At University College, London

Metadata for Digital Collections: A How-to-Do-It Manual

Ontology-based Architecture Documentation Approach

7. Relational Calculus (Part I) 7.1 Introduction

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

Ontology-based Navigation of Bibliographic Metadata: Example from the Food, Nutrition and Agriculture Journal

Information Retrieval

Envisioning Semantic Web Technology Solutions for the Arts

Barcode is a machine readable strip for automatic identification of items, by means of printed bars of different widths

- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents

AutoFocus, an Open Source Facet-Driven Enterprise Search Solution

Social Search Networks of People and Search Engines. CS6200 Information Retrieval

COLUMN. Choosing the right CMS authoring tools. Three key criteria will determine the most suitable authoring environment NOVEMBER 2003

Things to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic

In the following, we examine Find Database, Find ejournal, QuickSearch, MetaSearch, and My Space in NELLI.

Adaptable and Adaptive Web Information Systems. Lecture 1: Introduction

B2SAFE metadata management

Teiid Designer User Guide 7.5.0

Cambridge Books Online (CBO)

Library resources in philology

Wondering about either OWL ontologies or SKOS vocabularies? You need both!

Dynamic Models - A case study in developing curriculum regulation and conformity using Protege

Multimedia Information Systems

Introduction to Information Retrieval

Jumpstarting the Semantic Web

THE LIBRARY OF TRINITY COLLEGE DUBLIN, THE UNIVERSITY OF DUBLIN LEABHARLANN CHOLÁISTE NA TRÍONÓIDE, OLLSCOIL ÁTHA CLIATH

Information Retrieval

Individual Project. Agnieszka Jastrzębska Władysław Homenda Lucjan Stapp

Information Retrieval. Session 11 LBSC 671 Creating Information Infrastructures

TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION

Preface...xi Coverage of this edition...xi Acknowledgements...xiii

Information mining and information retrieval : methods and applications

INTELLIGENT SYSTEMS OVER THE INTERNET

Semantic Web Search Model for Information Retrieval of the Semantic Data *

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

Lecture 1/2. Copyright 2007 STI - INNSBRUCK

The World Bank Enterprise Search Program. Luisita Guanlao The World Bank Group May 10, 2005

The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries

FIBO Metadata in Ontology Mapping

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

CONTENTdm Basic Skills 1: Getting Started with CONTENTdm

THE LIBRARY OF TRINITY COLLEGE DUBLIN, THE UNIVERSITY OF DUBLIN LEABHARLANN CHOLÁISTE NA TRÍONÓIDE, OLLSCOIL ÁTHA CLIATH

Data is the new Oil (Ann Winblad)

or: How to be your own Good Fairy

@dvanced document management Holstraat 19, B-9000 Gent, tel: 09/ , fax: 09/

Semantic Image Retrieval Based on Ontology and SPARQL Query

Taxonomies and controlled vocabularies best practices for metadata

0.1 Upper ontologies and ontology matching

IMPROVING INFORMATION RETRIEVAL BASED ON QUERY CLASSIFICATION ALGORITHM

Eloquent WebSuite Planning Guide

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward

A Content Based Image Retrieval System Based on Color Features

Guidance for data entry in Deakin Research Online

The Dublin Core Metadata Element Set

Semantic media application with user created content to enhance enjoying cultural heritage

Lockheed Martin DAML Project

Semantic Web Knowledge Representation in the Web Context. CS 431 March 24, 2008 Carl Lagoze Cornell University

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

Search Engines. Charles Severance

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey.

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Ontology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University

Conceptual Interpretation of LOM Semantics and its Mapping to Upper Level Ontologies

Libraries, Repositories and Metadata. Lara Whitelaw, Metadata Development Manager, The Open University. 22 Oct 2003

Transcription:

Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector space model Extensions of classic retrieval methods Evaluating search methods User adaptation and feedback Knowledge Organisation: Thesaurus Associative Search Metadata and meta Classification schemes Information extraction Case-based reasoning Topic Maps Prof. Dr. Knut Hinkelmann 1

1 Introduction Information Retrieval and Knowledge Management: Process-based Organisational Memory maturisation implicit explicit tacit in heads of people self-aware in heads of people documented in documents/ databases formal program code bases people organisation information technology Information Retrieval (IR) deals with the representation, storage, organisation of, and access to information items (= explicit ) Prof. Dr. Knut Hinkelmann 3

Information Need Representation and organisation of information items should provide the user with easy access to the information items, he/she is interested in Characterisation of the user information need is not a simple problem Example: The user has to translate the information need into a query which can be processed by a search engine (IR system) Prof. Dr. Knut Hinkelmann 4 Key Goal of Information Retrieval Key goal of Information Retrieval: Given a user query, retrieve information that might be relevant to the user. query information items IR system relevant information items user In its most common form, the query is a set of key words which summarize the description of the information need. Prof. Dr. Knut Hinkelmann 5

Basic Concepts: Retrieval vs. Browsing Retrieval Browsing Database Retrieval Information need can be specified A query can be formulated which contrains the information that might be of interest Browsing Main objective not clearly defined or very broad Purpose might change during interaction with the system Example: User is interested in car racing in general. When finding information about formula 1, he might turn his interest into a specific driver or car manufacturer Prof. Dr. Knut Hinkelmann 6 Retrieval vs. Browsing censored Prof. Dr. Knut Hinkelmann 7

Filtering relevant information Retrieval and browsing are pulling actions the user requests the information in an interactive manner Alternatively, information can be pushed towards the user Examples: periodic news service, RSS feed In push scenario, the IR system has to filter information which might be relevant for the user Prof. Dr. Knut Hinkelmann 8 What is the difference between querying a database (data retrieval) and querying a a document repository (information retrieval)? Prof. Dr. Knut Hinkelmann 9

Basic Concepts: Data Retrieval vs. Information Retrieval data typed data (integer, string,...) well-defined structure and semantics query results are exact result: information items directly satisfying information need retrieve all items that satisfy clearly defined conditions a single erroneous object means failure data queries refer to the structure attribute, table etc. syntactic processing based on structure select c.name, c.phone from customer c, order o where c.customerno = o.customerno Prof. Dr. Knut Hinkelmann 10 t t i l t t d t information is often unstructured (text) may be semantically ambiguous query results are approximations result: documents containing information satisfying information need notion of relevance is in the center of IR retrieved objects might be inaccurate, small failures are tolerable information retrieval deals with content systems interprets content of information items find relevant documents in spite of differences in formulation and vocabulary Metadata user services store search description Metadata (Index) resources manuals product data $ price lists correspondence laws/ regulations (An index is the internal representation of metadata for efficient processing) Prof. Dr. Knut Hinkelmann 11

Use of Metadata Search for information resources using suitable criteria organisation and effective access to electronic resources (e.g. document management systems, digital libraries) unique identification of resources (storage) location, e.g. URL unique ID distinction of different resources data exchange between systems with different data structures and interfaces Prof. Dr. Knut Hinkelmann 12 Meta-data Example The meta-data of a map containing the name of the resource, the creator, die resolution, the copy right and a description. Prof. Dr. Knut Hinkelmann 13

Structured Metadata Examples user data (document) metadata name: ELENA-Ber creation: 18.3.2001 modification: 25.6.2001 format: Word document type: project report recipient: All All Life Life Insurance Inc. Inc. author: Smith Examples for Meta-data: library catalogue with description of books: author, title, publication date, publisher, key words, location document management systems distinguish between user data (resources, documents) and meta-data skill databases / yellow pages contain descriptions of people Prof. Dr. Knut Hinkelmann 14 General vs. application-specific metadata General metadata can be used for any kind of information Examples: author, date of creation, key words Application-specific metadata specific attributes examples: for a photograph: resolution for a piece of music: the style or the album specific attribute values examples: predefined key words, project names Prof. Dr. Knut Hinkelmann 15

Types of Metadata descriptive structural administrative provide information about the content and the objective of the resource (e.g. using key words) describe the structure/composition of the resource administrative information to deal with the resources (e.g. date of creation, access rights) Prof. Dr. Knut Hinkelmann 16 Kinds of Meta-data and Meta-Knowledge Textindex full-text: words contained in text documents key words: manually assigned to documents structured meta-data attributes and their values classification systems / taxonomies (hierarchy of) categories thesaurus terms and their relations semantic nets / ontologies concepts and relations meta-data meta-: organisation Prof. Dr. Knut Hinkelmann 17

Information Retrieval Methods intensity processing structured search classification degree of automation associative search text search (full-text, key words) source: Prof. Dr. Knut Hinkelmann 18