Katrin Weller & Isabella Peters

Similar documents
Power Tags as Tools for Social Knowledge Organization Systems

KOSO: A Reference-Ontology for Reuse of Existing Knowledge Organization Systems

0.1 Knowledge Organization Systems for Semantic Web

Generalizable Semantic Relations for Knowledge Representation

Chapter 27 Introduction to Information Retrieval and Web Search

INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE

Against folksonomies: indexing blogs and podcasts for corporate knowledge management

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION

Ontology-based Architecture Documentation Approach

Information Retrieval

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

Semantic Technologies for Nuclear Knowledge Modelling and Applications

Kristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok

Natural Language Processing with PoolParty

Indexing and subject organisation

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

Controlled Vocabularies & Folksonomies. LIS 390 IOE: Information Organization in Everyday Life Week 10-2 Instructors: Lee & Jones

The Semantic Web DEFINITIONS & APPLICATIONS

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Metadata Standards and Applications. 6. Vocabularies: Attributes and Values

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

A Comparison between Users free-text queries and RILM index terms. Shuheng Wu Queens College, CUNY Yun F. Henshaw RILM

Business Benefits of Developing Effective Taxonomies. Cathrin Senn and Ian Davis Taxonomy Consultants

Proceedings. of the ISSI 2011 conference. 13th International Conference of the International Society for Scientometrics & Informetrics

Creating and Maintaining Vocabularies

Semantics to the Bookmarks: A Review of Social Semantic Bookmarking Systems

Terminology Management

CS 6320 Natural Language Processing

Ermergent Semantics in BibSonomy

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

SKOS. COMP62342 Sean Bechhofer

Ontologies SKOS. COMP62342 Sean Bechhofer

Collaborative Tagging: Providing User Created Organizational Structure for Web 2.0

Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty

Springer Science+ Business, LLC

A Design Rationale Representation for Model-Based Designs in Software Engineering

The UNESCO Thesaurus

Information Retrieval. Chap 7. Text Operations

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Welcome to the Quantitative Trait Loci (QTL) Tutorial

CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Enterprise Multimedia Integration and Search

Ontology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University

Social Tagging. Kristina Lerman. USC Information Sciences Institute Thanks to Anon Plangprasopchok for providing material for this lecture.

The AGROVOC Concept Scheme - A Walkthrough

Semantic Web Systems Ontologies Jacques Fleuriot School of Informatics

Semantic Web and Web2.0. Dr Nicholas Gibbins

Taxonomies and controlled vocabularies best practices for metadata

Taxonomy for Self-service Delivery

Tags, Categories and Keywords

Collaborative Tagging: A New Way of Defining Keywords to Access Web Resources

TagFS Tag Semantics for Hierarchical File Systems

Introduction to Information Retrieval

Social Search Networks of People and Search Engines. CS6200 Information Retrieval

Information Retrieval. (M&S Ch 15)

ISKO UK. Content Architecture. Semantic interoperability in an international comprehensive knowledge organisation system.

Fusing Corporate Thesaurus Management with Linked Data using PoolParty

Knowledge organization on the Web ISKO-IWA meeting

> Semantic Web Use Cases and Case Studies

Enhanced retrieval using semantic technologies:

The Tagging Tangle: Creating a librarian s guide to tagging. Gillian Hanlon, Information Officer Scottish Library & Information Council

Semantic Technologies in a Chemical Context

slide courtesy of D. Yarowsky Splitting Words a.k.a. Word Sense Disambiguation Intro to NLP - J. Eisner 1

ConTag: A Semantic Tag Recommendation System

Taxonomy Tool. There are quite a few products on the market called taxonomy. Heather Hedden

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data

Basics of SEO Published on: 20 September 2017

Enriching knowledge graphs with text processing techniques

University of Eastern Finland Library Heikki Laitinen UEF // University of Eastern Finland

Understanding the user: Personomy translation for tag recommendation

Taxonomies in support of search

Terminology Management Platform (TMP)

Caravela: Semantic Content Management with Automatic Information Integration and Categorization

Agenda. Web 2.0: User Generated Metadata. Why metadata? Introduction. Why metadata? Example for Annotations

Enhancing information services using machine to machine terminology services

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

Taxonomy Governance Checklist

Contrasting Knowledge Organization Systems for the description of research products: the case of overlapping in the agricultural domain

Exploratory Analysis: Clustering

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Query Refinement and Search Result Presentation

@toread and Cool: Subjective, Affective and Associative Factors in Tagging

CHAPTER 2 OVERVIEW OF TAG RECOMMENDATION

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008

Ontology-based Navigation of Bibliographic Metadata: Example from the Food, Nutrition and Agriculture Journal

WEB 2.0 AND FOLKSONOMY

4) DAVE CLARKE. OASIS: Constructing knowledgebases around high resolution images using ontologies and Linked Data

MSc Advanced Computer Science School of Computer Science The University of Manchester

State of the Art and Trends in Search Engine Technology. Gerhard Weikum

Mining Web Data. Lijun Zhang

Information Retrieval and Web Search

Information Retrieval and Knowledge Organisation

Semantic Image Retrieval Based on Ontology and SPARQL Query

Query Expansion using Wikipedia and DBpedia

The Data Organization

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4. Prof. James She

THE GETTY VOCABULARIES TECHNICAL UPDATE

Transcription:

Tag Gardening Activities for Folksonomy Maintenance and Enrichment Katrin Weller & Isabella Peters Heinrich-Heine-University Düsseldorf Institute for Language and Information Dept. of Information Science Presentation at I-Semantics, Triple-I Conference 2008. Graz, 04. September 2008

Motivation How to to combine the the dynamics of of freely chosen tags with the the steadiness and and complexity of of controlled vocabularies?

Two Sides of the Same Coin Aim 1: Optimizing folksonomies for specific application scenarios. Aim 2: Maintaining and enriching knowledge organization systems (KOS, e.g. ontologies or thesauri) with folksonomy terms. Tag Gardening!

The Two Sides of Tag Gardening

Some Problems of Unstructured Folksonomies Spelling variants, translations, abbreviations and synonyms have to be considered for query formulation and indexing. Tags serve a variety of functions, not just content description (e.g. toread, @Henry ). Tags enable browsing by popularity but not navigation by meanings and semantic interrelations. Even on a personal level unstructured tags may become unmanageable.

Source: http://flickr.com/photos/jup3nep/2473905053/

Some problems of unstructured folksonomies Tag Tag cloud of of the the user MarmaladeToday s for for 1.157 bookmarks. Source: del.icio.us (5.03.2008).

Some approaches: Adding structure to folksonomies Del.icio.us Bibsonomy.org

Some solutions: Adding structure to folksonomies Flickr.com

The Folksonomy Tag Garden

Tag Gardening Term coined by James Governor: Like plants or animals, tags evolve in an emergent fashion, open to hybridisation. Stewardship can help grow and put roots down. Helping the darwinian process is tag gardening. We define tag gardening as any activity to edit, reeingineer, manipulate or organize tags in order to make them more productive and effective. Tag gardening is performed on top of existing folksonomies. Users may tag as usual, afterwards gardenig activities may be performed for optimization and user support.

Levels of Tag Gardening Document collection vs. single document level: Simple form: editing the tags of one single document. Complex task: handling all tags in a folksonomy system. Personal vs. collaborative level: Personomy level: single users edit the personal tags they use within a system. Folksonomy level: enabling a user community to collaboratively maintain all tags in use. Intra- and cross-platform level: Usual case: consider only tags within one platform. Broader view: for some cases the use of consistent tags across different platforms will be useful.

Tag Gardening Activities: 1. Weeding Removing bad tags : spelling variants (plural vs. singular, conflation of multi-word tags) and spam through pesticides. Aim: enhancing recall and a consistent indexing vocabulary. Achieved by type-ahead functionality during indexing, editing functionalities for tags after the application (remove, change, etc.), Natural Language Processing of index tags and search tags, indexing and retrieval tutorials or guidelines for users, authorized users as gardeners Simplest form of tag gardening

Tag Gardening Activities: 2. Seeding Extending the folksonomy with rarely used seedlings if high-frequency tags do not sufficiently discriminate resources. Aim: enhancing precision and expressiveness of the folksonomy. Achieved by displaying an inverse tag cloud during indexing or particular green house areas where the seedlings may develop and grow, discrete tag suggestions during indexing.

Tag Green Houses Problem: high-frequent tags provide high recall but low precision and low degrees of discrimination. Tag Tag Design Design on on Del.icio.us. Del.icio.us. Seedlings or Baby tags as additional entry points to explore document collections. Promoting baby tags via alternative display options like New tags / trends infrequent tags / inverse tag cloud

Tag Gardening Activities: 3. Landscape Architecture Shaping the folksonomy into flower beds, distinguishing similar looking plants, identifying their species, branding each species with labels and giving additional information regarding their application area. Aim: enhancing precision and expressiveness of the folksonomy by adding semantics. For query expansion along semantic relations, for enhanced navigation, as basis for semantic-oriented display. Achieved by conflation of multi-language tags, summarization of synonyms, distinction of homonyms, establishment of semantic relations, field-based tagging

Synonym interrelation and distinction of homonyms.

Hidden Semantic Relations in Folksonomies Source for image: http:// www.flickr.com

Tag Gardening Activities: 4. Fertilizing Combination of folksonomies and KOS during indexing and retrieval. Aim: enhancing precision and recall and the expressiveness of the folksonomies by adding semantics, for query expansion during retrieval via semantic relations, for enhanced indexing functionalities, for enhanced navigation within the folksonomy, as basis for semantic-oriented displays. Achieved by semantic-oriented tag suggestions during indexing and retrieval ( tag suggestions not based on tag popularity to avoid success breeds success-effect ) establishment of semantic relations by mappings to KOS (after indexing)

Tag Gardening Activities: Fertilizing Type 1 and 2 Fertilizing Type 1 Fertilizing a folksonomy with semantic structures from other knowledge organization systems. Fertilizing Type 2 Fertilizing a structured knowledge organization system (ontology, thesaurus) with user-generated terminologies.

Tag Gardening in Practice Possibilities Editing and deleting functionalities for tags on personal and document level. Detecting and labeling semantic relations. Use of power tags as candidates on document collection level. Co-occurence computations as starting points. Tools for personal tag gardening on personomy level. Collaborative tag gardening in (small) communities. Power users as chief gardeners in communities.

Gardening support: Power tags and cooccurences as candidates and starting points Steps: Detecting power tags on the resource level Computing co-occurences of power tags with the whole tag collection Result: distribution and power tags on cooccurence level Power relations as candidates for new folksonomy structuring approaches Data retrieved 2008-05-15 from del.icio.us

Personal Tag Gardening Our approach: TagCare (www.tagcare.org, currently under development) take care of your tags TagCare imports personal tags from different tagging platforms and allows to manage them centrally for better personal overview. Personal structured tag vocabulary for: Personal infomation management (PIM) Consistent tagging across platforms (functionalities under development). Search in folksonomy-based systems (e.g. use precooked personal synonym lists).

Personal Tag Gardening Useof TagCare firstversion Adding, deleting and editing of tags Hierarchical structuring of tags Interrelating of synonyms Labeling of otherwise related tags Statistics on tag frequencies

Tag Gardening Community Solution for small, closed communties and development of shared vocabularies: Community tagging plus single information architects. Information Information architect: architect: builds buildsa structured structuredthesaurus thesaurus and and enhances enhancesititwith withtags. tags. establishes establishesstructures structuresbetween loose loosetags (emergent (emergentsemantics). Community: Community: applies appliestags and and performs performsminor minorediting activities. activities.

Collaborative Tag Gardening Challenges and open questions Who may perform editing actions? Who is an authorized user? Which tags are spam? How can users collaborate, which aspects are determined collectively? Can tagging guidelines be applied?

Challenges and open questions The three elements of folksonomies (resources people tags) constitute a three-dimensional knowledge space: domain community expressiveness. Conflicts are caused, if all three elements should be considered to large extents simultaneously.

Conclusions & Outlook Folksonomies may be enriched with semantics to achieve a combination of vocabulary dynamics and structure. Aim: improved information retrieval on personomy and collection level. Manual and semi-automatic approaches may be combined. Tag gardening may first be applied to small communities or small application domains. Solutions for bigger communities are needed, chief gardeners as first approach. Future work: development of TagCare, analysis of types of semantic relations for folksonomy enrichment, tagging for support of ontology or thesaurus development.

Thank You Best regards from Düsseldorf! Contact: katrin.weller@uni-duesseldorf.de isabella.peters@uni-duesseldorf.de http://www.phil-fak.uni-duesseldorf.de/infowiss/