Fifth International Conference INFORMATION RESEARCH AND APPLICATIONS

Similar documents
MULTIMEDIA RETRIEVAL

ICDEM 2010, July 2010

Digital Presentation and Preservation of Cultural and Scientific Heritage International Conference

Features for Art Painting Classification Based on Vector Quantization of MPEG-7 Descriptors

General Image Database Model

INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS

A Digital Library Framework for Reusing e-learning Video Documents

MPEG-7. Multimedia Content Description Standard

DIGITAL LIBRARY AND SEARCH ENGINE OF BULGARIAN FOLKLORE SONGS

MAIN DIFFERENCES BETWEEN MAP/REDUCE AND COLLECT/REPORT PARADIGMS. Krassimira Ivanova

PERFORMANCE EVALUATION OF ONTOLOGY AND FUZZYBASE CBIR

Krassimira Ivanova, Milena Dobreva, Peter Stanchev, Koen Vanhoof

CHAPTER 8 Multimedia Information Retrieval

Offering Access to Personalized Interactive Video

A Content Based Image Retrieval System Based on Color Features

Block Diagram. Physical World. Image Acquisition. Enhancement and Restoration. Segmentation. Feature Selection/Extraction.

Bulgarian Folk Songs in a Digital Library

Search Engines. Information Retrieval in Practice

Automatic Image Annotation by Classification Using Mpeg-7 Features

SOFIA: An Image Database with Image Retrieval by Image Content

An Efficient Approach for Color Pattern Matching Using Image Mining

Efficient Indexing and Searching Framework for Unstructured Data

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

MATRIX BASED SEQUENTIAL INDEXING TECHNIQUE FOR VIDEO DATA MINING

Efficient Content Based Image Retrieval System with Metadata Processing

Building OWL Ontology of Unique Bulgarian Bells Using Protégé Platform

MATRIX BASED INDEXING TECHNIQUE FOR VIDEO DATA

Semantic Video Indexing

D.9.1 Web Portal Creation and Launch

Human Brain Image Database System (HBIDS) for Epilepsy

FINAL REPORT. Appendix D15. Final Report

New Trends in Information Technologies

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Computing Similarity between Cultural Heritage Items using Multimodal Features

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering

The ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents 1

Tumor Detection and classification of Medical MRI UsingAdvance ROIPropANN Algorithm

Presentation of the Electronic Letters on Computer Vision and Image Analysis (ELCVIA)

Web Security Vulnerabilities: Challenges and Solutions

Introduzione alle Biblioteche Digitali Audio/Video

Image Processing (IP)

Delivery Context in MPEG-21

HOURS 7:30 AM - 4:30 PM

Image Mining Using Image Feature

Wavelet-based Texture Classification of Tissues in Computed Tomography

DIGITAL LIBRARY FOR BULGARIAN TRADITIONAL CULTURE AND FOLKLORE

An Intelligent System for Archiving and Retrieval of Audiovisual Material Based on the MPEG-7 Description Schemes

Social Behavior Prediction Through Reality Mining

USING METADATA TO PROVIDE SCALABLE BROADCAST AND INTERNET CONTENT AND SERVICES

Image Classification Using Wavelet Coefficients in Low-pass Bands

C H A P T E R Introduction

A Framework for Source Code metrics

Very Fast Image Retrieval

AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES

Rough Feature Selection for CBIR. Outline

A Developer s Guide to the Semantic Web

Adaptive Medical Information Delivery Combining User, Task and Situation Models

Fast Image Matching on Web Pages

Crossing the Archival Borders

Visualization Stages, Sensory vs. Arbitrary symbols, Data Characteristics, Visualization Goals. Trajectory Reminder

Semantic Search in Heterogeneous Digital Repositories: Case Studies

GNDraw Software Application for Creating Generalized Nets

Museum Collections and the Semantic Web

Constructing Object Oriented Class for extracting and using data from data cube

VIDEO SEARCHING AND BROWSING USING VIEWFINDER

Finding Topic-centric Identified Experts based on Full Text Analysis

A Survey On Different Text Clustering Techniques For Patent Analysis

Information System on Literature in the Field of ICT for Environmental Sustainability

Contextion: A Framework for Developing Context-Aware Mobile Applications

CONTEXT-SENSITIVE VISUAL RESOURCE BROWSER

Using MILOS to build a Multimedia Digital Library Application: The PhotoBook experience

Abstract and Index and Web Discovery Services IEEE Partners

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

An Information Storage System for Large-Scale Knowledge Resources

Many of these have already been announced via JIT weekly updates on the KB-L listserv, but are gathered here for a more complete look.

SHORTEST PATH ALGORITHM FOR QUERY PROCESSING IN PEER TO PEER NETWORKS

3D Surface Reconstruction of the Brain based on Level Set Method

MPEG-7 Audio: Tools for Semantic Audio Description and Processing

An Infrastructure for MultiMedia Metadata Management

xiii A. Hayden Lindsey IBM Distinguished Engineer and Director, Studio Tools Foreword

AUGMENTED REALITY AS A METHOD FOR EXPANDED PRESENTATION OF OBJECTS OF DIGITIZED HERITAGE. Alexander Kolev, Dimo Dimov

The Semantic Web Explained

Spray and Dynamic: Advanced Routing in Delay Tolerant Networks

An Introduction to Content Based Image Retrieval

Freedom of Access in Maine

ONTOLOGY BASED KNOWLEDGE EXTRACTION FROM FRUIT IMAGES

APPLYING THE POWER OF AI TO YOUR VIDEO PRODUCTION STORAGE

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

Digital Archives: Extending the 5S model through NESTOR

Writing a Research Paper

A New Algorithm for Shape Detection

Application of PHP and MySQL for search and retrieval Web services in Web information systems

Studying conceptual models for publishing library data to the Semantic Web

Practical experiences towards generic resource navigation and visualization

Britannica Library is an online general reference resource for all ages and abilities.

New wavelet based ART network for texture classification

An Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database

A Study on Metadata Extraction, Retrieval and 3D Visualization Technologies for Multimedia Data and Its Application to e-learning

Semantically Enhanced Hypermedia: A First Step

A Novel Image Classification Model Based on Contourlet Transform and Dynamic Fuzzy Graph Cuts

Transcription:

Fifth International Conference INFORMATION RESEARCH AND APPLICATIONS 26-30 June 2007, Varna P R O C E E D I N G S Volume 1 ITHEA SOFIA, 2007

Kr. Markov, Kr. Ivanova (Ed.) Proceedings of the Fifth International Conference Information Research and Applications i.tech 2007, Varna, Bulgaria Volume 1 Sofia, Institute of Information Theories and Applications FOI ITHEA 2007 First edition Printed in Bulgaria by Institute of Information Theories and Applications FOI ITHEA Sofia -1090, P.O. Box 775 e-mail: info@foibg.com www.foibg.com All rights reserved. 2007 Krassimir Markov, Krassimira Ivanova - Editors 2007 Institute of Information Theories and Applications FOI ITHEA - Publisher 2007 For all authors in the issue ISSN 1313-1109 (paperback) ISSN 1313-115X (CD) ISSN 1313-1222 (online) FSBN Volume 1: 978-954-16-2001-4 (paperback) 978-954-16-2002-1 (CD) 978-954-16-2003-8 (Online) FSBN Volume 2: 978-954-16-2004-5 (paperback) 978-954-16-2005-2 (CD) 978-954-16-2006-9 (Online)

Fifth International Conference I.TECH 2007 Multimedia Semantics and Cultural Heritage MULTIMEDIA RETRIEVAL - STATE OF THE ART Peter L. Stanchev (Keynote Speech) In this keynote speech we discuss the history, the state of art, and the future of multimedia retrieval [6]. Wide access to large information collections is of great importance in many aspects of everyday life. For this reason, significant effort has been spent in studying and developing techniques that support effective and efficient retrieval of multimedia data. Every day we are overwhelmed by information of many types: TV channels, news feeds, web sites, to name a few. Without an efficient and effective retrieval and filtering support, much time and effort is required in finding the information that we really need in this highly dynamic information age. The process of image description consists of extracting the global image characteristics, recognizing the imageobjects, and assigning semantics to these objects. The image data can be treated as physical image representation and its meaning as a logical image representation. The logical representation includes methods for describing the image and image-objects characteristics and the relationships among the image objects. Several visual descriptors exist for representing the physical content of an image, such as the MPEG-7 standard [5]. Historically, semantic retrieval was frequently based on computer vision. To reduce the semantic gap, the low-level content-based media features are frequently being converted to high-level concepts or terms. The MPEG-7 descriptors can be classified as general visual descriptors and domain specific descriptors. The former include color, texture, shape, and motion features. The latter includes face recognition descriptors. Color is one of the most widely used image and video retrieval feature. The MPEG-7 standard includes five color descriptors, which represent different aspects of the color and include color distribution, spatial layout, and spatial structure of the color. The image texture is one of the most important image characteristic in both human and computer image analysis and object recognition. Visual texture is a property of a region in an image. There are two texture descriptors in MPEG-7: a homogeneous texture descriptor and edge histogram descriptor. Both of these descriptors support search and retrieval based on content descriptions. MPEG-7 supports region and contour shape descriptors. Object shape features are very powerful when used in similarity retrieval. Although distance functions are not part of the standard, we will present the most used distance functions. A technique for improving the similarity search process of images in a Multimedia Content Management System is analyzed. The content based retrieval process integrates the search on different multimedia components, which are linked in XML structures. Depending on the specific characteristics of an image data set, some features can be more effective than others when performing similarity searches [2]. Based on this observation we propose a technique that predicts the effectiveness of the MPEG-7 image features that depends on a statistical analysis of

12 Multimedia Semantics and Cultural Heritage the specific data sets in the Multimedia Content Management System. This technique is validated through extensive experiments with human subjects. We illustrate several aspects of the fine art databases [4]. We showed that MPEG-7 descriptors can be used, but they give different results than applying on other type of images. The use of the Color Structure descriptor only produces sufficiently efficient results in the query search. The new generation Semantic Web languages, such as RDF(S) and OWL will play a major role. The integration of semantic understanding of pictures with personalized delivery raises new questions. The query language for this type of system is not yet scandalized but we hope that an emerging standard will come soon. We discuss the problems witch arise working with Magnetic Resonance (MR) images. As an illustration of medical image processing tools we discuss MR brain segmentation problems. Functional analysis of different medical systems is made. We emphasize on the fact that working with medical images is different from working with other kind of images. As an illustration two systems are presented [3]. The first system is MEDIMAGE, which is a multimedia database for Alzheimer s disease patients. It contains MR images, text and voice data and it is used to find some correlations of brain atrophy in Alzheimer s patients with different demographic factors. The second system is Epilepsy system, which includes image data from MRI and SPECT, scans and EEG analysis results and it is used for patients with epilepsy. We present a novel approach for efficient video stream filtering that is based on the use of the MPEG-7 descriptors and exploits the properties of metric spaces in order to reduce the computational load of the filtering receiver [1]. Among other types of information, Audio/Video can be considered today as a primary means of communication, due to its richness in informative content and to its appeal. This implies that the development of techniques supporting the retrieval of Audio/Video documents is of primary importance to enable the access for the general public as well as for professional users of significant asset of today s life. This process will gain more impetus from the adoption of standards to represent video content. The retrieval process is based on a simple schema: users specify their request needs (e.g. a set of keywords or a sample image) that are translated into a system query. The items in the archive are compared with the user s query, in order to determine if they are relevant for the user s request. To process this type of query, it is necessary to determine a set of properties of the objects stored in the archive (usually called features) and a similarity measure to compare queries and archive objects. Video features can be described by using the MPEG-7 standard. In case the similarity measure is metric, many possible approaches to create indexes can be adopted. These indexes allow improving the efficiency of the retrieval process, by comparing the query only with a limited number of objects in the archive. Our approach goes toward a solution of this problem, by proposing a novel approach to Audio/Video filtering that makes use of simple additional information sent together with the video. This allows avoiding the comparison of the filter with video features for many non-matching videos or video components that will not pass the filter in any case. The proposed approach requires that the measure of the similarity between the filter and the video representative is metric, and it is based on the use of the well known technique of pivots. The following systems are discussed: Tamino, TV-Anytime, MILOS, PicToSeek, Marvel, imgseek, IKONA, SIMPLIcity, ALIP, SIMBA, Viper, LCPD, Video Google, Cortina, Octagon, PicSOM, LIRE. Semantic retrieval in TREC is also presented. The specific challenges in the field are highlighted. Conclusion remarks about the future of the multimedia retrieval are drowning.

Fifth International Conference I.TECH 2007 13 Bibliography [1] Falchi F., Gennaro C., Savino P., Stanchev P., Efficient Video Stream Filtering, accepted for ACM Multimedia, 2007 [2] Stanchev P., Amato G., Falchi F., Gennaro C., Rabitti F., Savino P., Selection of MPEG-7 Image Features for Improving Image Similarity Search on Specific Data Sets, 7-th IASTED International Conference on Computer Graphics and Imaging, CGIM 2004, Kauai, Hawaii, 2004, 395-400. [3] Stanchev P., Fotouhi F., Siadat M-R., and Soltanian-Zadeh H., Multimedia Mining- A High Way to Intelligent Multimedia Documents, book chapter 7- Medical Multimedia and Multimodality Databases, D. Djeraba (Eds) - Kluwer Accademic Publishers, 2002, 139-160. [4] Stanchev P., Green Jr D.., Dimitrov B., Semantic Notation and Retrieval in Art and Architecture Image Collections, Journal of Digital Information Management, Vol. 3, No. 4, December 2005, 218-221 [5] Stanchev P., Introduction to MPEG 7: Multimedia Content Description Interface, International Conference on "Visualization, Imaging and Image Processing (VIIP 2004), September 6-8, 2004, Marbella, Spain [6] Stanchev P., Semantic Video and Image Retrieval - State of the Art and Challenges, ACMSE 2007, The 45th ACM Southeast Conference, Winston-Salem, North Carolina, USA, March 24, 2007, 512-513 Author's Information Peter Stanchev is a Professor of Computer science at Kettering University, Flint, Michigan, USA and Professor and Department Chair at the Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Sofia, Bulgaria. He is also affiliated with the Institute of Information Science and Technologies, Italian National Research Council, Pisa, Italy. He has published two books, more than one hundred and fifty papers in monographs, journal, and peer-reviewed conferences that have more than 250 citations. His research interests are in the field of Image Processing, Multimedia Systems, Database Systems, and Expert Systems. He lectures courses in the areas of: Database Systems, Information Retrieval, Data Mining, Human-computer interaction, Computing and Algorithms, Web technology, Computer Architecture, Design of Information Systems, Computer Operating Systems, Image Databases, Fuzzy sets, Decision Support Systems, Multimedia Databases, Expert Systems, Image Processing, Computer Graphics and Game design. He is a Co-Chair of the annual multimedia semantics workshops. Peter L. Stanchev e-mail: pstanche@kettering.edu; www.kettering.edu/~pstanche Kettering University; Flint, MI, USA 48504; Institute of Mathematics and Informatics, Sofia-1113, acad. G.Bontchev, bl.8, Bulgaria; CREATION OF A DIGITAL CORPUS OF BULGARIAN DIALECTS Nikola Ikonomov, Milena Dobreva Abstract: The paper presents our considerations related to the creation of a digital corpus of Bulgarian dialects. The dialectological archive of Bulgarian language consists of more than 250 audio tapes. All tapes were recorded between 1955 and 1965 in the course of regular dialectological expeditions throughout the country. The records typically contain interviews with inhabitants of small villages in Bulgaria. The topics covered are usually related to such issues as birth, everyday life, marriage, family relationship, death, etc. Only a few tapes contain folk songs from different regions of the country. Taking into account the progressive deterioration of the magnetic media and the realistic prospects of data loss, the Institute for Bulgarian Language at the Academy of Sciences launched in 1997 a project aiming at restoration