Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval

Size: px
Start display at page:

Download "Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval"

Transcription

1 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Stephane Clinchant, Jean-Michel Renders, and Gabriela Csurka Xerox Research Centre Europe, 6 ch. de Maupertuis, Meylan, France FirstName.LastName@xrce.xerox.com Abstract. We present here some transmedia similarity measures that we recently designed by adopting some intermediate level fusion approaches. The main idea is to use some principles coming from pseudorelevance feedback and, more specifically, transmedia pseudo-relevance feedback for enriching the mono-media representation of an object with features coming from the other media. One issue that arises when adopting such a strategy is to determine how to compute the mono-media similarity between an aggregate of objects coming from a first (pseudo-)feedback step and one single multimodal object. We propose two alternative ways of adressing this issue, that result in what we called the transmedia document reranking and complementary feedback methods respectively. For the ImageCLEF - Photo Retrieval Task, it appears that monomedia retrieval performance is more or less equivalent for pure image and pure text content (around 20% MAP). Using our transmedia pseudofeedback-based similarity measures allowed us to dramatically increase the performance by 50% (relative). From a cross-lingual perspective, the use of domain-specific, corpus-adapted probabilistic dictionaries seems to offer better results than the use of a broader, more general standard dictionary. With respect to the monolingual baselines, multilingual runs show a slight degradation of retrieval performance ( 6 to 10% relative). Keywords: hybrid retrieval, trans-media relevance feedback. 1 Introduction and Related Works Up to now, many standard methods to tackle the problem of defining efficient trans-modal similarity measures and of solving the associated semantic gap use late fusion strategies. Basically, they rely on mono-modal analysis of multifacet objects, computing mono-modal similarities independently and then merging these mono-modal similarities by some simple aggregation operator. In contrast to these strategies, we propose here two intermediate level fusion approaches. Both approaches, similarly to [1,2,3], are based on Transmedia Relevance Pseudo-Feedback, i.e. they are mixed-modality extentions of Relevance Models in which the modality of data is switched during the (pseudo-) feedback process, from image to text or text to image. C. Peters et al. (Eds.): CLEF 2007, LNCS 5152, pp , c Springer-Verlag Berlin Heidelberg 2008

2 570 S. Clinchant, J.-M. Renders, and G. Csurka Our first approach, called Complementary Feedback (section 3.1), is similar to the approach suggested by [3]. However, while [3] uses classical text-based feedback algorithms (like Rocchio), we use a pseudo-feedback method issued from the language modelling approach to information retrieval, namely the mixture model method from Zhai and Lafferty [4] originally designed to enrich textual queries. In our second approach, called Transmedia Document Reranking Approach (section 3.2), we do not really extract a new query, nor enrich an existing one. This second approach uses the similarity computed in the other mode as a component of feedback, in order to rerank documents. So, this is a one step retrieval, contrarily to the first one (and the related works). This method is quite general since it can be applied to any textual/visual similarities or, equivalently, with any mono-modal (textual / visual) retrieval engine. This is not the case for the other methods: [1], for instance, is based on a specific similarity model both for texts and images. Moreover, as the alternative methods require a second retrieval step, the use of a particular choice of text feedback method depends implicitly on the underlying text retrieval model. Our method is free from such dependencies, since it works on similarities as basic components. Even if both approaches appear to be rather simpler than most alternative state-of-the-art approaches, they turned out to give superior results in the ImageClef PhotoRetrieval Track ([5]). 2 Monomedia Similarities 2.1 Cross-Entropy between Texts Starting from a traditional bag-of-word representation of pre-processed texts (here, preprocessing includes tokenization, lemmatization, word decompounding and standard stopword removal), we adopt the language modeling approach to information retrieval and we use the (asymmetric) cross-entropy function as similarity. Particular details of this textual similarity measure are given in [6]. 2.2 FisherVectorsforImages To compute the similarity measure between images I and J, we simply use the the L1-norm of the difference between their Fisher vectors (normalised gradient vector of the corresponding generative model, with unitary L1-norm; see details in [6,7]). 3 Cross-Media Similarities Based on Transmedia Relevance Feedback The main idea is the following: for a given image i, consider as new features the (textual) terms of the texts associated to the most similar images (from a purely visual viewpoint). We will denote this neighbouring set as N img (i). Its size is fixed a priori: this is typically the topn objects returned from a retrieval system

3 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval 571 (CBIR) or the N nearest-neighbours using some predefined visual similarity measures. Then, we can compute a new similarity with respect to any multimodal object j of the collection O as the textual similarity of this new representation of the image i with the textual parts of j. There are three families of approaches to compute the mono-media similarity between an aggregate of objects N img (i) and one single multimodal object: 1. aggregating N img (i) to form a single object (typically by concatenation) and then compute standard similarity between two objects; 2. use a method of pseudo feedback algorithm (for instance Rochhio s algorithm) to extract relevant, meaningfull features of an aggregate and finally use a mono-media similarity. 3. aggregating all similarity measures (assuming we can) between all possible couple of objects Methods of families 1 and 2 involve therefore the creation of a new single object (in this case, a text) and a new retrieval step (this time using a text retrieval system). The third family does not. 3.1 Complementary Pseudo-Feedback This approach, as the work presented in [3], belongs to the second family of aggregation strategies mentionned in the previous section but, contrarily to [3], our method uses the Language Modelling framework to realize the pseudo-feedback. Recall that the fundamental problem in transmedia feedback is to define how we compute the mono-media similarity between an aggregate of objects N img (i) (or N txt (i)) and one single multimodal object. The main idea here is to consider the set N img (i) as the relevance concept F and derive its corresponding language model (LM) θ F. Afterwards, we can use the cross-entropy function between θ F and the LM of the textual part of any object j in O as the new transmedia similarity. We adopt the framework given by the mixture model method from Zhai and Lafferty [4] (originally designed to enrich textual queries), to derive the LM associated with F. See [6] for practical details. Once θ F has been estimated, a new query LM can be obtained trough interpolation: θ new query = αθ old query +(1 α)θ F (1) where θ old query corresponds to the LM of the textual part of the query i. In a nearly dual way, starting from the textual part of the query, a similar scheme using N txt (i) can be adopted to derive a new visual representation (actually some generalized Fisher Vectors) of the relevance concept, this time relying on Rocchio s method that is more adapted to continuous feature representation. 3.2 Transmedia Document Reranking Unlike the Complementary Feedback, the Transmedia Document Reranking approach belongs to the third family of aggregation strategies mentionned in 3.

4 572 S. Clinchant, J.-M. Renders, and G. Csurka The main idea is to define a new cross-media similarity measure by aggregating all similarity measures (assuming we can) between all possible couple of objects retrieved by the Transmedia Relevance Feedback. More formally, if we denote by T (u) the text associated to multimodal object u and by ˆT (i) the new textual representation of image i, then the new crossmedia similarity measure w.r.t. the multimodal object j is: sim ImgTxt (i, j) =sim txt ( ˆT (i), T (j)) = sim txt (T (d), T (j)) (2) d N img(i) where sim txt is any textual similarity measure but, in a particular embodiment, we propose to use the cross-entropy function (e.g. the one based on Language Modelling, even if it is assymetric), that appears to be one of the most effective measures in purely textual information retrieval systems. This method can be seen as a reranking method. Suppose that q is some image query; if T (d) is the text of an image belonging to the initial feedback set N img (q), then the rank of the own neighbors of T (d) in the textual sense will be increased, even if they are not so similar from a purely visual viewpoint. In particular, this allows to define a similarity between a purely image query and a simple textual object without visual counterpart. By duality, we can define another cross-media similarity measure: for a given text i, we consider as new features the Fisher vectors of the images associated to the most similar texts (from a purely textual viewpoint) in the multimodal database. We will denote this neighbouring set as N txt (i). If we denote by I(u) the image associated to multimodal object u and by Î(i) the new visual representation of text i, then the new cross-media similarity measure is: sim TxtImg (i, j) =sim img (Î(i), I(j)) = d N txt(i) sim img (I(d), I(j)) (3) Finally, we can combine all the similarities to define a global similarity measure between two multi-modal objects i and j: for instance, using a linear combination, sim glob (i, j) =λ 1 sim txt (T (i), T (j)) +λ 2 sim img (I(i), I(j)) +λ 3 sim ImgTxt (i, j) +λ 4 sim TxtImg (i, j) In one embodiment, we use a simple weighted averages of these similarities, and optimize the weights through the use of a labelled/annotated training set. The main advantage of this method is that, using an aggregation strategy that belongs to family 3, it does not require any further retrieval step. Furthermore, it exploits all trans-modal paths (TXT-TXT, TXT-IMG, IMG-TXT and IMG-IMG) and combines them. Finally, we can pre-compute the monomodal similaritites (textual and visual) between all pairs of objects in the multimedia reference repository, as these computations are independent from the objects in the run-time application; once stored, these values can be re-injected into the translingual similarity equations at run-time, greatly reducing the computation time.

5 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Experimental Results Table 1 shows the name of our ImageCLEF runs and the corresponding mean average precision measures. For a description of the task, the corpus and the queries, refer to [5]. Table 1. Official Runs Run Txt Img CF1 CF2 CF3 TR1 TR2 TR3 MAP Below is a detailed description of all the methods we used for the runs: Txt:This run was a pure text run: documents were basically preprocessed and each document was enriched using Flickr database. For each term of a document, its top 20 related tags from Flickr were added to the document (see details in [6]). Then, a unigram language model for each document is estimated, giving more weight to the original document terms. An additional step of pseudo-relevance feedback using the method explained in [4] is then performed. Img:This run is a pure image run: it uses Fisher Kernel metric to define the image similarity. As a query encompasses 3 visual sub-queries, we have to combine the similarity score with respect to these 3 subqueries. To this aim, the result lists from the image sub-queries are renormalized (by substracting the mean and dividing by the standard deviation) and merged by simple sum. CF1:This run uses both texts and images: it starts from query images only, to determine the relevance set N img (i) for each query i andthenimplements the the complementary (intermedia) feedback described in section 3.1. The size of the neighbouring set is 15. Refering to the notation of section 3.1, the value of α is 0.5. CF2:This runs works with the same principle as the previous run CF1. The main difference is that (target) english documents have been enriched with Flickr and that the initial query in German was translated by multiplying its Language Model by the probabilistic translation matrix extracted from the (small) parallel part of the corpus. Otherwise, it uses the same parameters as previously. CF3:This run uses the same process as in CF1, exceptthatitusesenglish queries to search for German annotations. English queries are translated with the probabilistic translation matrix extracted from the (small) parallel part of the corpus and the translated queries follow the same process as in CF1 but with different parameter : the size of the neighbouring set is 10, while the value of α is 0.7. TR1:This run uses both texts and images: it starts from query images only, to determine N img (i) for each query i (as in the previous run above) and then implements the method Transmedia Reranking method described in section 3.2. The size of the neighbouring set is 5.

6 574 S. Clinchant, J.-M. Renders, and G. Csurka TR2:It is basically the same algorithm as the preceding run TR1, except that the textual part of the data (annotations) is enriched with Flickr tags. TR3:This run uses the TR algorithm as in TR1 but, we merge the result lists from TR1 and from the purely text queries (Txt), by summing the relevance scores after normalisation (by substracting the mean and dividing by the standard deviation for each list). 3.4 Topic-Based Analysis of Results In order to better understand the possible correlations between the different methods and/or the systematic superiority of some of them, Figure 1 compares the Average Precisions for each pair of methods and for each topic. Methods are:text-only (TXT), image-only (IMG), our best Complementary Feedback (CF) and our best Transmedia Reranking (TR) approaches. Fig. 1. Average Precision values per topic, for six pairs of methods A deeper analysis of the individual topics leads to the following conclusions: From a purely visual aspect, search performance is better when the example images of the query are similar between themselves; search results degrade in the opposite case. See examples in Figure 2. The combination between text and image works better if the text query is complementary with respect to the visual information (see for instance left column of Figure 3). The combination does not perform well when either one of the media works very badly, especially the image, which is not suprising as the images were used for transmedia pseudo-relevance feedback (e.g. topics 3 and 32). There were also examples in which multi-media retrieval performance was poor, while individual mono-media retrieval worked not too bad (e.g. right

7 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval 575 Fig. 2. Left column: query images from topics for which the retrieval worked best: (a) 22 tennis player during rally, (b) 55 drawings in Peruvian desert and (c) 51 photos of goddaughters from Brasil. Right column: query images from topics for which the retrieval worked worst: (d) 9 tourist accomodation near Lake Titicaca, (e) 10 destinations in Venezuela and (e) 39 people in bad weather. Fig. 3. Left column: Query images from topics with best hybrid combinations: (a) 21 accomodations provided by host families, (b) 44 mountains in mainland Australia and (c) 48 vehicle in South Korea. Right column: query images from topics with worst hybrid combinations : (d) 38 Machu Picchu and Huayna Picchu in bad weather, (e) 11 black and white photos from Russia and (f) 3 religious statue in the foreground. column of 3). The reason might be that the retrieved images were incorrectly reranked based on their textual similarity with the query text. For example, for topic 38, non relevant images of Machu Picchu and Huayna Picchu (because not taken under showing bad weather condition) got better ranking,

8 576 S. Clinchant, J.-M. Renders, and G. Csurka with the effect of decreasing the precision (P20 falling down from 0.7 to 0.21 (for TR) and to 0.3 (for CF). 4 Conclusion With a slightly annotated corpus of images, also characterised by an abstraction level in the textual description that is significantly different from the one used in the queries, it appears that mono-media retrieval performance is more or less equivalent for pure image and pure text content (around 20% MAP). Using our transmedia pseudofeedback-based similarity measures allowed us to dramatically increase the performance by 50% (relative). Trying to model the textual relevance concept present in thetoprankeddocumentsissuedfrom a first (purely visual) retrieval and combining this with the textual part of the original query turns out to be the best strategy, being slightly superior to our transmedia document reranking method. From a cross-lingual perspective, the use of domain-specific, corpus-adapted probabilistic dictionaries seems to offer better results than the use of a broader, more general standard dictionary. With respect to the monolingual baseline, multilingual runs show a slight degradation of retrieval performance ( 6 to 10% relative). Acknowledgments. This work was partly funded by the French Government under the Infomagic project, part of the Pole CAP DIGITAL (IMVN) de Paris, Ile-de-France. The authors also want to thank Florent Perronin for his greatly appreciated help in applying some of the Generic Visual Categorizer components in our experiments. References 1. Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: NIPS (2003) 2. Chang, Y.C., Chen, H.H.: Approaches of using a word-image ontology and an annotated image corpus as intermedia for cross-language image retrieval. In: Working Notes of the CLEF Workshop, Alicante, Spain (2006) 3. Maillot, N., Chevallet, J.-P., Valea, V., Lim, J.H.: Ipal inter-media pseudo-relevance feedback approach to imageclef 2006 photo retrieval. In: Working Notes of the CLEF Workshop, Alicante, Spain (2006) 4. Zhai, C., Lafferty, J.D.: Model-based feedback in the language modeling approach to information retrieval. In: CIKM (2001) 5. Grubinger, M., Clough, P., Hanbury, A., Müller, H.: Overview of the ImageCLEFphoto 2007 photographic retrieval task. In: Working Notes of the CLEF Workshop, Budapest, Hungary (2007) 6. Clinchant, S., Renders, J.-M., Csurka, G.: Xrce s participation to ImageCLEF In: Working Notes of the CLEF Workshop, Budapest, Hungary (2007) 7. Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: CVPR (2007)

XRCE s Participation to ImageCLEFphoto 2007

XRCE s Participation to ImageCLEFphoto 2007 XRCE s Participation to ImageCLEFphoto 2007 Stephane Clinchant, Jean-Michel Renders and Gabriela Csurka Xerox Research Centre Europe, 6 ch. de Maupertuis, 38240 Meylan, France FirstName.LastName@xrce.xerox.com

More information

Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval

Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National

More information

Experiment for Using Web Information to do Query and Document Expansion

Experiment for Using Web Information to do Query and Document Expansion Experiment for Using Web Information to do Query and Document Expansion Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National Taiwan University Taipei,

More information

XRCE s Participation in ImageCLEF 2009

XRCE s Participation in ImageCLEF 2009 XRCE s Participation in ImageCLEF 2009 Julien Ah-Pine 1, Stephane Clinchant 1,2, Gabriela Csurka 1, Yan Liu 1 1 Xerox Research Centre Europe, 6 chemin de Maupertuis 38240, Meylan France firstname.lastname@xrce.xerox.com

More information

IPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching

IPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching IPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching Sheng Gao, Jean-Pierre Chevallet and Joo-Hwee Lim IPAL, Institute for Infocomm Research, A*Star, Singapore

More information

Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images

Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images Miguel E. Ruiz 1 and Aurélie Névéol 2 1 University of North Texas, School of Library and Information Sciences P.O. Box 311068,

More information

Trans Media Relevance Feedback for Image Autoannotation

Trans Media Relevance Feedback for Image Autoannotation MENSINK et al.: TMRF FOR IMAGE AUTOANNOTATION 1 Trans Media Relevance Feedback for Image Autoannotation Thomas Mensink 12 thomas.mensink@xrce.xerox.com Jakob Verbeek 2 jakob.verbeek@inrialpes.fr Gabriela

More information

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad

More information

Overview of the ImageCLEFphoto 2008 Photographic Retrieval Task

Overview of the ImageCLEFphoto 2008 Photographic Retrieval Task Overview of the ImageCLEFphoto 2008 Photographic Retrieval Task Thomas Arni 1, Paul Clough 1, Mark Sanderson 1 and Michael Grubinger 2 1 Department of Information Studies, University of Sheffield, UK 2

More information

CACAO PROJECT AT THE 2009 TASK

CACAO PROJECT AT THE 2009 TASK CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype

More information

Semantic text features from small world graphs

Semantic text features from small world graphs Semantic text features from small world graphs Jurij Leskovec 1 and John Shawe-Taylor 2 1 Carnegie Mellon University, USA. Jozef Stefan Institute, Slovenia. jure@cs.cmu.edu 2 University of Southampton,UK

More information

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN: Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,

More information

Wikipedia Retrieval Task ImageCLEF 2011

Wikipedia Retrieval Task ImageCLEF 2011 Wikipedia Retrieval Task ImageCLEF 2011 Theodora Tsikrika University of Applied Sciences Western Switzerland, Switzerland Jana Kludas University of Geneva, Switzerland Adrian Popescu CEA LIST, France Outline

More information

Text-Based Clustering of the ImageCLEFphoto Collection for Augmenting the Retrieved Results

Text-Based Clustering of the ImageCLEFphoto Collection for Augmenting the Retrieved Results Text-Based Clustering of the ImageCLEFphoto Collection for Augmenting the Retrieved Results Osama El Demerdash, Leila Kosseim, and Sabine Bergler CLaC Laboratory - Department of Computer Science & Software

More information

Document Expansion for Text-based Image Retrieval at CLEF 2009

Document Expansion for Text-based Image Retrieval at CLEF 2009 Document Expansion for Text-based Image Retrieval at CLEF 2009 Jinming Min, Peter Wilkins, Johannes Leveling, and Gareth Jones Centre for Next Generation Localisation School of Computing, Dublin City University

More information

The Visual Concept Detection Task in ImageCLEF 2008

The Visual Concept Detection Task in ImageCLEF 2008 The Visual Concept Detection Task in ImageCLEF 2008 Thomas Deselaers 1 and Allan Hanbury 2,3 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 PRIP, Inst.

More information

Multimodal Medical Image Retrieval based on Latent Topic Modeling

Multimodal Medical Image Retrieval based on Latent Topic Modeling Multimodal Medical Image Retrieval based on Latent Topic Modeling Mandikal Vikram 15it217.vikram@nitk.edu.in Suhas BS 15it110.suhas@nitk.edu.in Aditya Anantharaman 15it201.aditya.a@nitk.edu.in Sowmya Kamath

More information

Metric Learning for Large Scale Image Classification:

Metric Learning for Large Scale Image Classification: Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Thomas Mensink 1,2 Jakob Verbeek 2 Florent Perronnin 1 Gabriela Csurka 1 1 TVPA - Xerox Research Centre

More information

Using Maximum Entropy for Automatic Image Annotation

Using Maximum Entropy for Automatic Image Annotation Using Maximum Entropy for Automatic Image Annotation Jiwoon Jeon and R. Manmatha Center for Intelligent Information Retrieval Computer Science Department University of Massachusetts Amherst Amherst, MA-01003.

More information

ResPubliQA 2010

ResPubliQA 2010 SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first

More information

A Family of Contextual Measures of Similarity between Distributions with Application to Image Retrieval

A Family of Contextual Measures of Similarity between Distributions with Application to Image Retrieval A Family of Contextual Measures of Similarity between Distributions with Application to Image Retrieval Florent Perronnin, Yan Liu and Jean-Michel Renders Xerox Research Centre Europe (XRCE) Textual and

More information

An Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm

An Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm An Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm Henan Zhao and Rizos Sakellariou Department of Computer Science, University of Manchester,

More information

Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task

Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task Michael Grubinger 1,PaulClough 2, Allan Hanbury 3, and Henning Müller 4,5 1 Victoria University, Melbourne, Australia 2 Sheffield University,

More information

The University of Amsterdam at the CLEF 2008 Domain Specific Track

The University of Amsterdam at the CLEF 2008 Domain Specific Track The University of Amsterdam at the CLEF 2008 Domain Specific Track Parsimonious Relevance and Concept Models Edgar Meij emeij@science.uva.nl ISLA, University of Amsterdam Maarten de Rijke mdr@science.uva.nl

More information

IPL at ImageCLEF 2010

IPL at ImageCLEF 2010 IPL at ImageCLEF 2010 Alexandros Stougiannis, Anestis Gkanogiannis, and Theodore Kalamboukis Information Processing Laboratory Department of Informatics Athens University of Economics and Business 76 Patission

More information

Document Structure Analysis in Associative Patent Retrieval

Document Structure Analysis in Associative Patent Retrieval Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,

More information

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND 41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia

More information

A New Measure of the Cluster Hypothesis

A New Measure of the Cluster Hypothesis A New Measure of the Cluster Hypothesis Mark D. Smucker 1 and James Allan 2 1 Department of Management Sciences University of Waterloo 2 Center for Intelligent Information Retrieval Department of Computer

More information

A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task

A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task Wei Li, Jinming Min, Gareth J. F. Jones Center for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland

More information

A Formal Approach to Score Normalization for Meta-search

A Formal Approach to Score Normalization for Meta-search A Formal Approach to Score Normalization for Meta-search R. Manmatha and H. Sever Center for Intelligent Information Retrieval Computer Science Department University of Massachusetts Amherst, MA 01003

More information

Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information

Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department of Information

More information

An Axiomatic Approach to IR UIUC TREC 2005 Robust Track Experiments

An Axiomatic Approach to IR UIUC TREC 2005 Robust Track Experiments An Axiomatic Approach to IR UIUC TREC 2005 Robust Track Experiments Hui Fang ChengXiang Zhai Department of Computer Science University of Illinois at Urbana-Champaign Abstract In this paper, we report

More information

ImageCLEF 2008: visual feature analysis in segmented images

ImageCLEF 2008: visual feature analysis in segmented images SZTAKI @ ImageCLEF 2008: visual feature analysis in segmented images Bálint Daróczy Zsolt Fekete Mátyás Brendel Simon Rácz András Benczúr Dávid Siklósi Attila Pereszlényi Data Mining and Web search Research

More information

BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task

BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task S. Avila 1,2, N. Thome 1, M. Cord 1, E. Valle 3, and A. de A. Araújo 2 1 Pierre and Marie Curie University, UPMC-Sorbonne Universities, LIP6, France

More information

Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF

Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF Julia Jürgens, Sebastian Kastner, Christa Womser-Hacker, and Thomas Mandl University of Hildesheim,

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

DUTH at ImageCLEF 2011 Wikipedia Retrieval

DUTH at ImageCLEF 2011 Wikipedia Retrieval DUTH at ImageCLEF 2011 Wikipedia Retrieval Avi Arampatzis, Konstantinos Zagoris, and Savvas A. Chatzichristofis Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi

More information

Information Retrieval. (M&S Ch 15)

Information Retrieval. (M&S Ch 15) Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion

More information

Medical Image Annotation in ImageCLEF 2008

Medical Image Annotation in ImageCLEF 2008 Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany 2 RWTH Aachen University, Dept. of Medical Informatics,

More information

Metric Learning for Large-Scale Image Classification:

Metric Learning for Large-Scale Image Classification: Metric Learning for Large-Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Florent Perronnin 1 work published at ECCV 2012 with: Thomas Mensink 1,2 Jakob Verbeek 2 Gabriela Csurka

More information

VisoLink: A User-Centric Social Relationship Mining

VisoLink: A User-Centric Social Relationship Mining VisoLink: A User-Centric Social Relationship Mining Lisa Fan and Botang Li Department of Computer Science, University of Regina Regina, Saskatchewan S4S 0A2 Canada {fan, li269}@cs.uregina.ca Abstract.

More information

MIRS: Text Based and Content Based Image Retrieval Trupti S. Atre 1, K.V.Metre 2

MIRS: Text Based and Content Based Image Retrieval Trupti S. Atre 1, K.V.Metre 2 MIRS: Text Based and Content Based Image Retrieval Trupti S. Atre 1, K.V.Metre 2 Abstract The main goal of this paper is to show multimedia information retrieval task using the combination of textual pre-filtering

More information

Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking

Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking Emanuele Di Buccio, Ivano Masiero, and Massimo Melucci Department of Information Engineering, University

More information

The Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System

The Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System The Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System Our first participation on the TRECVID workshop A. F. de Araujo 1, F. Silveira 2, H. Lakshman 3, J. Zepeda 2, A. Sheth 2, P. Perez

More information

BUAA AUDR at ImageCLEF 2012 Photo Annotation Task

BUAA AUDR at ImageCLEF 2012 Photo Annotation Task BUAA AUDR at ImageCLEF 2012 Photo Annotation Task Lei Huang, Yang Liu State Key Laboratory of Software Development Enviroment, Beihang University, 100191 Beijing, China huanglei@nlsde.buaa.edu.cn liuyang@nlsde.buaa.edu.cn

More information

CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval

CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval DCU @ CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval Walid Magdy, Johannes Leveling, Gareth J.F. Jones Centre for Next Generation Localization School of Computing Dublin City University,

More information

Image Query Expansion Using Semantic Selectional Restrictions

Image Query Expansion Using Semantic Selectional Restrictions Image Query Expansion Using Semantic Selectional Restrictions Osama El Demerdash, Sabine Bergler and Leila Kosseim Concordia University {osama el,bergler,kosseim}@cse.concordia.ca CLaC Laboratory - Department

More information

Improving Recognition through Object Sub-categorization

Improving Recognition through Object Sub-categorization Improving Recognition through Object Sub-categorization Al Mansur and Yoshinori Kuno Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Saitama 338-8570,

More information

A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP

A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP A fully-automatic approach to answer geographic queries: at GikiP Johannes Leveling Sven Hartrumpf Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen)

More information

Medical Image Annotation in ImageCLEF 2008

Medical Image Annotation in ImageCLEF 2008 Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 RWTH Aachen University,

More information

CADIAL Search Engine at INEX

CADIAL Search Engine at INEX CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr

More information

Image retrieval based on bag of images

Image retrieval based on bag of images University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Image retrieval based on bag of images Jun Zhang University of Wollongong

More information

Overview of iclef 2008: search log analysis for Multilingual Image Retrieval

Overview of iclef 2008: search log analysis for Multilingual Image Retrieval Overview of iclef 2008: search log analysis for Multilingual Image Retrieval Julio Gonzalo Paul Clough Jussi Karlgren UNED U. Sheffield SICS Spain United Kingdom Sweden julio@lsi.uned.es p.d.clough@sheffield.ac.uk

More information

THE preceding chapters were all devoted to the analysis of images and signals which

THE preceding chapters were all devoted to the analysis of images and signals which Chapter 5 Segmentation of Color, Texture, and Orientation Images THE preceding chapters were all devoted to the analysis of images and signals which take values in IR. It is often necessary, however, to

More information

An Investigation of Basic Retrieval Models for the Dynamic Domain Task

An Investigation of Basic Retrieval Models for the Dynamic Domain Task An Investigation of Basic Retrieval Models for the Dynamic Domain Task Razieh Rahimi and Grace Hui Yang Department of Computer Science, Georgetown University rr1042@georgetown.edu, huiyang@cs.georgetown.edu

More information

Web Information Retrieval using WordNet

Web Information Retrieval using WordNet Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT

More information

External Query Reformulation for Text-based Image Retrieval

External Query Reformulation for Text-based Image Retrieval External Query Reformulation for Text-based Image Retrieval Jinming Min and Gareth J. F. Jones Centre for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland {jmin,gjones}@computing.dcu.ie

More information

Ranking Web Pages by Associating Keywords with Locations

Ranking Web Pages by Associating Keywords with Locations Ranking Web Pages by Associating Keywords with Locations Peiquan Jin, Xiaoxiang Zhang, Qingqing Zhang, Sheng Lin, and Lihua Yue University of Science and Technology of China, 230027, Hefei, China jpq@ustc.edu.cn

More information

Alberto Messina, Maurizio Montagnuolo

Alberto Messina, Maurizio Montagnuolo A Generalised Cross-Modal Clustering Method Applied to Multimedia News Semantic Indexing and Retrieval Alberto Messina, Maurizio Montagnuolo RAI Centre for Research and Technological Innovation Madrid,

More information

GoNTogle: A Tool for Semantic Annotation and Search

GoNTogle: A Tool for Semantic Annotation and Search GoNTogle: A Tool for Semantic Annotation and Search Giorgos Giannopoulos 1,2, Nikos Bikakis 1,2, Theodore Dalamagas 2, and Timos Sellis 1,2 1 KDBSL Lab, School of ECE, Nat. Tech. Univ. of Athens, Greece

More information

MSRA Columbus at GeoCLEF2007

MSRA Columbus at GeoCLEF2007 MSRA Columbus at GeoCLEF2007 Zhisheng Li 1, Chong Wang 2, Xing Xie 2, Wei-Ying Ma 2 1 Department of Computer Science, University of Sci. & Tech. of China, Hefei, Anhui, 230026, P.R. China zsli@mail.ustc.edu.cn

More information

MSRA Columbus at GeoCLEF 2006

MSRA Columbus at GeoCLEF 2006 MSRA Columbus at GeoCLEF 2006 Zhisheng Li, Chong Wang 2, Xing Xie 2, Wei-Ying Ma 2 Department of Computer Science, University of Sci. & Tech. of China, Hefei, Anhui, 230026, P.R. China zsli@mail.ustc.edu.cn

More information

Modeling Spatial Relations for Image Retrieval by Conceptual Graphs

Modeling Spatial Relations for Image Retrieval by Conceptual Graphs Modeling Spatial Relations for Image Retrieval by Conceptual Graphs Carlos Hernández-Gracidas, L. Enrique Sucar and Manuel Montes-y-Gómez Computer Science Department National Institute of Astrophysics,

More information

Influence of Word Normalization on Text Classification

Influence of Word Normalization on Text Classification Influence of Word Normalization on Text Classification Michal Toman a, Roman Tesar a and Karel Jezek a a University of West Bohemia, Faculty of Applied Sciences, Plzen, Czech Republic In this paper we

More information

Improved Fisher Vector for Large Scale Image Classification XRCE's participation for ILSVRC

Improved Fisher Vector for Large Scale Image Classification XRCE's participation for ILSVRC Improved Fisher Vector for Large Scale Image Classification XRCE's participation for ILSVRC Jorge Sánchez, Florent Perronnin and Thomas Mensink Xerox Research Centre Europe (XRCE) Overview Fisher Vector

More information

Word Embedding for Social Book Suggestion

Word Embedding for Social Book Suggestion Word Embedding for Social Book Suggestion Nawal Ould-Amer 1, Philippe Mulhem 1, Mathias Géry 2, and Karam Abdulahhad 1 1 Univ. Grenoble Alpes, LIG, F-38000 Grenoble, France CNRS, LIG, F-38000 Grenoble,

More information

Clustering for Text and Image-Based Photo Retrieval at CLEF 2009

Clustering for Text and Image-Based Photo Retrieval at CLEF 2009 Clustering for ext and mage-based Photo Retrieval at CLEF 2009 Qian Zhu and Diana nkpen School of nformation echnology and Engineering University of Ottawa qzhu012@uottawa.ca, diana@site.uottawa.ca Abstract.

More information

Entity-Based Information Retrieval System: A Graph-Based Document and Query Model

Entity-Based Information Retrieval System: A Graph-Based Document and Query Model Entity-Based Search Entity-Based Information Retrieval System: A Graph-Based Document and Query Model Mohannad ALMASRI 1, Jean-Pierre Chevallet 2, Catherine Berrut 1 1 UNIVSERSITÉ JOSEPH FOURIER - GRENOBLE

More information

Cross-Language Evaluation Forum - CLEF

Cross-Language Evaluation Forum - CLEF Cross-Language Evaluation Forum - CLEF Carol Peters IEI-CNR, Pisa, Italy IST-2000-31002 Kick-off: October 2001 Outline Project Objectives Background CLIR System Evaluation CLEF Infrastructure Results so

More information

Robust Relevance-Based Language Models

Robust Relevance-Based Language Models Robust Relevance-Based Language Models Xiaoyan Li Department of Computer Science, Mount Holyoke College 50 College Street, South Hadley, MA 01075, USA Email: xli@mtholyoke.edu ABSTRACT We propose a new

More information

University of Santiago de Compostela at CLEF-IP09

University of Santiago de Compostela at CLEF-IP09 University of Santiago de Compostela at CLEF-IP9 José Carlos Toucedo, David E. Losada Grupo de Sistemas Inteligentes Dept. Electrónica y Computación Universidad de Santiago de Compostela, Spain {josecarlos.toucedo,david.losada}@usc.es

More information

Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy

Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy Martin Rajman, Pierre Andrews, María del Mar Pérez Almenta, and Florian Seydoux Artificial Intelligence

More information

IMPROVING INFORMATION RETRIEVAL BASED ON QUERY CLASSIFICATION ALGORITHM

IMPROVING INFORMATION RETRIEVAL BASED ON QUERY CLASSIFICATION ALGORITHM IMPROVING INFORMATION RETRIEVAL BASED ON QUERY CLASSIFICATION ALGORITHM Myomyo Thannaing 1, Ayenandar Hlaing 2 1,2 University of Technology (Yadanarpon Cyber City), near Pyin Oo Lwin, Myanmar ABSTRACT

More information

Query Likelihood with Negative Query Generation

Query Likelihood with Negative Query Generation Query Likelihood with Negative Query Generation Yuanhua Lv Department of Computer Science University of Illinois at Urbana-Champaign Urbana, IL 61801 ylv2@uiuc.edu ChengXiang Zhai Department of Computer

More information

University of Alicante at NTCIR-9 GeoTime

University of Alicante at NTCIR-9 GeoTime University of Alicante at NTCIR-9 GeoTime Fernando S. Peregrino fsperegrino@dlsi.ua.es David Tomás dtomas@dlsi.ua.es Department of Software and Computing Systems University of Alicante Carretera San Vicente

More information

TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation

TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek, Cordelia Schmid LEAR team, INRIA Rhône-Alpes, Grenoble, France

More information

A Survey on Multimedia Information Retrieval System

A Survey on Multimedia Information Retrieval System A Survey on Multimedia Information Retrieval System Pooja Shinde 1, Prof. J. V. Shinde 2 1 M.E Student, Kalyani Charitable Trust s Late G.N. Sapkal College of Engineering 2 Professor, Kalyani Charitable

More information

Exploiting Index Pruning Methods for Clustering XML Collections

Exploiting Index Pruning Methods for Clustering XML Collections Exploiting Index Pruning Methods for Clustering XML Collections Ismail Sengor Altingovde, Duygu Atilgan and Özgür Ulusoy Department of Computer Engineering, Bilkent University, Ankara, Turkey {ismaila,

More information

Multimodal information approaches for the Wikipedia collection at ImageCLEF 2011

Multimodal information approaches for the Wikipedia collection at ImageCLEF 2011 Multimodal information approaches for the Wikipedia collection at ImageCLEF 2011 R. Granados 1, J. Benavent 2, X. Benavent 2, E. de Ves 2, Ana García-Serrano 1 1 Universidad Nacional de Educación a Distancia,

More information

Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al.

Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision Fall 2007 Objective Given a query image of an object,

More information

Computing Similarity between Cultural Heritage Items using Multimodal Features

Computing Similarity between Cultural Heritage Items using Multimodal Features Computing Similarity between Cultural Heritage Items using Multimodal Features Nikolaos Aletras and Mark Stevenson Department of Computer Science, University of Sheffield Could the combination of textual

More information

TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback

TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback RMIT @ TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback Ameer Albahem ameer.albahem@rmit.edu.au Lawrence Cavedon lawrence.cavedon@rmit.edu.au Damiano

More information

Informativeness for Adhoc IR Evaluation:

Informativeness for Adhoc IR Evaluation: Informativeness for Adhoc IR Evaluation: A measure that prevents assessing individual documents Romain Deveaud 1, Véronique Moriceau 2, Josiane Mothe 3, and Eric SanJuan 1 1 LIA, Univ. Avignon, France,

More information

Overview of the ImageCLEFphoto 2007 photographic retrieval task

Overview of the ImageCLEFphoto 2007 photographic retrieval task Overview of the ImageCLEFphoto 2007 photographic retrieval task Michael Grubinger 1, Paul Clough 2, Allan Hanbury 3, Henning Müller 4 1 Victoria University, Melbourne, Australia 2 Sheffield University,

More information

Dynamic Two-Stage Image Retrieval from Large Multimodal Databases

Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, and Savvas A. Chatzichristofis Department of Electrical and Computer Engineering, Democritus University

More information

CMPSCI 646, Information Retrieval (Fall 2003)

CMPSCI 646, Information Retrieval (Fall 2003) CMPSCI 646, Information Retrieval (Fall 2003) Midterm exam solutions Problem CO (compression) 1. The problem of text classification can be described as follows. Given a set of classes, C = {C i }, where

More information

Prior Art Retrieval Using Various Patent Document Fields Contents

Prior Art Retrieval Using Various Patent Document Fields Contents Prior Art Retrieval Using Various Patent Document Fields Contents Metti Zakaria Wanagiri and Mirna Adriani Fakultas Ilmu Komputer, Universitas Indonesia Depok 16424, Indonesia metti.zakaria@ui.edu, mirna@cs.ui.ac.id

More information

MICC-UNIFI at ImageCLEF 2013 Scalable Concept Image Annotation

MICC-UNIFI at ImageCLEF 2013 Scalable Concept Image Annotation MICC-UNIFI at ImageCLEF 2013 Scalable Concept Image Annotation Tiberio Uricchio, Marco Bertini, Lamberto Ballan, and Alberto Del Bimbo Media Integration and Communication Center (MICC) Università degli

More information

Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI

Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI M. M. Rahman, Bipin C. Desai, Prabir Bhattacharya Dept. of Computer Science & Software Engineering,

More information

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks University of Amsterdam at INEX 2010: Ad hoc and Book Tracks Jaap Kamps 1,2 and Marijn Koolen 1 1 Archives and Information Studies, Faculty of Humanities, University of Amsterdam 2 ISLA, Faculty of Science,

More information

Predicting Popular Xbox games based on Search Queries of Users

Predicting Popular Xbox games based on Search Queries of Users 1 Predicting Popular Xbox games based on Search Queries of Users Chinmoy Mandayam and Saahil Shenoy I. INTRODUCTION This project is based on a completed Kaggle competition. Our goal is to predict which

More information

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics

More information

Lab for Media Search, National University of Singapore 1

Lab for Media Search, National University of Singapore 1 1 2 Word2Image: Towards Visual Interpretation of Words Haojie Li Introduction Motivation A picture is worth 1000 words Traditional dictionary Containing word entries accompanied by photos or drawing to

More information

Evaluation Methods for Focused Crawling

Evaluation Methods for Focused Crawling Evaluation Methods for Focused Crawling Andrea Passerini, Paolo Frasconi, and Giovanni Soda DSI, University of Florence, ITALY {passerini,paolo,giovanni}@dsi.ing.unifi.it Abstract. The exponential growth

More information

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH

EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH Andreas Walter FZI Forschungszentrum Informatik, Haid-und-Neu-Straße 10-14, 76131 Karlsruhe, Germany, awalter@fzi.de

More information

From Passages into Elements in XML Retrieval

From Passages into Elements in XML Retrieval From Passages into Elements in XML Retrieval Kelly Y. Itakura David R. Cheriton School of Computer Science, University of Waterloo 200 Univ. Ave. W. Waterloo, ON, Canada yitakura@cs.uwaterloo.ca Charles

More information

Tag Based Image Search by Social Re-ranking

Tag Based Image Search by Social Re-ranking Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule

More information

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and

More information

An Edge-Based Approach to Motion Detection*

An Edge-Based Approach to Motion Detection* An Edge-Based Approach to Motion Detection* Angel D. Sappa and Fadi Dornaika Computer Vison Center Edifici O Campus UAB 08193 Barcelona, Spain {sappa, dornaika}@cvc.uab.es Abstract. This paper presents

More information

Automatically Generating Queries for Prior Art Search

Automatically Generating Queries for Prior Art Search Automatically Generating Queries for Prior Art Search Erik Graf, Leif Azzopardi, Keith van Rijsbergen University of Glasgow {graf,leif,keith}@dcs.gla.ac.uk Abstract This report outlines our participation

More information