Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval
|
|
- Chrystal Garrett
- 5 years ago
- Views:
Transcription
1 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Stephane Clinchant, Jean-Michel Renders, and Gabriela Csurka Xerox Research Centre Europe, 6 ch. de Maupertuis, Meylan, France FirstName.LastName@xrce.xerox.com Abstract. We present here some transmedia similarity measures that we recently designed by adopting some intermediate level fusion approaches. The main idea is to use some principles coming from pseudorelevance feedback and, more specifically, transmedia pseudo-relevance feedback for enriching the mono-media representation of an object with features coming from the other media. One issue that arises when adopting such a strategy is to determine how to compute the mono-media similarity between an aggregate of objects coming from a first (pseudo-)feedback step and one single multimodal object. We propose two alternative ways of adressing this issue, that result in what we called the transmedia document reranking and complementary feedback methods respectively. For the ImageCLEF - Photo Retrieval Task, it appears that monomedia retrieval performance is more or less equivalent for pure image and pure text content (around 20% MAP). Using our transmedia pseudofeedback-based similarity measures allowed us to dramatically increase the performance by 50% (relative). From a cross-lingual perspective, the use of domain-specific, corpus-adapted probabilistic dictionaries seems to offer better results than the use of a broader, more general standard dictionary. With respect to the monolingual baselines, multilingual runs show a slight degradation of retrieval performance ( 6 to 10% relative). Keywords: hybrid retrieval, trans-media relevance feedback. 1 Introduction and Related Works Up to now, many standard methods to tackle the problem of defining efficient trans-modal similarity measures and of solving the associated semantic gap use late fusion strategies. Basically, they rely on mono-modal analysis of multifacet objects, computing mono-modal similarities independently and then merging these mono-modal similarities by some simple aggregation operator. In contrast to these strategies, we propose here two intermediate level fusion approaches. Both approaches, similarly to [1,2,3], are based on Transmedia Relevance Pseudo-Feedback, i.e. they are mixed-modality extentions of Relevance Models in which the modality of data is switched during the (pseudo-) feedback process, from image to text or text to image. C. Peters et al. (Eds.): CLEF 2007, LNCS 5152, pp , c Springer-Verlag Berlin Heidelberg 2008
2 570 S. Clinchant, J.-M. Renders, and G. Csurka Our first approach, called Complementary Feedback (section 3.1), is similar to the approach suggested by [3]. However, while [3] uses classical text-based feedback algorithms (like Rocchio), we use a pseudo-feedback method issued from the language modelling approach to information retrieval, namely the mixture model method from Zhai and Lafferty [4] originally designed to enrich textual queries. In our second approach, called Transmedia Document Reranking Approach (section 3.2), we do not really extract a new query, nor enrich an existing one. This second approach uses the similarity computed in the other mode as a component of feedback, in order to rerank documents. So, this is a one step retrieval, contrarily to the first one (and the related works). This method is quite general since it can be applied to any textual/visual similarities or, equivalently, with any mono-modal (textual / visual) retrieval engine. This is not the case for the other methods: [1], for instance, is based on a specific similarity model both for texts and images. Moreover, as the alternative methods require a second retrieval step, the use of a particular choice of text feedback method depends implicitly on the underlying text retrieval model. Our method is free from such dependencies, since it works on similarities as basic components. Even if both approaches appear to be rather simpler than most alternative state-of-the-art approaches, they turned out to give superior results in the ImageClef PhotoRetrieval Track ([5]). 2 Monomedia Similarities 2.1 Cross-Entropy between Texts Starting from a traditional bag-of-word representation of pre-processed texts (here, preprocessing includes tokenization, lemmatization, word decompounding and standard stopword removal), we adopt the language modeling approach to information retrieval and we use the (asymmetric) cross-entropy function as similarity. Particular details of this textual similarity measure are given in [6]. 2.2 FisherVectorsforImages To compute the similarity measure between images I and J, we simply use the the L1-norm of the difference between their Fisher vectors (normalised gradient vector of the corresponding generative model, with unitary L1-norm; see details in [6,7]). 3 Cross-Media Similarities Based on Transmedia Relevance Feedback The main idea is the following: for a given image i, consider as new features the (textual) terms of the texts associated to the most similar images (from a purely visual viewpoint). We will denote this neighbouring set as N img (i). Its size is fixed a priori: this is typically the topn objects returned from a retrieval system
3 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval 571 (CBIR) or the N nearest-neighbours using some predefined visual similarity measures. Then, we can compute a new similarity with respect to any multimodal object j of the collection O as the textual similarity of this new representation of the image i with the textual parts of j. There are three families of approaches to compute the mono-media similarity between an aggregate of objects N img (i) and one single multimodal object: 1. aggregating N img (i) to form a single object (typically by concatenation) and then compute standard similarity between two objects; 2. use a method of pseudo feedback algorithm (for instance Rochhio s algorithm) to extract relevant, meaningfull features of an aggregate and finally use a mono-media similarity. 3. aggregating all similarity measures (assuming we can) between all possible couple of objects Methods of families 1 and 2 involve therefore the creation of a new single object (in this case, a text) and a new retrieval step (this time using a text retrieval system). The third family does not. 3.1 Complementary Pseudo-Feedback This approach, as the work presented in [3], belongs to the second family of aggregation strategies mentionned in the previous section but, contrarily to [3], our method uses the Language Modelling framework to realize the pseudo-feedback. Recall that the fundamental problem in transmedia feedback is to define how we compute the mono-media similarity between an aggregate of objects N img (i) (or N txt (i)) and one single multimodal object. The main idea here is to consider the set N img (i) as the relevance concept F and derive its corresponding language model (LM) θ F. Afterwards, we can use the cross-entropy function between θ F and the LM of the textual part of any object j in O as the new transmedia similarity. We adopt the framework given by the mixture model method from Zhai and Lafferty [4] (originally designed to enrich textual queries), to derive the LM associated with F. See [6] for practical details. Once θ F has been estimated, a new query LM can be obtained trough interpolation: θ new query = αθ old query +(1 α)θ F (1) where θ old query corresponds to the LM of the textual part of the query i. In a nearly dual way, starting from the textual part of the query, a similar scheme using N txt (i) can be adopted to derive a new visual representation (actually some generalized Fisher Vectors) of the relevance concept, this time relying on Rocchio s method that is more adapted to continuous feature representation. 3.2 Transmedia Document Reranking Unlike the Complementary Feedback, the Transmedia Document Reranking approach belongs to the third family of aggregation strategies mentionned in 3.
4 572 S. Clinchant, J.-M. Renders, and G. Csurka The main idea is to define a new cross-media similarity measure by aggregating all similarity measures (assuming we can) between all possible couple of objects retrieved by the Transmedia Relevance Feedback. More formally, if we denote by T (u) the text associated to multimodal object u and by ˆT (i) the new textual representation of image i, then the new crossmedia similarity measure w.r.t. the multimodal object j is: sim ImgTxt (i, j) =sim txt ( ˆT (i), T (j)) = sim txt (T (d), T (j)) (2) d N img(i) where sim txt is any textual similarity measure but, in a particular embodiment, we propose to use the cross-entropy function (e.g. the one based on Language Modelling, even if it is assymetric), that appears to be one of the most effective measures in purely textual information retrieval systems. This method can be seen as a reranking method. Suppose that q is some image query; if T (d) is the text of an image belonging to the initial feedback set N img (q), then the rank of the own neighbors of T (d) in the textual sense will be increased, even if they are not so similar from a purely visual viewpoint. In particular, this allows to define a similarity between a purely image query and a simple textual object without visual counterpart. By duality, we can define another cross-media similarity measure: for a given text i, we consider as new features the Fisher vectors of the images associated to the most similar texts (from a purely textual viewpoint) in the multimodal database. We will denote this neighbouring set as N txt (i). If we denote by I(u) the image associated to multimodal object u and by Î(i) the new visual representation of text i, then the new cross-media similarity measure is: sim TxtImg (i, j) =sim img (Î(i), I(j)) = d N txt(i) sim img (I(d), I(j)) (3) Finally, we can combine all the similarities to define a global similarity measure between two multi-modal objects i and j: for instance, using a linear combination, sim glob (i, j) =λ 1 sim txt (T (i), T (j)) +λ 2 sim img (I(i), I(j)) +λ 3 sim ImgTxt (i, j) +λ 4 sim TxtImg (i, j) In one embodiment, we use a simple weighted averages of these similarities, and optimize the weights through the use of a labelled/annotated training set. The main advantage of this method is that, using an aggregation strategy that belongs to family 3, it does not require any further retrieval step. Furthermore, it exploits all trans-modal paths (TXT-TXT, TXT-IMG, IMG-TXT and IMG-IMG) and combines them. Finally, we can pre-compute the monomodal similaritites (textual and visual) between all pairs of objects in the multimedia reference repository, as these computations are independent from the objects in the run-time application; once stored, these values can be re-injected into the translingual similarity equations at run-time, greatly reducing the computation time.
5 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Experimental Results Table 1 shows the name of our ImageCLEF runs and the corresponding mean average precision measures. For a description of the task, the corpus and the queries, refer to [5]. Table 1. Official Runs Run Txt Img CF1 CF2 CF3 TR1 TR2 TR3 MAP Below is a detailed description of all the methods we used for the runs: Txt:This run was a pure text run: documents were basically preprocessed and each document was enriched using Flickr database. For each term of a document, its top 20 related tags from Flickr were added to the document (see details in [6]). Then, a unigram language model for each document is estimated, giving more weight to the original document terms. An additional step of pseudo-relevance feedback using the method explained in [4] is then performed. Img:This run is a pure image run: it uses Fisher Kernel metric to define the image similarity. As a query encompasses 3 visual sub-queries, we have to combine the similarity score with respect to these 3 subqueries. To this aim, the result lists from the image sub-queries are renormalized (by substracting the mean and dividing by the standard deviation) and merged by simple sum. CF1:This run uses both texts and images: it starts from query images only, to determine the relevance set N img (i) for each query i andthenimplements the the complementary (intermedia) feedback described in section 3.1. The size of the neighbouring set is 15. Refering to the notation of section 3.1, the value of α is 0.5. CF2:This runs works with the same principle as the previous run CF1. The main difference is that (target) english documents have been enriched with Flickr and that the initial query in German was translated by multiplying its Language Model by the probabilistic translation matrix extracted from the (small) parallel part of the corpus. Otherwise, it uses the same parameters as previously. CF3:This run uses the same process as in CF1, exceptthatitusesenglish queries to search for German annotations. English queries are translated with the probabilistic translation matrix extracted from the (small) parallel part of the corpus and the translated queries follow the same process as in CF1 but with different parameter : the size of the neighbouring set is 10, while the value of α is 0.7. TR1:This run uses both texts and images: it starts from query images only, to determine N img (i) for each query i (as in the previous run above) and then implements the method Transmedia Reranking method described in section 3.2. The size of the neighbouring set is 5.
6 574 S. Clinchant, J.-M. Renders, and G. Csurka TR2:It is basically the same algorithm as the preceding run TR1, except that the textual part of the data (annotations) is enriched with Flickr tags. TR3:This run uses the TR algorithm as in TR1 but, we merge the result lists from TR1 and from the purely text queries (Txt), by summing the relevance scores after normalisation (by substracting the mean and dividing by the standard deviation for each list). 3.4 Topic-Based Analysis of Results In order to better understand the possible correlations between the different methods and/or the systematic superiority of some of them, Figure 1 compares the Average Precisions for each pair of methods and for each topic. Methods are:text-only (TXT), image-only (IMG), our best Complementary Feedback (CF) and our best Transmedia Reranking (TR) approaches. Fig. 1. Average Precision values per topic, for six pairs of methods A deeper analysis of the individual topics leads to the following conclusions: From a purely visual aspect, search performance is better when the example images of the query are similar between themselves; search results degrade in the opposite case. See examples in Figure 2. The combination between text and image works better if the text query is complementary with respect to the visual information (see for instance left column of Figure 3). The combination does not perform well when either one of the media works very badly, especially the image, which is not suprising as the images were used for transmedia pseudo-relevance feedback (e.g. topics 3 and 32). There were also examples in which multi-media retrieval performance was poor, while individual mono-media retrieval worked not too bad (e.g. right
7 Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval 575 Fig. 2. Left column: query images from topics for which the retrieval worked best: (a) 22 tennis player during rally, (b) 55 drawings in Peruvian desert and (c) 51 photos of goddaughters from Brasil. Right column: query images from topics for which the retrieval worked worst: (d) 9 tourist accomodation near Lake Titicaca, (e) 10 destinations in Venezuela and (e) 39 people in bad weather. Fig. 3. Left column: Query images from topics with best hybrid combinations: (a) 21 accomodations provided by host families, (b) 44 mountains in mainland Australia and (c) 48 vehicle in South Korea. Right column: query images from topics with worst hybrid combinations : (d) 38 Machu Picchu and Huayna Picchu in bad weather, (e) 11 black and white photos from Russia and (f) 3 religious statue in the foreground. column of 3). The reason might be that the retrieved images were incorrectly reranked based on their textual similarity with the query text. For example, for topic 38, non relevant images of Machu Picchu and Huayna Picchu (because not taken under showing bad weather condition) got better ranking,
8 576 S. Clinchant, J.-M. Renders, and G. Csurka with the effect of decreasing the precision (P20 falling down from 0.7 to 0.21 (for TR) and to 0.3 (for CF). 4 Conclusion With a slightly annotated corpus of images, also characterised by an abstraction level in the textual description that is significantly different from the one used in the queries, it appears that mono-media retrieval performance is more or less equivalent for pure image and pure text content (around 20% MAP). Using our transmedia pseudofeedback-based similarity measures allowed us to dramatically increase the performance by 50% (relative). Trying to model the textual relevance concept present in thetoprankeddocumentsissuedfrom a first (purely visual) retrieval and combining this with the textual part of the original query turns out to be the best strategy, being slightly superior to our transmedia document reranking method. From a cross-lingual perspective, the use of domain-specific, corpus-adapted probabilistic dictionaries seems to offer better results than the use of a broader, more general standard dictionary. With respect to the monolingual baseline, multilingual runs show a slight degradation of retrieval performance ( 6 to 10% relative). Acknowledgments. This work was partly funded by the French Government under the Infomagic project, part of the Pole CAP DIGITAL (IMVN) de Paris, Ile-de-France. The authors also want to thank Florent Perronin for his greatly appreciated help in applying some of the Generic Visual Categorizer components in our experiments. References 1. Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: NIPS (2003) 2. Chang, Y.C., Chen, H.H.: Approaches of using a word-image ontology and an annotated image corpus as intermedia for cross-language image retrieval. In: Working Notes of the CLEF Workshop, Alicante, Spain (2006) 3. Maillot, N., Chevallet, J.-P., Valea, V., Lim, J.H.: Ipal inter-media pseudo-relevance feedback approach to imageclef 2006 photo retrieval. In: Working Notes of the CLEF Workshop, Alicante, Spain (2006) 4. Zhai, C., Lafferty, J.D.: Model-based feedback in the language modeling approach to information retrieval. In: CIKM (2001) 5. Grubinger, M., Clough, P., Hanbury, A., Müller, H.: Overview of the ImageCLEFphoto 2007 photographic retrieval task. In: Working Notes of the CLEF Workshop, Budapest, Hungary (2007) 6. Clinchant, S., Renders, J.-M., Csurka, G.: Xrce s participation to ImageCLEF In: Working Notes of the CLEF Workshop, Budapest, Hungary (2007) 7. Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: CVPR (2007)
XRCE s Participation to ImageCLEFphoto 2007
XRCE s Participation to ImageCLEFphoto 2007 Stephane Clinchant, Jean-Michel Renders and Gabriela Csurka Xerox Research Centre Europe, 6 ch. de Maupertuis, 38240 Meylan, France FirstName.LastName@xrce.xerox.com
More informationUsing an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval
Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National
More informationExperiment for Using Web Information to do Query and Document Expansion
Experiment for Using Web Information to do Query and Document Expansion Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National Taiwan University Taipei,
More informationXRCE s Participation in ImageCLEF 2009
XRCE s Participation in ImageCLEF 2009 Julien Ah-Pine 1, Stephane Clinchant 1,2, Gabriela Csurka 1, Yan Liu 1 1 Xerox Research Centre Europe, 6 chemin de Maupertuis 38240, Meylan France firstname.lastname@xrce.xerox.com
More informationIPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching
IPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching Sheng Gao, Jean-Pierre Chevallet and Joo-Hwee Lim IPAL, Institute for Infocomm Research, A*Star, Singapore
More informationEvaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images
Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images Miguel E. Ruiz 1 and Aurélie Névéol 2 1 University of North Texas, School of Library and Information Sciences P.O. Box 311068,
More informationTrans Media Relevance Feedback for Image Autoannotation
MENSINK et al.: TMRF FOR IMAGE AUTOANNOTATION 1 Trans Media Relevance Feedback for Image Autoannotation Thomas Mensink 12 thomas.mensink@xrce.xerox.com Jakob Verbeek 2 jakob.verbeek@inrialpes.fr Gabriela
More informationMIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion
MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad
More informationOverview of the ImageCLEFphoto 2008 Photographic Retrieval Task
Overview of the ImageCLEFphoto 2008 Photographic Retrieval Task Thomas Arni 1, Paul Clough 1, Mark Sanderson 1 and Michael Grubinger 2 1 Department of Information Studies, University of Sheffield, UK 2
More informationCACAO PROJECT AT THE 2009 TASK
CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype
More informationSemantic text features from small world graphs
Semantic text features from small world graphs Jurij Leskovec 1 and John Shawe-Taylor 2 1 Carnegie Mellon University, USA. Jozef Stefan Institute, Slovenia. jure@cs.cmu.edu 2 University of Southampton,UK
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:
Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,
More informationWikipedia Retrieval Task ImageCLEF 2011
Wikipedia Retrieval Task ImageCLEF 2011 Theodora Tsikrika University of Applied Sciences Western Switzerland, Switzerland Jana Kludas University of Geneva, Switzerland Adrian Popescu CEA LIST, France Outline
More informationText-Based Clustering of the ImageCLEFphoto Collection for Augmenting the Retrieved Results
Text-Based Clustering of the ImageCLEFphoto Collection for Augmenting the Retrieved Results Osama El Demerdash, Leila Kosseim, and Sabine Bergler CLaC Laboratory - Department of Computer Science & Software
More informationDocument Expansion for Text-based Image Retrieval at CLEF 2009
Document Expansion for Text-based Image Retrieval at CLEF 2009 Jinming Min, Peter Wilkins, Johannes Leveling, and Gareth Jones Centre for Next Generation Localisation School of Computing, Dublin City University
More informationThe Visual Concept Detection Task in ImageCLEF 2008
The Visual Concept Detection Task in ImageCLEF 2008 Thomas Deselaers 1 and Allan Hanbury 2,3 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 PRIP, Inst.
More informationMultimodal Medical Image Retrieval based on Latent Topic Modeling
Multimodal Medical Image Retrieval based on Latent Topic Modeling Mandikal Vikram 15it217.vikram@nitk.edu.in Suhas BS 15it110.suhas@nitk.edu.in Aditya Anantharaman 15it201.aditya.a@nitk.edu.in Sowmya Kamath
More informationMetric Learning for Large Scale Image Classification:
Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Thomas Mensink 1,2 Jakob Verbeek 2 Florent Perronnin 1 Gabriela Csurka 1 1 TVPA - Xerox Research Centre
More informationUsing Maximum Entropy for Automatic Image Annotation
Using Maximum Entropy for Automatic Image Annotation Jiwoon Jeon and R. Manmatha Center for Intelligent Information Retrieval Computer Science Department University of Massachusetts Amherst Amherst, MA-01003.
More informationResPubliQA 2010
SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first
More informationA Family of Contextual Measures of Similarity between Distributions with Application to Image Retrieval
A Family of Contextual Measures of Similarity between Distributions with Application to Image Retrieval Florent Perronnin, Yan Liu and Jean-Michel Renders Xerox Research Centre Europe (XRCE) Textual and
More informationAn Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm
An Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm Henan Zhao and Rizos Sakellariou Department of Computer Science, University of Manchester,
More informationOverview of the ImageCLEFphoto 2007 Photographic Retrieval Task
Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task Michael Grubinger 1,PaulClough 2, Allan Hanbury 3, and Henning Müller 4,5 1 Victoria University, Melbourne, Australia 2 Sheffield University,
More informationThe University of Amsterdam at the CLEF 2008 Domain Specific Track
The University of Amsterdam at the CLEF 2008 Domain Specific Track Parsimonious Relevance and Concept Models Edgar Meij emeij@science.uva.nl ISLA, University of Amsterdam Maarten de Rijke mdr@science.uva.nl
More informationIPL at ImageCLEF 2010
IPL at ImageCLEF 2010 Alexandros Stougiannis, Anestis Gkanogiannis, and Theodore Kalamboukis Information Processing Laboratory Department of Informatics Athens University of Economics and Business 76 Patission
More informationDocument Structure Analysis in Associative Patent Retrieval
Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,
More informationTEXT CHAPTER 5. W. Bruce Croft BACKGROUND
41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia
More informationA New Measure of the Cluster Hypothesis
A New Measure of the Cluster Hypothesis Mark D. Smucker 1 and James Allan 2 1 Department of Management Sciences University of Waterloo 2 Center for Intelligent Information Retrieval Department of Computer
More informationA Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task
A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task Wei Li, Jinming Min, Gareth J. F. Jones Center for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland
More informationA Formal Approach to Score Normalization for Meta-search
A Formal Approach to Score Normalization for Meta-search R. Manmatha and H. Sever Center for Intelligent Information Retrieval Computer Science Department University of Massachusetts Amherst, MA 01003
More informationSupervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information
Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department of Information
More informationAn Axiomatic Approach to IR UIUC TREC 2005 Robust Track Experiments
An Axiomatic Approach to IR UIUC TREC 2005 Robust Track Experiments Hui Fang ChengXiang Zhai Department of Computer Science University of Illinois at Urbana-Champaign Abstract In this paper, we report
More informationImageCLEF 2008: visual feature analysis in segmented images
SZTAKI @ ImageCLEF 2008: visual feature analysis in segmented images Bálint Daróczy Zsolt Fekete Mátyás Brendel Simon Rácz András Benczúr Dávid Siklósi Attila Pereszlényi Data Mining and Web search Research
More informationBossaNova at ImageCLEF 2012 Flickr Photo Annotation Task
BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task S. Avila 1,2, N. Thome 1, M. Cord 1, E. Valle 3, and A. de A. Araújo 2 1 Pierre and Marie Curie University, UPMC-Sorbonne Universities, LIP6, France
More informationPatent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF
Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF Julia Jürgens, Sebastian Kastner, Christa Womser-Hacker, and Thomas Mandl University of Hildesheim,
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationDUTH at ImageCLEF 2011 Wikipedia Retrieval
DUTH at ImageCLEF 2011 Wikipedia Retrieval Avi Arampatzis, Konstantinos Zagoris, and Savvas A. Chatzichristofis Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi
More informationInformation Retrieval. (M&S Ch 15)
Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion
More informationMedical Image Annotation in ImageCLEF 2008
Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany 2 RWTH Aachen University, Dept. of Medical Informatics,
More informationMetric Learning for Large-Scale Image Classification:
Metric Learning for Large-Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Florent Perronnin 1 work published at ECCV 2012 with: Thomas Mensink 1,2 Jakob Verbeek 2 Gabriela Csurka
More informationVisoLink: A User-Centric Social Relationship Mining
VisoLink: A User-Centric Social Relationship Mining Lisa Fan and Botang Li Department of Computer Science, University of Regina Regina, Saskatchewan S4S 0A2 Canada {fan, li269}@cs.uregina.ca Abstract.
More informationMIRS: Text Based and Content Based Image Retrieval Trupti S. Atre 1, K.V.Metre 2
MIRS: Text Based and Content Based Image Retrieval Trupti S. Atre 1, K.V.Metre 2 Abstract The main goal of this paper is to show multimedia information retrieval task using the combination of textual pre-filtering
More informationImproving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking
Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking Emanuele Di Buccio, Ivano Masiero, and Massimo Melucci Department of Information Engineering, University
More informationThe Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System
The Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System Our first participation on the TRECVID workshop A. F. de Araujo 1, F. Silveira 2, H. Lakshman 3, J. Zepeda 2, A. Sheth 2, P. Perez
More informationBUAA AUDR at ImageCLEF 2012 Photo Annotation Task
BUAA AUDR at ImageCLEF 2012 Photo Annotation Task Lei Huang, Yang Liu State Key Laboratory of Software Development Enviroment, Beihang University, 100191 Beijing, China huanglei@nlsde.buaa.edu.cn liuyang@nlsde.buaa.edu.cn
More informationCLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval
DCU @ CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval Walid Magdy, Johannes Leveling, Gareth J.F. Jones Centre for Next Generation Localization School of Computing Dublin City University,
More informationImage Query Expansion Using Semantic Selectional Restrictions
Image Query Expansion Using Semantic Selectional Restrictions Osama El Demerdash, Sabine Bergler and Leila Kosseim Concordia University {osama el,bergler,kosseim}@cse.concordia.ca CLaC Laboratory - Department
More informationImproving Recognition through Object Sub-categorization
Improving Recognition through Object Sub-categorization Al Mansur and Yoshinori Kuno Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Saitama 338-8570,
More informationA fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP
A fully-automatic approach to answer geographic queries: at GikiP Johannes Leveling Sven Hartrumpf Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen)
More informationMedical Image Annotation in ImageCLEF 2008
Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 RWTH Aachen University,
More informationCADIAL Search Engine at INEX
CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr
More informationImage retrieval based on bag of images
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Image retrieval based on bag of images Jun Zhang University of Wollongong
More informationOverview of iclef 2008: search log analysis for Multilingual Image Retrieval
Overview of iclef 2008: search log analysis for Multilingual Image Retrieval Julio Gonzalo Paul Clough Jussi Karlgren UNED U. Sheffield SICS Spain United Kingdom Sweden julio@lsi.uned.es p.d.clough@sheffield.ac.uk
More informationTHE preceding chapters were all devoted to the analysis of images and signals which
Chapter 5 Segmentation of Color, Texture, and Orientation Images THE preceding chapters were all devoted to the analysis of images and signals which take values in IR. It is often necessary, however, to
More informationAn Investigation of Basic Retrieval Models for the Dynamic Domain Task
An Investigation of Basic Retrieval Models for the Dynamic Domain Task Razieh Rahimi and Grace Hui Yang Department of Computer Science, Georgetown University rr1042@georgetown.edu, huiyang@cs.georgetown.edu
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationExternal Query Reformulation for Text-based Image Retrieval
External Query Reformulation for Text-based Image Retrieval Jinming Min and Gareth J. F. Jones Centre for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland {jmin,gjones}@computing.dcu.ie
More informationRanking Web Pages by Associating Keywords with Locations
Ranking Web Pages by Associating Keywords with Locations Peiquan Jin, Xiaoxiang Zhang, Qingqing Zhang, Sheng Lin, and Lihua Yue University of Science and Technology of China, 230027, Hefei, China jpq@ustc.edu.cn
More informationAlberto Messina, Maurizio Montagnuolo
A Generalised Cross-Modal Clustering Method Applied to Multimedia News Semantic Indexing and Retrieval Alberto Messina, Maurizio Montagnuolo RAI Centre for Research and Technological Innovation Madrid,
More informationGoNTogle: A Tool for Semantic Annotation and Search
GoNTogle: A Tool for Semantic Annotation and Search Giorgos Giannopoulos 1,2, Nikos Bikakis 1,2, Theodore Dalamagas 2, and Timos Sellis 1,2 1 KDBSL Lab, School of ECE, Nat. Tech. Univ. of Athens, Greece
More informationMSRA Columbus at GeoCLEF2007
MSRA Columbus at GeoCLEF2007 Zhisheng Li 1, Chong Wang 2, Xing Xie 2, Wei-Ying Ma 2 1 Department of Computer Science, University of Sci. & Tech. of China, Hefei, Anhui, 230026, P.R. China zsli@mail.ustc.edu.cn
More informationMSRA Columbus at GeoCLEF 2006
MSRA Columbus at GeoCLEF 2006 Zhisheng Li, Chong Wang 2, Xing Xie 2, Wei-Ying Ma 2 Department of Computer Science, University of Sci. & Tech. of China, Hefei, Anhui, 230026, P.R. China zsli@mail.ustc.edu.cn
More informationModeling Spatial Relations for Image Retrieval by Conceptual Graphs
Modeling Spatial Relations for Image Retrieval by Conceptual Graphs Carlos Hernández-Gracidas, L. Enrique Sucar and Manuel Montes-y-Gómez Computer Science Department National Institute of Astrophysics,
More informationInfluence of Word Normalization on Text Classification
Influence of Word Normalization on Text Classification Michal Toman a, Roman Tesar a and Karel Jezek a a University of West Bohemia, Faculty of Applied Sciences, Plzen, Czech Republic In this paper we
More informationImproved Fisher Vector for Large Scale Image Classification XRCE's participation for ILSVRC
Improved Fisher Vector for Large Scale Image Classification XRCE's participation for ILSVRC Jorge Sánchez, Florent Perronnin and Thomas Mensink Xerox Research Centre Europe (XRCE) Overview Fisher Vector
More informationWord Embedding for Social Book Suggestion
Word Embedding for Social Book Suggestion Nawal Ould-Amer 1, Philippe Mulhem 1, Mathias Géry 2, and Karam Abdulahhad 1 1 Univ. Grenoble Alpes, LIG, F-38000 Grenoble, France CNRS, LIG, F-38000 Grenoble,
More informationClustering for Text and Image-Based Photo Retrieval at CLEF 2009
Clustering for ext and mage-based Photo Retrieval at CLEF 2009 Qian Zhu and Diana nkpen School of nformation echnology and Engineering University of Ottawa qzhu012@uottawa.ca, diana@site.uottawa.ca Abstract.
More informationEntity-Based Information Retrieval System: A Graph-Based Document and Query Model
Entity-Based Search Entity-Based Information Retrieval System: A Graph-Based Document and Query Model Mohannad ALMASRI 1, Jean-Pierre Chevallet 2, Catherine Berrut 1 1 UNIVSERSITÉ JOSEPH FOURIER - GRENOBLE
More informationCross-Language Evaluation Forum - CLEF
Cross-Language Evaluation Forum - CLEF Carol Peters IEI-CNR, Pisa, Italy IST-2000-31002 Kick-off: October 2001 Outline Project Objectives Background CLIR System Evaluation CLEF Infrastructure Results so
More informationRobust Relevance-Based Language Models
Robust Relevance-Based Language Models Xiaoyan Li Department of Computer Science, Mount Holyoke College 50 College Street, South Hadley, MA 01075, USA Email: xli@mtholyoke.edu ABSTRACT We propose a new
More informationUniversity of Santiago de Compostela at CLEF-IP09
University of Santiago de Compostela at CLEF-IP9 José Carlos Toucedo, David E. Losada Grupo de Sistemas Inteligentes Dept. Electrónica y Computación Universidad de Santiago de Compostela, Spain {josecarlos.toucedo,david.losada}@usc.es
More informationConceptual document indexing using a large scale semantic dictionary providing a concept hierarchy
Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy Martin Rajman, Pierre Andrews, María del Mar Pérez Almenta, and Florian Seydoux Artificial Intelligence
More informationIMPROVING INFORMATION RETRIEVAL BASED ON QUERY CLASSIFICATION ALGORITHM
IMPROVING INFORMATION RETRIEVAL BASED ON QUERY CLASSIFICATION ALGORITHM Myomyo Thannaing 1, Ayenandar Hlaing 2 1,2 University of Technology (Yadanarpon Cyber City), near Pyin Oo Lwin, Myanmar ABSTRACT
More informationQuery Likelihood with Negative Query Generation
Query Likelihood with Negative Query Generation Yuanhua Lv Department of Computer Science University of Illinois at Urbana-Champaign Urbana, IL 61801 ylv2@uiuc.edu ChengXiang Zhai Department of Computer
More informationUniversity of Alicante at NTCIR-9 GeoTime
University of Alicante at NTCIR-9 GeoTime Fernando S. Peregrino fsperegrino@dlsi.ua.es David Tomás dtomas@dlsi.ua.es Department of Software and Computing Systems University of Alicante Carretera San Vicente
More informationTagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation
TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek, Cordelia Schmid LEAR team, INRIA Rhône-Alpes, Grenoble, France
More informationA Survey on Multimedia Information Retrieval System
A Survey on Multimedia Information Retrieval System Pooja Shinde 1, Prof. J. V. Shinde 2 1 M.E Student, Kalyani Charitable Trust s Late G.N. Sapkal College of Engineering 2 Professor, Kalyani Charitable
More informationExploiting Index Pruning Methods for Clustering XML Collections
Exploiting Index Pruning Methods for Clustering XML Collections Ismail Sengor Altingovde, Duygu Atilgan and Özgür Ulusoy Department of Computer Engineering, Bilkent University, Ankara, Turkey {ismaila,
More informationMultimodal information approaches for the Wikipedia collection at ImageCLEF 2011
Multimodal information approaches for the Wikipedia collection at ImageCLEF 2011 R. Granados 1, J. Benavent 2, X. Benavent 2, E. de Ves 2, Ana García-Serrano 1 1 Universidad Nacional de Educación a Distancia,
More informationTotal Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al.
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision Fall 2007 Objective Given a query image of an object,
More informationComputing Similarity between Cultural Heritage Items using Multimodal Features
Computing Similarity between Cultural Heritage Items using Multimodal Features Nikolaos Aletras and Mark Stevenson Department of Computer Science, University of Sheffield Could the combination of textual
More informationTREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback
RMIT @ TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback Ameer Albahem ameer.albahem@rmit.edu.au Lawrence Cavedon lawrence.cavedon@rmit.edu.au Damiano
More informationInformativeness for Adhoc IR Evaluation:
Informativeness for Adhoc IR Evaluation: A measure that prevents assessing individual documents Romain Deveaud 1, Véronique Moriceau 2, Josiane Mothe 3, and Eric SanJuan 1 1 LIA, Univ. Avignon, France,
More informationOverview of the ImageCLEFphoto 2007 photographic retrieval task
Overview of the ImageCLEFphoto 2007 photographic retrieval task Michael Grubinger 1, Paul Clough 2, Allan Hanbury 3, Henning Müller 4 1 Victoria University, Melbourne, Australia 2 Sheffield University,
More informationDynamic Two-Stage Image Retrieval from Large Multimodal Databases
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, and Savvas A. Chatzichristofis Department of Electrical and Computer Engineering, Democritus University
More informationCMPSCI 646, Information Retrieval (Fall 2003)
CMPSCI 646, Information Retrieval (Fall 2003) Midterm exam solutions Problem CO (compression) 1. The problem of text classification can be described as follows. Given a set of classes, C = {C i }, where
More informationPrior Art Retrieval Using Various Patent Document Fields Contents
Prior Art Retrieval Using Various Patent Document Fields Contents Metti Zakaria Wanagiri and Mirna Adriani Fakultas Ilmu Komputer, Universitas Indonesia Depok 16424, Indonesia metti.zakaria@ui.edu, mirna@cs.ui.ac.id
More informationMICC-UNIFI at ImageCLEF 2013 Scalable Concept Image Annotation
MICC-UNIFI at ImageCLEF 2013 Scalable Concept Image Annotation Tiberio Uricchio, Marco Bertini, Lamberto Ballan, and Alberto Del Bimbo Media Integration and Communication Center (MICC) Università degli
More informationMulti-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI
Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI M. M. Rahman, Bipin C. Desai, Prabir Bhattacharya Dept. of Computer Science & Software Engineering,
More informationUniversity of Amsterdam at INEX 2010: Ad hoc and Book Tracks
University of Amsterdam at INEX 2010: Ad hoc and Book Tracks Jaap Kamps 1,2 and Marijn Koolen 1 1 Archives and Information Studies, Faculty of Humanities, University of Amsterdam 2 ISLA, Faculty of Science,
More informationPredicting Popular Xbox games based on Search Queries of Users
1 Predicting Popular Xbox games based on Search Queries of Users Chinmoy Mandayam and Saahil Shenoy I. INTRODUCTION This project is based on a completed Kaggle competition. Our goal is to predict which
More informationA novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems
A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics
More informationLab for Media Search, National University of Singapore 1
1 2 Word2Image: Towards Visual Interpretation of Words Haojie Li Introduction Motivation A picture is worth 1000 words Traditional dictionary Containing word entries accompanied by photos or drawing to
More informationEvaluation Methods for Focused Crawling
Evaluation Methods for Focused Crawling Andrea Passerini, Paolo Frasconi, and Giovanni Soda DSI, University of Florence, ITALY {passerini,paolo,giovanni}@dsi.ing.unifi.it Abstract. The exponential growth
More informationEFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH
EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH Andreas Walter FZI Forschungszentrum Informatik, Haid-und-Neu-Straße 10-14, 76131 Karlsruhe, Germany, awalter@fzi.de
More informationFrom Passages into Elements in XML Retrieval
From Passages into Elements in XML Retrieval Kelly Y. Itakura David R. Cheriton School of Computer Science, University of Waterloo 200 Univ. Ave. W. Waterloo, ON, Canada yitakura@cs.uwaterloo.ca Charles
More informationTag Based Image Search by Social Re-ranking
Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationAn Edge-Based Approach to Motion Detection*
An Edge-Based Approach to Motion Detection* Angel D. Sappa and Fadi Dornaika Computer Vison Center Edifici O Campus UAB 08193 Barcelona, Spain {sappa, dornaika}@cvc.uab.es Abstract. This paper presents
More informationAutomatically Generating Queries for Prior Art Search
Automatically Generating Queries for Prior Art Search Erik Graf, Leif Azzopardi, Keith van Rijsbergen University of Glasgow {graf,leif,keith}@dcs.gla.ac.uk Abstract This report outlines our participation
More information