Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval

Similar documents
Experiment for Using Web Information to do Query and Document Expansion

Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images

A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP

Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval

Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task

Clustering for Text and Image-Based Photo Retrieval at CLEF 2009

Image Query Expansion Using Semantic Selectional Restrictions

Using XML Logical Structure to Retrieve (Multimedia) Objects

Medical Image Annotation in ImageCLEF 2008

The CLEF 2005 Cross-Language Image Retrieval Track

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

Overview of the ImageCLEF 2006 Photographic Retrieval and Object Annotation Tasks

CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval

NTUBROWS System for NTCIR-7. Information Retrieval for Question Answering

A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task

krones Academy - media suite User guide

External Query Reformulation for Text-based Image Retrieval

Wikipedia Retrieval Task ImageCLEF 2011

Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task

Multilingual Image Search from a user s perspective

QUICK REFERENCE GUIDE: SHELL SUPPLIER PROFILE QUESTIONNAIRE (SPQ)

University of Santiago de Compostela at CLEF-IP09

Overview of the ImageCLEFphoto 2008 Photographic Retrieval Task

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task. Junjun Wang 2013/4/22

A Practical Passage-based Approach for Chinese Document Retrieval

Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF

Searching and Organizing Images Across Languages

Integrating textual and visual information for cross-language image retrieval: A trans-media dictionary approach

Comment Extraction from Blog Posts and Its Applications to Opinion Mining

LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES

Run on Earth User Guide

Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI

ImageCLEF 2008: visual feature analysis in segmented images

Semantic Annotation of Web Resources Using IdentityRank and Wikipedia

Google Search Appliance

Query Expansion from Wikipedia and Topic Web Crawler on CLIR

CACAO PROJECT AT THE 2009 TASK

MSRA Columbus at GeoCLEF 2006

From CLIR to CLIE: Some Lessons in NTCIR Evaluation

ImageCLEF 2008 Bálint Daróczy

Evaluating a Conceptual Indexing Method by Utilizing WordNet

Content-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database

Cross Lingual Question Answering using CINDI_QA for 2007

Document Expansion for Text-based Image Retrieval at CLEF 2009

Cross-Language Retrieval with Wikipedia

Open Archive Toulouse Archive Ouverte

Experiments with Automatic Query Formulation in the Extended Boolean Model

WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia

Medical Image Annotation in ImageCLEF 2008

Cross-Language Evaluation Forum - CLEF

BUILDING CORPORA OF TRANSCRIBED SPEECH FROM OPEN ACCESS SOURCES

Overview of the ImageCLEFphoto 2007 photographic retrieval task

Overview of WebCLEF 2007

Evaluation and image retrieval

Document Structure Analysis in Associative Patent Retrieval

SIMATIC. Industrial PC Microsoft Windows 7. Safety instructions 1. Initial startup: Commissioning the operating. system

TCD-DCU at 2009: Document Expansion, Query Translation and Language Modeling

IPL at ImageCLEF 2010

MetaMoJi Share for Business Ver. 2 MetaMoJi Note for Business Ver. 2 Installation and Operation Guide

IPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching

FAO TERM PORTAL User s Guide

GoNTogle: A Tool for Semantic Annotation and Search

ResPubliQA 2010

HUKB at NTCIR-12 IMine-2 task: Utilization of Query Analysis Results and Wikipedia Data for Subtopic Mining

Large-Scale Interactive Evaluation of Multilingual Information Access Systems the iclef Flickr Challenge

indexing and query processing. The inverted le was constructed for the retrieval target collection which contains full texts of two years' Japanese pa

CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets

CriES 2010

PKU at ImageCLEF 2008: Experiments with Query Extension Techniques. for Text-Base and Content-Based Image Retrieval

Task 3 Patient-Centred Information Retrieval: Team CUNI

3 Publishing Technique

Overview of the CLEF 2010 medical image retrieval track

Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

Ranking Web Pages by Associating Keywords with Locations

DUTH at ImageCLEF 2011 Wikipedia Retrieval

User Experiments with the Eurovision Cross-Language Image Retrieval System. Paul Clough and Mark Sanderson. University of Sheffield. Sheffield, UK.

MetaMoJi Share for Business Ver. 3 MetaMoJi Note for Business Ver. 3 Administrator s Guide

SIMATIC. Industrial PC Microsoft Windows 7 (USB stick) Safety instructions 1. Initial startup: Commissioning the operating system

Overview of iclef 2008: search log analysis for Multilingual Image Retrieval

STRUCTURE-BASED QUERY EXPANSION FOR XML SEARCH ENGINE

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks

Overview of the wikipediamm task at ImageCLEF 2008

Prior Art Retrieval Using Various Patent Document Fields Contents

The Wikipedia XML Corpus

Oracle Policy Automation Release Notes

Task3 Patient-Centred Information Retrieval: Team CUNI

INEX REPORT. Report on INEX 2012

Linking Entities in Chinese Queries to Knowledge Graph

Predicting Next Search Actions with Search Engine Query Logs

Time-Surfer: Time-Based Graphical Access to Document Content

Table of Contents. Welcome to Immobel. Below you will find documentation regarding our products and services.

Global Training Catalog

The University of Évora s Participation in

Profiling Web Archive Coverage for Top-Level Domain & Content Language

Building Simulated Queries for Known-Item Topics

led to different techniques for cross-language retrieval, ones which utilized the power of human indexing of documents to improve retrieval via bi-lin

Entity Linking at TAC Task Description

Annotating Spatio-Temporal Information in Documents

CroLOM: Cross-Lingual Ontology Matching System

Transcription:

Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National Taiwan University Taipei, Taiwan ycchang@nlg.csie.ntu.edu.tw, hhchen@csie.ntu.edu.tw Abstract. ImageCLEF2007 photo task is different from those of the previous years in two aspects. The caption field in the image annotations and the narrative field in the text queries are removed, and the example images in the visual queries are also removed from the image collection. In the new definition, the information that can be employed is less than before. Thus matching query words and annotations directly is not feasible. This paper explores the web to expand queries and documents. The experiments show that query expansion improves the performance 16.11%, however, document expansion brings in too much noise and the performance decreases 28.24%. The media mapping method based on an image-text parallel corpus is regarded as query expansion. The results of the formal runs show this method performs the best. Compared with the performance of the models without expansion, the MAP improves about 86.69%~143.12%. Integration of the external and the internal resources gains no benefits in the further experiments. 1 Introduction Image retrieval becomes more important since large scale image data are available on the web. In ImageCLEFphoto task, each topic, which is composed of a text query and a visual query, simulates the information need of users. In the previous years [1] [2], a text query includes topic and narrative fields in several different languages and a visual query includes two or three example images in the image data set. Each image in the image collection is annotated with title, location, date, notes and a detail caption. The task definitions of ImageCLEFphoto2007 [3] are changed in two aspects. First, the caption field in the image annotations and the narrative field in the queries are removed. These changes aim to reflect the information needs of the real world, i.e., image annotations and queries are usually short and rough. Second, the example images of the visual queries do not belong to the image collection. This change reflects that users may use their own photos as examples rather than images in the collection. When the caption field in the image annotation is removed, matching query words and image annotations becomes more challenging than before. We will explore an external resource, e.g., the web, to expand queries and documents. Through a web * Corresponding author. C. Peters et al. (Eds.): CLEF 2007, LNCS 5152, pp. 504 511, 2008. Springer-Verlag Berlin Heidelberg 2008

Using an Image-Text Parallel Corpus and the Web for Query Expansion 505 search engine such as Google, we can retrieve relevant web pages, and use them for expansion. Compared with query expansion of using pseudo relevance feedback in the corpus, the outside resource may bring in information that the target corpus may not have. However, the information retrieved from the web may contain noise at the same time. How to filter out noise is an important issue. In this paper, we restrict the search space to some kinds of web sites, e.g., Wikipedia, and investigate if it is helpful for retrieval. In addition to external resources, we employ internal resources such as an imagetext parallel corpus, i.e., the target collection itself. Under such a trans-media corpus, two approaches say, media mapping [4] and a trans-media dictionary [5] were proposed before. Media mapping approach, which can be regarded as a kind of pseudo relevance feedback across different media, has better performance than trans-media dictionary approach. We will employ media mapping to ImageCLEFphoto 2007, examine its performance in the new definitions, and analyze if the integration of external and internal resources is helpful. This paper is organized as follows. Section 2 introduces the three methods we explore. Section 3 specifies official and unofficial runs we design. Section 4 shows and discusses the experiment results. Finally, we conclude the remarks in Section 5. 2 Methods Three methods including query (document) expansion using the web and query expansion with media mapping via a cross-media corpus are presented in the following subsections. 2.1 Query Expansion Using the Web Queries and image annotations are both short in ImageCLEFphoto 2007. In this situation, we plan to expand the given queries and get more information. Several previous experiments have shown that query expansion using pseudo relevance feedback is very useful in this task. In this paper, we employ outside resource like the web for query expansion and analyze if it can bring in useful information. The best way to access the web is through a web search engine like Google. We submit a text query to retrieve relevant web pages. Because the language of a text query may be different from the language of image annotations, we have to introduce the language translation mechanism. There are two alternative ways to deal with this problem. The first is to submit a text query to the web search engine directly and then to translate the retrieved web pages into target language. The second is to translate a text query into target language and then to submit the translated query to the web search engine. The drawback of the first approach (i.e., translation after retrieval) is the cost to translate all the web pages we get. In contrast, the translation cost of the second approach (i.e., translation before retrieval) is relatively low. However, when there are named entities in queries, the second approach may get wrong translation and thus the retrieved web pages may be unrelated to the original query. In the experiments, we adopt the approach of translation before retrieval. Next, we have to select words from the retrieved results to expand the given query. The selection mechanism can filter out noisy information, but it may also lose some useful

506 Y.-C. Chang and H.-H. Chen information. Here, we adopt the simplest way, i.e., to employ the top ranked snippets to expand our query. For the issue of noise, we limit the websites we access to encyclopedia-based ones, e.g., Wikipedia, by adding a web site name as an extra query term when submitting a query to a web search engine. 2.2 Document Expansion Using the Web Direct keyword matching may not be workable after query expansion if the relevant documents do not mention the words in the expanded query. There are two alternative ways to deal with this problem. First, we can expand a query with the words appearing in a document. Query expansion using relevance feedback belongs to this type. Second, we may expand the documents. In this paper, we explore the document expansion using the web. This method is similar to the one used in query expansion. We consider the title field of an image annotation as a query, and submit it to the web search engine to get the top ranked snippets to expand documents. Because documents are in target language, language translation is not necessary during document expansion. That is the major difference from query expansion. Although document expansion avoids translation errors, expanding too many words may introduce noise. In document expansion, we restrict the selection scope as follows. Only those words nearby the words in the title field of image annotations are considered as candidates. We set a window size (e.g., 5) in the experiments. Besides the above noise issue, document expansion has a logical problem. Assume the word animal is in title field of an image annotation. When we expand hyponym words such as tiger, cat, dog, etc., we do not know which animals are actually mentioned in that image. If the image talks about rabbit, the wrong expansion may introduce erroneous terms. 2.3 Media Mapping with an Image-Text Parallel Corpus Media mapping method [4][6] regards the target collection as an image-text parallel corpus, and employs such an intermedia to translate a visual query into a text one, and vise versa. The intermedia link two kinds of media (i.e., text and image) in this paper. Media mapping method can be seen as relevance feedback across different media and used in query expansion. In ImageCLEFphoto2006, we created a new query using media mapping and merged the results of the new query and the original visual query. In ImageCLEFphoto 2007, we regard the media mapping as query expansion in the following way. First, we submit a visual query to a content-based information retrieval (CBIR) system. Because images and the corresponding text annotations are parallelized in the collection, we then rerank the top n returned images by using a text query. Finally, the text annotations of the top m images are added to the original text query. We submit the expanded query to a text retrieval system and get the final result. We can compare the results of query expansion using the web and the media mapping with intermedia. In addition, we can examine the feasibility of the media mapping method in the new task definitions. There are two new challenges. First, retrieving the related images in the intermedia via visual query becomes harder. In the past, the visual queries are images in the image collection, so that they always appear

Using an Image-Text Parallel Corpus and the Web for Query Expansion 507 in the top of the returned images. Second, the caption field in an image annotation has been removed beforehand, so that the text information we can get from the image counterparts is less than the one in the tasks of previous years. We are interested in if the media mapping method is still workable in the new environment. 3 Experiments We submitted 27 official runs including 18 cross-lingual runs for eight different languages, 8 mono-lingual runs for three different languages, and 1 run for visual query only. All the queries with different source languages were translated into target language (e.g., English) using SYSTRAN system. We adopted Okapi with BM25 formula for text retrieval. The experiments consider the following issues. First, we want to check if the retrieved web pages can bring in new information for query expansion. We examine the expanded words when the recall is improved. Second, we compare the results of query expansion runs that limit or do not limit the web sites. Third, we want to check the effects that document expansion achieves. The runs using both query expansion and document expansion are also checked. Fourth, we examine the performance of media mapping method. Then, we employ both media mapping and the web for query expansion, and check if the web can bring in new information that media mapping cannot do. Some of the above issues are verified in the official runs, while some are done in the unofficial runs. Our official runs are described as follows. A run is named by the format Source- Language-TargetLanguage-Automatic-FeedBack-Media, where DE (German), ES (Spanish), EN (English), FR (French), JP (Japanese), RU (Russian), ZHT (Traditional Chinese), ZHS (Simplified Chinese), AUTO (Automatic), NOFB (No Feedback), TE (Document Expansion), FBQE (Feedback and Query Expansion), TXT (Text), IMG (Image), and TXTIMG (Text and Image). 1. 8 cross-lingual runs that use text query only, and do not consider query expansion: ES-EN-AUTO-NOFB-TXT, FR-EN-AUTO-NOFB-TXT, RU-EN-AUTO-NOFB-TXT, PT-EN-AUTO-NOFB-TXT, JA-EN-AUTO-NOFB-TXT, IT-EN-AUTO-NOFB-TXT, ZHT-EN-AUTO-NOFB-TXT, ZHS-EN-AUTO-NOFB-TXT 2. 3 mono-lingual runs that use text query only, and do not consider query expansion: EN-EN-AUTO-NOFB-TXT, ES-ES-AUTO-NOFB-TXT, DE-DE-AUTO-NOFB-TXT 3. 3 mono-lingual runs that adopt the media mapping method for query expansion: ES-ES-AUTO-FBQE-TXTIMG, EN-EN-AUTO-FBQE-TXTIMG, DE-DE-AUTO-FBQE-TXTIMG 4. 8 cross-lingual runs that use the media mapping method for query expansion: PT-EN-AUTO-FBQE-TXTIMG, ES-EN-AUTO-FBQE-TXTIMG, RU-EN-AUTO-FBQE-TXTIMG, IT-EN-AUTO-FBQE-TXTIMG, ZHT-EN-AUTO-FBQE-TXTIMG, ZHS-EN-AUTO-FBQE-TXTIMG, JA-EN-AUTO-FBQE-TXTIMG, FR-EN-AUTO-FBQE-TXTIMG 5. 2 runs that expand query with the web, but do not consider document expansion: EN-EN-AUTO-QE-TXT-TOPIC, ZHT-EN-AUTO-QE-TXT-TOPIC 6. 2 runs that expand document with the web, but do not consider query expansion: EN-EN-AUTO-TE-TXT-CAPTION, ZHT-EN-AUTO-TE-TXT-CAPTION

508 Y.-C. Chang and H.-H. Chen 7. 1 run that use visual query and the media mapping method only: IMG-EN-AUTO-FB-TXTIMG Some unofficial runs are described as follows. 1. 2 runs that consider both query expansion and document expansion: EN-EN-AUTO-TE-QE-TXT, ZHT-EN-AUTO-TE-QE-TXT 2. 2 runs that expand query with the web and limit the search space: EN-EN-AUTO-QE-WIKI-TXT, ZHT-EN-AUTO-QE-WIKI-TXT 3. 2 runs that use both the web and the media mapping for query expansion: EN-EN-AUTO-QE-FBQE-TXTIMG, ZHT-EN-AUTO-QE-FBQE-TXTIMG. 4 Results and Discussions In the first set of experiments, we use the top one snippet returned by Google to expand the text queries. Table 1 shows the results of runs EN-EN-AUTO-QE-TXT-TOPIC, ZHT-EN-AUTO-QE-TXT-TOPIC, EN-EN-AUTO-NOFB-TXT, and ZHT-EN-AUTO- NOFB-TXT. In both cross-lingual and mono-lingual cases, the performance of systems with query expansion is better than that without query expansion. After expansion, both recall and precision are improved. In the original expectation, precision is decreased since we do not apply any strategies to filter noise. Table 1. Results of models with/without query expansion Query Language- Document Language Evaluation Metric Query Expansion Using the Web Query Without Expansion Traditional Chinese- MAP 0.1225 (+16.11 %) 0.1055 English Recall 0.4461 (+18.14 %) 0.3776 English-English MAP 0.1577 (+7.57 %) 0.1466 Recall 0.5439 (+14.84%) 0.4736 In the second set of experiments, we compare the results of query expansion with and without limiting the search space. Table 2 shows that the performance does not change very much after restricting the web sites for cross-lingual retrieval. The performance even has a little decrease in mono-lingual runs. We find that restrictive access may retrieve unrelated pages in some cases. The third set of experiments aims to evaluate the effects of document expansion. Table 3 summarizes the results. Document expansion does not take positive effects. In both cross-lingual and mono-lingual runs, recall and MAP are decreased when document expansion is introduced no matter whether query expansion is employed or not. The major reason may be that the strategy brings in too much noise. The fourth set of experiments examines the performance of the media mapping method in the new definitions. The results are shown in Table 4. Total 8 cross-lingual runs and 3 mono-lingual runs are tested. Media mapping achieves very good performance. Compared with the performance of the models without expansion, the MAP improves about 86.69%~143.12%. In last year, media mapping improves the performance about 71%~119%. This result shows that media mapping method is robust under different task definitions.

Using an Image-Text Parallel Corpus and the Web for Query Expansion 509 Table 2. Results of models with and without limiting the search space Run Name (cross-lingual/mono-lingual) Limitation Recall MAP ZHT-EN-AUTO-QE-TXT-TOPIC (cross-lingual) No 0.4461 0.1225 ZHT-EN-AUTO-QE-WIKI-TXT (cross-lingual) Yes 0.4713 0.1290 EN-EN-AUTO-QE-TXT-TOPIC (mono-lingual) No 0.5439 0.1577 EN-EN-AUTO-QE-WIKI-TXT English (mono-lingual) Yes 0.5102 0.1330 Table 3. Results of models using or not using document expansion Runs Name (cross-lingual/mono-lingual) Document Query Recall MAP Expansion Expansion ZHT-EN-AUTO-NOFB-TXT (cross) No No 0.3776 0.1055 ZHT-EN-AUTO-TE-TXT-CAPTION (cross) Yes No 0.3050 0.0757 ZHT-EN-AUTO-QE-TXT-TOPIC (cross) No Yes 0.4461 0.1225 ZHT-EN-AUTO-TE-QE-TXT (cross) Yes Yes 0.3562 0.0799 EN-EN-AUTO-NOFB-TXT (mono) No No 0.4736 0.1466 EN-EN-AUTO-TE-TXT-CAPTION (mono) Yes No 0.3729 0.1154 EN-EN-AUTO-QE-TXT-TOPIC (mono) No Yes 0.5439 0.1577 EN-EN-AUTO-TE-QE-TXT (mono) Yes Yes 0.4156 0.1203 Table 4. Results of using the media mapping as query expansion Query Language- Document Language Traditional Chinese-English Simplified Chinese-English Portuguese-English Spanish-English Russian-English Italian-English French-English Japanese-English English-English Spanish-Spanish German-German Metric Query Expansion using Media Mapping Without Expansion MAP 0.2565 (+143.12 %) 0.1055 Recall 0.6405 (+69.62 %) 0.3776 MAP 0.2565 (+143.12 %) 0.1055 Recall 0.6405 (+69.62 %) 0.3776 MAP 0.2820 (+109.35 %) 0.1347 Recall 0.6733 (+51.81 %) 0.4435 MAP 0.2785 (+96.12 %) 0.1420 Recall 0.6718 (+50.89 %) 0.4452 MAP 0.2731 (+100.66 %) 0.1361 Recall 0.6738 (+46.54 %) 0.4598 MAP 0.2705 (+130.60 %) 0.1173 Recall 0.6481 (+67.59 %) 0.3867 MAP 0.2669 (+95.96 %) 0.1362 Recall 0.6651 (+61.74 %) 0.4112 MAP 0.2551 (+117.29 %) 0.1174 Recall 0.6507 (+57.55 %) 0.4130 MAP 0.2737 (+86.69 %) 0.1466 Recall 0.6812 (+43.83 %) 0.4736 MAP 0.2792 (+92.02 %) 0.1454 Recall 0.6282 (+32.30 %) 0.4748 MAP 0.2449 (+128.87 %) 0.1070 Recall 0.5790 (+82.64 %) 0.3170

510 Y.-C. Chang and H.-H. Chen Tables 1 and 4 conclude that media mapping with an image-text parallel corpus and query expansion using the web are very useful, and the former is better than the latter. The last set of experiments checks if integrating the internal and the external resources gets better performance. Table 5 shows that such an integration does not have positive effects. MAP is decreased when the web is used. It may be due to that the external resource (i.e., the web) has more noise than the internal resource (i.e., the image-text parallel corpus). Table 5. Results of the models using both media mapping and the web Run Media Mapping the Web Recall MAP ZHT-EN-AUTO-FBQE-TXTIMG Yes No 0.6405 0.2565 ZHT-EN-AUTO-QE-FBQE-TXTIMG Yes Yes 0.6533 0.2255 EN-EN-AUTO-FBQE-TXTIMG Yes No 0.6812 0.2737 EN-EN-AUTO-QE-FBQE-TXTIMG Yes Yes 0.6738 0.2442 In the above sets of experiments, we compare the performance of different kinds of approaches. Media mapping with an image-text parallel corpus is the best of all. Table 6 summarizes the ranks of our official runs with media mapping approach in Image- CLEFphoto2007. Each row lists the language pair, total submitted runs and the rank of our runs. Compared with the runs of different participants, media mapping approach performs quite well in different language pairs. Except English mono-lingual and Simplified Chinese-English cross-lingual runs, our system ranks number 1 in the rest of official runs we submitted. That shows the robustness of our media mapping approach in integrating text and visual information. Table 6. Ranks of official runs using media mapping approach in ImageCLEFphoto2007 Mono-Lingual Run/Cross-Lingual Run Total Submitted Runs Rank English English 142 8 German German 30 1 Spanish Spanish 15 1 Simplified Chinese English 23 2 Tradition Chinese English 1 1 French English 21 1 Italian English 10 1 Japanese English 6 1 Portuguese English 9 1 Russian English 6 1 Spanish English 9 1 5 Conclusion This paper explores the use of the web for query and document expansion. The experiments show that the named entities expanded from the web are useful. Limiting the search web sites to Wikipedia seems not to improve the performance and may filter out some related webs. Document expansion brings in too much noise, so that

Using an Image-Text Parallel Corpus and the Web for Query Expansion 511 the performance decreases 28.24%. Regarding media mapping as query expansion improves the retrieval performance very much. It shows the robustness of media mapping method even the new task definition is more challenging than before. Integrating both the web and an image-text parallel corpus for query expansion cannot improve the performance. Acknowledgments. Research of this paper was partially supported by National Science Council, Taiwan, and Excellent Research Projects of National Taiwan University, under the contracts 95-2221-E-002-334 and 96R0062-AE00-02. References 1. Clough, P., Sanderson, M., Müller, H.: The CLEF 2004 Cross-Language Image Retrieval Track. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, pp. 597 613. Springer, Heidelberg (2005) 2. Clough, P., Müller, H., Deselaers, T., Grubinger, M., Lehmann, T.M., Jensen, J., Hersh, W.: The CLEF 2005 Cross-Language Image Retrieval Track. In: Peters, C., Clough, P., Gonzalo, J., Jones, G., Kluck, M., Magnini, B. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 535 557. Springer, Heidelberg (2006) 3. Grubinger, M., Clough, P., Hanbury, A., Müller, H.: Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task. In: Nardi, A., Peters, C. (eds.) Working Notes of the 2007 CLEF Workshop (2007) 4. Chang, Y.C., Chen, H.H.: Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730, pp. 625 632. Springer, Heidelberg (2007) 5. Lin, W.C., Chang, Y.C., Chen, H.H.: Integrating Textual and Visual Information for Cross- Language Image Retrieval: A Trans-Media Dictionary Approach. Information Processing and Management 43, 488 502 (2007) 6. Chen, H.H., Chang, Y.C.: Language Translation and Media Transformation in Cross- Language Image Retrieval. In: Sugimoto, S., Hunter, J., Rauber, A., Morishima, A. (eds.) ICADL 2006. LNCS, vol. 4312, pp. 350 359. Springer, Heidelberg (2006)