Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images

Size: px
Start display at page:

Download "Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images"

Transcription

1 Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images Miguel E. Ruiz 1 and Aurélie Névéol 2 1 University of North Texas, School of Library and Information Sciences P.O. Box , Denton, Texas USA meruiz@unt.edu 2 National Library of Medicine Bldg. 38A, B1N-28A, 8600 Rockville Pike, Bethesda, MD USA neveola@mail.nih.gov Abstract. This paper presents the results of the State University of New York at Buffalo (UB) team in collaboration with the National Library of Medicine (NLM) in the 2007 ImageCLEFmed task. We use a system that combines visual features (using a CBIR System) and text retrieval. We used the Medical Text Indexer (MTI) developed by NLM to automatically assign MeSH terms and UMLS concepts to the English free text annotations of the images. We also used an equivalent system called MAIF that automatically assigns MeSH and UMLS concepts to French free text. Our results indicate that the use of automatically assigned UMLS concepts improves retrieval performance significantly. We also identified specific aspects of the system that could be improved in the future, such as the method used to perform the automatic translation of medical terms and the addition of image classification to process queries targeted to a specific image modality. 1 Introduction ThispaperpresentstheresultsofourparticipationinimageCLEFmed2007.Inprevious years we have used a method that maps the queries to Unified Medical Language System (UMLS) concepts and then uses these concepts to find translations of the English queries into French and German [1, 2]. This method has been successful in handling English queries to find the corresponding French and German translations. For this year s challenge, we focused on assessing 1) the use of an automatic indexing system providing Medical subject Headings (MeSH terms) and UMLS concepts; and 2) the use of UMLS-based translation with French as the query language. The impact of both features on retrieval performance was analyzed. 2 System Description The system that was used this year combines two publicly available systems: SMART: This is an information retrieval system developed by Gerald Salton and his collaborators at Cornell University [3]. SMART implements a generalized vector space model representation of documents and queries. C. Peters et al. (Eds.): CLEF 2007, LNCS 5152, pp , c Springer-Verlag Berlin Heidelberg 2008

2 642 M.E. Ruiz and A. Névéol This is an important feature since we wanted to include three different representations of the image annotations: Free text, MeSH terms, and UMLS concepts. Flexible Image Retrieval Engine (FIRE): This is an open source content based image retrieval system developed at RWTH Aachen University, Germany [4]. For processing the annotations we also used two automatic text categorization tools that map free text to MeSH terms. We used the Medical Text Indexer (MTI) which is a tool developed at the U.S. National Library of Medicine (NLM) to assign MeSH terms to the English annotations. For processing French text we used Medical Automatic Indexer for French (MAIF) which is a tool similar to MTI that uses NLP as well as statistical methods to assign MeSH terms to free text. We did not have a tool to perform a similar mapping of the German text. We also decided to add the concept unique identifier (CUI) from the UMLS so that we could match queries and documents using these language independent concepts. Since MeSH is one of the vocabularies of UMLS, the assignment of the UMLS concepts was performed by getting the corresponding identifiers of the MeSH terms in UMLS. 3 Collection Preparation As described in the ImageCLEFmed 2007 overview paper [5] the image collection used in this task consists of six sub-collections. Each collection has its own metadata in XML format for the image annotations. In order to process all collections uniformly we created a common XML schema and converted all the annotation to this new schema. Figure 1 shows the common metadata schema that was used. English queries and documents were processed by parsing them using MTI to identify MeSH concepts present in the free text and then add the corresponding MeSH terms as well as the UMLS concepts. MTI uses NLP techniques (implemented in Metamap) as well as a statistical K-Nearest-Neighbor (KNN) method that takes advantage of the entire MEDLINE collection [6]. MTI is currently being used at NLM as a semi-automatic and fully automatic indexing tool. For this task, we used the top 25 recommendations provided by the system ran with default filtering. French queries and documents were processed using a modified version of the MAIF described in [7]. MAIF is able to retrieve MeSH terms from biomedical text in French. It specifically retrieves main headings and main heading/subheading pairs. However, for the purpose of the image-clef task, we only used MAIF to retrieve MeSH main headings that were then mapped to UMLS concepts. We used a collection of 15, 000 French citations available from CISMeF (Catalogue and Index of Online Health Information in French available at for retrieving the French MeSH terms used in MAIF. The modified version of MAIF is similar to MTI in that it combines a NLP method and a statistical, knowledge-based method [7]. However, the two systems differ in the specific implementation of both methods. The combination of these two approaches takes

3 Evaluation of Automatically Assigned MeSH Terms 643 Fig. 1. Common XML schema and Ctypes for indexing into account the relative score assigned to the terms by each method. The relative score of a term is obtained by dividing the score of the term by the sum of all the scores assigned by the corresponding method. Combining the methods in this way gives an advantage to terms retrieved by the NLP method. Because the NLP approach tends to retrieve a smaller number of terms per document, the relative importance of each term tends to be higher than the relative importance of terms retrieved by the statistical method. The final term selection is performed using the breakage function described in [8]. The score assigned to a MeSH candidate represents its likelihood to be a good indexing term: the higher the score, the more likely it is that the corresponding MeSH term is a good indexing candidate. Given a list of indexing candidates and the score that has been assigned to them, the breakage function is meant to detect a breach of continuity in the scores, therefore highlighting the point in the candidate list where terms become significantly less likely to be correct indexing terms. The final set of MeSH main headings assigned to a document consists of all the terms ranked above this threshold. Once the collections were converted in to the common XML schema we use SMART to parse the XML documents and create three indexes (also called Ctypes in SMART). Ctype 0 was used for indexing free text from the original annotations, Ctype 1 was used to index the MeSH terms automatically assigned using the medical text indexing tools (MTI for English text and MAIF for French text), and Ctype 2 was used to index the UMLS concepts that were identified by MTI or MAIF. 4 Retrieval Model We used a generalized vector space model that combines the vector representation of each of the four indexes presented in Figure 1. The final retrieval model can be represented using the following formula: score(image) =α Score CBIR + β sim Text (d i,q) (1)

4 644 M.E. Ruiz and A. Névéol where α and β are coefficients that weight the contribution of each system and sim Text is defined as: sim Text (d i,q)=λ sim words (d i,q)+μ sim MeSHterms (d i,q) +ρ sim UMLSConcepts (d i,q) (2) where λ, μ and ρ are coefficients that control the contribution of each of the ctypes. The values of these coefficients were computed empirically using the optimal results on the 2006 topics. The similarity values are computed using cosine normalization (atc) for the documents and augmented term frequency for the queries (atn). We also performed automatic retrieval feedback by retrieving 1, 000 documents using the original query and assuming that the top n documents are relevant. This allowed us to select the top m terms ranked according to Rocchio s relevance feedback formula [9]. 5 Experimental Results and Analysis We submitted 7 official runs which are shown in Table 1. A total of 5 runs use queries in English and 2 runs use queries in French. Translations of the queries into the other two languages were automatically generated by expanding the query with the all UMLS terms associated to the concepts assigned by MTI or MAIF. From these runs we can see that the highest score was obtained by runs that use the English queries and combine the text and image results obtaining a Mean Average Precision (MAP) value of and (UB-NLM-UBTI 3, and UB-NLM-UBTI 1). Overall these two runs perform well above the median run in imageclefmed 2007 (Median MAP= ) and rank 5th and 6th among all automatic mixed runs. Unfortunately our multilingual runs perform significantly below (MAP 0.254). This indicates that our automatic translation approach does decrease performance when compared to using the English queries only. We suspect that this could be due to the fact that the translations might be adding terms that change the focus of the query. Tables 2a-d show a series of unofficial runs that allow comparison of the methods that were used in our system. Table 2a shows the performance obtained by using free text (English only), automatically assigned UMLS concepts and the CBIR retrieval using FIRE. Our base lines for free text and UMLS concepts are quite strong since they both perform above the median system. The CBIR baseline is quite weak compared with the text and concept baselines. However, when compared to other visual only runs it is around average for CBIR runs. A query by query analysis of the results for the CBIR run shows that the MAP for 21 of the 30 queries is below which is a major factor for the poor performance shown. It appears that the fact that the queries require specific image modality seems to be a major factor since our CBIR system does not include an image classification module that could identify the image modality to filter out those images that do not have the requested modality in the query. Table 2b shows the results obtained using only English queries. Because the collection has predominantly English annotations we can see that these runs

5 Evaluation of Automatically Assigned MeSH Terms 645 Table 1. Performance of Official Runs Run name Description type MAP Exact-P P10 P20 UB-NLM-UBTI 3 English queries Mixed run UB-NLM-UBTI 1 English queries Mixed run UB-NLM-UBmixedMulti2 English cross-lang Mixed run UB-NLM-UBTextBL1 English queries Text only UB-NLM-UBTextBL2 English cross-lang Text only UB-NLM-UBTextFR French cross-lang Text only UB-NLM-UBmixedFR French cross-lang Mixed run correspond to our highest scoring official runs (UBTI 1 and UBTI 3). All these runs use the free text as well as the UMLS concepts automatically assigned to both queries and documents. These results confirm that the use of automatically identified concepts improves performance considerably when compared to using free text only. We can also see that the merging formula that combines visual and text features does work properly despite the fact that the CBIR run contributes little to the overall MAP. Our two top scoring runs use text as well as image features. The best automatic run (MAP=0.3018) was not submitted but is only marginally better than our highest official run. Table 2c and 2d show performance of our cross-lingual runs. These runs use the UMLS automatic translations based on the UMLS concept mapping obtained from the English text. We can see that this actually harms performance significantly compared with using English only queries. We believe that is due to the aggressive translation method that we tried to use since it seems to add terms that shift the focus of the query. We plan to explore this issue in more detail in our future research. Despite this result we can see that the results confirm that using UMLS concepts (which are language independent) improves performance with respect to using only free text translations. Also the use of the results from the CBIR system yield only small improvements in retrieval performance. Table 2d shows the result of our cross-lingual runs that use French as the query language. Our official French runs used the same parameters as the English runs and this seems to have harmed the results for French since the runs presented in our unofficial runs show significantly better performance. These results are comparable to the best French cross-lingual results presented by other teams in the conference. However, the overall French cross-lingual results achieve only 56% of the English retrieval performance.this could be due to the fact that the French resources we used (citation database and medical lexicon) are much smaller than the UMLS resources available for English. Table 3 presents runs that use all the manually generated terms in English, French and German that were provided in the ImageCLEFmed topics. These queries achieve the highest score using our system with a MAP of which is comparable to the best manual run reported this year [5]. As in our previously presented experiments, the results with the manual queries show improvements when automatically generated UMLS concepts and pseudo relevance feedback are used. Use of the CBIR results yields a small improvement.

6 646 M.E. Ruiz and A. Névéol Table 2. Unofficial Runs Run name MAP Exact-P P10 P20 (a) Baseline runs EN-free text only UMLS concepts only FIRE baseline (CBIR) (b) English only runs EN-text-RF EN-text baseline + image EN-text rf + images (c) Automatic English cross-lingual runs EN-Multi-Baseline EN-Multi + concepts EN-Multi + concepts + images EN-Multi-rf (d) Automatic French cross-lingual runs FR-Multi-Baseline FR-Multi-Baseline + images FR-Multi- RF FR-Multi-RF + images Table 3. Manual runs Run name MAP Exact-P P10 P20 Multi-manual text only Multi-Manual text+contepts Multi-Manual Text+concepts + images Multi-manual rf Multi-manual rf + images Table 4. Comparison of results by type of query Type Free text UMLS concepts CBIR Combination Visual Visual-Semantic Semantic We performed a query by query analysis to try to understand how the different methods proposed are affected by different types of queries. Table 4 shows the average MAP by groups of topics according to whether they are visual, semantic and mixed (visual-semantic). As expected the text based and UMLS concept based runs perform better in the semantic topics. The CBIR system performs slightly better in the visual and mixed topics while the poorest performance is

7 Evaluation of Automatically Assigned MeSH Terms 647 in the semantic topics. The combination shows consistent improvements in all three groups of topics. 6 Conclusions From the results we can conclude that the use of automatically assigned UMLS concepts using MTI significantly improves performance for the retrieval of medical images with English annotations. We also confirm that our generalized vector space model works well for combining retrieval results from free text, UMLS concepts and CBIR systems. Despite the low performance of our CBIR system the merging method is robust enough to maintain or even improve results. We also conclude that our methods work better for semantic queries while still achieving significantly high performance for visual or mixed visual semantic queries. Our cross-lingual results using French as the query language are relatively low and indicate that we need to work on improving our translation method based on UMLS mapping. We plan to explore this further in our future research. The low results from the CBIR system indicate that we need to address the image classification problem so that the CBIR results can give a more significant contribution to the overall fusion of results. Acknowledgements This work was supported in part by an appointment of A. Névéol and M. E. Ruiz to the NLM Research Participation Program. This program is administered by the Oak Ridge Institute for Science and Education trhough an interagency agreement between the U.S. Department of Energy and the National Library of Medicine. We also want to thank Dr. Alan Aronson and the Indexing Initiative Project team at the NLM for their support and for making the MTI system available for this project. References [1] Ruiz, M.: Combining image features, case descriptions and umls concepts to improve retrieval of medical images. In: Proceedings of the AMIA Annual Symposium, Washington, DC, pp (2006) [2] Ruiz, M.: Ub at imageclefmed In: Peters, C., Clough, P., Gonzalo, J., Jones, G., Kluck, M., Magnini, B. (eds.) CLEF LNCS, vol. 4730, pp Springer, Heidelberg (2007) [3] Salton, G. (ed.): The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice-Hall, Englewood Cliffs (1983) [4] Deselaers, T., Keysers, D., Ney., H.: Features for image retrieval: A quantitative comparison. In: Rasmussen, C.E., Bülthoff, H.H., Schölkopf, B., Giese, M.A. (eds.) DAGM LNCS, vol. 3175, pp Springer, Heidelberg (2004)

8 648 M.E. Ruiz and A. Névéol [5] Müller, H., Deselaers, T., Kim, E., Kalpathy-Cramer, J., Deserno, T.M., Hersh, W.: Overview of the imageclef 2007 medical retrieval and annotation tasks. In: Peters, C., et al. (eds.) CLEF LNCS, vol Springer, Heidelberg (2008) [6] Aronson, A., Mork, J., Gay, C., Humphrey, S., Rogers, W.: The nlm indexing initiativeś medical text indexer. In: MEDINFO, 11(Pt 1), pp (2004) [7] Névéol, A., Mork, J., Aronson, A., Darmoni, S.: Evaluation of french and english mesh indexing systems with a parallel corpus. In: Proceedings of the AMIA Annual Symposium, pp (2005) [8] Névéol, A., Rogozan, A., Darmoni, S.: Automatic indexing of online health resources for a french quality controlled gateway. Information Processing and Management 42, (2006) [9] Rocchio, J.J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, Englewood Cliff, NJ (1971)

Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval

Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National

More information

Document Retrieval using Predication Similarity

Document Retrieval using Predication Similarity Document Retrieval using Predication Similarity Kalpa Gunaratna 1 Kno.e.sis Center, Wright State University, Dayton, OH 45435 USA kalpa@knoesis.org Abstract. Document retrieval has been an important research

More information

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad

More information

The NLM Medical Text Indexer System for Indexing Biomedical Literature

The NLM Medical Text Indexer System for Indexing Biomedical Literature The NLM Medical Text Indexer System for Indexing Biomedical Literature James G. Mork 1, Antonio J. Jimeno Yepes 2,1, Alan R. Aronson 1 1 National Library of Medicine, Bethesda, MD, USA {mork,alan}@nlm.nih.gov

More information

Optimization of the PubMed Automatic Term Mapping

Optimization of the PubMed Automatic Term Mapping 238 Medical Informatics in a United and Healthy Europe K.-P. Adlassnig et al. (Eds.) IOS Press, 2009 2009 European Federation for Medical Informatics. All rights reserved. doi:10.3233/978-1-60750-044-5-238

More information

Experiment for Using Web Information to do Query and Document Expansion

Experiment for Using Web Information to do Query and Document Expansion Experiment for Using Web Information to do Query and Document Expansion Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National Taiwan University Taipei,

More information

A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP

A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP A fully-automatic approach to answer geographic queries: at GikiP Johannes Leveling Sven Hartrumpf Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen)

More information

Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track

Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Matthew Simpson, Md Mahmudur Rahman, Dina Demner-Fushman, Sameer Antani, George R. Thoma Lister Hill

More information

Evaluation and image retrieval

Evaluation and image retrieval Evaluation and image retrieval Henning Müller Thomas Deselaers Overview Information retrieval evaluation TREC Multimedia retrieval evaluation TRECVID, ImageEval, Benchathlon, ImageCLEF Past Future Information

More information

Medical Image Annotation in ImageCLEF 2008

Medical Image Annotation in ImageCLEF 2008 Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany 2 RWTH Aachen University, Dept. of Medical Informatics,

More information

Document Expansion for Text-based Image Retrieval at CLEF 2009

Document Expansion for Text-based Image Retrieval at CLEF 2009 Document Expansion for Text-based Image Retrieval at CLEF 2009 Jinming Min, Peter Wilkins, Johannes Leveling, and Gareth Jones Centre for Next Generation Localisation School of Computing, Dublin City University

More information

Clustering for Text and Image-Based Photo Retrieval at CLEF 2009

Clustering for Text and Image-Based Photo Retrieval at CLEF 2009 Clustering for ext and mage-based Photo Retrieval at CLEF 2009 Qian Zhu and Diana nkpen School of nformation echnology and Engineering University of Ottawa qzhu012@uottawa.ca, diana@site.uottawa.ca Abstract.

More information

Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task

Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task Walid Magdy, Gareth J.F. Jones Centre for Next Generation Localisation School of Computing Dublin City University,

More information

CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval

CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval DCU @ CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval Walid Magdy, Johannes Leveling, Gareth J.F. Jones Centre for Next Generation Localization School of Computing Dublin City University,

More information

IPL at ImageCLEF 2010

IPL at ImageCLEF 2010 IPL at ImageCLEF 2010 Alexandros Stougiannis, Anestis Gkanogiannis, and Theodore Kalamboukis Information Processing Laboratory Department of Informatics Athens University of Economics and Business 76 Patission

More information

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics

More information

CACAO PROJECT AT THE 2009 TASK

CACAO PROJECT AT THE 2009 TASK CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype

More information

Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014

Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014 Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014 Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of

More information

CADIAL Search Engine at INEX

CADIAL Search Engine at INEX CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr

More information

Content-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification

Content-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification Content-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification Juan C. Caicedo, Fabio A. Gonzalez and Eduardo Romero BioIngenium Research Group National University of

More information

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and

More information

Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts

Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts Kwangcheol Shin 1, Sang-Yong Han 1, and Alexander Gelbukh 1,2 1 Computer Science and Engineering Department, Chung-Ang University,

More information

Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms

Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms Yikun Guo, Henk Harkema, Rob Gaizauskas University of Sheffield, UK {guo, harkema, gaizauskas}@dcs.shef.ac.uk

More information

Content-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database

Content-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database Content-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database Kai Chen 1 and Jean Hennebert 2 1 University of Fribourg, DIVA-DIUF, Bd. de Pérolles 90, 1700 Fribourg, Switzerland

More information

Medical Image Annotation in ImageCLEF 2008

Medical Image Annotation in ImageCLEF 2008 Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 RWTH Aachen University,

More information

Using XML Logical Structure to Retrieve (Multimedia) Objects

Using XML Logical Structure to Retrieve (Multimedia) Objects Using XML Logical Structure to Retrieve (Multimedia) Objects Zhigang Kong and Mounia Lalmas Queen Mary, University of London {cskzg,mounia}@dcs.qmul.ac.uk Abstract. This paper investigates the use of the

More information

York University at CLEF ehealth 2015: Medical Document Retrieval

York University at CLEF ehealth 2015: Medical Document Retrieval York University at CLEF ehealth 2015: Medical Document Retrieval Andia Ghoddousi Jimmy Xiangji Huang Information Retrieval and Knowledge Management Research Lab Department of Computer Science and Engineering

More information

Overview of the CLEF 2009 Medical Image Annotation Track

Overview of the CLEF 2009 Medical Image Annotation Track Overview of the CLEF 2009 Medical Image Annotation Track Tatiana Tommasi 1, Barbara Caputo 1, Petra Welter 2, Mark Oliver Güld 2, and Thomas M. Deserno 2 1 Idiap Research Institute, Martigny, Switzerland,

More information

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks University of Amsterdam at INEX 2010: Ad hoc and Book Tracks Jaap Kamps 1,2 and Marijn Koolen 1 1 Archives and Information Studies, Faculty of Humanities, University of Amsterdam 2 ISLA, Faculty of Science,

More information

SINAI at CLEF ehealth 2017 Task 3

SINAI at CLEF ehealth 2017 Task 3 SINAI at CLEF ehealth 2017 Task 3 Manuel Carlos Díaz-Galiano, M. Teresa Martín-Valdivia, Salud María Jiménez-Zafra, Alberto Andreu, and L. Alfonso Ureña López Department of Computer Science, Universidad

More information

Exploring the Query Expansion Methods for Concept Based Representation

Exploring the Query Expansion Methods for Concept Based Representation Exploring the Query Expansion Methods for Concept Based Representation Yue Wang and Hui Fang Department of Electrical and Computer Engineering University of Delaware 140 Evans Hall, Newark, Delaware, 19716,

More information

Wikipedia Retrieval Task ImageCLEF 2011

Wikipedia Retrieval Task ImageCLEF 2011 Wikipedia Retrieval Task ImageCLEF 2011 Theodora Tsikrika University of Applied Sciences Western Switzerland, Switzerland Jana Kludas University of Geneva, Switzerland Adrian Popescu CEA LIST, France Outline

More information

Preliminary Work on Building a User Friendly Adaptive Clinical Documents Repository

Preliminary Work on Building a User Friendly Adaptive Clinical Documents Repository Preliminary Work on Building a User Friendly Adaptive Clinical Documents Repository Enriko Aryanto Stanford University 121 Campus Dr. #3112A Stanford, CA 94305 1-650-497-7306 earyanto@stanford.edu Yang

More information

External Query Reformulation for Text-based Image Retrieval

External Query Reformulation for Text-based Image Retrieval External Query Reformulation for Text-based Image Retrieval Jinming Min and Gareth J. F. Jones Centre for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland {jmin,gjones}@computing.dcu.ie

More information

Multiple Terminologies in a Health Portal: Automatic Indexing and Information Retrieval

Multiple Terminologies in a Health Portal: Automatic Indexing and Information Retrieval Multiple Terminologies in a Health Portal: Automatic Indexing and Information Retrieval Stéfan J. Darmoni, MD, PhD 1, Suzanne Pereira, PhD 1,2,3, Saoussen Sakji, MSc 1, Tayeb Merabti, MSc 1, É. Prieur,

More information

This is the author s version of a work that was submitted/accepted for publication in the following source:

This is the author s version of a work that was submitted/accepted for publication in the following source: This is the author s version of a work that was submitted/accepted for publication in the following source: Koopman, Bevan, Bruza, Peter, Sitbon, Laurianne, & Lawley, Michael (2011) AEHRC & QUT at TREC

More information

A Study on Query Expansion with MeSH Terms and Elasticsearch. IMS Unipd at CLEF ehealth Task 3

A Study on Query Expansion with MeSH Terms and Elasticsearch. IMS Unipd at CLEF ehealth Task 3 A Study on Query Expansion with MeSH Terms and Elasticsearch. IMS Unipd at CLEF ehealth Task 3 Giorgio Maria Di Nunzio and Alexandru Moldovan Dept. of Information Engineering University of Padua giorgiomaria.dinunzio@unipd.it,alexandru.moldovan@studenti.unipd.it

More information

A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH

A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH A thesis Submitted to the faculty of the graduate school of the University of Minnesota by Vamshi Krishna Thotempudi In partial fulfillment of the requirements

More information

Recommending MeSH terms for annotating biomedical articles

Recommending MeSH terms for annotating biomedical articles Recommending MeSH terms for annotating biomedical articles Minlie Huang, 1,2 Aurélie Névéol, 2 Zhiyong Lu 2 1 State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for

More information

A Language Independent Author Verifier Using Fuzzy C-Means Clustering

A Language Independent Author Verifier Using Fuzzy C-Means Clustering A Language Independent Author Verifier Using Fuzzy C-Means Clustering Notebook for PAN at CLEF 2014 Pashutan Modaresi 1,2 and Philipp Gross 1 1 pressrelations GmbH, Düsseldorf, Germany {pashutan.modaresi,

More information

ResPubliQA 2010

ResPubliQA 2010 SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first

More information

Word Indexing Versus Conceptual Indexing in Medical Image Retrieval

Word Indexing Versus Conceptual Indexing in Medical Image Retrieval Word Indexing Versus Conceptual Indexing in Medical Image Retrieval (ReDCAD participation at ImageCLEF Medical Image Retrieval 2012) Karim Gasmi, Mouna Torjmen-Khemakhem, and Maher Ben Jemaa Research unit

More information

Query Reformulation for Clinical Decision Support Search

Query Reformulation for Clinical Decision Support Search Query Reformulation for Clinical Decision Support Search Luca Soldaini, Arman Cohan, Andrew Yates, Nazli Goharian, Ophir Frieder Information Retrieval Lab Computer Science Department Georgetown University

More information

Multilingual Image Search from a user s perspective

Multilingual Image Search from a user s perspective Multilingual Image Search from a user s perspective Julio Gonzalo, Paul Clough, Jussi Karlgren QUAERO-Image CLEF workshop, 16/09/08 Finding is a matter of two fast stupid smart slow great potential for

More information

ECNU at 2017 ehealth Task 2: Technologically Assisted Reviews in Empirical Medicine

ECNU at 2017 ehealth Task 2: Technologically Assisted Reviews in Empirical Medicine ECNU at 2017 ehealth Task 2: Technologically Assisted Reviews in Empirical Medicine Jiayi Chen 1, Su Chen 1, Yang Song 1, Hongyu Liu 1, Yueyao Wang 1, Qinmin Hu 1, Liang He 1, and Yan Yang 1,2 Department

More information

AAUITEC at ImageCLEF 2015: Compound Figure Separation

AAUITEC at ImageCLEF 2015: Compound Figure Separation AAUITEC at ImageCLEF 2015: Compound Figure Separation Mario Taschwer 1 and Oge Marques 2 1 ITEC, Klagenfurt University (AAU), Austria, mario.taschwer@aau.at 2 Florida Atlantic University (FAU), Boca Raton,

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

Multimodal Medical Image Retrieval based on Latent Topic Modeling

Multimodal Medical Image Retrieval based on Latent Topic Modeling Multimodal Medical Image Retrieval based on Latent Topic Modeling Mandikal Vikram 15it217.vikram@nitk.edu.in Suhas BS 15it110.suhas@nitk.edu.in Aditya Anantharaman 15it201.aditya.a@nitk.edu.in Sowmya Kamath

More information

DUTH at ImageCLEF 2011 Wikipedia Retrieval

DUTH at ImageCLEF 2011 Wikipedia Retrieval DUTH at ImageCLEF 2011 Wikipedia Retrieval Avi Arampatzis, Konstantinos Zagoris, and Savvas A. Chatzichristofis Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi

More information

Outline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity

Outline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using

More information

A Survey on Postive and Unlabelled Learning

A Survey on Postive and Unlabelled Learning A Survey on Postive and Unlabelled Learning Gang Li Computer & Information Sciences University of Delaware ligang@udel.edu Abstract In this paper we survey the main algorithms used in positive and unlabeled

More information

A New Approach for Automatic Thesaurus Construction and Query Expansion for Document Retrieval

A New Approach for Automatic Thesaurus Construction and Query Expansion for Document Retrieval Information and Management Sciences Volume 18, Number 4, pp. 299-315, 2007 A New Approach for Automatic Thesaurus Construction and Query Expansion for Document Retrieval Liang-Yu Chen National Taiwan University

More information

Medical image analysis and retrieval. Henning Müller

Medical image analysis and retrieval. Henning Müller Medical image analysis and retrieval Henning Müller Overview My background Our laboratory Current projects Khresmoi, MANY, Promise, Chorus+, NinaPro Challenges Demonstration Conclusions 2 Personal background

More information

Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF

Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF Julia Jürgens, Sebastian Kastner, Christa Womser-Hacker, and Thomas Mandl University of Hildesheim,

More information

Book Recommendation based on Social Information

Book Recommendation based on Social Information Book Recommendation based on Social Information Chahinez Benkoussas and Patrice Bellot LSIS Aix-Marseille University chahinez.benkoussas@lsis.org patrice.bellot@lsis.org Abstract : In this paper, we present

More information

Relevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway

Relevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway Relevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway Jean-François Gehanno a, Gaétan Kerdelhué a, Saoussen Sakji a, Philippe Massari a, Michel Joubert b, Stéfan J.

More information

X. A Relevance Feedback System Based on Document Transformations. S. R. Friedman, J. A. Maceyak, and S. F. Weiss

X. A Relevance Feedback System Based on Document Transformations. S. R. Friedman, J. A. Maceyak, and S. F. Weiss X-l X. A Relevance Feedback System Based on Document Transformations S. R. Friedman, J. A. Maceyak, and S. F. Weiss Abstract An information retrieval system using relevance feedback to modify the document

More information

FIRE Flexible Image Retrieval Engine: ImageCLEF 2004 Evaluation

FIRE Flexible Image Retrieval Engine: ImageCLEF 2004 Evaluation FIRE Flexible Image Retrieval Engine: ImageCLEF 2004 Evaluation Thomas Deselaers, Daniel Keysers, and Hermann Ney Lehrstuhl für Informatik VI Computer Science Department, RWTH Aachen University D-52056

More information

TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback

TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback RMIT @ TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback Ameer Albahem ameer.albahem@rmit.edu.au Lawrence Cavedon lawrence.cavedon@rmit.edu.au Damiano

More information

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND 41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia

More information

SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task

SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of Korea wakeup06@empas.com, jinchoi@snu.ac.kr

More information

Inter and Intra-Document Contexts Applied in Polyrepresentation

Inter and Intra-Document Contexts Applied in Polyrepresentation Inter and Intra-Document Contexts Applied in Polyrepresentation Mette Skov, Birger Larsen and Peter Ingwersen Department of Information Studies, Royal School of Library and Information Science Birketinget

More information

CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets

CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets Arjumand Younus 1,2, Colm O Riordan 1, and Gabriella Pasi 2 1 Computational Intelligence Research Group,

More information

Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI

Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI M. M. Rahman, Bipin C. Desai, Prabir Bhattacharya Dept. of Computer Science & Software Engineering,

More information

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Tilburg University Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Publication date: 2006 Link to publication Citation for published

More information

Task3 Patient-Centred Information Retrieval: Team CUNI

Task3 Patient-Centred Information Retrieval: Team CUNI Task3 Patient-Centred Information Retrieval: Team CUNI Shadi Saleh and Pavel Pecina Charles University Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics, Czech Republic {saleh,pecina}@ufal.mff.cuni.cz

More information

NovaSearch on medical ImageCLEF 2013

NovaSearch on medical ImageCLEF 2013 NovaSearch on medical ImageCLEF 2013 André Mourão, Flávio Martins and João Magalhães Universidade Nova de Lisboa, Faculdade de Ciências e Tecnologia, Caparica, Portugal, a.mourao@campus.fct.unl.pt, flaviomartins@acm.org,

More information

RMIT University at TREC 2006: Terabyte Track

RMIT University at TREC 2006: Terabyte Track RMIT University at TREC 2006: Terabyte Track Steven Garcia Falk Scholer Nicholas Lester Milad Shokouhi School of Computer Science and IT RMIT University, GPO Box 2476V Melbourne 3001, Australia 1 Introduction

More information

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision A Semantic Web-Based Approach for Harvesting Multilingual Textual Definitions from Wikipedia to Support ICD-11 Revision Guoqian Jiang 1,* Harold R. Solbrig 1 and Christopher G. Chute 1 1 Department of

More information

A Multiple-stage Approach to Re-ranking Clinical Documents

A Multiple-stage Approach to Re-ranking Clinical Documents A Multiple-stage Approach to Re-ranking Clinical Documents Heung-Seon Oh and Yuchul Jung Information Service Center Korea Institute of Science and Technology Information {ohs, jyc77}@kisti.re.kr Abstract.

More information

Dealing with Syntactic Variation through a Locality-Based Approach

Dealing with Syntactic Variation through a Locality-Based Approach Dealing with Syntactic Variation through a Locality-Based Approach Jesús Vilares and Miguel A. Alonso Departamento de Computación, Universidade da Coruña Campus de Elviña s/n, 15071 A Coruña, Spain. {jvilares,

More information

Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval

Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Stephane Clinchant, Jean-Michel Renders, and Gabriela Csurka Xerox Research Centre Europe, 6 ch. de Maupertuis, 38240 Meylan, France

More information

An Information Retrieval Approach for Source Code Plagiarism Detection

An Information Retrieval Approach for Source Code Plagiarism Detection -2014: An Information Retrieval Approach for Source Code Plagiarism Detection Debasis Ganguly, Gareth J. F. Jones CNGL: Centre for Global Intelligent Content School of Computing, Dublin City University

More information

Multimodal Information Spaces for Content-based Image Retrieval

Multimodal Information Spaces for Content-based Image Retrieval Research Proposal Multimodal Information Spaces for Content-based Image Retrieval Abstract Currently, image retrieval by content is a research problem of great interest in academia and the industry, due

More information

WeSeE-Match Results for OEAI 2012

WeSeE-Match Results for OEAI 2012 WeSeE-Match Results for OEAI 2012 Heiko Paulheim Technische Universität Darmstadt paulheim@ke.tu-darmstadt.de Abstract. WeSeE-Match is a simple, element-based ontology matching tool. Its basic technique

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

A New Measure of the Cluster Hypothesis

A New Measure of the Cluster Hypothesis A New Measure of the Cluster Hypothesis Mark D. Smucker 1 and James Allan 2 1 Department of Management Sciences University of Waterloo 2 Center for Intelligent Information Retrieval Department of Computer

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

Semantic Estimation for Texts in Software Engineering

Semantic Estimation for Texts in Software Engineering Semantic Estimation for Texts in Software Engineering 汇报人 : Reporter:Xiaochen Li Dalian University of Technology, China 大连理工大学 2016 年 11 月 29 日 Oscar Lab 2 Ph.D. candidate at OSCAR Lab, in Dalian University

More information

A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition

A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es

More information

IPL at CLEF 2013 Medical Retrieval Task

IPL at CLEF 2013 Medical Retrieval Task IPL at CLEF 2013 Medical Retrieval Task Spyridon Stathopoulos, Ismini Lourentzou, Antonia Kyriakopoulou, and Theodore Kalamboukis Information Processing Laboratory, Department of Informatics, Athens University

More information

Full-texts representation with Medical Subject Headings, and co-citations network reranking strategies for TREC 2014 Clinical Decision Support Track

Full-texts representation with Medical Subject Headings, and co-citations network reranking strategies for TREC 2014 Clinical Decision Support Track Full-texts representation with Medical Subject Headings, and co-citations network reranking strategies for TREC 2014 Clinical Decision Support Track J. Gobeill ab, A. Gaudinat a, E. Pasche c, P. Ruch ab

More information

From Passages into Elements in XML Retrieval

From Passages into Elements in XML Retrieval From Passages into Elements in XML Retrieval Kelly Y. Itakura David R. Cheriton School of Computer Science, University of Waterloo 200 Univ. Ave. W. Waterloo, ON, Canada yitakura@cs.uwaterloo.ca Charles

More information

Maintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content

Maintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content Maintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content William Hersh, M.D., Andrea Ball, M.L.S., Bikram Day, M.S., Mary Masterson, M.P.H., Li Zhang, M.S., Lynetta Sacherek,

More information

NUS-I2R: Learning a Combined System for Entity Linking

NUS-I2R: Learning a Combined System for Entity Linking NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm

More information

UMass at TREC 2017 Common Core Track

UMass at TREC 2017 Common Core Track UMass at TREC 2017 Common Core Track Qingyao Ai, Hamed Zamani, Stephen Harding, Shahrzad Naseri, James Allan and W. Bruce Croft Center for Intelligent Information Retrieval College of Information and Computer

More information

Document Structure Analysis in Associative Patent Retrieval

Document Structure Analysis in Associative Patent Retrieval Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,

More information

A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition

A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es

More information

Automatic prior art searching and patent encoding at CLEF-IP 10

Automatic prior art searching and patent encoding at CLEF-IP 10 Automatic prior art searching and patent encoding at CLEF-IP 10 1 Douglas Teodoro, 2 Julien Gobeill, 1 Emilie Pasche, 1 Dina Vishnyakova, 2 Patrick Ruch and 1 Christian Lovis, 1 BiTeM group, Medical Informatics

More information

Enriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data

Enriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data Enriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data Marie B. Synnestvedt, MSEd 1, 2 1 Drexel University College of Information Science

More information

MedGIFT projects in medical imaging. Henning Müller

MedGIFT projects in medical imaging. Henning Müller MedGIFT projects in medical imaging Henning Müller Where we are 2 Who I am Medical informatics studies in Heidelberg, Germany (1992-1997) Exchange with Daimler Benz research, USA PhD in image processing,

More information

A Formal Approach to Score Normalization for Meta-search

A Formal Approach to Score Normalization for Meta-search A Formal Approach to Score Normalization for Meta-search R. Manmatha and H. Sever Center for Intelligent Information Retrieval Computer Science Department University of Massachusetts Amherst, MA 01003

More information

Automatically Generating Queries for Prior Art Search

Automatically Generating Queries for Prior Art Search Automatically Generating Queries for Prior Art Search Erik Graf, Leif Azzopardi, Keith van Rijsbergen University of Glasgow {graf,leif,keith}@dcs.gla.ac.uk Abstract This report outlines our participation

More information

Using Relevant Regions in Image Search and Query Refinement for Medical CBIR

Using Relevant Regions in Image Search and Query Refinement for Medical CBIR Using Relevant Regions in Image Search and Query Refinement for Medical CBIR Edward Kim 1, Sameer Antani 2, Xiaolei Huang 1, L.Rodney Long 2, Dina Demner-Fushman 2 1 Department of Computer Science and

More information

1.

1. * 390/0/2 : 389/07/20 : 2 25-8223 ( ) 2 25-823 ( ) ISC SCOPUS L ISA http://jist.irandoc.ac.ir 390 22-97 - :. aminnezarat@gmail.com mosavit@pnu.ac.ir : ( ).... 00.. : 390... " ". ( )...2 2. 3. 4 Google..

More information

CUNI team: CLEF ehealth Consumer Health Search Task 2018

CUNI team: CLEF ehealth Consumer Health Search Task 2018 CUNI team: CLEF ehealth Consumer Health Search Task 2018 Shadi Saleh and Pavel Pecina Charles University Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics, Czech Republic {saleh,pecina}@ufal.mff.cuni.cz

More information

WSU-IR at TREC 2015 Clinical Decision Support Track: Joint Weighting of Explicit and Latent Medical Query Concepts from Diverse Sources

WSU-IR at TREC 2015 Clinical Decision Support Track: Joint Weighting of Explicit and Latent Medical Query Concepts from Diverse Sources WSU-IR at TREC 2015 Clinical Decision Support Track: Joint Weighting of Explicit and Latent Medical Query Concepts from Diverse Sources Saeid Balaneshin-kordan, Alexander Kotov, and Railan Xisto Department

More information

Routing and Ad-hoc Retrieval with the. Nikolaus Walczuch, Norbert Fuhr, Michael Pollmann, Birgit Sievers. University of Dortmund, Germany.

Routing and Ad-hoc Retrieval with the. Nikolaus Walczuch, Norbert Fuhr, Michael Pollmann, Birgit Sievers. University of Dortmund, Germany. Routing and Ad-hoc Retrieval with the TREC-3 Collection in a Distributed Loosely Federated Environment Nikolaus Walczuch, Norbert Fuhr, Michael Pollmann, Birgit Sievers University of Dortmund, Germany

More information

Siemens TREC-4 Report: Further Experiments with Database. Merging. Ellen M. Voorhees. Siemens Corporate Research, Inc.

Siemens TREC-4 Report: Further Experiments with Database. Merging. Ellen M. Voorhees. Siemens Corporate Research, Inc. Siemens TREC-4 Report: Further Experiments with Database Merging Ellen M. Voorhees Siemens Corporate Research, Inc. Princeton, NJ ellen@scr.siemens.com Abstract A database merging technique is a strategy

More information

Metadata Quality Assessment: A Phased Approach to Ensuring Long-term Access to Digital Resources

Metadata Quality Assessment: A Phased Approach to Ensuring Long-term Access to Digital Resources Metadata Quality Assessment: A Phased Approach to Ensuring Long-term Access to Digital Resources Authors Daniel Gelaw Alemneh University of North Texas Post Office Box 305190, Denton, Texas 76203, USA

More information