Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images
|
|
- Magnus Nelson
- 5 years ago
- Views:
Transcription
1 Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images Miguel E. Ruiz 1 and Aurélie Névéol 2 1 University of North Texas, School of Library and Information Sciences P.O. Box , Denton, Texas USA meruiz@unt.edu 2 National Library of Medicine Bldg. 38A, B1N-28A, 8600 Rockville Pike, Bethesda, MD USA neveola@mail.nih.gov Abstract. This paper presents the results of the State University of New York at Buffalo (UB) team in collaboration with the National Library of Medicine (NLM) in the 2007 ImageCLEFmed task. We use a system that combines visual features (using a CBIR System) and text retrieval. We used the Medical Text Indexer (MTI) developed by NLM to automatically assign MeSH terms and UMLS concepts to the English free text annotations of the images. We also used an equivalent system called MAIF that automatically assigns MeSH and UMLS concepts to French free text. Our results indicate that the use of automatically assigned UMLS concepts improves retrieval performance significantly. We also identified specific aspects of the system that could be improved in the future, such as the method used to perform the automatic translation of medical terms and the addition of image classification to process queries targeted to a specific image modality. 1 Introduction ThispaperpresentstheresultsofourparticipationinimageCLEFmed2007.Inprevious years we have used a method that maps the queries to Unified Medical Language System (UMLS) concepts and then uses these concepts to find translations of the English queries into French and German [1, 2]. This method has been successful in handling English queries to find the corresponding French and German translations. For this year s challenge, we focused on assessing 1) the use of an automatic indexing system providing Medical subject Headings (MeSH terms) and UMLS concepts; and 2) the use of UMLS-based translation with French as the query language. The impact of both features on retrieval performance was analyzed. 2 System Description The system that was used this year combines two publicly available systems: SMART: This is an information retrieval system developed by Gerald Salton and his collaborators at Cornell University [3]. SMART implements a generalized vector space model representation of documents and queries. C. Peters et al. (Eds.): CLEF 2007, LNCS 5152, pp , c Springer-Verlag Berlin Heidelberg 2008
2 642 M.E. Ruiz and A. Névéol This is an important feature since we wanted to include three different representations of the image annotations: Free text, MeSH terms, and UMLS concepts. Flexible Image Retrieval Engine (FIRE): This is an open source content based image retrieval system developed at RWTH Aachen University, Germany [4]. For processing the annotations we also used two automatic text categorization tools that map free text to MeSH terms. We used the Medical Text Indexer (MTI) which is a tool developed at the U.S. National Library of Medicine (NLM) to assign MeSH terms to the English annotations. For processing French text we used Medical Automatic Indexer for French (MAIF) which is a tool similar to MTI that uses NLP as well as statistical methods to assign MeSH terms to free text. We did not have a tool to perform a similar mapping of the German text. We also decided to add the concept unique identifier (CUI) from the UMLS so that we could match queries and documents using these language independent concepts. Since MeSH is one of the vocabularies of UMLS, the assignment of the UMLS concepts was performed by getting the corresponding identifiers of the MeSH terms in UMLS. 3 Collection Preparation As described in the ImageCLEFmed 2007 overview paper [5] the image collection used in this task consists of six sub-collections. Each collection has its own metadata in XML format for the image annotations. In order to process all collections uniformly we created a common XML schema and converted all the annotation to this new schema. Figure 1 shows the common metadata schema that was used. English queries and documents were processed by parsing them using MTI to identify MeSH concepts present in the free text and then add the corresponding MeSH terms as well as the UMLS concepts. MTI uses NLP techniques (implemented in Metamap) as well as a statistical K-Nearest-Neighbor (KNN) method that takes advantage of the entire MEDLINE collection [6]. MTI is currently being used at NLM as a semi-automatic and fully automatic indexing tool. For this task, we used the top 25 recommendations provided by the system ran with default filtering. French queries and documents were processed using a modified version of the MAIF described in [7]. MAIF is able to retrieve MeSH terms from biomedical text in French. It specifically retrieves main headings and main heading/subheading pairs. However, for the purpose of the image-clef task, we only used MAIF to retrieve MeSH main headings that were then mapped to UMLS concepts. We used a collection of 15, 000 French citations available from CISMeF (Catalogue and Index of Online Health Information in French available at for retrieving the French MeSH terms used in MAIF. The modified version of MAIF is similar to MTI in that it combines a NLP method and a statistical, knowledge-based method [7]. However, the two systems differ in the specific implementation of both methods. The combination of these two approaches takes
3 Evaluation of Automatically Assigned MeSH Terms 643 Fig. 1. Common XML schema and Ctypes for indexing into account the relative score assigned to the terms by each method. The relative score of a term is obtained by dividing the score of the term by the sum of all the scores assigned by the corresponding method. Combining the methods in this way gives an advantage to terms retrieved by the NLP method. Because the NLP approach tends to retrieve a smaller number of terms per document, the relative importance of each term tends to be higher than the relative importance of terms retrieved by the statistical method. The final term selection is performed using the breakage function described in [8]. The score assigned to a MeSH candidate represents its likelihood to be a good indexing term: the higher the score, the more likely it is that the corresponding MeSH term is a good indexing candidate. Given a list of indexing candidates and the score that has been assigned to them, the breakage function is meant to detect a breach of continuity in the scores, therefore highlighting the point in the candidate list where terms become significantly less likely to be correct indexing terms. The final set of MeSH main headings assigned to a document consists of all the terms ranked above this threshold. Once the collections were converted in to the common XML schema we use SMART to parse the XML documents and create three indexes (also called Ctypes in SMART). Ctype 0 was used for indexing free text from the original annotations, Ctype 1 was used to index the MeSH terms automatically assigned using the medical text indexing tools (MTI for English text and MAIF for French text), and Ctype 2 was used to index the UMLS concepts that were identified by MTI or MAIF. 4 Retrieval Model We used a generalized vector space model that combines the vector representation of each of the four indexes presented in Figure 1. The final retrieval model can be represented using the following formula: score(image) =α Score CBIR + β sim Text (d i,q) (1)
4 644 M.E. Ruiz and A. Névéol where α and β are coefficients that weight the contribution of each system and sim Text is defined as: sim Text (d i,q)=λ sim words (d i,q)+μ sim MeSHterms (d i,q) +ρ sim UMLSConcepts (d i,q) (2) where λ, μ and ρ are coefficients that control the contribution of each of the ctypes. The values of these coefficients were computed empirically using the optimal results on the 2006 topics. The similarity values are computed using cosine normalization (atc) for the documents and augmented term frequency for the queries (atn). We also performed automatic retrieval feedback by retrieving 1, 000 documents using the original query and assuming that the top n documents are relevant. This allowed us to select the top m terms ranked according to Rocchio s relevance feedback formula [9]. 5 Experimental Results and Analysis We submitted 7 official runs which are shown in Table 1. A total of 5 runs use queries in English and 2 runs use queries in French. Translations of the queries into the other two languages were automatically generated by expanding the query with the all UMLS terms associated to the concepts assigned by MTI or MAIF. From these runs we can see that the highest score was obtained by runs that use the English queries and combine the text and image results obtaining a Mean Average Precision (MAP) value of and (UB-NLM-UBTI 3, and UB-NLM-UBTI 1). Overall these two runs perform well above the median run in imageclefmed 2007 (Median MAP= ) and rank 5th and 6th among all automatic mixed runs. Unfortunately our multilingual runs perform significantly below (MAP 0.254). This indicates that our automatic translation approach does decrease performance when compared to using the English queries only. We suspect that this could be due to the fact that the translations might be adding terms that change the focus of the query. Tables 2a-d show a series of unofficial runs that allow comparison of the methods that were used in our system. Table 2a shows the performance obtained by using free text (English only), automatically assigned UMLS concepts and the CBIR retrieval using FIRE. Our base lines for free text and UMLS concepts are quite strong since they both perform above the median system. The CBIR baseline is quite weak compared with the text and concept baselines. However, when compared to other visual only runs it is around average for CBIR runs. A query by query analysis of the results for the CBIR run shows that the MAP for 21 of the 30 queries is below which is a major factor for the poor performance shown. It appears that the fact that the queries require specific image modality seems to be a major factor since our CBIR system does not include an image classification module that could identify the image modality to filter out those images that do not have the requested modality in the query. Table 2b shows the results obtained using only English queries. Because the collection has predominantly English annotations we can see that these runs
5 Evaluation of Automatically Assigned MeSH Terms 645 Table 1. Performance of Official Runs Run name Description type MAP Exact-P P10 P20 UB-NLM-UBTI 3 English queries Mixed run UB-NLM-UBTI 1 English queries Mixed run UB-NLM-UBmixedMulti2 English cross-lang Mixed run UB-NLM-UBTextBL1 English queries Text only UB-NLM-UBTextBL2 English cross-lang Text only UB-NLM-UBTextFR French cross-lang Text only UB-NLM-UBmixedFR French cross-lang Mixed run correspond to our highest scoring official runs (UBTI 1 and UBTI 3). All these runs use the free text as well as the UMLS concepts automatically assigned to both queries and documents. These results confirm that the use of automatically identified concepts improves performance considerably when compared to using free text only. We can also see that the merging formula that combines visual and text features does work properly despite the fact that the CBIR run contributes little to the overall MAP. Our two top scoring runs use text as well as image features. The best automatic run (MAP=0.3018) was not submitted but is only marginally better than our highest official run. Table 2c and 2d show performance of our cross-lingual runs. These runs use the UMLS automatic translations based on the UMLS concept mapping obtained from the English text. We can see that this actually harms performance significantly compared with using English only queries. We believe that is due to the aggressive translation method that we tried to use since it seems to add terms that shift the focus of the query. We plan to explore this issue in more detail in our future research. Despite this result we can see that the results confirm that using UMLS concepts (which are language independent) improves performance with respect to using only free text translations. Also the use of the results from the CBIR system yield only small improvements in retrieval performance. Table 2d shows the result of our cross-lingual runs that use French as the query language. Our official French runs used the same parameters as the English runs and this seems to have harmed the results for French since the runs presented in our unofficial runs show significantly better performance. These results are comparable to the best French cross-lingual results presented by other teams in the conference. However, the overall French cross-lingual results achieve only 56% of the English retrieval performance.this could be due to the fact that the French resources we used (citation database and medical lexicon) are much smaller than the UMLS resources available for English. Table 3 presents runs that use all the manually generated terms in English, French and German that were provided in the ImageCLEFmed topics. These queries achieve the highest score using our system with a MAP of which is comparable to the best manual run reported this year [5]. As in our previously presented experiments, the results with the manual queries show improvements when automatically generated UMLS concepts and pseudo relevance feedback are used. Use of the CBIR results yields a small improvement.
6 646 M.E. Ruiz and A. Névéol Table 2. Unofficial Runs Run name MAP Exact-P P10 P20 (a) Baseline runs EN-free text only UMLS concepts only FIRE baseline (CBIR) (b) English only runs EN-text-RF EN-text baseline + image EN-text rf + images (c) Automatic English cross-lingual runs EN-Multi-Baseline EN-Multi + concepts EN-Multi + concepts + images EN-Multi-rf (d) Automatic French cross-lingual runs FR-Multi-Baseline FR-Multi-Baseline + images FR-Multi- RF FR-Multi-RF + images Table 3. Manual runs Run name MAP Exact-P P10 P20 Multi-manual text only Multi-Manual text+contepts Multi-Manual Text+concepts + images Multi-manual rf Multi-manual rf + images Table 4. Comparison of results by type of query Type Free text UMLS concepts CBIR Combination Visual Visual-Semantic Semantic We performed a query by query analysis to try to understand how the different methods proposed are affected by different types of queries. Table 4 shows the average MAP by groups of topics according to whether they are visual, semantic and mixed (visual-semantic). As expected the text based and UMLS concept based runs perform better in the semantic topics. The CBIR system performs slightly better in the visual and mixed topics while the poorest performance is
7 Evaluation of Automatically Assigned MeSH Terms 647 in the semantic topics. The combination shows consistent improvements in all three groups of topics. 6 Conclusions From the results we can conclude that the use of automatically assigned UMLS concepts using MTI significantly improves performance for the retrieval of medical images with English annotations. We also confirm that our generalized vector space model works well for combining retrieval results from free text, UMLS concepts and CBIR systems. Despite the low performance of our CBIR system the merging method is robust enough to maintain or even improve results. We also conclude that our methods work better for semantic queries while still achieving significantly high performance for visual or mixed visual semantic queries. Our cross-lingual results using French as the query language are relatively low and indicate that we need to work on improving our translation method based on UMLS mapping. We plan to explore this further in our future research. The low results from the CBIR system indicate that we need to address the image classification problem so that the CBIR results can give a more significant contribution to the overall fusion of results. Acknowledgements This work was supported in part by an appointment of A. Névéol and M. E. Ruiz to the NLM Research Participation Program. This program is administered by the Oak Ridge Institute for Science and Education trhough an interagency agreement between the U.S. Department of Energy and the National Library of Medicine. We also want to thank Dr. Alan Aronson and the Indexing Initiative Project team at the NLM for their support and for making the MTI system available for this project. References [1] Ruiz, M.: Combining image features, case descriptions and umls concepts to improve retrieval of medical images. In: Proceedings of the AMIA Annual Symposium, Washington, DC, pp (2006) [2] Ruiz, M.: Ub at imageclefmed In: Peters, C., Clough, P., Gonzalo, J., Jones, G., Kluck, M., Magnini, B. (eds.) CLEF LNCS, vol. 4730, pp Springer, Heidelberg (2007) [3] Salton, G. (ed.): The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice-Hall, Englewood Cliffs (1983) [4] Deselaers, T., Keysers, D., Ney., H.: Features for image retrieval: A quantitative comparison. In: Rasmussen, C.E., Bülthoff, H.H., Schölkopf, B., Giese, M.A. (eds.) DAGM LNCS, vol. 3175, pp Springer, Heidelberg (2004)
8 648 M.E. Ruiz and A. Névéol [5] Müller, H., Deselaers, T., Kim, E., Kalpathy-Cramer, J., Deserno, T.M., Hersh, W.: Overview of the imageclef 2007 medical retrieval and annotation tasks. In: Peters, C., et al. (eds.) CLEF LNCS, vol Springer, Heidelberg (2008) [6] Aronson, A., Mork, J., Gay, C., Humphrey, S., Rogers, W.: The nlm indexing initiativeś medical text indexer. In: MEDINFO, 11(Pt 1), pp (2004) [7] Névéol, A., Mork, J., Aronson, A., Darmoni, S.: Evaluation of french and english mesh indexing systems with a parallel corpus. In: Proceedings of the AMIA Annual Symposium, pp (2005) [8] Névéol, A., Rogozan, A., Darmoni, S.: Automatic indexing of online health resources for a french quality controlled gateway. Information Processing and Management 42, (2006) [9] Rocchio, J.J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, Englewood Cliff, NJ (1971)
Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval
Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National
More informationDocument Retrieval using Predication Similarity
Document Retrieval using Predication Similarity Kalpa Gunaratna 1 Kno.e.sis Center, Wright State University, Dayton, OH 45435 USA kalpa@knoesis.org Abstract. Document retrieval has been an important research
More informationMIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion
MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad
More informationThe NLM Medical Text Indexer System for Indexing Biomedical Literature
The NLM Medical Text Indexer System for Indexing Biomedical Literature James G. Mork 1, Antonio J. Jimeno Yepes 2,1, Alan R. Aronson 1 1 National Library of Medicine, Bethesda, MD, USA {mork,alan}@nlm.nih.gov
More informationOptimization of the PubMed Automatic Term Mapping
238 Medical Informatics in a United and Healthy Europe K.-P. Adlassnig et al. (Eds.) IOS Press, 2009 2009 European Federation for Medical Informatics. All rights reserved. doi:10.3233/978-1-60750-044-5-238
More informationExperiment for Using Web Information to do Query and Document Expansion
Experiment for Using Web Information to do Query and Document Expansion Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National Taiwan University Taipei,
More informationA fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP
A fully-automatic approach to answer geographic queries: at GikiP Johannes Leveling Sven Hartrumpf Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen)
More informationText- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track
Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Matthew Simpson, Md Mahmudur Rahman, Dina Demner-Fushman, Sameer Antani, George R. Thoma Lister Hill
More informationEvaluation and image retrieval
Evaluation and image retrieval Henning Müller Thomas Deselaers Overview Information retrieval evaluation TREC Multimedia retrieval evaluation TRECVID, ImageEval, Benchathlon, ImageCLEF Past Future Information
More informationMedical Image Annotation in ImageCLEF 2008
Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany 2 RWTH Aachen University, Dept. of Medical Informatics,
More informationDocument Expansion for Text-based Image Retrieval at CLEF 2009
Document Expansion for Text-based Image Retrieval at CLEF 2009 Jinming Min, Peter Wilkins, Johannes Leveling, and Gareth Jones Centre for Next Generation Localisation School of Computing, Dublin City University
More informationClustering for Text and Image-Based Photo Retrieval at CLEF 2009
Clustering for ext and mage-based Photo Retrieval at CLEF 2009 Qian Zhu and Diana nkpen School of nformation echnology and Engineering University of Ottawa qzhu012@uottawa.ca, diana@site.uottawa.ca Abstract.
More informationApplying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task
Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task Walid Magdy, Gareth J.F. Jones Centre for Next Generation Localisation School of Computing Dublin City University,
More informationCLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval
DCU @ CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval Walid Magdy, Johannes Leveling, Gareth J.F. Jones Centre for Next Generation Localization School of Computing Dublin City University,
More informationIPL at ImageCLEF 2010
IPL at ImageCLEF 2010 Alexandros Stougiannis, Anestis Gkanogiannis, and Theodore Kalamboukis Information Processing Laboratory Department of Informatics Athens University of Economics and Business 76 Patission
More informationA novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems
A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics
More informationCACAO PROJECT AT THE 2009 TASK
CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype
More informationClassification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014
Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014 Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of
More informationCADIAL Search Engine at INEX
CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr
More informationContent-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification
Content-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification Juan C. Caicedo, Fabio A. Gonzalez and Eduardo Romero BioIngenium Research Group National University of
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationBalancing Manual and Automatic Indexing for Retrieval of Paper Abstracts
Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts Kwangcheol Shin 1, Sang-Yong Han 1, and Alexander Gelbukh 1,2 1 Computer Science and Engineering Department, Chung-Ang University,
More informationSheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms
Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms Yikun Guo, Henk Harkema, Rob Gaizauskas University of Sheffield, UK {guo, harkema, gaizauskas}@dcs.shef.ac.uk
More informationContent-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database
Content-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database Kai Chen 1 and Jean Hennebert 2 1 University of Fribourg, DIVA-DIUF, Bd. de Pérolles 90, 1700 Fribourg, Switzerland
More informationMedical Image Annotation in ImageCLEF 2008
Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 RWTH Aachen University,
More informationUsing XML Logical Structure to Retrieve (Multimedia) Objects
Using XML Logical Structure to Retrieve (Multimedia) Objects Zhigang Kong and Mounia Lalmas Queen Mary, University of London {cskzg,mounia}@dcs.qmul.ac.uk Abstract. This paper investigates the use of the
More informationYork University at CLEF ehealth 2015: Medical Document Retrieval
York University at CLEF ehealth 2015: Medical Document Retrieval Andia Ghoddousi Jimmy Xiangji Huang Information Retrieval and Knowledge Management Research Lab Department of Computer Science and Engineering
More informationOverview of the CLEF 2009 Medical Image Annotation Track
Overview of the CLEF 2009 Medical Image Annotation Track Tatiana Tommasi 1, Barbara Caputo 1, Petra Welter 2, Mark Oliver Güld 2, and Thomas M. Deserno 2 1 Idiap Research Institute, Martigny, Switzerland,
More informationUniversity of Amsterdam at INEX 2010: Ad hoc and Book Tracks
University of Amsterdam at INEX 2010: Ad hoc and Book Tracks Jaap Kamps 1,2 and Marijn Koolen 1 1 Archives and Information Studies, Faculty of Humanities, University of Amsterdam 2 ISLA, Faculty of Science,
More informationSINAI at CLEF ehealth 2017 Task 3
SINAI at CLEF ehealth 2017 Task 3 Manuel Carlos Díaz-Galiano, M. Teresa Martín-Valdivia, Salud María Jiménez-Zafra, Alberto Andreu, and L. Alfonso Ureña López Department of Computer Science, Universidad
More informationExploring the Query Expansion Methods for Concept Based Representation
Exploring the Query Expansion Methods for Concept Based Representation Yue Wang and Hui Fang Department of Electrical and Computer Engineering University of Delaware 140 Evans Hall, Newark, Delaware, 19716,
More informationWikipedia Retrieval Task ImageCLEF 2011
Wikipedia Retrieval Task ImageCLEF 2011 Theodora Tsikrika University of Applied Sciences Western Switzerland, Switzerland Jana Kludas University of Geneva, Switzerland Adrian Popescu CEA LIST, France Outline
More informationPreliminary Work on Building a User Friendly Adaptive Clinical Documents Repository
Preliminary Work on Building a User Friendly Adaptive Clinical Documents Repository Enriko Aryanto Stanford University 121 Campus Dr. #3112A Stanford, CA 94305 1-650-497-7306 earyanto@stanford.edu Yang
More informationExternal Query Reformulation for Text-based Image Retrieval
External Query Reformulation for Text-based Image Retrieval Jinming Min and Gareth J. F. Jones Centre for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland {jmin,gjones}@computing.dcu.ie
More informationMultiple Terminologies in a Health Portal: Automatic Indexing and Information Retrieval
Multiple Terminologies in a Health Portal: Automatic Indexing and Information Retrieval Stéfan J. Darmoni, MD, PhD 1, Suzanne Pereira, PhD 1,2,3, Saoussen Sakji, MSc 1, Tayeb Merabti, MSc 1, É. Prieur,
More informationThis is the author s version of a work that was submitted/accepted for publication in the following source:
This is the author s version of a work that was submitted/accepted for publication in the following source: Koopman, Bevan, Bruza, Peter, Sitbon, Laurianne, & Lawley, Michael (2011) AEHRC & QUT at TREC
More informationA Study on Query Expansion with MeSH Terms and Elasticsearch. IMS Unipd at CLEF ehealth Task 3
A Study on Query Expansion with MeSH Terms and Elasticsearch. IMS Unipd at CLEF ehealth Task 3 Giorgio Maria Di Nunzio and Alexandru Moldovan Dept. of Information Engineering University of Padua giorgiomaria.dinunzio@unipd.it,alexandru.moldovan@studenti.unipd.it
More informationA RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH
A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH A thesis Submitted to the faculty of the graduate school of the University of Minnesota by Vamshi Krishna Thotempudi In partial fulfillment of the requirements
More informationRecommending MeSH terms for annotating biomedical articles
Recommending MeSH terms for annotating biomedical articles Minlie Huang, 1,2 Aurélie Névéol, 2 Zhiyong Lu 2 1 State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for
More informationA Language Independent Author Verifier Using Fuzzy C-Means Clustering
A Language Independent Author Verifier Using Fuzzy C-Means Clustering Notebook for PAN at CLEF 2014 Pashutan Modaresi 1,2 and Philipp Gross 1 1 pressrelations GmbH, Düsseldorf, Germany {pashutan.modaresi,
More informationResPubliQA 2010
SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first
More informationWord Indexing Versus Conceptual Indexing in Medical Image Retrieval
Word Indexing Versus Conceptual Indexing in Medical Image Retrieval (ReDCAD participation at ImageCLEF Medical Image Retrieval 2012) Karim Gasmi, Mouna Torjmen-Khemakhem, and Maher Ben Jemaa Research unit
More informationQuery Reformulation for Clinical Decision Support Search
Query Reformulation for Clinical Decision Support Search Luca Soldaini, Arman Cohan, Andrew Yates, Nazli Goharian, Ophir Frieder Information Retrieval Lab Computer Science Department Georgetown University
More informationMultilingual Image Search from a user s perspective
Multilingual Image Search from a user s perspective Julio Gonzalo, Paul Clough, Jussi Karlgren QUAERO-Image CLEF workshop, 16/09/08 Finding is a matter of two fast stupid smart slow great potential for
More informationECNU at 2017 ehealth Task 2: Technologically Assisted Reviews in Empirical Medicine
ECNU at 2017 ehealth Task 2: Technologically Assisted Reviews in Empirical Medicine Jiayi Chen 1, Su Chen 1, Yang Song 1, Hongyu Liu 1, Yueyao Wang 1, Qinmin Hu 1, Liang He 1, and Yan Yang 1,2 Department
More informationAAUITEC at ImageCLEF 2015: Compound Figure Separation
AAUITEC at ImageCLEF 2015: Compound Figure Separation Mario Taschwer 1 and Oge Marques 2 1 ITEC, Klagenfurt University (AAU), Austria, mario.taschwer@aau.at 2 Florida Atlantic University (FAU), Boca Raton,
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationMultimodal Medical Image Retrieval based on Latent Topic Modeling
Multimodal Medical Image Retrieval based on Latent Topic Modeling Mandikal Vikram 15it217.vikram@nitk.edu.in Suhas BS 15it110.suhas@nitk.edu.in Aditya Anantharaman 15it201.aditya.a@nitk.edu.in Sowmya Kamath
More informationDUTH at ImageCLEF 2011 Wikipedia Retrieval
DUTH at ImageCLEF 2011 Wikipedia Retrieval Avi Arampatzis, Konstantinos Zagoris, and Savvas A. Chatzichristofis Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi
More informationOutline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity
Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using
More informationA Survey on Postive and Unlabelled Learning
A Survey on Postive and Unlabelled Learning Gang Li Computer & Information Sciences University of Delaware ligang@udel.edu Abstract In this paper we survey the main algorithms used in positive and unlabeled
More informationA New Approach for Automatic Thesaurus Construction and Query Expansion for Document Retrieval
Information and Management Sciences Volume 18, Number 4, pp. 299-315, 2007 A New Approach for Automatic Thesaurus Construction and Query Expansion for Document Retrieval Liang-Yu Chen National Taiwan University
More informationMedical image analysis and retrieval. Henning Müller
Medical image analysis and retrieval Henning Müller Overview My background Our laboratory Current projects Khresmoi, MANY, Promise, Chorus+, NinaPro Challenges Demonstration Conclusions 2 Personal background
More informationPatent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF
Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF Julia Jürgens, Sebastian Kastner, Christa Womser-Hacker, and Thomas Mandl University of Hildesheim,
More informationBook Recommendation based on Social Information
Book Recommendation based on Social Information Chahinez Benkoussas and Patrice Bellot LSIS Aix-Marseille University chahinez.benkoussas@lsis.org patrice.bellot@lsis.org Abstract : In this paper, we present
More informationRelevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway
Relevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway Jean-François Gehanno a, Gaétan Kerdelhué a, Saoussen Sakji a, Philippe Massari a, Michel Joubert b, Stéfan J.
More informationX. A Relevance Feedback System Based on Document Transformations. S. R. Friedman, J. A. Maceyak, and S. F. Weiss
X-l X. A Relevance Feedback System Based on Document Transformations S. R. Friedman, J. A. Maceyak, and S. F. Weiss Abstract An information retrieval system using relevance feedback to modify the document
More informationFIRE Flexible Image Retrieval Engine: ImageCLEF 2004 Evaluation
FIRE Flexible Image Retrieval Engine: ImageCLEF 2004 Evaluation Thomas Deselaers, Daniel Keysers, and Hermann Ney Lehrstuhl für Informatik VI Computer Science Department, RWTH Aachen University D-52056
More informationTREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback
RMIT @ TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback Ameer Albahem ameer.albahem@rmit.edu.au Lawrence Cavedon lawrence.cavedon@rmit.edu.au Damiano
More informationTEXT CHAPTER 5. W. Bruce Croft BACKGROUND
41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia
More informationSNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task
SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of Korea wakeup06@empas.com, jinchoi@snu.ac.kr
More informationInter and Intra-Document Contexts Applied in Polyrepresentation
Inter and Intra-Document Contexts Applied in Polyrepresentation Mette Skov, Birger Larsen and Peter Ingwersen Department of Information Studies, Royal School of Library and Information Science Birketinget
More informationCIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets
CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets Arjumand Younus 1,2, Colm O Riordan 1, and Gabriella Pasi 2 1 Computational Intelligence Research Group,
More informationMulti-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI
Multi-Modal Interactive Approach to ImageCLEF 2007 Photographic and Medical Retrieval Tasks by CINDI M. M. Rahman, Bipin C. Desai, Prabir Bhattacharya Dept. of Computer Science & Software Engineering,
More informationTilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval
Tilburg University Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Publication date: 2006 Link to publication Citation for published
More informationTask3 Patient-Centred Information Retrieval: Team CUNI
Task3 Patient-Centred Information Retrieval: Team CUNI Shadi Saleh and Pavel Pecina Charles University Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics, Czech Republic {saleh,pecina}@ufal.mff.cuni.cz
More informationNovaSearch on medical ImageCLEF 2013
NovaSearch on medical ImageCLEF 2013 André Mourão, Flávio Martins and João Magalhães Universidade Nova de Lisboa, Faculdade de Ciências e Tecnologia, Caparica, Portugal, a.mourao@campus.fct.unl.pt, flaviomartins@acm.org,
More informationRMIT University at TREC 2006: Terabyte Track
RMIT University at TREC 2006: Terabyte Track Steven Garcia Falk Scholer Nicholas Lester Milad Shokouhi School of Computer Science and IT RMIT University, GPO Box 2476V Melbourne 3001, Australia 1 Introduction
More informationA Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision
A Semantic Web-Based Approach for Harvesting Multilingual Textual Definitions from Wikipedia to Support ICD-11 Revision Guoqian Jiang 1,* Harold R. Solbrig 1 and Christopher G. Chute 1 1 Department of
More informationA Multiple-stage Approach to Re-ranking Clinical Documents
A Multiple-stage Approach to Re-ranking Clinical Documents Heung-Seon Oh and Yuchul Jung Information Service Center Korea Institute of Science and Technology Information {ohs, jyc77}@kisti.re.kr Abstract.
More informationDealing with Syntactic Variation through a Locality-Based Approach
Dealing with Syntactic Variation through a Locality-Based Approach Jesús Vilares and Miguel A. Alonso Departamento de Computación, Universidade da Coruña Campus de Elviña s/n, 15071 A Coruña, Spain. {jvilares,
More informationTrans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval
Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Stephane Clinchant, Jean-Michel Renders, and Gabriela Csurka Xerox Research Centre Europe, 6 ch. de Maupertuis, 38240 Meylan, France
More informationAn Information Retrieval Approach for Source Code Plagiarism Detection
-2014: An Information Retrieval Approach for Source Code Plagiarism Detection Debasis Ganguly, Gareth J. F. Jones CNGL: Centre for Global Intelligent Content School of Computing, Dublin City University
More informationMultimodal Information Spaces for Content-based Image Retrieval
Research Proposal Multimodal Information Spaces for Content-based Image Retrieval Abstract Currently, image retrieval by content is a research problem of great interest in academia and the industry, due
More informationWeSeE-Match Results for OEAI 2012
WeSeE-Match Results for OEAI 2012 Heiko Paulheim Technische Universität Darmstadt paulheim@ke.tu-darmstadt.de Abstract. WeSeE-Match is a simple, element-based ontology matching tool. Its basic technique
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationA New Measure of the Cluster Hypothesis
A New Measure of the Cluster Hypothesis Mark D. Smucker 1 and James Allan 2 1 Department of Management Sciences University of Waterloo 2 Center for Intelligent Information Retrieval Department of Computer
More informationWEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS
1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,
More informationSemantic Estimation for Texts in Software Engineering
Semantic Estimation for Texts in Software Engineering 汇报人 : Reporter:Xiaochen Li Dalian University of Technology, China 大连理工大学 2016 年 11 月 29 日 Oscar Lab 2 Ph.D. candidate at OSCAR Lab, in Dalian University
More informationA Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition
A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es
More informationIPL at CLEF 2013 Medical Retrieval Task
IPL at CLEF 2013 Medical Retrieval Task Spyridon Stathopoulos, Ismini Lourentzou, Antonia Kyriakopoulou, and Theodore Kalamboukis Information Processing Laboratory, Department of Informatics, Athens University
More informationFull-texts representation with Medical Subject Headings, and co-citations network reranking strategies for TREC 2014 Clinical Decision Support Track
Full-texts representation with Medical Subject Headings, and co-citations network reranking strategies for TREC 2014 Clinical Decision Support Track J. Gobeill ab, A. Gaudinat a, E. Pasche c, P. Ruch ab
More informationFrom Passages into Elements in XML Retrieval
From Passages into Elements in XML Retrieval Kelly Y. Itakura David R. Cheriton School of Computer Science, University of Waterloo 200 Univ. Ave. W. Waterloo, ON, Canada yitakura@cs.uwaterloo.ca Charles
More informationMaintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content
Maintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content William Hersh, M.D., Andrea Ball, M.L.S., Bikram Day, M.S., Mary Masterson, M.P.H., Li Zhang, M.S., Lynetta Sacherek,
More informationNUS-I2R: Learning a Combined System for Entity Linking
NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm
More informationUMass at TREC 2017 Common Core Track
UMass at TREC 2017 Common Core Track Qingyao Ai, Hamed Zamani, Stephen Harding, Shahrzad Naseri, James Allan and W. Bruce Croft Center for Intelligent Information Retrieval College of Information and Computer
More informationDocument Structure Analysis in Associative Patent Retrieval
Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,
More informationA Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition
A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es
More informationAutomatic prior art searching and patent encoding at CLEF-IP 10
Automatic prior art searching and patent encoding at CLEF-IP 10 1 Douglas Teodoro, 2 Julien Gobeill, 1 Emilie Pasche, 1 Dina Vishnyakova, 2 Patrick Ruch and 1 Christian Lovis, 1 BiTeM group, Medical Informatics
More informationEnriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data
Enriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data Marie B. Synnestvedt, MSEd 1, 2 1 Drexel University College of Information Science
More informationMedGIFT projects in medical imaging. Henning Müller
MedGIFT projects in medical imaging Henning Müller Where we are 2 Who I am Medical informatics studies in Heidelberg, Germany (1992-1997) Exchange with Daimler Benz research, USA PhD in image processing,
More informationA Formal Approach to Score Normalization for Meta-search
A Formal Approach to Score Normalization for Meta-search R. Manmatha and H. Sever Center for Intelligent Information Retrieval Computer Science Department University of Massachusetts Amherst, MA 01003
More informationAutomatically Generating Queries for Prior Art Search
Automatically Generating Queries for Prior Art Search Erik Graf, Leif Azzopardi, Keith van Rijsbergen University of Glasgow {graf,leif,keith}@dcs.gla.ac.uk Abstract This report outlines our participation
More informationUsing Relevant Regions in Image Search and Query Refinement for Medical CBIR
Using Relevant Regions in Image Search and Query Refinement for Medical CBIR Edward Kim 1, Sameer Antani 2, Xiaolei Huang 1, L.Rodney Long 2, Dina Demner-Fushman 2 1 Department of Computer Science and
More information1.
* 390/0/2 : 389/07/20 : 2 25-8223 ( ) 2 25-823 ( ) ISC SCOPUS L ISA http://jist.irandoc.ac.ir 390 22-97 - :. aminnezarat@gmail.com mosavit@pnu.ac.ir : ( ).... 00.. : 390... " ". ( )...2 2. 3. 4 Google..
More informationCUNI team: CLEF ehealth Consumer Health Search Task 2018
CUNI team: CLEF ehealth Consumer Health Search Task 2018 Shadi Saleh and Pavel Pecina Charles University Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics, Czech Republic {saleh,pecina}@ufal.mff.cuni.cz
More informationWSU-IR at TREC 2015 Clinical Decision Support Track: Joint Weighting of Explicit and Latent Medical Query Concepts from Diverse Sources
WSU-IR at TREC 2015 Clinical Decision Support Track: Joint Weighting of Explicit and Latent Medical Query Concepts from Diverse Sources Saeid Balaneshin-kordan, Alexander Kotov, and Railan Xisto Department
More informationRouting and Ad-hoc Retrieval with the. Nikolaus Walczuch, Norbert Fuhr, Michael Pollmann, Birgit Sievers. University of Dortmund, Germany.
Routing and Ad-hoc Retrieval with the TREC-3 Collection in a Distributed Loosely Federated Environment Nikolaus Walczuch, Norbert Fuhr, Michael Pollmann, Birgit Sievers University of Dortmund, Germany
More informationSiemens TREC-4 Report: Further Experiments with Database. Merging. Ellen M. Voorhees. Siemens Corporate Research, Inc.
Siemens TREC-4 Report: Further Experiments with Database Merging Ellen M. Voorhees Siemens Corporate Research, Inc. Princeton, NJ ellen@scr.siemens.com Abstract A database merging technique is a strategy
More informationMetadata Quality Assessment: A Phased Approach to Ensuring Long-term Access to Digital Resources
Metadata Quality Assessment: A Phased Approach to Ensuring Long-term Access to Digital Resources Authors Daniel Gelaw Alemneh University of North Texas Post Office Box 305190, Denton, Texas 76203, USA
More information